Accurate identification of RNA editing sites from primitive sequence with deep neural networks.
Ouyang, Zhangyi; Liu, Feng; Zhao, Chenghui; Ren, Chao; An, Gaole; Mei, Chuan; Bo, Xiaochen; Shu, Wenjie
2018-04-16
RNA editing is a post-transcriptional RNA sequence alteration. Current methods have identified editing sites and facilitated research but require sufficient genomic annotations and prior-knowledge-based filtering steps, resulting in a cumbersome, time-consuming identification process. Moreover, these methods have limited generalizability and applicability in species with insufficient genomic annotations or in conditions of limited prior knowledge. We developed DeepRed, a deep learning-based method that identifies RNA editing from primitive RNA sequences without prior-knowledge-based filtering steps or genomic annotations. DeepRed achieved 98.1% and 97.9% area under the curve (AUC) in training and test sets, respectively. We further validated DeepRed using experimentally verified U87 cell RNA-seq data, achieving 97.9% positive predictive value (PPV). We demonstrated that DeepRed offers better prediction accuracy and computational efficiency than current methods with large-scale, mass RNA-seq data. We used DeepRed to assess the impact of multiple factors on editing identification with RNA-seq data from the Association of Biomolecular Resource Facilities and Sequencing Quality Control projects. We explored developmental RNA editing pattern changes during human early embryogenesis and evolutionary patterns in Drosophila species and the primate lineage using DeepRed. Our work illustrates DeepRed's state-of-the-art performance; it may decipher the hidden principles behind RNA editing, making editing detection convenient and effective.
A deep learning method for lincRNA detection using auto-encoder algorithm.
Yu, Ning; Yu, Zeng; Pan, Yi
2017-12-06
RNA sequencing technique (RNA-seq) enables scientists to develop novel data-driven methods for discovering more unidentified lincRNAs. Meantime, knowledge-based technologies are experiencing a potential revolution ignited by the new deep learning methods. By scanning the newly found data set from RNA-seq, scientists have found that: (1) the expression of lincRNAs appears to be regulated, that is, the relevance exists along the DNA sequences; (2) lincRNAs contain some conversed patterns/motifs tethered together by non-conserved regions. The two evidences give the reasoning for adopting knowledge-based deep learning methods in lincRNA detection. Similar to coding region transcription, non-coding regions are split at transcriptional sites. However, regulatory RNAs rather than message RNAs are generated. That is, the transcribed RNAs participate the biological process as regulatory units instead of generating proteins. Identifying these transcriptional regions from non-coding regions is the first step towards lincRNA recognition. The auto-encoder method achieves 100% and 92.4% prediction accuracy on transcription sites over the putative data sets. The experimental results also show the excellent performance of predictive deep neural network on the lincRNA data sets compared with support vector machine and traditional neural network. In addition, it is validated through the newly discovered lincRNA data set and one unreported transcription site is found by feeding the whole annotated sequences through the deep learning machine, which indicates that deep learning method has the extensive ability for lincRNA prediction. The transcriptional sequences of lincRNAs are collected from the annotated human DNA genome data. Subsequently, a two-layer deep neural network is developed for the lincRNA detection, which adopts the auto-encoder algorithm and utilizes different encoding schemes to obtain the best performance over intergenic DNA sequence data. Driven by those newly annotated lincRNA data, deep learning methods based on auto-encoder algorithm can exert their capability in knowledge learning in order to capture the useful features and the information correlation along DNA genome sequences for lincRNA detection. As our knowledge, this is the first application to adopt the deep learning techniques for identifying lincRNA transcription sequences.
Making sense of deep sequencing
Goldman, D.; Domschke, K.
2016-01-01
This review, the first of an occasional series, tries to make sense of the concepts and uses of deep sequencing of polynucleic acids (DNA and RNA). Deep sequencing, synonymous with next-generation sequencing, high-throughput sequencing and massively parallel sequencing, includes whole genome sequencing but is more often and diversely applied to specific parts of the genome captured in different ways, for example the highly expressed portion of the genome known as the exome and portions of the genome that are epigenetically marked either by DNA methylation, the binding of proteins including histones, or that are in different configurations and thus more or less accessible to enzymes that cleave DNA. Deep sequencing of RNA (RNASeq) reverse-transcribed to complementary DNA is invaluable for measuring RNA expression and detecting changes in RNA structure. Important concepts in deep sequencing include the length and depth of sequence reads, mapping and assembly of reads, sequencing error, haplotypes, and the propensity of deep sequencing, as with other types of ‘big data’, to generate large numbers of errors, requiring monitoring for methodologic biases and strategies for replication and validation. Deep sequencing yields a unique genetic fingerprint that can be used to identify a person, and a trove of predictors of genetic medical diseases. Deep sequencing to identify epigenetic events including changes in DNA methylation and RNA expression can reveal the history and impact of environmental exposures. Because of the power of sequencing to identify and deliver biomedically significant information about a person and their blood relatives, it creates ethical dilemmas and practical challenges in research and clinical care, for example the decision and procedures to report incidental findings that will increasingly and frequently be discovered. PMID:24925306
Pan, Xiaoyong; Shen, Hong-Bin
2018-05-02
RNA-binding proteins (RBPs) take over 5∼10% of the eukaryotic proteome and play key roles in many biological processes, e.g. gene regulation. Experimental detection of RBP binding sites is still time-intensive and high-costly. Instead, computational prediction of the RBP binding sites using pattern learned from existing annotation knowledge is a fast approach. From the biological point of view, the local structure context derived from local sequences will be recognized by specific RBPs. However, in computational modeling using deep learning, to our best knowledge, only global representations of entire RNA sequences are employed. So far, the local sequence information is ignored in the deep model construction process. In this study, we present a computational method iDeepE to predict RNA-protein binding sites from RNA sequences by combining global and local convolutional neural networks (CNNs). For the global CNN, we pad the RNA sequences into the same length. For the local CNN, we split a RNA sequence into multiple overlapping fixed-length subsequences, where each subsequence is a signal channel of the whole sequence. Next, we train deep CNNs for multiple subsequences and the padded sequences to learn high-level features, respectively. Finally, the outputs from local and global CNNs are combined to improve the prediction. iDeepE demonstrates a better performance over state-of-the-art methods on two large-scale datasets derived from CLIP-seq. We also find that the local CNN run 1.8 times faster than the global CNN with comparable performance when using GPUs. Our results show that iDeepE has captured experimentally verified binding motifs. https://github.com/xypan1232/iDeepE. xypan172436@gmail.com or hbshen@sjtu.edu.cn. Supplementary data are available at Bioinformatics online.
miRBase: integrating microRNA annotation and deep-sequencing data.
Kozomara, Ana; Griffiths-Jones, Sam
2011-01-01
miRBase is the primary online repository for all microRNA sequences and annotation. The current release (miRBase 16) contains over 15,000 microRNA gene loci in over 140 species, and over 17,000 distinct mature microRNA sequences. Deep-sequencing technologies have delivered a sharp rise in the rate of novel microRNA discovery. We have mapped reads from short RNA deep-sequencing experiments to microRNAs in miRBase and developed web interfaces to view these mappings. The user can view all read data associated with a given microRNA annotation, filter reads by experiment and count, and search for microRNAs by tissue- and stage-specific expression. These data can be used as a proxy for relative expression levels of microRNA sequences, provide detailed evidence for microRNA annotations and alternative isoforms of mature microRNAs, and allow us to revisit previous annotations. miRBase is available online at: http://www.mirbase.org/.
Swenson, Luke C; Moores, Andrew; Low, Andrew J; Thielen, Alexander; Dong, Winnie; Woods, Conan; Jensen, Mark A; Wynhoven, Brian; Chan, Dennison; Glascock, Christopher; Harrigan, P Richard
2010-08-01
Tropism testing should rule out CXCR4-using HIV before treatment with CCR5 antagonists. Currently, the recombinant phenotypic Trofile assay (Monogram) is most widely utilized; however, genotypic tests may represent alternative methods. Independent triplicate amplifications of the HIV gp120 V3 region were made from either plasma HIV RNA or proviral DNA. These underwent standard, population-based sequencing with an ABI3730 (RNA n = 63; DNA n = 40), or "deep" sequencing with a Roche/454 Genome Sequencer-FLX (RNA n = 12; DNA n = 12). Position-specific scoring matrices (PSSMX4/R5) (-6.96 cutoff) and geno2pheno[coreceptor] (5% false-positive rate) inferred tropism from V3 sequence. These methods were then independently validated with a separate, blinded dataset (n = 278) of screening samples from the maraviroc MOTIVATE trials. Standard sequencing of HIV RNA with PSSM yielded 69% sensitivity and 91% specificity, relative to Trofile. The validation dataset gave 75% sensitivity and 83% specificity. Proviral DNA plus PSSM gave 77% sensitivity and 71% specificity. "Deep" sequencing of HIV RNA detected >2% inferred-CXCR4-using virus in 8/8 samples called non-R5 by Trofile, and <2% in 4/4 samples called R5. Triplicate analyses of V3 standard sequence data detect greater proportions of CXCR4-using samples than previously achieved. Sequencing proviral DNA and "deep" V3 sequencing may also be useful tools for assessing tropism.
Comprehensive discovery of noncoding RNAs in acute myeloid leukemia cell transcriptomes.
Zhang, Jin; Griffith, Malachi; Miller, Christopher A; Griffith, Obi L; Spencer, David H; Walker, Jason R; Magrini, Vincent; McGrath, Sean D; Ly, Amy; Helton, Nichole M; Trissal, Maria; Link, Daniel C; Dang, Ha X; Larson, David E; Kulkarni, Shashikant; Cordes, Matthew G; Fronick, Catrina C; Fulton, Robert S; Klco, Jeffery M; Mardis, Elaine R; Ley, Timothy J; Wilson, Richard K; Maher, Christopher A
2017-11-01
To detect diverse and novel RNA species comprehensively, we compared deep small RNA and RNA sequencing (RNA-seq) methods applied to a primary acute myeloid leukemia (AML) sample. We were able to discover previously unannotated small RNAs using deep sequencing of a library method using broader insert size selection. We analyzed the long noncoding RNA (lncRNA) landscape in AML by comparing deep sequencing from multiple RNA-seq library construction methods for the sample that we studied and then integrating RNA-seq data from 179 AML cases. This identified lncRNAs that are completely novel, differentially expressed, and associated with specific AML subtypes. Our study revealed the complexity of the noncoding RNA transcriptome through a combined strategy of strand-specific small RNA and total RNA-seq. This dataset will serve as an invaluable resource for future RNA-based analyses. Copyright © 2017 ISEH – Society for Hematology and Stem Cells. Published by Elsevier Inc. All rights reserved.
Burroughs, A Maxwell; Ando, Yoshinari; de Hoon, Michiel J L; Tomaru, Yasuhiro; Nishibu, Takahiro; Ukekawa, Ryo; Funakoshi, Taku; Kurokawa, Tsutomu; Suzuki, Harukazu; Hayashizaki, Yoshihide; Daub, Carsten O
2010-10-01
Animal microRNA sequences are subject to 3' nucleotide addition. Through detailed analysis of deep-sequenced short RNA data sets, we show adenylation and uridylation of miRNA is globally present and conserved across Drosophila and vertebrates. To better understand 3' adenylation function, we deep-sequenced RNA after knockdown of nucleotidyltransferase enzymes. The PAPD4 nucleotidyltransferase adenylates a wide range of miRNA loci, but adenylation does not appear to affect miRNA stability on a genome-wide scale. Adenine addition appears to reduce effectiveness of miRNA targeting of mRNA transcripts while deep-sequencing of RNA bound to immunoprecipitated Argonaute (AGO) subfamily proteins EIF2C1-EIF2C3 revealed substantial reduction of adenine addition in miRNA associated with EIF2C2 and EIF2C3. Our findings show 3' addition events are widespread and conserved across animals, PAPD4 is a primary miRNA adenylating enzyme, and suggest a role for 3' adenine addition in modulating miRNA effectiveness, possibly through interfering with incorporation into the RNA-induced silencing complex (RISC), a regulatory role that would complement the role of miRNA uridylation in blocking DICER1 uptake.
DeepBase: annotation and discovery of microRNAs and other noncoding RNAs from deep-sequencing data.
Yang, Jian-Hua; Qu, Liang-Hu
2012-01-01
Recent advances in high-throughput deep-sequencing technology have produced large numbers of short and long RNA sequences and enabled the detection and profiling of known and novel microRNAs (miRNAs) and other noncoding RNAs (ncRNAs) at unprecedented sensitivity and depth. In this chapter, we describe the use of deepBase, a database that we have developed to integrate all public deep-sequencing data and to facilitate the comprehensive annotation and discovery of miRNAs and other ncRNAs from these data. deepBase provides an integrative, interactive, and versatile web graphical interface to evaluate miRBase-annotated miRNA genes and other known ncRNAs, explores the expression patterns of miRNAs and other ncRNAs, and discovers novel miRNAs and other ncRNAs from deep-sequencing data. deepBase also provides a deepView genome browser to comparatively analyze these data at multiple levels. deepBase is available at http://deepbase.sysu.edu.cn/.
3' terminal diversity of MRP RNA and other human noncoding RNAs revealed by deep sequencing.
Goldfarb, Katherine C; Cech, Thomas R
2013-09-21
Post-transcriptional 3' end processing is a key component of RNA regulation. The abundant and essential RNA subunit of RNase MRP has been proposed to function in three distinct cellular compartments and therefore may utilize this mode of regulation. Here we employ 3' RACE coupled with high-throughput sequencing to characterize the 3' terminal sequences of human MRP RNA and other noncoding RNAs that form RNP complexes. The 3' terminal sequence of MRP RNA from HEK293T cells has a distinctive distribution of genomically encoded termini (including an assortment of U residues) with a portion of these selectively tagged by oligo(A) tails. This profile contrasts with the relatively homogenous 3' terminus of an in vitro transcribed MRP RNA control and the differing 3' terminal profiles of U3 snoRNA, RNase P RNA, and telomerase RNA (hTR). 3' RACE coupled with deep sequencing provides a valuable framework for the functional characterization of 3' terminal sequences of noncoding RNAs.
An, Xiaoping; Fan, Hang; Ma, Maijuan; Anderson, Benjamin D.; Jiang, Jiafu; Liu, Wei; Cao, Wuchun; Tong, Yigang
2014-01-01
This paper explored our hypothesis that sRNA (18∼30 bp) deep sequencing technique can be used as an efficient strategy to identify microorganisms other than viruses, such as prokaryotic and eukaryotic pathogens. In the study, the clean reads derived from the sRNA deep sequencing data of wild-caught ticks and mosquitoes were compared against the NCBI nucleotide collection (non-redundant nt database) using Blastn. The blast results were then analyzed with in-house Python scripts. An empirical formula was proposed to identify the putative pathogens. Results showed that not only viruses but also prokaryotic and eukaryotic species of interest can be screened out and were subsequently confirmed with experiments. Specially, a novel Rickettsia spp. was indicated to exist in Haemaphysalis longicornis ticks collected in Beijing. Our study demonstrated the reuse of sRNA deep sequencing data would have the potential to trace the origin of pathogens or discover novel agents of emerging/re-emerging infectious diseases. PMID:24618575
Deep Sequencing to Identify the Causes of Viral Encephalitis
Chan, Benjamin K.; Wilson, Theodore; Fischer, Kael F.; Kriesel, John D.
2014-01-01
Deep sequencing allows for a rapid, accurate characterization of microbial DNA and RNA sequences in many types of samples. Deep sequencing (also called next generation sequencing or NGS) is being developed to assist with the diagnosis of a wide variety of infectious diseases. In this study, seven frozen brain samples from deceased subjects with recent encephalitis were investigated. RNA from each sample was extracted, randomly reverse transcribed and sequenced. The sequence analysis was performed in a blinded fashion and confirmed with pathogen-specific PCR. This analysis successfully identified measles virus sequences in two brain samples and herpes simplex virus type-1 sequences in three brain samples. No pathogen was identified in the other two brain specimens. These results were concordant with pathogen-specific PCR and partially concordant with prior neuropathological examinations, demonstrating that deep sequencing can accurately identify viral infections in frozen brain tissue. PMID:24699691
3′ terminal diversity of MRP RNA and other human noncoding RNAs revealed by deep sequencing
2013-01-01
Background Post-transcriptional 3′ end processing is a key component of RNA regulation. The abundant and essential RNA subunit of RNase MRP has been proposed to function in three distinct cellular compartments and therefore may utilize this mode of regulation. Here we employ 3′ RACE coupled with high-throughput sequencing to characterize the 3′ terminal sequences of human MRP RNA and other noncoding RNAs that form RNP complexes. Results The 3′ terminal sequence of MRP RNA from HEK293T cells has a distinctive distribution of genomically encoded termini (including an assortment of U residues) with a portion of these selectively tagged by oligo(A) tails. This profile contrasts with the relatively homogenous 3′ terminus of an in vitro transcribed MRP RNA control and the differing 3′ terminal profiles of U3 snoRNA, RNase P RNA, and telomerase RNA (hTR). Conclusions 3′ RACE coupled with deep sequencing provides a valuable framework for the functional characterization of 3′ terminal sequences of noncoding RNAs. PMID:24053768
DSAP: deep-sequencing small RNA analysis pipeline.
Huang, Po-Jung; Liu, Yi-Chung; Lee, Chi-Ching; Lin, Wei-Chen; Gan, Richie Ruei-Chi; Lyu, Ping-Chiang; Tang, Petrus
2010-07-01
DSAP is an automated multiple-task web service designed to provide a total solution to analyzing deep-sequencing small RNA datasets generated by next-generation sequencing technology. DSAP uses a tab-delimited file as an input format, which holds the unique sequence reads (tags) and their corresponding number of copies generated by the Solexa sequencing platform. The input data will go through four analysis steps in DSAP: (i) cleanup: removal of adaptors and poly-A/T/C/G/N nucleotides; (ii) clustering: grouping of cleaned sequence tags into unique sequence clusters; (iii) non-coding RNA (ncRNA) matching: sequence homology mapping against a transcribed sequence library from the ncRNA database Rfam (http://rfam.sanger.ac.uk/); and (iv) known miRNA matching: detection of known miRNAs in miRBase (http://www.mirbase.org/) based on sequence homology. The expression levels corresponding to matched ncRNAs and miRNAs are summarized in multi-color clickable bar charts linked to external databases. DSAP is also capable of displaying miRNA expression levels from different jobs using a log(2)-scaled color matrix. Furthermore, a cross-species comparative function is also provided to show the distribution of identified miRNAs in different species as deposited in miRBase. DSAP is available at http://dsap.cgu.edu.tw.
Optimization of conditions to sequence long cDNAs from viruses
USDA-ARS?s Scientific Manuscript database
Fourth generation sequencing with the Minion nanopore sequencer provides opportunity to obtain deep coverage and long read for single molecules. This will benefit studies on RNA viruses. In the past, Sanger, Illumina, and Ion Torrent sequencing have been utilized to study RNA viruses. Both technique...
Deep sequencing of cardiac microRNA-mRNA interactomes in clinical and experimental cardiomyopathy
Matkovich, Scot J.; Dorn, Gerald W.
2018-01-01
Summary MicroRNAs are a family of short (~21 nucleotide) noncoding RNAs that serve key roles in cellular growth and differentiation and the response of the heart to stress stimuli. As the sequence-specific recognition element of RNA-induced silencing complexes (RISCs), microRNAs bind mRNAs and prevent their translation via mechanisms that may include transcript degradation and/or prevention of ribosome binding. Short microRNA sequences and the ability of microRNAs to bind to mRNA sites having only partial/imperfect sequence complementarity complicates purely computational analyses of microRNA-mRNA interactomes. Furthermore, computational microRNA target prediction programs typically ignore biological context, and therefore the principal determinants of microRNA-mRNA binding: the presence and quantity of each. To address these deficiencies we describe an empirical method, developed via studies of stressed and failing hearts, to determine disease-induced changes in microRNAs, mRNAs, and the mRNAs targeted to the RISC, without cross-linking mRNAs to RISC proteins. Deep sequencing methods are used to determine RNA abundances, delivering unbiased, quantitative RNA data limited only by their annotation in the genome of interest. We describe the laboratory bench steps required to perform these experiments, experimental design strategies to achieve an appropriate number of sequencing reads per biological replicate, and computer-based processing tools and procedures to convert large raw sequencing data files into gene expression measures useful for differential expression analyses. PMID:25836573
Deep sequencing of cardiac microRNA-mRNA interactomes in clinical and experimental cardiomyopathy.
Matkovich, Scot J; Dorn, Gerald W
2015-01-01
MicroRNAs are a family of short (~21 nucleotide) noncoding RNAs that serve key roles in cellular growth and differentiation and the response of the heart to stress stimuli. As the sequence-specific recognition element of RNA-induced silencing complexes (RISCs), microRNAs bind mRNAs and prevent their translation via mechanisms that may include transcript degradation and/or prevention of ribosome binding. Short microRNA sequences and the ability of microRNAs to bind to mRNA sites having only partial/imperfect sequence complementarity complicate purely computational analyses of microRNA-mRNA interactomes. Furthermore, computational microRNA target prediction programs typically ignore biological context, and therefore the principal determinants of microRNA-mRNA binding: the presence and quantity of each. To address these deficiencies we describe an empirical method, developed via studies of stressed and failing hearts, to determine disease-induced changes in microRNAs, mRNAs, and the mRNAs targeted to the RISC, without cross-linking mRNAs to RISC proteins. Deep sequencing methods are used to determine RNA abundances, delivering unbiased, quantitative RNA data limited only by their annotation in the genome of interest. We describe the laboratory bench steps required to perform these experiments, experimental design strategies to achieve an appropriate number of sequencing reads per biological replicate, and computer-based processing tools and procedures to convert large raw sequencing data files into gene expression measures useful for differential expression analyses.
Samad, Abdul Fatah A; Nazaruddin, Nazaruddin; Murad, Abdul Munir Abdul; Jani, Jaeyres; Zainal, Zamri; Ismail, Ismanizan
2018-03-01
In current era, majority of microRNA (miRNA) are being discovered through computational approaches which are more confined towards model plants. Here, for the first time, we have described the identification and characterization of novel miRNA in a non-model plant, Persicaria minor ( P . minor ) using computational approach. Unannotated sequences from deep sequencing were analyzed based on previous well-established parameters. Around 24 putative novel miRNAs were identified from 6,417,780 reads of the unannotated sequence which represented 11 unique putative miRNA sequences. PsRobot target prediction tool was deployed to identify the target transcripts of putative novel miRNAs. Most of the predicted target transcripts (mRNAs) were known to be involved in plant development and stress responses. Gene ontology showed that majority of the putative novel miRNA targets involved in cellular component (69.07%), followed by molecular function (30.08%) and biological process (0.85%). Out of 11 unique putative miRNAs, 7 miRNAs were validated through semi-quantitative PCR. These novel miRNAs discoveries in P . minor may develop and update the current public miRNA database.
Position-specific binding of FUS to nascent RNA regulates mRNA length
Masuda, Akio; Takeda, Jun-ichi; Okuno, Tatsuya; Okamoto, Takaaki; Ohkawara, Bisei; Ito, Mikako; Ishigaki, Shinsuke; Sobue, Gen
2015-01-01
More than half of all human genes produce prematurely terminated polyadenylated short mRNAs. However, the underlying mechanisms remain largely elusive. CLIP-seq (cross-linking immunoprecipitation [CLIP] combined with deep sequencing) of FUS (fused in sarcoma) in neuronal cells showed that FUS is frequently clustered around an alternative polyadenylation (APA) site of nascent RNA. ChIP-seq (chromatin immunoprecipitation [ChIP] combined with deep sequencing) of RNA polymerase II (RNAP II) demonstrated that FUS stalls RNAP II and prematurely terminates transcription. When an APA site is located upstream of an FUS cluster, FUS enhances polyadenylation by recruiting CPSF160 and up-regulates the alternative short transcript. In contrast, when an APA site is located downstream from an FUS cluster, polyadenylation is not activated, and the RNAP II-suppressing effect of FUS leads to down-regulation of the alternative short transcript. CAGE-seq (cap analysis of gene expression [CAGE] combined with deep sequencing) and PolyA-seq (a strand-specific and quantitative method for high-throughput sequencing of 3' ends of polyadenylated transcripts) revealed that position-specific regulation of mRNA lengths by FUS is operational in two-thirds of transcripts in neuronal cells, with enrichment in genes involved in synaptic activities. PMID:25995189
High Class-Imbalance in pre-miRNA Prediction: A Novel Approach Based on deepSOM.
Stegmayer, Georgina; Yones, Cristian; Kamenetzky, Laura; Milone, Diego H
2017-01-01
The computational prediction of novel microRNA within a full genome involves identifying sequences having the highest chance of being a miRNA precursor (pre-miRNA). These sequences are usually named candidates to miRNA. The well-known pre-miRNAs are usually only a few in comparison to the hundreds of thousands of potential candidates to miRNA that have to be analyzed, which makes this task a high class-imbalance classification problem. The classical way of approaching it has been training a binary classifier in a supervised manner, using well-known pre-miRNAs as positive class and artificially defining the negative class. However, although the selection of positive labeled examples is straightforward, it is very difficult to build a set of negative examples in order to obtain a good set of training samples for a supervised method. In this work, we propose a novel and effective way of approaching this problem using machine learning, without the definition of negative examples. The proposal is based on clustering unlabeled sequences of a genome together with well-known miRNA precursors for the organism under study, which allows for the quick identification of the best candidates to miRNA as those sequences clustered with known precursors. Furthermore, we propose a deep model to overcome the problem of having very few positive class labels. They are always maintained in the deep levels as positive class while less likely pre-miRNA sequences are filtered level after level. Our approach has been compared with other methods for pre-miRNAs prediction in several species, showing effective predictivity of novel miRNAs. Additionally, we will show that our approach has a lower training time and allows for a better graphical navegability and interpretation of the results. A web-demo interface to try deepSOM is available at http://fich.unl.edu.ar/sinc/web-demo/deepsom/.
USDA-ARS?s Scientific Manuscript database
Complete genome sequence of a double-stranded RNA (dsRNA) virus, southern tomato virus (STV), on tomatoes in China, was elucidated using small RNAs deep sequencing. The identified STV_CN12 shares 99% sequence identity to other isolates from Mexico, France, Spain, and U.S. This is the first report ...
Plant MicroRNA Prediction by Supervised Machine Learning Using C5.0 Decision Trees.
Williams, Philip H; Eyles, Rod; Weiller, Georg
2012-01-01
MicroRNAs (miRNAs) are nonprotein coding RNAs between 20 and 22 nucleotides long that attenuate protein production. Different types of sequence data are being investigated for novel miRNAs, including genomic and transcriptomic sequences. A variety of machine learning methods have successfully predicted miRNA precursors, mature miRNAs, and other nonprotein coding sequences. MirTools, mirDeep2, and miRanalyzer require "read count" to be included with the input sequences, which restricts their use to deep-sequencing data. Our aim was to train a predictor using a cross-section of different species to accurately predict miRNAs outside the training set. We wanted a system that did not require read-count for prediction and could therefore be applied to short sequences extracted from genomic, EST, or RNA-seq sources. A miRNA-predictive decision-tree model has been developed by supervised machine learning. It only requires that the corresponding genome or transcriptome is available within a sequence window that includes the precursor candidate so that the required sequence features can be collected. Some of the most critical features for training the predictor are the miRNA:miRNA(∗) duplex energy and the number of mismatches in the duplex. We present a cross-species plant miRNA predictor with 84.08% sensitivity and 98.53% specificity based on rigorous testing by leave-one-out validation.
Transcriptome and Small RNA Deep Sequencing Reveals Deregulation of miRNA Biogenesis in Human Glioma
Moore, Lynette M.; Kivinen, Virpi; Liu, Yuexin; Annala, Matti; Cogdell, David; Liu, Xiuping; Liu, Chang-Gong; Sawaya, Raymond; Yli-Harja, Olli; Shmulevich, Ilya; Fuller, Gregory N.; Zhang, Wei; Nykter, Matti
2013-01-01
Altered expression of oncogenic and tumor-suppressing microRNAs (miRNAs) is widely associated with tumorigenesis. However, the regulatory mechanisms underlying these alterations are poorly understood. We sought to shed light on the deregulation of miRNA biogenesis promoting the aberrant miRNA expression profiles identified in these tumors. Using sequencing technology to perform both whole-transcriptome and small RNA sequencing of glioma patient samples, we examined precursor and mature miRNAs to directly evaluate the miRNA maturation process, and interrogated expression profiles for genes involved in the major steps of miRNA biogenesis. We found that ratios of mature to precursor forms of a large number of miRNAs increased with the progression from normal brain to low-grade and then to high-grade gliomas. The expression levels of genes involved in each of the three major steps of miRNA biogenesis (nuclear processing, nucleo-cytoplasmic transport, and cytoplasmic processing) were systematically altered in glioma tissues. Survival analysis of an independent data set demonstrated that the alteration of genes involved in miRNA maturation correlates with survival in glioma patients. Direct quantification of miRNA maturation with deep sequencing demonstrated that deregulation of the miRNA biogenesis pathway is a hallmark for glioma genesis and progression. PMID:23007860
Complete genome sequence of a novel genotype of squash mosaic virus
USDA-ARS?s Scientific Manuscript database
Complete genome sequence of a novel genotype of Squash mosaic virus (SqMV) infecting squash plants in Spain was obtained using deep sequencing of small ribonucleic acids and assembly. The low nucleotide sequence identities, with 87-88% on RNA1 and 84-86% on RNA2 to known SqMV isolates, suggest a new...
Using small RNA (sRNA) deep sequencing to understand global virus distribution in plants
USDA-ARS?s Scientific Manuscript database
Small RNAs (sRNAs), a class of regulatory RNAs, have been used to serve as the specificity determinants of suppressing gene expression in plants and animals. Next generation sequencing (NGS) uncovered the sRNA landscape in most organisms including their associated microbes. In the current study, w...
USDA-ARS?s Scientific Manuscript database
Deep sequencing of viruses isolated from infected hosts is an efficient way to measure population-genetic variation and can reveal patterns of dispersal and natural selection. In this study, we mined existing Illumina sequence reads to investigate single-nucleotide polymorphisms (SNPs) within two RN...
Sittka, Alexandra; Sharma, Cynthia M; Rolle, Katarzyna; Vogel, Jörg
2009-01-01
The bacterial Sm-like protein, Hfq, is a key factor for the stability and function of small non-coding RNAs (sRNAs) in Escherichia coli. Homologues of this protein have been predicted in many distantly related organisms yet their functional conservation as sRNA-binding proteins has not entirely been clear. To address this, we expressed in Salmonella the Hfq proteins of two eubacteria (Neisseria meningitides, Aquifex aeolicus) and an archaeon (Methanocaldococcus jannaschii), and analyzed the associated RNA by deep sequencing. This in vivo approach identified endogenous Salmonella sRNAs as a major target of the foreign Hfq proteins. New Salmonella sRNA species were also identified, and some of these accumulated specifically in the presence of a foreign Hfq protein. In addition, we observed specific RNA processing defects, e.g., suppression of precursor processing of SraH sRNA by Methanocaldococcus Hfq, or aberrant accumulation of extracytoplasmic target mRNAs of the Salmonella GcvB, MicA or RybB sRNAs. Taken together, our study provides evidence of a conserved inherent sRNA-binding property of Hfq, which may facilitate the lateral transmission of regulatory sRNAs among distantly related species. It also suggests that the expression of heterologous RNA-binding proteins combined with deep sequencing analysis of RNA ligands can be used as a molecular tool to dissect individual steps of RNA metabolism in vivo.
Takai, Ken; Horikoshi, Koki
1999-01-01
Molecular phylogenetic analysis of a naturally occurring microbial community in a deep-subsurface geothermal environment indicated that the phylogenetic diversity of the microbial population in the environment was extremely limited and that only hyperthermophilic archaeal members closely related to Pyrobaculum were present. All archaeal ribosomal DNA sequences contained intron-like sequences, some of which had open reading frames with repeated homing-endonuclease motifs. The sequence similarity analysis and the phylogenetic analysis of these homing endonucleases suggested the possible phylogenetic relationship among archaeal rRNA-encoded homing endonucleases. PMID:10584021
Oasis 2: improved online analysis of small RNA-seq data.
Rahman, Raza-Ur; Gautam, Abhivyakti; Bethune, Jörn; Sattar, Abdul; Fiosins, Maksims; Magruder, Daniel Sumner; Capece, Vincenzo; Shomroni, Orr; Bonn, Stefan
2018-02-14
Small RNA molecules play important roles in many biological processes and their dysregulation or dysfunction can cause disease. The current method of choice for genome-wide sRNA expression profiling is deep sequencing. Here we present Oasis 2, which is a new main release of the Oasis web application for the detection, differential expression, and classification of small RNAs in deep sequencing data. Compared to its predecessor Oasis, Oasis 2 features a novel and speed-optimized sRNA detection module that supports the identification of small RNAs in any organism with higher accuracy. Next to the improved detection of small RNAs in a target organism, the software now also recognizes potential cross-species miRNAs and viral and bacterial sRNAs in infected samples. In addition, novel miRNAs can now be queried and visualized interactively, providing essential information for over 700 high-quality miRNA predictions across 14 organisms. Robust biomarker signatures can now be obtained using the novel enhanced classification module. Oasis 2 enables biologists and medical researchers to rapidly analyze and query small RNA deep sequencing data with improved precision, recall, and speed, in an interactive and user-friendly environment. Oasis 2 is implemented in Java, J2EE, mysql, Python, R, PHP and JavaScript. It is freely available at https://oasis.dzne.de.
Danielsson, Frida; Wiking, Mikaela; Mahdessian, Diana; Skogs, Marie; Ait Blal, Hammou; Hjelmare, Martin; Stadler, Charlotte; Uhlén, Mathias; Lundberg, Emma
2013-01-04
One of the major challenges of a chromosome-centric proteome project is to explore in a systematic manner the potential proteins identified from the chromosomal genome sequence, but not yet characterized on a protein level. Here, we describe the use of RNA deep sequencing to screen human cell lines for RNA profiles and to use this information to select cell lines suitable for characterization of the corresponding gene product. In this manner, the subcellular localization of proteins can be analyzed systematically using antibody-based confocal microscopy. We demonstrate the usefulness of selecting cell lines with high expression levels of RNA transcripts to increase the likelihood of high quality immunofluorescence staining and subsequent successful subcellular localization of the corresponding protein. The results show a path to combine transcriptomics with affinity proteomics to characterize the proteins in a gene- or chromosome-centric manner.
USDA-ARS?s Scientific Manuscript database
Butyrate is a nutritional element with strong epigenetic regulatory activity as an inhibitor of histone deacetylases (HDACs). Based on the analysis of differentially expressed genes induced by butyrate in the bovine epithelial cell using deep RNA-sequencing technology (RNA-seq), a set of unique gen...
Zeng, Cong; Thomas, Leighton J; Kelly, Michelle; Gardner, Jonathan P A
2016-05-01
The complete mitochondrial genome of a New Zealand specimen of the deep-sea sponge Poecillastra laminaris (Sollas, 1886) (Astrophorida, Vulcanellidae), from the Colville Ridge, New Zealand, was sequenced using the 454 Life Science pyrosequencing system. To identify homologous mitochondrial sequences, the 454 reads were mapped to the complete mitochondrial genome sequence of Geodia neptuni (GeneBank No. NC_006990). The P. laminaris genome is 18,413 bp in length and includes 14 protein-coding genes, 24 transfer RNA genes and 2 ribosomal RNA genes. Gene order resembled that of other demosponges. The base composition of the genome is A (29.1%), T (35.2%), C (14.0%) and G (21.7%). This is the second published mitogenome for a sponge of the order Astrophorida and will be useful in future phylogenetic analysis of deep-sea sponges.
Microbial Diversity in Deep-sea Methane Seep Sediments Presented by SSU rRNA Gene Tag Sequencing
Nunoura, Takuro; Takaki, Yoshihiro; Kazama, Hiromi; Hirai, Miho; Ashi, Juichiro; Imachi, Hiroyuki; Takai, Ken
2012-01-01
Microbial community structures in methane seep sediments in the Nankai Trough were analyzed by tag-sequencing analysis for the small subunit (SSU) rRNA gene using a newly developed primer set. The dominant members of Archaea were Deep-sea Hydrothermal Vent Euryarchaeotic Group 6 (DHVEG 6), Marine Group I (MGI) and Deep Sea Archaeal Group (DSAG), and those in Bacteria were Alpha-, Gamma-, Delta- and Epsilonproteobacteria, Chloroflexi, Bacteroidetes, Planctomycetes and Acidobacteria. Diversity and richness were examined by 8,709 and 7,690 tag-sequences from sediments at 5 and 25 cm below the seafloor (cmbsf), respectively. The estimated diversity and richness in the methane seep sediment are as high as those in soil and deep-sea hydrothermal environments, although the tag-sequences obtained in this study were not sufficient to show whole microbial diversity in this analysis. We also compared the diversity and richness of each taxon/division between the sediments from the two depths, and found that the diversity and richness of some taxa/divisions varied significantly along with the depth. PMID:22510646
Chan, Wen-Ling; Yang, Wen-Kuang; Huang, Hsien-Da; Chang, Jan-Gowth
2013-01-01
RNA interference (RNAi) is a gene silencing process within living cells, which is controlled by the RNA-induced silencing complex with a sequence-specific manner. In flies and mice, the pseudogene transcripts can be processed into short interfering RNAs (siRNAs) that regulate protein-coding genes through the RNAi pathway. Following these findings, we construct an innovative and comprehensive database to elucidate siRNA-mediated mechanism in human transcribed pseudogenes (TPGs). To investigate TPG producing siRNAs that regulate protein-coding genes, we mapped the TPGs to small RNAs (sRNAs) that were supported by publicly deep sequencing data from various sRNA libraries and constructed the TPG-derived siRNA-target interactions. In addition, we also presented that TPGs can act as a target for miRNAs that actually regulate the parental gene. To enable the systematic compilation and updating of these results and additional information, we have developed a database, pseudoMap, capturing various types of information, including sequence data, TPG and cognate annotation, deep sequencing data, RNA-folding structure, gene expression profiles, miRNA annotation and target prediction. As our knowledge, pseudoMap is the first database to demonstrate two mechanisms of human TPGs: encoding siRNAs and decoying miRNAs that target the parental gene. pseudoMap is freely accessible at http://pseudomap.mbc.nctu.edu.tw/. Database URL: http://pseudomap.mbc.nctu.edu.tw/
Sequence-specific bias correction for RNA-seq data using recurrent neural networks.
Zhang, Yao-Zhong; Yamaguchi, Rui; Imoto, Seiya; Miyano, Satoru
2017-01-25
The recent success of deep learning techniques in machine learning and artificial intelligence has stimulated a great deal of interest among bioinformaticians, who now wish to bring the power of deep learning to bare on a host of bioinformatical problems. Deep learning is ideally suited for biological problems that require automatic or hierarchical feature representation for biological data when prior knowledge is limited. In this work, we address the sequence-specific bias correction problem for RNA-seq data redusing Recurrent Neural Networks (RNNs) to model nucleotide sequences without pre-determining sequence structures. The sequence-specific bias of a read is then calculated based on the sequence probabilities estimated by RNNs, and used in the estimation of gene abundance. We explore the application of two popular RNN recurrent units for this task and demonstrate that RNN-based approaches provide a flexible way to model nucleotide sequences without knowledge of predetermined sequence structures. Our experiments show that training a RNN-based nucleotide sequence model is efficient and RNN-based bias correction methods compare well with the-state-of-the-art sequence-specific bias correction method on the commonly used MAQC-III data set. RNNs provides an alternative and flexible way to calculate sequence-specific bias without explicitly pre-determining sequence structures.
Shiroguchi, Katsuyuki; Jia, Tony Z.; Sims, Peter A.; Xie, X. Sunney
2012-01-01
RNA sequencing (RNA-Seq) is a powerful tool for transcriptome profiling, but is hampered by sequence-dependent bias and inaccuracy at low copy numbers intrinsic to exponential PCR amplification. We developed a simple strategy for mitigating these complications, allowing truly digital RNA-Seq. Following reverse transcription, a large set of barcode sequences is added in excess, and nearly every cDNA molecule is uniquely labeled by random attachment of barcode sequences to both ends. After PCR, we applied paired-end deep sequencing to read the two barcodes and cDNA sequences. Rather than counting the number of reads, RNA abundance is measured based on the number of unique barcode sequences observed for a given cDNA sequence. We optimized the barcodes to be unambiguously identifiable, even in the presence of multiple sequencing errors. This method allows counting with single-copy resolution despite sequence-dependent bias and PCR-amplification noise, and is analogous to digital PCR but amendable to quantifying a whole transcriptome. We demonstrated transcriptome profiling of Escherichia coli with more accurate and reproducible quantification than conventional RNA-Seq. PMID:22232676
Analysis of alterative cleavage and polyadenylation by 3′ region extraction and deep sequencing
Hoque, Mainul; Ji, Zhe; Zheng, Dinghai; Luo, Wenting; Li, Wencheng; You, Bei; Park, Ji Yeon; Yehia, Ghassan; Tian, Bin
2012-01-01
Alternative cleavage and polyadenylation (APA) leads to mRNA isoforms with different coding sequences (CDS) and/or 3′ untranslated regions (3′UTRs). Using 3′ Region Extraction And Deep Sequencing (3′READS), a method which addresses the internal priming and oligo(A) tail issues that commonly plague polyA site (pA) identification, we comprehensively mapped pAs in the mouse genome, thoroughly annotating 3′ ends of genes and revealing over five thousand pAs (~8% of total) flanked by A-rich sequences, which have hitherto been overlooked. About 79% of mRNA genes and 66% of long non-coding RNA (lncRNA) genes have APA; but these two gene types have distinct usage patterns for pAs in introns and upstream exons. Promoter-distal pAs become relatively more abundant during embryonic development and cell differentiation, a trend affecting pAs in both 3′-most exons and upstream regions. Upregulated isoforms generally have stronger pAs, suggesting global modulation of the 3′ end processing activity in development and differentiation. PMID:23241633
Wang, Chen; Han, Jian; Liu, Chonghuai; Kibet, Korir Nicholas; Kayesh, Emrul; Shangguan, Lingfei; Li, Xiaoying; Fang, Jinggui
2012-03-29
MicroRNA (miRNA) is a class of functional non-coding small RNA with 19-25 nucleotides in length while Amur grape (Vitis amurensis Rupr.) is an important wild fruit crop with the strongest cold resistance among the Vitis species, is used as an excellent breeding parent for grapevine, and has elicited growing interest in wine production. To date, there is a relatively large number of grapevine miRNAs (vv-miRNAs) from cultivated grapevine varieties such as Vitis vinifera L. and hybrids of V. vinifera and V. labrusca, but there is no report on miRNAs from Vitis amurensis Rupr, a wild grapevine species. A small RNA library from Amur grape was constructed and Solexa technology used to perform deep sequencing of the library followed by subsequent bioinformatics analysis to identify new miRNAs. In total, 126 conserved miRNAs belonging to 27 miRNA families were identified, and 34 known but non-conserved miRNAs were also found. Significantly, 72 new potential Amur grape-specific miRNAs were discovered. The sequences of these new potential va-miRNAs were further validated through miR-RACE, and accumulation of 18 new va-miRNAs in seven tissues of grapevines confirmed by real time RT-PCR (qRT-PCR) analysis. The expression levels of va-miRNAs in flowers and berries were found to be basically consistent in identity to those from deep sequenced sRNAs libraries of combined corresponding tissues. We also describe the conservation and variation of va-miRNAs using miR-SNPs and miR-LDs during plant evolution based on comparison of orthologous sequences, and further reveal that the number and sites of miR-SNP in diverse miRNA families exhibit distinct divergence. Finally, 346 target genes for the new miRNAs were predicted and they include a number of Amur grape stress tolerance genes and many genes regulating anthocyanin synthesis and sugar metabolism. Deep sequencing of short RNAs from Amur grape flowers and berries identified 72 new potential miRNAs and 34 known but non-conserved miRNAs, indicating that specific miRNAs exist in Amur grape. These results show that a number of regulatory miRNAs exist in Amur grape and play an important role in Amur grape growth, development, and response to abiotic or biotic stress.
Biosynthesis and genetic encoding of phosphothreonine through parallel selection and deep sequencing
Huguenin-Dezot, Nicolas; Liang, Alexandria D.; Schmied, Wolfgang H.; Rogerson, Daniel T.; Chin, Jason W.
2017-01-01
The phosphorylation of threonine residues in proteins regulates diverse processes in eukaryotic cells, and thousands of threonine phosphorylations have been identified. An understanding of how threonine phosphorylation regulates biological function will be accelerated by general methods to bio-synthesize defined phospho-proteins. Here we address limitations in current methods for discovering aminoacyl-tRNA synthetase/tRNA pairs for incorporating non-natural amino acids into proteins, by combining parallel positive selections with deep sequencing and statistical analysis, to create a rapid approach for directly discovering aminoacyl-tRNA synthetase/tRNA pairs that selectively incorporate non-natural substrates. Our approach is scalable and enables the direct discovery of aminoacyl-tRNA synthetase/tRNA pairs with mutually orthogonal substrate specificity. We biosynthesize phosphothreonine in cells, and use our new selection approach to discover a phosphothreonyl-tRNA synthetase/tRNACUA pair. By combining these advances we create an entirely biosynthetic route to incorporating phosphothreonine in proteins and biosynthesize several phosphoproteins; enabling phosphoprotein structure determination and synthetic protein kinase activation. PMID:28553966
Liang, Tingming; Liu, Chang; Ye, Zhenchao
2013-01-01
Obesity and associated metabolic disorders contribute importantly to the metabolic syndrome. On the other hand, microRNAs (miRNAs) are a class of small non-coding RNAs that repress target gene expression by inducing mRNA degradation and/or translation repression. Dysregulation of specific miRNAs in obesity may influence energy metabolism and cause insulin resistance, which leads to dyslipidemia, steatosis hepatis and type 2 diabetes. In the present study, we comprehensively analyzed and validated dysregulated miRNAs in ob/ob mouse liver, as well as miRNA groups based on miRNA gene cluster and gene family by using deep sequencing miRNA datasets. We found that over 13.8% of the total analyzed miRNAs were dysregulated, of which 37 miRNA species showed significantly differential expression. Further RT-qPCR analysis in some selected miRNAs validated the similar expression patterns observed in deep sequencing. Interestingly, we found that miRNA gene cluster and family always showed consistent dysregulation patterns in ob/ob mouse liver, although they had various enrichment levels. Functional enrichment analysis revealed the versatile physiological roles (over six signal pathways and five human diseases) of these miRNAs. Biological studies indicated that overexpression of miR-126 or inhibition of miR-24 in AML-12 cells attenuated free fatty acids-induced fat accumulation. Taken together, our data strongly suggest that obesity and metabolic disturbance are tightly associated with functional miRNAs. We also identified hepatic miRNA candidates serving as potential biomarkers for the diagnose of the metabolic syndrome.
Kravatsky, Yuri; Chechetkin, Vladimir; Fedoseeva, Daria; Gorbacheva, Maria; Kravatskaya, Galina; Kretova, Olga; Tchurikov, Nickolai
2017-11-23
The efficient development of antiviral drugs, including efficient antiviral small interfering RNAs (siRNAs), requires continuous monitoring of the strict correspondence between a drug and the related highly variable viral DNA/RNA target(s). Deep sequencing is able to provide an assessment of both the general target conservation and the frequency of particular mutations in the different target sites. The aim of this study was to develop a reliable bioinformatic pipeline for the analysis of millions of short, deep sequencing reads corresponding to selected highly variable viral sequences that are drug target(s). The suggested bioinformatic pipeline combines the available programs and the ad hoc scripts based on an original algorithm of the search for the conserved targets in the deep sequencing data. We also present the statistical criteria for the threshold of reliable mutation detection and for the assessment of variations between corresponding data sets. These criteria are robust against the possible sequencing errors in the reads. As an example, the bioinformatic pipeline is applied to the study of the conservation of RNA interference (RNAi) targets in human immunodeficiency virus 1 (HIV-1) subtype A. The developed pipeline is freely available to download at the website http://virmut.eimb.ru/. Brief comments and comparisons between VirMut and other pipelines are also presented.
Geoseq: a tool for dissecting deep-sequencing datasets.
Gurtowski, James; Cancio, Anthony; Shah, Hardik; Levovitz, Chaya; George, Ajish; Homann, Robert; Sachidanandam, Ravi
2010-10-12
Datasets generated on deep-sequencing platforms have been deposited in various public repositories such as the Gene Expression Omnibus (GEO), Sequence Read Archive (SRA) hosted by the NCBI, or the DNA Data Bank of Japan (ddbj). Despite being rich data sources, they have not been used much due to the difficulty in locating and analyzing datasets of interest. Geoseq http://geoseq.mssm.edu provides a new method of analyzing short reads from deep sequencing experiments. Instead of mapping the reads to reference genomes or sequences, Geoseq maps a reference sequence against the sequencing data. It is web-based, and holds pre-computed data from public libraries. The analysis reduces the input sequence to tiles and measures the coverage of each tile in a sequence library through the use of suffix arrays. The user can upload custom target sequences or use gene/miRNA names for the search and get back results as plots and spreadsheet files. Geoseq organizes the public sequencing data using a controlled vocabulary, allowing identification of relevant libraries by organism, tissue and type of experiment. Analysis of small sets of sequences against deep-sequencing datasets, as well as identification of public datasets of interest, is simplified by Geoseq. We applied Geoseq to, a) identify differential isoform expression in mRNA-seq datasets, b) identify miRNAs (microRNAs) in libraries, and identify mature and star sequences in miRNAS and c) to identify potentially mis-annotated miRNAs. The ease of using Geoseq for these analyses suggests its utility and uniqueness as an analysis tool.
Kretova, Olga V; Chechetkin, Vladimir R; Fedoseeva, Daria M; Kravatsky, Yuri V; Sosin, Dmitri V; Alembekov, Ildar R; Gorbacheva, Maria A; Gashnikova, Natalya M; Tchurikov, Nickolai A
2017-02-01
Any method for silencing the activity of the HIV-1 retrovirus should tackle the extremely high variability of HIV-1 sequences and mutational escape. We studied sequence variability in the vicinity of selected RNA interference (RNAi) targets from isolates of HIV-1 subtype A in Russia, and we propose that using artificial RNAi is a potential alternative to traditional antiretroviral therapy. We prove that using multiple RNAi targets overcomes the variability in HIV-1 isolates. The optimal number of targets critically depends on the conservation of the target sequences. The total number of targets that are conserved with a probability of 0.7-0.8 should exceed at least 2. Combining deep sequencing and multitarget RNAi may provide an efficient approach to cure HIV/AIDS.
Detection of non-coding RNA in bacteria and archaea using the DETR'PROK Galaxy pipeline.
Toffano-Nioche, Claire; Luo, Yufei; Kuchly, Claire; Wallon, Claire; Steinbach, Delphine; Zytnicki, Matthias; Jacq, Annick; Gautheret, Daniel
2013-09-01
RNA-seq experiments are now routinely used for the large scale sequencing of transcripts. In bacteria or archaea, such deep sequencing experiments typically produce 10-50 million fragments that cover most of the genome, including intergenic regions. In this context, the precise delineation of the non-coding elements is challenging. Non-coding elements include untranslated regions (UTRs) of mRNAs, independent small RNA genes (sRNAs) and transcripts produced from the antisense strand of genes (asRNA). Here we present a computational pipeline (DETR'PROK: detection of ncRNAs in prokaryotes) based on the Galaxy framework that takes as input a mapping of deep sequencing reads and performs successive steps of clustering, comparison with existing annotation and identification of transcribed non-coding fragments classified into putative 5' UTRs, sRNAs and asRNAs. We provide a step-by-step description of the protocol using real-life example data sets from Vibrio splendidus and Escherichia coli. Copyright © 2013 The Authors. Published by Elsevier Inc. All rights reserved.
miRanalyzer: a microRNA detection and analysis tool for next-generation sequencing experiments.
Hackenberg, Michael; Sturm, Martin; Langenberger, David; Falcón-Pérez, Juan Manuel; Aransay, Ana M
2009-07-01
Next-generation sequencing allows now the sequencing of small RNA molecules and the estimation of their expression levels. Consequently, there will be a high demand of bioinformatics tools to cope with the several gigabytes of sequence data generated in each single deep-sequencing experiment. Given this scene, we developed miRanalyzer, a web server tool for the analysis of deep-sequencing experiments for small RNAs. The web server tool requires a simple input file containing a list of unique reads and its copy numbers (expression levels). Using these data, miRanalyzer (i) detects all known microRNA sequences annotated in miRBase, (ii) finds all perfect matches against other libraries of transcribed sequences and (iii) predicts new microRNAs. The prediction of new microRNAs is an especially important point as there are many species with very few known microRNAs. Therefore, we implemented a highly accurate machine learning algorithm for the prediction of new microRNAs that reaches AUC values of 97.9% and recall values of up to 75% on unseen data. The web tool summarizes all the described steps in a single output page, which provides a comprehensive overview of the analysis, adding links to more detailed output pages for each analysis module. miRanalyzer is available at http://web.bioinformatics.cicbiogune.es/microRNA/.
A robust and cost-effective approach to sequence and analyze complete genomes of small RNA viruses
USDA-ARS?s Scientific Manuscript database
Background: Next-generation sequencing (NGS) allows ultra-deep sequencing of nucleic acids. The use of sequence-independent amplification of viral nucleic acids without utilization of target-specific primers provides advantages over traditional sequencing methods and allows detection of unsuspected ...
Using small RNA deep sequencing data to detect siRNA duplexes induced by plant viruses
USDA-ARS?s Scientific Manuscript database
Small interfering RNA (siRNA) duplexes are produced in plants during virus infection, which are short (usually 21 to 24-base pair) double-stranded RNAs (dsRNAs) with several overhanging nucleotides on the 5' end and 3' end. The investigation of the siRNA duplexes is useful to better understand the R...
Discriminative Prediction of A-To-I RNA Editing Events from DNA Sequence
Sun, Jiangming; Singh, Pratibha; Bagge, Annika; Valtat, Bérengère; Vikman, Petter; Spégel, Peter; Mulder, Hindrik
2016-01-01
RNA editing is a post-transcriptional alteration of RNA sequences that, via insertions, deletions or base substitutions, can affect protein structure as well as RNA and protein expression. Recently, it has been suggested that RNA editing may be more frequent than previously thought. A great impediment, however, to a deeper understanding of this process is the paramount sequencing effort that needs to be undertaken to identify RNA editing events. Here, we describe an in silico approach, based on machine learning, that ameliorates this problem. Using 41 nucleotide long DNA sequences, we show that novel A-to-I RNA editing events can be predicted from known A-to-I RNA editing events intra- and interspecies. The validity of the proposed method was verified in an independent experimental dataset. Using our approach, 203 202 putative A-to-I RNA editing events were predicted in the whole human genome. Out of these, 9% were previously reported. The remaining sites require further validation, e.g., by targeted deep sequencing. In conclusion, the approach described here is a useful tool to identify potential A-to-I RNA editing events without the requirement of extensive RNA sequencing. PMID:27764195
Isakov, Ofer; Bordería, Antonio V; Golan, David; Hamenahem, Amir; Celniker, Gershon; Yoffe, Liron; Blanc, Hervé; Vignuzzi, Marco; Shomron, Noam
2015-07-01
The study of RNA virus populations is a challenging task. Each population of RNA virus is composed of a collection of different, yet related genomes often referred to as mutant spectra or quasispecies. Virologists using deep sequencing technologies face major obstacles when studying virus population dynamics, both experimentally and in natural settings due to the relatively high error rates of these technologies and the lack of high performance pipelines. In order to overcome these hurdles we developed a computational pipeline, termed ViVan (Viral Variance Analysis). ViVan is a complete pipeline facilitating the identification, characterization and comparison of sequence variance in deep sequenced virus populations. Applying ViVan on deep sequenced data obtained from samples that were previously characterized by more classical approaches, we uncovered novel and potentially crucial aspects of virus populations. With our experimental work, we illustrate how ViVan can be used for studies ranging from the more practical, detection of resistant mutations and effects of antiviral treatments, to the more theoretical temporal characterization of the population in evolutionary studies. Freely available on the web at http://www.vivanbioinfo.org : nshomron@post.tau.ac.il Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press.
Azzouzi, Imane; Moest, Hansjoerg; Wollscheid, Bernd; Schmugge, Markus; Eekels, Julia J M; Speer, Oliver
2015-05-01
During maturation, erythropoietic cells extrude their nuclei but retain their ability to respond to oxidant stress by tightly regulating protein translation. Several studies have reported microRNA-mediated regulation of translation during terminal stages of erythropoiesis, even after enucleation. In the present study, we performed a detailed examination of the endogenous microRNA machinery in human red blood cells using a combination of deep sequencing analysis of microRNAs and proteomic analysis of the microRNA-induced silencing complex. Among the 197 different microRNAs detected, miR-451a was the most abundant, representing more than 60% of all read sequences. In addition, miR-451a and its known target, 14-3-3ζ mRNA, were bound to the microRNA-induced silencing complex, implying their direct interaction in red blood cells. The proteomic characterization of endogenous Argonaute 2-associated microRNA-induced silencing complex revealed 26 cofactor candidates. Among these cofactors, we identified several RNA-binding proteins, as well as motor proteins and vesicular trafficking proteins. Our results demonstrate that red blood cells contain complex microRNA machinery, which might enable immature red blood cells to control protein translation independent of de novo nuclei information. Copyright © 2015 ISEH - International Society for Experimental Hematology. Published by Elsevier Inc. All rights reserved.
Byers, Helen; Wallis, Yvonne; van Veen, Elke M; Lalloo, Fiona; Reay, Kim; Smith, Philip; Wallace, Andrew J; Bowers, Naomi; Newman, William G; Evans, D Gareth
2016-11-01
The sensitivity of testing BRCA1 and BRCA2 remains unresolved as the frequency of deep intronic splicing variants has not been defined in high-risk familial breast/ovarian cancer families. This variant category is reported at significant frequency in other tumour predisposition genes, including NF1 and MSH2. We carried out comprehensive whole gene RNA analysis on 45 high-risk breast/ovary and male breast cancer families with no identified pathogenic variant on exonic sequencing and copy number analysis of BRCA1/2. In addition, we undertook variant screening of a 10-gene high/moderate risk breast/ovarian cancer panel by next-generation sequencing. DNA testing identified the causative variant in 50/56 (89%) breast/ovarian/male breast cancer families with Manchester scores of ≥50 with two variants being confirmed to affect splicing on RNA analysis. RNA sequencing of BRCA1/BRCA2 on 45 individuals from high-risk families identified no deep intronic variants and did not suggest loss of RNA expression as a cause of lost sensitivity. Panel testing in 42 samples identified a known RAD51D variant, a high-risk ATM variant in another breast ovary family and a truncating CHEK2 mutation. Current exonic sequencing and copy number analysis variant detection methods of BRCA1/2 have high sensitivity in high-risk breast/ovarian cancer families. Sequence analysis of RNA does not identify any variants undetected by current analysis of BRCA1/2. However, RNA analysis clarified the pathogenicity of variants of unknown significance detected by current methods. The low diagnostic uplift achieved through sequence analysis of the other known breast/ovarian cancer susceptibility genes indicates that further high-risk genes remain to be identified.
Cavalier-Smith, Thomas
2015-04-01
Contradictory and confusing results can arise if sequenced 'monoprotist' samples really contain DNA of very different species. Eukaryote-wide phylogenetic analyses using five genes from the amoeboflagellate culture ATCC 50646 previously implied it was an undescribed percolozoan related to percolatean flagellates (Stephanopogon, Percolomonas). Contrastingly, three phylogenetic analyses of 18S rRNA alone, did not place it within Percolozoa, but as an isolated deep-branching excavate. I resolve that contradiction by sequence phylogenies for all five genes individually, using up to 652 taxa. Its 18S rRNA sequence (GQ377652) is near-identical to one from stained-glass windows, somewhat more distant from one from cooling-tower water, all three related to terrestrial actinocephalid gregarines Hoplorhynchus and Pyxinia. All four protein-gene sequences (Hsp90; α-tubulin; β-tubulin; actin) are from an amoeboflagellate heterolobosean percolozoan, not especially deeply branching. Contrary to previous conclusions from trees combining protein and rRNA sequences or rDNA trees including Eozoa only, this culture does not represent a major novel deep-branching eukaryote lineage distinct from Heterolobosea, and thus lacks special significance for deep eukaryote phylogeny, though the rDNA sequence is important for gregarine phylogeny. α-Tubulin trees for over 250 eukaryotes refute earlier suggestions of lateral gene transfer within eukaryotes, being largely congruent with morphology and other gene trees. Copyright © 2015. Published by Elsevier GmbH.
Tang, Zhonghui; Zhang, Liping; Xu, Chenguang; Yuan, Shaohua; Zhang, Fengting; Zheng, Yonglian; Zhao, Changping
2012-01-01
The male sterility of thermosensitive genic male sterile (TGMS) lines of wheat (Triticum aestivum) is strictly controlled by temperature. The early phase of anther development is especially susceptible to cold stress. MicroRNAs (miRNAs) play an important role in plant development and in responses to environmental stress. In this study, deep sequencing of small RNA (smRNA) libraries obtained from spike tissues of the TGMS line under cold and control conditions identified a total of 78 unique miRNA sequences from 30 families and trans-acting small interfering RNAs (tasiRNAs) derived from two TAS3 genes. To identify smRNA targets in the wheat TGMS line, we applied the degradome sequencing method, which globally and directly identifies the remnants of smRNA-directed target cleavage. We identified 26 targets of 16 miRNA families and three targets of tasiRNAs. Comparing smRNA sequencing data sets and TaqMan quantitative polymerase chain reaction results, we identified six miRNAs and one tasiRNA (tasiRNA-ARF [for Auxin-Responsive Factor]) as cold stress-responsive smRNAs in spike tissues of the TGMS line. We also determined the expression profiles of target genes that encode transcription factors in response to cold stress. Interestingly, the expression of cold stress-responsive smRNAs integrated in the auxin-signaling pathway and their target genes was largely noncorrelated. We investigated the tissue-specific expression of smRNAs using a tissue microarray approach. Our data indicated that miR167 and tasiRNA-ARF play roles in regulating the auxin-signaling pathway and possibly in the developmental response to cold stress. These data provide evidence that smRNA regulatory pathways are linked with male sterility in the TGMS line during cold stress. PMID:22508932
Li, Guo; Liu, Yong; Liu, Chao; Su, Zhongwu; Ren, Shuling; Wang, Yunyun; Deng, Tengbo; Huang, Donghai; Tian, Yongquan; Qiu, Yuanzheng
2016-09-06
Radioresistance is one of the major factors limiting the therapeutic efficacy and prognosis of patients with nasopharyngeal carcinoma (NPC). Accumulating evidence has suggested that aberrant expression of long noncoding RNAs (lncRNAs) contributes to cancer progression. Therefore, here we identified lncRNAs associated with radioresistance in NPC. The differential expression profiles of lncRNAs associated with NPC radioresistance were constructed by next-generation deep sequencing by comparing radioresistant NPC cells with their parental cells. LncRNA-related mRNAs were predicted and analyzed using bioinformatics algorithms compared with the mRNA profiles related to radioresistance obtained in our previous study. Several lncRNAs and associated mRNAs were validated in established NPC radioresistant cell models and NPC tissues. By comparison between radioresistant CNE-2-Rs and parental CNE-2 cells by next-generation deep sequencing, a total of 781 known lncRNAs and 2054 novel lncRNAs were annotated. The top five upregulated and downregulated known/novel lncRNAs were detected using quantitative real-time reverse transcription-polymerase chain reaction, and 7/10 known lncRNAs and 3/10 novel lncRNAs were demonstrated to have significant differential expression trends that were the same as those predicted by deep sequencing. From the prediction process, 13 pairs of lncRNAs and their associated genes were acquired, and the prediction trends of three pairs were validated in both radioresistant CNE-2-Rs and 6-10B-Rs cell lines, including lncRNA n373932 and SLITRK5, n409627 and PRSS12, and n386034 and RIMKLB. LncRNA n373932 and its related SLITRK5 showed dramatic expression changes in post-irradiation radioresistant cells and a negative expression correlation in NPC tissues (R = -0.595, p < 0.05). Our study provides an overview of the expression profiles of radioresistant lncRNAs and potentially related mRNAs, which will facilitate future investigations into the function of lncRNAs in NPC radioresistance.
2012-01-01
Background MicroRNA (miRNA) is a class of functional non-coding small RNA with 19-25 nucleotides in length while Amur grape (Vitis amurensis Rupr.) is an important wild fruit crop with the strongest cold resistance among the Vitis species, is used as an excellent breeding parent for grapevine, and has elicited growing interest in wine production. To date, there is a relatively large number of grapevine miRNAs (vv-miRNAs) from cultivated grapevine varieties such as Vitis vinifera L. and hybrids of V. vinifera and V. labrusca, but there is no report on miRNAs from Vitis amurensis Rupr, a wild grapevine species. Results A small RNA library from Amur grape was constructed and Solexa technology used to perform deep sequencing of the library followed by subsequent bioinformatics analysis to identify new miRNAs. In total, 126 conserved miRNAs belonging to 27 miRNA families were identified, and 34 known but non-conserved miRNAs were also found. Significantly, 72 new potential Amur grape-specific miRNAs were discovered. The sequences of these new potential va-miRNAs were further validated through miR-RACE, and accumulation of 18 new va-miRNAs in seven tissues of grapevines confirmed by real time RT-PCR (qRT-PCR) analysis. The expression levels of va-miRNAs in flowers and berries were found to be basically consistent in identity to those from deep sequenced sRNAs libraries of combined corresponding tissues. We also describe the conservation and variation of va-miRNAs using miR-SNPs and miR-LDs during plant evolution based on comparison of orthologous sequences, and further reveal that the number and sites of miR-SNP in diverse miRNA families exhibit distinct divergence. Finally, 346 target genes for the new miRNAs were predicted and they include a number of Amur grape stress tolerance genes and many genes regulating anthocyanin synthesis and sugar metabolism. Conclusions Deep sequencing of short RNAs from Amur grape flowers and berries identified 72 new potential miRNAs and 34 known but non-conserved miRNAs, indicating that specific miRNAs exist in Amur grape. These results show that a number of regulatory miRNAs exist in Amur grape and play an important role in Amur grape growth, development, and response to abiotic or biotic stress. PMID:22455456
Nucleotide sequence of an exceptionally long 5.8S ribosomal RNA from Crithidia fasciculata.
Schnare, M N; Gray, M W
1982-01-01
In Crithidia fasciculata, a trypanosomatid protozoan, the large ribosomal subunit contains five small RNA species (e, f, g, i, j) in addition to 5S rRNA [Gray, M.W. (1981) Mol. Cell. Biol. 1, 347-357]. The complete primary sequence of species i is shown here to be pAACGUGUmCGCGAUGGAUGACUUGGCUUCCUAUCUCGUUGA ... AGAmACGCAGUAAAGUGCGAUAAGUGGUApsiCAAUUGmCAGAAUCAUUCAAUUACCGAAUCUUUGAACGAAACGG ... CGCAUGGGAGAAGCUCUUUUGAGUCAUCCCCGUGCAUGCCAUAUUCUCCAmGUGUCGAA(C)OH. This sequence establishes that species i is a 5.8S rRNA, despite its exceptional length (171-172 nucleotides). The extra nucleotides in C. fasciculata 5.8S rRNA are located in a region whose primary sequence and length are highly variable among 5.8S rRNAs, but which is capable of forming a stable hairpin loop structure (the "G+C-rich hairpin"). The sequence of C. fasciculata 5.8S rRNA is no more closely related to that of another protozoan, Acanthamoeba castellanii, than it is to representative 5.8S rRNA sequences from the other eukaryotic kingdoms, emphasizing the deep phylogenetic divisions that seem to exist within the Kingdom Protista. Images PMID:7079176
Tsuchiya, Mariko; Amano, Kojiro; Abe, Masaya; Seki, Misato; Hase, Sumitaka; Sato, Kengo; Sakakibara, Yasubumi
2016-06-15
Deep sequencing of the transcripts of regulatory non-coding RNA generates footprints of post-transcriptional processes. After obtaining sequence reads, the short reads are mapped to a reference genome, and specific mapping patterns can be detected called read mapping profiles, which are distinct from random non-functional degradation patterns. These patterns reflect the maturation processes that lead to the production of shorter RNA sequences. Recent next-generation sequencing studies have revealed not only the typical maturation process of miRNAs but also the various processing mechanisms of small RNAs derived from tRNAs and snoRNAs. We developed an algorithm termed SHARAKU to align two read mapping profiles of next-generation sequencing outputs for non-coding RNAs. In contrast with previous work, SHARAKU incorporates the primary and secondary sequence structures into an alignment of read mapping profiles to allow for the detection of common processing patterns. Using a benchmark simulated dataset, SHARAKU exhibited superior performance to previous methods for correctly clustering the read mapping profiles with respect to 5'-end processing and 3'-end processing from degradation patterns and in detecting similar processing patterns in deriving the shorter RNAs. Further, using experimental data of small RNA sequencing for the common marmoset brain, SHARAKU succeeded in identifying the significant clusters of read mapping profiles for similar processing patterns of small derived RNA families expressed in the brain. The source code of our program SHARAKU is available at http://www.dna.bio.keio.ac.jp/sharaku/, and the simulated dataset used in this work is available at the same link. Accession code: The sequence data from the whole RNA transcripts in the hippocampus of the left brain used in this work is available from the DNA DataBank of Japan (DDBJ) Sequence Read Archive (DRA) under the accession number DRA004502. yasu@bio.keio.ac.jp Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.
Cornelissen, Marion; Gall, Astrid; Vink, Monique; Zorgdrager, Fokla; Binter, Špela; Edwards, Stephanie; Jurriaans, Suzanne; Bakker, Margreet; Ong, Swee Hoe; Gras, Luuk; van Sighem, Ard; Bezemer, Daniela; de Wolf, Frank; Reiss, Peter; Kellam, Paul; Berkhout, Ben; Fraser, Christophe; van der Kuyl, Antoinette C
2017-07-15
The BEEHIVE (Bridging the Evolution and Epidemiology of HIV in Europe) project aims to analyse nearly-complete viral genomes from >3000 HIV-1 infected Europeans using high-throughput deep sequencing techniques to investigate the virus genetic contribution to virulence. Following the development of a computational pipeline, including a new de novo assembler for RNA virus genomes, to generate larger contiguous sequences (contigs) from the abundance of short sequence reads that characterise the data, another area that determines genome sequencing success is the quality and quantity of the input RNA. A pilot experiment with 125 patient plasma samples was performed to investigate the optimal method for isolation of HIV-1 viral RNA for long amplicon genome sequencing. Manual isolation with the QIAamp Viral RNA Mini Kit (Qiagen) was superior over robotically extracted RNA using either the QIAcube robotic system, the mSample Preparation Systems RNA kit with automated extraction by the m2000sp system (Abbott Molecular), or the MagNA Pure 96 System in combination with the MagNA Pure 96 Instrument (Roche Diagnostics). We scored amplification of a set of four HIV-1 amplicons of ∼1.9, 3.6, 3.0 and 3.5kb, and subsequent recovery of near-complete viral genomes. Subsequently, 616 BEEHIVE patient samples were analysed to determine factors that influence successful amplification of the genome in four overlapping amplicons using the QIAamp Viral RNA Kit for viral RNA isolation. Both low plasma viral load and high sample age (stored before 1999) negatively influenced the amplification of viral amplicons >3kb. A plasma viral load of >100,000 copies/ml resulted in successful amplification of all four amplicons for 86% of the samples, this value dropped to only 46% for samples with viral loads of <20,000 copies/ml. Copyright © 2016 The Authors. Published by Elsevier B.V. All rights reserved.
Porter, Danielle P.; Daeumer, Martin; Thielen, Alexander; Chang, Silvia; Martin, Ross; Cohen, Cal; Miller, Michael D.; White, Kirsten L.
2015-01-01
At Week 96 of the Single-Tablet Regimen (STaR) study, more treatment-naïve subjects that received rilpivirine/emtricitabine/tenofovir DF (RPV/FTC/TDF) developed resistance mutations compared to those treated with efavirenz (EFV)/FTC/TDF by population sequencing. Furthermore, more RPV/FTC/TDF-treated subjects with baseline HIV-1 RNA >100,000 copies/mL developed resistance compared to subjects with baseline HIV-1 RNA ≤100,000 copies/mL. Here, deep sequencing was utilized to assess the presence of pre-existing low-frequency variants in subjects with and without resistance development in the STaR study. Deep sequencing (Illumina MiSeq) was performed on baseline and virologic failure samples for all subjects analyzed for resistance by population sequencing during the clinical study (n = 33), as well as baseline samples from control subjects with virologic response (n = 118). Primary NRTI or NNRTI drug resistance mutations present at low frequency (≥2% to 20%) were detected in 6.6% of baseline samples by deep sequencing, all of which occurred in control subjects. Deep sequencing results were generally consistent with population sequencing but detected additional primary NNRTI and NRTI resistance mutations at virologic failure in seven samples. HIV-1 drug resistance mutations emerging while on RPV/FTC/TDF or EFV/FTC/TDF treatment were not present at low frequency at baseline in the STaR study. PMID:26690199
Porter, Danielle P; Daeumer, Martin; Thielen, Alexander; Chang, Silvia; Martin, Ross; Cohen, Cal; Miller, Michael D; White, Kirsten L
2015-12-07
At Week 96 of the Single-Tablet Regimen (STaR) study, more treatment-naïve subjects that received rilpivirine/emtricitabine/tenofovir DF (RPV/FTC/TDF) developed resistance mutations compared to those treated with efavirenz (EFV)/FTC/TDF by population sequencing. Furthermore, more RPV/FTC/TDF-treated subjects with baseline HIV-1 RNA >100,000 copies/mL developed resistance compared to subjects with baseline HIV-1 RNA ≤100,000 copies/mL. Here, deep sequencing was utilized to assess the presence of pre-existing low-frequency variants in subjects with and without resistance development in the STaR study. Deep sequencing (Illumina MiSeq) was performed on baseline and virologic failure samples for all subjects analyzed for resistance by population sequencing during the clinical study (n = 33), as well as baseline samples from control subjects with virologic response (n = 118). Primary NRTI or NNRTI drug resistance mutations present at low frequency (≥2% to 20%) were detected in 6.6% of baseline samples by deep sequencing, all of which occurred in control subjects. Deep sequencing results were generally consistent with population sequencing but detected additional primary NNRTI and NRTI resistance mutations at virologic failure in seven samples. HIV-1 drug resistance mutations emerging while on RPV/FTC/TDF or EFV/FTC/TDF treatment were not present at low frequency at baseline in the STaR study.
2013-01-01
Background MicroRNAs (miRNAs) are small non-coding RNAs that play critical roles in regulating post transcriptional gene expression. Gall midges encompass a large group of insects that are of economic importance and also possess fascinating biological traits. The gall midge Mayetiola destructor, commonly known as the Hessian fly, is a destructive pest of wheat and model organism for studying gall midge biology and insect – host plant interactions. Results In this study, we systematically analyzed miRNAs from the Hessian fly. Deep-sequencing a Hessian fly larval transcriptome led to the identification of 89 miRNA species that are either identical or very similar to known miRNAs from other insects, and 184 novel miRNAs that have not been reported from other species. A genome-wide search through a draft Hessian fly genome sequence identified a total of 611 putative miRNA-encoding genes based on sequence similarity and the existence of a stem-loop structure for miRNA precursors. Analysis of the 611 putative genes revealed a striking feature: the dramatic expansion of several miRNA gene families. The largest family contained 91 genes that encoded 20 different miRNAs. Microarray analyses revealed the expression of miRNA genes was strictly regulated during Hessian fly larval development and abundance of many miRNA genes were affected by host genotypes. Conclusion The identification of a large number of miRNAs for the first time from a gall midge provides a foundation for further studies of miRNA functions in gall midge biology and behavior. The dramatic expansion of identical or similar miRNAs provides a unique system to study functional relations among miRNA iso-genes as well as changes in sequence specificity due to small changes in miRNAs and in their mRNA targets. These results may also facilitate the identification of miRNA genes for potential pest control through transgenic approaches. PMID:23496979
Identifying MicroRNAs and Transcript Targets in Jatropha Seeds
Galli, Vanessa; Guzman, Frank; de Oliveira, Luiz F. V.; Loss-Morais, Guilherme; Körbes, Ana P.; Silva, Sérgio D. A.; Margis-Pinheiro, Márcia M. A. N.; Margis, Rogério
2014-01-01
MicroRNAs, or miRNAs, are endogenously encoded small RNAs that play a key role in diverse plant biological processes. Jatropha curcas L. has received significant attention as a potential oilseed crop for the production of renewable oil. Here, a sRNA library of mature seeds and three mRNA libraries from three different seed development stages were generated by deep sequencing to identify and characterize the miRNAs and pre-miRNAs of J. curcas. Computational analysis was used for the identification of 180 conserved miRNAs and 41 precursors (pre-miRNAs) as well as 16 novel pre-miRNAs. The predicted miRNA target genes are involved in a broad range of physiological functions, including cellular structure, nuclear function, translation, transport, hormone synthesis, defense, and lipid metabolism. Some pre-miRNA and miRNA targets vary in abundance between the three stages of seed development. A search for sequences that produce siRNA was performed, and the results indicated that J. curcas siRNAs play a role in nuclear functions, transport, catalytic processes and disease resistance. This study presents the first large scale identification of J. curcas miRNAs and their targets in mature seeds based on deep sequencing, and it contributes to a functional understanding of these miRNAs. PMID:24551031
2012-01-01
Background MicroRNAs (miRNAs) are one of the functional non-coding small RNAs involved in the epigenetic control of the plant genome. Although plants contain both evolutionary conserved miRNAs and species-specific miRNAs within their genomes, computational methods often only identify evolutionary conserved miRNAs. The recent sequencing of the Brassica rapa genome enables us to identify miRNAs and their putative target genes. In this study, we sought to provide a more comprehensive prediction of B. rapa miRNAs based on high throughput small RNA deep sequencing. Results We sequenced small RNAs from five types of tissue: seedlings, roots, petioles, leaves, and flowers. By analyzing 2.75 million unique reads that mapped to the B. rapa genome, we identified 216 novel and 196 conserved miRNAs that were predicted to target approximately 20% of the genome’s protein coding genes. Quantitative analysis of miRNAs from the five types of tissue revealed that novel miRNAs were expressed in diverse tissues but their expression levels were lower than those of the conserved miRNAs. Comparative analysis of the miRNAs between the B. rapa and Arabidopsis thaliana genomes demonstrated that redundant copies of conserved miRNAs in the B. rapa genome may have been deleted after whole genome triplication. Novel miRNA members seemed to have spontaneously arisen from the B. rapa and A. thaliana genomes, suggesting the species-specific expansion of miRNAs. We have made this data publicly available in a miRNA database of B. rapa called BraMRs. The database allows the user to retrieve miRNA sequences, their expression profiles, and a description of their target genes from the five tissue types investigated here. Conclusions This is the first report to identify novel miRNAs from Brassica crops using genome-wide high throughput techniques. The combination of computational methods and small RNA deep sequencing provides robust predictions of miRNAs in the genome. The finding of numerous novel miRNAs, many with few target genes and low expression levels, suggests the rapid evolution of miRNA genes. The development of a miRNA database, BraMRs, enables us to integrate miRNA identification, target prediction, and functional annotation of target genes. BraMRs will represent a valuable public resource with which to study the epigenetic control of B. rapa and other closely related Brassica species. The database is available at the following link: http://bramrs.rna.kr [1]. PMID:23163954
Li, Chenghua; Feng, Weida; Qiu, Lihua; Xia, Changge; Su, Xiurong; Jin, Chunhua; Zhou, Tingting; Zeng, Yuan; Li, Taiwu
2012-08-01
MicroRNAs (miRNAs) constitute a family of small RNA species which have been demonstrated to be one of key effectors in mediating host-pathogen interaction. In this study, two haemocytes miRNA libraries were constructed with deep sequenced by illumina Hiseq2000 from healthy (L1) and skin ulceration syndrome Apostichopus japonicus (L2). The high throughput solexa sequencing resulted in 9,579,038 and 7,742,558 clean data from L1 and L2, respectively. Sequences analysis revealed that 40 conserved miRNAs were found in both libraries, in which let-7 and mir-125 were speculated to be clustered together and expressed accordingly. Eighty-six miRNA candidates were also identified by reference genome search and stem-loop structure prediction. Importantly, mir-31 and mir-2008 displayed significant differential expression between the two libraries according to FPKM model, which might be considered as promising targets for elucidating the intrinsic mechanism of skin ulceration syndrome outbreak in the species. Copyright © 2012 Elsevier Ltd. All rights reserved.
Unusual RNA plant virus integration in the soybean genome leads to the production of small RNAs.
da Fonseca, Guilherme Cordenonsi; de Oliveira, Luiz Felipe Valter; de Morais, Guilherme Loss; Abdelnor, Ricardo Vilela; Nepomuceno, Alexandre Lima; Waterhouse, Peter M; Farinelli, Laurent; Margis, Rogerio
2016-05-01
Horizontal gene transfer (HGT) is known to be a major force in genome evolution. The acquisition of genes from viruses by eukaryotic genomes is a well-studied example of HGT, including rare cases of non-retroviral RNA virus integration. The present study describes the integration of cucumber mosaic virus RNA-1 into soybean genome. After an initial metatranscriptomic analysis of small RNAs derived from soybean, the de novo assembly resulted a 3029-nt contig homologous to RNA-1. The integration of this sequence in the soybean genome was confirmed by DNA deep sequencing. The locus where the integration occurred harbors the full RNA-1 sequence followed by the partial sequence of an endogenous mRNA and another sequence of RNA-1 as an inverted repeat and allowing the formation of a hairpin structure. This region recombined into a retrotransposon located inside an exon of a soybean gene. The nucleotide similarity of the integrated sequence compared to other Cucumber mosaic virus sequences indicates that the integration event occurred recently. We described a rare event of non-retroviral RNA virus integration in soybean that leads to the production of a double-stranded RNA in a similar fashion to virus resistance RNAi plants. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
USDA-ARS?s Scientific Manuscript database
The complete genome sequence of a Southern tomato virus (STV) isolate on tomato plants in a seed production field in Bangladesh was obtained for the first time using next generation sequencing. The identified isolate STV_BD-13 shares high degree of sequence identity (99%) with several known STV isol...
Complete genome sequence of a tomato infecting tomato mottle mosaic virus in New York
USDA-ARS?s Scientific Manuscript database
Complete genome sequence of an emerging isolate of tomato mottle mosaic virus (ToMMV) infecting experimental nicotianan benthamiana plants in up-state New York was obtained using small RNA deep sequencing. ToMMV_NY-13 shared 99% sequence identity to ToMMV isolates from Mexico and Florida. Broader d...
Deep sequencing reveals cell-type-specific patterns of single-cell transcriptome variation.
Dueck, Hannah; Khaladkar, Mugdha; Kim, Tae Kyung; Spaethling, Jennifer M; Francis, Chantal; Suresh, Sangita; Fisher, Stephen A; Seale, Patrick; Beck, Sheryl G; Bartfai, Tamas; Kuhn, Bernhard; Eberwine, James; Kim, Junhyong
2015-06-09
Differentiation of metazoan cells requires execution of different gene expression programs but recent single-cell transcriptome profiling has revealed considerable variation within cells of seeming identical phenotype. This brings into question the relationship between transcriptome states and cell phenotypes. Additionally, single-cell transcriptomics presents unique analysis challenges that need to be addressed to answer this question. We present high quality deep read-depth single-cell RNA sequencing for 91 cells from five mouse tissues and 18 cells from two rat tissues, along with 30 control samples of bulk RNA diluted to single-cell levels. We find that transcriptomes differ globally across tissues with regard to the number of genes expressed, the average expression patterns, and within-cell-type variation patterns. We develop methods to filter genes for reliable quantification and to calibrate biological variation. All cell types include genes with high variability in expression, in a tissue-specific manner. We also find evidence that single-cell variability of neuronal genes in mice is correlated with that in rats consistent with the hypothesis that levels of variation may be conserved. Single-cell RNA-sequencing data provide a unique view of transcriptome function; however, careful analysis is required in order to use single-cell RNA-sequencing measurements for this purpose. Technical variation must be considered in single-cell RNA-sequencing studies of expression variation. For a subset of genes, biological variability within each cell type appears to be regulated in order to perform dynamic functions, rather than solely molecular noise.
Ge, Xie; Zhang, Yong; Jiang, Jianhao; Zhong, Yi; Yang, Xiaonan; Li, Zhiqian; Huang, Yongping; Tan, Anjiang
2013-01-01
The current identification of microRNAs (miRNAs) in insects is largely dependent on genome sequences. However, the lack of available genome sequences inhibits the identification of miRNAs in various insect species. In this study, we used a miRNA database of the silkworm Bombyx mori as a reference to identify miRNAs in Helicoverpa armigera and Spodoptera litura using deep sequencing and homology analysis. Because all three species belong to the Lepidoptera, the experiment produced reliable results. Our study identified 97 and 91 conserved miRNAs in H. armigera and S. litura, respectively. Using the genome of B. mori and BAC sequences of H. armigera as references, 1 novel miRNA and 8 novel miRNA candidates were identified in H. armigera, and 4 novel miRNA candidates were identified in S. litura. An evolutionary analysis revealed that most of the identified miRNAs were insect-specific, and more than 20 miRNAs were Lepidoptera-specific. The investigation of the expression patterns of miR-2a, miR-34, miR-2796-3p and miR-11 revealed their potential roles in insect development. miRNA target prediction revealed that conserved miRNA target sites exist in various genes in the 3 species. Conserved miRNA target sites for the Hsp90 gene among the 3 species were validated in the mammalian 293T cell line using a dual-luciferase reporter assay. Our study provides a new approach with which to identify miRNAs in insects lacking genome information and contributes to the functional analysis of insect miRNAs. PMID:23289012
2012-01-01
Background Plants respond to external stimuli through fine regulation of gene expression partially ensured by small RNAs. Of these, microRNAs (miRNAs) play a crucial role. They negatively regulate gene expression by targeting the cleavage or translational inhibition of target messenger RNAs (mRNAs). In Hevea brasiliensis, environmental and harvesting stresses are known to affect natural rubber production. This study set out to identify abiotic stress-related miRNAs in Hevea using next-generation sequencing and bioinformatic analysis. Results Deep sequencing of small RNAs was carried out on plantlets subjected to severe abiotic stress using the Solexa technique. By combining the LeARN pipeline, data from the Plant microRNA database (PMRD) and Hevea EST sequences, we identified 48 conserved miRNA families already characterized in other plant species, and 10 putatively novel miRNA families. The results showed the most abundant size for miRNAs to be 24 nucleotides, except for seven families. Several MIR genes produced both 20-22 nucleotides and 23-27 nucleotides. The two miRNA class sizes were detected for both conserved and putative novel miRNA families, suggesting their functional duality. The EST databases were scanned with conserved and novel miRNA sequences. MiRNA targets were computationally predicted and analysed. The predicted targets involved in "responses to stimuli" and to "antioxidant" and "transcription activities" are presented. Conclusions Deep sequencing of small RNAs combined with transcriptomic data is a powerful tool for identifying conserved and novel miRNAs when the complete genome is not yet available. Our study provided additional information for evolutionary studies and revealed potentially specific regulation of the control of redox status in Hevea. PMID:22330773
Carissimo, Guillaume; Eiglmeier, Karin; Reveillaud, Julie; Holm, Inge; Diallo, Mawlouth; Diallo, Diawo; Vantaux, Amélie; Kim, Saorin; Ménard, Didier; Siv, Sovannaroth; Belda, Eugeni; Bischoff, Emmanuel; Antoniewski, Christophe; Vernick, Kenneth D.
2016-01-01
Mosquitoes of the Anopheles gambiae complex display strong preference for human bloodmeals and are major malaria vectors in Africa. However, their interaction with viruses or role in arbovirus transmission during epidemics has been little examined, with the exception of O’nyong-nyong virus, closely related to Chikungunya virus. Deep-sequencing has revealed different RNA viruses in natural insect viromes, but none have been previously described in the Anopheles gambiae species complex. Here, we describe two novel insect RNA viruses, a Dicistrovirus and a Cypovirus, found in laboratory colonies of An. gambiae taxa using small-RNA deep sequencing. Sequence analysis was done with Metavisitor, an open-source bioinformatic pipeline for virus discovery and de novo genome assembly. Wild-collected Anopheles from Senegal and Cambodia were positive for the Dicistrovirus and Cypovirus, displaying high sequence identity to the laboratory-derived virus. Thus, the Dicistrovirus (Anopheles C virus, AnCV) and Cypovirus (Anopheles Cypovirus, AnCPV) are components of the natural virome of at least some anopheline species. Their possible influence on mosquito immunity or transmission of other pathogens is unknown. These natural viruses could be developed as models for the study of Anopheles-RNA virus interactions in low security laboratory settings, in an analogous manner to the use of rodent malaria parasites for studies of mosquito anti-parasite immunity. PMID:27138938
Ono, Hiroyuki; Saitsu, Hirotomo; Horikawa, Reiko; Nakashima, Shinichi; Ohkubo, Yumiko; Yanagi, Kumiko; Nakabayashi, Kazuhiko; Fukami, Maki; Fujisawa, Yasuko; Ogata, Tsutomu
2018-02-02
Although partial androgen insensitivity syndrome (PAIS) is caused by attenuated responsiveness to androgens, androgen receptor gene (AR) mutations on the coding regions and their splice sites have been identified only in <25% of patients with a diagnosis of PAIS. We performed extensive molecular studies including whole exome sequencing in a Japanese family with PAIS, identifying a deep intronic variant beyond the branch site at intron 6 of AR (NM_000044.4:c.2450-42 G > A). This variant created the splice acceptor motif that was accompanied by pyrimidine-rich sequence and two candidate branch sites. Consistent with this, reverse transcriptase (RT)-PCR experiments for cycloheximide-treated lymphoblastoid cell lines revealed a relatively large amount of aberrant mRNA produced by the newly created splice acceptor site and a relatively small amount of wildtype mRNA produced by the normal splice acceptor site. Furthermore, most of the aberrant mRNA was shown to undergo nonsense mediated decay (NMD) and, if a small amount of aberrant mRNA may have escaped NMD, such mRNA was predicted to generate a truncated AR protein missing some functional domains. These findings imply that the deep intronic mutation creating an alternative splice acceptor site resulted in the production of a relatively small amount of wildtype AR mRNA, leading to PAIS.
Genomic maps of lincRNA occupancy reveal principles of RNA-chromatin interactions
Chu, Ci; Qu, Kun; Zhong, Franklin; Artandi, Steven E.; Chang, Howard Y.
2011-01-01
SUMMARY Long intergenic noncoding RNAs (lincRNAs) are key regulators of chromatin state, yet the nature and sites of RNA-chromatin interaction are mostly unknown. Here we introduce Chromatin Isolation by RNA Purification (ChIRP), where tiling oligonucleotides retrieve specific lincRNAs with bound protein and DNA sequences, which are enumerated by deep sequencing. ChIRP-seq of three lincRNAs reveal that RNA occupancy sites in the genome are focal, sequence-specific, and numerous. Drosophila roX2 RNA occupies male X-linked gene bodies with increasing tendency toward the 3’ end, peaking at CES sites. Human telomerase RNA TERC occupies telomeres and Wnt pathway genes. HOTAIR lincRNA preferentially occupies a GA-rich DNA motif to nucleate broad domains of Polycomb occupancy and histone H3 lysine 27 trimethylation. HOTAIR occupancy occurs independently of EZH2, suggesting the order of RNA guidance of Polycomb occupancy. ChIRP-seq is generally applicable to illuminate the intersection of RNA and chromatin with newfound precision genome-wide. PMID:21963238
USDA-ARS?s Scientific Manuscript database
The complete genome sequence (6,423 nt) of an emerging Cucumber green mottle mosaic virus (CGMMV) isolate on cucumber in North America was determined through deep sequencing of sRNA and rapid amplification of cDNA ends. It shares 99% nucleotide sequence identity to the Asian genotype, but only 90% t...
Fu, Lu-Lu; Xu, Ying; Li, Dan-Dan; Dai, Xiao-Wei; Xu, Xin; Zhang, Jing-Shun; Ming, Hao; Zhang, Xue-Ying; Zhang, Guo-Qing; Ma, Ya-Lan; Zheng, Lian-Wen
2018-05-30
Polycystic ovary syndrome (PCOS) is one of the most common endocrine disorders in reproductive-aged women. However, the exact pathophysiology of PCOS remains largely unclear. We performed deep sequencing to investigate the mRNA and long noncoding RNA (lncRNA) expression profiles in the ovarian tissues of letrozole-induced PCOS rat model and control rats. A total of 2147 mRNAs and 158 lncRNAs were differentially expressed between the PCOS models and control. Gene ontology analysis indicated that differentially expressed mRNAs were associated with biological adhesion, reproduction, and metabolic process. Pathway analysis results indicated that these aberrantly expressed mRNAs were related to several specific signaling pathways, including insulin resistance, steroid hormone biosynthesis, PPAR signaling pathway, cell adhesion molecules, autoimmune thyroid disease, and AMPK signaling pathway. The relative expression levels of mRNAs and lncRNAs were validated through qRT-PCR. LncRNA-miRNA-mRNA network was constructed to explore ceRNAs involved in the PCOS model and were also verified by qRTPCR experiment. These findings may provide insight into the pathogenesis of PCOS and clues to find key diagnostic and therapeutic roles of lncRNA in PCOS. Copyright © 2018 Elsevier B.V. All rights reserved.
Purpose: High-risk neuroblastoma is an aggressive disease. DNA sequencing studies have revealed a paucity of actionable genomic alterations and a low mutation burden, posing challenges to develop effective novel therapies. We used RNA sequencing (RNA-seq) to investigate the biology of this disease including a focus on tumor-infiltrating lymphocytes (TILs). Experimental Design: We performed deep RNA-seq on pre-treatment diagnostic tumors from 129 high-risk and 21 low- or intermediate-risk patients with neuroblastomas.
Brouilette, Scott; Kuersten, Scott; Mein, Charles; Bozek, Monika; Terry, Anna; Dias, Kerith-Rae; Bhaw-Rosun, Leena; Shintani, Yasunori; Coppen, Steven; Ikebe, Chiho; Sawhney, Vinit; Campbell, Niall; Kaneko, Masahiro; Tano, Nobuko; Ishida, Hidekazu; Suzuki, Ken; Yashiro, Kenta
2012-10-01
Deep sequencing of single cell-derived cDNAs offers novel insights into oncogenesis and embryogenesis. However, traditional library preparation for RNA-seq analysis requires multiple steps with consequent sample loss and stochastic variation at each step significantly affecting output. Thus, a simpler and better protocol is desirable. The recently developed hyperactive Tn5-mediated library preparation, which brings high quality libraries, is likely one of the solutions. Here, we tested the applicability of hyperactive Tn5-mediated library preparation to deep sequencing of single cell cDNA, optimized the protocol, and compared it with the conventional method based on sonication. This new technique does not require any expensive or special equipment, which secures wider availability. A library was constructed from only 100 ng of cDNA, which enables the saving of precious specimens. Only a few steps of robust enzymatic reaction resulted in saved time, enabling more specimens to be prepared at once, and with a more reproducible size distribution among the different specimens. The obtained RNA-seq results were comparable to the conventional method. Thus, this Tn5-mediated preparation is applicable for anyone who aims to carry out deep sequencing for single cell cDNAs. Copyright © 2012 Wiley Periodicals, Inc.
Liu, Tong; Hu, John; Zuo, Yuhu; Jin, Yazhong; Hou, Jumei
2016-04-01
Deep sequencing of small RNAs is a useful tool to identify novel small RNAs that may be involved in fungal growth and pathogenesis. In this study, we used HiSeq deep sequencing to identify 747,487 unique small RNAs from Curvularia lunata. Among these small RNAs were 1012 microRNA-like RNAs (milRNAs), which are similar to other known microRNAs, and 48 potential novel milRNAs without homologs in other organisms have been identified using the miRBase© database. We used quantitative PCR to analyze the expression of four of these milRNAs from C. lunata at different developmental stages. The analysis revealed several changes associated with germinating conidia and mycelial growth, suggesting that these milRNAs may play a role in pathogen infection and mycelial growth. A total of 8334 target mRNAs for the 1012 milRNAs that were identified, and 256 target mRNAs for the 48 novel milRNAs were predicted by computational analysis. These target mRNAs of milRNAs were also performed by gene ontology and Kyoto Encyclopedia of Genes and Genomes pathway analysis. To our knowledge, this study is the first report of C. lunata's milRNA profiles. This information will provide a better understanding of pathogen development and infection mechanism.
Rathe, Susan K; Moriarity, Branden S; Stoltenberg, Christopher B; Kurata, Morito; Aumann, Natalie K; Rahrmann, Eric P; Bailey, Natashay J; Melrose, Ellen G; Beckmann, Dominic A; Liska, Chase R; Largaespada, David A
2014-08-13
The evolution from microarrays to transcriptome deep-sequencing (RNA-seq) and from RNA interference to gene knockouts using Clustered Regularly Interspaced Short Palindromic Repeats (CRISPRs) and Transcription Activator-Like Effector Nucleases (TALENs) has provided a new experimental partnership for identifying and quantifying the effects of gene changes on drug resistance. Here we describe the results from deep-sequencing of RNA derived from two cytarabine (Ara-C) resistance acute myeloid leukemia (AML) cell lines, and present CRISPR and TALEN based methods for accomplishing complete gene knockout (KO) in AML cells. We found protein modifying loss-of-function mutations in Dck in both Ara-C resistant cell lines. CRISPR and TALEN-based KO of Dck dramatically increased the IC₅₀ of Ara-C and introduction of a DCK overexpression vector into Dck KO clones resulted in a significant increase in Ara-C sensitivity. This effort demonstrates the power of using transcriptome analysis and CRISPR/TALEN-based KOs to identify and verify genes associated with drug resistance.
Wu, Jieying; Gao, Weimin; Zhang, Weiwen; Meldrum, Deirdre R
2011-01-01
Limitation in sample quality and quantity is one of the big obstacles for applying metatranscriptomic technologies to explore gene expression and functionality of microbial communities in natural environments. In this study, several amplification methods were evaluated for whole-transcriptome amplification of deep-sea microbial samples, which are of low cell density and high impurity. The best amplification method was identified and incorporated into a complete protocol to isolate and amplify deep-sea microbial samples. In the protocol, total RNA was first isolated by a modified method combining Trizol (Invitrogen, CA) and RNeasy (QIAGEN, CA) method, amplified with a WT-Ovation™ Pico RNA Amplification System (NuGEN, CA), and then converted to double-strand DNA from single-strand cDNA with a WT-Ovation™ Exon Module (NuGEN, CA). The products from the whole-transcriptome amplification of deep-sea microbial samples were assessed first through random clone library sequencing. The BLAST search results showed that marine-based sequences are dominant in the libraries, consistent with the ecological source of the samples. The products were then used for next-generation Roche GS FLX Titanium sequencing to obtain metatranscriptome data. Preliminary analysis of the metatranscriptomic data showed good sequencing quality. Although the protocol was designed and demonstrated to be effective for deep-sea microbial samples, it should be applicable to similar samples from other extreme environments in exploring community structure and functionality of microbial communities. Copyright © 2010 Elsevier B.V. All rights reserved.
Bass, David; Moureau, Gregory; Tang, Shuoya; McAlister, Erica; Culverwell, C. Lorna; Glücksman, Edvard; Wang, Hui; Brown, T. David K.; Gould, Ernest A.; Harbach, Ralph E.; de Lamballerie, Xavier; Firth, Andrew E.
2013-01-01
We investigated whether small RNA (sRNA) sequenced from field-collected mosquitoes and chironomids (Diptera) can be used as a proxy signature of viral prevalence within a range of species and viral groups, using sRNAs sequenced from wild-caught specimens, to inform total RNA deep sequencing of samples of particular interest. Using this strategy, we sequenced from adult Anopheles maculipennis s.l. mosquitoes the apparently nearly complete genome of one previously undescribed virus related to chronic bee paralysis virus, and, from a pool of Ochlerotatus caspius and Oc. detritus mosquitoes, a nearly complete entomobirnavirus genome. We also reconstructed long sequences (1503-6557 nt) related to at least nine other viruses. Crucially, several of the sequences detected were reconstructed from host organisms highly divergent from those in which related viruses have been previously isolated or discovered. It is clear that viral transmission and maintenance cycles in nature are likely to be significantly more complex and taxonomically diverse than previously expected. PMID:24260463
Sequence, Structure, and Context Preferences of Human RNA Binding Proteins.
Dominguez, Daniel; Freese, Peter; Alexis, Maria S; Su, Amanda; Hochman, Myles; Palden, Tsultrim; Bazile, Cassandra; Lambert, Nicole J; Van Nostrand, Eric L; Pratt, Gabriel A; Yeo, Gene W; Graveley, Brenton R; Burge, Christopher B
2018-06-07
RNA binding proteins (RBPs) orchestrate the production, processing, and function of mRNAs. Here, we present the affinity landscapes of 78 human RBPs using an unbiased assay that determines the sequence, structure, and context preferences of these proteins in vitro by deep sequencing of bound RNAs. These data enable construction of "RNA maps" of RBP activity without requiring crosslinking-based assays. We found an unexpectedly low diversity of RNA motifs, implying frequent convergence of binding specificity toward a relatively small set of RNA motifs, many with low compositional complexity. Offsetting this trend, however, we observed extensive preferences for contextual features distinct from short linear RNA motifs, including spaced "bipartite" motifs, biased flanking nucleotide composition, and bias away from or toward RNA structure. Our results emphasize the importance of contextual features in RNA recognition, which likely enable targeting of distinct subsets of transcripts by different RBPs that recognize the same linear motif. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.
Identifying active foraminifera in the Sea of Japan using metatranscriptomic approach
NASA Astrophysics Data System (ADS)
Lejzerowicz, Franck; Voltsky, Ivan; Pawlowski, Jan
2013-02-01
Metagenetics represents an efficient and rapid tool to describe environmental diversity patterns of microbial eukaryotes based on ribosomal DNA sequences. However, the results of metagenetic studies are often biased by the presence of extracellular DNA molecules that are persistent in the environment, especially in deep-sea sediment. As an alternative, short-lived RNA molecules constitute a good proxy for the detection of active species. Here, we used a metatranscriptomic approach based on RNA-derived (cDNA) sequences to study the diversity of the deep-sea benthic foraminifera and compared it to the metagenetic approach. We analyzed 257 ribosomal DNA and cDNA sequences obtained from seven sediments samples collected in the Sea of Japan at depths ranging from 486 to 3665 m. The DNA and RNA-based approaches gave a similar view of the taxonomic composition of foraminiferal assemblage, but differed in some important points. First, the cDNA dataset was dominated by sequences of rotaliids and robertiniids, suggesting that these calcareous species, some of which have been observed in Rose Bengal stained samples, are the most active component of foraminiferal community. Second, the richness of monothalamous (single-chambered) foraminifera was particularly high in DNA extracts from the deepest samples, confirming that this group of foraminifera is abundant but not necessarily very active in the deep-sea sediments. Finally, the high divergence of undetermined sequences in cDNA dataset indicate the limits of our database and lack of knowledge about some active but possibly rare species. Our study demonstrates the capability of the metatranscriptomic approach to detect active foraminiferal species and prompt its use in future high-throughput sequencing-based environmental surveys.
Microbial community structure in three deep-sea carbonate crusts.
Heijs, S K; Aloisi, G; Bouloubassi, I; Pancost, R D; Pierre, C; Sinninghe Damsté, J S; Gottschal, J C; van Elsas, J D; Forney, L J
2006-10-01
Carbonate crusts in marine environments can act as sinks for carbon dioxide. Therefore, understanding carbonate crust formation could be important for understanding global warming. In the present study, the microbial communities of three carbonate crust samples from deep-sea mud volcanoes in the eastern Mediterranean were characterized by sequencing 16S ribosomal RNA (rRNA) genes amplified from DNA directly retrieved from the samples. In combination with the mineralogical composition of the crusts and lipid analyses, sequence data were used to assess the possible role of prokaryotes in crust formation. Collectively, the obtained data showed the presence of highly diverse communities, which were distinct in each of the carbonate crusts studied. Bacterial 16S rRNA gene sequences were found in all crusts and the majority was classified as alpha-, gamma-, and delta- Proteobacteria. Interestingly, sequences of Proteobacteria related to Halomonas and Halovibrio sp., which can play an active role in carbonate mineral formation, were present in all crusts. Archaeal 16S rRNA gene sequences were retrieved from two of the crusts studied. Several of those were closely related to archaeal sequences of organisms that have previously been linked to the anaerobic oxidation of methane (AOM). However, the majority of archaeal sequences were not related to sequences of organisms known to be involved in AOM. In combination with the strongly negative delta 13C values of archaeal lipids, these results open the possibility that organisms with a role in AOM may be more diverse within the Archaea than previously suggested. Different communities found in the crusts could carry out similar processes that might play a role in carbonate crust formation.
Characterization of the mammalian miRNA turnover landscape
Guo, Yanwen; Liu, Jun; Elfenbein, Sarah J.; Ma, Yinghong; Zhong, Mei; Qiu, Caihong; Ding, Ye; Lu, Jun
2015-01-01
Steady state cellular microRNA (miRNA) levels represent the balance between miRNA biogenesis and turnover. The kinetics and sequence determinants of mammalian miRNA turnover during and after miRNA maturation are not fully understood. Through a large-scale study on mammalian miRNA turnover, we report the co-existence of multiple cellular miRNA pools with distinct turnover kinetics and biogenesis properties and reveal previously unrecognized sequence features for fast turnover miRNAs. We measured miRNA turnover rates in eight mammalian cell types with a combination of expression profiling and deep sequencing. While most miRNAs are stable, a subset of miRNAs, mostly miRNA*s, turnovers quickly, many of which display a two-step turnover kinetics. Moreover, different sequence isoforms of the same miRNA can possess vastly different turnover rates. Fast turnover miRNA isoforms are enriched for 5′ nucleotide bias against Argonaute-(AGO)-loading, but also additional 3′ and central sequence features. Modeling based on two fast turnover miRNA*s miR-222-5p and miR-125b-1-3p, we unexpectedly found that while both miRNA*s are associated with AGO, they strongly differ in HSP90 association and sensitivity to HSP90 inhibition. Our data characterize the landscape of genome-wide miRNA turnover in cultured mammalian cells and reveal differential HSP90 requirements for different miRNA*s. Our findings also implicate rules for designing stable small RNAs, such as siRNAs. PMID:25653157
Shahinas, Dea; Silverman, Michael; Sittler, Taylor; Chiu, Charles; Kim, Peter; Allen-Vercoe, Emma; Weese, Scott; Wong, Andrew; Low, Donald E.; Pillai, Dylan R.
2012-01-01
ABSTRACT Fecal microbiome transplantation by low-volume enema is an effective, safe, and inexpensive alternative to antibiotic therapy for patients with chronic relapsing Clostridium difficile infection (CDI). We explored the microbial diversity of pre- and posttransplant stool specimens from CDI patients (n = 6) using deep sequencing of the 16S rRNA gene. While interindividual variability in microbiota change occurs with fecal transplantation and vancomycin exposure, in this pilot study we note that clinical cure of CDI is associated with an increase in diversity and richness. Genus- and species-level analysis may reveal a cocktail of microorganisms or products thereof that will ultimately be used as a probiotic to treat CDI. PMID:23093385
Vernick, Kenneth D.
2017-01-01
Metavisitor is a software package that allows biologists and clinicians without specialized bioinformatics expertise to detect and assemble viral genomes from deep sequence datasets. The package is composed of a set of modular bioinformatic tools and workflows that are implemented in the Galaxy framework. Using the graphical Galaxy workflow editor, users with minimal computational skills can use existing Metavisitor workflows or adapt them to suit specific needs by adding or modifying analysis modules. Metavisitor works with DNA, RNA or small RNA sequencing data over a range of read lengths and can use a combination of de novo and guided approaches to assemble genomes from sequencing reads. We show that the software has the potential for quick diagnosis as well as discovery of viruses from a vast array of organisms. Importantly, we provide here executable Metavisitor use cases, which increase the accessibility and transparency of the software, ultimately enabling biologists or clinicians to focus on biological or medical questions. PMID:28045932
Bazak, Lily; Haviv, Ami; Barak, Michal; Jacob-Hirsch, Jasmine; Deng, Patricia; Zhang, Rui; Isaacs, Farren J; Rechavi, Gideon; Li, Jin Billy; Eisenberg, Eli; Levanon, Erez Y
2014-03-01
RNA molecules transmit the information encoded in the genome and generally reflect its content. Adenosine-to-inosine (A-to-I) RNA editing by ADAR proteins converts a genomically encoded adenosine into inosine. It is known that most RNA editing in human takes place in the primate-specific Alu sequences, but the extent of this phenomenon and its effect on transcriptome diversity are not yet clear. Here, we analyzed large-scale RNA-seq data and detected ∼1.6 million editing sites. As detection sensitivity increases with sequencing coverage, we performed ultradeep sequencing of selected Alu sequences and showed that the scope of editing is much larger than anticipated. We found that virtually all adenosines within Alu repeats that form double-stranded RNA undergo A-to-I editing, although most sites exhibit editing at only low levels (<1%). Moreover, using high coverage sequencing, we observed editing of transcripts resulting from residual antisense expression, doubling the number of edited sites in the human genome. Based on bioinformatic analyses and deep targeted sequencing, we estimate that there are over 100 million human Alu RNA editing sites, located in the majority of human genes. These findings set the stage for exploring how this primate-specific massive diversification of the transcriptome is utilized.
Chen, Muyan; Zhang, Xiumei; Liu, Jianning; Storey, Kenneth B.
2013-01-01
The regulatory role of miRNA in gene expression is an emerging hot new topic in the control of hypometabolism. Sea cucumber aestivation is a complicated physiological process that includes obvious hypometabolism as evidenced by a decrease in the rates of oxygen consumption and ammonia nitrogen excretion, as well as a serious degeneration of the intestine into a very tiny filament. To determine whether miRNAs play regulatory roles in this process, the present study analyzed profiles of miRNA expression in the intestine of the sea cucumber (Apostichopus japonicus), using Solexa deep sequencing technology. We identified 308 sea cucumber miRNAs, including 18 novel miRNAs specific to sea cucumber. Animals sampled during deep aestivation (DA) after at least 15 days of continuous torpor, were compared with animals from a non-aestivation (NA) state (animals that had passed through aestivation and returned to the active state). We identified 42 differentially expressed miRNAs [RPM (reads per million) >10, |FC| (|fold change|) ≥1, FDR (false discovery rate) <0.01] during aestivation, which were validated by two other miRNA profiling methods: miRNA microarray and real-time PCR. Among the most prominent miRNA species, miR-200-3p, miR-2004, miR-2010, miR-22, miR-252a, miR-252a-3p and miR-92 were significantly over-expressed during deep aestivation compared with non-aestivation animals. Preliminary analyses of their putative target genes and GO analysis suggest that these miRNAs could play important roles in global transcriptional depression and cell differentiation during aestivation. High-throughput sequencing data and microarray data have been submitted to GEO database. PMID:24143179
Comparing sequencing assays and human-machine analyses in actionable genomics for glioblastoma.
Wrzeszczynski, Kazimierz O; Frank, Mayu O; Koyama, Takahiko; Rhrissorrakrai, Kahn; Robine, Nicolas; Utro, Filippo; Emde, Anne-Katrin; Chen, Bo-Juen; Arora, Kanika; Shah, Minita; Vacic, Vladimir; Norel, Raquel; Bilal, Erhan; Bergmann, Ewa A; Moore Vogel, Julia L; Bruce, Jeffrey N; Lassman, Andrew B; Canoll, Peter; Grommes, Christian; Harvey, Steve; Parida, Laxmi; Michelini, Vanessa V; Zody, Michael C; Jobanputra, Vaidehi; Royyuru, Ajay K; Darnell, Robert B
2017-08-01
To analyze a glioblastoma tumor specimen with 3 different platforms and compare potentially actionable calls from each. Tumor DNA was analyzed by a commercial targeted panel. In addition, tumor-normal DNA was analyzed by whole-genome sequencing (WGS) and tumor RNA was analyzed by RNA sequencing (RNA-seq). The WGS and RNA-seq data were analyzed by a team of bioinformaticians and cancer oncologists, and separately by IBM Watson Genomic Analytics (WGA), an automated system for prioritizing somatic variants and identifying drugs. More variants were identified by WGS/RNA analysis than by targeted panels. WGA completed a comparable analysis in a fraction of the time required by the human analysts. The development of an effective human-machine interface in the analysis of deep cancer genomic datasets may provide potentially clinically actionable calls for individual patients in a more timely and efficient manner than currently possible. NCT02725684.
Chen, Zhouwei; Li, Lufeng; Shan, Zhan; Huang, Hannian; Chen, Huan; Ding, Xianfeng; Guo, Jiangfeng; Liu, Lili
2016-11-01
Kineococcus radiotolerans is a Gram-positive, radio-resistant bacterium isolated from a radioactive environment. The small noncoding RNAs (sRNAs) in bacteria are reported to play roles in the immediate response to stress and/or the recovery from stress. The analysis of K. radiotolerans transcriptome sequencing results can identify these sRNAs in a genome-wide detection, using RNA sequencing (RNA-seq) by the deep sequencing technique. In this study, the raw data of radiation-exposed samples (RS) and control samples (CS) were acquired separately from the sequencing platform. There were 217 common sRNA candidates in the two samples screened in the genome-wide scale by bioinformatics analysis. There were 43 differentially expressed sRNA candidates, including 28 up-regulated and 15 down-regulated ones. The down-regulated sRNAs were selected for the sRNA target prediction, of which 12 sRNAs that may modulate the genes related to the transcription regulation and DNA repair were considered as the candidates involved in the radio-resistance regulation system. Copyright © 2016 Elsevier GmbH. All rights reserved.
Wang, Zheng Jia; Huang, Jian Qin; Huang, You Jun; Li, Zheng; Zheng, Bing Song
2012-08-01
Hickory (Carya cathayensis Sarg.) is an economically important woody plant in China, but its long juvenile phase delays yield. MicroRNAs (miRNAs) are critical regulators of genes and important for normal plant development and physiology, including flower development. We used Solexa technology to sequence two small RNA libraries from two floral differentiation stages in hickory to identify miRNAs related to flower development. We identified 39 conserved miRNA sequences from 114 loci belonging to 23 families as well as two novel and ten potential novel miRNAs belonging to nine families. Moreover, 35 conserved miRNA*s and two novel miRNA*s were detected. Twenty miRNA sequences from 49 loci belonging to 11 families were differentially expressed; all were up-regulated at the later stage of flower development in hickory. Quantitative real-time PCR of 12 conserved miRNA sequences, five novel miRNA families, and two novel miRNA*s validated that all were expressed during hickory flower development, and the expression patterns were similar to those detected with Solexa sequencing. Finally, a total of 146 targets of the novel and conserved miRNAs were predicted. This study identified a diverse set of miRNAs that were closely related to hickory flower development and that could help in plant floral induction.
Wang, Peter Lincoln; Lacayo, Norman; Brown, Patrick O.
2012-01-01
Most human pre-mRNAs are spliced into linear molecules that retain the exon order defined by the genomic sequence. By deep sequencing of RNA from a variety of normal and malignant human cells, we found RNA transcripts from many human genes in which the exons were arranged in a non-canonical order. Statistical estimates and biochemical assays provided strong evidence that a substantial fraction of the spliced transcripts from hundreds of genes are circular RNAs. Our results suggest that a non-canonical mode of RNA splicing, resulting in a circular RNA isoform, is a general feature of the gene expression program in human cells. PMID:22319583
Shenoy, Archana; Blelloch, Robert
2009-09-11
The Microprocessor, containing the RNA binding protein Dgcr8 and RNase III enzyme Drosha, is responsible for processing primary microRNAs to precursor microRNAs. The Microprocessor regulates its own levels by cleaving hairpins in the 5'UTR and coding region of the Dgcr8 mRNA, thereby destabilizing the mature transcript. To determine whether the Microprocessor has a broader role in directly regulating other coding mRNA levels, we integrated results from expression profiling and ultra high-throughput deep sequencing of small RNAs. Expression analysis of mRNAs in wild-type, Dgcr8 knockout, and Dicer knockout mouse embryonic stem (ES) cells uncovered mRNAs that were specifically upregulated in the Dgcr8 null background. A number of these transcripts had evolutionarily conserved predicted hairpin targets for the Microprocessor. However, analysis of deep sequencing data of 18 to 200nt small RNAs in mouse ES, HeLa, and HepG2 indicates that exonic sequence reads that map in a pattern consistent with Microprocessor activity are unique to Dgcr8. We conclude that the Microprocessor's role in directly destabilizing coding mRNAs is likely specifically targeted to Dgcr8 itself, suggesting a specialized cellular mechanism for gene auto-regulation.
Asha, Srinivasan; Sreekumar, Sweda; Soniya, E V
2016-01-01
Analysis of high-throughput small RNA deep sequencing data, in combination with black pepper transcriptome sequences revealed microRNA-mediated gene regulation in black pepper ( Piper nigrum L.). Black pepper is an important spice crop and its berries are used worldwide as a natural food additive that contributes unique flavour to foods. In the present study to characterize microRNAs from black pepper, we generated a small RNA library from black pepper leaf and sequenced it by Illumina high-throughput sequencing technology. MicroRNAs belonging to a total of 303 conserved miRNA families were identified from the sRNAome data. Subsequent analysis from recently sequenced black pepper transcriptome confirmed precursor sequences of 50 conserved miRNAs and four potential novel miRNA candidates. Stem-loop qRT-PCR experiments demonstrated differential expression of eight conserved miRNAs in black pepper. Computational analysis of targets of the miRNAs showed 223 potential black pepper unigene targets that encode diverse transcription factors and enzymes involved in plant development, disease resistance, metabolic and signalling pathways. RLM-RACE experiments further mapped miRNA-mediated cleavage at five of the mRNA targets. In addition, miRNA isoforms corresponding to 18 miRNA families were also identified from black pepper. This study presents the first large-scale identification of microRNAs from black pepper and provides the foundation for the future studies of miRNA-mediated gene regulation of stress responses and diverse metabolic processes in black pepper.
Deep sequencing of foot-and-mouth disease virus reveals RNA sequences involved in genome packaging.
Logan, Grace; Newman, Joseph; Wright, Caroline F; Lasecka-Dykes, Lidia; Haydon, Daniel T; Cottam, Eleanor M; Tuthill, Tobias J
2017-10-18
Non-enveloped viruses protect their genomes by packaging them into an outer shell or capsid of virus-encoded proteins. Packaging and capsid assembly in RNA viruses can involve interactions between capsid proteins and secondary structures in the viral genome as exemplified by the RNA bacteriophage MS2 and as proposed for other RNA viruses of plants, animals and human. In the picornavirus family of non-enveloped RNA viruses, the requirements for genome packaging remain poorly understood. Here we show a novel and simple approach to identify predicted RNA secondary structures involved in genome packaging in the picornavirus foot-and-mouth disease virus (FMDV). By interrogating deep sequencing data generated from both packaged and unpackaged populations of RNA we have determined multiple regions of the genome with constrained variation in the packaged population. Predicted secondary structures of these regions revealed stem loops with conservation of structure and a common motif at the loop. Disruption of these features resulted in attenuation of virus growth in cell culture due to a reduction in assembly of mature virions. This study provides evidence for the involvement of predicted RNA structures in picornavirus packaging and offers a readily transferable methodology for identifying packaging requirements in many other viruses. Importance In order to transmit their genetic material to a new host, non-enveloped viruses must protect their genomes by packaging them into an outer shell or capsid of virus-encoded proteins. For many non-enveloped RNA viruses the requirements for this critical part of the viral life cycle remain poorly understood. We have identified RNA sequences involved in genome packaging of the picornavirus foot-and-mouth disease virus. This virus causes an economically devastating disease of livestock affecting both the developed and developing world. The experimental methods developed to carry out this work are novel, simple and transferable to the study of packaging signals in other RNA viruses. Improved understanding of RNA packaging may lead to novel vaccine approaches or targets for antiviral drugs with broad spectrum activity. Copyright © 2017 Logan et al.
Deep Sequencing Reveals a Divergent Ugandan cassava brown streak virus Isolate from Malawi
Winter, Stephan; Mukasa, Settumba; Tairo, Fred; Sseruwagi, Peter; Ndunguru, Joseph; Duffy, Siobain
2017-01-01
ABSTRACT Illumina sequencing of RNA from a cassava cutting from northern Malawi produced a genome of Ugandan cassava brown streak virus (UCBSV-MW-NB7_2013). Sequence comparisons revealed stronger similarity to an isolate from nearby Tanzania (93.4% pairwise nucleotide identity) than to those previously reported from Malawi (86.9 to 87.0%). PMID:28818908
Generic Amplicon Deep Sequencing to Determine Ilarvirus Species Diversity in Australian Prunus
Kinoti, Wycliff M.; Constable, Fiona E.; Nancarrow, Narelle; Plummer, Kim M.; Rodoni, Brendan
2017-01-01
The distribution of Ilarvirus species populations amongst 61 Australian Prunus trees was determined by next generation sequencing (NGS) of amplicons generated using a genus-based generic RT-PCR targeting a conserved region of the Ilarvirus RNA2 component that encodes the RNA dependent RNA polymerase (RdRp) gene. Presence of Ilarvirus sequences in each positive sample was further validated by Sanger sequencing of cloned amplicons of regions of each of RNA1, RNA2 and/or RNA3 that were generated by species specific PCRs and by metagenomic NGS. Prunus necrotic ringspot virus (PNRSV) was the most frequently detected Ilarvirus, occurring in 48 of the 61 Ilarvirus-positive trees and Prune dwarf virus (PDV) and Apple mosaic virus (ApMV) were detected in three trees and one tree, respectively. American plum line pattern virus (APLPV) was detected in three trees and represents the first report of APLPV detection in Australia. Two novel and distinct groups of Ilarvirus-like RNA2 amplicon sequences were also identified in several trees by the generic amplicon NGS approach. The high read depth from the amplicon NGS of the generic PCR products allowed the detection of distinct RNA2 RdRp sequence variant populations of PNRSV, PDV, ApMV, APLPV and the two novel Ilarvirus-like sequences. Mixed infections of ilarviruses were also detected in seven Prunus trees. Sanger sequencing of specific RNA1, RNA2, and/or RNA3 genome segments of each virus and total nucleic acid metagenomics NGS confirmed the presence of PNRSV, PDV, ApMV and APLPV detected by RNA2 generic amplicon NGS. However, the two novel groups of Ilarvirus-like RNA2 amplicon sequences detected by the generic amplicon NGS could not be associated to the presence of sequence from RNA1 or RNA3 genome segments or full Ilarvirus genomes, and their origin is unclear. This work highlights the sensitivity of genus-specific amplicon NGS in detection of virus sequences and their distinct populations in multiple samples, and the need for a standardized approach to accurately determine what constitutes an active, viable virus infection after detection by molecular based methods. PMID:28713347
Generic Amplicon Deep Sequencing to Determine Ilarvirus Species Diversity in Australian Prunus.
Kinoti, Wycliff M; Constable, Fiona E; Nancarrow, Narelle; Plummer, Kim M; Rodoni, Brendan
2017-01-01
The distribution of Ilarvirus species populations amongst 61 Australian Prunus trees was determined by next generation sequencing (NGS) of amplicons generated using a genus-based generic RT-PCR targeting a conserved region of the Ilarvirus RNA2 component that encodes the RNA dependent RNA polymerase (RdRp) gene. Presence of Ilarvirus sequences in each positive sample was further validated by Sanger sequencing of cloned amplicons of regions of each of RNA1, RNA2 and/or RNA3 that were generated by species specific PCRs and by metagenomic NGS. Prunus necrotic ringspot virus (PNRSV) was the most frequently detected Ilarvirus , occurring in 48 of the 61 Ilarvirus -positive trees and Prune dwarf virus (PDV) and Apple mosaic virus (ApMV) were detected in three trees and one tree, respectively. American plum line pattern virus (APLPV) was detected in three trees and represents the first report of APLPV detection in Australia. Two novel and distinct groups of Ilarvirus -like RNA2 amplicon sequences were also identified in several trees by the generic amplicon NGS approach. The high read depth from the amplicon NGS of the generic PCR products allowed the detection of distinct RNA2 RdRp sequence variant populations of PNRSV, PDV, ApMV, APLPV and the two novel Ilarvirus -like sequences. Mixed infections of ilarviruses were also detected in seven Prunus trees. Sanger sequencing of specific RNA1, RNA2, and/or RNA3 genome segments of each virus and total nucleic acid metagenomics NGS confirmed the presence of PNRSV, PDV, ApMV and APLPV detected by RNA2 generic amplicon NGS. However, the two novel groups of Ilarvirus -like RNA2 amplicon sequences detected by the generic amplicon NGS could not be associated to the presence of sequence from RNA1 or RNA3 genome segments or full Ilarvirus genomes, and their origin is unclear. This work highlights the sensitivity of genus-specific amplicon NGS in detection of virus sequences and their distinct populations in multiple samples, and the need for a standardized approach to accurately determine what constitutes an active, viable virus infection after detection by molecular based methods.
Deep learning improves prediction of CRISPR-Cpf1 guide RNA activity.
Kim, Hui Kwon; Min, Seonwoo; Song, Myungjae; Jung, Soobin; Choi, Jae Woo; Kim, Younggwang; Lee, Sangeun; Yoon, Sungroh; Kim, Hyongbum Henry
2018-03-01
We present two algorithms to predict the activity of AsCpf1 guide RNAs. Indel frequencies for 15,000 target sequences were used in a deep-learning framework based on a convolutional neural network to train Seq-deepCpf1. We then incorporated chromatin accessibility information to create the better-performing DeepCpf1 algorithm for cell lines for which such information is available and show that both algorithms outperform previous machine learning algorithms on our own and published data sets.
Characterization of microRNAs from goat (Capra hircus) by Solexa deep-sequencing technology.
Ling, Y H; Ding, J P; Zhang, X D; Wang, L J; Zhang, Y H; Li, Y S; Zhang, Z J; Zhang, X R
2013-06-13
MicroRNAs (miRNAs) are an important class of small noncoding RNAs that are highly conserved in plants and animals. Many miRNAs are known to mediate a myriad of cell processes, including proliferation and differentiation, via the regulation of some transcription and signaling factors, which are closely related to muscle development and disease. In this study, small RNA cDNA libraries of Boer goats were constructed. In addition, we obtained the goat muscle miRNAs by using Solexa deep-sequencing technology and analyzed these miRNA characteristics by combining it with the bioinformatics technology. Based on Solexa sequencing and bioinformatics analysis, 562 species-conserved and 5 goat genome-specific miRNAs were identified, 322 of which exceeded 100 in the expression levels. The results of real-time quantitative polymerase chain reaction from 8 randomly selected miRNAs showed that the 8 miRNAs were expressed in goat muscle, and the expression patterns were consistent with the Solexa sequencing results. The identification and characterization of miRNAs in goat muscle provide important information on the role of miRNA regulation in muscle growth and development. These data will help to facilitate studies on the regulatory roles played by miRNAs during goat growth and development.
Breaking the 1000-gene barrier for Mimivirus using ultra-deep genome and transcriptome sequencing.
Legendre, Matthieu; Santini, Sébastien; Rico, Alain; Abergel, Chantal; Claverie, Jean-Michel
2011-03-04
Mimivirus, a giant dsDNA virus infecting Acanthamoeba, is the prototype of the mimiviridae family, the latest addition to the family of the nucleocytoplasmic large DNA viruses (NCLDVs). Its 1.2 Mb-genome was initially predicted to encode 917 genes. A subsequent RNA-Seq analysis precisely mapped many transcript boundaries and identified 75 new genes. We now report a much deeper analysis using the SOLiD™ technology combining RNA-Seq of the Mimivirus transcriptome during the infectious cycle (202.4 Million reads), and a complete genome re-sequencing (45.3 Million reads). This study corrected the genome sequence and identified several single nucleotide polymorphisms. Our results also provided clear evidence of previously overlooked transcription units, including an important RNA polymerase subunit distantly related to Euryarchea homologues. The total Mimivirus gene count is now 1018, 11% greater than the original annotation. This study highlights the huge progress brought about by ultra-deep sequencing for the comprehensive annotation of virus genomes, opening the door to a complete one-nucleotide resolution level description of their transcriptional activity, and to the realistic modeling of the viral genome expression at the ultimate molecular level. This work also illustrates the need to go beyond bioinformatics-only approaches for the annotation of short protein and non-coding genes in viral genomes.
Integrated design, execution, and analysis of arrayed and pooled CRISPR genome-editing experiments.
Canver, Matthew C; Haeussler, Maximilian; Bauer, Daniel E; Orkin, Stuart H; Sanjana, Neville E; Shalem, Ophir; Yuan, Guo-Cheng; Zhang, Feng; Concordet, Jean-Paul; Pinello, Luca
2018-05-01
CRISPR (clustered regularly interspaced short palindromic repeats) genome-editing experiments offer enormous potential for the evaluation of genomic loci using arrayed single guide RNAs (sgRNAs) or pooled sgRNA libraries. Numerous computational tools are available to help design sgRNAs with optimal on-target efficiency and minimal off-target potential. In addition, computational tools have been developed to analyze deep-sequencing data resulting from genome-editing experiments. However, these tools are typically developed in isolation and oftentimes are not readily translatable into laboratory-based experiments. Here, we present a protocol that describes in detail both the computational and benchtop implementation of an arrayed and/or pooled CRISPR genome-editing experiment. This protocol provides instructions for sgRNA design with CRISPOR (computational tool for the design, evaluation, and cloning of sgRNA sequences), experimental implementation, and analysis of the resulting high-throughput sequencing data with CRISPResso (computational tool for analysis of genome-editing outcomes from deep-sequencing data). This protocol allows for design and execution of arrayed and pooled CRISPR experiments in 4-5 weeks by non-experts, as well as computational data analysis that can be performed in 1-2 d by both computational and noncomputational biologists alike using web-based and/or command-line versions.
Chen, Muyan; Storey, Kenneth B
2014-02-01
The sea cucumber Apostichopus japonicus withstands high water temperatures in the summer by suppressing its metabolic rate and entering a state of aestivation. We hypothesized that changes in the expression of miRNAs could provide important post-transcriptional regulation of gene expression during hypometabolism via control over mRNA translation. The present study analyzed profiles of miRNA expression in the sea cucumber respiratory tree using Solexa deep sequencing technology. We identified 279 sea cucumber miRNAs, including 15 novel miRNAs specific to sea cucumber. Animals sampled during deep aestivation (DA; after at least 15 days of continuous torpor) were compared with animals from a non-aestivation (NA) state (animals that had passed through aestivation and returned to an active state). We identified 30 differentially expressed miRNAs ([RPM (reads per million) >10, |FC| (|fold change|)≥1, FDR (false discovery rate)<0.01]) during aestivation, which were validated by two other miRNA profiling methods: miRNA microarray and real-time PCR. Among the most prominent miRNA species, miR-124, miR-124-3p, miR-79, miR-9 and miR-2010 were significantly over-expressed during deep aestivation compared with non-aestivation animals, suggesting that these miRNAs may play important roles in metabolic rate suppression during aestivation. High-throughput sequencing data and microarray data have been submitted to the GEO database with accession number: 16902695. Copyright © 2014 Elsevier B.V. All rights reserved.
Deep Sequencing Insights in Therapeutic shRNA Processing and siRNA Target Cleavage Precision.
Denise, Hubert; Moschos, Sterghios A; Sidders, Benjamin; Burden, Frances; Perkins, Hannah; Carter, Nikki; Stroud, Tim; Kennedy, Michael; Fancy, Sally-Ann; Lapthorn, Cris; Lavender, Helen; Kinloch, Ross; Suhy, David; Corbau, Romu
2014-02-04
TT-034 (PF-05095808) is a recombinant adeno-associated virus serotype 8 (AAV8) agent expressing three short hairpin RNA (shRNA) pro-drugs that target the hepatitis C virus (HCV) RNA genome. The cytosolic enzyme Dicer cleaves each shRNA into multiple, potentially active small interfering RNA (siRNA) drugs. Using next-generation sequencing (NGS) to identify and characterize active shRNAs maturation products, we observed that each TT-034-encoded shRNA could be processed into as many as 95 separate siRNA strands. Few of these appeared active as determined by Sanger 5' RNA Ligase-Mediated Rapid Amplification of cDNA Ends (5-RACE) and through synthetic shRNA and siRNA analogue studies. Moreover, NGS scrutiny applied on 5-RACE products (RACE-seq) suggested that synthetic siRNAs could direct cleavage in not one, but up to five separate positions on targeted RNA, in a sequence-dependent manner. These data support an on-target mechanism of action for TT-034 without cytotoxicity and question the accepted precision of substrate processing by the key RNA interference (RNAi) enzymes Dicer and siRNA-induced silencing complex (siRISC).Molecular Therapy-Nucleic Acids (2014) 3, e145; doi:10.1038/mtna.2013.73; published online 4 February 2014.
Zhang, De-Chao; Liu, Yan-Xia; Li, Xin-Zheng
2015-09-01
Deep sea ferromanganese (FeMn) nodules contain metallic mineral resources and have great economic potential. In this study, a combination of culture-dependent and culture-independent (16S rRNA genes clone library and pyrosequencing) methods was used to investigate the bacterial diversity in FeMn nodules from Jiaolong Seamount, the South China Sea. Eleven bacterial strains including some moderate thermophiles were isolated. The majority of strains belonged to the phylum Proteobacteria; one isolate belonged to the phylum Firmicutes. A total of 259 near full-length bacterial 16S rRNA gene sequences in a clone library and 67,079 valid reads obtained using pyrosequencing indicated that members of the Gammaproteobacteria dominated, with the most abundant bacterial genera being Pseudomonas and Alteromonas. Sequence analysis indicated the presence of many organisms whose closest relatives are known manganese oxidizers, iron reducers, hydrogen-oxidizing bacteria and methylotrophs. This is the first reported investigation of bacterial diversity associated with deep sea FeMn nodules from the South China Sea.
TAM: a method for enrichment and depletion analysis of a microRNA category in a list of microRNAs.
Lu, Ming; Shi, Bing; Wang, Juan; Cao, Qun; Cui, Qinghua
2010-08-09
MicroRNAs (miRNAs) are a class of important gene regulators. The number of identified miRNAs has been increasing dramatically in recent years. An emerging major challenge is the interpretation of the genome-scale miRNA datasets, including those derived from microarray and deep-sequencing. It is interesting and important to know the common rules or patterns behind a list of miRNAs, (i.e. the deregulated miRNAs resulted from an experiment of miRNA microarray or deep-sequencing). For the above purpose, this study presents a method and develops a tool (TAM) for annotations of meaningful human miRNAs categories. We first integrated miRNAs into various meaningful categories according to prior knowledge, such as miRNA family, miRNA cluster, miRNA function, miRNA associated diseases, and tissue specificity. Using TAM, given lists of miRNAs can be rapidly annotated and summarized according to the integrated miRNA categorical data. Moreover, given a list of miRNAs, TAM can be used to predict novel related miRNAs. Finally, we confirmed the usefulness and reliability of TAM by applying it to deregulated miRNAs in acute myocardial infarction (AMI) from two independent experiments. TAM can efficiently identify meaningful categories for given miRNAs. In addition, TAM can be used to identify novel miRNA biomarkers. TAM tool, source codes, and miRNA category data are freely available at http://cmbi.bjmu.edu.cn/tam.
USDA-ARS?s Scientific Manuscript database
Squash mosaic virus (SqMV), a seed-borne virus belonging to the genus Commovirus in the family Comoviridae, could cause a serious yield loss on cucurbit crops worldwide. SqMV has a bipartite single-stranded ribonucleic acid (RNA) genome (RNA-1 and RNA-2) encapsidated separately with two capsid prote...
Poudel, Saroj; Aryal, Niranjan; Lu, Chaofu; ...
2015-03-31
Camelina sativa is an annual oilseed crop that is under intensive development for renewable resources of biofuels and industrial oils. MicroRNAs, or miRNAs, are endogenously encoded small RNAs that play key roles in diverse plant biological processes. Here, we conducted deep sequencing on small RNA libraries prepared from camelina leaves, flower buds and two stages of developing seeds corresponding to initial and peak storage products accumulation. Computational analyses identified 207 known miRNAs belonging to 63 families, as well as 5 novel miRNAs. These miRNAs, especially members of the miRNA families, varied greatly in different tissues and developmental stages. The predictedmore » miRNA target genes are involved in a broad range of physiological functions including lipid metabolism. This report is the first step toward elucidating roles of miRNAs in C. sativa and will provide additional tools to improve this oilseed crop for biofuels and biomaterials.« less
MicroRNAs play critical roles during plant development and in response to abiotic stresses.
de Lima, Júlio César; Loss-Morais, Guilherme; Margis, Rogerio
2012-12-01
MicroRNAs (miRNAs) have been identified as key molecules in regulatory networks. The fine-tuning role of miRNAs in addition to the regulatory role of transcription factors has shown that molecular events during development are tightly regulated. In addition, several miRNAs play crucial roles in the response to abiotic stress induced by drought, salinity, low temperatures, and metals such as aluminium. Interestingly, several miRNAs have overlapping roles with regard to development, stress responses, and nutrient homeostasis. Moreover, in response to the same abiotic stresses, different expression patterns for some conserved miRNA families among different plant species revealed different metabolic adjustments. The use of deep sequencing technologies for the characterisation of miRNA frequency and the identification of new miRNAs adds complexity to regulatory networks in plants. In this review, we consider the regulatory role of miRNAs in plant development and abiotic stresses, as well as the impact of deep sequencing technologies on the generation of miRNA data.
Zhang, Likui; Kang, Manyu; Huang, Yangchao; Yang, Lixiang
2016-05-01
The diversity and ecological significance of bacteria and archaea in deep-sea environments have been thoroughly investigated, but eukaryotic microorganisms in these areas, such as fungi, are poorly understood. To elucidate fungal diversity in calcareous deep-sea sediments in the Southwest India Ridge (SWIR), the internal transcribed spacer (ITS) regions of rRNA genes from two sediment metagenomic DNA samples were amplified and sequenced using the Illumina sequencing platform. The results revealed that 58-63 % and 36-42 % of the ITS sequences (97 % similarity) belonged to Basidiomycota and Ascomycota, respectively. These findings suggest that Basidiomycota and Ascomycota are the predominant fungal phyla in the two samples. We also found that Agaricomycetes, Leotiomycetes, and Pezizomycetes were the major fungal classes in the two samples. At the species level, Thelephoraceae sp. and Phialocephala fortinii were major fungal species in the two samples. Despite the low relative abundance, unidentified fungal sequences were also observed in the two samples. Furthermore, we found that there were slight differences in fungal diversity between the two sediment samples, although both were collected from the SWIR. Thus, our results demonstrate that calcareous deep-sea sediments in the SWIR harbor diverse fungi, which augment the fungal groups in deep-sea sediments. This is the first report of fungal communities in calcareous deep-sea sediments in the SWIR revealed by Illumina sequencing.
2015-01-01
Nematodes inhabiting benthic deep-sea ecosystems account for >90% of the total metazoan abundances and they have been hypothesised to be hyper-diverse, but their biodiversity is still largely unknown. Metabarcoding could facilitate the census of biodiversity, especially for those tiny metazoans for which morphological identification is difficult. We compared, for the first time, different DNA extraction procedures based on the use of two commercial kits and a previously published laboratory protocol and tested their suitability for sequencing analyses of 18S rDNA of marine nematodes. We also investigated the reliability of Roche 454 sequencing analyses for assessing the biodiversity of deep-sea nematode assemblages previously morphologically identified. Finally, intra-genomic variation in 18S rRNA gene repeats was investigated by Illumina MiSeq in different deep-sea nematode morphospecies to assess the influence of polymorphisms on nematode biodiversity estimates. Our results indicate that the two commercial kits should be preferred for the molecular analysis of biodiversity of deep-sea nematodes since they consistently provide amplifiable DNA suitable for sequencing. We report that the morphological identification of deep-sea nematodes matches the results obtained by metabarcoding analysis only at the order-family level and that a large portion of Operational Clustered Taxonomic Units (OCTUs) was not assigned. We also show that independently from the cut-off criteria and bioinformatic pipelines used, the number of OCTUs largely exceeds the number of individuals and that 18S rRNA gene of different morpho-species of nematodes displayed intra-genomic polymorphisms. Our results indicate that metabarcoding is an important tool to explore the diversity of deep-sea nematodes, but still fails in identifying most of the species due to limited number of sequences deposited in the public databases, and in providing quantitative data on the species encountered. These aspects should be carefully taken into account before using metabarcoding in quantitative ecological research and monitoring programmes of marine biodiversity. PMID:26701112
Dell'Anno, Antonio; Carugati, Laura; Corinaldesi, Cinzia; Riccioni, Giulia; Danovaro, Roberto
2015-01-01
Nematodes inhabiting benthic deep-sea ecosystems account for >90% of the total metazoan abundances and they have been hypothesised to be hyper-diverse, but their biodiversity is still largely unknown. Metabarcoding could facilitate the census of biodiversity, especially for those tiny metazoans for which morphological identification is difficult. We compared, for the first time, different DNA extraction procedures based on the use of two commercial kits and a previously published laboratory protocol and tested their suitability for sequencing analyses of 18S rDNA of marine nematodes. We also investigated the reliability of Roche 454 sequencing analyses for assessing the biodiversity of deep-sea nematode assemblages previously morphologically identified. Finally, intra-genomic variation in 18S rRNA gene repeats was investigated by Illumina MiSeq in different deep-sea nematode morphospecies to assess the influence of polymorphisms on nematode biodiversity estimates. Our results indicate that the two commercial kits should be preferred for the molecular analysis of biodiversity of deep-sea nematodes since they consistently provide amplifiable DNA suitable for sequencing. We report that the morphological identification of deep-sea nematodes matches the results obtained by metabarcoding analysis only at the order-family level and that a large portion of Operational Clustered Taxonomic Units (OCTUs) was not assigned. We also show that independently from the cut-off criteria and bioinformatic pipelines used, the number of OCTUs largely exceeds the number of individuals and that 18S rRNA gene of different morpho-species of nematodes displayed intra-genomic polymorphisms. Our results indicate that metabarcoding is an important tool to explore the diversity of deep-sea nematodes, but still fails in identifying most of the species due to limited number of sequences deposited in the public databases, and in providing quantitative data on the species encountered. These aspects should be carefully taken into account before using metabarcoding in quantitative ecological research and monitoring programmes of marine biodiversity.
Low-abundant bacteria drive compositional changes in the gut microbiota after dietary alteration.
Benjamino, Jacquelynn; Lincoln, Stephen; Srivastava, Ranjan; Graf, Joerg
2018-05-10
As the importance of beneficial bacteria is better recognized, understanding the dynamics of symbioses becomes increasingly crucial. In many gut symbioses, it is essential to understand whether changes in host diet play a role in the persistence of the bacterial gut community. In this study, termites were fed six dietary sources and the microbial community was monitored over a 49-day period using 16S rRNA gene sequencing. A deep backpropagation artificial neural network (ANN) was used to learn how the six different lignocellulose food sources affected the temporal composition of the hindgut microbiota of the termite as well as taxon-taxon and taxon-substrate interactions. Shifts in the termite gut microbiota after diet change in each colony were observed using 16S rRNA gene sequencing and beta diversity analyses. The artificial neural network accurately predicted the relative abundances of taxa at random points in the temporal study and showed that low-abundant taxa maintain community driving correlations in the hindgut. This combinatorial approach utilizing 16S rRNA gene sequencing and deep learning revealed that low-abundant bacteria that often do not belong to the core community are drivers of the termite hindgut bacterial community composition.
Juan, Li; Tong, Hong-li; Zhang, Pengjun; Guo, Guanghong; Wang, Zi; Wen, Xinyu; Dong, Zhennan; Tian, Ya-ping
2014-09-03
Small non-coding microRNAs (miRNAs) are involved in cancer development and progression, and serum profiles of cervical cancer patients may be useful for identifying novel miRNAs. We performed deep sequencing on serum pools of cervical cancer patients and healthy controls with 3 replicates and constructed a small RNA library. We used MIREAP to predict novel miRNAs and identified 2 putative novel miRNAs between serum pools of cervical cancer patients and healthy controls after filtering out pseudo-pre-miRNAs using Triplet-SVM analysis. The 2 putative novel miRNAs were validated by real time PCR and were significantly decreased in cervical cancer patients compared with healthy controls. One novel miRNA had an area under curve (AUC) of 0.921 (95% CI: 0.883, 0.959) with a sensitivity of 85.7% and a specificity of 88.2% when discriminating between cervical cancer patients and healthy controls. Our results suggest that characterizing serum profiles of cervical cancers by Solexa sequencing may be a good method for identifying novel miRNAs and that the validated novel miRNAs described here may be cervical cancer-associated biomarkers.
Rapid Creation and Quantitative Monitoring of High Coverage shRNA Libraries
Bassik, Michael C.; Lebbink, Robert Jan; Churchman, L. Stirling; Ingolia, Nicholas T.; Patena, Weronika; LeProust, Emily M.; Schuldiner, Maya; Weissman, Jonathan S.; McManus, Michael T.
2009-01-01
Short hairpin RNA (shRNA) libraries are limited by the low efficacy of many shRNAs, giving false negatives, and off-target effects, giving false positives. Here we present a strategy for rapidly creating expanded shRNA pools (∼30 shRNAs/gene) that are analyzed by deep-sequencing (EXPAND). This approach enables identification of multiple effective target-specific shRNAs from a complex pool, allowing a rigorous statistical evaluation of whether a gene is a true hit. PMID:19448642
NASA Astrophysics Data System (ADS)
Yakimov, Michail M.; Cono, Violetta La; Denaro, Renata
2009-05-01
The autotrophic and ammonia-oxidizing crenarchaeal assemblage at offshore site located in the deep Mediterranean (Tyrrhenian Sea, depth 3000 m) water was studied by PCR amplification of the key functional genes involved in energy (ammonia mono-oxygenase alpha subunit, amoA) and central metabolism (acetyl-CoA carboxylase alpha subunit, accA). Using two recently annotated genomes of marine crenarchaeons, an initial set of primers targeting archaeal accA-like genes was designed. Approximately 300 clones were analyzed, of which 100% of amoA library and almost 70% of accA library were unambiguously related to the corresponding genes from marine Crenarchaeota. Even though the acetyl-CoA carboxylase is phylogenetically not well conserved and the remaining clones were affiliated to various bacterial acetyl-CoA/propionyl-CoA carboxylase genes, the pool of archaeal sequences was applied for development of quantitative PCR analysis of accA-like distribution using TaqMan ® methodolgy. The archaeal accA gene fragments, together with alignable gene fragments from the Sargasso Sea and North Pacific Subtropical Gyre (ALOHA Station) metagenome databases, were analyzed by multiple sequence alignment. Two accA-like sequences, found in ALOHA Station at the depth of 4000 m, formed a deeply branched clade with 64% of all archaeal Tyrrhenian clones. No close relatives for residual 36% of clones, except of those recovered from Eastern Mediterranean, was found, suggesting the existence of a specific lineage of the crenarchaeal accA genes in deep Mediterranean water. Alignment of Mediterranean amoA sequences defined four cosmopolitan phylotypes of Crenarchaeota putative ammonia mono-oxygenase subunit A gene occurring in the water sample from the 3000 m depth. Without exception all phylotypes fell into Deep Marine Group I cluster that contain the vast majority of known sequences recovered from global deep-sea environment. Remarkably, three phylotypes accounted for 91% of all Mediterranean amoA clones and corresponded to the sequences retrieved from the less deep compartments of the world's ocean, most likely reflecting the higher temperature at the depth of the Mediterranean Sea. In order to verify whether these phylotypes might represent important Crenarchaeota in the functioning of the Mediterranean bathypelagic ecosystem, expression of crenarchaeal amoA gene was monitored by direct RNA retrieval and following analysis of amoA-related mRNA transcripts. Surprisingly, all mRNA-derived sequences formed a tight monophyletic group, which fell into large Shallow Marine Group I cluster with sequences retrieved from shallow (up to 200 m) waters, sediments and corals. This group was not detected in DNA-based clone library, obviously, due to an overwhelming dominance of the Deep Marine Group I. The failure to recover the amoA transcripts, related to Deep Marine Group I of Crenarchaeota, was unanticipated and likely resulted from the physiology of these strongly adapted deep-sea organisms. As far as all seawater samples were treated on-board under atmospheric pressure conditions and sunlight, the decompression and/or photoinhibition likely affected their metabolic activity, followed by the strong decay of gene expression.
Small RNA Deep Sequencing and the Effects of microRNA408 on Root Gravitropic Bending in Arabidopsis
NASA Astrophysics Data System (ADS)
Li, Huasheng; Lu, Jinying; Sun, Qiao; Chen, Yu; He, Dacheng; Liu, Min
2015-11-01
MicroRNA (miRNA) is a non-coding small RNA composed of 20 to 24 nucleotides that influences plant root development. This study analyzed the miRNA expression in Arabidopsis root tip cells using Illumina sequencing and real-time PCR before (sample 0) and 15 min after (sample 15) a 3-D clinostat rotational treatment was administered. After stimulation was performed, the expression levels of seven miRNA genes, including Arabidopsis miR160, miR161, miR394, miR402, miR403, miR408, and miR823, were significantly upregulated. Illumina sequencing results also revealed two novel miRNAsthat have not been previously reported, The target genes of these miRNAs included pentatricopeptide repeat-containing protein and diadenosine tetraphosphate hydrolase. An overexpression vector of Arabidopsis miR408 was constructed and transferred to Arabidopsis plant. The roots of plants over expressing miR408 exhibited a slower reorientation upon gravistimulation in comparison with those of wild-type. This result indicate that miR408 could play a role in root gravitropic response.
NASA Technical Reports Server (NTRS)
Woese, C. R.; Achenbach, L.; Rouviere, P.; Mandelco, L.
1991-01-01
A major and too little recognized source of artifact in phylogenetic analysis of molecular sequence data is compositional difference among sequences. The problem becomes particularly acute when alignments contain ribosomal RNAs from both mesophilic and thermophilic species. Among prokaryotes the latter are considerably higher in G + C content than the former, which often results in artificial clustering of thermophilic lineages and their being placed artificially deep in phylogenetic trees. In this communication we review archaeal phylogeny in the light of this consideration, focusing in particular on the phylogenetic position of the sulfate reducing species Archaeoglobus fulgidus, using both 16S rRNA and 23S rRNA sequences. The analysis shows clearly that the previously reported deep branching of the A. fulgidus lineage (very near the base of the euryarchaeal side of the archaeal tree) is incorrect, and that the lineage actually groups with a previously recognized unit that comprises the Methanomicrobiales and extreme halophiles.
mRNA deep sequencing reveals 75 new genes and a complex transcriptional landscape in Mimivirus.
Legendre, Matthieu; Audic, Stéphane; Poirot, Olivier; Hingamp, Pascal; Seltzer, Virginie; Byrne, Deborah; Lartigue, Audrey; Lescot, Magali; Bernadac, Alain; Poulain, Julie; Abergel, Chantal; Claverie, Jean-Michel
2010-05-01
Mimivirus, a virus infecting Acanthamoeba, is the prototype of the Mimiviridae, the latest addition to the nucleocytoplasmic large DNA viruses. The Mimivirus genome encodes close to 1000 proteins, many of them never before encountered in a virus, such as four amino-acyl tRNA synthetases. To explore the physiology of this exceptional virus and identify the genes involved in the building of its characteristic intracytoplasmic "virion factory," we coupled electron microscopy observations with the massively parallel pyrosequencing of the polyadenylated RNA fractions of Acanthamoeba castellanii cells at various time post-infection. We generated 633,346 reads, of which 322,904 correspond to Mimivirus transcripts. This first application of deep mRNA sequencing (454 Life Sciences [Roche] FLX) to a large DNA virus allowed the precise delineation of the 5' and 3' extremities of Mimivirus mRNAs and revealed 75 new transcripts including several noncoding RNAs. Mimivirus genes are expressed across a wide dynamic range, in a finely regulated manner broadly described by three main temporal classes: early, intermediate, and late. This RNA-seq study confirmed the AAAATTGA sequence as an early promoter element, as well as the presence of palindromes at most of the polyadenylation sites. It also revealed a new promoter element correlating with late gene expression, which is also prominent in Sputnik, the recently described Mimivirus "virophage." These results-validated genome-wide by the hybridization of total RNA extracted from infected Acanthamoeba cells on a tiling array (Agilent)--will constitute the foundation on which to build subsequent functional studies of the Mimivirus/Acanthamoeba system.
RNA-Seq analysis to capture the transcriptome landscape of a single cell
Tang, Fuchou; Barbacioru, Catalin; Nordman, Ellen; Xu, Nanlan; Bashkirov, Vladimir I; Lao, Kaiqin; Surani, M. Azim
2013-01-01
We describe here a protocol for digital transcriptome analysis in a single mouse blastomere using a deep sequencing approach. An individual blastomere was first isolated and put into lysate buffer by mouth pipette. Reverse transcription was then performed directly on the whole cell lysate. After this, the free primers were removed by Exonuclease I and a poly(A) tail was added to the 3′ end of the first-strand cDNA by Terminal Deoxynucleotidyl Transferase. Then the single cell cDNAs were amplified by 20 plus 9 cycles of PCR. Then 100-200 ng of these amplified cDNAs were used to construct a sequencing library. The sequencing library can be used for deep sequencing using the SOLiD system. Compared with the cDNA microarray technique, our assay can capture up to 75% more genes expressed in early embryos. The protocol can generate deep sequencing libraries within 6 days for 16 single cell samples. PMID:20203668
Comparing sequencing assays and human-machine analyses in actionable genomics for glioblastoma
Wrzeszczynski, Kazimierz O.; Frank, Mayu O.; Koyama, Takahiko; Rhrissorrakrai, Kahn; Robine, Nicolas; Utro, Filippo; Emde, Anne-Katrin; Chen, Bo-Juen; Arora, Kanika; Shah, Minita; Vacic, Vladimir; Norel, Raquel; Bilal, Erhan; Bergmann, Ewa A.; Moore Vogel, Julia L.; Bruce, Jeffrey N.; Lassman, Andrew B.; Canoll, Peter; Grommes, Christian; Harvey, Steve; Parida, Laxmi; Michelini, Vanessa V.; Zody, Michael C.; Jobanputra, Vaidehi; Royyuru, Ajay K.
2017-01-01
Objective: To analyze a glioblastoma tumor specimen with 3 different platforms and compare potentially actionable calls from each. Methods: Tumor DNA was analyzed by a commercial targeted panel. In addition, tumor-normal DNA was analyzed by whole-genome sequencing (WGS) and tumor RNA was analyzed by RNA sequencing (RNA-seq). The WGS and RNA-seq data were analyzed by a team of bioinformaticians and cancer oncologists, and separately by IBM Watson Genomic Analytics (WGA), an automated system for prioritizing somatic variants and identifying drugs. Results: More variants were identified by WGS/RNA analysis than by targeted panels. WGA completed a comparable analysis in a fraction of the time required by the human analysts. Conclusions: The development of an effective human-machine interface in the analysis of deep cancer genomic datasets may provide potentially clinically actionable calls for individual patients in a more timely and efficient manner than currently possible. ClinicalTrials.gov identifier: NCT02725684. PMID:28740869
Asymmetric purine-pyrimidine distribution in cellular small RNA population of papaya
2012-01-01
Background The small RNAs (sRNA) are a regulatory class of RNA mainly represented by the 21 and 24-nucleotide size classes. The cellular sRNAs are processed by RNase III family enzyme dicer (Dicer like in plant) from a self-complementary hairpin loop or other type of RNA duplexes. The papaya genome has been sequenced, but its microRNAs and other regulatory RNAs are yet to be analyzed. Results We analyzed the genomic features of the papaya sRNA population from three sRNA deep sequencing libraries made from leaves, flowers, and leaves infected with Papaya Ringspot Virus (PRSV). We also used the deep sequencing data to annotate the micro RNA (miRNA) in papaya. We identified 60 miRNAs, 24 of which were conserved in other species, and 36 of which were novel miRNAs specific to papaya. In contrast to the Chargaff’s purine-pyrimidine equilibrium, cellular sRNA was significantly biased towards a purine rich population. Of the two purine bases, higher frequency of adenine was present in 23nt or longer sRNAs, while 22nt or shorter sRNAs were over represented by guanine bases. However, this bias was not observed in the annotated miRNAs in plants. The 21nt species were expressed from fewer loci but expressed at higher levels relative to the 24nt species. The highly expressed 21nt species were clustered in a few isolated locations of the genome. The PRSV infected leaves showed higher accumulation of 21 and 22nt sRNA compared to uninfected leaves. We observed higher accumulation of miRNA* of seven annotated miRNAs in virus-infected tissue, indicating the potential function of miRNA* under stressed conditions. Conclusions We have identified 60 miRNAs in papaya. Our study revealed the asymmetric purine-pyrimidine distribution in cellular sRNA population. The 21nt species of sRNAs have higher expression levels than 24nt sRNA. The miRNA* of some miRNAs shows higher accumulation in PRSV infected tissues, suggesting that these strands are not totally functionally redundant. The findings open a new avenue for further investigation of the sRNA silencing pathway in plants. PMID:23216749
Asymmetric purine-pyrimidine distribution in cellular small RNA population of papaya.
Aryal, Rishi; Yang, Xiaozeng; Yu, Qingyi; Sunkar, Ramanjulu; Li, Lei; Ming, Ray
2012-12-05
The small RNAs (sRNA) are a regulatory class of RNA mainly represented by the 21 and 24-nucleotide size classes. The cellular sRNAs are processed by RNase III family enzyme dicer (Dicer like in plant) from a self-complementary hairpin loop or other type of RNA duplexes. The papaya genome has been sequenced, but its microRNAs and other regulatory RNAs are yet to be analyzed. We analyzed the genomic features of the papaya sRNA population from three sRNA deep sequencing libraries made from leaves, flowers, and leaves infected with Papaya Ringspot Virus (PRSV). We also used the deep sequencing data to annotate the micro RNA (miRNA) in papaya. We identified 60 miRNAs, 24 of which were conserved in other species, and 36 of which were novel miRNAs specific to papaya. In contrast to the Chargaff's purine-pyrimidine equilibrium, cellular sRNA was significantly biased towards a purine rich population. Of the two purine bases, higher frequency of adenine was present in 23nt or longer sRNAs, while 22nt or shorter sRNAs were over represented by guanine bases. However, this bias was not observed in the annotated miRNAs in plants. The 21nt species were expressed from fewer loci but expressed at higher levels relative to the 24nt species. The highly expressed 21nt species were clustered in a few isolated locations of the genome. The PRSV infected leaves showed higher accumulation of 21 and 22nt sRNA compared to uninfected leaves. We observed higher accumulation of miRNA* of seven annotated miRNAs in virus-infected tissue, indicating the potential function of miRNA* under stressed conditions. We have identified 60 miRNAs in papaya. Our study revealed the asymmetric purine-pyrimidine distribution in cellular sRNA population. The 21nt species of sRNAs have higher expression levels than 24nt sRNA. The miRNA* of some miRNAs shows higher accumulation in PRSV infected tissues, suggesting that these strands are not totally functionally redundant. The findings open a new avenue for further investigation of the sRNA silencing pathway in plants.
Identification of microRNAs differentially expressed involved in male flower development.
Wang, Zhengjia; Huang, Jianqin; Sun, Zhichao; Zheng, Bingsong
2015-03-01
Hickory (Carya cathayensis Sarg.) is one of the most economically important woody trees in eastern China, but its long flowering phase delays yield. Our understanding of the regulatory roles of microRNAs (miRNAs) in male flower development in hickory remains poor. Using high-throughput sequencing technology, we have pyrosequenced two small RNA libraries from two male flower differentiation stages in hickory. Analysis of the sequencing data identified 114 conserved miRNAs that belonged to 23 miRNA families, five novel miRNAs including their corresponding miRNA*s, and 22 plausible miRNA candidates. Differential expression analysis revealed 12 miRNA sequences that were upregulated in the later (reproductive) stage of male flower development. Quantitative real-time PCR showed similar expression trends as that of the deep sequencing. Novel miRNAs and plausible miRNA candidates were predicted using bioinformatic analysis methods. The miRNAs newly identified in this study have increased the number of known miRNAs in hickory, and the identification of differentially expressed miRNAs will provide new avenues for studies into miRNAs involved in the process of male flower development in hickory and other related trees.
Zhang, Hanyuan; Vieira Resende E Silva, Bruno; Cui, Juan
2018-05-01
Small RNA sequencing is the most widely used tool for microRNA (miRNA) discovery, and shows great potential for the efficient study of miRNA cross-species transport, i.e., by detecting the presence of exogenous miRNA sequences in the host species. Because of the increased appreciation of dietary miRNAs and their far-reaching implication in human health, research interests are currently growing with regard to exogenous miRNAs bioavailability, mechanisms of cross-species transport and miRNA function in cellular biological processes. In this article, we present microRNA Discovery (miRDis), a new small RNA sequencing data analysis pipeline for both endogenous and exogenous miRNA detection. Specifically, we developed and deployed a Web service that supports the annotation and expression profiling data of known host miRNAs and the detection of novel miRNAs, other noncoding RNAs, and the exogenous miRNAs from dietary species. As a proof-of-concept, we analyzed a set of human plasma sequencing data from a milk-feeding study where 225 human miRNAs were detected in the plasma samples and 44 show elevated expression after milk intake. By examining the bovine-specific sequences, data indicate that three bovine miRNAs (bta-miR-378, -181* and -150) are present in human plasma possibly because of the dietary uptake. Further evaluation based on different sets of public data demonstrates that miRDis outperforms other state-of-the-art tools in both detection and quantification of miRNA from either animal or plant sources. The miRDis Web server is available at: http://sbbi.unl.edu/miRDis/index.php.
Deep Sequence Analysis of AgoshRNA Processing Reveals 3' A Addition and Trimming.
Harwig, Alex; Herrera-Carrillo, Elena; Jongejan, Aldo; van Kampen, Antonius Hubertus; Berkhout, Ben
2015-07-14
The RNA interference (RNAi) pathway, in which microprocessor and Dicer collaborate to process microRNAs (miRNA), was recently expanded by the description of alternative processing routes. In one of these noncanonical pathways, Dicer action is replaced by the Argonaute2 (Ago2) slicer function. It was recently shown that the stem-length of precursor-miRNA or short hairpin RNA (shRNA) molecules is a major determinant for Dicer versus Ago2 processing. Here we present the results of a deep sequence study on the processing of shRNAs with different stem length and a top G·U wobble base pair (bp). This analysis revealed some unexpected properties of these so-called AgoshRNA molecules that are processed by Ago2 instead of Dicer. First, we confirmed the gradual shift from Dicer to Ago2 processing upon shortening of the hairpin length. Second, hairpins with a stem larger than 19 base pair are inefficiently cleaved by Ago2 and we noticed a shift in the cleavage site. Third, the introduction of a top G·U bp in a regular shRNA can promote Ago2-cleavage, which coincides with a loss of Ago2-loading of the Dicer-cleaved 3' strand. Fourth, the Ago2-processed AgoshRNAs acquire a short 3' tail of 1-3 A-nucleotides (nt) and we present evidence that this product is subsequently trimmed by the poly(A)-specific ribonuclease (PARN).
Deep Sequence Analysis of AgoshRNA Processing Reveals 3' A Addition and Trimming
Harwig, Alex; Herrera-Carrillo, Elena; Jongejan, Aldo; van Kampen, Antonius Hubertus; Berkhout, Ben
2015-01-01
The RNA interference (RNAi) pathway, in which microprocessor and Dicer collaborate to process microRNAs (miRNA), was recently expanded by the description of alternative processing routes. In one of these noncanonical pathways, Dicer action is replaced by the Argonaute2 (Ago2) slicer function. It was recently shown that the stem-length of precursor-miRNA or short hairpin RNA (shRNA) molecules is a major determinant for Dicer versus Ago2 processing. Here we present the results of a deep sequence study on the processing of shRNAs with different stem length and a top G·U wobble base pair (bp). This analysis revealed some unexpected properties of these so-called AgoshRNA molecules that are processed by Ago2 instead of Dicer. First, we confirmed the gradual shift from Dicer to Ago2 processing upon shortening of the hairpin length. Second, hairpins with a stem larger than 19 base pair are inefficiently cleaved by Ago2 and we noticed a shift in the cleavage site. Third, the introduction of a top G·U bp in a regular shRNA can promote Ago2-cleavage, which coincides with a loss of Ago2-loading of the Dicer-cleaved 3' strand. Fourth, the Ago2-processed AgoshRNAs acquire a short 3' tail of 1–3 A-nucleotides (nt) and we present evidence that this product is subsequently trimmed by the poly(A)-specific ribonuclease (PARN). PMID:26172504
Chen, Ping; Zhang, Limin; Guo, Xiaoxuan; Dai, Xin; Liu, Li; Xi, Lijun; Wang, Jian; Song, Lei; Wang, Yuezhu; Zhu, Yaxin; Huang, Li; Huang, Ying
2016-01-01
The phylum Actinobacteria has been reported to be common or even abundant in deep marine sediments, however, knowledge about the diversity, distribution, and function of actinobacteria is limited. In this study, actinobacterial diversity in the deep sea along the Southwest Indian Ridge (SWIR) was investigated using both 16S rRNA gene pyrosequencing and culture-based methods. The samples were collected at depths of 1662–4000 m below water surface. Actinobacterial sequences represented 1.2–9.1% of all microbial 16S rRNA gene amplicon sequences in each sample. A total of 5 actinobacterial classes, 17 orders, 28 families, and 52 genera were detected by pyrosequencing, dominated by the classes Acidimicrobiia and Actinobacteria. Differences in actinobacterial community compositions were found among the samples. The community structure showed significant correlations to geochemical factors, notably pH, calcium, total organic carbon, total phosphorus, and total nitrogen, rather than to spatial distance at the scale of the investigation. In addition, 176 strains of the Actinobacteria class, belonging to 9 known orders, 18 families, and 29 genera, were isolated. Among these cultivated taxa, 8 orders, 13 families, and 15 genera were also recovered by pyrosequencing. At a 97% 16S rRNA gene sequence similarity, the pyrosequencing data encompassed 77.3% of the isolates but the isolates represented only 10.3% of the actinobacterial reads. Phylogenetic analysis of all the representative actinobacterial sequences and isolates indicated that at least four new orders within the phylum Actinobacteria were detected by pyrosequencing. More than half of the isolates spanning 23 genera and all samples demonstrated activity in the degradation of refractory organics, including polycyclic aromatic hydrocarbons and polysaccharides, suggesting their potential ecological functions and biotechnological applications for carbon recycling. PMID:27621725
Jima, Dereje D.; Zhang, Jenny; Jacobs, Cassandra; Richards, Kristy L.; Dunphy, Cherie H.; Choi, William W. L.; Yan Au, Wing; Srivastava, Gopesh; Czader, Magdalena B.; Rizzieri, David A.; Lagoo, Anand S.; Lugar, Patricia L.; Mann, Karen P.; Flowers, Christopher R.; Bernal-Mizrachi, Leon; Naresh, Kikkeri N.; Evens, Andrew M.; Gordon, Leo I.; Luftig, Micah; Friedman, Daphne R.; Weinberg, J. Brice; Thompson, Michael A.; Gill, Javed I.; Liu, Qingquan; How, Tam; Grubor, Vladimir; Gao, Yuan; Patel, Amee; Wu, Han; Zhu, Jun; Blobe, Gerard C.; Lipsky, Peter E.; Chadburn, Amy
2010-01-01
A role for microRNA (miRNA) has been recognized in nearly every biologic system examined thus far. A complete delineation of their role must be preceded by the identification of all miRNAs present in any system. We elucidated the complete small RNA transcriptome of normal and malignant B cells through deep sequencing of 31 normal and malignant human B-cell samples that comprise the spectrum of B-cell differentiation and common malignant phenotypes. We identified the expression of 333 known miRNAs, which is more than twice the number previously recognized in any tissue type. We further identified the expression of 286 candidate novel miRNAs in normal and malignant B cells. These miRNAs were validated at a high rate (92%) using quantitative polymerase chain reaction, and we demonstrated their application in the distinction of clinically relevant subgroups of lymphoma. We further demonstrated that a novel miRNA cluster, previously annotated as a hypothetical gene LOC100130622, contains 6 novel miRNAs that regulate the transforming growth factor-β pathway. Thus, our work suggests that more than a third of the miRNAs present in most cellular types are currently unknown and that these miRNAs may regulate important cellular functions. PMID:20733160
Jobst-Schwan, Tilman; Schmidt, Johanna Magdalena; Schneider, Ronen; Hoogstraten, Charlotte A; Ullmann, Jeremy F P; Schapiro, David; Majmundar, Amar J; Kolb, Amy; Eddy, Kaitlyn; Shril, Shirlee; Braun, Daniela A; Poduri, Annapurna; Hildebrandt, Friedhelm
2018-01-01
Until recently, morpholino oligonucleotides have been widely employed in zebrafish as an acute and efficient loss-of-function assay. However, off-target effects and reproducibility issues when compared to stable knockout lines have compromised their further use. Here we employed an acute CRISPR/Cas approach using multiple single guide RNAs targeting simultaneously different positions in two exemplar genes (osgep or tprkb) to increase the likelihood of generating mutations on both alleles in the injected F0 generation and to achieve a similar effect as morpholinos but with the reproducibility of stable lines. This multi single guide RNA approach resulted in median likelihoods for at least one mutation on each allele of >99% and sgRNA specific insertion/deletion profiles as revealed by deep-sequencing. Immunoblot showed a significant reduction for Osgep and Tprkb proteins. For both genes, the acute multi-sgRNA knockout recapitulated the microcephaly phenotype and reduction in survival that we observed previously in stable knockout lines, though milder in the acute multi-sgRNA knockout. Finally, we quantify the degree of mutagenesis by deep sequencing, and provide a mathematical model to quantitate the chance for a biallelic loss-of-function mutation. Our findings can be generalized to acute and stable CRISPR/Cas targeting for any zebrafish gene of interest.
Gu, Yifeng; Zhang, Lei; Chen, Xiaowu
2014-08-01
MicroRNAs (miRNAs) play an important role in gonadal development and differentiation in fish. However, understanding of the mechanism of this process is hindered by our poor knowledge of miRNA expression patterns in fish gonads. In this study, miRNA libraries derived from adult gonads of Paralichthys olivaceus were generated by using next-generation sequencing (NGS) technology. Bioinformatics analysis was performed to distinguish mature miRNA sequences from two classes of small RNAs represented in the sequencing data. A total of 141 mature miRNAs were identified, in which 21 miRNAs were found in P. olivaceus for the first time. Variance and preference of miRNAs expression were concluded from the deep sequencing reads. Some miRNAs, such as pol-miR-143, pol-miR-26a and pol-let-7a were found with quite high expression levels in both gonads, while some exhibited a clear sex-biased expression in different gonad. Approximate 20.0% and 13.1% of the isolated miRNAs were preferentially expressed in the testis (FC<0.5) or ovary (FC>2), respectively. The identification and the preliminary analysis of the sex-biased expression of miRNAs in P. olivaceus gonads in our work by using NGS will provide us a basic catalog of miRNAs to facilitate future improvement and exploitation of sexual regulatory mechanisms in P. olivaceus. Copyright © 2014. Published by Elsevier Inc.
USDA-ARS?s Scientific Manuscript database
While breast milk has unique health advantages for infants, the mechanisms by which it regulates the physiology of newborns are incompletely understood. miRNAs have been described as functioning transcellularly, and have been previously isolated in cell-free and exosomal form from bodily liquids (se...
omiRas: a Web server for differential expression analysis of miRNAs derived from small RNA-Seq data.
Müller, Sören; Rycak, Lukas; Winter, Peter; Kahl, Günter; Koch, Ina; Rotter, Björn
2013-10-15
Small RNA deep sequencing is widely used to characterize non-coding RNAs (ncRNAs) differentially expressed between two conditions, e.g. healthy and diseased individuals and to reveal insights into molecular mechanisms underlying condition-specific phenotypic traits. The ncRNAome is composed of a multitude of RNAs, such as transfer RNA, small nucleolar RNA and microRNA (miRNA), to name few. Here we present omiRas, a Web server for the annotation, comparison and visualization of interaction networks of ncRNAs derived from next-generation sequencing experiments of two different conditions. The Web tool allows the user to submit raw sequencing data and results are presented as: (i) static annotation results including length distribution, mapping statistics, alignments and quantification tables for each library as well as lists of differentially expressed ncRNAs between conditions and (ii) an interactive network visualization of user-selected miRNAs and their target genes based on the combination of several miRNA-mRNA interaction databases. The omiRas Web server is implemented in Python, PostgreSQL, R and can be accessed at: http://tools.genxpro.net/omiras/.
The green ash transcriptome and identification of genes responding to abiotic and biotic stresses
Thomas Lane; Teodora Best; Nicole Zembower; Jack Davitt; Nathan Henry; Yi Xu; Jennifer Koch; Haiying Liang; John McGraw; Stephan Schuster; Donghwan Shim; Mark V. Coggeshall; John E. Carlson; Margaret E. Staton
2016-01-01
Background: To develop a set of transcriptome sequences to support research on environmental stress responses in green ash (Fraxinus pennsylvanica), we undertook deep RNA sequencing of green ash tissues under various stress treatments. The treatments, including emerald ash borer (EAB) feeding, heat, drought, cold and ozone, were selected to mimic...
An integrated expression atlas of miRNAs and their promoters in human and mouse
de Rie, Derek; Abugessaisa, Imad; Alam, Tanvir; Arner, Erik; Arner, Peter; Ashoor, Haitham; Åström, Gaby; Babina, Magda; Bertin, Nicolas; Burroughs, A. Maxwell; Carlisle, Ailsa J.; Daub, Carsten O.; Detmar, Michael; Deviatiiarov, Ruslan; Fort, Alexandre; Gebhard, Claudia; Goldowitz, Daniel; Guhl, Sven; Ha, Thomas J.; Harshbarger, Jayson; Hasegawa, Akira; Hashimoto, Kosuke; Herlyn, Meenhard; Heutink, Peter; Hitchens, Kelly J.; Hon, Chung Chau; Huang, Edward; Ishizu, Yuri; Kai, Chieko; Kasukawa, Takeya; Klinken, Peter; Lassmann, Timo; Lecellier, Charles-Henri; Lee, Weonju; Lizio, Marina; Makeev, Vsevolod; Mathelier, Anthony; Medvedeva, Yulia A.; Mejhert, Niklas; Mungall, Christopher J.; Noma, Shohei; Ohshima, Mitsuhiro; Okada-Hatakeyama, Mariko; Persson, Helena; Rizzu, Patrizia; Roudnicky, Filip; Sætrom, Pål; Sato, Hiroki; Severin, Jessica; Shin, Jay W.; Swoboda, Rolf K.; Tarui, Hiroshi; Toyoda, Hiroo; Vitting-Seerup, Kristoffer; Winteringham, Louise; Yamaguchi, Yoko; Yasuzawa, Kayoko; Yoneda, Misako; Yumoto, Noriko; Zabierowski, Susan; Zhang, Peter G.; Wells, Christine A.; Summers, Kim M.; Kawaji, Hideya; Sandelin, Albin; Rehli, Michael; Hayashizaki, Yoshihide; Carninci, Piero; Forrest, Alistair R. R.; de Hoon, Michiel J. L.
2018-01-01
MicroRNAs (miRNAs) are short non-coding RNAs with key roles in cellular regulation. As part of the fifth edition of the Functional Annotation of Mammalian Genome (FANTOM5) project, we created an integrated expression atlas of miRNAs and their promoters by deep-sequencing 492 short RNA (sRNA) libraries, with matching Cap Analysis Gene Expression (CAGE) data, from 396 human and 47 mouse RNA samples. Promoters were identified for 1,357 human and 804 mouse miRNAs and showed strong sequence conservation between species. We also found that primary and mature miRNA expression levels were correlated, allowing us to use the primary miRNA measurements as a proxy for mature miRNA levels in a total of 1,829 human and 1,029 mouse CAGE libraries. We thus provide a broad atlas of miRNA expression and promoters in primary mammalian cells, establishing a foundation for detailed analysis of miRNA expression patterns and transcriptional control regions. PMID:28829439
Zhang, Jiayao; Zhao, Shuqi; Zheng, Hong; Gao, Ge; Wei, Liping; Li, Yi
2011-01-01
RNA silencing, mediated by small RNAs including microRNAs (miRNAs) and small interfering RNAs (siRNAs), is a potent antiviral or antibacterial mechanism, besides regulating normal cellular gene expression critical for development and physiology. To gain insights into host small RNA metabolism under infections by different viruses, we used Solexa/Illumina deep sequencing to characterize the small RNA profiles of rice plants infected by two distinct viruses, Rice dwarf virus (RDV, dsRNA virus) and Rice stripe virus (RSV, a negative sense and ambisense RNA virus), respectively, as compared with those from non-infected plants. Our analyses showed that RSV infection enhanced the accumulation of some rice miRNA*s, but not their corresponding miRNAs, as well as accumulation of phased siRNAs from a particular precursor. Furthermore, RSV infection also induced the expression of novel miRNAs in a phased pattern from several conserved miRNA precursors. In comparison, no such changes in host small RNA expression was observed in RDV-infected rice plants. Significantly RSV infection elevated the expression levels of selective OsDCLs and OsAGOs, whereas RDV infection only affected the expression of certain OsRDRs. Our results provide a comparative analysis, via deep sequencing, of changes in the small RNA profiles and in the genes of RNA silencing machinery induced by different viruses in a natural and economically important crop host plant. They uncover new mechanisms and complexity of virus-host interactions that may have important implications for further studies on the evolution of cellular small RNA biogenesis that impact pathogen infection, pathogenesis, as well as organismal development. PMID:21901091
Kimura, Hiroyuki; Sugihara, Maki; Kato, Kenji; Hanada, Satoshi
2006-01-01
Deep-subsurface samples obtained by deep drilling are likely to be contaminated with mesophilic microorganisms in the drilling fluid, and this could affect determination of the community structure of the geothermal microflora using 16S rRNA gene clone library analysis. To eliminate possible contamination by PCR-amplified 16S rRNA genes from mesophiles, a combined thermal denaturation and enzyme digestion method, based on a strong correlation between the G+C content of the 16S rRNA gene and the optimum growth temperatures of most known prokaryotic cultures, was used prior to clone library construction. To validate this technique, hot spring fluid (76°C) and river water (14°C) were used to mimic a deep-subsurface sample contaminated with drilling fluid. After DNA extraction and PCR amplification of the 16S rRNA genes from individual samples separately, the amplified products from river water were observed to be denatured at 82°C and completely digested by exonuclease I (Exo I), while the amplified products from hot spring fluid remained intact after denaturation at 84°C and enzyme digestion with Exo I. DNAs extracted from the two samples were mixed and used as a template for amplification of the 16S rRNA genes. The amplified rRNA genes were denatured at 84°C and digested with Exo I before clone library construction. The results indicated that the 16S rRNA gene sequences from the river water were almost completely eliminated, whereas those from the hot spring fluid remained. PMID:16391020
Yang, Kai-Chien; Yamada, Kathryn A; Patel, Akshar Y; Topkara, Veli K; George, Isaac; Cheema, Faisal H; Ewald, Gregory A; Mann, Douglas L; Nerbonne, Jeanne M
2014-03-04
Microarrays have been used extensively to profile transcriptome remodeling in failing human heart, although the genomic coverage provided is limited and fails to provide a detailed picture of the myocardial transcriptome landscape. Here, we describe sequencing-based transcriptome profiling, providing comprehensive analysis of myocardial mRNA, microRNA (miRNA), and long noncoding RNA (lncRNA) expression in failing human heart before and after mechanical support with a left ventricular (LV) assist device (LVAD). Deep sequencing of RNA isolated from paired nonischemic (NICM; n=8) and ischemic (ICM; n=8) human failing LV samples collected before and after LVAD and from nonfailing human LV (n=8) was conducted. These analyses revealed high abundance of mRNA (37%) and lncRNA (71%) of mitochondrial origin. miRNASeq revealed 160 and 147 differentially expressed miRNAs in ICM and NICM, respectively, compared with nonfailing LV. Among these, only 2 (ICM) and 5 (NICM) miRNAs are normalized with LVAD. RNASeq detected 18 480, including 113 novel, lncRNAs in human LV. Among the 679 (ICM) and 570 (NICM) lncRNAs differentially expressed with heart failure, ≈10% are improved or normalized with LVAD. In addition, the expression signature of lncRNAs, but not miRNAs or mRNAs, distinguishes ICM from NICM. Further analysis suggests that cis-gene regulation represents a major mechanism of action of human cardiac lncRNAs. The myocardial transcriptome is dynamically regulated in advanced heart failure and after LVAD support. The expression profiles of lncRNAs, but not mRNAs or miRNAs, can discriminate failing hearts of different pathologies and are markedly altered in response to LVAD support. These results suggest an important role for lncRNAs in the pathogenesis of heart failure and in reverse remodeling observed with mechanical support.
Guo, Feng; Wang, Zhi-Ping; Yu, Ke; Zhang, T.
2015-01-01
Foaming of activated sludge (AS) causes adverse impacts on wastewater treatment operation and hygiene. In this study, we investigated the microbial communities of foam, foaming AS and non-foaming AS in a sewage treatment plant via deep-sequencing of the taxonomic marker genes 16S rRNA and mycobacterial rpoB and a metagenomic approach. In addition to Actinobacteria, many genera (e.g., Clostridium XI, Arcobacter, Flavobacterium) were more abundant in the foam than in the AS. On the other hand, deep-sequencing of rpoB did not detect any obligate pathogenic mycobacteria in the foam. We found that unknown factors other than the abundance of Gordonia sp. could determine the foaming process, because abundance of the same species was stable before and after a foaming event over six months. More interestingly, although the dominant Gordonia foam former was the closest with G. amarae, it was identified as an undescribed Gordonia species by referring to the 16S rRNA gene, gyrB and, most convincingly, the reconstructed draft genome from metagenomic reads. Our results, based on metagenomics and deep sequencing, reveal that foams are derived from diverse taxa, which expands previous understanding and provides new insight into the underlying complications of the foaming phenomenon in AS. PMID:25560234
mRNA deep sequencing reveals 75 new genes and a complex transcriptional landscape in Mimivirus
Legendre, Matthieu; Audic, Stéphane; Poirot, Olivier; Hingamp, Pascal; Seltzer, Virginie; Byrne, Deborah; Lartigue, Audrey; Lescot, Magali; Bernadac, Alain; Poulain, Julie; Abergel, Chantal; Claverie, Jean-Michel
2010-01-01
Mimivirus, a virus infecting Acanthamoeba, is the prototype of the Mimiviridae, the latest addition to the nucleocytoplasmic large DNA viruses. The Mimivirus genome encodes close to 1000 proteins, many of them never before encountered in a virus, such as four amino-acyl tRNA synthetases. To explore the physiology of this exceptional virus and identify the genes involved in the building of its characteristic intracytoplasmic “virion factory,” we coupled electron microscopy observations with the massively parallel pyrosequencing of the polyadenylated RNA fractions of Acanthamoeba castellanii cells at various time post-infection. We generated 633,346 reads, of which 322,904 correspond to Mimivirus transcripts. This first application of deep mRNA sequencing (454 Life Sciences [Roche] FLX) to a large DNA virus allowed the precise delineation of the 5′ and 3′ extremities of Mimivirus mRNAs and revealed 75 new transcripts including several noncoding RNAs. Mimivirus genes are expressed across a wide dynamic range, in a finely regulated manner broadly described by three main temporal classes: early, intermediate, and late. This RNA-seq study confirmed the AAAATTGA sequence as an early promoter element, as well as the presence of palindromes at most of the polyadenylation sites. It also revealed a new promoter element correlating with late gene expression, which is also prominent in Sputnik, the recently described Mimivirus “virophage.” These results—validated genome-wide by the hybridization of total RNA extracted from infected Acanthamoeba cells on a tiling array (Agilent)—will constitute the foundation on which to build subsequent functional studies of the Mimivirus/Acanthamoeba system. PMID:20360389
The small RNA profile in latex from Hevea brasiliensis trees is affected by tapping panel dryness.
Gébelin, Virginie; Leclercq, Julie; Kuswanhadi; Argout, Xavier; Chaidamsari, Tetty; Hu, Songnian; Tang, Chaorong; Sarah, Gautier; Yang, Meng; Montoro, Pascal
2013-10-01
Natural rubber is harvested by tapping Hevea brasiliensis (Willd. ex A. Juss.) Müll. Arg. Harvesting stress can lead to tapping panel dryness (TPD). MicroRNAs (miRNAs) are induced by abiotic stress and regulate gene expression by targeting the cleavage or translational inhibition of target messenger RNAs. This study set out to sequence miRNAs expressed in latex cells and to identify TPD-related putative targets. Deep sequencing of small RNAs was carried out on latex from trees affected by TPD using Solexa technology. The most abundant small RNA class size was 21 nucleotides for TPD trees compared with 24 nucleotides in healthy trees. By combining the LeARN pipeline, data from the Plant MicroRNA database and Hevea EST sequences, we identified 19 additional conserved and four putative species-specific miRNA families not found in previous studies on rubber. The relative transcript abundance of the Hbpre-MIR159b gene increased with TPD. This study revealed a small RNA-specific signature of TPD-affected trees. Both RNA degradation and a shift in miRNA biogenesis are suggested to explain the general decline in small RNAs and, particularly, in miRNAs.
Smola, Matthew J; Rice, Greggory M; Busan, Steven; Siegfried, Nathan A; Weeks, Kevin M
2015-11-01
Selective 2'-hydroxyl acylation analyzed by primer extension (SHAPE) chemistries exploit small electrophilic reagents that react with 2'-hydroxyl groups to interrogate RNA structure at single-nucleotide resolution. Mutational profiling (MaP) identifies modified residues by using reverse transcriptase to misread a SHAPE-modified nucleotide and then counting the resulting mutations by massively parallel sequencing. The SHAPE-MaP approach measures the structure of large and transcriptome-wide systems as accurately as can be done for simple model RNAs. This protocol describes the experimental steps, implemented over 3 d, that are required to perform SHAPE probing and to construct multiplexed SHAPE-MaP libraries suitable for deep sequencing. Automated processing of MaP sequencing data is accomplished using two software packages. ShapeMapper converts raw sequencing files into mutational profiles, creates SHAPE reactivity plots and provides useful troubleshooting information. SuperFold uses these data to model RNA secondary structures, identify regions with well-defined structures and visualize probable and alternative helices, often in under 1 d. SHAPE-MaP can be used to make nucleotide-resolution biophysical measurements of individual RNA motifs, rare components of complex RNA ensembles and entire transcriptomes.
Whole-Genome Characterization of Prunus necrotic ringspot virus Infecting Sweet Cherry in China
2018-01-01
ABSTRACT Prunus necrotic ringspot virus (PNRSV) causes yield loss in most cultivated stone fruits, including sweet cherry. Using a small RNA deep-sequencing approach combined with end-genome sequence cloning, we identified the complete genomes of all three PNRSV strands from PNRSV-infected sweet cherry trees and compared them with those of two previously reported isolates. PMID:29496825
High fungal diversity and abundance recovered in the deep-sea sediments of the Pacific Ocean.
Xu, Wei; Pang, Ka-Lai; Luo, Zhu-Hua
2014-11-01
Knowledge about the presence and ecological significance of bacteria and archaea in the deep-sea environments has been well recognized, but the eukaryotic microorganisms, such as fungi, have rarely been reported. The present study investigated the composition and abundance of fungal community in the deep-sea sediments of the Pacific Ocean. In this study, a total of 1,947 internal transcribed spacer (ITS) regions of fungal rRNA gene clones were recovered from five sediment samples at the Pacific Ocean (water depths ranging from 5,017 to 6,986 m) using three different PCR primer sets. There were 16, 17, and 15 different operational taxonomic units (OTUs) identified from fungal-universal, Ascomycota-, and Basidiomycota-specific clone libraries, respectively. Majority of the recovered sequences belonged to diverse phylotypes of Ascomycota (25 phylotypes) and Basidiomycota (18 phylotypes). The multiple primer approach totally recovered 27 phylotypes which showed low similarities (≤97 %) with available fungal sequences in the GenBank, suggesting possible new fungal taxa occurring in the deep-sea environments or belonging to taxa not represented in the GenBank. Our results also recovered high fungal LSU rRNA gene copy numbers (3.52 × 10(6) to 5.23 × 10(7)copies/g wet sediment) from the Pacific Ocean sediment samples, suggesting that the fungi might be involved in important ecological functions in the deep-sea environments.
Bi, Yaqi; Tugume, Arthur K.; Valkonen, Jari P. T.
2012-01-01
Background Arctium species (Asteraceae) are distributed worldwide and are used as food and rich sources of secondary metabolites for the pharmaceutical industry, e.g., against avian influenza virus. RNA silencing is an antiviral defense mechanism that detects and destroys virus-derived double-stranded RNA, resulting in accumulation of virus-derived small RNAs (21–24 nucleotides) that can be used for generic detection of viruses by small-RNA deep sequencing (SRDS). Methodology/Principal Findings SRDS was used to detect viruses in the biennial wild plant species Arctium tomentosum (woolly burdock; family Asteraceae) displaying virus-like symptoms of vein yellowing and leaf mosaic in southern Finland. Assembly of the small-RNA reads resulted in contigs homologous to Alstroemeria virus X (AlsVX), a positive/single-stranded RNA virus of genus Potexvirus (family Alphaflexiviridae), or related to negative/single-stranded RNA viruses of the genus Emaravirus. The coat protein gene of AlsVX was 81% and 89% identical to the two AlsVX isolates from Japan and Norway, respectively. The deduced, partial nucleocapsid protein amino acid sequence of the emara-like virus was only 78% or less identical to reported emaraviruses and showed no variability among the virus isolates characterized. This virus—tentatively named as Woolly burdock yellow vein virus—was exclusively associated with yellow vein and leaf mosaic symptoms in woolly burdock, whereas AlsVX was detected in only one of the 52 plants tested. Conclusions/Significance These results provide novel information about natural virus infections in Acrtium species and reveal woolly burdock as the first natural host of AlsVX besides Alstroemeria (family Alstroemeriaceae). Results also revealed a new virus related to the recently emerged Emaravirus genus and demonstrated applicability of SRDS to detect negative-strand RNA viruses. SRDS potentiates virus surveys of wild plants, a research area underrepresented in plant virology, and helps reveal natural reservoirs of viruses that cause yield losses in cultivated plants. PMID:22912734
microRNA expression profiling in fetal single ventricle malformation identified by deep sequencing.
Yu, Zhang-Bin; Han, Shu-Ping; Bai, Yun-Fei; Zhu, Chun; Pan, Ya; Guo, Xi-Rong
2012-01-01
microRNAs (miRNAs) have emerged as key regulators in many biological processes, particularly cardiac growth and development, although the specific miRNA expression profile associated with this process remains to be elucidated. This study aimed to characterize the cellular microRNA profile involved in the development of congenital heart malformation, through the investigation of single ventricle (SV) defects. Comprehensive miRNA profiling in human fetal SV cardiac tissue was performed by deep sequencing. Differential expression of 48 miRNAs was revealed by sequencing by oligonucleotide ligation and detection (SOLiD) analysis. Of these, 38 were down-regulated and 10 were up-regulated in differentiated SV cardiac tissue, compared to control cardiac tissue. This was confirmed by real-time quantitative reverse transcription-polymerase chain reaction (qRT-PCR) analysis. Predicted target genes of the 48 differentially expressed miRNAs were analyzed by gene ontology and categorized according to cellular process, regulation of biological process and metabolic process. Pathway-Express analysis identified the WNT and mTOR signaling pathways as the most significant processes putatively affected by the differential expression of these miRNAs. The candidate genes involved in cardiac development were identified as potential targets for these differentially expressed microRNAs and the collaborative network of microRNAs and cardiac development related-mRNAs was constructed. These data provide the basis for future investigation of the mechanism of the occurrence and development of fetal SV malformations.
West, L; Powers, D
1993-01-01
Although it is generally accepted that the first multicellular organisms arose from unicellular ancestors, the phylogenetic relationships linking these groups remain unclear. Anatomical, physiological, and molecular studies of current multicellular organisms with relatively simple body organization suggest key characteristics of the earliest multicellular lineages. Glass sponges, the Hexactinellida, possess cellular characteristics that resemble some unicellular protistan organisms. These unique sponges were abundant in shallow seas of the early Cambrian, but they are currently restricted to polar habitats or very deep regions of the world oceans. Due in part to their relative inaccessibility, their potential significance to the early phylogeny of the eukaryotic kingdoms has been largely overlooked. We used sequences of the 18s ribosomal RNA gene of Farrea occa, a representative of the deep-water hexactinellid sponges, and Coelocarteria singaporense, a representative of the more common demosponges, and compared them with selected ribosomal RNA gene sequences available within the Protista. Using four computational methods for phylogenetic analysis of ribosomal DNA sequences, we found that the hexactinellid sponge-demosponge cluster is most closely related to Volvox and Acanthamoeba.
Ding, Jian; Ruan, Chengjiang; Guan, Ying; Krishna, Priti
2018-03-05
Sea buckthorn is a plant of medicinal and nutritional importance owing in part to the high levels of essential fatty acids, linoleic (up to 42%) and α-linolenic (up to 39%) acids in the seed oil. Sea buckthorn can produce seeds either via the sexual pathway or by apomixis. The seed development and maturation programs are critically dependent on miRNAs. To understand miRNA-mediated regulation of sea buckthorn seed development, eight small RNA libraries were constructed for deep sequencing from developing seeds of a low oil content line 'SJ1' and a high oil content line 'XE3'. High-throughput sequencing identified 137 known miRNA from 27 families and 264 novel miRNAs. The potential targets of the identified miRNAs were predicted based on sequence homology. Nineteen (four known and 15 novel) and 22 (six known and 16 novel) miRNAs were found to be involved in lipid biosynthesis and seed size, respectively. An integrated analysis of mRNA and miRNA transcriptome and qRT-PCR identified some key miRNAs and their targets (miR164d-ARF2, miR168b-Δ9D, novelmiRNA-108-ACC, novelmiRNA-23-GPD1, novelmiRNA-58-DGAT1, and novelmiRNA-191-DGAT2) potentially involved in seed size and lipid biosynthesis of sea buckthorn seed. These results indicate the potential importance of miRNAs in regulating lipid biosynthesis and seed size in sea buckthorn.
Ding, Jiarui; Condon, Anne; Shah, Sohrab P
2018-05-21
Single-cell RNA-sequencing has great potential to discover cell types, identify cell states, trace development lineages, and reconstruct the spatial organization of cells. However, dimension reduction to interpret structure in single-cell sequencing data remains a challenge. Existing algorithms are either not able to uncover the clustering structures in the data or lose global information such as groups of clusters that are close to each other. We present a robust statistical model, scvis, to capture and visualize the low-dimensional structures in single-cell gene expression data. Simulation results demonstrate that low-dimensional representations learned by scvis preserve both the local and global neighbor structures in the data. In addition, scvis is robust to the number of data points and learns a probabilistic parametric mapping function to add new data points to an existing embedding. We then use scvis to analyze four single-cell RNA-sequencing datasets, exemplifying interpretable two-dimensional representations of the high-dimensional single-cell RNA-sequencing data.
HSA: a heuristic splice alignment tool.
Bu, Jingde; Chi, Xuebin; Jin, Zhong
2013-01-01
RNA-Seq methodology is a revolutionary transcriptomics sequencing technology, which is the representative of Next generation Sequencing (NGS). With the high throughput sequencing of RNA-Seq, we can acquire much more information like differential expression and novel splice variants from deep sequence analysis and data mining. But the short read length brings a great challenge to alignment, especially when the reads span two or more exons. A two steps heuristic splice alignment tool is generated in this investigation. First, map raw reads to reference with unspliced aligner--BWA; second, split initial unmapped reads into three equal short reads (seeds), align each seed to the reference, filter hits, search possible split position of read and extend hits to a complete match. Compare with other splice alignment tools like SOAPsplice and Tophat2, HSA has a better performance in call rate and efficiency, but its results do not as accurate as the other software to some extent. HSA is an effective spliced aligner of RNA-Seq reads mapping, which is available at https://github.com/vlcc/HSA.
NASA Astrophysics Data System (ADS)
Ward, N.; Page, S.; Heidelberg, J.; Eisen, J. A.; Fraser, C. M.
2002-12-01
The composition of microbial communities associated with deep-sea hydrothermal vent animals is of interest because of the key role of bacterial symbionts in driving the chemosynthetic food chain of the vent system, and also because bacterial biofilms attached to animal exterior surfaces may play a part in settlement of larval forms. Sequence analysis of 16S ribosomal RNA (rRNA) genes from such communities provides a snapshot of community structure, as this gene is present in all Bacteria and Archaea, and a useful phylogenetic marker for both cultivated microbial species, and uncultivated species such as many of those found in the deep-sea environment. Specimens of giant tube worms (Riftia pachyptila), mussels (Bathymodiolus thermophilus), and clams (Calyptogena magnifica) were collected during the 2002 R/V Atlantis research cruises to the East Pacific Rise (9N) and Galápagos Rift. Microbial biofilms attached to the exterior surfaces of individual animals were sampled, as were tissues known to harbor chemosynthetic bacterial endosymbionts. Genomic DNA was extracted from the samples using a commercially available kit, and 16S rRNA genes amplified from the mixed bacterial communities using the polymerase chain reaction (PCR) and oligonucleotide primers targeting conserved terminal regions of the 16S rRNA gene. The PCR products obtained were cloned into a plasmid vector and the recombinant plasmids transformed into cells of Escherichia coli. Individual cloned 16S rRNA genes were sequenced at the 5' end of the gene (the most phylogenetically informative region in most taxa) and the sequence data compared to publicly available gene sequence databases, to allow a preliminary assignment of clones to taxonomic groups within the Bacteria and Archaea, and to determine the overall composition and phylogenetic diversity of the animal-associated microbial communities. Analysis of Riftia pachyptila exterior biofilm samples revealed the presence of members of the delta and epsilon proteobacteria, low GC Gram positive bacteria (firmicutes), spirochetes, CFB (Cytophaga-Flavobacterium-Bacteroides) group, green nonsulfur bacteria, acidobacteria, verrucomicrobia, and planctomycetes. The presence of the latter three taxonomic groups is of special interest, as they represent phylogenetically distinct groups within the Bacteria for which specific ecological functions have not yet been identified, but which have been found to be widely distributed and often numerically significant in diverse terrestrial and aquatic habitats. Although further sequencing is required to demonstrate the presence of a Riftia-associated microbial population distinct from that of the surrounding seawater, results available from three Riftia individuals from the East Pacific Rise suggest this to be the case. Analysis of microbial communities associated with the gill tissue of the mussel Bathymodiolus thermophilus shows a population dominated by gamma-Proteobacterial chemoautotrophic symbionts, although lower frequency novel phylotypes have been detected. Representatives of specific taxonomic groups have been selected for sequencing of the complete 16S rRNA gene, and the sequences used to reconstruct phylogenetic trees to more accurately determine the evolutionary relationships between the novel sequences, and available sequences for both cultured and non-cultured bacteria.
Spermine Condenses DNA, but Not RNA Duplexes
DOE Office of Scientific and Technical Information (OSTI.GOV)
Katz, Andrea M.; Tolokh, Igor S.; Pabit, Suzette A.
Interactions between the polyamine spermine and nucleic acids drive important cellular processes. Spermine condenses DNA, and some RNAs such as poly(rA):poly(rU). A large fraction of the spermine present in cells is bound to RNA, but apparently does not condense it. Here, we study the effect of spermine binding to short duplex RNA and DNA and compare our findings with predictions of molecular dynamics simulations. When small numbers of spermine are introduced, RNA with a designed sequence, containing a mixture of 14 GC pairs and 11 AU pairs, resists condensation relative to DNA of an equivalent sequence or to 25 basemore » pair poly(rA):poly(rU) RNA. Comparison of wide-angle x-ray scattering profiles with simulation suggests that spermine is sequestered deep within the major groove of mixed sequence RNA, preventing condensation by limiting opportunities to bridge to other molecules as well as stabilizing the RNA by locking it into a particular conformation. In contrast, for DNA, simulations suggest that spermine binds external to the duplex, offering opportunities for intermolecular interaction. The goal of this study is to explain how RNA can remain soluble, and available for interaction with other molecules in the cell, despite the presence of spermine at concentrations high enough to precipitate DNA.« less
Schneider, Ronen; Hoogstraten, Charlotte A.; Schapiro, David; Majmundar, Amar J.; Kolb, Amy; Eddy, Kaitlyn; Shril, Shirlee; Braun, Daniela A.; Poduri, Annapurna
2018-01-01
Until recently, morpholino oligonucleotides have been widely employed in zebrafish as an acute and efficient loss-of-function assay. However, off-target effects and reproducibility issues when compared to stable knockout lines have compromised their further use. Here we employed an acute CRISPR/Cas approach using multiple single guide RNAs targeting simultaneously different positions in two exemplar genes (osgep or tprkb) to increase the likelihood of generating mutations on both alleles in the injected F0 generation and to achieve a similar effect as morpholinos but with the reproducibility of stable lines. This multi single guide RNA approach resulted in median likelihoods for at least one mutation on each allele of >99% and sgRNA specific insertion/deletion profiles as revealed by deep-sequencing. Immunoblot showed a significant reduction for Osgep and Tprkb proteins. For both genes, the acute multi-sgRNA knockout recapitulated the microcephaly phenotype and reduction in survival that we observed previously in stable knockout lines, though milder in the acute multi-sgRNA knockout. Finally, we quantify the degree of mutagenesis by deep sequencing, and provide a mathematical model to quantitate the chance for a biallelic loss-of-function mutation. Our findings can be generalized to acute and stable CRISPR/Cas targeting for any zebrafish gene of interest. PMID:29346415
Smola, Matthew J.; Rice, Greggory M.; Busan, Steven; Siegfried, Nathan A.; Weeks, Kevin M.
2016-01-01
SHAPE chemistries exploit small electrophilic reagents that react with the 2′-hydroxyl group to interrogate RNA structure at single-nucleotide resolution. Mutational profiling (MaP) identifies modified residues based on the ability of reverse transcriptase to misread a SHAPE-modified nucleotide and then counting the resulting mutations by massively parallel sequencing. The SHAPE-MaP approach measures the structure of large and transcriptome-wide systems as accurately as for simple model RNAs. This protocol describes the experimental steps, implemented over three days, required to perform SHAPE probing and construct multiplexed SHAPE-MaP libraries suitable for deep sequencing. These steps include RNA folding and SHAPE structure probing, mutational profiling by reverse transcription, library construction, and sequencing. Automated processing of MaP sequencing data is accomplished using two software packages. ShapeMapper converts raw sequencing files into mutational profiles, creates SHAPE reactivity plots, and provides useful troubleshooting information, often within an hour. SuperFold uses these data to model RNA secondary structures, identify regions with well-defined structures, and visualize probable and alternative helices, often in under a day. We illustrate these algorithms with the E. coli thiamine pyrophosphate riboswitch, E. coli 16S rRNA, and HIV-1 genomic RNAs. SHAPE-MaP can be used to make nucleotide-resolution biophysical measurements of individual RNA motifs, rare components of complex RNA ensembles, and entire transcriptomes. The straightforward MaP strategy greatly expands the number, length, and complexity of analyzable RNA structures. PMID:26426499
A deep intronic mutation in the SLC12A3 gene leads to Gitelman syndrome.
Nozu, Kandai; Iijima, Kazumoto; Nozu, Yoshimi; Ikegami, Ei; Imai, Takehide; Fu, Xue Jun; Kaito, Hiroshi; Nakanishi, Koichi; Yoshikawa, Norishige; Matsuo, Masafumi
2009-11-01
Many mutations have been detected in the SLC12A3 gene of Gitelman syndrome (GS, OMIM 263800) patients. In previous studies, only one mutant allele was detected in approximately 20 to 41% of patients with GS; however, the exact reason for the nonidentification has not been established. In this study, we used RT-PCR using mRNA to investigate for the first time transcript abnormalities caused by deep intronic mutation. Direct sequencing analysis of leukocyte DNA identified one base insertion in exon 6 (c.818_819insG), but no mutation was detected in another allele. We analyzed RNA extracted from leukocytes and urine sediments and detected unknown sequence containing 238bp between exons 13 and 14. The genomic DNA analysis of intron 13 revealed a single-base substitution (c.1670-191C>T) that creates a new donor splice site within the intron resulting in the inclusion of a novel cryptic exon in mRNA. This is the first report of creation of a splice site by a deep intronic single-nucleotide change in GS and the first report to detect the onset mechanism in a patient with GS and missing mutation in one allele. This molecular onset mechanism may partly explain the poor success rate of mutation detection in both alleles of patients with GS.
Wang, Fangquan; Li, Wenqi; Zhu, Jinyan; Fan, Fangjun; Wang, Jun; Zhong, Weigong; Wang, Ming-Bo; Liu, Qing; Zhu, Qian-Hao; Zhou, Tong; Lan, Ying; Zhou, Yijun; Yang, Jie
2016-05-11
Rice black-streaked dwarf virus (RBSDV) belongs to the genus Fijivirus in the family of Reoviridae and causes severe yield loss in rice-producing areas in Asia. RNA silencing, as a natural defence mechanism against plant viruses, has been successfully exploited for engineering virus resistance in plants, including rice. In this study, we generated transgenic rice lines harbouring a hairpin RNA (hpRNA) construct targeting four RBSDV genes, S1, S2, S6 and S10, encoding the RNA-dependent RNA polymerase, the putative core protein, the RNA silencing suppressor and the outer capsid protein, respectively. Both field nursery and artificial inoculation assays of three generations of the transgenic lines showed that they had strong resistance to RBSDV infection. The RBSDV resistance in the segregating transgenic populations correlated perfectly with the presence of the hpRNA transgene. Furthermore, the hpRNA transgene was expressed in the highly resistant transgenic lines, giving rise to abundant levels of 21-24 nt small interfering RNA (siRNA). By small RNA deep sequencing, the RBSDV-resistant transgenic lines detected siRNAs from all four viral gene sequences in the hpRNA transgene, indicating that the whole chimeric fusion sequence can be efficiently processed by Dicer into siRNAs. Taken together, our results suggest that long hpRNA targeting multiple viral genes can be used to generate stable and durable virus resistance in rice, as well as other plant species.
2013-01-01
Background Deep sequencing of viruses isolated from infected hosts is an efficient way to measure population-genetic variation and can reveal patterns of dispersal and natural selection. In this study, we mined existing Illumina sequence reads to investigate single-nucleotide polymorphisms (SNPs) within two RNA viruses of the Western honey bee (Apis mellifera), deformed wing virus (DWV) and Israel acute paralysis virus (IAPV). All viral RNA was extracted from North American samples of honey bees or, in one case, the ectoparasitic mite Varroa destructor. Results Coverage depth was generally lower for IAPV than DWV, and marked gaps in coverage occurred in several narrow regions (< 50 bp) of IAPV. These coverage gaps occurred across sequencing runs and were virtually unchanged when reads were re-mapped with greater permissiveness (up to 8% divergence), suggesting a recurrent sequencing artifact rather than strain divergence. Consensus sequences of DWV for each sample showed little phylogenetic divergence, low nucleotide diversity, and strongly negative values of Fu and Li’s D statistic, suggesting a recent population bottleneck and/or purifying selection. The Kakugo strain of DWV fell outside of all other DWV sequences at 100% bootstrap support. IAPV consensus sequences supported the existence of multiple clades as had been previously reported, and Fu and Li’s D was closer to neutral expectation overall, although a sliding-window analysis identified a significantly positive D within the protease region, suggesting selection maintains diversity in that region. Within-sample mean diversity was comparable between the two viruses on average, although for both viruses there was substantial variation among samples in mean diversity at third codon positions and in the number of high-diversity sites. FST values were bimodal for DWV, likely reflecting neutral divergence in two low-diversity populations, whereas IAPV had several sites that were strong outliers with very low FST. Conclusions This initial survey of genetic variation within honey bee RNA viruses suggests future directions for studies examining the underlying causes of population-genetic structure in these economically important pathogens. PMID:23497218
Cornman, Robert Scott; Boncristiani, Humberto; Dainat, Benjamin; Chen, Yanping; vanEngelsdorp, Dennis; Weaver, Daniel; Evans, Jay D
2013-03-07
Deep sequencing of viruses isolated from infected hosts is an efficient way to measure population-genetic variation and can reveal patterns of dispersal and natural selection. In this study, we mined existing Illumina sequence reads to investigate single-nucleotide polymorphisms (SNPs) within two RNA viruses of the Western honey bee (Apis mellifera), deformed wing virus (DWV) and Israel acute paralysis virus (IAPV). All viral RNA was extracted from North American samples of honey bees or, in one case, the ectoparasitic mite Varroa destructor. Coverage depth was generally lower for IAPV than DWV, and marked gaps in coverage occurred in several narrow regions (< 50 bp) of IAPV. These coverage gaps occurred across sequencing runs and were virtually unchanged when reads were re-mapped with greater permissiveness (up to 8% divergence), suggesting a recurrent sequencing artifact rather than strain divergence. Consensus sequences of DWV for each sample showed little phylogenetic divergence, low nucleotide diversity, and strongly negative values of Fu and Li's D statistic, suggesting a recent population bottleneck and/or purifying selection. The Kakugo strain of DWV fell outside of all other DWV sequences at 100% bootstrap support. IAPV consensus sequences supported the existence of multiple clades as had been previously reported, and Fu and Li's D was closer to neutral expectation overall, although a sliding-window analysis identified a significantly positive D within the protease region, suggesting selection maintains diversity in that region. Within-sample mean diversity was comparable between the two viruses on average, although for both viruses there was substantial variation among samples in mean diversity at third codon positions and in the number of high-diversity sites. FST values were bimodal for DWV, likely reflecting neutral divergence in two low-diversity populations, whereas IAPV had several sites that were strong outliers with very low FST. This initial survey of genetic variation within honey bee RNA viruses suggests future directions for studies examining the underlying causes of population-genetic structure in these economically important pathogens.
Short intronic repeat sequences facilitate circular RNA production
Liang, Dongming
2014-01-01
Recent deep sequencing studies have revealed thousands of circular noncoding RNAs generated from protein-coding genes. These RNAs are produced when the precursor messenger RNA (pre-mRNA) splicing machinery “backsplices” and covalently joins, for example, the two ends of a single exon. However, the mechanism by which the spliceosome selects only certain exons to circularize is largely unknown. Using extensive mutagenesis of expression plasmids, we show that miniature introns containing the splice sites along with short (∼30- to 40-nucleotide) inverted repeats, such as Alu elements, are sufficient to allow the intervening exons to circularize in cells. The intronic repeats must base-pair to one another, thereby bringing the splice sites into close proximity to each other. More than simple thermodynamics is clearly at play, however, as not all repeats support circularization, and increasing the stability of the hairpin between the repeats can sometimes inhibit circular RNA biogenesis. The intronic repeats and exonic sequences must collaborate with one another, and a functional 3′ end processing signal is required, suggesting that circularization may occur post-transcriptionally. These results suggest detailed and generalizable models that explain how the splicing machinery determines whether to produce a circular noncoding RNA or a linear mRNA. PMID:25281217
Lim, Chun Shen; Brown, Chris M
2017-01-01
Structured RNA elements may control virus replication, transcription and translation, and their distinct features are being exploited by novel antiviral strategies. Viral RNA elements continue to be discovered using combinations of experimental and computational analyses. However, the wealth of sequence data, notably from deep viral RNA sequencing, viromes, and metagenomes, necessitates computational approaches being used as an essential discovery tool. In this review, we describe practical approaches being used to discover functional RNA elements in viral genomes. In addition to success stories in new and emerging viruses, these approaches have revealed some surprising new features of well-studied viruses e.g., human immunodeficiency virus, hepatitis C virus, influenza, and dengue viruses. Some notable discoveries were facilitated by new comparative analyses of diverse viral genome alignments. Importantly, comparative approaches for finding RNA elements embedded in coding and non-coding regions differ. With the exponential growth of computer power we have progressed from stem-loop prediction on single sequences to cutting edge 3D prediction, and from command line to user friendly web interfaces. Despite these advances, many powerful, user friendly prediction tools and resources are underutilized by the virology community.
Lim, Chun Shen; Brown, Chris M.
2018-01-01
Structured RNA elements may control virus replication, transcription and translation, and their distinct features are being exploited by novel antiviral strategies. Viral RNA elements continue to be discovered using combinations of experimental and computational analyses. However, the wealth of sequence data, notably from deep viral RNA sequencing, viromes, and metagenomes, necessitates computational approaches being used as an essential discovery tool. In this review, we describe practical approaches being used to discover functional RNA elements in viral genomes. In addition to success stories in new and emerging viruses, these approaches have revealed some surprising new features of well-studied viruses e.g., human immunodeficiency virus, hepatitis C virus, influenza, and dengue viruses. Some notable discoveries were facilitated by new comparative analyses of diverse viral genome alignments. Importantly, comparative approaches for finding RNA elements embedded in coding and non-coding regions differ. With the exponential growth of computer power we have progressed from stem-loop prediction on single sequences to cutting edge 3D prediction, and from command line to user friendly web interfaces. Despite these advances, many powerful, user friendly prediction tools and resources are underutilized by the virology community. PMID:29354101
Dilley, Kari A; Voorhies, Alexander A; Luthra, Priya; Puri, Vinita; Stockwell, Timothy B; Lorenzi, Hernan; Basler, Christopher F; Shabman, Reed S
2017-01-01
Ebola virus and Marburg virus are members of the Filovirdae family and causative agents of hemorrhagic fever with high fatality rates in humans. Filovirus virulence is partially attributed to the VP35 protein, a well-characterized inhibitor of the RIG-I-like receptor pathway that triggers the antiviral interferon (IFN) response. Prior work demonstrates the ability of VP35 to block potent RIG-I activators, such as Sendai virus (SeV), and this IFN-antagonist activity is directly correlated with its ability to bind RNA. Several structural studies demonstrate that VP35 binds short synthetic dsRNAs; yet, there are no data that identify viral immunostimulatory RNAs (isRNA) or host RNAs bound to VP35 in cells. Utilizing a SeV infection model, we demonstrate that both viral isRNA and host RNAs are bound to Ebola and Marburg VP35s in cells. By deep sequencing the purified VP35-bound RNA, we identified the SeV copy-back defective interfering (DI) RNA, previously identified as a robust RIG-I activator, as the isRNA bound by multiple filovirus VP35 proteins, including the VP35 protein from the West African outbreak strain (Makona EBOV). Moreover, RNAs isolated from a VP35 RNA-binding mutant were not immunostimulatory and did not include the SeV DI RNA. Strikingly, an analysis of host RNAs bound by wild-type, but not mutant, VP35 revealed that select host RNAs are preferentially bound by VP35 in cell culture. Taken together, these data support a model in which VP35 sequesters isRNA in virus-infected cells to avert RIG-I like receptor (RLR) activation.
Voorhies, Alexander A.; Luthra, Priya; Puri, Vinita; Stockwell, Timothy B.; Lorenzi, Hernan; Basler, Christopher F.; Shabman, Reed S.
2017-01-01
Ebola virus and Marburg virus are members of the Filovirdae family and causative agents of hemorrhagic fever with high fatality rates in humans. Filovirus virulence is partially attributed to the VP35 protein, a well-characterized inhibitor of the RIG-I-like receptor pathway that triggers the antiviral interferon (IFN) response. Prior work demonstrates the ability of VP35 to block potent RIG-I activators, such as Sendai virus (SeV), and this IFN-antagonist activity is directly correlated with its ability to bind RNA. Several structural studies demonstrate that VP35 binds short synthetic dsRNAs; yet, there are no data that identify viral immunostimulatory RNAs (isRNA) or host RNAs bound to VP35 in cells. Utilizing a SeV infection model, we demonstrate that both viral isRNA and host RNAs are bound to Ebola and Marburg VP35s in cells. By deep sequencing the purified VP35-bound RNA, we identified the SeV copy-back defective interfering (DI) RNA, previously identified as a robust RIG-I activator, as the isRNA bound by multiple filovirus VP35 proteins, including the VP35 protein from the West African outbreak strain (Makona EBOV). Moreover, RNAs isolated from a VP35 RNA-binding mutant were not immunostimulatory and did not include the SeV DI RNA. Strikingly, an analysis of host RNAs bound by wild-type, but not mutant, VP35 revealed that select host RNAs are preferentially bound by VP35 in cell culture. Taken together, these data support a model in which VP35 sequesters isRNA in virus-infected cells to avert RIG-I like receptor (RLR) activation. PMID:28636653
Comparative analysis of the small RNA transcriptomes of Pinus contorta and Oryza sativa
Morin, Ryan D.; Aksay, Gozde; Dolgosheina, Elena; Ebhardt, H. Alexander; Magrini, Vincent; Mardis, Elaine R.; Sahinalp, S. Cenk; Unrau, Peter J.
2008-01-01
The diversity of microRNAs and small-interfering RNAs has been extensively explored within angiosperms by focusing on a few key organisms such as Oryza sativa and Arabidopsis thaliana. A deeper division of the plants is defined by the radiation of the angiosperms and gymnosperms, with the latter comprising the commercially important conifers. The conifers are expected to provide important information regarding the evolution of highly conserved small regulatory RNAs. Deep sequencing provides the means to characterize and quantitatively profile small RNAs in understudied organisms such as these. Pyrosequencing of small RNAs from O. sativa revealed, as expected, ∼21- and ∼24-nt RNAs. The former contained known microRNAs, and the latter largely comprised intergenic-derived sequences likely representing heterochromatin siRNAs. In contrast, sequences from Pinus contorta were dominated by 21-nt small RNAs. Using a novel sequence-based clustering algorithm, we identified sequences belonging to 18 highly conserved microRNA families in P. contorta as well as numerous clusters of conserved small RNAs of unknown function. Using multiple methods, including expressed sequence folding and machine learning algorithms, we found a further 53 candidate novel microRNA families, 51 appearing specific to the P. contorta library. In addition, alignment of small RNA sequences to the O. sativa genome revealed six perfectly conserved classes of small RNA that included chloroplast transcripts and specific types of genomic repeats. The conservation of microRNAs and other small RNAs between the conifers and the angiosperms indicates that important RNA silencing processes were highly developed in the earliest spermatophytes. Genomic mapping of all sequences to the O. sativa genome can be viewed at http://microrna.bcgsc.ca/cgi-bin/gbrowse/rice_build_3/. PMID:18323537
Whole-Genome Characterization of Prunus necrotic ringspot virus Infecting Sweet Cherry in China.
Wang, Jiawei; Zhai, Ying; Zhu, Dongzi; Liu, Weizhen; Pappu, Hanu R; Liu, Qingzhong
2018-03-01
Prunus necrotic ringspot virus (PNRSV) causes yield loss in most cultivated stone fruits, including sweet cherry. Using a small RNA deep-sequencing approach combined with end-genome sequence cloning, we identified the complete genomes of all three PNRSV strands from PNRSV-infected sweet cherry trees and compared them with those of two previously reported isolates. Copyright © 2018 Wang et al.
Deep Sequencing of RNA from Ancient Maize Kernels
Rasmussen, Morten; Cappellini, Enrico; Romero-Navarro, J. Alberto; Wales, Nathan; Alquezar-Planas, David E.; Penfield, Steven; Brown, Terence A.; Vielle-Calzada, Jean-Philippe; Montiel, Rafael; Jørgensen, Tina; Odegaard, Nancy; Jacobs, Michael; Arriaza, Bernardo; Higham, Thomas F. G.; Ramsey, Christopher Bronk; Willerslev, Eske; Gilbert, M. Thomas P.
2013-01-01
The characterization of biomolecules from ancient samples can shed otherwise unobtainable insights into the past. Despite the fundamental role of transcriptomal change in evolution, the potential of ancient RNA remains unexploited – perhaps due to dogma associated with the fragility of RNA. We hypothesize that seeds offer a plausible refuge for long-term RNA survival, due to the fundamental role of RNA during seed germination. Using RNA-Seq on cDNA synthesized from nucleic acid extracts, we validate this hypothesis through demonstration of partial transcriptomal recovery from two sources of ancient maize kernels. The results suggest that ancient seed transcriptomics may offer a powerful new tool with which to study plant domestication. PMID:23326310
Singh, Savita; Zheng, Yun; Jagadeeswaran, Guru; Ebron, Jey Sabith; Sikand, Kavleen; Gupta, Sanjay; Sunker, Ramanjulu; Shukla, Girish C
2016-02-28
Complex epithelial and stromal cell interactions are required during the development and progression of prostate cancer. Regulatory small non-coding microRNAs (miRNAs) participate in the spatiotemporal regulation of messenger RNA (mRNA) and regulation of translation affecting a large number of genes involved in prostate carcinogenesis. In this study, through deep-sequencing of size fractionated small RNA libraries we profiled the miRNAs of prostate epithelial (PrEC) and stromal (PrSC) cells. Over 50 million reads were obtained for PrEC in which 860,468 were unique sequences. Similarly, nearly 76 million reads for PrSC were obtained in which over 1 million were unique reads. Expression of many miRNAs of broadly conserved and poorly conserved miRNA families were identified. Sixteen highly expressed miRNAs with significant change in expression in PrSC than PrEC were further analyzed in silico. ConsensusPathDB showed the target genes of these miRNAs were significantly involved in adherence junction, cell adhesion, EGRF, TGF-β and androgen signaling. Let-7 family of tumor-suppressor miRNAs expression was highly pervasive in both, PrEC and PrSC cells. In addition, we have also identified several miRNAs that are unique to PrEC or PrSC cells and their predicted putative targets are a group of transcription factors. This study provides perspective on the miRNA expression in PrEC and PrSC, and reveals a global trend in miRNA interactome. We conclude that the most abundant miRNAs are potential regulators of development and differentiation of the prostate gland by targeting a set of growth factors. Additionally, high level expression of the most members of let-7 family miRNAs suggests their role in the fine tuning of the growth and proliferation of prostate epithelial and stromal cells. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Eldem, Vahap; Çelikkol Akçay, Ufuk; Ozhuner, Esma; Bakır, Yakup; Uranbey, Serkan; Unver, Turgay
2012-01-01
Peach (Prunus persica L.) is one of the most important worldwide fresh fruits. Since fruit growth largely depends on adequate water supply, drought stress is considered as the most important abiotic stress limiting fleshy fruit production and quality in peach. Plant responses to drought stress are regulated both at transcriptional and post-transcriptional level. As post-transcriptional gene regulators, miRNAs (miRNAs) are small (19–25 nucleotides in length), endogenous, non-coding RNAs. Recent studies indicate that miRNAs are involved in plant responses to drought. Therefore, Illumina deep sequencing technology was used for genome-wide identification of miRNAs and their expression profile in response to drought in peach. In this study, four sRNA libraries were constructed from leaf control (LC), leaf stress (LS), root control (RC) and root stress (RS) samples. We identified a total of 531, 471, 535 and 487 known mature miRNAs in LC, LS, RC and RS libraries, respectively. The expression level of 262 (104 up-regulated, 158 down-regulated) of the 453 miRNAs changed significantly in leaf tissue, whereas 368 (221 up-regulated, 147 down-regulated) of the 465 miRNAs had expression levels that changed significantly in root tissue upon drought stress. Additionally, a total of 197, 221, 238 and 265 novel miRNA precursor candidates were identified from LC, LS, RC and RS libraries, respectively. Target transcripts (137 for LC, 133 for LS, 148 for RC and 153 for RS) generated significant Gene Ontology (GO) terms related to DNA binding and catalytic activites. Genome-wide miRNA expression analysis of peach by deep sequencing approach helped to expand our understanding of miRNA function in response to drought stress in peach and Rosaceae. A set of differentially expressed miRNAs could pave the way for developing new strategies to alleviate the adverse effects of drought stress on plant growth and development. PMID:23227166
Subburaj, Saminathan; Chung, Sung Jin; Lee, Choongil; Ryu, Seuk-Min; Kim, Duk Hyoung; Kim, Jin-Soo; Bae, Sangsu; Lee, Geung-Joo
2016-07-01
Site-directed mutagenesis of nitrate reductase genes using direct delivery of purified Cas9 protein preassembled with guide RNA produces mutations efficiently in Petunia × hybrida protoplast system. The clustered, regularly interspaced, short palindromic repeat (CRISPR)-CRISPR associated endonuclease 9 (CRISPR/Cas9) system has been recently announced as a powerful molecular breeding tool for site-directed mutagenesis in higher plants. Here, we report a site-directed mutagenesis method targeting Petunia nitrate reductase (NR) gene locus. This method could create mutations efficiently using direct delivery of purified Cas9 protein and single guide RNA (sgRNA) into protoplast cells. After transient introduction of RNA-guided endonuclease (RGEN) ribonucleoproteins (RNPs) with different sgRNAs targeting NR genes, mutagenesis at the targeted loci was detected by T7E1 assay and confirmed by targeted deep sequencing. T7E1 assay showed that RGEN RNPs induced site-specific mutations at frequencies ranging from 2.4 to 21 % at four different sites (NR1, 2, 4 and 6) in the PhNR gene locus with average mutation efficiency of 14.9 ± 2.2 %. Targeted deep DNA sequencing revealed mutation rates of 5.3-17.8 % with average mutation rate of 11.5 ± 2 % at the same NR gene target sites in DNA fragments of analyzed protoplast transfectants. Further analysis from targeted deep sequencing showed that the average ratio of deletion to insertion produced collectively by the four NR-RGEN target sites (NR1, 2, 4, and 6) was about 63:37. Our results demonstrated that direct delivery of RGEN RNPs into protoplast cells of Petunia can be exploited as an efficient tool for site-directed mutagenesis of genes or genome editing in plant systems.
Åsman, Anna K M; Vetukuri, Ramesh R; Jahan, Sultana N; Fogelqvist, Johan; Corcoran, Pádraic; Avrova, Anna O; Whisson, Stephen C; Dixelius, Christina
2014-12-10
The oomycete Phytophthora infestans possesses active RNA silencing pathways, which presumably enable this plant pathogen to control the large numbers of transposable elements present in its 240 Mb genome. Small RNAs (sRNAs), central molecules in RNA silencing, are known to also play key roles in this organism, notably in regulation of critical effector genes needed for infection of its potato host. To identify additional classes of sRNAs in oomycetes, we mapped deep sequencing reads to transfer RNAs (tRNAs) thereby revealing the presence of 19-40 nt tRNA-derived RNA fragments (tRFs). Northern blot analysis identified abundant tRFs corresponding to half tRNA molecules. Some tRFs accumulated differentially during infection, as seen by examining sRNAs sequenced from P. infestans-potato interaction libraries. The putative connection between tRF biogenesis and the canonical RNA silencing pathways was investigated by employing hairpin RNA-mediated RNAi to silence the genes encoding P. infestans Argonaute (PiAgo) and Dicer (PiDcl) endoribonucleases. By sRNA sequencing we show that tRF accumulation is PiDcl1-independent, while Northern hybridizations detected reduced levels of specific tRNA-derived species in the PiAgo1 knockdown line. Our findings extend the sRNA diversity in oomycetes to include fragments derived from non-protein-coding RNA transcripts and identify tRFs with elevated levels during infection of potato by P. infestans.
Fingerprints of Modified RNA Bases from Deep Sequencing Profiles.
Kietrys, Anna M; Velema, Willem A; Kool, Eric T
2017-11-29
Posttranscriptional modifications of RNA bases are not only found in many noncoding RNAs but have also recently been identified in coding (messenger) RNAs as well. They require complex and laborious methods to locate, and many still lack methods for localized detection. Here we test the ability of next-generation sequencing (NGS) to detect and distinguish between ten modified bases in synthetic RNAs. We compare ultradeep sequencing patterns of modified bases, including miscoding, insertions and deletions (indels), and truncations, to unmodified bases in the same contexts. The data show widely varied responses to modification, ranging from no response, to high levels of mutations, insertions, deletions, and truncations. The patterns are distinct for several of the modifications, and suggest the future use of ultradeep sequencing as a fingerprinting strategy for locating and identifying modifications in cellular RNAs.
Sachsenröder, Jana; Twardziok, Sven; Hammerl, Jens A; Janczyk, Pawel; Wrede, Paul; Hertwig, Stefan; Johne, Reimar
2012-01-01
Animal faeces comprise a community of many different microorganisms including bacteria and viruses. Only scarce information is available about the diversity of viruses present in the faeces of pigs. Here we describe a protocol, which was optimized for the purification of the total fraction of viral particles from pig faeces. The genomes of the purified DNA and RNA viruses were simultaneously amplified by PCR and subjected to deep sequencing followed by bioinformatic analyses. The efficiency of the method was monitored using a process control consisting of three bacteriophages (T4, M13 and MS2) with different morphology and genome types. Defined amounts of the bacteriophages were added to the sample and their abundance was assessed by quantitative PCR during the preparation procedure. The procedure was applied to a pooled faecal sample of five pigs. From this sample, 69,613 sequence reads were generated. All of the added bacteriophages were identified by sequence analysis of the reads. In total, 7.7% of the reads showed significant sequence identities with published viral sequences. They mainly originated from bacteriophages (73.9%) and mammalian viruses (23.9%); 0.8% of the sequences showed identities to plant viruses. The most abundant detected porcine viruses were kobuvirus, rotavirus C, astrovirus, enterovirus B, sapovirus and picobirnavirus. In addition, sequences with identities to the chimpanzee stool-associated circular ssDNA virus were identified. Whole genome analysis indicates that this virus, tentatively designated as pig stool-associated circular ssDNA virus (PigSCV), represents a novel pig virus. The established protocol enables the simultaneous detection of DNA and RNA viruses in pig faeces including the identification of so far unknown viruses. It may be applied in studies investigating aetiology, epidemiology and ecology of diseases. The implemented process control serves as quality control, ensures comparability of the method and may be used for further method optimization.
Uncovering microRNA-mediated response to SO2 stress in Arabidopsis thaliana by deep sequencing.
Li, Lihong; Xue, Meizhao; Yi, Huilan
2016-10-05
Sulfur dioxide (SO2) is a major air pollutant and has significant impacts on plants. MicroRNAs (miRNAs) are a class of gene expression regulators that play important roles in response to environmental stresses. In this study, deep sequencing was used for genome-wide identification of miRNAs and their expression profiles in response to SO2 stress in Arabidopsis thaliana shoots. A total of 27 conserved miRNAs and 5 novel miRNAs were found to be differentially expressed under SO2 stress. qRT-PCR analysis showed mostly negative correlation between miRNA accumulation and target gene mRNA abundance, suggesting regulatory roles of these miRNAs during SO2 exposure. The target genes of SO2-responsive miRNAs encode transcription factors and proteins that regulate auxin signaling and stress response, and the miRNAs-mediated suppression of these genes could improve plant resistance to SO2 stress. Promoter sequence analysis of genes encoding SO2-responsive miRNAs showed that stress-responsive and phytohormone-related cis-regulatory elements occurred frequently, providing additional evidence of the involvement of miRNAs in adaption to SO2 stress. This study represents a comprehensive expression profiling of SO2-responsive miRNAs in Arabidopsis and broads our perspective on the ubiquitous regulatory roles of miRNAs under stress conditions. Copyright © 2016 Elsevier B.V. All rights reserved.
Oral Microbiome of Deep and Shallow Dental Pockets In Chronic Periodontitis
Ge, Xiuchun; Rodriguez, Rafael; Trinh, My; Gunsolley, John; Xu, Ping
2013-01-01
We examined the subgingival bacterial biodiversity in untreated chronic periodontitis patients by sequencing 16S rRNA genes. The primary purpose of the study was to compare the oral microbiome in deep (diseased) and shallow (healthy) sites. A secondary purpose was to evaluate the influences of smoking, race and dental caries on this relationship. A total of 88 subjects from two clinics were recruited. Paired subgingival plaque samples were taken from each subject, one from a probing site depth >5 mm (deep site) and the other from a probing site depth ≤3mm (shallow site). A universal primer set was designed to amplify the V4–V6 region for oral microbial 16S rRNA sequences. Differences in genera and species attributable to deep and shallow sites were determined by statistical analysis using a two-part model and false discovery rate. Fifty-one of 170 genera and 200 of 746 species were found significantly different in abundances between shallow and deep sites. Besides previously identified periodontal disease-associated bacterial species, additional species were found markedly changed in diseased sites. Cluster analysis revealed that the microbiome difference between deep and shallow sites was influenced by patient-level effects such as clinic location, race and smoking. The differences between clinic locations may be influenced by racial distribution, in that all of the African Americans subjects were seen at the same clinic. Our results suggested that there were influences from the microbiome for caries and periodontal disease and these influences are independent. PMID:23762384
High-Throughput Sequencing of RNA Silencing-Associated Small RNAs in Olive (Olea europaea L.)
Donaire, Livia; Pedrola, Laia; de la Rosa, Raúl; Llave, César
2011-01-01
Small RNAs (sRNAs) of 20 to 25 nucleotides (nt) in length maintain genome integrity and control gene expression in a multitude of developmental and physiological processes. Despite RNA silencing has been primarily studied in model plants, the advent of high-throughput sequencing technologies has enabled profiling of the sRNA component of more than 40 plant species. Here, we used deep sequencing and molecular methods to report the first inventory of sRNAs in olive (Olea europaea L.). sRNA libraries prepared from juvenile and adult shoots revealed that the 24-nt class dominates the sRNA transcriptome and atypically accumulates to levels never seen in other plant species, suggesting an active role of heterochromatin silencing in the maintenance and integrity of its large genome. A total of 18 known miRNA families were identified in the libraries. Also, 5 other sRNAs derived from potential hairpin-like precursors remain as plausible miRNA candidates. RNA blots confirmed miRNA expression and suggested tissue- and/or developmental-specific expression patterns. Target mRNAs of conserved miRNAs were computationally predicted among the olive cDNA collection and experimentally validated through endonucleolytic cleavage assays. Finally, we use expression data to uncover genetic components of the miR156, miR172 and miR390/TAS3-derived trans-acting small interfering RNA (tasiRNA) regulatory nodes, suggesting that these interactive networks controlling developmental transitions are fully operational in olive. PMID:22140484
Xiong, Changyan; Li, Xuejiao; Liu, Juanli; Zhao, Xin; Xu, Shungao; Huang, Xinxiang
2018-01-01
Antisense RNAs from complementary strands of protein coding genes regulate the expression of genes involved in many cellular processes. Using deep sequencing analysis of the Salmonella enterica serovar Typhi ( S. Typhi) transcriptome, a novel antisense RNA encoded on the strand complementary to the rpoH gene was revealed. In this study, the molecular features of this antisense RNA were assessed using northern blotting and rapid amplification of cDNA ends. The 3,508 nt sequence of RNA was identified as the antisense RNA of the rpoH gene and was named ArpH. ArpH was found to attenuate the invasion of HeLa cells by S. Typhi by regulating the expression of SPI-1 genes. In an rpoH mutant strain, the invasive capacity of S. Typhi was increased, whereas overexpression of ArpH positively regulates rpoH mRNA levels. Results of this study suggest that the cis -encoded antisense RNA ArpH is likely to affect the invasive capacity of S. Typhi by regulating the expression of rpoH .
Dadzie, Isaac; Xu, Shungao; Ni, Bin; Zhang, Xiaolei; Zhang, Haifang; Sheng, Xiumei; Xu, Huaxi; Huang, Xinxiang
2013-01-01
Antisense RNAs that originate from the complementary strand of protein coding genes are involved in the regulation of gene expression in all domains of life. In bacteria, some of these antisense RNAs are transcriptional noise whiles others play a vital role to adapt the cell to changing environmental conditions. By deep sequencing analysis of transcriptome of Salmonella enterica serovar Typhi, a partial RNA sequence encoded in-cis to the dnaA gene was revealed. Northern blot and RACE analysis confirmed the transcription of this antisense RNA which was expressed mostly in the stationary phase of the bacterial growth and also under iron limitation and osmotic stress. Pulse expression analysis showed that overexpression of the antisense RNA resulted in a significant increase in the mRNA levels of dnaA, which will ultimately enhance their translation. Our findings have revealed that antisense RNA of dnaA is indeed transcribed not merely as a by-product of the cell's transcription machinery but plays a vital role as far as stability of dnaA mRNA is concerned. PMID:23637809
HIV-1 RNAs are Not Part of the Argonaute 2 Associated RNA Interference Pathway in Macrophages.
Vongrad, Valentina; Imig, Jochen; Mohammadi, Pejman; Kishore, Shivendra; Jaskiewicz, Lukasz; Hall, Jonathan; Günthard, Huldrych F; Beerenwinkel, Niko; Metzner, Karin J
2015-01-01
MiRNAs and other small noncoding RNAs (sncRNAs) are key players in post-transcriptional gene regulation. HIV-1 derived small noncoding RNAs (sncRNAs) have been described in HIV-1 infected cells, but their biological functions still remain to be elucidated. Here, we approached the question whether viral sncRNAs may play a role in the RNA interference (RNAi) pathway or whether viral mRNAs are targeted by cellular miRNAs in human monocyte derived macrophages (MDM). The incorporation of viral sncRNAs and/or their target RNAs into RNA-induced silencing complex was investigated using photoactivatable ribonucleoside-induced cross-linking and immunoprecipitation (PAR-CLIP) as well as high-throughput sequencing of RNA isolated by cross-linking immunoprecipitation (HITS-CLIP), which capture Argonaute2-bound miRNAs and their target RNAs. HIV-1 infected monocyte-derived macrophages (MDM) were chosen as target cells, as they have previously been shown to express HIV-1 sncRNAs. In addition, we applied small RNA deep sequencing to study differential cellular miRNA expression in HIV-1 infected versus non-infected MDMs. PAR-CLIP and HITS-CLIP data demonstrated the absence of HIV-1 RNAs in Ago2-RISC, although the presence of a multitude of HIV-1 sncRNAs in HIV-1 infected MDMs was confirmed by small RNA sequencing. Small RNA sequencing revealed that 1.4% of all sncRNAs were of HIV-1 origin. However, neither HIV-1 derived sncRNAs nor putative HIV-1 target sequences incorporated into Ago2-RISC were identified suggesting that HIV-1 sncRNAs are not involved in the canonical RNAi pathway nor is HIV-1 targeted by this pathway in HIV-1 infected macrophages.
Analysis of miRNA expression profiles in melatonin-exposed GC-1 spg cell line.
Zhu, Xiaoling; Chen, Shuxiong; Jiang, Yanwen; Xu, Ying; Zhao, Yun; Chen, Lu; Li, Chunjin; Zhou, Xu
2018-02-05
Melatonin is an endocrine neurohormone secreted by pinealocytes in the pineal gland. It exerts diverse physiological effects, such as circadian rhythm regulator and antioxidant. However, the functional importance of melatonin in spermatogenesis regulation remains unclear. The objectives of this study are to: (1) detect melatonin affection on miRNA expression profiles in GC-1 spg cells by miRNA deep sequencing (DeepSeq) and (2) define melatonin affected miRNA-mRNA interactions and associated biological processes using bioinformatics analysis. GC-1 spg cells were cultured with melatonin (10 -7 M) for 24h. DeepSeq data were validated using quantitative real-time reverse transcription polymerase chain reaction analysis (qRT-PCR). A total of 176 miRNA expressions were found to be significantly different between two groups (fold change of >2 or <0.5 and FDR<0.05). Among these expressions, 171 were up-regulated, and 5 were down-regulated. Ontology analysis of biological processes of these targets indicated a variety of biological functions. Pathway analysis indicated that the predicted targets were involved in cancers, apoptosis and signaling pathways, such as VEGF, TNF, Ras and Notch. Results implicated that melatonin could regulate the expression of miRNA to perform its physiological effects in GC-1 spg cells. These results should be useful to investigate the biological function of miRNAs regulated by melatonin in spermatogenesis and testicular germ cell tumor. Copyright © 2017 Elsevier B.V. All rights reserved.
Genetic characterization of novel putative rhabdovirus and dsRNA virus from Japanese persimmon.
Ito, Takao; Suzaki, Koichi; Nakano, Masaaki
2013-08-01
Deep-sequencing analysis of nucleic acids from leaf tissue of Japanese persimmon trees exhibiting fruit apex disorder in some fruits detected two molecules that were graft transmitted to healthy seedlings. One of the complete genomes consisted of 13 467 nt and encoded six genes similar to those of plant rhabdoviruses. The virus formed a distinct cluster in the genus Cytorhabdovirus with lettuce necrotic yellows virus, lettuce yellow mottle virus and strawberry crinkle virus in a phylogenetic tree based on the L protein (RNA-dependent RNA polymerase, RdRp). The other consisted of 7475 nt and shared a genome organization similar to those of some insect and fungal viruses having dsRNA genomes. In a phylogenetic tree using the RdRp sequence of several unassigned dsRNA viruses, the virus formed a possible new genus cluster with two insect viruses, Circulifer tenellus virus 1 and Spissistilus festinus virus 1, and one plant virus, cucurbit yellows-associated virus.
Staufen1 senses overall transcript secondary structure to regulate translation
Ricci, Emiliano P; Kucukural, Alper; Cenik, Can; Mercier, Blandine C; Singh, Guramrit; Heyer, Erin E; Ashar-Patel, Ami; Peng, Lingtao; Moore, Melissa J
2015-01-01
Human Staufen1 (Stau1) is a double-stranded RNA (dsRNA)-binding protein implicated in multiple post-transcriptional gene-regulatory processes. Here we combined RNA immunoprecipitation in tandem (RIPiT) with RNase footprinting, formaldehyde cross-linking, sonication-mediated RNA fragmentation and deep sequencing to map Staufen1-binding sites transcriptome wide. We find that Stau1 binds complex secondary structures containing multiple short helices, many of which are formed by inverted Alu elements in annotated 3′ untranslated regions (UTRs) or in ‘strongly distal’ 3′ UTRs. Stau1 also interacts with actively translating ribosomes and with mRNA coding sequences (CDSs) and 3′ UTRs in proportion to their GC content and propensity to form internal secondary structure. On mRNAs with high CDS GC content, higher Stau1 levels lead to greater ribosome densities, thus suggesting a general role for Stau1 in modulating translation elongation through structured CDS regions. Our results also indicate that Stau1 regulates translation of transcription-regulatory proteins. PMID:24336223
High-throughput determination of RNA structure by proximity ligation.
Ramani, Vijay; Qiu, Ruolan; Shendure, Jay
2015-09-01
We present an unbiased method to globally resolve RNA structures through pairwise contact measurements between interacting regions. RNA proximity ligation (RPL) uses proximity ligation of native RNA followed by deep sequencing to yield chimeric reads with ligation junctions in the vicinity of structurally proximate bases. We apply RPL in both baker's yeast (Saccharomyces cerevisiae) and human cells and generate contact probability maps for ribosomal and other abundant RNAs, including yeast snoRNAs, the RNA subunit of the signal recognition particle and the yeast U2 spliceosomal RNA homolog. RPL measurements correlate with established secondary structures for these RNA molecules, including stem-loop structures and long-range pseudoknots. We anticipate that RPL will complement the current repertoire of computational and experimental approaches in enabling the high-throughput determination of secondary and tertiary RNA structures.
Wang, Ruijia; Nambiar, Ram; Zheng, Dinghai
2018-01-01
Abstract PolyA_DB is a database cataloging cleavage and polyadenylation sites (PASs) in several genomes. Previous versions were based mainly on expressed sequence tags (ESTs), which had a limited amount and could lead to inaccurate PAS identification due to the presence of internal A-rich sequences in transcripts. Here, we present an updated version of the database based solely on deep sequencing data. First, PASs are mapped by the 3′ region extraction and deep sequencing (3′READS) method, ensuring unequivocal PAS identification. Second, a large volume of data based on diverse biological samples increases PAS coverage by 3.5-fold over the EST-based version and provides PAS usage information. Third, strand-specific RNA-seq data are used to extend annotated 3′ ends of genes to obtain more thorough annotations of alternative polyadenylation (APA) sites. Fourth, conservation information of PAS across mammals sheds light on significance of APA sites. The database (URL: http://www.polya-db.org/v3) currently holds PASs in human, mouse, rat and chicken, and has links to the UCSC genome browser for further visualization and for integration with other genomic data. PMID:29069441
Using Poisson mixed-effects model to quantify transcript-level gene expression in RNA-Seq.
Hu, Ming; Zhu, Yu; Taylor, Jeremy M G; Liu, Jun S; Qin, Zhaohui S
2012-01-01
RNA sequencing (RNA-Seq) is a powerful new technology for mapping and quantifying transcriptomes using ultra high-throughput next-generation sequencing technologies. Using deep sequencing, gene expression levels of all transcripts including novel ones can be quantified digitally. Although extremely promising, the massive amounts of data generated by RNA-Seq, substantial biases and uncertainty in short read alignment pose challenges for data analysis. In particular, large base-specific variation and between-base dependence make simple approaches, such as those that use averaging to normalize RNA-Seq data and quantify gene expressions, ineffective. In this study, we propose a Poisson mixed-effects (POME) model to characterize base-level read coverage within each transcript. The underlying expression level is included as a key parameter in this model. Since the proposed model is capable of incorporating base-specific variation as well as between-base dependence that affect read coverage profile throughout the transcript, it can lead to improved quantification of the true underlying expression level. POME can be freely downloaded at http://www.stat.purdue.edu/~yuzhu/pome.html. yuzhu@purdue.edu; zhaohui.qin@emory.edu Supplementary data are available at Bioinformatics online.
Chemoresistance Evolution in Triple-Negative Breast Cancer Delineated by Single-Cell Sequencing.
Kim, Charissa; Gao, Ruli; Sei, Emi; Brandt, Rachel; Hartman, Johan; Hatschek, Thomas; Crosetto, Nicola; Foukakis, Theodoros; Navin, Nicholas E
2018-05-03
Triple-negative breast cancer (TNBC) is an aggressive subtype that frequently develops resistance to chemotherapy. An unresolved question is whether resistance is caused by the selection of rare pre-existing clones or alternatively through the acquisition of new genomic aberrations. To investigate this question, we applied single-cell DNA and RNA sequencing in addition to bulk exome sequencing to profile longitudinal samples from 20 TNBC patients during neoadjuvant chemotherapy (NAC). Deep-exome sequencing identified 10 patients in which NAC led to clonal extinction and 10 patients in which clones persisted after treatment. In 8 patients, we performed a more detailed study using single-cell DNA sequencing to analyze 900 cells and single-cell RNA sequencing to analyze 6,862 cells. Our data showed that resistant genotypes were pre-existing and adaptively selected by NAC, while transcriptional profiles were acquired by reprogramming in response to chemotherapy in TNBC patients. Copyright © 2018 Elsevier Inc. All rights reserved.
tRNA-Derived Small RNA: A Novel Regulatory Small Non-Coding RNA.
Li, Siqi; Xu, Zhengping; Sheng, Jinghao
2018-05-10
Deep analysis of next-generation sequencing data unveils numerous small non-coding RNAs with distinct functions. Recently, fragments derived from tRNA, named as tRNA-derived small RNA (tsRNA), have attracted broad attention. There are mainly two types of tsRNAs, including tRNA-derived stress-induced RNA (tiRNA) and tRNA-derived fragment (tRF), which differ in the cleavage position of the precursor or mature tRNA transcript. Emerging evidence has shown that tsRNAs are not merely tRNA degradation debris but have been recognized to play regulatory roles in many specific physiological and pathological processes. In this review, we summarize the biogeneses of various tsRNAs, present the emerging concepts regarding functions and mechanisms of action of tsRNAs, highlight the potential application of tsRNAs in human diseases, and put forward the current problems and future research directions.
MicroRNA-like RNAs from the same miRNA precursors play a role in cassava chilling responses.
Zeng, Changying; Xia, Jing; Chen, Xin; Zhou, Yufei; Peng, Ming; Zhang, Weixiong
2017-12-07
MicroRNAs (miRNAs) are known to play important roles in various cellular processes and stress responses. MiRNAs can be identified by analyzing reads from high-throughput deep sequencing. The reads realigned to miRNA precursors besides canonical miRNAs were initially considered as sequencing noise and ignored from further analysis. Here we reported a small-RNA species of phased and half-phased miRNA-like RNAs different from canonical miRNAs from cassava miRNA precursors detected under four distinct chilling conditions. They can form abundant multiple small RNAs arranged along precursors in a tandem and phased or half-phased fashion. Some of these miRNA-like RNAs were experimentally confirmed by re-amplification and re-sequencing, and have a similar qRT-PCR detection ratio as their cognate canonical miRNAs. The target genes of those phased and half-phased miRNA-like RNAs function in process of cell growth metabolism and play roles in protein kinase. Half-phased miR171d.3 was confirmed to have cleavage activities on its target gene P-glycoprotein 11, a broad substrate efflux pump across cellular membranes, which is thought to provide protection for tropical cassava during sharp temperature decease. Our results showed that the RNAs from miRNA precursors are miRNA-like small RNAs that are viable negative gene regulators and may have potential functions in cassava chilling responses.
Recurrent chimeric RNAs enriched in human prostate cancer identified by deep sequencing
Kannan, Kalpana; Wang, Liguo; Wang, Jianghua; Ittmann, Michael M.; Li, Wei; Yen, Laising
2011-01-01
Transcription-induced chimeric RNAs, possessing sequences from different genes, are expected to increase the proteomic diversity through chimeric proteins or altered regulation. Despite their importance, few studies have focused on chimeric RNAs especially regarding their presence/roles in human cancers. By deep sequencing the transcriptome of 20 human prostate cancer and 10 matched benign prostate tissues, we obtained 1.3 billion sequence reads, which led to the identification of 2,369 chimeric RNA candidates. Chimeric RNAs occurred in significantly higher frequency in cancer than in matched benign samples. Experimental investigation of a selected 46 set led to the confirmation of 32 chimeric RNAs, of which 27 were highly recurrent and previously undescribed in prostate cancer. Importantly, a subset of these chimeras was present in prostate cancer cell lines, but not detectable in primary human prostate epithelium cells, implying their associations with cancer. These chimeras contain discernable 5′ and 3′ splice sites at the RNA junction, indicating that their formation is mediated by splicing. Their presence is also largely independent of the expression of parental genes, suggesting that other factors are involved in their production and regulation. One chimera, TMEM79-SMG5, is highly differentially expressed in human cancer samples and therefore a potential biomarker. The prevalence of chimeric RNAs may allow the limited number of human genes to encode a substantially larger number of RNAs and proteins, forming an additional layer of cellular complexity. Together, our results suggest that chimeric RNAs are widespread, and increased chimeric RNA events could represent a unique class of molecular alteration in cancer. PMID:21571633
Speth, Daan R; Lagkouvardos, Ilias; Wang, Yong; Qian, Pei-Yuan; Dutilh, Bas E; Jetten, Mike S M
2017-07-01
Several recent studies have indicated that members of the phylum Planctomycetes are abundantly present at the brine-seawater interface (BSI) above multiple brine pools in the Red Sea. Planctomycetes include bacteria capable of anaerobic ammonium oxidation (anammox). Here, we investigated the possibility of anammox at BSI sites using metagenomic shotgun sequencing of DNA obtained from the BSI above the Discovery Deep brine pool. Analysis of sequencing reads matching the 16S rRNA and hzsA genes confirmed presence of anammox bacteria of the genus Scalindua. Phylogenetic analysis of the 16S rRNA gene indicated that this Scalindua sp. belongs to a distinct group, separate from the anammox bacteria in the seawater column, that contains mostly sequences retrieved from high-salt environments. Using coverage- and composition-based binning, we extracted and assembled the draft genome of the dominant anammox bacterium. Comparative genomic analysis indicated that this Scalindua species uses compatible solutes for osmoadaptation, in contrast to other marine anammox bacteria that likely use a salt-in strategy. We propose the name Candidatus Scalindua rubra for this novel species, alluding to its discovery in the Red Sea.
Short intronic repeat sequences facilitate circular RNA production.
Liang, Dongming; Wilusz, Jeremy E
2014-10-15
Recent deep sequencing studies have revealed thousands of circular noncoding RNAs generated from protein-coding genes. These RNAs are produced when the precursor messenger RNA (pre-mRNA) splicing machinery "backsplices" and covalently joins, for example, the two ends of a single exon. However, the mechanism by which the spliceosome selects only certain exons to circularize is largely unknown. Using extensive mutagenesis of expression plasmids, we show that miniature introns containing the splice sites along with short (∼ 30- to 40-nucleotide) inverted repeats, such as Alu elements, are sufficient to allow the intervening exons to circularize in cells. The intronic repeats must base-pair to one another, thereby bringing the splice sites into close proximity to each other. More than simple thermodynamics is clearly at play, however, as not all repeats support circularization, and increasing the stability of the hairpin between the repeats can sometimes inhibit circular RNA biogenesis. The intronic repeats and exonic sequences must collaborate with one another, and a functional 3' end processing signal is required, suggesting that circularization may occur post-transcriptionally. These results suggest detailed and generalizable models that explain how the splicing machinery determines whether to produce a circular noncoding RNA or a linear mRNA. © 2014 Liang and Wilusz; Published by Cold Spring Harbor Laboratory Press.
ASR5 is involved in the regulation of miRNA expression in rice.
Neto, Lauro Bücker; Arenhart, Rafael Augusto; de Oliveira, Luiz Felipe Valter; de Lima, Júlio Cesar; Bodanese-Zanettini, Maria Helena; Margis, Rogerio; Margis-Pinheiro, Márcia
2015-11-01
The work describes an ASR knockdown transcriptomic analysis by deep sequencing of rice root seedlings and the transactivation of ASR cis-acting elements in the upstream region of a MIR gene. MicroRNAs are key regulators of gene expression that guide post-transcriptional control of plant development and responses to environmental stresses. ASR (ABA, Stress and Ripening) proteins are plant-specific transcription factors with key roles in different biological processes. In rice, ASR proteins have been suggested to participate in the regulation of stress response genes. This work describes the transcriptomic analysis by deep sequencing two libraries, comparing miRNA abundance from the roots of transgenic ASR5 knockdown rice seedlings with that of the roots of wild-type non-transformed rice seedlings. Members of 59 miRNA families were detected, and 276 mature miRNAs were identified. Our analysis detected 112 miRNAs that were differentially expressed between the two libraries. A predicted inverse correlation between miR167abc and its target gene (LOC_Os07g29820) was confirmed using RT-qPCR. Protoplast transactivation assays showed that ASR5 is able to recognize binding sites upstream of the MIR167a gene and drive its expression in vivo. Together, our data establish a comparative study of miRNAome profiles and is the first study to suggest the involvement of ASR proteins in miRNA gene regulation.
Microbial Characterization of Qatari Barchan Sand Dunes
Chatziefthimiou, Aspassia D.; Nguyen, Hanh; Richer, Renee; Louge, Michel; Sultan, Ali A.; Schloss, Patrick; Hay, Anthony G.
2016-01-01
This study represents the first characterization of sand microbiota in migrating barchan sand dunes. Bacterial communities were studied through direct counts and cultivation, as well as 16S rRNA gene and metagenomic sequence analysis to gain an understanding of microbial abundance, diversity, and potential metabolic capabilities. Direct on-grain cell counts gave an average of 5.3 ± 0.4 x 105 cells g-1 of sand. Cultured isolates (N = 64) selected for 16S rRNA gene sequencing belonged to the phyla Actinobacteria (58%), Firmicutes (27%) and Proteobacteria (15%). Deep-sequencing of 16S rRNA gene amplicons from 18 dunes demonstrated a high relative abundance of Proteobacteria, particularly enteric bacteria, and a dune-specific-pattern of bacterial community composition that correlated with dune size. Shotgun metagenome sequences of two representative dunes were analyzed and found to have similar relative bacterial abundance, though the relative abundances of eukaryotic, viral and enterobacterial sequences were greater in sand from the dune closer to a camel-pen. Functional analysis revealed patterns similar to those observed in desert soils; however, the increased relative abundance of genes encoding sporulation and dormancy are consistent with the dune microbiome being well-adapted to the exceptionally hyper-arid Qatari desert. PMID:27655399
smRNAome profiling to identify conserved and novel microRNAs in Stevia rebaudiana Bertoni
2012-01-01
Background MicroRNAs (miRNAs) constitute a family of small RNA (sRNA) population that regulates the gene expression and plays an important role in plant development, metabolism, signal transduction and stress response. Extensive studies on miRNAs have been performed in different plants such as Arabidopsis thaliana, Oryza sativa etc. and volume of the miRNA database, mirBASE, has been increasing on day to day basis. Stevia rebaudiana Bertoni is an important perennial herb which accumulates high concentrations of diterpene steviol glycosides which contributes to its high indexed sweetening property with no calorific value. Several studies have been carried out for understanding molecular mechanism involved in biosynthesis of these glycosides, however, information about miRNAs has been lacking in S. rebaudiana. Deep sequencing of small RNAs combined with transcriptomic data is a powerful tool for identifying conserved and novel miRNAs irrespective of availability of genome sequence data. Results To identify miRNAs in S. rebaudiana, sRNA library was constructed and sequenced using Illumina genome analyzer II. A total of 30,472,534 reads representing 2,509,190 distinct sequences were obtained from sRNA library. Based on sequence similarity, we identified 100 miRNAs belonging to 34 highly conserved families. Also, we identified 12 novel miRNAs whose precursors were potentially generated from stevia EST and nucleotide sequences. All novel sequences have not been earlier described in other plant species. Putative target genes were predicted for most conserved and novel miRNAs. The predicted targets are mainly mRNA encoding enzymes regulating essential plant metabolic and signaling pathways. Conclusions This study led to the identification of 34 highly conserved miRNA families and 12 novel potential miRNAs indicating that specific miRNAs exist in stevia species. Our results provided information on stevia miRNAs and their targets building a foundation for future studies to understand their roles in key stevia traits. PMID:23116282
smRNAome profiling to identify conserved and novel microRNAs in Stevia rebaudiana Bertoni.
Mandhan, Vibha; Kaur, Jagdeep; Singh, Kashmir
2012-11-01
MicroRNAs (miRNAs) constitute a family of small RNA (sRNA) population that regulates the gene expression and plays an important role in plant development, metabolism, signal transduction and stress response. Extensive studies on miRNAs have been performed in different plants such as Arabidopsis thaliana, Oryza sativa etc. and volume of the miRNA database, mirBASE, has been increasing on day to day basis. Stevia rebaudiana Bertoni is an important perennial herb which accumulates high concentrations of diterpene steviol glycosides which contributes to its high indexed sweetening property with no calorific value. Several studies have been carried out for understanding molecular mechanism involved in biosynthesis of these glycosides, however, information about miRNAs has been lacking in S. rebaudiana. Deep sequencing of small RNAs combined with transcriptomic data is a powerful tool for identifying conserved and novel miRNAs irrespective of availability of genome sequence data. To identify miRNAs in S. rebaudiana, sRNA library was constructed and sequenced using Illumina genome analyzer II. A total of 30,472,534 reads representing 2,509,190 distinct sequences were obtained from sRNA library. Based on sequence similarity, we identified 100 miRNAs belonging to 34 highly conserved families. Also, we identified 12 novel miRNAs whose precursors were potentially generated from stevia EST and nucleotide sequences. All novel sequences have not been earlier described in other plant species. Putative target genes were predicted for most conserved and novel miRNAs. The predicted targets are mainly mRNA encoding enzymes regulating essential plant metabolic and signaling pathways. This study led to the identification of 34 highly conserved miRNA families and 12 novel potential miRNAs indicating that specific miRNAs exist in stevia species. Our results provided information on stevia miRNAs and their targets building a foundation for future studies to understand their roles in key stevia traits.
Ultra Deep Sequencing of Listeria monocytogenes sRNA Transcriptome Revealed New Antisense RNAs
Behrens, Sebastian; Widder, Stefanie; Mannala, Gopala Krishna; Qing, Xiaoxing; Madhugiri, Ramakanth; Kefer, Nathalie; Mraheil, Mobarak Abu; Rattei, Thomas; Hain, Torsten
2014-01-01
Listeria monocytogenes, a gram-positive pathogen, and causative agent of listeriosis, has become a widely used model organism for intracellular infections. Recent studies have identified small non-coding RNAs (sRNAs) as important factors for regulating gene expression and pathogenicity of L. monocytogenes. Increased speed and reduced costs of high throughput sequencing (HTS) techniques have made RNA sequencing (RNA-Seq) the state-of-the-art method to study bacterial transcriptomes. We created a large transcriptome dataset of L. monocytogenes containing a total of 21 million reads, using the SOLiD sequencing technology. The dataset contained cDNA sequences generated from L. monocytogenes RNA collected under intracellular and extracellular condition and additionally was size fractioned into three different size ranges from <40 nt, 40–150 nt and >150 nt. We report here, the identification of nine new sRNAs candidates of L. monocytogenes and a reevaluation of known sRNAs of L. monocytogenes EGD-e. Automatic comparison to known sRNAs revealed a high recovery rate of 55%, which was increased to 90% by manual revision of the data. Moreover, thorough classification of known sRNAs shed further light on their possible biological functions. Interestingly among the newly identified sRNA candidates are antisense RNAs (asRNAs) associated to the housekeeping genes purA, fumC and pgi and potentially their regulation, emphasizing the significance of sRNAs for metabolic adaptation in L. monocytogenes. PMID:24498259
USDA-ARS?s Scientific Manuscript database
The glassy-winged sharpshooter (GWSS) is an invasive insect species that transmits Xylella fastidiosa, the bacterium causing Pierce’s disease of grapevine and other leaf scorch diseases. X. fastidiosa has been shown to colonize the anterior foregut (cibarium and precibarium) of sharpshooters, where ...
USDA-ARS?s Scientific Manuscript database
The glassy-winged sharpshooter (GWSS) is an invasive insect species that transmits Xylella fastidiosa, the bacterium causing Pierce’s disease of grapevine and other leaf scorch diseases. X. fastidiosa has been shown to colonize the anterior foregut (cibarium and precibarium) of sharpshooters, where ...
USDA-ARS?s Scientific Manuscript database
Volatile short-chain fatty acids (SCFAs, acetate, propionate, and butyrate), especially butyrate, alter cell differentiation, proliferation, motility, and in particular, induce cell cycle arrest and apoptosis through its histone deacetylase (HDAC) inhibition activity. Butyrate is a great inducer of ...
Cross-species inference of long non-coding RNAs greatly expands the ruminant transcriptome.
Bush, Stephen J; Muriuki, Charity; McCulloch, Mary E B; Farquhar, Iseabail L; Clark, Emily L; Hume, David A
2018-04-24
mRNA-like long non-coding RNAs (lncRNAs) are a significant component of mammalian transcriptomes, although most are expressed only at low levels, with high tissue-specificity and/or at specific developmental stages. Thus, in many cases lncRNA detection by RNA-sequencing (RNA-seq) is compromised by stochastic sampling. To account for this and create a catalogue of ruminant lncRNAs, we compared de novo assembled lncRNAs derived from large RNA-seq datasets in transcriptional atlas projects for sheep and goats with previous lncRNAs assembled in cattle and human. We then combined the novel lncRNAs with the sheep transcriptional atlas to identify co-regulated sets of protein-coding and non-coding loci. Few lncRNAs could be reproducibly assembled from a single dataset, even with deep sequencing of the same tissues from multiple animals. Furthermore, there was little sequence overlap between lncRNAs that were assembled from pooled RNA-seq data. We combined positional conservation (synteny) with cross-species mapping of candidate lncRNAs to identify a consensus set of ruminant lncRNAs and then used the RNA-seq data to demonstrate detectable and reproducible expression in each species. In sheep, 20 to 30% of lncRNAs were located close to protein-coding genes with which they are strongly co-expressed, which is consistent with the evolutionary origin of some ncRNAs in enhancer sequences. Nevertheless, most of the lncRNAs are not co-expressed with neighbouring protein-coding genes. Alongside substantially expanding the ruminant lncRNA repertoire, the outcomes of our analysis demonstrate that stochastic sampling can be partly overcome by combining RNA-seq datasets from related species. This has practical implications for the future discovery of lncRNAs in other species.
Zhang, Shuai; Qin, Chunxia; Cao, Guoqiong; Xin, Wenfeng; Feng, Chengqiang; Zhang, Wensheng
2016-08-02
Long noncoding RNAs (lncRNAs) may play an important role in Alzheimer's disease (AD) pathogenesis. However, despite considerable research in this area, the comprehensive and systematic understanding of lncRNAs in AD is still limited. The emergence of RNA sequencing provides a predictor and has incomparable advantage compared with other methods, including microarray. In this study, we identified lncRNAs in a 7-month-old mouse brain through deep RNA sequencing using the senescence-accelerated mouse prone 8 (SAMP8) and senescence-accelerated mouse resistant 1 (SAMR1) models. A total of 599,985,802 clean reads and 23,334 lncRNA transcripts were obtained. Then, we identified 97 significantly upregulated and 114 significantly downregulated lncRNA transcripts from all cases in SAMP8 mice relative to SAMR1 mice. Gene ontology (GO) and Kyoto Encyclopedia of Genes and Genomes analyses revealed that these significantly dysregulated lncRNAs were involved in regulating the development of AD from various angles, such as nerve growth factor term (GO: 1990089), mitogen-activated protein kinase signaling pathway, and AD pathway. Furthermore, the most probable AD-associated lncRNAs were predicted and listed in detail. Our study provided the systematic dissection of lncRNA profiling in SAMP8 mouse brain and accelerated the development of lncRNA biomarkers in AD. These attracting biomarkers could provide significant insights into AD therapy in the future.
NASA Astrophysics Data System (ADS)
Bar Or, I.; Ben-Dov, E.; Kushmaro, A.; Eckert, W.; Sivan, O.
2014-06-01
Microbial methane oxidation process (methanotrophy) is the primary control on the emission of the greenhouse gas methane (CH4) to the atmosphere. In terrestrial environments, aerobic methanotrophic bacteria are mainly responsible for oxidizing the methane. In marine sediments the coupling of the anaerobic oxidation of methane (AOM) with sulfate reduction, often by a consortium of anaerobic methanotrophic archaea (ANME) and sulfate reducing bacteria, was found to consume almost all the upward diffusing methane. Recently, we showed geochemical evidence for AOM driven by iron reduction in Lake Kinneret (LK) (Israel) deep sediments and suggested that this process can be an important global methane sink. The goal of the present study was to link the geochemical gradients found in the porewater (chemical and isotope profiles) with possible changes in microbial community structure. Specifically, we examined the possible shift in the microbial community in the deep iron-driven AOM zone and its similarity to known sulfate driven AOM populations. Screening of archaeal 16S rRNA gene sequences revealed Thaumarchaeota and Euryarchaeota as the dominant phyla in the sediment. Thaumarchaeota, which belongs to the family of copper containing membrane-bound monooxgenases, increased with depth while Euryarchaeota decreased. This may indicate the involvement of Thaumarchaeota, which were discovered to be ammonia oxidizers but whose activity could also be linked to methane, in AOM in the deep sediment. ANMEs sequences were not found in the clone libraries, suggesting that iron-driven AOM is not through sulfate. Bacterial 16S rRNA sequences displayed shifts in community diversity with depth. Proteobacteria and Chloroflexi increased with depth, which could be connected with their different dissimilatory anaerobic processes. The observed changes in microbial community structure suggest possible direct and indirect mechanisms for iron-driven AOM in deep sediments.
Yi, Hai-Cheng; You, Zhu-Hong; Huang, De-Shuang; Li, Xiao; Jiang, Tong-Hai; Li, Li-Ping
2018-06-01
The interactions between non-coding RNAs (ncRNAs) and proteins play an important role in many biological processes, and their biological functions are primarily achieved by binding with a variety of proteins. High-throughput biological techniques are used to identify protein molecules bound with specific ncRNA, but they are usually expensive and time consuming. Deep learning provides a powerful solution to computationally predict RNA-protein interactions. In this work, we propose the RPI-SAN model by using the deep-learning stacked auto-encoder network to mine the hidden high-level features from RNA and protein sequences and feed them into a random forest (RF) model to predict ncRNA binding proteins. Stacked assembling is further used to improve the accuracy of the proposed method. Four benchmark datasets, including RPI2241, RPI488, RPI1807, and NPInter v2.0, were employed for the unbiased evaluation of five established prediction tools: RPI-Pred, IPMiner, RPISeq-RF, lncPro, and RPI-SAN. The experimental results show that our RPI-SAN model achieves much better performance than other methods, with accuracies of 90.77%, 89.7%, 96.1%, and 99.33%, respectively. It is anticipated that RPI-SAN can be used as an effective computational tool for future biomedical researches and can accurately predict the potential ncRNA-protein interacted pairs, which provides reliable guidance for biological research. Copyright © 2018 The Author(s). Published by Elsevier Inc. All rights reserved.
Loher, Phillipe; Telonis, Aristeidis G.; Rigoutsos, Isidore
2017-01-01
Transfer RNA fragments (tRFs) are an established class of constitutive regulatory molecules that arise from precursor and mature tRNAs. RNA deep sequencing (RNA-seq) has greatly facilitated the study of tRFs. However, the repeat nature of the tRNA templates and the idiosyncrasies of tRNA sequences necessitate the development and use of methodologies that differ markedly from those used to analyze RNA-seq data when studying microRNAs (miRNAs) or messenger RNAs (mRNAs). Here we present MINTmap (for MItochondrial and Nuclear TRF mapping), a method and a software package that was developed specifically for the quick, deterministic and exhaustive identification of tRFs in short RNA-seq datasets. In addition to identifying them, MINTmap is able to unambiguously calculate and report both raw and normalized abundances for the discovered tRFs. Furthermore, to ensure specificity, MINTmap identifies the subset of discovered tRFs that could be originating outside of tRNA space and flags them as candidate false positives. Our comparative analysis shows that MINTmap exhibits superior sensitivity and specificity to other available methods while also being exceptionally fast. The MINTmap codes are available through https://github.com/TJU-CMC-Org/MINTmap/ under an open source GNU GPL v3.0 license. PMID:28220888
Zywicki, Marek; Bakowska-Zywicka, Kamilla; Polacek, Norbert
2012-05-01
The exploration of the non-protein-coding RNA (ncRNA) transcriptome is currently focused on profiling of microRNA expression and detection of novel ncRNA transcription units. However, recent studies suggest that RNA processing can be a multi-layer process leading to the generation of ncRNAs of diverse functions from a single primary transcript. Up to date no methodology has been presented to distinguish stable functional RNA species from rapidly degraded side products of nucleases. Thus the correct assessment of widespread RNA processing events is one of the major obstacles in transcriptome research. Here, we present a novel automated computational pipeline, named APART, providing a complete workflow for the reliable detection of RNA processing products from next-generation-sequencing data. The major features include efficient handling of non-unique reads, detection of novel stable ncRNA transcripts and processing products and annotation of known transcripts based on multiple sources of information. To disclose the potential of APART, we have analyzed a cDNA library derived from small ribosome-associated RNAs in Saccharomyces cerevisiae. By employing the APART pipeline, we were able to detect and confirm by independent experimental methods multiple novel stable RNA molecules differentially processed from well known ncRNAs, like rRNAs, tRNAs or snoRNAs, in a stress-dependent manner.
Avsec, Žiga; Cheng, Jun; Gagneur, Julien
2018-01-01
Abstract Motivation Regulatory sequences are not solely defined by their nucleic acid sequence but also by their relative distances to genomic landmarks such as transcription start site, exon boundaries or polyadenylation site. Deep learning has become the approach of choice for modeling regulatory sequences because of its strength to learn complex sequence features. However, modeling relative distances to genomic landmarks in deep neural networks has not been addressed. Results Here we developed spline transformation, a neural network module based on splines to flexibly and robustly model distances. Modeling distances to various genomic landmarks with spline transformations significantly increased state-of-the-art prediction accuracy of in vivo RNA-binding protein binding sites for 120 out of 123 proteins. We also developed a deep neural network for human splice branchpoint based on spline transformations that outperformed the current best, already distance-based, machine learning model. Compared to piecewise linear transformation, as obtained by composition of rectified linear units, spline transformation yields higher prediction accuracy as well as faster and more robust training. As spline transformation can be applied to further quantities beyond distances, such as methylation or conservation, we foresee it as a versatile component in the genomics deep learning toolbox. Availability and implementation Spline transformation is implemented as a Keras layer in the CONCISE python package: https://github.com/gagneurlab/concise. Analysis code is available at https://github.com/gagneurlab/Manuscript_Avsec_Bioinformatics_2017. Contact avsec@in.tum.de or gagneur@in.tum.de Supplementary information Supplementary data are available at Bioinformatics online. PMID:29155928
Jensen, Sigmund; Neufeld, Josh D; Birkeland, Nils-Kåre; Hovland, Martin; Murrell, John Colin
2008-11-01
Deep-water coral reefs are seafloor environments with diverse biological communities surrounded by cold permanent darkness. Sources of energy and carbon for the nourishment of these reefs are presently unclear. We investigated one aspect of the food web using DNA stable-isotope probing (DNA-SIP). Sediment from beneath a Lophelia pertusa reef off the coast of Norway was incubated until assimilation of 5 micromol 13CH4 g(-1) wet weight occurred. Extracted DNA was separated into 'light' and 'heavy' fractions for analysis of labelling. Bacterial community fingerprinting of PCR-amplified 16S rRNA gene fragments revealed two predominant 13C-specific bands. Sequencing of these bands indicated that carbon from 13CH4 had been assimilated by a Methylomicrobium and an uncultivated member of the Gammaproteobacteria. Cloning and sequencing of 16S rRNA genes from the heavy DNA, in addition to genes encoding particulate methane monooxygenase and methanol dehydrogenase, all linked Methylomicrobium with methane metabolism. Putative cross-feeders were affiliated with Methylophaga (Gammaproteobacteria), Hyphomicrobium (Alphaproteobacteria) and previously unrecognized methylotrophs of the Gammaproteobacteria, Alphaproteobacteria, Deferribacteres and Bacteroidetes. This first marine methane SIP study provides evidence for the presence of methylotrophs that participate in sediment food webs associated with deep-water coral reefs.
Multi-disciplinary methods to define RNA-protein interactions and regulatory networks.
Ascano, Manuel; Gerstberger, Stefanie; Tuschl, Thomas
2013-02-01
The advent of high-throughput technologies including deep-sequencing and protein mass spectrometry is facilitating the acquisition of large and precise data sets toward the definition of post-transcriptional regulatory networks. While early studies that investigated specific RNA-protein interactions in isolation laid the foundation for our understanding of the existence of molecular machines to assemble and process RNAs, there is a more recent appreciation of the importance of individual RNA-protein interactions that contribute to post-transcriptional gene regulation. The multitude of RNA-binding proteins (RBPs) and their many RNA targets has only been captured experimentally in recent times. In this review, we will examine current multidisciplinary approaches toward elucidating RNA-protein networks and their regulation. Copyright © 2013 Elsevier Ltd. All rights reserved.
Micropathogen Community Analysis in Hyalomma rufipes via High-Throughput Sequencing of Small RNAs
Luo, Jin; Liu, Min-Xuan; Ren, Qiao-Yun; Chen, Ze; Tian, Zhan-Cheng; Hao, Jia-Wei; Wu, Feng; Liu, Xiao-Cui; Luo, Jian-Xun; Yin, Hong; Wang, Hui; Liu, Guang-Yuan
2017-01-01
Ticks are important vectors in the transmission of a broad range of micropathogens to vertebrates, including humans. Because of the role of ticks in disease transmission, identifying and characterizing the micropathogen profiles of tick populations have become increasingly important. The objective of this study was to survey the micropathogens of Hyalomma rufipes ticks. Illumina HiSeq2000 technology was utilized to perform deep sequencing of small RNAs (sRNAs) extracted from field-collected H. rufipes ticks in Gansu Province, China. The resultant sRNA library data revealed that the surveyed tick populations produced reads that were homologous to St. Croix River Virus (SCRV) sequences. We also observed many reads that were homologous to microbial and/or pathogenic isolates, including bacteria, protozoa, and fungi. As part of this analysis, a phylogenetic tree was constructed to display the relationships among the homologous sequences that were identified. The study offered a unique opportunity to gain insight into the micropathogens of H. rufipes ticks. The effective control of arthropod vectors in the future will require knowledge of the micropathogen composition of vectors harboring infectious agents. Understanding the ecological factors that regulate vector propagation in association with the prevalence and persistence of micropathogen lineages is also imperative. These interactions may affect the evolution of micropathogen lineages, especially if the micropathogens rely on the vector or host for dispersal. The sRNA deep-sequencing approach used in this analysis provides an intuitive method to survey micropathogen prevalence in ticks and other vector species. PMID:28861401
Rapidly evolving homing CRISPR barcodes
Kalhor, Reza; Mali, Prashant; Church, George M.
2017-01-01
We present here an approach for engineering evolving DNA barcodes in living cells. The methodology entails using a homing guide RNA (hgRNA) scaffold that directs the Cas9-hgRNA complex to target the DNA locus of the hgRNA itself. We show that this homing CRISPR-Cas9 system acts as an expressed genetic barcode that diversifies its sequence and that the rate of diversification can be controlled in cultured cells. We further evaluate these barcodes in cell populations and show the barcode RNAs can be assayed as single molecules in situ . This integrated approach will have wide ranging applications, such as in deep lineage tracing, cellular barcoding, molecular recording, dissecting cancer biology, and connectome mapping. PMID:27918539
Soreq, Lilach; Guffanti, Alessandro; Salomonis, Nathan; Simchovitz, Alon; Israel, Zvi; Bergman, Hagai; Soreq, Hermona
2014-01-01
The continuously prolonged human lifespan is accompanied by increase in neurodegenerative diseases incidence, calling for the development of inexpensive blood-based diagnostics. Analyzing blood cell transcripts by RNA-Seq is a robust means to identify novel biomarkers that rapidly becomes a commonplace. However, there is lack of tools to discover novel exons, junctions and splicing events and to precisely and sensitively assess differential splicing through RNA-Seq data analysis and across RNA-Seq platforms. Here, we present a new and comprehensive computational workflow for whole-transcriptome RNA-Seq analysis, using an updated version of the software AltAnalyze, to identify both known and novel high-confidence alternative splicing events, and to integrate them with both protein-domains and microRNA binding annotations. We applied the novel workflow on RNA-Seq data from Parkinson's disease (PD) patients' leukocytes pre- and post- Deep Brain Stimulation (DBS) treatment and compared to healthy controls. Disease-mediated changes included decreased usage of alternative promoters and N-termini, 5′-end variations and mutually-exclusive exons. The PD regulated FUS and HNRNP A/B included prion-like domains regulated regions. We also present here a workflow to identify and analyze long non-coding RNAs (lncRNAs) via RNA-Seq data. We identified reduced lncRNA expression and selective PD-induced changes in 13 of over 6,000 detected leukocyte lncRNAs, four of which were inversely altered post-DBS. These included the U1 spliceosomal lncRNA and RP11-462G22.1, each entailing sequence complementarity to numerous microRNAs. Analysis of RNA-Seq from PD and unaffected controls brains revealed over 7,000 brain-expressed lncRNAs, of which 3,495 were co-expressed in the leukocytes including U1, which showed both leukocyte and brain increases. Furthermore, qRT-PCR validations confirmed these co-increases in PD leukocytes and two brain regions, the amygdala and substantia-nigra, compared to controls. This novel workflow allows deep multi-level inspection of RNA-Seq datasets and provides a comprehensive new resource for understanding disease transcriptome modifications in PD and other neurodegenerative diseases. PMID:24651478
Leda, Ana Rachel; Hunter, James; Oliveira, Ursula Castro; Azevedo, Inacio Junqueira; Sucupira, Maria Cecilia Araripe; Diaz, Ricardo Sobhie
2018-04-19
The presence of minority transmitted drug resistance mutations was assessed using ultra-deep sequencing and correlated with disease progression among recently HIV-1-infected individuals from Brazil. Samples at baseline during recent infection and 1 year after the establishment of the infection were analysed. Viral RNA and proviral DNA from 25 individuals were subjected to ultra-deep sequencing of the reverse transcriptase and protease regions of HIV-1. Viral strains carrying transmitted drug resistance mutations were detected in 9 out of the 25 patients, for all major antiretroviral classes, ranging from one to five mutations per patient. Ultra-deep sequencing detected strains with frequencies as low as 1.6% and only strains with frequencies >20% were detected by population plasma sequencing (three patients). Transmitted drug resistance strains with frequencies <14.8% did not persist upon established infection. The presence of transmitted drug resistance mutations was negatively correlated with the viral load and with CD4+ T cell count decay. Transmitted drug resistance mutations representing small percentages of the viral population do not persist during infection because they are negatively selected in the first year after HIV-1 seroconversion.
Clyde, Karen; Glaunsinger, Britt A.
2011-01-01
One characteristic of lytic infection with gammaherpesviruses, including Kaposi's sarcoma-associated herpesvirus (KSHV), Epstein-Barr virus (EBV) and murine herpesvirus 68 (MHV68), is the dramatic suppression of cellular gene expression in a process known as host shutoff. The alkaline exonuclease proteins (KSHV SOX, MHV-68 muSOX and EBV BGLF5) have been shown to induce shutoff by destabilizing cellular mRNAs. Here we extend previous analyses of cellular mRNA abundance during lytic infection to characterize the effects of SOX and muSOX, in the absence of other viral genes, utilizing deep sequencing technology (RNA-seq). Consistent with previous observations during lytic infection, the majority of transcripts are downregulated in cells expressing either SOX or muSOX, with muSOX acting as a more potent shutoff factor than SOX. Moreover, most cellular messages fall into the same expression class in both SOX- and muSOX-expressing cells, indicating that both factors target similar pools of mRNAs. More abundant mRNAs are more efficiently downregulated, suggesting a concentration effect in transcript targeting. However, even among highly expressed genes there are mRNAs that escape host shutoff. Further characterization of select escapees reveals multiple mechanisms by which cellular genes can evade downregulation. While some mRNAs are directly refractory to SOX, the steady state levels of others remain unchanged, presumably as a consequence of downstream effects on mRNA biogenesis. Collectively, these studies lay the framework for dissecting the mechanisms underlying the susceptibility of mRNA to destruction during lytic gammaherpesvirus infection. PMID:21573023
Bending, Gary D.; Lincoln, Suzanne D.; Sørensen, Sebastian R.; Morgan, J. Alun W.; Aamand, Jens; Walker, Allan
2003-01-01
Substantial spatial variability in the degradation rate of the phenyl-urea herbicide isoproturon (IPU) [3-(4-isopropylphenyl)-1,1-dimethylurea] has been shown to occur within agricultural fields, with implications for the longevity of the compound in the soil, and its movement to ground- and surface water. The microbial mechanisms underlying such spatial variability in degradation rate were investigated at Deep Slade field in Warwickshire, United Kingdom. Most-probable-number analysis showed that rapid degradation of IPU was associated with proliferation of IPU-degrading organisms. Slow degradation of IPU was linked to either a delay in the proliferation of IPU-degrading organisms or apparent cometabolic degradation. Using enrichment techniques, an IPU-degrading bacterial culture (designated strain F35) was isolated from fast-degrading soil, and partial 16S rRNA sequencing placed it within the Sphingomonas group. Denaturing gradient gel electrophoresis (DGGE) of PCR-amplified bacterial community 16S rRNA revealed two bands that increased in intensity in soil during growth-linked metabolism of IPU, and sequencing of the excised bands showed high sequence homology to the Sphingomonas group. However, while F35 was not closely related to either DGGE band, one of the DGGE bands showed 100% partial 16S rRNA sequence homology to an IPU-degrading Sphingomonas sp. (strain SRS2) isolated from Deep Slade field in an earlier study. Experiments with strains SRS2 and F35 in soil and liquid culture showed that the isolates had a narrow pH optimum (7 to 7.5) for metabolism of IPU. The pH requirements of IPU-degrading strains of Sphingomonas spp. could largely account for the spatial variation of IPU degradation rates across the field. PMID:12571001
Pan, Xiaoyong; Shen, Hong-Bin
2017-02-28
RNAs play key roles in cells through the interactions with proteins known as the RNA-binding proteins (RBP) and their binding motifs enable crucial understanding of the post-transcriptional regulation of RNAs. How the RBPs correctly recognize the target RNAs and why they bind specific positions is still far from clear. Machine learning-based algorithms are widely acknowledged to be capable of speeding up this process. Although many automatic tools have been developed to predict the RNA-protein binding sites from the rapidly growing multi-resource data, e.g. sequence, structure, their domain specific features and formats have posed significant computational challenges. One of current difficulties is that the cross-source shared common knowledge is at a higher abstraction level beyond the observed data, resulting in a low efficiency of direct integration of observed data across domains. The other difficulty is how to interpret the prediction results. Existing approaches tend to terminate after outputting the potential discrete binding sites on the sequences, but how to assemble them into the meaningful binding motifs is a topic worth of further investigation. In viewing of these challenges, we propose a deep learning-based framework (iDeep) by using a novel hybrid convolutional neural network and deep belief network to predict the RBP interaction sites and motifs on RNAs. This new protocol is featured by transforming the original observed data into a high-level abstraction feature space using multiple layers of learning blocks, where the shared representations across different domains are integrated. To validate our iDeep method, we performed experiments on 31 large-scale CLIP-seq datasets, and our results show that by integrating multiple sources of data, the average AUC can be improved by 8% compared to the best single-source-based predictor; and through cross-domain knowledge integration at an abstraction level, it outperforms the state-of-the-art predictors by 6%. Besides the overall enhanced prediction performance, the convolutional neural network module embedded in iDeep is also able to automatically capture the interpretable binding motifs for RBPs. Large-scale experiments demonstrate that these mined binding motifs agree well with the experimentally verified results, suggesting iDeep is a promising approach in the real-world applications. The iDeep framework not only can achieve promising performance than the state-of-the-art predictors, but also easily capture interpretable binding motifs. iDeep is available at http://www.csbio.sjtu.edu.cn/bioinf/iDeep.
Xiao, Chuan-Le; Mai, Zhi-Biao; Lian, Xin-Lei; Zhong, Jia-Yong; Jin, Jing-Jie; He, Qing-Yu; Zhang, Gong
2014-01-01
Correct and bias-free interpretation of the deep sequencing data is inevitably dependent on the complete mapping of all mappable reads to the reference sequence, especially for quantitative RNA-seq applications. Seed-based algorithms are generally slow but robust, while Burrows-Wheeler Transform (BWT) based algorithms are fast but less robust. To have both advantages, we developed an algorithm FANSe2 with iterative mapping strategy based on the statistics of real-world sequencing error distribution to substantially accelerate the mapping without compromising the accuracy. Its sensitivity and accuracy are higher than the BWT-based algorithms in the tests using both prokaryotic and eukaryotic sequencing datasets. The gene identification results of FANSe2 is experimentally validated, while the previous algorithms have false positives and false negatives. FANSe2 showed remarkably better consistency to the microarray than most other algorithms in terms of gene expression quantifications. We implemented a scalable and almost maintenance-free parallelization method that can utilize the computational power of multiple office computers, a novel feature not present in any other mainstream algorithm. With three normal office computers, we demonstrated that FANSe2 mapped an RNA-seq dataset generated from an entire Illunima HiSeq 2000 flowcell (8 lanes, 608 M reads) to masked human genome within 4.1 hours with higher sensitivity than Bowtie/Bowtie2. FANSe2 thus provides robust accuracy, full indel sensitivity, fast speed, versatile compatibility and economical computational utilization, making it a useful and practical tool for deep sequencing applications. FANSe2 is freely available at http://bioinformatics.jnu.edu.cn/software/fanse2/.
The bipartite mitochondrial genome of Ruizia karukerae (Rhigonematomorpha, Nematoda).
Kim, Taeho; Kern, Elizabeth; Park, Chungoo; Nadler, Steven A; Bae, Yeon Jae; Park, Joong-Ki
2018-05-10
Mitochondrial genes and whole mitochondrial genome sequences are widely used as molecular markers in studying population genetics and resolving both deep and shallow nodes in phylogenetics. In animals the mitochondrial genome is generally composed of a single chromosome, but mystifying exceptions sometimes occur. We determined the complete mitochondrial genome of the millipede-parasitic nematode Ruizia karukerae and found its mitochondrial genome consists of two circular chromosomes, which is highly unusual in bilateral animals. Chromosome I is 7,659 bp and includes six protein-coding genes, two rRNA genes and nine tRNA genes. Chromosome II comprises 7,647 bp, with seven protein-coding genes and 16 tRNA genes. Interestingly, both chromosomes share a 1,010 bp sequence containing duplicate copies of cox2 and three tRNA genes (trnD, trnG and trnH), and the nucleotide sequences between the duplicated homologous gene copies are nearly identical, suggesting a possible recent genesis for this bipartite mitochondrial genome. Given that little is known about the formation, maintenance or evolution of abnormal mitochondrial genome structures, R. karukerae mtDNA may provide an important early glimpse into this process.
USDA-ARS?s Scientific Manuscript database
The glassy-winged sharpshooter (GWSS) is an invasive insect species that transmits Xylella fastidiosa, the bacterium causing Pierce’s disease of grapevine and other leaf scorch diseases. X. fastidiosa has been shown to colonize the anterior foregut (cibarium and precibarium) of sharpshooters, where ...
Jeong, Dong-Hoon; Park, Sunhee; Zhai, Jixian; Gurazada, Sai Guna Ranjan; De Paoli, Emanuele; Meyers, Blake C.; Green, Pamela J.
2011-01-01
Small RNAs have a variety of important roles in plant development, stress responses, and other processes. They exert their influence by guiding mRNA cleavage, translational repression, and chromatin modification. To identify previously unknown rice (Oryza sativa) microRNAs (miRNAs) and those regulated by environmental stress, 62 small RNA libraries were constructed from rice plants and used for deep sequencing with Illumina technology. The libraries represent several tissues from control plants and plants subjected to different environmental stress treatments. More than 94 million genome-matched reads were obtained, resulting in more than 16 million distinct small RNA sequences. This allowed an evaluation of ~400 annotated miRNAs with current criteria and the finding that among these, ~150 had small interfering RNA–like characteristics. Seventy-six new miRNAs were found, and miRNAs regulated in response to water stress, nutrient stress, or temperature stress were identified. Among the new examples of miRNA regulation were members of the same miRNA family that were differentially regulated in different organs and had distinct sequences Some of these distinct family members result in differential target cleavage and provide new insight about how an agriculturally important rice phenotype could be regulated in the panicle. This high-resolution analysis of rice miRNAs should be relevant to plant miRNAs in general, particularly in the Poaceae. PMID:22158467
Copeland, Alex; Gu, Wei; Yasawong, Montri; Lapidus, Alla; Lucas, Susan; Deshpande, Shweta; Pagani, Ioanna; Tapia, Roxanne; Cheng, Jan-Fang; Goodwin, Lynne A.; Pitluck, Sam; Liolios, Konstantinos; Ivanova, Natalia; Mavromatis, Konstantinos; Mikhailova, Natalia; Pati, Amrita; Chen, Amy; Palaniappan, Krishna; Land, Miriam; Pan, Chongle; Brambilla, Evelyne-Marie; Rohde, Manfred; Tindall, Brian J.; Sikorski, Johannes; Göker, Markus; Detter, John C.; Bristow, James; Eisen, Jonathan A.; Markowitz, Victor; Hugenholtz, Philip; Kyrpides, Nikos C.; Klenk, Hans-Peter; Woyke, Tanja
2012-01-01
Marinithermus hydrothermalis Sako et al. 2003 is the type species of the monotypic genus Marinithermus. M. hydrothermalis T1T was the first isolate within the phylum “Thermus-Deinococcus” to exhibit optimal growth under a salinity equivalent to that of sea water and to have an absolute requirement for NaCl for growth. M. hydrothermalis T1T is of interest because it may provide a new insight into the ecological significance of the aerobic, thermophilic decomposers in the circulation of organic compounds in deep-sea hydrothermal vent ecosystems. This is the first completed genome sequence of a member of the genus Marinithermus and the seventh sequence from the family Thermaceae. Here we describe the features of this organism, together with the complete genome sequence and annotation. The 2,269,167 bp long genome with its 2,251 protein-coding and 59 RNA genes is a part of the Genomic Encyclopedia of Bacteria and Archaea project. PMID:22675595
Small RNA Analysis in Sindbis Virus Infected Human HEK293 Cells
Dalmay, Tamas; Powell, Penny P.
2013-01-01
Introduction In contrast to the defence mechanism of RNA interference (RNAi) in plants and invertebrates, its role in the innate response to virus infection of mammals is a matter of debate. Since RNAi has a well-established role in controlling infection of the alphavirus Sindbis virus (SINV) in insects, we have used this virus to investigate the role of RNAi in SINV infection of human cells. Results SINV AR339 and TR339-GFP were adapted to grow in HEK293 cells. Deep sequencing of small RNAs (sRNAs) early in SINV infection (4 and 6 hpi) showed low abundance (0.8%) of viral sRNAs (vsRNAs), with no size, sequence or location specific patterns characteristic of Dicer products nor did they possess any discernible pattern to ascribe to a specific RNAi biogenesis pathway. This was supported by multiple variants for each sequence, and lack of hot spots along the viral genome sequence. The abundance of the best defined vsRNAs was below the limit of Northern blot detection. The adaptation of the virus to HEK293 cells showed little sequence changes compared to the reference; however, a SNP in E1 gene with a preference from G to C was found. Deep sequencing results showed little variation of expression of cellular microRNAs (miRNAs) at 4 and 6 hpi compared to uninfected cells. Twelve miRNAs exhibiting some minor differential expression by sequencing, showed no difference in expression by Northern blot analysis. Conclusions We show that, unlike SINV infection of invertebrates, generation of Dicer-dependent svRNAs and change in expression of cellular miRNAs were not detected as part of the Human response to SINV. PMID:24391886
Comparative metagenomics of bathypelagic plankton and bottom sediment from the Sea of Marmara
Quaiser, Achim; Zivanovic, Yvan; Moreira, David; López-García, Purificación
2011-01-01
To extend comparative metagenomic analyses of the deep-sea, we produced metagenomic data by direct 454 pyrosequencing from bathypelagic plankton (1000 m depth) and bottom sediment of the Sea of Marmara, the gateway between the Eastern Mediterranean and the Black Seas. Data from small subunit ribosomal RNA (SSU rRNA) gene libraries and direct pyrosequencing of the same samples indicated that Gamma- and Alpha-proteobacteria, followed by Bacteroidetes, dominated the bacterial fraction in Marmara deep-sea plankton, whereas Planctomycetes, Delta- and Gamma-proteobacteria were the most abundant groups in high bacterial-diversity sediment. Group I Crenarchaeota/Thaumarchaeota dominated the archaeal plankton fraction, although group II and III Euryarchaeota were also present. Eukaryotes were highly diverse in SSU rRNA gene libraries, with group I (Duboscquellida) and II (Syndiniales) alveolates and Radiozoa dominating plankton, and Opisthokonta and Alveolates, sediment. However, eukaryotic sequences were scarce in pyrosequence data. Archaeal amo genes were abundant in plankton, suggesting that Marmara planktonic Thaumarchaeota are ammonia oxidizers. Genes involved in sulfate reduction, carbon monoxide oxidation, anammox and sulfatases were over-represented in sediment. Genome recruitment analyses showed that Alteromonas macleodii ‘surface ecotype', Pelagibacter ubique and Nitrosopumilus maritimus were highly represented in 1000 m-deep plankton. A comparative analysis of Marmara metagenomes with ALOHA deep-sea and surface plankton, whale carcasses, Peru subsurface sediment and soil metagenomes clustered deep-sea Marmara plankton with deep-ALOHA plankton and whale carcasses, likely because of the suboxic conditions in the deep Marmara water column. The Marmara sediment clustered with the soil metagenome, highlighting the common ecological role of both types of microbial communities in the degradation of organic matter and the completion of biogeochemical cycles. PMID:20668488
RNA Structural Analysis by Evolving SHAPE Chemistry
Spitale, Robert C.; Flynn, Ryan A.; Torre, Eduardo A.; Kool, Eric T.; Chang, Howard Y.
2017-01-01
RNA is central to the flow of biological information. From transcription to splicing, RNA localization, translation, and decay, RNA is intimately involved in regulating every step of the gene expression program, and is thus essential for health and understanding disease. RNA has the unique ability to base-pair with itself and other nucleic acids to form complex structures. Hence the information content in RNA is not simply its linear sequence of bases, but is also encoded in complex folding of RNA molecules. A general chemical functionality that all RNAs have is a 2’-hydroxyl group in the ribose ring, and the reactivity of the 2'-hydroxyl in RNA is gated by local nucleotide flexibility. In other words, the 2'-hydroxyl is reactive at single-stranded and conformationally flexible positions but is unreactive at nucleotides constrained by base pairing. Recent efforts have been focused on developing reagents that modify RNA as a function of RNA 2’ hydroxyl group flexibility. Such RNA structure probing techniques can be read out by primer extension in experiments termed RNA SHAPE (Selective 2’ Hydroxyl Acylation and Primer Extension). Herein we describe the efforts devoted to the design and utilization of SHAPE probes for characterizing RNA structure. We also describe current technological advances that are being used to utilize SHAPE chemistry with deep sequencing to probe many RNAs in parallel. The merger of chemistry with genomics is sure to open the door to genome-wide exploration of RNA structure and function. PMID:25132067
Sensitivity of Small RNA-Based Detection of Plant Viruses.
Santala, Johanna; Valkonen, Jari P T
2018-01-01
Plants recognize unrelated viruses by the antiviral defense system called RNA interference (RNAi). RNAi processes double-stranded viral RNA into small RNAs (sRNAs) of 21-24 nucleotides, the reassembly of which into longer strands in silico allows virus identification by comparison with the sequences available in databases. The aim of this study was to compare the virus detection sensitivity of sRNA-based virus diagnosis with the established virus species-specific polymerase chain reaction (PCR) approach. Viruses propagated in tobacco plants included three engineered, infectious clones of Potato virus A (PVA), each carrying a different marker gene, and an infectious clone of Potato virus Y (PVY). Total RNA (containing sRNA) was isolated and subjected to reverse-transcription real-time PCR (RT-RT-PCR) and sRNA deep-sequencing at different concentrations. RNA extracted from various crop plants was included in the reactions to normalize RNA concentrations. Targeted detection of selected viruses showed a similar threshold for the sRNA and reverse-transcription quantitative PCR (RT-qPCR) analyses. The detection limit for PVY and PVA by RT-qPCR in this study was 3 and 1.5 fg of viral RNA, respectively, in 50 ng of total RNA per PCR reaction. When knowledge was available about the viruses likely present in the samples, sRNA-based virus detection was 10 times more sensitive than RT-RT-PCR. The advantage of sRNA analysis is the detection of all tested viruses without the need for virus-specific primers or probes.
Detecting and characterizing circular RNAs
Jeck, William R.; Sharpless, Norman E.
2014-01-01
Circular RNA transcripts were first identified in the early 1990s but knowledge of these species has remained limited, as their study has been difficult through traditional methods of RNA analysis. Now, novel bioinformatic approaches coupled with biochemical enrichment strategies and deep sequencing have allowed comprehensive studies of circular RNA species. Recent studies have revealed thousands of endogenous circular RNAs (circRNAs) in mammalian cells, some of which are highly abundant and evolutionarily conserved. Evidence is emerging that some circRNAs might regulate microRNA (miRNA) function, and roles in transcriptional control have also been suggested. Therefore, study of this class of non-coding RNAs has potential implications for therapeutic and research applications. We believe the key future challenge to the field will be to understand the regulation and function of these unusual molecules. PMID:24811520
Genome-wide characterization of microRNA in foxtail millet (Setaria italica)
2013-01-01
Background MicroRNAs (miRNAs) are a class of short non-coding, endogenous RNAs that play key roles in many biological processes in both animals and plants. Although many miRNAs have been identified in a large number of organisms, the miRNAs in foxtail millet (Setaria italica) have, until now, been poorly understood. Results In this study, two replicate small RNA libraries from foxtail millet shoots were sequenced, and 40 million reads representing over 10 million unique sequences were generated. We identified 43 known miRNAs, 172 novel miRNAs and 2 mirtron precursor candidates in foxtail millet. Some miRNA*s of the known and novel miRNAs were detected as well. Further, eight novel miRNAs were validated by stem-loop RT-PCR. Potential targets of the foxtail millet miRNAs were predicted based on our strict criteria. Of the predicted target genes, 79% (351) had functional annotations in InterPro and GO analyses, indicating the targets of the miRNAs were involved in a wide range of regulatory functions and some specific biological processes. A total of 69 pairs of syntenic miRNA precursors that were conserved between foxtail millet and sorghum were found. Additionally, stem-loop RT-PCR was conducted to confirm the tissue-specific expression of some miRNAs in the four tissues identified by deep-sequencing. Conclusions We predicted, for the first time, 215 miRNAs and 447 miRNA targets in foxtail millet at a genome-wide level. The precursors, expression levels, miRNA* sequences, target functions, conservation, and evolution of miRNAs we identified were investigated. Some of the novel foxtail millet miRNAs and miRNA targets were validated experimentally. PMID:24330712
Genome-wide characterization of microRNA in foxtail millet (Setaria italica).
Yi, Fei; Xie, Shaojun; Liu, Yuwei; Qi, Xin; Yu, Jingjuan
2013-12-13
MicroRNAs (miRNAs) are a class of short non-coding, endogenous RNAs that play key roles in many biological processes in both animals and plants. Although many miRNAs have been identified in a large number of organisms, the miRNAs in foxtail millet (Setaria italica) have, until now, been poorly understood. In this study, two replicate small RNA libraries from foxtail millet shoots were sequenced, and 40 million reads representing over 10 million unique sequences were generated. We identified 43 known miRNAs, 172 novel miRNAs and 2 mirtron precursor candidates in foxtail millet. Some miRNA*s of the known and novel miRNAs were detected as well. Further, eight novel miRNAs were validated by stem-loop RT-PCR. Potential targets of the foxtail millet miRNAs were predicted based on our strict criteria. Of the predicted target genes, 79% (351) had functional annotations in InterPro and GO analyses, indicating the targets of the miRNAs were involved in a wide range of regulatory functions and some specific biological processes. A total of 69 pairs of syntenic miRNA precursors that were conserved between foxtail millet and sorghum were found. Additionally, stem-loop RT-PCR was conducted to confirm the tissue-specific expression of some miRNAs in the four tissues identified by deep-sequencing. We predicted, for the first time, 215 miRNAs and 447 miRNA targets in foxtail millet at a genome-wide level. The precursors, expression levels, miRNA* sequences, target functions, conservation, and evolution of miRNAs we identified were investigated. Some of the novel foxtail millet miRNAs and miRNA targets were validated experimentally.
Sakai, Hiroaki; Kanamori, Hiroyuki; Arai-Kichise, Yuko; Shibata-Hatta, Mari; Ebana, Kaworu; Oono, Youko; Kurita, Kanako; Fujisawa, Hiroko; Katagiri, Satoshi; Mukai, Yoshiyuki; Hamada, Masao; Itoh, Takeshi; Matsumoto, Takashi; Katayose, Yuichi; Wakasa, Kyo; Yano, Masahiro; Wu, Jianzhong
2014-01-01
Having a deep genetic structure evolved during its domestication and adaptation, the Asian cultivated rice (Oryza sativa) displays considerable physiological and morphological variations. Here, we describe deep whole-genome sequencing of the aus rice cultivar Kasalath by using the advanced next-generation sequencing (NGS) technologies to gain a better understanding of the sequence and structural changes among highly differentiated cultivars. The de novo assembled Kasalath sequences represented 91.1% (330.55 Mb) of the genome and contained 35 139 expressed loci annotated by RNA-Seq analysis. We detected 2 787 250 single-nucleotide polymorphisms (SNPs) and 7393 large insertion/deletion (indel) sites (>100 bp) between Kasalath and Nipponbare, and 2 216 251 SNPs and 3780 large indels between Kasalath and 93-11. Extensive comparison of the gene contents among these cultivars revealed similar rates of gene gain and loss. We detected at least 7.39 Mb of inserted sequences and 40.75 Mb of unmapped sequences in the Kasalath genome in comparison with the Nipponbare reference genome. Mapping of the publicly available NGS short reads from 50 rice accessions proved the necessity and the value of using the Kasalath whole-genome sequence as an additional reference to capture the sequence polymorphisms that cannot be discovered by using the Nipponbare sequence alone. PMID:24578372
Uniform, optimal signal processing of mapped deep-sequencing data.
Kumar, Vibhor; Muratani, Masafumi; Rayan, Nirmala Arul; Kraus, Petra; Lufkin, Thomas; Ng, Huck Hui; Prabhakar, Shyam
2013-07-01
Despite their apparent diversity, many problems in the analysis of high-throughput sequencing data are merely special cases of two general problems, signal detection and signal estimation. Here we adapt formally optimal solutions from signal processing theory to analyze signals of DNA sequence reads mapped to a genome. We describe DFilter, a detection algorithm that identifies regulatory features in ChIP-seq, DNase-seq and FAIRE-seq data more accurately than assay-specific algorithms. We also describe EFilter, an estimation algorithm that accurately predicts mRNA levels from as few as 1-2 histone profiles (R ∼0.9). Notably, the presence of regulatory motifs in promoters correlates more with histone modifications than with mRNA levels, suggesting that histone profiles are more predictive of cis-regulatory mechanisms. We show by applying DFilter and EFilter to embryonic forebrain ChIP-seq data that regulatory protein identification and functional annotation are feasible despite tissue heterogeneity. The mathematical formalism underlying our tools facilitates integrative analysis of data from virtually any sequencing-based functional profile.
Blazejak, Anna; Schippers, Axel
2010-05-01
Sequences of members of the bacterial candidate division JS-1 and the classes Anaerolineae and Caldilineae of the phylum Chloroflexi are frequently found in 16S rRNA gene clone libraries obtained from marine sediments. Using a newly designed quantitative, real-time PCR assay, these bacterial groups were jointly quantified in samples from near-surface and deeply buried marine sediments from the Peru margin, the Black Sea, and a forearc basin off the island of Sumatra. In near-surface sediments, sequences of the JS-1 as well as Anaerolineae- and Caldilineae-related Bacteria were quantified with significantly lower 16S rRNA gene copy numbers than the sequences of total Bacteria. In contrast, in deeply buried sediments below approximately 1 m depth, similar quantities of the 16S rRNA gene copies of these specific groups and Bacteria were found. This finding indicates that JS-1 and Anaerolineae- and Caldilineae-related Bacteria might dominate the bacterial community in deeply buried marine sediments and thus seem to play an important ecological role in the deep biosphere.
Baril, Patrick; Ezzine, Safia; Pichon, Chantal
2015-01-01
MicroRNAs (miRNAs) are a class of small non-coding RNAs that regulate gene expression by binding mRNA targets via sequence complementary inducing translational repression and/or mRNA degradation. A current challenge in the field of miRNA biology is to understand the functionality of miRNAs under physiopathological conditions. Recent evidence indicates that miRNA expression is more complex than simple regulation at the transcriptional level. MiRNAs undergo complex post-transcriptional regulations such miRNA processing, editing, accumulation and re-cycling within P-bodies. They are dynamically regulated and have a well-orchestrated spatiotemporal localization pattern. Real-time and spatio-temporal analyses of miRNA expression are difficult to evaluate and often underestimated. Therefore, important information connecting miRNA expression and function can be lost. Conventional miRNA profiling methods such as Northern blot, real-time PCR, microarray, in situ hybridization and deep sequencing continue to contribute to our knowledge of miRNA biology. However, these methods can seldom shed light on the spatiotemporal organization and function of miRNAs in real-time. Non-invasive molecular imaging methods have the potential to address these issues and are thus attracting increasing attention. This paper reviews the state-of-the-art of methods used to detect miRNAs and discusses their contribution in the emerging field of miRNA biology and therapy. PMID:25749473
Baril, Patrick; Ezzine, Safia; Pichon, Chantal
2015-03-04
MicroRNAs (miRNAs) are a class of small non-coding RNAs that regulate gene expression by binding mRNA targets via sequence complementary inducing translational repression and/or mRNA degradation. A current challenge in the field of miRNA biology is to understand the functionality of miRNAs under physiopathological conditions. Recent evidence indicates that miRNA expression is more complex than simple regulation at the transcriptional level. MiRNAs undergo complex post-transcriptional regulations such miRNA processing, editing, accumulation and re-cycling within P-bodies. They are dynamically regulated and have a well-orchestrated spatiotemporal localization pattern. Real-time and spatio-temporal analyses of miRNA expression are difficult to evaluate and often underestimated. Therefore, important information connecting miRNA expression and function can be lost. Conventional miRNA profiling methods such as Northern blot, real-time PCR, microarray, in situ hybridization and deep sequencing continue to contribute to our knowledge of miRNA biology. However, these methods can seldom shed light on the spatiotemporal organization and function of miRNAs in real-time. Non-invasive molecular imaging methods have the potential to address these issues and are thus attracting increasing attention. This paper reviews the state-of-the-art of methods used to detect miRNAs and discusses their contribution in the emerging field of miRNA biology and therapy.
Chakraborty, Sandeep; Britton, Monica; Martínez-García, P J; Dandekar, Abhaya M
2016-03-01
Deep RNA-Seq profiling, a revolutionary method used for quantifying transcriptional levels, often includes non-specific transcripts from other co-existing organisms in spite of stringent protocols. Using the recently published walnut genome sequence as a filter, we present a broad analysis of the RNA-Seq derived transcriptome profiles obtained from twenty different tissues to extract the biodiversity and possible plant-microbe interactions in the walnut ecosystem in California. Since the residual nature of the transcripts being analyzed does not provide sufficient information to identify the exact strain, inferences made are constrained to the genus level. The presence of the pathogenic oomycete Phytophthora was detected in the root through the presence of a glyceraldehyde-3-phosphate dehydrogenase. Cryptococcus, the causal agent of cryptococcosis, was found in the catkins and vegetative buds, corroborating previous work indicating that the plant surface supported the sexual cycle of this human pathogen. The RNA-Seq profile revealed several species of the endophytic nitrogen fixing Actinobacteria. Another bacterial species implicated in aerobic biodegradation of methyl tert-butyl ether (Methylibium petroleiphilum) is also found in the root. RNA encoding proteins from the pea aphid were found in the leaves and vegetative buds, while a serine protease from mosquito with significant homology to a female reproductive tract protease from Drosophila mojavensis in the vegetative bud suggests egg-laying activities. The comprehensive analysis of RNA-seq data present also unraveled detailed, tissue-specific information of ~400 transcripts encoded by the largest family of resistance (R) genes (NBS-LRR), which possibly rationalizes the resistance of the specific walnut plant to the pathogens detected. Thus, we elucidate the biodiversity and possible plant-microbe interactions in several walnut (Juglans regia) tissues in California using deep RNA-Seq profiling.
Fungal diversity in deep-sea sediments associated with asphalt seeps at the Sao Paulo Plateau
NASA Astrophysics Data System (ADS)
Nagano, Yuriko; Miura, Toshiko; Nishi, Shinro; Lima, Andre O.; Nakayama, Cristina; Pellizari, Vivian H.; Fujikura, Katsunori
2017-12-01
We investigated the fungal diversity in a total of 20 deep-sea sediment samples (of which 14 samples were associated with natural asphalt seeps and 6 samples were not associated) collected from two different sites at the Sao Paulo Plateau off Brazil by Ion Torrent PGM targeting ITS region of ribosomal RNA. Our results suggest that diverse fungi (113 operational taxonomic units (OTUs) based on clustering at 97% sequence similarity assigned into 9 classes and 31 genus) are present in deep-sea sediment samples collected at the Sao Paulo Plateau, dominated by Ascomycota (74.3%), followed by Basidiomycota (11.5%), unidentified fungi (7.1%), and sequences with no affiliation to any organisms in the public database (7.1%). However, it was revealed that only three species, namely Penicillium sp., Cadophora malorum and Rhodosporidium diobovatum, were dominant, with the majority of OTUs remaining a minor community. Unexpectedly, there was no significant difference in major fungal community structure between the asphalt seep and non-asphalt seep sites, despite the presence of mass hydrocarbon deposits and the high amount of macro organisms surrounding the asphalt seeps. However, there were some differences in the minor fungal communities, with possible asphalt degrading fungi present specifically in the asphalt seep sites. In contrast, some differences were found between the two different sampling sites. Classification of OTUs revealed that only 47 (41.6%) fungal OTUs exhibited >97% sequence similarity, in comparison with pre-existing ITS sequences in public databases, indicating that a majority of deep-sea inhabiting fungal taxa still remain undescribed. Although our knowledge on fungi and their role in deep-sea environments is still limited and scarce, this study increases our understanding of fungal diversity and community structure in deep-sea environments.
Insilico profiling of microRNAs in Korean ginseng (Panax ginseng Meyer)
Mathiyalagan, Ramya; Subramaniyam, Sathiyamoorthy; Natarajan, Sathishkumar; Kim, Yeon Ju; Sun, Myung Suk; Kim, Se Young; Kim, Yu-Jin; Yang, Deok Chun
2013-01-01
MicroRNAs (miRNAs) are a class of recently discovered non-coding small RNA molecules, on average approximately 21 nucleotides in length, which underlie numerous important biological roles in gene regulation in various organisms. The miRNA database (release 18) has 18,226 miRNAs, which have been deposited from different species. Although miRNAs have been identified and validated in many plant species, no studies have been reported on discovering miRNAs in Panax ginseng Meyer, which is a traditionally known medicinal plant in oriental medicine, also known as Korean ginseng. It has triterpene ginseng saponins called ginsenosides, which are responsible for its various pharmacological activities. Predicting conserved miRNAs by homology-based analysis with available expressed sequence tag (EST) sequences can be powerful, if the species lacks whole genome sequence information. In this study by using the EST based computational approach, 69 conserved miRNAs belonging to 44 miRNA families were identified in Korean ginseng. The digital gene expression patterns of predicted conserved miRNAs were analyzed by deep sequencing using small RNA sequences of flower buds, leaves, and lateral roots. We have found that many of the identified miRNAs showed tissue specific expressions. Using the insilico method, 346 potential targets were identified for the predicted 69 conserved miRNAs by searching the ginseng EST database, and the predicted targets were mainly involved in secondary metabolic processes, responses to biotic and abiotic stress, and transcription regulator activities, as well as a variety of other metabolic processes. PMID:23717176
Chen, Zongxiang; Li, Fuli; Yang, Songnan; Dong, Yibo; Yuan, Qianhua; Wang, Feng; Li, Weimin; Jiang, Ying; Jia, Shirong; Pei, Xinwu
2013-01-01
MicroRNAs (miRNAs) is a class of non-coding RNAs involved in post- transcriptional control of gene expression, via degradation and/or translational inhibition. Six-hundred sixty-one rice miRNAs are known that are important in plant development. However, flowering-related miRNAs have not been characterized in Oryza rufipogon Griff. It was approved by supervision department of Guangdong wild rice protection. We analyzed flowering-related miRNAs in O. rufipogon using high-throughput sequencing (deep sequencing) to understand the changes that occurred during rice domestication, and to elucidate their functions in flowering. Three O. rufipogon sRNA libraries, two vegetative stage (CWR-V1 and CWR-V2) and one flowering stage (CWR-F2) were sequenced using Illumina deep sequencing. A total of 20,156,098, 21,531,511 and 20,995,942 high quality sRNA reads were obtained from CWR-V1, CWR-V2 and CWR-F2, respectively, of which 3,448,185, 4,265,048 and 2,833,527 reads matched known miRNAs. We identified 512 known rice miRNAs in 214 miRNA families and predicted 290 new miRNAs. Targeted functional annotation, GO and KEGG pathway analyses predicted that 187 miRNAs regulate expression of flowering-related genes. Differential expression analysis of flowering-related miRNAs showed that: expression of 95 miRNAs varied significantly between the libraries, 66 are flowering-related miRNAs, such as oru-miR97, oru-miR117, oru-miR135, oru-miR137, et al. 17 are early-flowering -related miRNAs, including osa-miR160f, osa-miR164d, osa-miR167d, osa-miR169a, osa-miR172b, oru-miR4, et al., induced during the floral transition. Real-time PCR revealed the same expression patterns as deep sequencing. miRNAs targets were confirmed for cleavage by 5'-RACE in vivo, and were negatively regulated by miRNAs. This is the first investigation of flowering miRNAs in wild rice. The result indicates that variation in miRNAs occurred during rice domestication and lays a foundation for further study of phase change and flowering in O. rufipogon. Complicated regulatory networks mediated by multiple miRNAs regulate the expression of flowering genes that control the induction of flowering.
RISC RNA sequencing for context-specific identification of in vivo miR targets
Matkovich, Scot J; Van Booven, Derek J; Eschenbacher, William H; Dorn, Gerald W
2010-01-01
Rationale MicroRNAs (miRs) are expanding our understanding of cardiac disease and have the potential to transform cardiovascular therapeutics. One miR can target hundreds of individual mRNAs, but existing methodologies are not sufficient to accurately and comprehensively identify these mRNA targets in vivo. Objective To develop methods permitting identification of in vivo miR targets in an unbiased manner, using massively parallel sequencing of mouse cardiac transcriptomes in combination with sequencing of mRNA associated with mouse cardiac RNA-induced silencing complexes (RISCs). Methods and Results We optimized techniques for expression profiling small amounts of RNA without introducing amplification bias, and applied this to anti-Argonaute 2 immunoprecipitated RISCs (RISC-Seq) from mouse hearts. By comparing RNA-sequencing results of cardiac RISC and transcriptome from the same individual hearts, we defined 1,645 mRNAs consistently targeted to mouse cardiac RISCs. We employed this approach in hearts overexpressing miRs from Myh6 promoter-driven precursors (programmed RISC-Seq) to identify 209 in vivo targets of miR-133a and 81 in vivo targets of miR-499. Consistent with the fact that miR-133a and miR-499 have widely differing ‘seed’ sequences and belong to different miR families, only 6 targets were common to miR-133a- and miR-499-programmed hearts. Conclusions RISC-sequencing is a highly sensitive method for general RISC profiling and individual miR target identification in biological context, and is applicable to any tissue and any disease state. Summary MicroRNAs (miRs) are key regulators of mRNA translation in health and disease. While bioinformatic predictions suggest that a single miR may target hundreds of mRNAs, the number of experimentally verified targets of miRs is low. To enable comprehensive, unbiased examination of miR targets, we have performed deep RNA sequencing of cardiac transcriptomes in parallel with cardiac RNA-induced silencing complex (RISC)-associated RNAs (the RISCome), called RISC sequencing. We developed methods that did not require cross-linking of RNAs to RISCs or amplification of mRNA prior to sequencing, making it possible to rapidly perform RISC sequencing from intact tissue while avoiding amplification bias. Comparison of RISCome with transcriptome expression defined the degree of RISC enrichment for each mRNA. The majority of the mRNAs enriched in wild-type cardiac RISComes compared to transcriptomes were bioinformatically predicted to be targets of at least 1 of 139 cardiac-expressed miRs. Programming cardiomyocyte RISCs via transgenic overexpression in adult hearts of miR-133a or miR-499, two miRs that contain entirely different ‘seed’ sequences, elicited differing profiles of RISC-targeted mRNAs. Thus, RISC sequencing represents a highly sensitive method for general RISC profiling and individual miR target identification in biological context. PMID:21030712
2013-01-01
Background MicroRNAs (miRNAs) are an abundant class of endogenous small RNA molecules that downregulate gene expression at the posttranscriptional level. They play important roles in multiple biological processes by regulating genes that control developmental timing, growth, stem cell division and apoptosis by binding to the mRNA of target genes. Despite the position Atlantic salmon (Salmo salar) has as an economically important domesticated animal, there has been little research on miRNAs in this species. Knowledge about miRNAs and their target genes may be used to control health and to improve performance of economically important traits. However, before their biological function can be unravelled they must be identified and annotated. The aims of this study were to identify and characterize miRNA genes in Atlantic salmon by deep sequencing analysis of small RNA libraries from nine different tissues. Results A total of 180 distinct mature miRNAs belonging to 106 families of evolutionary conserved miRNAs, and 13 distinct novel mature miRNAs were discovered and characterized. The mature miRNAs corresponded to 521 putative precursor sequences located at unique genome locations. About 40% of these precursors were part of gene clusters, and the majority of the Salmo salar gene clusters discovered were conserved across species. Comparison of expression levels in samples from different tissues applying DESeq indicated that there were tissue specific expression differences in three conserved and one novel miRNA. Ssa-miR 736 was detected in heart tissue only, while two other clustered miRNAs (ssa-miR 212 and132) seems to be at a higher expression level in brain tissue. These observations correlate well with their expected functions as regulators of signal pathways in cardiac and neuronal cells, respectively. Ssa-miR 8163 is one of the novel miRNAs discovered and its function remains unknown. However, differential expression analysis using DESeq suggests that this miRNA is enriched in liver tissue and the precursor was mapped to intron 7 of the transferrin gene. Conclusions The identification and annotation of evolutionary conserved and novel Salmo salar miRNAs as well as the characterization of miRNA gene clusters provide biological knowledge that will greatly facilitate further functional studies on miRNAs in this species. PMID:23865519
USDA-ARS?s Scientific Manuscript database
Heilongjiang province is one of the most important potato production areas in China. Frequent outbreaks of virus and viroid diseases in production fields have significantly decreased potato yield and quality. However, we still do not have a clear understanding on the composition and genetic diversit...
USDA-ARS?s Scientific Manuscript database
Three commercial broiler breeds were fed from hatch with a diet supplemented with Capsicum and Curcuma longa oleoresins, and co-infected with Eimeria maxima and Clostridium perfringens to induce necrotic enteritis (NE). Pyrotag deep sequencing of bacterial 16S rRNA showed that gut microbiota compos...
Zhou, Yi; Yu, Fan; Gao, Yun; Luo, Yongju; Tang, Zhanyang; Guo, Zhongbao; Guo, Enyan; Gan, Xi; Zhang, Ming; Zhang, Yaping
2014-01-01
MicroRNAs (miRNAs) are endogenous non-coding small RNAs which play important roles in the regulation of gene expression by cleaving or inhibiting the translation of target gene transcripts. Thereinto, some specific miRNAs show regulatory activities in gonad development via translational control. In order to further understand the role of miRNA-mediated posttranscriptional regulation in Nile tilapia (Oreochromis niloticus) ovary and testis, two small RNA libraries of Nile tilapia were sequenced by Solexa small RNA deep sequencing methods. A total of 9,731,431 and 8,880,497 raw reads, representing 5,407,800 and 4,396,281 unique sequences were obtained from the sexually mature ovaries and testes, respectively. After comparing the small RNA sequences with the Rfam database, 1,432,210 reads in ovaries and 984,146 reads in testes were matched to the genome sequence of Nile tilapia. Bioinformatic analysis identified 764 mature miRNA, 209 miRNA-5p and 202 miRNA-3p were found in the two libraries, of which 525 known miRNAs are both expressed in the ovary and testis of Nile tilapia. Comparison of expression profiles of the testis, miR-727, miR-129 and miR-29 families were highly expressed in tilapia ovary. Additionally, miR-132, miR-212, miR-33a and miR-135b families, showed significant higher expression in testis compared with that in ovary. Furthermore, the expression patterns of the miRNAs were analyzed in different developmental stages of gonad. The result showed different expression patterns were observed during development of testis and ovary. In addition, the identification and characterization of differentially expressed miRNAs in the ovaries and testis of Nile tilapia provides important information on the role of miRNA in the regulation of the ovarian and testicular development and function. This data will be helpful to facilitate studies on the regulation of miRNAs during teleosts reproduction. PMID:24466258
Single-Cell Sequencing for Drug Discovery and Drug Development.
Wu, Hongjin; Wang, Charles; Wu, Shixiu
2017-01-01
Next-generation sequencing (NGS), particularly single-cell sequencing, has revolutionized the scale and scope of genomic and biomedical research. Recent technological advances in NGS and singlecell studies have made the deep whole-genome (DNA-seq), whole epigenome and whole-transcriptome sequencing (RNA-seq) at single-cell level feasible. NGS at the single-cell level expands our view of genome, epigenome and transcriptome and allows the genome, epigenome and transcriptome of any organism to be explored without a priori assumptions and with unprecedented throughput. And it does so with single-nucleotide resolution. NGS is also a very powerful tool for drug discovery and drug development. In this review, we describe the current state of single-cell sequencing techniques, which can provide a new, more powerful and precise approach for analyzing effects of drugs on treated cells and tissues. Our review discusses single-cell whole genome/exome sequencing (scWGS/scWES), single-cell transcriptome sequencing (scRNA-seq), single-cell bisulfite sequencing (scBS), and multiple omics of single-cell sequencing. We also highlight the advantages and challenges of each of these approaches. Finally, we describe, elaborate and speculate the potential applications of single-cell sequencing for drug discovery and drug development. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
Yang, Huan; Zhang, Ying; Vallandingham, Jim; Li, Hau; Florens, Laurence; Mak, Ho Yi
2012-01-01
The molecular mechanisms for target mRNA degradation in Caenorhabditis elegans undergoing RNAi are not fully understood. Using a combination of genetic, proteomic, and biochemical approaches, we report a divergent RDE-10/RDE-11 complex that is required for RNAi in C. elegans. Genetic analysis indicates that the RDE-10/RDE-11 complex acts in parallel to nuclear RNAi. Association of the complex with target mRNA is dependent on RDE-1 but not RRF-1, suggesting that target mRNA recognition depends on primary but not secondary siRNA. Furthermore, RDE-11 is required for mRNA degradation subsequent to target engagement. Deep sequencing reveals a fivefold decrease in secondary siRNA abundance in rde-10 and rde-11 mutant animals, while primary siRNA and microRNA biogenesis is normal. Therefore, the RDE-10/RDE-11 complex is critical for amplifying the exogenous RNAi response. Our work uncovers an essential output of the RNAi pathway in C. elegans. PMID:22508728
Yang, Huan; Zhang, Ying; Vallandingham, Jim; Li, Hua; Li, Hau; Florens, Laurence; Mak, Ho Yi
2012-04-15
The molecular mechanisms for target mRNA degradation in Caenorhabditis elegans undergoing RNAi are not fully understood. Using a combination of genetic, proteomic, and biochemical approaches, we report a divergent RDE-10/RDE-11 complex that is required for RNAi in C. elegans. Genetic analysis indicates that the RDE-10/RDE-11 complex acts in parallel to nuclear RNAi. Association of the complex with target mRNA is dependent on RDE-1 but not RRF-1, suggesting that target mRNA recognition depends on primary but not secondary siRNA. Furthermore, RDE-11 is required for mRNA degradation subsequent to target engagement. Deep sequencing reveals a fivefold decrease in secondary siRNA abundance in rde-10 and rde-11 mutant animals, while primary siRNA and microRNA biogenesis is normal. Therefore, the RDE-10/RDE-11 complex is critical for amplifying the exogenous RNAi response. Our work uncovers an essential output of the RNAi pathway in C. elegans.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Shi, CY; Yang, H; Wei, CL
Tea is one of the most popular non-alcoholic beverages worldwide. However, the tea plant, Camellia sinensis, is difficult to culture in vitro, to transform, and has a large genome, rendering little genomic information available. Recent advances in large-scale RNA sequencing (RNA-seq) provide a fast, cost-effective, and reliable approach to generate large expression datasets for functional genomic analysis, which is especially suitable for non-model species with un-sequenced genomes. Using high-throughput Illumina RNA-seq, the transcriptome from poly (A){sup +} RNA of C. sinensis was analyzed at an unprecedented depth (2.59 gigabase pairs). Approximate 34.5 million reads were obtained, trimmed, and assembled intomore » 127,094 unigenes, with an average length of 355 bp and an N50 of 506 bp, which consisted of 788 contig clusters and 126,306 singletons. This number of unigenes was 10-fold higher than existing C. sinensis sequences deposited in GenBank (as of August 2010). Sequence similarity analyses against six public databases (Uniprot, NR and COGs at NCBI, Pfam, InterPro and KEGG) found 55,088 unigenes that could be annotated with gene descriptions, conserved protein domains, or gene ontology terms. Some of the unigenes were assigned to putative metabolic pathways. Targeted searches using these annotations identified the majority of genes associated with several primary metabolic pathways and natural product pathways that are important to tea quality, such as flavonoid, theanine and caffeine biosynthesis pathways. Novel candidate genes of these secondary pathways were discovered. Comparisons with four previously prepared cDNA libraries revealed that this transcriptome dataset has both a high degree of consistency with previous EST data and an approximate 20 times increase in coverage. Thirteen unigenes related to theanine and flavonoid synthesis were validated. Their expression patterns in different organs of the tea plant were analyzed by RT-PCR and quantitative real time PCR (qRT-PCR). An extensive transcriptome dataset has been obtained from the deep sequencing of tea plant. The coverage of the transcriptome is comprehensive enough to discover all known genes of several major metabolic pathways. This transcriptome dataset can serve as an important public information platform for gene expression, genomics, and functional genomic studies in C. sinensis.« less
2011-01-01
Background Tea is one of the most popular non-alcoholic beverages worldwide. However, the tea plant, Camellia sinensis, is difficult to culture in vitro, to transform, and has a large genome, rendering little genomic information available. Recent advances in large-scale RNA sequencing (RNA-seq) provide a fast, cost-effective, and reliable approach to generate large expression datasets for functional genomic analysis, which is especially suitable for non-model species with un-sequenced genomes. Results Using high-throughput Illumina RNA-seq, the transcriptome from poly (A)+ RNA of C. sinensis was analyzed at an unprecedented depth (2.59 gigabase pairs). Approximate 34.5 million reads were obtained, trimmed, and assembled into 127,094 unigenes, with an average length of 355 bp and an N50 of 506 bp, which consisted of 788 contig clusters and 126,306 singletons. This number of unigenes was 10-fold higher than existing C. sinensis sequences deposited in GenBank (as of August 2010). Sequence similarity analyses against six public databases (Uniprot, NR and COGs at NCBI, Pfam, InterPro and KEGG) found 55,088 unigenes that could be annotated with gene descriptions, conserved protein domains, or gene ontology terms. Some of the unigenes were assigned to putative metabolic pathways. Targeted searches using these annotations identified the majority of genes associated with several primary metabolic pathways and natural product pathways that are important to tea quality, such as flavonoid, theanine and caffeine biosynthesis pathways. Novel candidate genes of these secondary pathways were discovered. Comparisons with four previously prepared cDNA libraries revealed that this transcriptome dataset has both a high degree of consistency with previous EST data and an approximate 20 times increase in coverage. Thirteen unigenes related to theanine and flavonoid synthesis were validated. Their expression patterns in different organs of the tea plant were analyzed by RT-PCR and quantitative real time PCR (qRT-PCR). Conclusions An extensive transcriptome dataset has been obtained from the deep sequencing of tea plant. The coverage of the transcriptome is comprehensive enough to discover all known genes of several major metabolic pathways. This transcriptome dataset can serve as an important public information platform for gene expression, genomics, and functional genomic studies in C. sinensis. PMID:21356090
Long, Rui-Cai; Li, Ming-Na; Kang, Jun-Mei; Zhang, Tie-Jun; Sun, Yan; Yang, Qing-Chuan
2015-05-01
Small 21- to 24-nucleotide (nt) ribonucleic acids (RNAs), notably the microRNA (miRNA), are emerging as a posttranscriptional regulation mechanism. Salt stress is one of the primary abiotic stresses that cause the crop losses worldwide. In saline lands, root growth and function of plant are determined by the action of environmental salt stress through specific genes that adapt root development to the restrictive condition. To elucidate the role of miRNAs in salt stress regulation in Medicago, we used a high-throughput sequencing approach to analyze four small RNA libraries from roots of Zhongmu-1 (Medicago sativa) and Jemalong A17 (Medicago truncatula), which were treated with 300 mM NaCl for 0 and 8 h. Each library generated about 20 million short sequences and contained predominantly small RNAs of 24-nt length, followed by 21-nt and 22-nt small RNAs. Using sequence analysis, we identified 385 conserved miRNAs from 96 families, along with 68 novel candidate miRNAs. Of all the 68 predicted novel miRNAs, 15 miRNAs were identified to have miRNA*. Statistical analysis on abundance of sequencing read revealed specific miRNA showing contrasting expression patterns between M. sativa and M. truncatula roots, as well as between roots treated for 0 and 8 h. The expression of 10 conserved and novel miRNAs was also quantified by quantitative real-time reverse transcription polymerase chain reaction (qRT-PCR). The miRNA precursor and target genes were predicted by bioinformatics analysis. We concluded that the salt stress related conserved and novel miRNAs may have a large variety of target mRNAs, some of which might play key roles in salt stress regulation of Medicago. © 2014 Scandinavian Plant Physiology Society.
Effects of hydrostatic pressure on yeasts isolated from deep-sea hydrothermal vents.
Burgaud, Gaëtan; Hué, Nguyen Thi Minh; Arzur, Danielle; Coton, Monika; Perrier-Cornet, Jean-Marie; Jebbar, Mohamed; Barbier, Georges
2015-11-01
Hydrostatic pressure plays a significant role in the distribution of life in the biosphere. Knowledge of deep-sea piezotolerant and (hyper)piezophilic bacteria and archaea diversity has been well documented, along with their specific adaptations to cope with high hydrostatic pressure (HHP). Recent investigations of deep-sea microbial community compositions have shown unexpected micro-eukaryotic communities, mainly dominated by fungi. Molecular methods such as next-generation sequencing have been used for SSU rRNA gene sequencing to reveal fungal taxa. Currently, a difficult but fascinating challenge for marine mycologists is to create deep-sea marine fungus culture collections and assess their ability to cope with pressure. Indeed, although there is no universal genetic marker for piezoresistance, physiological analyses provide concrete relevant data for estimating their adaptations and understanding the role of fungal communities in the abyss. The present study investigated morphological and physiological responses of fungi to HHP using a collection of deep-sea yeasts as a model. The aim was to determine whether deep-sea yeasts were able to tolerate different HHP and if they were metabolically active. Here we report an unexpected taxonomic-based dichotomic response to pressure with piezosensitve ascomycetes and piezotolerant basidiomycetes, and distinct morphological switches triggered by pressure for certain strains. Copyright © 2015 Institut Pasteur. Published by Elsevier Masson SAS. All rights reserved.
NASA Astrophysics Data System (ADS)
Wu, Yue-Hong; Liao, Li; Wang, Chun-Sheng; Ma, Wei-Lin; Meng, Fan-Xu; Wu, Min; Xu, Xue-Wei
2013-09-01
Deep-sea polymetallic nodules, rich in metals such as Fe, Mn, and Ni, are potential resources for future exploitation. Early culturing and microscopy studies suggest that polymetallic nodules are at least partially biogenic. To understand the microbial communities in this environment, we compared microbial community composition and diversity inside nodules and in the surrounding sediments. Three sampling sites in the Pacific Ocean containing polymetallic nodules were used for culture-independent investigations of microbial diversity. A total of 1013 near full-length bacterial 16S rRNA gene sequences and 640 archaeal 16S rRNA gene sequences with ~650 bp from nodules and the surrounding sediments were analyzed. Bacteria showed higher diversity than archaea. Interestingly, sediments contained more diverse bacterial communities than nodules, while the opposite was detected for archaea. Bacterial communities tend to be mostly unique to sediments or nodules, with only 13.3% of sequences shared. The most abundant bacterial groups detected only in nodules were Pseudoalteromonas and Alteromonas, which were predicted to play a role in building matrix outside cells to induce or control mineralization. However, archaeal communities were mostly shared between sediments and nodules, including the most abundant OTU containing 290 sequences from marine group I Thaumarchaeota. PcoA analysis indicated that microhabitat (i.e., nodule or sediment) seemed to be a major factor influencing microbial community composition, rather than sampling locations or distances between locations.
Li, Ruixue; Chen, Dandan; Wang, Taichu; Wan, Yizhen; Li, Rongfang; Fang, Rongjun; Wang, Yuting; Hu, Fei; Zhou, Hong; Li, Long; Zhao, Weiguo
2017-01-01
MicroRNAs (miRNAs) play important regulatory roles by targeting mRNAs for cleavage or translational repression. Identification of miRNA targets is essential to better understanding the roles of miRNAs. miRNA targets have not been well characterized in mulberry (Morus alba). To anatomize miRNA guided gene regulation under drought stress, transcriptome-wide high throughput degradome sequencing was used in this study to directly detect drought stress responsive miRNA targets in mulberry. A drought library (DL) and a contrast library (CL) were constructed to capture the cleaved mRNAs for sequencing. In CL, 409 target genes of 30 conserved miRNA families and 990 target genes of 199 novel miRNAs were identified. In DL, 373 target genes of 30 conserved miRNA families and 950 target genes of 195 novel miRNAs were identified. Of the conserved miRNA families in DL, mno-miR156, mno-miR172, and mno-miR396 had the highest number of targets with 54, 52 and 41 transcripts, respectively, indicating that these three miRNA families and their target genes might play important functions in response to drought stress in mulberry. Additionally, we found that many of the target genes were transcription factors. By analyzing the miRNA-target molecular network, we found that the DL independent networks consisted of 838 miRNA-mRNA pairs (63.34%). The expression patterns of 11 target genes and 12 correspondent miRNAs were detected using qRT-PCR. Six miRNA targets were further verified by RNA ligase-mediated 5' rapid amplification of cDNA ends (RLM-5' RACE). Gene Ontology (GO) annotations and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis revealed that these target transcripts were implicated in a broad range of biological processes and various metabolic pathways. This is the first study to comprehensively characterize target genes and their associated miRNAs in response to drought stress by degradome sequencing in mulberry. This study provides a framework for understanding the molecular mechanisms of drought resistance in mulberry.
NASA Astrophysics Data System (ADS)
Wang, Y.; Xia, Y.; Dong, H.; Dong, X.; Yang, K.; Dong, Z.; Huang, L.
2005-12-01
Microbial communities in the deep drill cores from the Chinese Continent Scientific Drilling were analyzed with culture-independent and dependent techniques. Genomic DNA was extracted from two metamorphic rocks: S1 from 430 and S13 from 1033 meters below the ground surface. The 16S rRNA gene was amplified by polymerase chain reaction (PCR) followed by cloning and sequencing. The total cell number was counted using the 4',6-diamidino-2-phenylindole (DAPI) staining and biomass of two specific bacteria were quantified using real-time PCR. Enrichment was set up for a rock from 3911 meters below the surface in medium for authotrophic methanogens (i.e., CO2 + H2). The total cell number in S13 was 1.0 × 104 cells per gram of rock. 16S rRNA gene analysis indicated that low G + C Gram positive sequences were dominant (50 percent of all 54 clone sequenced) followed by the alpha-, beta, and gamma-Proteobacteria. Within the low G + C Gram positive bacteria, most clone sequences were similar to species of Bacillus from various natural environments (deserts, rivers etc.). Within the Proteobacteria, our clone sequences were similar to species of Acinetobacter, Acidovorax, and Aeromonas. The RT-RCP results showed that biomass of two particular clone sequences (CCSD1305, similar to Aeromonas caviae and CCSD1307, similar to Acidovorax facilis) was 95 and 1258 cells/g, respectively. A bacterial isolate was obtained from the 3911-m rock in methanogenic medium. It was Gram negative with no flagella, immobile, and facultative anaerobic, and grows optimally at 65oC. Phylogenetic analysis indicated that it was closely related to the genus of Bacillus. Physiological tests further revealed that it was a strain of Bacillus caldotenax.
Härtl, Katja; Kalinowski, Gregor; Hoffmann, Thomas; Preuss, Anja; Schwab, Wilfried
2017-05-01
RNA interference (RNAi) has been exploited as a reverse genetic tool for functional genomics in the nonmodel species strawberry (Fragaria × ananassa) since 2006. Here, we analysed for the first time different but overlapping nucleotide sections (>200 nt) of two endogenous genes, FaCHS (chalcone synthase) and FaOMT (O-methyltransferase), as inducer sequences and a transitive vector system to compare their gene silencing efficiencies. In total, ten vectors were assembled each containing the nucleotide sequence of one fragment in sense and corresponding antisense orientation separated by an intron (inverted hairpin construct, ihp). All sequence fragments along the full lengths of both target genes resulted in a significant down-regulation of the respective gene expression and related metabolite levels. Quantitative PCR data and successful application of a transitive vector system coinciding with a phenotypic change suggested propagation of the silencing signal. The spreading of the signal in strawberry fruit in the 3' direction was shown for the first time by the detection of secondary small interfering RNAs (siRNAs) outside of the primary targets by deep sequencing. Down-regulation of endogenes by the transitive method was less effective than silencing by ihp constructs probably because the numbers of primary siRNAs exceeded the quantity of secondary siRNAs by three orders of magnitude. Besides, we observed consistent hotspots of primary and secondary siRNA formation along the target sequence which fall within a distance of less than 200 nt. Thus, ihp vectors seem to be superior over the transitive vector system for functional genomics in strawberry fruit. © 2016 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.
Aromatic-degrading Sphingomonas isolates from the deep subsurface
DOE Office of Scientific and Technical Information (OSTI.GOV)
Fredrickson, J.K.; Romine, M.F.; Balkwill, D.L.
An obligately aerobic chemoheterotrophic bacterium (strain F199) previously isolated from Southeast Coastal Plain subsurface sediments and shown to degrade toluene, naphthalene, and other aromatic compounds was characterized by analysis of its 16S rRNA nucleotide base sequence and cellular lipid composition. Strain F199 contained 2-OH14:0 and 18:1{omega}7c as the predominant cellular fatty acids and sphingolipids that are characteristic of the genus Sphingomonas. Phylogenetic analysis of its 16SrRNA sequence indicated that F199 was most closely related to Sphingomonas capsulata among the bacteria currently in the Ribosomal Database. Five additional isolates from deep Southeast Coastal Plain sediments were determined by 16S rRNA sequencemore » analysis to be closely related to F199. These strains also contained characteristic sphingolipids. Four of these five strains could also grow on a broad range of aromatic compounds and could mineralize [{sup 14C}]toluene and [{sup 14C}]naphthalene. S. capsulata (ATCC 14666), Sphingomonas paucimobiolis (ATCC 29837), and one of the subsurface isolates were unable to grow on any of the aromatic compounds or mineralize toluene or naphthalene. These results indicate that bacteria within the genus Sphingomonas are present in Southeast Coastal Plain subsurface sediments and that the capacity for degrading a broad range of substituted aromatic compounds appears to be common among Sphingomonas species from this environment. 41 refs., 2 figs., 5 tabs.« less
iSS-PC: Identifying Splicing Sites via Physical-Chemical Properties Using Deep Sparse Auto-Encoder.
Xu, Zhao-Chun; Wang, Peng; Qiu, Wang-Ren; Xiao, Xuan
2017-08-15
Gene splicing is one of the most significant biological processes in eukaryotic gene expression, such as RNA splicing, which can cause a pre-mRNA to produce one or more mature messenger RNAs containing the coded information with multiple biological functions. Thus, identifying splicing sites in DNA/RNA sequences is significant for both the bio-medical research and the discovery of new drugs. However, it is expensive and time consuming based only on experimental technique, so new computational methods are needed. To identify the splice donor sites and splice acceptor sites accurately and quickly, a deep sparse auto-encoder model with two hidden layers, called iSS-PC, was constructed based on minimum error law, in which we incorporated twelve physical-chemical properties of the dinucleotides within DNA into PseDNC to formulate given sequence samples via a battery of cross-covariance and auto-covariance transformations. In this paper, five-fold cross-validation test results based on the same benchmark data-sets indicated that the new predictor remarkably outperformed the existing prediction methods in this field. Furthermore, it is expected that many other related problems can be also studied by this approach. To implement classification accurately and quickly, an easy-to-use web-server for identifying slicing sites has been established for free access at: http://www.jci-bioinfo.cn/iSS-PC.
Krishna, Srikar; Nair, Aparna; Cheedipudi, Sirisha; Poduval, Deepak; Dhawan, Jyotsna; Palakodeti, Dasaradhi; Ghanekar, Yashoda
2013-01-07
Small non-coding RNAs such as miRNAs, piRNAs and endo-siRNAs fine-tune gene expression through post-transcriptional regulation, modulating important processes in development, differentiation, homeostasis and regeneration. Using deep sequencing, we have profiled small non-coding RNAs in Hydra magnipapillata and investigated changes in small RNA expression pattern during head regeneration. Our results reveal a unique repertoire of small RNAs in hydra. We have identified 126 miRNA loci; 123 of these miRNAs are unique to hydra. Less than 50% are conserved across two different strains of Hydra vulgaris tested in this study, indicating a highly diverse nature of hydra miRNAs in contrast to bilaterian miRNAs. We also identified siRNAs derived from precursors with perfect stem-loop structure and that arise from inverted repeats. piRNAs were the most abundant small RNAs in hydra, mapping to transposable elements, the annotated transcriptome and unique non-coding regions on the genome. piRNAs that map to transposable elements and the annotated transcriptome display a ping-pong signature. Further, we have identified several miRNAs and piRNAs whose expression is regulated during hydra head regeneration. Our study defines different classes of small RNAs in this cnidarian model system, which may play a role in orchestrating gene expression essential for hydra regeneration.
Krishna, Srikar; Nair, Aparna; Cheedipudi, Sirisha; Poduval, Deepak; Dhawan, Jyotsna; Palakodeti, Dasaradhi; Ghanekar, Yashoda
2013-01-01
Small non-coding RNAs such as miRNAs, piRNAs and endo-siRNAs fine-tune gene expression through post-transcriptional regulation, modulating important processes in development, differentiation, homeostasis and regeneration. Using deep sequencing, we have profiled small non-coding RNAs in Hydra magnipapillata and investigated changes in small RNA expression pattern during head regeneration. Our results reveal a unique repertoire of small RNAs in hydra. We have identified 126 miRNA loci; 123 of these miRNAs are unique to hydra. Less than 50% are conserved across two different strains of Hydra vulgaris tested in this study, indicating a highly diverse nature of hydra miRNAs in contrast to bilaterian miRNAs. We also identified siRNAs derived from precursors with perfect stem–loop structure and that arise from inverted repeats. piRNAs were the most abundant small RNAs in hydra, mapping to transposable elements, the annotated transcriptome and unique non-coding regions on the genome. piRNAs that map to transposable elements and the annotated transcriptome display a ping–pong signature. Further, we have identified several miRNAs and piRNAs whose expression is regulated during hydra head regeneration. Our study defines different classes of small RNAs in this cnidarian model system, which may play a role in orchestrating gene expression essential for hydra regeneration. PMID:23166307
Baker, Brett J; Lesniewski, Ryan A; Dick, Gregory J
2012-12-01
Ammonia-oxidizing Archaea (AOA) are among the most abundant microorganisms in the oceans and have crucial roles in biogeochemical cycling of nitrogen and carbon. To better understand AOA inhabiting the deep sea, we obtained community genomic and transcriptomic data from ammonium-rich hydrothermal plumes in the Guaymas Basin (GB) and from surrounding deep waters of the Gulf of California. Among the most abundant and active lineages in the sequence data were marine group I (MGI) Archaea related to the cultured autotrophic ammonia-oxidizer, Nitrosopumilus maritimus. Assembly of MGI genomic fragments yielded 2.9 Mb of sequence containing seven 16S rRNA genes (95.4-98.4% similar to N. maritimus), including two near-complete genomes and several lower-abundance variants. Equal copy numbers of MGI 16S rRNA genes and ammonia monooxygenase genes and transcription of ammonia oxidation genes indicates that all of these genotypes actively oxidize ammonia. De novo genomic assembly revealed the functional potential of MGI populations and enhanced interpretation of metatranscriptomic data. Physiological distinction from N. maritimus is evident in the transcription of novel genes, including genes for urea utilization, suggesting an alternative source of ammonia. We were also able to determine which genotypes are most active in the plume. Transcripts involved in nitrification were more prominent in the plume and were among the most abundant transcripts in the community. These unique data sets reveal populations of deep-sea AOA thriving in the ammonium-rich GB that are related to surface types, but with key genomic and physiological differences.
Gao, Chao; Wang, Pengfei; Zhao, Shuzhen; Zhao, Chuanzhi; Xia, Han; Hou, Lei; Ju, Zheng; Zhang, Ye; Li, Changsheng; Wang, Xingjun
2017-03-02
As a typical geocarpic plant, peanut embryogenesis and pod development are complex processes involving many gene regulatory pathways and controlled by appropriate hormone level. MicroRNAs (miRNAs) are small non-coding RNAs that play indispensable roles in post-transcriptional gene regulation. Recently, identification and characterization of peanut miRNAs has been described. However, whether miRNAs participate in the regulation of peanut embryogenesis and pod development has yet to be explored. In this study, small RNA and degradome libraries from peanut early pod of different developmental stages were constructed and sequenced. A total of 70 known and 24 novel miRNA families were discovered. Among them, 16 miRNA families were legume-specific and 12 families were peanut-specific. 30 known and 10 novel miRNA families were differentially expressed during pod development. In addition, 115 target genes were identified for 47 miRNA families by degradome sequencing. Several new targets that might be specific to peanut were found and further validated by RNA ligase-mediated rapid amplification of 5' cDNA ends (RLM 5'-RACE). Furthermore, we performed profiling analysis of intact and total transcripts of several target genes, demonstrating that SPL (miR156/157), NAC (miR164), PPRP (miR167 and miR1088), AP2 (miR172) and GRF (miR396) are actively modulated during early pod development, respectively. Large numbers of miRNAs and their related target genes were identified through deep sequencing. These findings provided new information on miRNA-mediated regulatory pathways in peanut pod, which will contribute to the comprehensive understanding of the molecular mechanisms that governing peanut embryo and early pod development.
Global impact of RNA splicing on transcriptome remodeling in the heart.
Gao, Chen; Wang, Yibin
2012-08-01
In the eukaryotic transcriptome, both the numbers of genes and different RNA species produced by each gene contribute to the overall complexity. These RNA species are generated by the utilization of different transcriptional initiation or termination sites, or more commonly, from different messenger RNA (mRNA) splicing events. Among the 30,000+ genes in human genome, it is estimated that more than 95% of them can generate more than one gene product via alternative RNA splicing. The protein products generated from different RNA splicing variants can have different intracellular localization, activity, or tissue-distribution. Therefore, alternative RNA splicing is an important molecular process that contributes to the overall complexity of the genome and the functional specificity and diversity among different cell types. In this review, we will discuss current efforts to unravel the full complexity of the cardiac transcriptome using a deep-sequencing approach, and highlight the potential of this technology to uncover the global impact of RNA splicing on the transcriptome during development and diseases of the heart.
Discovery and small RNA profile of Pecan mosaic-associated virus, a novel potyvirus of pecan trees.
Su, Xiu; Fu, Shuai; Qian, Yajuan; Zhang, Liqin; Xu, Yi; Zhou, Xueping
2016-05-26
A novel potyvirus was discovered in pecan (Carya illinoensis) showing leaf mosaic symptom through the use of deep sequencing of small RNAs. The complete genome of this virus was determined to comprise of 9,310 nucleotides (nt), and shared 24.0% to 58.9% nucleotide similarities with that of other Potyviridae viruses. The genome was deduced to encode a single open reading frame (polyprotein) on the plus strand. Phylogenetic analysis based on the whole genome sequence and coat protein amino acid sequence showed that this virus is most closely related to Lettuce mosaic virus. Using electron microscopy, the typical Potyvirus filamentous particles were identified in infected pecan leaves with mosaic symptoms. Our results clearly show that this virus is a new member of the genus Potyvirus in the family Potyviridae. The virus is tentatively named Pecan mosaic-associated virus (PMaV). Additionally, profiling of the PMaV-derived small RNA (PMaV-sRNA) showed that the most abundant PMaV-sRNAs were 21-nt in length. There are several hotspots for small RNA production along the PMaV genome; two 21-nt PMaV-sRNAs starting at 811 nt and 610 nt of the minus-strand genome were highly repeated.
Discovery and small RNA profile of Pecan mosaic-associated virus, a novel potyvirus of pecan trees
Su, Xiu; Fu, Shuai; Qian, Yajuan; Zhang, Liqin; Xu, Yi; Zhou, Xueping
2016-01-01
A novel potyvirus was discovered in pecan (Carya illinoensis) showing leaf mosaic symptom through the use of deep sequencing of small RNAs. The complete genome of this virus was determined to comprise of 9,310 nucleotides (nt), and shared 24.0% to 58.9% nucleotide similarities with that of other Potyviridae viruses. The genome was deduced to encode a single open reading frame (polyprotein) on the plus strand. Phylogenetic analysis based on the whole genome sequence and coat protein amino acid sequence showed that this virus is most closely related to Lettuce mosaic virus. Using electron microscopy, the typical Potyvirus filamentous particles were identified in infected pecan leaves with mosaic symptoms. Our results clearly show that this virus is a new member of the genus Potyvirus in the family Potyviridae. The virus is tentatively named Pecan mosaic-associated virus (PMaV). Additionally, profiling of the PMaV-derived small RNA (PMaV-sRNA) showed that the most abundant PMaV-sRNAs were 21-nt in length. There are several hotspots for small RNA production along the PMaV genome; two 21-nt PMaV-sRNAs starting at 811 nt and 610 nt of the minus-strand genome were highly repeated. PMID:27226228
Wang, Zhengjia; Huang, Ruiming; Sun, Zhichao; Zhang, Tong; Huang, Jianqin
2017-05-01
MicroRNAs (miRNAs) are important regulators of plant development and fruit formation. Mature embryos of hickory (Carya cathayensis Sarg.) nuts contain more than 70% oil (comprising 90% unsaturated fatty acids), along with a substantial amount of oleic acid. To understand the roles of miRNAs involved in oil and oleic acid production during hickory embryogenesis, three small RNA libraries from different stages of embryogenesis were constructed. Deep sequencing of these three libraries identified 95 conserved miRNAs with 19 miRNA*s, 7 novel miRNAs (as well as their corresponding miRNA*s), and 26 potentially novel miRNAs. The analysis identified 15 miRNAs involved in oil and oleic acid production that are differentially expressed during embryogenesis in hickory. Among them, nine miRNA sequences, including eight conserved and one novel, were confirmed by qRT-PCR. In addition, 145 target genes of the novel miRNAs were predicted using a bioinformatic approach. Our results provide a framework for better understanding the roles of miRNAs during embryogenesis in hickory.
Conservation of small RNA pathways in platypus
Murchison, Elizabeth P.; Kheradpour, Pouya; Sachidanandam, Ravi; Smith, Carly; Hodges, Emily; Xuan, Zhenyu; Kellis, Manolis; Grützner, Frank; Stark, Alexander; Hannon, Gregory J.
2008-01-01
Small RNA pathways play evolutionarily conserved roles in gene regulation and defense from parasitic nucleic acids. The character and expression patterns of small RNAs show conservation throughout animal lineages, but specific animal clades also show variations on these recurring themes, including species-specific small RNAs. The monotremes, with only platypus and four species of echidna as extant members, represent the basal branch of the mammalian lineage. Here, we examine the small RNA pathways of monotremes by deep sequencing of six platypus and echidna tissues. We find that highly conserved microRNA species display their signature tissue-specific expression patterns. In addition, we find a large rapidly evolving cluster of microRNAs on platypus chromosome X1, which is unique to monotremes. Platypus and echidna testes contain a robust Piwi-interacting (piRNA) system, which appears to be participating in ongoing transposon defense. PMID:18463306
Conservation of small RNA pathways in platypus.
Murchison, Elizabeth P; Kheradpour, Pouya; Sachidanandam, Ravi; Smith, Carly; Hodges, Emily; Xuan, Zhenyu; Kellis, Manolis; Grützner, Frank; Stark, Alexander; Hannon, Gregory J
2008-06-01
Small RNA pathways play evolutionarily conserved roles in gene regulation and defense from parasitic nucleic acids. The character and expression patterns of small RNAs show conservation throughout animal lineages, but specific animal clades also show variations on these recurring themes, including species-specific small RNAs. The monotremes, with only platypus and four species of echidna as extant members, represent the basal branch of the mammalian lineage. Here, we examine the small RNA pathways of monotremes by deep sequencing of six platypus and echidna tissues. We find that highly conserved microRNA species display their signature tissue-specific expression patterns. In addition, we find a large rapidly evolving cluster of microRNAs on platypus chromosome X1, which is unique to monotremes. Platypus and echidna testes contain a robust Piwi-interacting (piRNA) system, which appears to be participating in ongoing transposon defense.
Parkes, R John; Sellek, Gerard; Webster, Gordon; Martin, Derek; Anders, Erik; Weightman, Andrew J; Sass, Henrik
2009-01-01
Deep subseafloor sediments may contain depressurization-sensitive, anaerobic, piezophilic prokaryotes. To test this we developed the DeepIsoBUG system, which when coupled with the HYACINTH pressure-retaining drilling and core storage system and the PRESS core cutting and processing system, enables deep sediments to be handled without depressurization (up to 25 MPa) and anaerobic prokaryotic enrichments and isolation to be conducted up to 100 MPa. Here, we describe the system and its first use with subsurface gas hydrate sediments from the Indian Continental Shelf, Cascadia Margin and Gulf of Mexico. Generally, highest cell concentrations in enrichments occurred close to in situ pressures (14 MPa) in a variety of media, although growth continued up to at least 80 MPa. Predominant sequences in enrichments were Carnobacterium, Clostridium, Marinilactibacillus and Pseudomonas, plus Acetobacterium and Bacteroidetes in Indian samples, largely independent of media and pressures. Related 16S rRNA gene sequences for all of these Bacteria have been detected in deep, subsurface environments, although isolated strains were piezotolerant, being able to grow at atmospheric pressure. Only the Clostridium and Acetobacterium were obligate anaerobes. No Archaea were enriched. It may be that these sediment samples were not deep enough (total depth 1126–1527 m) to obtain obligate piezophiles. PMID:19694787
Some New Windows into Terrestrial Deep Subsurface Microbial Ecosystems
NASA Astrophysics Data System (ADS)
Moser, D. P.
2011-12-01
Over the past several years, our group has surveyed the microbial ecology and biogeochemistry of a range of fracture rock subsurface ecosystems via deep mine boreholes in South Africa, the United States, and Canada; and boreholes from surface or deeply-sourced natural springs of the U.S. Great Basin. Collectively, these mostly unexplored habitats represent a wide range of geologic provinces, host rock types, aquatic chemistries, and the vast potential for biogeographic isolation. Thus, patterns of microbial diversity are of interest from the perspective of filling a fundamental knowledge gap; and while not necessarily expected, the detection of closely related microorganisms from geographically isolated settings would be noteworthy. Across these sample sets, microbial communities were invariably very low in biomass (e.g. 10e3 - 10e4 cells per mL) and dominated by deeply-branching bacterial lineages, particularly from the phyla Firmicutes and Nitrospira. In several cases, the Firmicutes have shown very close phylogenetic affiliations to lineages detected at divergent locations. For example, one abundant lineage from a new artesian well drilled into the Furnace Creek Fault of Death Valley, CA bears a very close phylogenetic relatedness to environmental DNA sequences (SSU rRNA gene) detected in one of the world's deepest mines (Tau Tona of South Africa) and what was North America's deepest gold mine (Homestake of South Dakota). Several radioactive wells from the Nevada National Security Site have produced rRNA gene sequences very close (e.g. greater than 99% identity) to that of Desulforudis audaxviator, a rarely detected microorganism thought to subsist as a single species ecosystem on the products of radiochemical reactions in deep crustal rocks from the South African Witwatersrand Basin. These sequences, along with more distantly related sequences from the marine subsurface (ridge flank basalt and mud volcanoes) and groundwater in Europe, hint at a role in certain hydrogen-rich subsurface settings for this group. Likewise, patterns of archaeal diversity across many of our Great Basin sites suggest shared deep lineages, particularly with the phylum, Thaumarchaeota. Here we will explore the possible significance of these patterns of diversity and discuss future research plans involving high throughput molecular techniques.
A landscape of circular RNA expression in the human heart.
Tan, Wilson L W; Lim, Benson T S; Anene-Nzelu, Chukwuemeka G O; Ackers-Johnson, Matthew; Dashi, Albert; See, Kelvin; Tiang, Zenia; Lee, Dominic Paul; Chua, Wee Woon; Luu, Tuan D A; Li, Peter Y Q; Richards, Arthur Mark; Foo, Roger S Y
2017-03-01
Circular RNA (circRNA) is a newly validated class of single-stranded RNA, ubiquitously expressed in mammalian tissues and possessing key functions including acting as microRNA sponges and as transcriptional regulators by binding to RNA-binding proteins. While independent studies confirm the expression of circRNA in various tissue types, genome-wide circRNA expression in the heart has yet to be described in detail. We performed deep RNA-sequencing on ribosomal-depleted RNA isolated from 12 human hearts, 25 mouse hearts and across a 28-day differentiation time-course of human embryonic stem cell-derived cardiomyocytes. Using purpose-designed bioinformatics tools, we uncovered a total of 15 318 and 3017 cardiac circRNA within human and mouse, respectively. Their abundance generally correlates with the abundance of their cognate linear RNA, but selected circRNAs exist at disproportionately higher abundance. Top highly expressed circRNA corresponded to key cardiac genes including Titin (TTN), RYR2, and DMD. The most abundant cardiac-expressed circRNA is a cytoplasmic localized single-exon circSLC8A1-1. The longest human transcript TTN alone generates up to 415 different exonic circRNA isoforms, the majority (83%) of which originates from the I-band domain. Finally, we confirmed the expression of selected cardiac circRNA by RT-PCR, Sanger sequencing and single molecule RNA-fluorescence in situ hybridization. Our data provide a detailed circRNA expression landscape in hearts. There is a high-abundance of specific cardiac-expressed circRNA. These findings open up a new avenue for future investigation into this emerging class of RNA. Published on behalf of the European Society of Cardiology. All rights reserved. © The Author 2016. For Permissions, please email: journals.permissions@oup.com.
Genome-wide mapping of alternative splicing in Arabidopsis thaliana
Filichkin, Sergei A.; Priest, Henry D.; Givan, Scott A.; Shen, Rongkun; Bryant, Douglas W.; Fox, Samuel E.; Wong, Weng-Keen; Mockler, Todd C.
2010-01-01
Alternative splicing can enhance transcriptome plasticity and proteome diversity. In plants, alternative splicing can be manifested at different developmental stages, and is frequently associated with specific tissue types or environmental conditions such as abiotic stress. We mapped the Arabidopsis transcriptome at single-base resolution using the Illumina platform for ultrahigh-throughput RNA sequencing (RNA-seq). Deep transcriptome sequencing confirmed a majority of annotated introns and identified thousands of novel alternatively spliced mRNA isoforms. Our analysis suggests that at least ∼42% of intron-containing genes in Arabidopsis are alternatively spliced; this is significantly higher than previous estimates based on cDNA/expressed sequence tag sequencing. Random validation confirmed that novel splice isoforms empirically predicted by RNA-seq can be detected in vivo. Novel introns detected by RNA-seq were substantially enriched in nonconsensus terminal dinucleotide splice signals. Alternative isoforms with premature termination codons (PTCs) comprised the majority of alternatively spliced transcripts. Using an example of an essential circadian clock gene, we show that intron retention can generate relatively abundant PTC+ isoforms and that this specific event is highly conserved among diverse plant species. Alternatively spliced PTC+ isoforms can be potentially targeted for degradation by the nonsense mediated mRNA decay (NMD) surveillance machinery or regulate the level of functional transcripts by the mechanism of regulated unproductive splicing and translation (RUST). We demonstrate that the relative ratios of the PTC+ and reference isoforms for several key regulatory genes can be considerably shifted under abiotic stress treatments. Taken together, our results suggest that like in animals, NMD and RUST may be widespread in plants and may play important roles in regulating gene expression. PMID:19858364
DNMT1-interacting RNAs block gene specific DNA methylation
Di Ruscio, Annalisa; Ebralidze, Alexander K.; Benoukraf, Touati; Amabile, Giovanni; Goff, Loyal A.; Terragni, Joylon; Figueroa, Maria Eugenia; De Figureido Pontes, Lorena Lobo; Alberich-Jorda, Meritxell; Zhang, Pu; Wu, Mengchu; D’Alò, Francesco; Melnick, Ari; Leone, Giuseppe; Ebralidze, Konstantin K.; Pradhan, Sriharsa; Rinn, John L.; Tenen, Daniel G.
2013-01-01
Summary DNA methylation was described almost a century ago. However, the rules governing its establishment and maintenance remain elusive. Here, we present data demonstrating that active transcription regulates levels of genomic methylation. We identified a novel RNA arising from the CEBPA gene locus critical in regulating the local DNA methylation profile. This RNA binds to DNMT1 and prevents CEBPA gene locus methylation. Deep sequencing of transcripts associated with DNMT1 combined with genome-scale methylation and expression profiling extended the generality of this finding to numerous gene loci. Collectively, these results delineate the nature of DNMT1-RNA interactions and suggest strategies for gene selective demethylation of therapeutic targets in disease. PMID:24107992
Zhang, Yanjie; Sun, Jin; Li, Xinzheng; Qiu, Jian-Wen
2016-01-01
We reported a nearly complete mitochondrial genome (mitogenome) from the glass sponge Lophophysema eversa, the second mitogenome in the order Amphidiscosida and the ninth in the class Hexactinellida. It is 20,651 base pairs in length and contains 39 genes including 13 protein-coding genes, 2 ribosomal RNA subunit genes and 24 tRNA genes. The gene content and order of L. eversa are identical to those of Tabachnickia sp., the other species with a sequenced mitogenome in Amphidiscosida, except with two additional tRNAs and three tRNA translocations. The cob gene has a +1 translational frameshift. These results will contribute to a better understanding of the phylogeny of glass sponges.
Vd’ačný, Peter; Bourland, William A.; Orsi, William; Epstein, Slava S.; Foissner, Wilhelm
2012-01-01
The class Litostomatea is a highly diverse ciliate taxon comprising hundreds of free-living and endocommensal species. However, their traditional morphology-based classification conflicts with 18S rRNA gene phylogenies indicating (1) a deep bifurcation of the Litostomatea into Rhynchostomatia and Haptoria + Trichostomatia, and (2) body polarization and simplification of the oral apparatus as main evolutionary trends in the Litostomatea. To test whether 18S rRNA molecules provide a suitable proxy for litostomatean evolutionary history, we used eighteen new ITS1-5.8S rRNA-ITS2 region sequences from various free-living litostomatean orders. These single- and multiple-locus analyses are in agreement with previous 18S rRNA gene phylogenies, supporting that both 18S rRNA gene and ITS region sequences are effective tools for resolving phylogenetic relationships among the litostomateans. Despite insertions, deletions and mutational saturations in the ITS region, the present study shows that ITS1 and ITS2 molecules can be used to infer phylogenetic relationships not only at species level but also at higher taxonomic ranks when their secondary structure information is utilized to aid alignment. PMID:22789763
Vd'ačný, Peter; Bourland, William A; Orsi, William; Epstein, Slava S; Foissner, Wilhelm
2012-11-01
The class Litostomatea is a highly diverse ciliate taxon comprising hundreds of free-living and endocommensal species. However, their traditional morphology-based classification conflicts with 18S rRNA gene phylogenies indicating (1) a deep bifurcation of the Litostomatea into Rhynchostomatia and Haptoria+Trichostomatia, and (2) body polarization and simplification of the oral apparatus as main evolutionary trends in the Litostomatea. To test whether 18S rRNA molecules provide a suitable proxy for litostomatean evolutionary history, we used eighteen new ITS1-5.8S rRNA-ITS2 region sequences from various free-living litostomatean orders. These single- and multiple-locus analyses are in agreement with previous 18S rRNA gene phylogenies, supporting that both 18S rRNA gene and ITS region sequences are effective tools for resolving phylogenetic relationships among the litostomateans. Despite insertions, deletions and mutational saturations in the ITS region, the present study shows that ITS1 and ITS2 molecules can be used to infer phylogenetic relationships not only at species level but also at higher taxonomic ranks when their secondary structure information is utilized to aid alignment. Copyright © 2012 Elsevier Inc. All rights reserved.
Wu, Dong-Dong; Ye, Ling-Qun; Li, Yan; Sun, Yan-Bo; Shao, Yi; Chen, Chunyan; Zhu, Zhu; Zhong, Li; Wang, Lu; Irwin, David M; Zhang, Yong E; Zhang, Ya-Ping
2015-08-01
Next-generation RNA sequencing has been successfully used for identification of transcript assembly, evaluation of gene expression levels, and detection of post-transcriptional modifications. Despite these large-scale studies, additional comprehensive RNA-seq data from different subregions of the human brain are required to fully evaluate the evolutionary patterns experienced by the human brain transcriptome. Here, we provide a total of 6.5 billion RNA-seq reads from different subregions of the human brain. A significant correlation was observed between the levels of alternative splicing and RNA editing, which might be explained by a competition between the molecular machineries responsible for the splicing and editing of RNA. Young human protein-coding genes demonstrate biased expression to the neocortical and non-neocortical regions during evolution on the lineage leading to humans. We also found that a significantly greater number of young human protein-coding genes are expressed in the putamen, a tissue that was also observed to have the highest level of RNA-editing activity. The putamen, which previously received little attention, plays an important role in cognitive ability, and our data suggest a potential contribution of the putamen to human evolution. © The Author (2015). Published by Oxford University Press on behalf of Journal of Molecular Cell Biology, IBCB, SIBS, CAS. All rights reserved.
Powell, J. Elijah; Ratnayeke, Nalin; Moran, Nancy A.
2017-01-01
High throughput rRNA amplicon surveys of bacterial communities provide a rapid snapshot of taxonomic composition. But strains with nearly identical rRNA sequences often differ in gene repertoires and metabolic capabilities. To assess strain-level variation within Snodgrassella alvi, a gut symbiont of corbiculate bees, we performed deep sequencing on amplicons of a single copy coding gene (minD) as well as the 16S rDNA V4 region. We surveyed honey bees (Apis mellifera) sampled globally and 12 bumble bee species (Bombus) sampled from two regions of the USA. The minD analyses reveal that S. alvi contains far more strain diversity than is evident from 16S rDNA analysis. Many taxa inferred on the basis of 16S rDNA are shared between A. mellifera and Bombus species, but taxa inferred on the basis of minD are never shared and often are restricted to particular Bombus species. Clustering based on minD revealed that gut communities often reflect host species and geographic location. Both minD and 16S rDNA analyses indicate that strain diversity is higher in A. mellifera than in Bombus species. The minD locus flanks a 16S gene, enabling development of strain-specific 16S fluorescent probes to illuminate the spatial relationship of strains within the bee gut. PMID:27482856
Gutierrez, Tony; Biddle, Jennifer F; Teske, Andreas; Aitken, Michael D
2015-01-01
Marine hydrocarbon-degrading bacteria perform a fundamental role in the biodegradation of crude oil and its petrochemical derivatives in coastal and open ocean environments. However, there is a paucity of knowledge on the diversity and function of these organisms in deep-sea sediment. Here we used stable-isotope probing (SIP), a valuable tool to link the phylogeny and function of targeted microbial groups, to investigate polycyclic aromatic hydrocarbon (PAH)-degrading bacteria under aerobic conditions in sediments from Guaymas Basin with uniformly labeled [(13)C]-phenanthrene (PHE). The dominant sequences in clone libraries constructed from (13)C-enriched bacterial DNA (from PHE enrichments) were identified to belong to the genus Cycloclasticus. We used quantitative PCR primers targeting the 16S rRNA gene of the SIP-identified Cycloclasticus to determine their abundance in sediment incubations amended with unlabeled PHE and showed substantial increases in gene abundance during the experiments. We also isolated a strain, BG-2, representing the SIP-identified Cycloclasticus sequence (99.9% 16S rRNA gene sequence identity), and used this strain to provide direct evidence of PHE degradation and mineralization. In addition, we isolated Halomonas, Thalassospira, and Lutibacterium sp. with demonstrable PHE-degrading capacity from Guaymas Basin sediment. This study demonstrates the value of coupling SIP with cultivation methods to identify and expand on the known diversity of PAH-degrading bacteria in the deep-sea.
Gutierrez, Tony; Biddle, Jennifer F.; Teske, Andreas; Aitken, Michael D.
2015-01-01
Marine hydrocarbon-degrading bacteria perform a fundamental role in the biodegradation of crude oil and its petrochemical derivatives in coastal and open ocean environments. However, there is a paucity of knowledge on the diversity and function of these organisms in deep-sea sediment. Here we used stable-isotope probing (SIP), a valuable tool to link the phylogeny and function of targeted microbial groups, to investigate polycyclic aromatic hydrocarbon (PAH)-degrading bacteria under aerobic conditions in sediments from Guaymas Basin with uniformly labeled [13C]-phenanthrene (PHE). The dominant sequences in clone libraries constructed from 13C-enriched bacterial DNA (from PHE enrichments) were identified to belong to the genus Cycloclasticus. We used quantitative PCR primers targeting the 16S rRNA gene of the SIP-identified Cycloclasticus to determine their abundance in sediment incubations amended with unlabeled PHE and showed substantial increases in gene abundance during the experiments. We also isolated a strain, BG-2, representing the SIP-identified Cycloclasticus sequence (99.9% 16S rRNA gene sequence identity), and used this strain to provide direct evidence of PHE degradation and mineralization. In addition, we isolated Halomonas, Thalassospira, and Lutibacterium sp. with demonstrable PHE-degrading capacity from Guaymas Basin sediment. This study demonstrates the value of coupling SIP with cultivation methods to identify and expand on the known diversity of PAH-degrading bacteria in the deep-sea. PMID:26217326
Jensen, Sigmund; Lynch, Michael D J; Ray, Jessica L; Neufeld, Josh D; Hovland, Martin
2015-10-01
Deep-sea coral reefs do not receive sunlight and depend on plankton. Little is known about the plankton composition at such reefs, even though they constitute habitats for many invertebrates and fish. We investigated plankton communities from three reefs at 260-350 m depth at hydrocarbon fields off the mid-Norwegian coast using a combination of cultivation and small subunit (SSU) rRNA gene and transcript sequencing. Eight months incubations of a reef water sample with minimal medium, supplemented with carbon dioxide and gaseous alkanes at in situ-like conditions, enabled isolation of mostly Alphaproteobacteria (Sulfitobacter, Loktanella), Gammaproteobacteria (Colwellia) and Flavobacteria (Polaribacter). The relative abundance of isolates in the original sample ranged from ∼ 0.01% to 0.80%. Comparisons of bacterial SSU sequences from filtered plankton of reef and non-reef control samples indicated high abundance and metabolic activity of primarily Alphaproteobacteria (SAR11 Ia), Gammaproteobacteria (ARCTIC96BD-19), but also of Deltaproteobacteria (Nitrospina, SAR324). Eukaryote SSU sequences indicated metabolically active microalgae and animals, including codfish, at the reef sites. The plankton community composition varied between reefs and differed between DNA and RNA assessments. Over 5000 operational taxonomic units were detected, some indicators of reef sites (e.g. Flavobacteria, Cercozoa, Demospongiae) and some more active at reef sites (e.g. Gammaproteobacteria, Ciliophora, Copepoda). © 2014 Society for Applied Microbiology and John Wiley & Sons Ltd.
USDA-ARS?s Scientific Manuscript database
Verticillium dahliae is a soil-borne fungus that causes vascular wilt diseases in a wide range of plant hosts. V. dahliae produces multicelled, melanized resting bodies, also known as microsclerotia (MS) that can survive for years in the soil. Thus, MS formation marks an important event in the disea...
A phylogenetic analysis of Aquifex pyrophilus
NASA Technical Reports Server (NTRS)
Burggraf, S.; Olsen, G. J.; Stetter, K. O.; Woese, C. R.
1992-01-01
The 16S rRNA of the bacterion Aquifex pyrophilus, a microaerophilic, oxygen-reducing hyperthermophile, has been sequenced directly from the the PCR amplified gene. Phylogenetic analyses show the Aq. pyrophilus lineage to be probably the deepest (earliest) in the (eu)bacterial tree. The addition of this deep branching to the bacterial tree further supports the argument that the Bacteria are of thermophilic ancestry.
USDA-ARS?s Scientific Manuscript database
Eight fermentative bacterial strains were isolated from mixed enrichment cultures of a composite soil sample collected at 1.34 km depth from the former Homestake gold mine in Lead, SD, USA. Phylogenetic analysis of their 16S rRNA gene sequences revealed that these isolates were affiliated with the p...
Triadó-Margarit, Xavier; Casamayor, Emilio O
2015-12-01
Diversity of small protists was studied in sulfidic and anoxic (euxinic) stratified karstic lakes and coastal lagoons by 18S rRNA gene analyses. We hypothesized a major sulfide effect, reducing protist diversity and richness with only a few specialized populations adapted to deal with low-redox conditions and high-sulfide concentrations. However, genetic fingerprinting suggested similar ecological diversity in anoxic and sulfurous than in upper oxygen rich water compartments with specific populations inhabiting euxinic waters. Many of them agreed with genera previously identified by microscopic observations, but also new and unexpected groups were detected. Most of the sequences matched a rich assemblage of Ciliophora (i.e., Coleps, Prorodon, Plagiopyla, Strombidium, Metopus, Vorticella and Caenomorpha, among others) and algae (mainly Cryptomonadales). Unidentified Cercozoa, Fungi, Stramenopiles and Discoba were recurrently found. The lack of GenBank counterparts was higher in deep hypolimnetic waters and appeared differentially allocated in the different taxa, being higher within Discoba and lower in Cryptophyceae. A larger number of populations than expected were specifically detected in the deep sulfurous waters, with unknown ecological interactions and metabolic capabilities. © 2015 Society for Applied Microbiology and John Wiley & Sons Ltd.
Lysobacter spongiicola sp. nov., isolated from a deep-sea sponge.
Romanenko, Lyudmila A; Uchino, Masataka; Tanaka, Naoto; Frolova, Galina M; Mikhailov, Valery V
2008-02-01
An aerobic, Gram-negative bacterium, strain KMM 329(T), was isolated from a deep-sea sponge specimen from the Philippine Sea and subjected to a polyphasic taxonomic investigation. Comparative 16S rRNA gene sequence analysis showed that strain KMM 329(T) clustered with the species of the genus Lysobacter. The highest level of 16S rRNA gene sequence similarity (97.0 %) was found with respect to Lysobacter concretionis KCTC 12205(T); lower values (96.4-95.2 %) were obtained with respect to the other recognized Lysobacter species. The value for DNA-DNA relatedness between strain KMM 329(T) and L. concretionis KCTC 12205(T) was 47 %. Branched fatty acids 16 : 0 iso, 15 : 0 iso, 11 : 0 iso 3-OH and 17 : 1 iso were found to be predominant. Strain KMM 329(T) had a DNA G+C content of 69.0 mol%. On the basis of the phenotypic, chemotaxonomic, DNA-DNA hybridization and phylogenetic data, strain KMM 329(T) represents a novel species of the genus Lysobacter, for which the name Lysobacter spongiicola sp. nov. is proposed. The type strain is KMM 329(T) (=NRIC 0728(T) =JCM 14760(T)).
MicroRNA repertoire for functional genome research in tilapia identified by deep sequencing.
Yan, Biao; Wang, Zhen-Hua; Zhu, Chang-Dong; Guo, Jin-Tao; Zhao, Jin-Liang
2014-08-01
The Nile tilapia (Oreochromis niloticus; Cichlidae) is an economically important species in aquaculture and occupies a prominent position in the aquaculture industry. MicroRNAs (miRNAs) are a class of noncoding RNAs that post-transcriptionally regulate gene expression involved in diverse biological and metabolic processes. To increase the repertoire of miRNAs characterized in tilapia, we used the Illumina/Solexa sequencing technology to sequence a small RNA library using pooled RNA sample isolated from the different developmental stages of tilapia. Bioinformatic analyses suggest that 197 conserved and 27 novel miRNAs are expressed in tilapia. Sequence alignments indicate that all tested miRNAs and miRNAs* are highly conserved across many species. In addition, we characterized the tissue expression patterns of five miRNAs using real-time quantitative PCR. We found that miR-1/206, miR-7/9, and miR-122 is abundantly expressed in muscle, brain, and liver, respectively, implying a potential role in the regulation of tissue differentiation or the maintenance of tissue identity. Overall, our results expand the number of tilapia miRNAs, and the discovery of miRNAs in tilapia genome contributes to a better understanding the role of miRNAs in regulating diverse biological processes.
Phylogenetic diversity and position of the genus Campylobacter
NASA Technical Reports Server (NTRS)
Lau, P. P.; DeBrunner-Vossbrinck, B.; Dunn, B.; Miotto, K.; MacDonnell, M. T.; Rollins, D. M.; Pillidge, C. J.; Hespell, R. B.; Colwell, R. R.; Sogin, M. L.;
1987-01-01
RNA sequence analysis has been used to examine the phylogenetic position and structure of the genus Campylobacter. A complete 5S rRNA sequence was determined for two strains of Campylobacter jejuni and extensive partial sequences of the 16S rRNA were obtained for several strains of C. jejuni and Wolinella succinogenes. In addition limited partial sequence data were obtained from the 16S rRNAs of isolates of C. coli, C. laridis, C. fetus, C. fecalis, and C. pyloridis. It was found that W. succinogenes is specifically related to, but not included, in the genus Campylobacter as presently constituted. Within the genus significant diversity was noted. C. jejuni, C. coli and C. laridis are very closely related but the other species are distinctly different from one another. C. pyloridis is without question the most divergent of the Campylobacter isolates examined here and is sufficiently distinct to warrant inclusion in a separate genus. In terms of overall position in bacterial phylogeny, the Campylobacter/Wolinella cluster represents a deep branching most probably located within an expanded version of the Division containing the purple photosynthetic bacteria and their relatives. The Campylobacter/Wolinella cluster is not specifically includable in either the alpha, beta or gamma subdivisions of the purple bacteria.
Genetic diversity among pandemic 2009 influenza viruses isolated from a transmission chain
2013-01-01
Background Influenza viruses such as swine-origin influenza A(H1N1) virus (A(H1N1)pdm09) generate genetic diversity due to the high error rate of their RNA polymerase, often resulting in mixed genotype populations (intra-host variants) within a single infection. This variation helps influenza to rapidly respond to selection pressures, such as those imposed by the immunological host response and antiviral therapy. We have applied deep sequencing to characterize influenza intra-host variation in a transmission chain consisting of three cases due to oseltamivir-sensitive viruses, and one derived oseltamivir-resistant case. Methods Following detection of the A(H1N1)pdm09 infections, we deep-sequenced the complete NA gene from two of the oseltamivir-sensitive virus-infected cases, and all eight gene segments of the viruses causing the remaining two cases. Results No evidence for the resistance-causing mutation (resulting in NA H275Y substitution) was observed in the oseltamivir-sensitive cases. Furthermore, deep sequencing revealed a subpopulation of oseltamivir-sensitive viruses in the case carrying resistant viruses. We detected higher levels of intra-host variation in the case carrying oseltamivir-resistant viruses than in those infected with oseltamivir-sensitive viruses. Conclusions Oseltamivir-resistance was only detected after prophylaxis with oseltamivir, suggesting that the mutation was selected for as a result of antiviral intervention. The persisting oseltamivir-sensitive virus population in the case carrying resistant viruses suggests either that a small proportion survive the treatment, or that the oseltamivir-sensitive virus rapidly re-establishes itself in the virus population after the bottleneck. Moreover, the increased intra-host variation in the oseltamivir-resistant case is consistent with the hypothesis that the population diversity of a RNA virus can increase rapidly following a population bottleneck. PMID:23587185
The sponge microbiome project.
Moitinho-Silva, Lucas; Nielsen, Shaun; Amir, Amnon; Gonzalez, Antonio; Ackermann, Gail L; Cerrano, Carlo; Astudillo-Garcia, Carmen; Easson, Cole; Sipkema, Detmer; Liu, Fang; Steinert, Georg; Kotoulas, Giorgos; McCormack, Grace P; Feng, Guofang; Bell, James J; Vicente, Jan; Björk, Johannes R; Montoya, Jose M; Olson, Julie B; Reveillaud, Julie; Steindler, Laura; Pineda, Mari-Carmen; Marra, Maria V; Ilan, Micha; Taylor, Michael W; Polymenakou, Paraskevi; Erwin, Patrick M; Schupp, Peter J; Simister, Rachel L; Knight, Rob; Thacker, Robert W; Costa, Rodrigo; Hill, Russell T; Lopez-Legentil, Susanna; Dailianis, Thanos; Ravasi, Timothy; Hentschel, Ute; Li, Zhiyong; Webster, Nicole S; Thomas, Torsten
2017-10-01
Marine sponges (phylum Porifera) are a diverse, phylogenetically deep-branching clade known for forming intimate partnerships with complex communities of microorganisms. To date, 16S rRNA gene sequencing studies have largely utilised different extraction and amplification methodologies to target the microbial communities of a limited number of sponge species, severely limiting comparative analyses of sponge microbial diversity and structure. Here, we provide an extensive and standardised dataset that will facilitate sponge microbiome comparisons across large spatial, temporal, and environmental scales. Samples from marine sponges (n = 3569 specimens), seawater (n = 370), marine sediments (n = 65) and other environments (n = 29) were collected from different locations across the globe. This dataset incorporates at least 268 different sponge species, including several yet unidentified taxa. The V4 region of the 16S rRNA gene was amplified and sequenced from extracted DNA using standardised procedures. Raw sequences (total of 1.1 billion sequences) were processed and clustered with (i) a standard protocol using QIIME closed-reference picking resulting in 39 543 operational taxonomic units (OTU) at 97% sequence identity, (ii) a de novo clustering using Mothur resulting in 518 246 OTUs, and (iii) a new high-resolution Deblur protocol resulting in 83 908 unique bacterial sequences. Abundance tables, representative sequences, taxonomic classifications, and metadata are provided. This dataset represents a comprehensive resource of sponge-associated microbial communities based on 16S rRNA gene sequences that can be used to address overarching hypotheses regarding host-associated prokaryotes, including host specificity, convergent evolution, environmental drivers of microbiome structure, and the sponge-associated rare biosphere. © The Authors 2017. Published by Oxford University Press.
DeepMirTar: a deep-learning approach for predicting human miRNA targets.
Wen, Ming; Cong, Peisheng; Zhang, Zhimin; Lu, Hongmei; Li, Tonghua
2018-06-01
MicroRNAs (miRNAs) are small noncoding RNAs that function in RNA silencing and post-transcriptional regulation of gene expression by targeting messenger RNAs (mRNAs). Because the underlying mechanisms associated with miRNA binding to mRNA are not fully understood, a major challenge of miRNA studies involves the identification of miRNA-target sites on mRNA. In silico prediction of miRNA-target sites can expedite costly and time-consuming experimental work by providing the most promising miRNA-target-site candidates. In this study, we reported the design and implementation of DeepMirTar, a deep-learning-based approach for accurately predicting human miRNA targets at the site level. The predicted miRNA-target sites are those having canonical or non-canonical seed, and features, including high-level expert-designed, low-level expert-designed, and raw-data-level, were used to represent the miRNA-target site. Comparison with other state-of-the-art machine-learning methods and existing miRNA-target-prediction tools indicated that DeepMirTar improved overall predictive performance. DeepMirTar is freely available at https://github.com/Bjoux2/DeepMirTar_SdA. lith@tongji.edu.cn, hongmeilu@csu.edu.cn. Supplementary data are available at Bioinformatics online.
Ordóñez-Baquera, Perla Lucía; González-Rodríguez, Everardo; Aguado-Santacruz, Gerardo Armando; Rascón-Cruz, Quintín; Conesa, Ana; Moreno-Brito, Verónica; Echavarria, Raquel; Dominguez-Viveros, Joel
2017-02-01
MicroRNAs (miRNAs) are small non-coding RNA molecules that regulate signal transduction, development, metabolism, and stress responses in plants through post-transcriptional degradation and/or translational repression of target mRNAs. Several studies have addressed the role of miRNAs in model plant species, but miRNA expression and function in economically important forage crops, such as Bouteloua gracilis (Poaceae), a high-quality and drought-resistant grass distributed in semiarid regions of the United States and northern Mexico remain unknown. We applied high-throughput sequencing technology and bioinformatics analysis and identified 31 conserved miRNA families and 53 novel putative miRNAs with different abundance of reads in chlorophyllic cell cultures derived from B. gracilis. Some conserved miRNA families were highly abundant and possessed predicted targets involved in metabolism, plant growth and development, and stress responses. We also predicted additional identified novel miRNAs with specific targets, including B. gracilis ESTs, which were detected under drought stress conditions. Here we report 31 conserved miRNA families and 53 putative novel miRNAs in B. gracilis. Our results suggested the presence of regulatory miRNAs involved in modulating physiological and stress responses in this grass species. Copyright © 2016 Elsevier Ltd. All rights reserved.
Ferragut, Fátima; Vega, Celina G; Mauroy, Axel; Conceição-Neto, Nádia; Zeller, Mark; Heylen, Elisabeth; Uriarte, Enrique Louge; Bilbao, Gladys; Bok, Marina; Matthijnssens, Jelle; Thiry, Etienne; Badaracco, Alejandra; Parreño, Viviana
2016-06-01
Bovine noroviruses are enteric pathogens detected in fecal samples of both diarrheic and non-diarrheic calves from several countries worldwide. However, epidemiological information regarding bovine noroviruses is still lacking for many important cattle producing countries from South America. In this study, three bovine norovirus genogroup III sequences were determined by conventional RT-PCR and Sanger sequencing in feces from diarrheic dairy calves from Argentina (B4836, B4848, and B4881, all collected in 2012). Phylogenetic studies based on a partial coding region for the RNA-dependent RNA polymerase (RdRp, 503 nucleotides) of these three samples suggested that two of them (B4836 and B4881) belong to genotype 2 (GIII.2) while the third one (B4848) was more closely related to genotype 1 (GIII.1) strains. By deep sequencing, the capsid region from two of these strains could be determined. This confirmed the circulation of genotype 1 (B4848) together with the presence of another sequence (B4881) sharing its highest genetic relatedness with genotype 1, but sufficiently distant to constitute a new genotype. This latter strain was shown in silico to be a recombinant: phylogenetic divergence was detected between its RNA-dependent RNA polymerase coding sequence (genotype GIII.2) and its capsid protein coding sequence (genotype GIII.1 or a potential norovirus genotype). According to this data, this strain could be the second genotype GIII.2_GIII.1 bovine norovirus recombinant described in literature worldwide. Further analysis suggested that this strain could even be a potential norovirus GIII genotype, tentatively named GIII.4. The data provides important epidemiological and evolutionary information on bovine noroviruses circulating in South America. Copyright © 2016. Published by Elsevier B.V.
Deep sequencing-based analysis of the anaerobic stimulon in Neisseria gonorrhoeae
2011-01-01
Background Maintenance of an anaerobic denitrification system in the obligate human pathogen, Neisseria gonorrhoeae, suggests that an anaerobic lifestyle may be important during the course of infection. Furthermore, mounting evidence suggests that reduction of host-produced nitric oxide has several immunomodulary effects on the host. However, at this point there have been no studies analyzing the complete gonococcal transcriptome response to anaerobiosis. Here we performed deep sequencing to compare the gonococcal transcriptomes of aerobically and anaerobically grown cells. Using the information derived from this sequencing, we discuss the implications of the robust transcriptional response to anaerobic growth. Results We determined that 198 chromosomal genes were differentially expressed (~10% of the genome) in response to anaerobic conditions. We also observed a large induction of genes encoded within the cryptic plasmid, pJD1. Validation of RNA-seq data using translational-lacZ fusions or RT-PCR demonstrated the RNA-seq results to be very reproducible. Surprisingly, many genes of prophage origin were induced anaerobically, as well as several transcriptional regulators previously unknown to be involved in anaerobic growth. We also confirmed expression and regulation of a small RNA, likely a functional equivalent of fnrS in the Enterobacteriaceae family. We also determined that many genes found to be responsive to anaerobiosis have also been shown to be responsive to iron and/or oxidative stress. Conclusions Gonococci will be subject to many forms of environmental stress, including oxygen-limitation, during the course of infection. Here we determined that the anaerobic stimulon in gonococci was larger than previous studies would suggest. Many new targets for future research have been uncovered, and the results derived from this study may have helped to elucidate factors or mechanisms of virulence that may have otherwise been overlooked. PMID:21251255
Dawn of the in vivo RNA structurome and interactome.
Kwok, Chun Kit
2016-10-15
RNA is one of the most fascinating biomolecules in living systems given its structural versatility to fold into elaborate architectures for important biological functions such as gene regulation, catalysis, and information storage. Knowledge of RNA structures and interactions can provide deep insights into their functional roles in vivo For decades, RNA structural studies have been conducted on a transcript-by-transcript basis. The advent of next-generation sequencing (NGS) has enabled the development of transcriptome-wide structural probing methods to profile the global landscape of RNA structures and interactions, also known as the RNA structurome and interactome, which transformed our understanding of the RNA structure-function relationship on a transcriptomic scale. In this review, molecular tools and NGS methods used for RNA structure probing are presented, novel insights uncovered by RNA structurome and interactome studies are highlighted, and perspectives on current challenges and potential future directions are discussed. A more complete understanding of the RNA structures and interactions in vivo will help illuminate the novel roles of RNA in gene regulation, development, and diseases. © 2016 The Author(s); published by Portland Press Limited on behalf of the Biochemical Society.
Lopez-Fernandez, Margarita; Cherkouk, Andrea; Vilchez-Vargas, Ramiro; Jauregui, Ruy; Pieper, Dietmar; Boon, Nico; Sanchez-Castro, Ivan; Merroun, Mohamed L
2015-11-01
The long-term disposal of radioactive wastes in a deep geological repository is the accepted international solution for the treatment and management of these special residues. The microbial community of the selected host rocks and engineered barriers for the deep geological repository may affect the performance and the safety of the radioactive waste disposal. In this work, the bacterial population of bentonite formations of Almeria (Spain), selected as a reference material for bentonite-engineered barriers in the disposal of radioactive wastes, was studied. 16S ribosomal RNA (rRNA) gene-based approaches were used to study the bacterial community of the bentonite samples by traditional clone libraries and Illumina sequencing. Using both techniques, the bacterial diversity analysis revealed similar results, with phylotypes belonging to 14 different bacterial phyla: Acidobacteria, Actinobacteria, Armatimonadetes, Bacteroidetes, Chloroflexi, Cyanobacteria, Deinococcus-Thermus, Firmicutes, Gemmatimonadetes, Planctomycetes, Proteobacteria, Nitrospirae, Verrucomicrobia and an unknown phylum. The dominant groups of the community were represented by Proteobacteria and Bacteroidetes. A high diversity was found in three of the studied samples. However, two samples were less diverse and dominated by Betaproteobacteria.
Rudolph, C; Wanner, G; Huber, R
2001-05-01
We report the identification of novel archaea living in close association with bacteria in the cold (approximately 10 degrees C) sulfurous marsh water of the Sippenauer Moor near Regensburg, Bavaria, Germany. These microorganisms form a characteristic, macroscopically visible structure, morphologically comparable to a string of pearls. Tiny, whitish globules (the pearls; diameter, about 0.5 to 3.0 mm) are connected to each other by thin, white-colored threads. Fluorescent in situ hybridization (FISH) studies have revealed that the outer part of the pearls is mainly composed of bacteria, with a filamentous bacterium predominating. Internally, archaeal cocci are the predominant microorganisms, with up to 10(7) cells estimated to be present in a single pearl. The archaea appear to be embedded in a polymer of unknown chemical composition. According to FISH and 16S rRNA gene sequence analysis, the archaea are affiliated with the euryarchaeal kingdom. The new euryarchaeal sequence represents a deep phylogenetic branch within the 16S rRNA tree and does not show extensive similarity to any cultivated archaea or to 16S rRNA gene sequences from environmental samples.
Filteau, Marie; Lagacé, Luc; LaPointe, Gisèle; Roy, Denis
2010-04-01
An arbitrary primed community PCR fingerprinting technique based on capillary electrophoresis was developed to study maple sap microbial community characteristics among 19 production sites in Québec over the tapping season. Presumptive fragment identification was made with corresponding fingerprint profiles of bacterial isolate cultures. Maple sap microbial communities were subsequently compared using a representative subset of 13 16S rRNA gene clone libraries followed by gene sequence analysis. Results from both methods indicated that all maple sap production sites and flow periods shared common microbiota members, but distinctive features also existed. Changes over the season in relative abundance of predominant populations showed evidence of a common pattern. Pseudomonas (64%) and Rahnella (8%) were the most abundantly and frequently represented genera of the 2239 sequences analyzed. Janthinobacterium, Leuconostoc, Lactococcus, Weissella, Epilithonimonas and Sphingomonas were revealed as occasional contaminants in maple sap. Maple sap microbiota showed a low level of deep diversity along with a high variation of similar 16S rRNA gene sequences within the Pseudomonas genus. Predominance of Pseudomonas is suggested as a typical feature of maple sap microbiota across geographical regions, production sites, and sap flow periods.
Poly(A)-tag deep sequencing data processing to extract poly(A) sites.
Wu, Xiaohui; Ji, Guoli; Li, Qingshun Quinn
2015-01-01
Polyadenylation [poly(A)] is an essential posttranscriptional processing step in the maturation of eukaryotic mRNA. The advent of next-generation sequencing (NGS) technology has offered feasible means to generate large-scale data and new opportunities for intensive study of polyadenylation, particularly deep sequencing of the transcriptome targeting the junction of 3'-UTR and the poly(A) tail of the transcript. To take advantage of this unprecedented amount of data, we present an automated workflow to identify polyadenylation sites by integrating NGS data cleaning, processing, mapping, normalizing, and clustering. In this pipeline, a series of Perl scripts are seamlessly integrated to iteratively map the single- or paired-end sequences to the reference genome. After mapping, the poly(A) tags (PATs) at the same genome coordinate are grouped into one cleavage site, and the internal priming artifacts removed. Then the ambiguous region is introduced to parse the genome annotation for cleavage site clustering. Finally, cleavage sites within a close range of 24 nucleotides and from different samples can be clustered into poly(A) clusters. This procedure could be used to identify thousands of reliable poly(A) clusters from millions of NGS sequences in different tissues or treatments.
2010-01-01
Background Systematic research on fish immunogenetics is indispensable in understanding the origin and evolution of immune systems. This has long been a challenging task because of the limited number of deep sequencing technologies and genome backgrounds of non-model fish available. The newly developed Solexa/Illumina RNA-seq and Digital gene expression (DGE) are high-throughput sequencing approaches and are powerful tools for genomic studies at the transcriptome level. This study reports the transcriptome profiling analysis of bacteria-challenged Lateolabrax japonicus using RNA-seq and DGE in an attempt to gain insights into the immunogenetics of marine fish. Results RNA-seq analysis generated 169,950 non-redundant consensus sequences, among which 48,987 functional transcripts with complete or various length encoding regions were identified. More than 52% of these transcripts are possibly involved in approximately 219 known metabolic or signalling pathways, while 2,673 transcripts were associated with immune-relevant genes. In addition, approximately 8% of the transcripts appeared to be fish-specific genes that have never been described before. DGE analysis revealed that the host transcriptome profile of Vibrio harveyi-challenged L. japonicus is considerably altered, as indicated by the significant up- or down-regulation of 1,224 strong infection-responsive transcripts. Results indicated an overall conservation of the components and transcriptome alterations underlying innate and adaptive immunity in fish and other vertebrate models. Analysis suggested the acquisition of numerous fish-specific immune system components during early vertebrate evolution. Conclusion This study provided a global survey of host defence gene activities against bacterial challenge in a non-model marine fish. Results can contribute to the in-depth study of candidate genes in marine fish immunity, and help improve current understanding of host-pathogen interactions and evolutionary history of immunogenetics from fish to mammals. PMID:20707909
Villacreses, Javier; Rojas-Herrera, Marcelo; Sánchez, Carolina; Hewstone, Nicole; Undurraga, Soledad F.; Alzate, Juan F.; Manque, Patricio; Maracaja-Coutinho, Vinicius; Polanco, Victor
2015-01-01
Here, we report the genome sequence and evidence for transcriptional activity of a virus-like element in the native Chilean berry tree Aristotelia chilensis. We propose to name the endogenous sequence as Aristotelia chilensis Virus 1 (AcV1). High-throughput sequencing of the genome of this tree uncovered an endogenous viral element, with a size of 7122 bp, corresponding to the complete genome of AcV1. Its sequence contains three open reading frames (ORFs): ORFs 1 and 2 shares 66%–73% amino acid similarity with members of the Caulimoviridae virus family, especially the Petunia vein clearing virus (PVCV), Petuvirus genus. ORF1 encodes a movement protein (MP); ORF2 a Reverse Transcriptase (RT) and a Ribonuclease H (RNase H) domain; and ORF3 showed no amino acid sequence similarity with any other known virus proteins. Analogous to other known endogenous pararetrovirus sequences (EPRVs), AcV1 is integrated in the genome of Maqui Berry and showed low viral transcriptional activity, which was detected by deep sequencing technology (DNA and RNA-seq). Phylogenetic analysis of AcV1 and other pararetroviruses revealed a closer resemblance with Petuvirus. Overall, our data suggests that AcV1 could be a new member of Caulimoviridae family, genus Petuvirus, and the first evidence of this kind of virus in a fruit plant. PMID:25855242
Qu, Wen; Cingolani, Pablo; Zeeberg, Barry R; Ruden, Douglas M
2017-01-01
Deep sequencing of cDNAs made from spliced mRNAs indicates that most coding genes in many animals and plants have pre-mRNA transcripts that are alternatively spliced. In pre-mRNAs, in addition to invariant exons that are present in almost all mature mRNA products, there are at least 6 additional types of exons, such as exons from alternative promoters or with alternative polyA sites, mutually exclusive exons, skipped exons, or exons with alternative 5' or 3' splice sites. Our bioinformatics-based hypothesis is that, in analogy to the genetic code, there is an "alternative-splicing code" in introns and flanking exon sequences, analogous to the genetic code, that directs alternative splicing of many of the 36 types of introns. In humans, we identified 42 different consensus sequences that are each present in at least 100 human introns. 37 of the 42 top consensus sequences are significantly enriched or depleted in at least one of the 36 types of introns. We further supported our hypothesis by showing that 96 out of 96 analyzed human disease mutations that affect RNA splicing, and change alternative splicing from one class to another, can be partially explained by a mutation altering a consensus sequence from one type of intron to that of another type of intron. Some of the alternative splicing consensus sequences, and presumably their small-RNA or protein targets, are evolutionarily conserved from 50 plant to animal species. We also noticed the set of introns within a gene usually share the same splicing codes, thus arguing that one sub-type of splicesosome might process all (or most) of the introns in a given gene. Our work sheds new light on a possible mechanism for generating the tremendous diversity in protein structure by alternative splicing of pre-mRNAs.
Kasai, Megumi; Matsumura, Hideo; Yoshida, Kentaro; Terauchi, Ryohei; Taneda, Akito; Kanazawa, Akira
2013-01-30
Introduction of a transgene that transcribes RNA homologous to an endogenous gene in the plant genome can induce silencing of both genes, a phenomenon termed cosuppression. Cosuppression was first discovered in transgenic petunia plants transformed with the CHS-A gene encoding chalcone synthase, in which nonpigmented sectors in flowers or completely white flowers are produced. Some of the flower-color patterns observed in transgenic petunias having CHS-A cosuppression resemble those in existing nontransgenic varieties. Although the mechanism by which white sectors are generated in nontransgenic petunia is known to be due to RNA silencing of the CHS-A gene as in cosuppression, whether the same trigger(s) and/or pattern of RNA degradation are involved in these phenomena has not been known. Here, we addressed this question using deep-sequencing and bioinformatic analyses of small RNAs. We analyzed short interfering RNAs (siRNAs) produced in nonpigmented sectors of petal tissues in transgenic petunia plants that have CHS-A cosuppression and a nontransgenic petunia variety Red Star, that has naturally occurring CHS-A RNA silencing. In both silencing systems, 21-nt and 22-nt siRNAs were the most and the second-most abundant size classes, respectively. CHS-A siRNA production was confined to exon 2, indicating that RNA degradation through the RNA silencing pathway occurred in this exon. Common siRNAs were detected in cosuppression and naturally occurring RNA silencing, and their ranks based on the number of siRNAs in these plants were correlated with each other. Noticeably, highly abundant siRNAs were common in these systems. Phased siRNAs were detected in multiple phases at multiple sites, and some of the ends of the regions that produced phased siRNAs were conserved. The features of siRNA production found to be common to cosuppression and naturally occurring silencing of the CHS-A gene indicate mechanistic similarities between these silencing systems especially in the biosynthetic processes of siRNAs including cleavage of CHS-A transcripts and subsequent production of secondary siRNAs in exon 2. The data also suggest that these events occurred at multiple sites, which can be a feature of these silencing phenomena.
2011-01-01
Background Readthrough fusions across adjacent genes in the genome, or transcription-induced chimeras (TICs), have been estimated using expressed sequence tag (EST) libraries to involve 4-6% of all genes. Deep transcriptional sequencing (RNA-Seq) now makes it possible to study the occurrence and expression levels of TICs in individual samples across the genome. Methods We performed single-end RNA-Seq on three human prostate adenocarcinoma samples and their corresponding normal tissues, as well as brain and universal reference samples. We developed two bioinformatics methods to specifically identify TIC events: a targeted alignment method using artificial exon-exon junctions within 200,000 bp from adjacent genes, and genomic alignment allowing splicing within individual reads. We performed further experimental verification and characterization of selected TIC and fusion events using quantitative RT-PCR and comparative genomic hybridization microarrays. Results Targeted alignment against artificial exon-exon junctions yielded 339 distinct TIC events, including 32 gene pairs with multiple isoforms. The false discovery rate was estimated to be 1.5%. Spliced alignment to the genome was less sensitive, finding only 18% of those found by targeted alignment in 33-nt reads and 59% of those in 50-nt reads. However, spliced alignment revealed 30 cases of TICs with intervening exons, in addition to distant inversions, scrambled genes, and translocations. Our findings increase the catalog of observed TIC gene pairs by 66%. We verified 6 of 6 predicted TICs in all prostate samples, and 2 of 5 predicted novel distant gene fusions, both private events among 54 prostate tumor samples tested. Expression of TICs correlates with that of the upstream gene, which can explain the prostate-specific pattern of some TIC events and the restriction of the SLC45A3-ELK4 e4-e2 TIC to ERG-negative prostate samples, as confirmed in 20 matched prostate tumor and normal samples and 9 lung cancer cell lines. Conclusions Deep transcriptional sequencing and analysis with targeted and spliced alignment methods can effectively identify TIC events across the genome in individual tissues. Prostate and reference samples exhibit a wide range of TIC events, involving more genes than estimated previously using ESTs. Tissue specificity of TIC events is correlated with expression patterns of the upstream gene. Some TIC events, such as MSMB-NCOA4, may play functional roles in cancer. PMID:21261984
Zygotic amplification of secondary piRNAs during silkworm embryogenesis
Kawaoka, Shinpei; Arai, Yuji; Kadota, Koji; Suzuki, Yutaka; Hara, Kahori; Sugano, Sumio; Shimizu, Kentaro; Tomari, Yukihide; Shimada, Toru; Katsuma, Susumu
2011-01-01
PIWI-interacting RNAs (piRNAs) are 23–30-nucleotide-long small RNAs that act as sequence-specific silencers of transposable elements in animal gonads. In flies, genetics and deep sequencing data have led to a hypothesis for piRNA biogenesis called the ping-pong cycle, where antisense primary piRNAs initiate an amplification loop to generate sense secondary piRNAs. However, to date, the process of the ping-pong cycle has never been monitored at work. Here, by large-scale profiling of piRNAs from silkworm ovary and embryos of different developmental stages, we demonstrate that maternally inherited antisense-biased piRNAs trigger acute amplification of secondary sense piRNA production in zygotes, at a time coinciding with zygotic transcription of sense transposon mRNAs. These results provide on-site evidence for the ping-pong cycle. PMID:21628432
Vaché, Christel; Besnard, Thomas; le Berre, Pauline; García-García, Gema; Baux, David; Larrieu, Lise; Abadie, Caroline; Blanchet, Catherine; Bolz, Hanno Jörn; Millan, Jose; Hamel, Christian; Malcolm, Sue; Claustres, Mireille; Roux, Anne-Françoise
2012-01-01
USH2A sequencing in three affected members of a large family, referred for the recessive USH2 syndrome, identified a single pathogenic alteration in one of them and a different mutation in the two affected nieces. As the patients carried a common USH2A haplotype, they likely shared a mutation not found by standard sequencing techniques. Analysis of RNA from nasal cells in one affected individual identified an additional pseudoexon (PE) resulting from a deep intronic mutation. This was confirmed by minigene assay. This is the first example in Usher syndrome (USH) with a mutation causing activation of a PE. The finding of this alteration in eight other individuals of mixed European origin emphasizes the importance of including RNA analysis in a comprehensive diagnostic service. Finally, this mutation, which would not have been found by whole-exome sequencing, could offer, for the first time in USH, the possibility of therapeutic correction by antisense oligonucleotides (AONs). © 2011 Wiley Periodicals, Inc.
ADAR2 induces reproducible changes in sequence and abundance of mature microRNAs in the mouse brain
Vesely, Cornelia; Tauber, Stefanie; Sedlazeck, Fritz J.; Tajaddod, Mansoureh; von Haeseler, Arndt; Jantsch, Michael F.
2014-01-01
Adenosine deaminases that act on RNA (ADARs) deaminate adenosines to inosines in double-stranded RNAs including miRNA precursors. A to I editing is widespread and required for normal life. By comparing deep sequencing data of brain miRNAs from wild-type and ADAR2 deficient mouse strains, we detect editing sites and altered miRNA processing at high sensitivity. We detect 48 novel editing events in miRNAs. Some editing events reach frequencies of up to 80%. About half of all editing events depend on ADAR2 while some miRNAs are preferentially edited by ADAR1. Sixty-four percent of all editing events are located within the seed region of mature miRNAs. For the highly edited miR-3099, we experimentally prove retargeting of the edited miRNA to novel 3′ UTRs. We show further that an abundant editing event in miR-497 promotes processing by Drosha of the corresponding pri-miRNA. We also detect reproducible changes in the abundance of specific miRNAs in ADAR2-deficient mice that occur independent of adjacent A to I editing events. This indicates that ADAR2 binding but not editing of miRNA precursors may influence their processing. Correlating with changes in miRNA abundance we find misregulation of putative targets of these miRNAs in the presence or absence of ADAR2. PMID:25260591
Bacterial community diversity of the deep-sea octocoral Paramuricea placomus.
Kellogg, Christina A; Ross, Steve W; Brooke, Sandra D
2016-01-01
Compared to tropical corals, much less is known about deep-sea coral biology and ecology. Although the microbial communities of some deep-sea corals have been described, this is the first study to characterize the bacterial community associated with the deep-sea octocoral, Paramuricea placomus . Samples from five colonies of P. placomus were collected from Baltimore Canyon (379-382 m depth) in the Atlantic Ocean off the east coast of the United States of America. DNA was extracted from the coral samples and 16S rRNA gene amplicons were pyrosequenced using V4-V5 primers. Three samples sequenced deeply (>4,000 sequences each) and were further analyzed. The dominant microbial phylum was Proteobacteria, but other major phyla included Firmicutes and Planctomycetes. A conserved community of bacterial taxa held in common across the three P. placomus colonies was identified, comprising 68-90% of the total bacterial community depending on the coral individual. The bacterial community of P. placomus does not appear to include the genus Endozoicomonas , which has been found previously to be the dominant bacterial associate in several temperate and tropical gorgonians. Inferred functionality suggests the possibility of nitrogen cycling by the core bacterial community.
Bacterial community diversity of the deep-sea octocoral Paramuricea placomus
Kellogg, Christina A.; Ross, Steve W.; Brooke, Sandra D.
2016-01-01
Compared to tropical corals, much less is known about deep-sea coral biology and ecology. Although the microbial communities of some deep-sea corals have been described, this is the first study to characterize the bacterial community associated with the deep-sea octocoral, Paramuricea placomus. Samples from five colonies of P. placomus were collected from Baltimore Canyon (379–382 m depth) in the Atlantic Ocean off the east coast of the United States of America. DNA was extracted from the coral samples and 16S rRNA gene amplicons were pyrosequenced using V4-V5 primers. Three samples sequenced deeply (>4,000 sequences each) and were further analyzed. The dominant microbial phylum was Proteobacteria, but other major phyla included Firmicutes and Planctomycetes. A conserved community of bacterial taxa held in common across the three P. placomuscolonies was identified, comprising 68–90% of the total bacterial community depending on the coral individual. The bacterial community of P. placomusdoes not appear to include the genus Endozoicomonas, which has been found previously to be the dominant bacterial associate in several temperate and tropical gorgonians. Inferred functionality suggests the possibility of nitrogen cycling by the core bacterial community.
Next-generation libraries for robust RNA interference-based genome-wide screens
Kampmann, Martin; Horlbeck, Max A.; Chen, Yuwen; Tsai, Jordan C.; Bassik, Michael C.; Gilbert, Luke A.; Villalta, Jacqueline E.; Kwon, S. Chul; Chang, Hyeshik; Kim, V. Narry; Weissman, Jonathan S.
2015-01-01
Genetic screening based on loss-of-function phenotypes is a powerful discovery tool in biology. Although the recent development of clustered regularly interspaced short palindromic repeats (CRISPR)-based screening approaches in mammalian cell culture has enormous potential, RNA interference (RNAi)-based screening remains the method of choice in several biological contexts. We previously demonstrated that ultracomplex pooled short-hairpin RNA (shRNA) libraries can largely overcome the problem of RNAi off-target effects in genome-wide screens. Here, we systematically optimize several aspects of our shRNA library, including the promoter and microRNA context for shRNA expression, selection of guide strands, and features relevant for postscreen sample preparation for deep sequencing. We present next-generation high-complexity libraries targeting human and mouse protein-coding genes, which we grouped into 12 sublibraries based on biological function. A pilot screen suggests that our next-generation RNAi library performs comparably to current CRISPR interference (CRISPRi)-based approaches and can yield complementary results with high sensitivity and high specificity. PMID:26080438
Sokol, Martin; Jessen, Karen Margrethe; Pedersen, Finn Skou
2016-01-01
Several studies have shown that human endogenous retroviruses and endogenous retrovirus-like repeats (here collectively HERVs) impose direct regulation on human genes through enhancer and promoter motifs present in their long terminal repeats (LTRs). Although chimeric transcription in which novel gene isoforms containing retroviral and human sequence are transcribed from viral promoters are commonly associated with disease, regulation by HERVs is beneficial in other settings; for example, in human testis chimeric isoforms of TP63 induced by an ERV9 LTR protect the male germ line upon DNA damage by inducing apoptosis, whereas in the human globin locus the γ- and β-globin switch during normal hematopoiesis is mediated by complex interactions of an ERV9 LTR and surrounding human sequence. The advent of deep sequencing or next-generation sequencing (NGS) has revolutionized the way researchers solve important scientific questions and develop novel hypotheses in relation to human genome regulation. We recently applied next-generation paired-end RNA-sequencing (RNA-seq) together with chromatin immunoprecipitation with sequencing (ChIP-seq) to examine ERV9 chimeric transcription in human reference cell lines from Encyclopedia of DNA Elements (ENCODE). This led to the discovery of advanced regulation mechanisms by ERV9s and other HERVs across numerous human loci including transcription of large gene-unannotated genomic regions, as well as cooperative regulation by multiple HERVs and non-LTR repeats such as Alu elements. In this article, well-established examples of human gene regulation by HERVs are reviewed followed by a description of paired-end RNA-seq, and its application in identifying chimeric transcription genome-widely. Based on integrative analyses of RNA-seq and ChIP-seq, data we then present novel examples of regulation by ERV9s of tumor suppressor genes CADM2 and SEMA3A, as well as transcription of an unannotated region. Taken together, this article highlights the high suitability of contemporary sequencing methods in future analyses of human biology in relation to evolutionary acquired retroviruses in the human genome. © 2016 APMIS. Published by John Wiley & Sons Ltd.
Su, Andreas A. H.; Tripp, Vanessa; Randau, Lennart
2013-01-01
The methanogenic archaeon Methanopyrus kandleri grows near the upper temperature limit for life. Genome analyses revealed strategies to adapt to these harsh conditions and elucidated a unique transfer RNA (tRNA) C-to-U editing mechanism at base 8 for 30 different tRNA species. Here, RNA-Seq deep sequencing methodology was combined with computational analyses to characterize the small RNome of this hyperthermophilic organism and to obtain insights into the RNA metabolism at extreme temperatures. A large number of 132 small RNAs were identified that guide RNA modifications, which are expected to stabilize structured RNA molecules. The C/D box guide RNAs were shown to exist as circular RNA molecules. In addition, clustered regularly interspaced short palindromic repeats RNA processing and potential regulatory RNAs were identified. Finally, the identification of tRNA precursors before and after the unique C8-to-U8 editing activity enabled the determination of the order of tRNA processing events with termini truncation preceding intron removal. This order of tRNA maturation follows the compartmentalized tRNA processing order found in Eukaryotes and suggests its conservation during evolution. PMID:23620296
Dong, Yibo; Yuan, Qianhua; Wang, Feng; Li, Weimin; Jiang, Ying; Jia, Shirong; Pei, XinWu
2013-01-01
Background MicroRNAs (miRNAs) is a class of non-coding RNAs involved in post- transcriptional control of gene expression, via degradation and/or translational inhibition. Six-hundred sixty-one rice miRNAs are known that are important in plant development. However, flowering-related miRNAs have not been characterized in Oryza rufipogon Griff. It was approved by supervision department of Guangdong wild rice protection. We analyzed flowering-related miRNAs in O. rufipogon using high-throughput sequencing (deep sequencing) to understand the changes that occurred during rice domestication, and to elucidate their functions in flowering. Results Three O. rufipogon sRNA libraries, two vegetative stage (CWR-V1 and CWR-V2) and one flowering stage (CWR-F2) were sequenced using Illumina deep sequencing. A total of 20,156,098, 21,531,511 and 20,995,942 high quality sRNA reads were obtained from CWR-V1, CWR-V2 and CWR-F2, respectively, of which 3,448,185, 4,265,048 and 2,833,527 reads matched known miRNAs. We identified 512 known rice miRNAs in 214 miRNA families and predicted 290 new miRNAs. Targeted functional annotation, GO and KEGG pathway analyses predicted that 187 miRNAs regulate expression of flowering-related genes. Differential expression analysis of flowering-related miRNAs showed that: expression of 95 miRNAs varied significantly between the libraries, 66 are flowering-related miRNAs, such as oru-miR97, oru-miR117, oru-miR135, oru-miR137, et al. 17 are early-flowering -related miRNAs, including osa-miR160f, osa-miR164d, osa-miR167d, osa-miR169a, osa-miR172b, oru-miR4, et al., induced during the floral transition. Real-time PCR revealed the same expression patterns as deep sequencing. miRNAs targets were confirmed for cleavage by 5′-RACE in vivo, and were negatively regulated by miRNAs. Conclusions This is the first investigation of flowering miRNAs in wild rice. The result indicates that variation in miRNAs occurred during rice domestication and lays a foundation for further study of phase change and flowering in O. rufipogon. Complicated regulatory networks mediated by multiple miRNAs regulate the expression of flowering genes that control the induction of flowering. PMID:24386120
Unique microbial community in drilling fluids from Chinese continental scientific drilling
Zhang, Gengxin; Dong, Hailiang; Jiang, Hongchen; Xu, Zhiqin; Eberl, Dennis D.
2006-01-01
Circulating drilling fluid is often regarded as a contamination source in investigations of subsurface microbiology. However, it also provides an opportunity to sample geological fluids at depth and to study contained microbial communities. During our study of deep subsurface microbiology of the Chinese Continental Scientific Deep drilling project, we collected 6 drilling fluid samples from a borehole from 2290 to 3350 m below the land surface. Microbial communities in these samples were characterized with cultivation-dependent and -independent techniques. Characterization of 16S rRNA genes indicated that the bacterial clone sequences related to Firmicutes became progressively dominant with increasing depth. Most sequences were related to anaerobic, thermophilic, halophilic or alkaliphilic bacteria. These habitats were consistent with the measured geochemical characteristics of the drilling fluids that have incorporated geological fluids and partly reflected the in-situ conditions. Several clone types were closely related to Thermoanaerobacter ethanolicus, Caldicellulosiruptor lactoaceticus, and Anaerobranca gottschalkii, an anaerobic metal-reducer, an extreme thermophile, and an anaerobic chemoorganotroph, respectively, with an optimal growth temperature of 50–68°C. Seven anaerobic, thermophilic Fe(III)-reducing bacterial isolates were obtained and they were capable of reducing iron oxide and clay minerals to produce siderite, vivianite, and illite. The archaeal diversity was low. Most archaeal sequences were not related to any known cultivated species, but rather to environmental clone sequences recovered from subsurface environments. We infer that the detected microbes were derived from geological fluids at depth and their growth habitats reflected the deep subsurface conditions. These findings have important implications for microbial survival and their ecological functions in the deep subsurface.
Bashir, Ali; Bansal, Vikas; Bafna, Vineet
2010-06-18
Massively parallel DNA sequencing technologies have enabled the sequencing of several individual human genomes. These technologies are also being used in novel ways for mRNA expression profiling, genome-wide discovery of transcription-factor binding sites, small RNA discovery, etc. The multitude of sequencing platforms, each with their unique characteristics, pose a number of design challenges, regarding the technology to be used and the depth of sequencing required for a particular sequencing application. Here we describe a number of analytical and empirical results to address design questions for two applications: detection of structural variations from paired-end sequencing and estimating mRNA transcript abundance. For structural variation, our results provide explicit trade-offs between the detection and resolution of rearrangement breakpoints, and the optimal mix of paired-read insert lengths. Specifically, we prove that optimal detection and resolution of breakpoints is achieved using a mix of exactly two insert library lengths. Furthermore, we derive explicit formulae to determine these insert length combinations, enabling a 15% improvement in breakpoint detection at the same experimental cost. On empirical short read data, these predictions show good concordance with Illumina 200 bp and 2 Kbp insert length libraries. For transcriptome sequencing, we determine the sequencing depth needed to detect rare transcripts from a small pilot study. With only 1 Million reads, we derive corrections that enable almost perfect prediction of the underlying expression probability distribution, and use this to predict the sequencing depth required to detect low expressed genes with greater than 95% probability. Together, our results form a generic framework for many design considerations related to high-throughput sequencing. We provide software tools http://bix.ucsd.edu/projects/NGS-DesignTools to derive platform independent guidelines for designing sequencing experiments (amount of sequencing, choice of insert length, mix of libraries) for novel applications of next generation sequencing.
Detection of microRNAs in color space.
Marco, Antonio; Griffiths-Jones, Sam
2012-02-01
Deep sequencing provides inexpensive opportunities to characterize the transcriptional diversity of known genomes. The AB SOLiD technology generates millions of short sequencing reads in color-space; that is, the raw data is a sequence of colors, where each color represents 2 nt and each nucleotide is represented by two consecutive colors. This strategy is purported to have several advantages, including increased ability to distinguish sequencing errors from polymorphisms. Several programs have been developed to map short reads to genomes in color space. However, a number of previously unexplored technical issues arise when using SOLiD technology to characterize microRNAs. Here we explore these technical difficulties. First, since the sequenced reads are longer than the biological sequences, every read is expected to contain linker fragments. The color-calling error rate increases toward the 3(') end of the read such that recognizing the linker sequence for removal becomes problematic. Second, mapping in color space may lead to the loss of the first nucleotide of each read. We propose a sequential trimming and mapping approach to map small RNAs. Using our strategy, we reanalyze three published insect small RNA deep sequencing datasets and characterize 22 new microRNAs. A bash shell script to perform the sequential trimming and mapping procedure, called SeqTrimMap, is available at: http://www.mirbase.org/tools/seqtrimmap/ antonio.marco@manchester.ac.uk Supplementary data are available at Bioinformatics online.
Branton, William G.; Ellestad, Kristofor K.; Maingat, Ferdinand; Wheatley, B. Matt; Rud, Erling; Warren, René L.; Holt, Robert A.; Surette, Michael G.; Power, Christopher
2013-01-01
The brain is assumed to be a sterile organ in the absence of disease although the impact of immune disruption is uncertain in terms of brain microbial diversity or quantity. To investigate microbial diversity and quantity in the brain, the profile of infectious agents was examined in pathologically normal and abnormal brains from persons with HIV/AIDS [HIV] (n = 12), other disease controls [ODC] (n = 14) and in cerebral surgical resections for epilepsy [SURG] (n = 6). Deep sequencing of cerebral white matter-derived RNA from the HIV (n = 4) and ODC (n = 4) patients and SURG (n = 2) groups revealed bacterially-encoded 16 s RNA sequences in all brain specimens with α-proteobacteria representing over 70% of bacterial sequences while the other 30% of bacterial classes varied widely. Bacterial rRNA was detected in white matter glial cells by in situ hybridization and peptidoglycan immunoreactivity was also localized principally in glia in human brains. Analyses of amplified bacterial 16 s rRNA sequences disclosed that Proteobacteria was the principal bacterial phylum in all human brain samples with similar bacterial rRNA quantities in HIV and ODC groups despite increased host neuroimmune responses in the HIV group. Exogenous viruses including bacteriophage and human herpes viruses-4, -5 and -6 were detected variably in autopsied brains from both clinical groups. Brains from SIV- and SHIV-infected macaques displayed a profile of bacterial phyla also dominated by Proteobacteria but bacterial sequences were not detected in experimentally FIV-infected cat or RAG1−/− mouse brains. Intracerebral implantation of human brain homogenates into RAG1−/− mice revealed a preponderance of α-proteobacteria 16 s RNA sequences in the brains of recipient mice at 7 weeks post-implantation, which was abrogated by prior heat-treatment of the brain homogenate. Thus, α-proteobacteria represented the major bacterial component of the primate brain’s microbiome regardless of underlying immune status, which could be transferred into naïve hosts leading to microbial persistence in the brain. PMID:23355888
Paparini, Andrea; Gofton, Alexander; Yang, Rongchang; White, Nicole; Bunce, Michael; Ryan, Una M
2015-01-01
Cryptosporidium is an important enteric pathogen that infects a wide range of humans and animals. Rapid and reliable detection and characterisation methods are essential for understanding the transmission dynamics of the parasite. Sanger sequencing, and high-throughput sequencing (HTS) on an Ion Torrent platform, were compared with each other for their sensitivity and accuracy in detecting and characterising 25 Cryptosporidium-positive human and animal faecal samples. Ion Torrent reads (n = 123,857) were obtained at both 18S rRNA and actin loci for 21 of the 25 samples. Of these, one isolate at the actin locus (Cattle 05) and three at the 18S rRNA locus (HTS 10, HTS 11 and HTS 12), suffered PCR drop-out (i.e. PCR failures) when using fusion-tagged PCR. Sanger sequences were obtained for both loci for 23 of the 25 samples and showed good agreement with Ion Torrent-based genotyping. Two samples both from pythons (SK 02 and SK 05) produced mixed 18S and actin chromatograms by Sanger sequencing but were clearly identified by Ion Torrent sequencing as C. muris. One isolate (SK 03) was typed as C. muris by Sanger sequencing but was identified as a mixed C. muris and C. tyzzeri infection by HTS. 18S rRNA Type B sequences were identified in 4/6 C. parvum isolates when deep sequenced but were undetected in Sanger sequencing. Sanger was cheaper than Ion Torrent when sequencing a small numbers of samples, but when larger numbers of samples are considered (n = 60), the costs were comparative. Fusion-tagged amplicon based approaches are a powerful way of approaching mixtures, the only draw-back being the loss of PCR efficiency on low-template samples when using primers coupled to MID tags and adaptors. Taken together these data show that HTS has excellent potential for revealing the "true" composition of species/types in a Cryptosporidium infection, but that HTS workflows need to be carefully developed to ensure sensitivity, accuracy and contamination are controlled. Copyright © 2015 Elsevier Inc. All rights reserved.
A deep learning framework for modeling structural features of RNA-binding protein targets
Zhang, Sai; Zhou, Jingtian; Hu, Hailin; Gong, Haipeng; Chen, Ligong; Cheng, Chao; Zeng, Jianyang
2016-01-01
RNA-binding proteins (RBPs) play important roles in the post-transcriptional control of RNAs. Identifying RBP binding sites and characterizing RBP binding preferences are key steps toward understanding the basic mechanisms of the post-transcriptional gene regulation. Though numerous computational methods have been developed for modeling RBP binding preferences, discovering a complete structural representation of the RBP targets by integrating their available structural features in all three dimensions is still a challenging task. In this paper, we develop a general and flexible deep learning framework for modeling structural binding preferences and predicting binding sites of RBPs, which takes (predicted) RNA tertiary structural information into account for the first time. Our framework constructs a unified representation that characterizes the structural specificities of RBP targets in all three dimensions, which can be further used to predict novel candidate binding sites and discover potential binding motifs. Through testing on the real CLIP-seq datasets, we have demonstrated that our deep learning framework can automatically extract effective hidden structural features from the encoded raw sequence and structural profiles, and predict accurate RBP binding sites. In addition, we have conducted the first study to show that integrating the additional RNA tertiary structural features can improve the model performance in predicting RBP binding sites, especially for the polypyrimidine tract-binding protein (PTB), which also provides a new evidence to support the view that RBPs may own specific tertiary structural binding preferences. In particular, the tests on the internal ribosome entry site (IRES) segments yield satisfiable results with experimental support from the literature and further demonstrate the necessity of incorporating RNA tertiary structural information into the prediction model. The source code of our approach can be found in https://github.com/thucombio/deepnet-rbp. PMID:26467480
Porath, Hagit T.; Barak, Michal; Pinto, Yishay; Wachtel, Chaim; Zilberberg, Alona; Lerer-Goldshtein, Tali; Efroni, Sol; Levanon, Erez Y.; Appelbaum, Lior
2015-01-01
Fragile X syndrome (FXS) is the most frequent inherited form of mental retardation. The cause for this X-linked disorder is the silencing of the fragile X mental retardation 1 (fmr1) gene and the absence of the fragile X mental retardation protein (Fmrp). The RNA-binding protein Fmrp represses protein translation, particularly in synapses. In Drosophila, Fmrp interacts with the adenosine deaminase acting on RNA (Adar) enzymes. Adar enzymes convert adenosine to inosine (A-to-I) and modify the sequence of RNA transcripts. Utilizing the fmr1 zebrafish mutant (fmr1-/-), we studied Fmrp-dependent neuronal circuit formation, behavior, and Adar-mediated RNA editing. By combining behavior analyses and live imaging of single axons and synapses, we showed hyperlocomotor activity, as well as increased axonal branching and synaptic density, in fmr1-/- larvae. We identified thousands of clustered RNA editing sites in the zebrafish transcriptome and showed that Fmrp biochemically interacts with the Adar2a protein. The expression levels of the adar genes and Adar2 protein increased in fmr1-/- zebrafish. Microfluidic-based multiplex PCR coupled with deep sequencing showed a mild increase in A-to-I RNA editing levels in evolutionarily conserved neuronal and synaptic Adar-targets in fmr1-/- larvae. These findings suggest that loss of Fmrp results in increased Adar-mediated RNA editing activity on target-specific RNAs, which, in turn, might alter neuronal circuit formation and behavior in FXS. PMID:26637167
Piezophilic Bacteria Isolated from Sediment of the Shimokita Coalbed, Japan
NASA Astrophysics Data System (ADS)
Fang, J.; Kato, C.; Hori, T.; Morono, Y.; Inagaki, F.
2013-12-01
The Earth is a cold planet as well as pressured planet, hosting both the surface biosphere and the deep biosphere. Pressure ranges over four-orders of magnitude in the surface biosphere and probably more in the deep biosphere. Pressure is an important thermodynamic property of the deep biosphere that affects microbial physiology and biochemistry. Bacteria that require high-pressure conditions for optimal growth are called piezophilic bacteria. Subseafloor marine sediments are one of the most extensive microbial habitats on Earth. Marine sediments cover more than two-thirds of the Earth's surface, and represent a major part of the deep biosphere. Owing to its vast size and intimate connection with the surface biosphere, particularly the oceans, the deep biosphere has enormous potential for influencing global-scale biogeochemical processes, including energy, climate, carbon and nutrient cycles. Therefore, studying piezophilic bacteria of the deep biosphere has important implications in increasing our understanding of global biogeochemical cycles, the interactions between the biosphere and the geosphere, and the evolution of life. Sediment samples were obtained during IODP Expedition 337, from 1498 meters below sea floor (mbsf) (Sample 6R-3), 1951~1999 mbsf (19R-1~25R-3; coalbed mix), and 2406 mbsf (29R-7). The samples were mixed with MB2216 growth medium and cultivated under anaerobic conditions at 35 MPa (megapascal) pressure. Growth temperatures were adjusted to in situ environmental conditions, 35°C for 6R-3, 45°C for 19R-1~25R-3, and 55°C for 29R-7. The cultivation was performed three times, for 30 days each time. Microbial cells were obtained and the total DNA was extracted. At the same time, isolation of microbes was also performed under anaerobic conditions. Microbial communities in the coalbed sediment were analyzed by cloning, sequencing, and terminal restriction fragment length polymorphism (t-RFLP) of 16S ribosomal RNA genes. From the partial 16S rRNA gene sequences, we have identified abundant Alkalibacterium sp. in 6R-3 and 29R-7 at the first HP cultivation. We also identified Haloactibacillus sp. in 6R-3 and Anoxybacillus related sp. in 19R-1~25R-3 at the third HP cultivation. These microorganisms are likely piezophiles and play an important role in degradation of sedimentary organic matter and production of microbial metabolites sustaining the deep microbial ecosystem in the Shimokita Coalbed. The complete 16S sequencing and isolation of piezophiles are now ongoing.
Zheng, Ling-Ling; Xu, Wei-Lin; Liu, Shun; Sun, Wen-Ju; Li, Jun-Hao; Wu, Jie; Yang, Jian-Hua; Qu, Liang-Hu
2016-07-08
tRNA-derived small RNA fragments (tRFs) are one class of small non-coding RNAs derived from transfer RNAs (tRNAs). tRFs play important roles in cellular processes and are involved in multiple cancers. High-throughput small RNA (sRNA) sequencing experiments can detect all the cellular expressed sRNAs, including tRFs. However, distinguishing genuine tRFs from RNA fragments generated by random degradation remains a major challenge. In this study, we developed an integrated web-based computing system, tRF2Cancer, to accurately identify tRFs from sRNA deep-sequencing data and evaluate their expression in multiple cancers. The binomial test was introduced to evaluate whether reads from a small RNA-seq data set represent tRFs or degraded fragments. A classification method was then used to annotate the types of tRFs based on their sites of origin in pre-tRNA or mature tRNA. We applied the pipeline to analyze 10 991 data sets from 32 types of cancers and identified thousands of expressed tRFs. A tool called 'tRFinCancer' was developed to facilitate the users to inspect the expression of tRFs across different types of cancers. Another tool called 'tRFBrowser' shows both the sites of origin and the distribution of chemical modification sites in tRFs on their source tRNA. The tRF2Cancer web server is available at http://rna.sysu.edu.cn/tRFfinder/. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Zhang, Pengpeng; Xu, Haixia; Li, Rui; Wu, Wei; Chao, Zhe; Li, Cencen; Xia, Wei; Wang, Lei; Yang, Jinzeng; Xu, Yongjie
2018-06-01
Myoblast differentiation is a highly complex process that is regulated by proteins as well as by non-coding RNAs. Circular RNAs have been identified as an emerging new class of non-coding RNA in the modulation of skeletal muscle development, whereas their expression profiles and functional regulation in myoblast differentiation remain unknown. In the present study, we performed deep RNA-sequencing of C2C12 myoblasts during cell differentiation and uncovered 37,751 unique circular RNAs derived from 6943 hosting genes. The ensuing qRT-PCR and RNA fluorescence in situ hybridization verification were carried out to confirm the RNA-sequencing results. An unbiased analysis demonstrated dynamic circular RNA expression changes in the process of myoblast differentiation, and the circular RNA abundances were independent from their cognate linear RNAs. Gene ontology analysis showed that many down-regulated circular RNAs were exclusive to cell division and the cell cycle, whereas up-regulated circular RNAs were related to the cell development process. Furthermore, interaction networks of circular RNA-microRNA were constructed. Several microRNAs well-known for myoblast regulation, such as miR-133, miR-24 and miR-23a, were in this network. In summary, this study showed that circular RNA expression dynamics changed during myoblast differentiation. Circular RNAs play a role in regulating the myoblast cell cycle and development by acting as microRNA binding sites to facilitate their regulation of gene expression during myoblast differentiation. These findings open a new avenue for future investigation of this emerging RNA class in skeletal muscle growth and development. Copyright © 2018 Elsevier Ltd. All rights reserved.
Jain, Mukesh; Chevala, V V S Narayana; Garg, Rohini
2014-11-01
MicroRNAs (miRNAs) are essential components of complex gene regulatory networks that orchestrate plant development. Although several genomic resources have been developed for the legume crop chickpea, miRNAs have not been discovered until now. For genome-wide discovery of miRNAs in chickpea (Cicer arietinum), we sequenced the small RNA content from seven major tissues/organs employing Illumina technology. About 154 million reads were generated, which represented more than 20 million distinct small RNA sequences. We identified a total of 440 conserved miRNAs in chickpea based on sequence similarity with known miRNAs in other plants. In addition, 178 novel miRNAs were identified using a miRDeep pipeline with plant-specific scoring. Some of the conserved and novel miRNAs with significant sequence similarity were grouped into families. The chickpea miRNAs targeted a wide range of mRNAs involved in diverse cellular processes, including transcriptional regulation (transcription factors), protein modification and turnover, signal transduction, and metabolism. Our analysis revealed several miRNAs with differential spatial expression. Many of the chickpea miRNAs were expressed in a tissue-specific manner. The conserved and differential expression of members of the same miRNA family in different tissues was also observed. Some of the same family members were predicted to target different chickpea mRNAs, which suggested the specificity and complexity of miRNA-mediated developmental regulation. This study, for the first time, reveals a comprehensive set of conserved and novel miRNAs along with their expression patterns and putative targets in chickpea, and provides a framework for understanding regulation of developmental processes in legumes. © The Author 2014. Published by Oxford University Press on behalf of the Society for Experimental Biology.
Zhang, Yan-Qiong; Chen, Dong-Liang; Tian, Hai-Feng; Zhang, Bao-Hong; Wen, Jian-Fan
2009-10-01
Using a combined computational program, we identified 50 potential microRNAs (miRNAs) in Giardia lamblia, one of the most primitive unicellular eukaryotes. These miRNAs are unique to G. lamblia and no homologues have been found in other organisms; miRNAs, currently known in other species, were not found in G. lamblia. This suggests that miRNA biogenesis and miRNA-mediated gene regulation pathway may evolve independently, especially in evolutionarily distant lineages. A majority (43) of the predicted miRNAs are located at one single locus; however, some miRNAs have two or more copies in the genome. Among the 58 miRNA genes, 28 are located in the intergenic regions whereas 30 are present in the anti-sense strands of the protein-coding sequences. Five predicted miRNAs are expressed in G. lamblia trophozoite cells evidenced by expressed sequence tags or RT-PCR. Thirty-seven identified miRNAs may target 50 protein-coding genes, including seven variant-specific surface proteins (VSPs). Our findings provide a clue that miRNA-mediated gene regulation may exist in the early stage of eukaryotic evolution, suggesting that it is an important regulation system ubiquitous in eukaryotes.
Liu, Maoyan; Liu, Xiangning; Li, Xun; Zhang, Deyong; Dai, Liangyin; Tang, Qianjun
2016-03-01
The genome sequence of pepper vein yellows virus (PeVYV) (PeVYV-HN, accession number KP326573), isolated from pepper plants (Capsicum annuum L.) grown at the Hunan Vegetables Institute (Changsha, Hunan, China), was determined by deep sequencing of small RNAs. The PeVYV-HN genome consists of 6244 nucleotides, contains six open reading frames (ORFs), and is similar to that of an isolate (AB594828) from Japan. Its genomic organization is similar to that of members of the genus Polerovirus. Sequence analysis revealed that PeVYV-HN shared 92% sequence identity with the Japanese PeVYV genome at both the nucleotide and amino acid levels. Evolutionary analysis based on the coat protein (CP), movement protein (MP), and RNA-dependent RNA polymerase (RdRP) showed that PeVYV could be divided into two major lineages corresponding to their geographical origins. The Asian isolates have a higher population expansion frequency than the African isolates. Negative selection and genetic drift (founder effect) were found to be the potential drivers of the molecular evolution of PeVYV. Moreover, recombination was not the distinct cause of PeVYV evolution. This is the first report of a complete genomic sequence of PeVYV in China.
Evaluation of non-coding variation in GLUT1 deficiency.
Liu, Yu-Chi; Lee, Jia Wei Audrey; Bellows, Susannah T; Damiano, John A; Mullen, Saul A; Berkovic, Samuel F; Bahlo, Melanie; Scheffer, Ingrid E; Hildebrand, Michael S
2016-12-01
Loss-of-function mutations in SLC2A1, encoding glucose transporter-1 (GLUT-1), lead to dysfunction of glucose transport across the blood-brain barrier. Ten percent of cases with hypoglycorrhachia (fasting cerebrospinal fluid [CSF] glucose <2.2mmol/L) do not have mutations. We hypothesized that GLUT1 deficiency could be due to non-coding SLC2A1 variants. We performed whole exome sequencing of one proband with a GLUT1 phenotype and hypoglycorrhachia negative for SLC2A1 sequencing and copy number variants. We studied a further 55 patients with different epilepsies and low CSF glucose who did not have exonic mutations or copy number variants. We sequenced non-coding promoter and intronic regions. We performed mRNA studies for the recurrent intronic variant. The proband had a de novo splice site mutation five base pairs from the intron-exon boundary. Three of 55 patients had deep intronic SLC2A1 variants, including a recurrent variant in two. The recurrent variant produced less SLC2A1 mRNA transcript. Fasting CSF glucose levels show an age-dependent correlation, which makes the definition of hypoglycorrhachia challenging. Low CSF glucose levels may be associated with pathogenic SLC2A1 mutations including deep intronic SLC2A1 variants. Extending genetic screening to non-coding regions will enable diagnosis of more patients with GLUT1 deficiency, allowing implementation of the ketogenic diet to improve outcomes. © 2016 Mac Keith Press.
NASA Astrophysics Data System (ADS)
Harrison, B. K.; Bailey, J. V.
2013-12-01
Sediment horizons represent a significant - but not permanent - barrier to microbial transport. Cells commonly attach to mineral surfaces in unconsolidated sediments. However, by taxis, growth, or passive migration under advecting fluids, some portion of the microbial community may transgress sedimentary boundaries. Few studies have attempted to constrain such transport of community signatures in the marine subsurface and its potential impact on biogeography. Integrated Ocean Drilling Program (IODP) Expedition 337 off the Shimokita Peninsula recovered sediments over a greater than 1km interval representing a gradual decrease of terrestrial influence, from tidal to continental shelf depositional settings. This sequence represents a key opportunity to link subsurface microbial communities to lithological variability and investigate the permanence of community signatures characteristic of distinct depositional regimes. The phylogenetic connectivity between marine and terrestrially-influenced deposits may demonstrate to what degree sediments offer a substantial barrier to cell transport in the subsurface. Previous work has demonstrated that the Actinobacterial phylum is broadly distributed in marine sediments (Maldonado et al., 2005), present and active in the deep subsurface (Orsi et al., 2013), and that marine and terrestrial lineages may potentially be distinguished by 16S rRNA gene sequencing (e.g. Prieto-Davó et al., 2013). We report on Actinobacteria-specific 16S rRNA gene diversity recovered between 1370 and 2642 mbsf with high-throughput sequencing using the Illumina MiSeq platform, as well as selective assembly and analysis of environmental clone libraries.
Wilson, M R; Zimmermann, L L; Crawford, E D; Sample, H A; Soni, P R; Baker, A N; Khan, L M; DeRisi, J L
2017-03-01
Solid organ transplant patients are vulnerable to suffering neurologic complications from a wide array of viral infections and can be sentinels in the population who are first to get serious complications from emerging infections like the recent waves of arboviruses, including West Nile virus, Chikungunya virus, Zika virus, and Dengue virus. The diverse and rapidly changing landscape of possible causes of viral encephalitis poses great challenges for traditional candidate-based infectious disease diagnostics that already fail to identify a causative pathogen in approximately 50% of encephalitis cases. We present the case of a 14-year-old girl on immunosuppression for a renal transplant who presented with acute meningoencephalitis. Traditional diagnostics failed to identify an etiology. RNA extracted from her cerebrospinal fluid was subjected to unbiased metagenomic deep sequencing, enhanced with the use of a Cas9-based technique for host depletion. This analysis identified West Nile virus (WNV). Convalescent serum serologies subsequently confirmed WNV seroconversion. These results support a clear clinical role for metagenomic deep sequencing in the setting of suspected viral encephalitis, especially in the context of the high-risk transplant patient population. © 2016 The Authors. American Journal of Transplantation published by Wiley Periodicals, Inc. on behalf of American Society of Transplant Surgeons.
Lee, Myunggyo; Lee, Kyubum; Yu, Namhee; Jang, Insu; Choi, Ikjung; Kim, Pora; Jang, Ye Eun; Kim, Byounggun; Kim, Sunkyu; Lee, Byungwook; Kang, Jaewoo; Lee, Sanghyuk
2017-01-04
Fusion gene is an important class of therapeutic targets and prognostic markers in cancer. ChimerDB is a comprehensive database of fusion genes encompassing analysis of deep sequencing data and manual curations. In this update, the database coverage was enhanced considerably by adding two new modules of The Cancer Genome Atlas (TCGA) RNA-Seq analysis and PubMed abstract mining. ChimerDB 3.0 is composed of three modules of ChimerKB, ChimerPub and ChimerSeq. ChimerKB represents a knowledgebase including 1066 fusion genes with manual curation that were compiled from public resources of fusion genes with experimental evidences. ChimerPub includes 2767 fusion genes obtained from text mining of PubMed abstracts. ChimerSeq module is designed to archive the fusion candidates from deep sequencing data. Importantly, we have analyzed RNA-Seq data of the TCGA project covering 4569 patients in 23 cancer types using two reliable programs of FusionScan and TopHat-Fusion. The new user interface supports diverse search options and graphic representation of fusion gene structure. ChimerDB 3.0 is available at http://ercsb.ewha.ac.kr/fusiongene/. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Ou-Yang, Fangqian; Luo, Qing-Jun; Zhang, Yue; Richardson, Casey R.; Jiang, Yingwen; Rock, Christopher D.
2013-01-01
microRNAs (miRNAs) are a class of small RNAs (sRNAs) of ~21 nucleotides (nt) in length processed from foldback hairpins by DICER-LIKE1 (DCL1) or DCL4. They regulate the expression of target mRNAs by base pairing through RNA-Induced Silencing Complex (RISC). In the RISC, ARGONAUTE1 (AGO1) is the key protein that cleaves miRNA targets at position ten of a miRNA:target duplex. The authenticity of many annotated rice miRNA hairpins is under debate because of their homology to repeat sequences. Some of them, like miR1884b, have been removed from the current release of miRBase based on incomplete information. In this study, we investigated the association of transposable element (TE)-derived miRNAs with typical miRNA pathways (DCL1/4- and AGO1-dependent) using publicly available deep sequencing datasets. Seven miRNA hairpins with 13 unique sRNAs were specifically enriched in AGO1 immunoprecipitation samples and relatively reduced in DCL1/4 knockdown genotypes. Interestingly, these species are ~21-nt long, instead of 24-nt as annotated in miRBase and the literature. Their expression profiles meet current criteria for functional annotation of miRNAs. In addition, diagnostic cleavage tags were found in degradome datasets for predicted target mRNAs. Most of these miRNA hairpins share significant homology with miniature inverted-repeat transposable elements (MITEs), one type of abundant DNA transposons in rice. Finally, the root-specific production of a 24 nt miRNA-like sRNA was confirmed by RNA blot for a novel EST that maps to the 3'-UTR of a candidate pseudogene showing extensive sequence homology to miR1884b hairpin. Our data are consistent with the hypothesis that TEs can serve as a driving force for the evolution of some MIRNAs, where co-opting of DICER-LIKE1/4 processing and integration into AGO1 could exapt transcribed TE-associated hairpins into typical miRNA pathways. PMID:23420033
Genome analysis of the platypus reveals unique signatures of evolution.
Warren, Wesley C; Hillier, LaDeana W; Marshall Graves, Jennifer A; Birney, Ewan; Ponting, Chris P; Grützner, Frank; Belov, Katherine; Miller, Webb; Clarke, Laura; Chinwalla, Asif T; Yang, Shiaw-Pyng; Heger, Andreas; Locke, Devin P; Miethke, Pat; Waters, Paul D; Veyrunes, Frédéric; Fulton, Lucinda; Fulton, Bob; Graves, Tina; Wallis, John; Puente, Xose S; López-Otín, Carlos; Ordóñez, Gonzalo R; Eichler, Evan E; Chen, Lin; Cheng, Ze; Deakin, Janine E; Alsop, Amber; Thompson, Katherine; Kirby, Patrick; Papenfuss, Anthony T; Wakefield, Matthew J; Olender, Tsviya; Lancet, Doron; Huttley, Gavin A; Smit, Arian F A; Pask, Andrew; Temple-Smith, Peter; Batzer, Mark A; Walker, Jerilyn A; Konkel, Miriam K; Harris, Robert S; Whittington, Camilla M; Wong, Emily S W; Gemmell, Neil J; Buschiazzo, Emmanuel; Vargas Jentzsch, Iris M; Merkel, Angelika; Schmitz, Juergen; Zemann, Anja; Churakov, Gennady; Kriegs, Jan Ole; Brosius, Juergen; Murchison, Elizabeth P; Sachidanandam, Ravi; Smith, Carly; Hannon, Gregory J; Tsend-Ayush, Enkhjargal; McMillan, Daniel; Attenborough, Rosalind; Rens, Willem; Ferguson-Smith, Malcolm; Lefèvre, Christophe M; Sharp, Julie A; Nicholas, Kevin R; Ray, David A; Kube, Michael; Reinhardt, Richard; Pringle, Thomas H; Taylor, James; Jones, Russell C; Nixon, Brett; Dacheux, Jean-Louis; Niwa, Hitoshi; Sekita, Yoko; Huang, Xiaoqiu; Stark, Alexander; Kheradpour, Pouya; Kellis, Manolis; Flicek, Paul; Chen, Yuan; Webber, Caleb; Hardison, Ross; Nelson, Joanne; Hallsworth-Pepin, Kym; Delehaunty, Kim; Markovic, Chris; Minx, Pat; Feng, Yucheng; Kremitzki, Colin; Mitreva, Makedonka; Glasscock, Jarret; Wylie, Todd; Wohldmann, Patricia; Thiru, Prathapan; Nhan, Michael N; Pohl, Craig S; Smith, Scott M; Hou, Shunfeng; Nefedov, Mikhail; de Jong, Pieter J; Renfree, Marilyn B; Mardis, Elaine R; Wilson, Richard K
2008-05-08
We present a draft genome sequence of the platypus, Ornithorhynchus anatinus. This monotreme exhibits a fascinating combination of reptilian and mammalian characters. For example, platypuses have a coat of fur adapted to an aquatic lifestyle; platypus females lactate, yet lay eggs; and males are equipped with venom similar to that of reptiles. Analysis of the first monotreme genome aligned these features with genetic innovations. We find that reptile and platypus venom proteins have been co-opted independently from the same gene families; milk protein genes are conserved despite platypuses laying eggs; and immune gene family expansions are directly related to platypus biology. Expansions of protein, non-protein-coding RNA and microRNA families, as well as repeat elements, are identified. Sequencing of this genome now provides a valuable resource for deep mammalian comparative analyses, as well as for monotreme biology and conservation.
Genome analysis of the platypus reveals unique signatures of evolution
Warren, Wesley C.; Hillier, LaDeana W.; Marshall Graves, Jennifer A.; Birney, Ewan; Ponting, Chris P.; Grützner, Frank; Belov, Katherine; Miller, Webb; Clarke, Laura; Chinwalla, Asif T.; Yang, Shiaw-Pyng; Heger, Andreas; Locke, Devin P.; Miethke, Pat; Waters, Paul D.; Veyrunes, Frédéric; Fulton, Lucinda; Fulton, Bob; Graves, Tina; Wallis, John; Puente, Xose S.; López-Otín, Carlos; Ordóñez, Gonzalo R.; Eichler, Evan E.; Chen, Lin; Cheng, Ze; Deakin, Janine E.; Alsop, Amber; Thompson, Katherine; Kirby, Patrick; Papenfuss, Anthony T.; Wakefield, Matthew J.; Olender, Tsviya; Lancet, Doron; Huttley, Gavin A.; Smit, Arian F. A.; Pask, Andrew; Temple-Smith, Peter; Batzer, Mark A.; Walker, Jerilyn A.; Konkel, Miriam K.; Harris, Robert S.; Whittington, Camilla M.; Wong, Emily S. W.; Gemmell, Neil J.; Buschiazzo, Emmanuel; Vargas Jentzsch, Iris M.; Merkel, Angelika; Schmitz, Juergen; Zemann, Anja; Churakov, Gennady; Kriegs, Jan Ole; Brosius, Juergen; Murchison, Elizabeth P.; Sachidanandam, Ravi; Smith, Carly; Hannon, Gregory J.; Tsend-Ayush, Enkhjargal; McMillan, Daniel; Attenborough, Rosalind; Rens, Willem; Ferguson-Smith, Malcolm; Lefèvre, Christophe M.; Sharp, Julie A.; Nicholas, Kevin R.; Ray, David A.; Kube, Michael; Reinhardt, Richard; Pringle, Thomas H.; Taylor, James; Jones, Russell C.; Nixon, Brett; Dacheux, Jean-Louis; Niwa, Hitoshi; Sekita, Yoko; Huang, Xiaoqiu; Stark, Alexander; Kheradpour, Pouya; Kellis, Manolis; Flicek, Paul; Chen, Yuan; Webber, Caleb; Hardison, Ross; Nelson, Joanne; Hallsworth-Pepin, Kym; Delehaunty, Kim; Markovic, Chris; Minx, Pat; Feng, Yucheng; Kremitzki, Colin; Mitreva, Makedonka; Glasscock, Jarret; Wylie, Todd; Wohldmann, Patricia; Thiru, Prathapan; Nhan, Michael N.; Pohl, Craig S.; Smith, Scott M.; Hou, Shunfeng; Renfree, Marilyn B.; Mardis, Elaine R.; Wilson, Richard K.
2009-01-01
We present a draft genome sequence of the platypus, Ornithorhynchus anatinus. This monotreme exhibits a fascinating combination of reptilian and mammalian characters. For example, platypuses have a coat of fur adapted to an aquatic lifestyle; platypus females lactate, yet lay eggs; and males are equipped with venom similar to that of reptiles. Analysis of the first monotreme genome aligned these features with genetic innovations. We find that reptile and platypus venom proteins have been co-opted independently from the same gene families; milk protein genes are conserved despite platypuses laying eggs; and immune gene family expansions are directly related to platypus biology. Expansions of protein, non-protein-coding RNA and microRNA families, as well as repeat elements, are identified. Sequencing of this genome now provides a valuable resource for deep mammalian comparative analyses, as well as for monotreme biology and conservation. PMID:18464734
Sheik, Cody S.; Reese, Brandi Kiel; Twing, Katrina I.; Sylvan, Jason B.; Grim, Sharon L.; Schrenk, Matthew O.; Sogin, Mitchell L.; Colwell, Frederick S.
2018-01-01
Earth’s subsurface environment is one of the largest, yet least studied, biomes on Earth, and many questions remain regarding what microorganisms are indigenous to the subsurface. Through the activity of the Census of Deep Life (CoDL) and the Deep Carbon Observatory, an open access 16S ribosomal RNA gene sequence database from diverse subsurface environments has been compiled. However, due to low quantities of biomass in the deep subsurface, the potential for incorporation of contaminants from reagents used during sample collection, processing, and/or sequencing is high. Thus, to understand the ecology of subsurface microorganisms (i.e., the distribution, richness, or survival), it is necessary to minimize, identify, and remove contaminant sequences that will skew the relative abundances of all taxa in the sample. In this meta-analysis, we identify putative contaminants associated with the CoDL dataset, recommend best practices for removing contaminants from samples, and propose a series of best practices for subsurface microbiology sampling. The most abundant putative contaminant genera observed, independent of evenness across samples, were Propionibacterium, Aquabacterium, Ralstonia, and Acinetobacter. While the top five most frequently observed genera were Pseudomonas, Propionibacterium, Acinetobacter, Ralstonia, and Sphingomonas. The majority of the most frequently observed genera (high evenness) were associated with reagent or potential human contamination. Additionally, in DNA extraction blanks, we observed potential archaeal contaminants, including methanogens, which have not been discussed in previous contamination studies. Such contaminants would directly affect the interpretation of subsurface molecular studies, as methanogenesis is an important subsurface biogeochemical process. Utilizing previously identified contaminant genera, we found that ∼27% of the total dataset were identified as contaminant sequences that likely originate from DNA extraction and DNA cleanup methods. Thus, controls must be taken at every step of the collection and processing procedure when working with low biomass environments such as, but not limited to, portions of Earth’s deep subsurface. Taken together, we stress that the CoDL dataset is an incredible resource for the broader research community interested in subsurface life, and steps to remove contamination derived sequences must be taken prior to using this dataset. PMID:29780369
Sheik, Cody S; Reese, Brandi Kiel; Twing, Katrina I; Sylvan, Jason B; Grim, Sharon L; Schrenk, Matthew O; Sogin, Mitchell L; Colwell, Frederick S
2018-01-01
Earth's subsurface environment is one of the largest, yet least studied, biomes on Earth, and many questions remain regarding what microorganisms are indigenous to the subsurface. Through the activity of the Census of Deep Life (CoDL) and the Deep Carbon Observatory, an open access 16S ribosomal RNA gene sequence database from diverse subsurface environments has been compiled. However, due to low quantities of biomass in the deep subsurface, the potential for incorporation of contaminants from reagents used during sample collection, processing, and/or sequencing is high. Thus, to understand the ecology of subsurface microorganisms (i.e., the distribution, richness, or survival), it is necessary to minimize, identify, and remove contaminant sequences that will skew the relative abundances of all taxa in the sample. In this meta-analysis, we identify putative contaminants associated with the CoDL dataset, recommend best practices for removing contaminants from samples, and propose a series of best practices for subsurface microbiology sampling. The most abundant putative contaminant genera observed, independent of evenness across samples, were Propionibacterium , Aquabacterium , Ralstonia , and Acinetobacter . While the top five most frequently observed genera were Pseudomonas , Propionibacterium , Acinetobacter , Ralstonia , and Sphingomonas . The majority of the most frequently observed genera (high evenness) were associated with reagent or potential human contamination. Additionally, in DNA extraction blanks, we observed potential archaeal contaminants, including methanogens, which have not been discussed in previous contamination studies. Such contaminants would directly affect the interpretation of subsurface molecular studies, as methanogenesis is an important subsurface biogeochemical process. Utilizing previously identified contaminant genera, we found that ∼27% of the total dataset were identified as contaminant sequences that likely originate from DNA extraction and DNA cleanup methods. Thus, controls must be taken at every step of the collection and processing procedure when working with low biomass environments such as, but not limited to, portions of Earth's deep subsurface. Taken together, we stress that the CoDL dataset is an incredible resource for the broader research community interested in subsurface life, and steps to remove contamination derived sequences must be taken prior to using this dataset.
Li, Yongqiang; Deng, Congliang; Bian, Yong; Zhao, Xiaoli; Zhou, Qi
2017-04-01
Apple stem grooving virus (ASGV), apple chlorotic leaf spot virus (ACLSV), and prunus necrotic ringspot virus (PNRSV) were identified in a crab apple tree by small RNA deep sequencing. The complete genome sequence of ACLSV isolate BJ (ACLSV-BJ) was 7554 nucleotides and shared 67.0%-83.0% nucleotide sequence identity with other ACLSV isolates. A phylogenetic tree based on the complete genome sequence of all available ACLSV isolates showed that ACLSV-BJ clustered with the isolates SY01 from hawthorn, MO5 from apple, and JB, KMS and YH from pear. The complete nucleotide sequence of ASGV-BJ was 6509 nucleotides (nt) long and shared 78.2%-80.7% nucleotide sequence identity with other isolates. ASGV-BJ and the isolate ASGV_kfp clustered together in the phylogenetic tree as an independent clade. Recombination analysis showed that isolate ASGV-BJ was a naturally occurring recombinant.
2010-01-01
Background Bathymodiolus azoricus is a deep-sea hydrothermal vent mussel found in association with large faunal communities living in chemosynthetic environments at the bottom of the sea floor near the Azores Islands. Investigation of the exceptional physiological reactions that vent mussels have adopted in their habitat, including responses to environmental microbes, remains a difficult challenge for deep-sea biologists. In an attempt to reveal genes potentially involved in the deep-sea mussel innate immunity we carried out a high-throughput sequence analysis of freshly collected B. azoricus transcriptome using gills tissues as the primary source of immune transcripts given its strategic role in filtering the surrounding waterborne potentially infectious microorganisms. Additionally, a substantial EST data set was produced and from which a comprehensive collection of genes coding for putative proteins was organized in a dedicated database, "DeepSeaVent" the first deep-sea vent animal transcriptome database based on the 454 pyrosequencing technology. Results A normalized cDNA library from gills tissue was sequenced in a full 454 GS-FLX run, producing 778,996 sequencing reads. Assembly of the high quality reads resulted in 75,407 contigs of which 3,071 were singletons. A total of 39,425 transcripts were conceptually translated into amino-sequences of which 22,023 matched known proteins in the NCBI non-redundant protein database, 15,839 revealed conserved protein domains through InterPro functional classification and 9,584 were assigned with Gene Ontology terms. Queries conducted within the database enabled the identification of genes putatively involved in immune and inflammatory reactions which had not been previously evidenced in the vent mussel. Their physical counterpart was confirmed by semi-quantitative quantitative Reverse-Transcription-Polymerase Chain Reactions (RT-PCR) and their RNA transcription level by quantitative PCR (qPCR) experiments. Conclusions We have established the first tissue transcriptional analysis of a deep-sea hydrothermal vent animal and generated a searchable catalog of genes that provides a direct method of identifying and retrieving vast numbers of novel coding sequences which can be applied in gene expression profiling experiments from a non-conventional model organism. This provides the most comprehensive sequence resource for identifying novel genes currently available for a deep-sea vent organism, in particular, genes putatively involved in immune and inflammatory reactions in vent mussels. The characterization of the B. azoricus transcriptome will facilitate research into biological processes underlying physiological adaptations to hydrothermal vent environments and will provide a basis for expanding our understanding of genes putatively involved in adaptations processes during post-capture long term acclimatization experiments, at "sea-level" conditions, using B. azoricus as a model organism. PMID:20937131
MicroRNA and Transcription Factor: Key Players in Plant Regulatory Network.
Samad, Abdul F A; Sajad, Muhammad; Nazaruddin, Nazaruddin; Fauzi, Izzat A; Murad, Abdul M A; Zainal, Zamri; Ismail, Ismanizan
2017-01-01
Recent achievements in plant microRNA (miRNA), a large class of small and non-coding RNAs, are very exciting. A wide array of techniques involving forward genetic, molecular cloning, bioinformatic analysis, and the latest technology, deep sequencing have greatly advanced miRNA discovery. A tiny miRNA sequence has the ability to target single/multiple mRNA targets. Most of the miRNA targets are transcription factors (TFs) which have paramount importance in regulating the plant growth and development. Various families of TFs, which have regulated a range of regulatory networks, may assist plants to grow under normal and stress environmental conditions. This present review focuses on the regulatory relationships between miRNAs and different families of TFs like; NF-Y, MYB, AP2, TCP, WRKY, NAC, GRF, and SPL. For instance NF-Y play important role during drought tolerance and flower development, MYB are involved in signal transduction and biosynthesis of secondary metabolites, AP2 regulate the floral development and nodule formation, TCP direct leaf development and growth hormones signaling. WRKY have known roles in multiple stress tolerances, NAC regulate lateral root formation, GRF are involved in root growth, flower, and seed development, and SPL regulate plant transition from juvenile to adult. We also studied the relation between miRNAs and TFs by consolidating the research findings from different plant species which will help plant scientists in understanding the mechanism of action and interaction between these regulators in the plant growth and development under normal and stress environmental conditions.
Guo, Chuanyu; Cui, Huachun; Ni, Songwei; Yan, Yang; Qin, Qiwei
2015-10-01
microRNAs (miRNAs) are an evolutionarily conserved class of non-coding RNA molecules that participate in various biological processes. Employment of high-throughput screening strategies greatly prompts the investigation and profiling of miRNAs in diverse species. In recent years, grouper (Epinephelus spp.) aquaculture was severely affected by iridoviral diseases. However, knowledge regarding the host immune responses to viral infection, especially the miRNA-mediated immune regulatory roles, is rather limited. In this study, by employing Solexa deep sequencing approach, we identified 116 grouper miRNAs from grouper spleen-derived cells (GS). As expected, these miRNAs shared high sequence similarity with miRNAs identified in zebrafish (Danio rerio), pufferfish (Fugu rubripes), and other higher vertebrates. In the process of Singapore grouper iridovirus (SGIV) infection, 45 and 43 miRNAs with altered expression (>1.5-fold) were identified by miRNA microarray assays in grouper spleen tissues and GS cells, respectively. Furthermore, target prediction revealed 189 putative targets of these grouper miRNAs. Copyright © 2015 Elsevier Ltd. All rights reserved.
Revised annotation of Plutella xylostella microRNAs and their genome-wide target identification.
Etebari, K; Asgari, S
2016-12-01
The diamondback moth, Plutella xylostella, is the most devastating pest of brassica crops worldwide. Although 128 mature microRNAs (miRNAs) have been annotated from this species in miRBase, there is a need to extend and correct the current P. xylostella miRNA repertoire as a result of its recently improved genome assembly and more available small RNA sequence data. We used our new ultra-deep sequence data and bioinformatics to re-annotate the P. xylostella genome for high confidence miRNAs with the correct 5p and 3p arm features. Furthermore, all the P. xylostella annotated genes were also screened to identify potential miRNA binding sites using three target-predicting algorithms. In total, 203 mature miRNAs were annotated, including 33 novel miRNAs. We identified 7691 highly confident binding sites for 160 pxy-miRNAs. The data provided here will facilitate future studies involving functional analyses of P. xylostella miRNAs as a platform to introduce novel approaches for sustainable management of this destructive pest. © 2016 The Royal Entomological Society.
Ugras, Stacy; Brill, Elliott; Jacobsen, Anders; Hafner, Markus; Socci, Nicholas D.; DeCarolis, Penelope L.; Khanin, Raya; O'Connor, Rachael; Mihailovic, Aleksandra; Taylor, Barry S.; Sheridan, Robert; Gimble, Jeffrey M.; Viale, Agnes; Crago, Aimee; Antonescu, Cristina R.; Sander, Chris; Tuschl, Thomas; Singer, Samuel
2011-01-01
Liposarcoma remains the most common mesenchymal cancer, with a mortality rate of 60% among patients with this disease. To address the present lack of therapeutic options, we embarked upon a study of microRNA (miRNA) expression alterations associated with liposarcomagenesis with the goal of exploiting differentially expressed miRNAs and the gene products they regulate as potential therapeutic targets. MicroRNA expression was profiled in samples of normal adipose tissue, well-differentiated liposarcoma, and dedifferentiated liposarcoma by both deep sequencing of small RNA libraries and hybridization-based Agilent microarrays. The expression profiles discriminated liposarcoma from normal adipose tissue and well-differentiated from dedifferentiated disease. We defined over 40 miRNAs that were dysregulated in dedifferentiated liposarcomas in both the sequencing and the microarray analysis. The upregulated miRNAs included two cancer-associated species (miR-21, miR-26a), and the downregulated miRNAs included two species that were highly abundant in adipose tissue (miR-143, miR-145). Restoring miR-143 expression in dedifferentiated liposarcoma cells inhibited proliferation, induced apoptosis, and decreased expression of BCL2, TOP2A, PRC1, and PLK1. The downregulation of PRC1 and its docking partner PLK1 suggests that miR-143 inhibits cytokinesis in these cells. In support of this idea, treatment with a PLK1 inhibitor potently induced G2/M growth arrest and apoptosis in liposarcoma cells. Taken together, our findings suggest that miR-143 re-expression vectors or selective agents directed at miR-143 or its targets may have therapeutic value in dedifferentiated liposarcoma. PMID:21693658
Metatranscriptomic analyses of honey bee colonies.
Tozkar, Cansu Ö; Kence, Meral; Kence, Aykut; Huang, Qiang; Evans, Jay D
2015-01-01
Honey bees face numerous biotic threats from viruses to bacteria, fungi, protists, and mites. Here we describe a thorough analysis of microbes harbored by worker honey bees collected from field colonies in geographically distinct regions of Turkey. Turkey is one of the World's most important centers of apiculture, harboring five subspecies of Apis mellifera L., approximately 20% of the honey bee subspecies in the world. We use deep ILLUMINA-based RNA sequencing to capture RNA species for the honey bee and a sampling of all non-endogenous species carried by bees. After trimming and mapping these reads to the honey bee genome, approximately 10% of the sequences (9-10 million reads per library) remained. These were then mapped to a curated set of public sequences containing ca. Sixty megabase-pairs of sequence representing known microbial species associated with honey bees. Levels of key honey bee pathogens were confirmed using quantitative PCR screens. We contrast microbial matches across different sites in Turkey, showing new country recordings of Lake Sinai virus, two Spiroplasma bacterium species, symbionts Candidatus Schmidhempelia bombi, Frischella perrara, Snodgrassella alvi, Gilliamella apicola, Lactobacillus spp.), neogregarines, and a trypanosome species. By using metagenomic analysis, this study also reveals deep molecular evidence for the presence of bacterial pathogens (Melissococcus plutonius, Paenibacillus larvae), Varroa destructor-1 virus, Sacbrood virus, and fungi. Despite this effort we did not detect KBV, SBPV, Tobacco ringspot virus, VdMLV (Varroa Macula like virus), Acarapis spp., Tropilaeleps spp. and Apocephalus (phorid fly). We discuss possible impacts of management practices and honey bee subspecies on microbial retinues. The described workflow and curated microbial database will be generally useful for microbial surveys of healthy and declining honey bees.
Metatranscriptomic analyses of honey bee colonies
Tozkar, Cansu Ö.; Kence, Meral; Kence, Aykut; Huang, Qiang; Evans, Jay D.
2015-01-01
Honey bees face numerous biotic threats from viruses to bacteria, fungi, protists, and mites. Here we describe a thorough analysis of microbes harbored by worker honey bees collected from field colonies in geographically distinct regions of Turkey. Turkey is one of the World's most important centers of apiculture, harboring five subspecies of Apis mellifera L., approximately 20% of the honey bee subspecies in the world. We use deep ILLUMINA-based RNA sequencing to capture RNA species for the honey bee and a sampling of all non-endogenous species carried by bees. After trimming and mapping these reads to the honey bee genome, approximately 10% of the sequences (9–10 million reads per library) remained. These were then mapped to a curated set of public sequences containing ca. Sixty megabase-pairs of sequence representing known microbial species associated with honey bees. Levels of key honey bee pathogens were confirmed using quantitative PCR screens. We contrast microbial matches across different sites in Turkey, showing new country recordings of Lake Sinai virus, two Spiroplasma bacterium species, symbionts Candidatus Schmidhempelia bombi, Frischella perrara, Snodgrassella alvi, Gilliamella apicola, Lactobacillus spp.), neogregarines, and a trypanosome species. By using metagenomic analysis, this study also reveals deep molecular evidence for the presence of bacterial pathogens (Melissococcus plutonius, Paenibacillus larvae), Varroa destructor-1 virus, Sacbrood virus, and fungi. Despite this effort we did not detect KBV, SBPV, Tobacco ringspot virus, VdMLV (Varroa Macula like virus), Acarapis spp., Tropilaeleps spp. and Apocephalus (phorid fly). We discuss possible impacts of management practices and honey bee subspecies on microbial retinues. The described workflow and curated microbial database will be generally useful for microbial surveys of healthy and declining honey bees. PMID:25852743
Brooks, Matthew J.; Rajasimha, Harsha K.; Roger, Jerome E.
2011-01-01
Purpose Next-generation sequencing (NGS) has revolutionized systems-based analysis of cellular pathways. The goals of this study are to compare NGS-derived retinal transcriptome profiling (RNA-seq) to microarray and quantitative reverse transcription polymerase chain reaction (qRT–PCR) methods and to evaluate protocols for optimal high-throughput data analysis. Methods Retinal mRNA profiles of 21-day-old wild-type (WT) and neural retina leucine zipper knockout (Nrl−/−) mice were generated by deep sequencing, in triplicate, using Illumina GAIIx. The sequence reads that passed quality filters were analyzed at the transcript isoform level with two methods: Burrows–Wheeler Aligner (BWA) followed by ANOVA (ANOVA) and TopHat followed by Cufflinks. qRT–PCR validation was performed using TaqMan and SYBR Green assays. Results Using an optimized data analysis workflow, we mapped about 30 million sequence reads per sample to the mouse genome (build mm9) and identified 16,014 transcripts in the retinas of WT and Nrl−/− mice with BWA workflow and 34,115 transcripts with TopHat workflow. RNA-seq data confirmed stable expression of 25 known housekeeping genes, and 12 of these were validated with qRT–PCR. RNA-seq data had a linear relationship with qRT–PCR for more than four orders of magnitude and a goodness of fit (R2) of 0.8798. Approximately 10% of the transcripts showed differential expression between the WT and Nrl−/− retina, with a fold change ≥1.5 and p value <0.05. Altered expression of 25 genes was confirmed with qRT–PCR, demonstrating the high degree of sensitivity of the RNA-seq method. Hierarchical clustering of differentially expressed genes uncovered several as yet uncharacterized genes that may contribute to retinal function. Data analysis with BWA and TopHat workflows revealed a significant overlap yet provided complementary insights in transcriptome profiling. Conclusions Our study represents the first detailed analysis of retinal transcriptomes, with biologic replicates, generated by RNA-seq technology. The optimized data analysis workflows reported here should provide a framework for comparative investigations of expression profiles. Our results show that NGS offers a comprehensive and more accurate quantitative and qualitative evaluation of mRNA content within a cell or tissue. We conclude that RNA-seq based transcriptome characterization would expedite genetic network analyses and permit the dissection of complex biologic functions. PMID:22162623
Kondo, Hideki; Hisano, Sakae; Chiba, Sotaro; Maruyama, Kazuyuki; Andika, Ida Bagus; Toyoda, Kazuhiro; Fujimori, Fumihiro; Suzuki, Nobuhiro
2016-02-02
The identification of mycoviruses contributes greatly to understanding of the diversity and evolutionary aspects of viruses. Powdery mildew fungi are important and widely studied obligate phytopathogenic agents, but there has been no report on mycoviruses infecting these fungi. In this study, we used a deep sequencing approach to analyze the double-stranded RNA (dsRNA) segments isolated from field-collected samples of powdery mildew fungus-infected red clover plants in Japan. Database searches identified the presence of at least ten totivirus (genus Totivirus)-like sequences, termed red clover powdery mildew-associated totiviruses (RPaTVs). The majority of these sequences shared moderate amino acid sequence identity with each other (<44%) and with other known totiviruses (<59%). Nine of these identified sequences (RPaTV1a, 1b and 2-8) resembled the genome of the prototype totivirus, Saccharomyces cerevisiae virus-L-A (ScV-L-A) in that they contained two overlapping open reading frames (ORFs) encoding a putative coat protein (CP) and an RNA dependent RNA polymerase (RdRp), while one sequence (RPaTV9) showed similarity to another totivirus, Ustilago maydis virus H1 (UmV-H1) that encodes a single polyprotein (CP-RdRp fusion). Similar to yeast totiviruses, each ScV-L-A-like RPaTV contains a -1 ribosomal frameshift site downstream of a predicted pseudoknot structure in the overlapping region of these ORFs, suggesting that the RdRp is translated as a CP-RdRp fusion. Moreover, several ScV-L-A-like sequences were also found by searches of the transcriptome shotgun assembly (TSA) libraries from rust fungi, plants and insects. Phylogenetic analyses show that nine ScV-L-A-like RPaTVs along with ScV-L-A-like sequences derived from TSA libraries are clustered with most established members of the genus Totivirus, while one RPaTV forms a new distinct clade with UmV-H1, possibly establishing an additional genus in the family. Taken together, our results indicate the presence of diverse, novel totiviruses in the powdery mildew fungus populations infecting red clover plants in the field. Copyright © 2015 Elsevier B.V. All rights reserved.
Subsurface microbial diversity in deep-granitic-fracture water in Colorado
Sahl, J.W.; Schmidt, R.; Swanner, E.D.; Mandernack, K.W.; Templeton, A.S.; Kieft, Thomas L.; Smith, R.L.; Sanford, W.E.; Callaghan, R.L.; Mitton, J.B.; Spear, J.R.
2008-01-01
A microbial community analysis using 16S rRNA gene sequencing was performed on borehole water and a granite rock core from Henderson Mine, a >1,000-meter-deep molybdenum mine near Empire, CO. Chemical analysis of borehole water at two separate depths (1,044 m and 1,004 m below the mine entrance) suggests that a sharp chemical gradient exists, likely from the mixing of two distinct subsurface fluids, one metal rich and one relatively dilute; this has created unique niches for microorganisms. The microbial community analyzed from filtered, oxic borehole water indicated an abundance of sequences from iron-oxidizing bacteria (Gallionella spp.) and was compared to the community from the same borehole after 2 weeks of being plugged with an expandable packer. Statistical analyses with UniFrac revealed a significant shift in community structure following the addition of the packer. Phospholipid fatty acid (PLFA) analysis suggested that Nitrosomonadales dominated the oxic borehole, while PLFAs indicative of anaerobic bacteria were most abundant in the samples from the plugged borehole. Microbial sequences were represented primarily by Firmicutes, Proteobacteria, and a lineage of sequences which did not group with any identified bacterial division; phylogenetic analyses confirmed the presence of a novel candidate division. This "Henderson candidate division" dominated the clone libraries from the dilute anoxic fluids. Sequences obtained from the granitic rock core (1,740 m below the surface) were represented by the divisions Proteobacteria (primarily the family Ralstoniaceae) and Firmicutes. Sequences grouping within Ralstoniaceae were also found in the clone libraries from metal-rich fluids yet were absent in more dilute fluids. Lineage-specific comparisons, combined with phylogenetic statistical analyses, show that geochemical variance has an important effect on microbial community structure in deep, subsurface systems. Copyright ?? 2008, American Society for Microbiology. All Rights Reserved.
Subsurface Microbial Diversity in Deep-Granitic-Fracture Water in Colorado▿
Sahl, Jason W.; Schmidt, Raleigh; Swanner, Elizabeth D.; Mandernack, Kevin W.; Templeton, Alexis S.; Kieft, Thomas L.; Smith, Richard L.; Sanford, William E.; Callaghan, Robert L.; Mitton, Jeffry B.; Spear, John R.
2008-01-01
A microbial community analysis using 16S rRNA gene sequencing was performed on borehole water and a granite rock core from Henderson Mine, a >1,000-meter-deep molybdenum mine near Empire, CO. Chemical analysis of borehole water at two separate depths (1,044 m and 1,004 m below the mine entrance) suggests that a sharp chemical gradient exists, likely from the mixing of two distinct subsurface fluids, one metal rich and one relatively dilute; this has created unique niches for microorganisms. The microbial community analyzed from filtered, oxic borehole water indicated an abundance of sequences from iron-oxidizing bacteria (Gallionella spp.) and was compared to the community from the same borehole after 2 weeks of being plugged with an expandable packer. Statistical analyses with UniFrac revealed a significant shift in community structure following the addition of the packer. Phospholipid fatty acid (PLFA) analysis suggested that Nitrosomonadales dominated the oxic borehole, while PLFAs indicative of anaerobic bacteria were most abundant in the samples from the plugged borehole. Microbial sequences were represented primarily by Firmicutes, Proteobacteria, and a lineage of sequences which did not group with any identified bacterial division; phylogenetic analyses confirmed the presence of a novel candidate division. This “Henderson candidate division” dominated the clone libraries from the dilute anoxic fluids. Sequences obtained from the granitic rock core (1,740 m below the surface) were represented by the divisions Proteobacteria (primarily the family Ralstoniaceae) and Firmicutes. Sequences grouping within Ralstoniaceae were also found in the clone libraries from metal-rich fluids yet were absent in more dilute fluids. Lineage-specific comparisons, combined with phylogenetic statistical analyses, show that geochemical variance has an important effect on microbial community structure in deep, subsurface systems. PMID:17981950
Becker, Annemarie H.; Oh, Eugene; Weissman, Jonathan S.; Kramer, Günter; Bukau, Bernd
2014-01-01
A plethora of factors is involved in the maturation of newly synthesized proteins, including chaperones, membrane targeting factors, and enzymes. Many factors act cotranslationally through association with ribosome-nascent chain complexes (RNCs), but their target specificities and modes of action remain poorly understood. We developed selective ribosome profiling (SeRP) to identify substrate pools and points of RNC engagement of these factors. SeRP is based on sequencing mRNA fragments covered by translating ribosomes (general ribosome profiling, RP), combined with a procedure to selectively isolate RNCs whose nascent polypeptides are associated with the factor of interest. Factor–RNC interactions are stabilized by crosslinking, the resulting factor–RNC adducts are then nuclease-treated to generate monosomes, and affinity-purified. The ribosome-extracted mRNA footprints are converted to DNA libraries for deep sequencing. The protocol is specified for general RP and SeRP in bacteria. It was first applied to the chaperone trigger factor and is readily adaptable to other cotranslationally acting factors, including eukaryotic factors. Factor–RNC purification and sequencing library preparation takes 7–8 days, sequencing and data analysis can be completed in 5–6 days. PMID:24136347
Röthig, Till; Yum, Lauren K.; Kremb, Stephan G.; Roik, Anna; Voolstra, Christian R.
2017-01-01
Microbes associated with deep-sea corals remain poorly studied. The lack of symbiotic algae suggests that associated microbes may play a fundamental role in maintaining a viable coral host via acquisition and recycling of nutrients. Here we employed 16 S rRNA gene sequencing to study bacterial communities of three deep-sea scleractinian corals from the Red Sea, Dendrophyllia sp., Eguchipsammia fistula, and Rhizotrochus typus. We found diverse, species-specific microbiomes, distinct from the surrounding seawater. Microbiomes were comprised of few abundant bacteria, which constituted the majority of sequences (up to 58% depending on the coral species). In addition, we found a high diversity of rare bacteria (taxa at <1% abundance comprised >90% of all bacteria). Interestingly, we identified anaerobic bacteria, potentially providing metabolic functions at low oxygen conditions, as well as bacteria harboring the potential to degrade crude oil components. Considering the presence of oil and gas fields in the Red Sea, these bacteria may unlock this carbon source for the coral host. In conclusion, the prevailing environmental conditions of the deep Red Sea (>20 °C, <2 mg oxygen L−1) may require distinct functional adaptations, and our data suggest that bacterial communities may contribute to coral functioning in this challenging environment. PMID:28303925
Röthig, Till; Yum, Lauren K; Kremb, Stephan G; Roik, Anna; Voolstra, Christian R
2017-03-17
Microbes associated with deep-sea corals remain poorly studied. The lack of symbiotic algae suggests that associated microbes may play a fundamental role in maintaining a viable coral host via acquisition and recycling of nutrients. Here we employed 16 S rRNA gene sequencing to study bacterial communities of three deep-sea scleractinian corals from the Red Sea, Dendrophyllia sp., Eguchipsammia fistula, and Rhizotrochus typus. We found diverse, species-specific microbiomes, distinct from the surrounding seawater. Microbiomes were comprised of few abundant bacteria, which constituted the majority of sequences (up to 58% depending on the coral species). In addition, we found a high diversity of rare bacteria (taxa at <1% abundance comprised >90% of all bacteria). Interestingly, we identified anaerobic bacteria, potentially providing metabolic functions at low oxygen conditions, as well as bacteria harboring the potential to degrade crude oil components. Considering the presence of oil and gas fields in the Red Sea, these bacteria may unlock this carbon source for the coral host. In conclusion, the prevailing environmental conditions of the deep Red Sea (>20 °C, <2 mg oxygen L -1 ) may require distinct functional adaptations, and our data suggest that bacterial communities may contribute to coral functioning in this challenging environment.
NASA Astrophysics Data System (ADS)
Zhang, Likui; Kang, Manyu; Xu, Jiajun; Xu, Jian; Shuai, Yinjie; Zhou, Xiaojian; Yang, Zhihui; Ma, Kesen
2016-05-01
Active deep-sea hydrothermal vents harbor abundant thermophilic and hyperthermophilic microorganisms. However, microbial communities in inactive hydrothermal vents have not been well documented. Here, we investigated bacterial and archaeal communities in the two deep-sea sediments (named as TVG4 and TVG11) collected from inactive hydrothermal vents in the Southwest India Ridge using the high-throughput sequencing technology of Illumina MiSeq2500 platform. Based on the V4 region of 16S rRNA gene, sequence analysis showed that bacterial communities in the two samples were dominated by Proteobacteria, followed by Bacteroidetes, Actinobacteria and Firmicutes. Furthermore, archaeal communities in the two samples were dominated by Thaumarchaeota and Euryarchaeota. Comparative analysis showed that (i) TVG4 displayed the higher bacterial richness and lower archaeal richness than TVG11; (ii) the two samples had more divergence in archaeal communities than bacterial communities. Bacteria and archaea that are potentially associated with nitrogen, sulfur metal and methane cycling were detected in the two samples. Overall, we first provided a comparative picture of bacterial and archaeal communities and revealed their potentially ecological roles in the deep-sea environments of inactive hydrothermal vents in the Southwest Indian Ridge, augmenting microbial communities in inactive hydrothermal vents.
Nakagawa, Tatsunori; Ishibashi, Jun-Ichiro; Maruyama, Akihiko; Yamanaka, Toshiro; Morimoto, Yusuke; Kimura, Hiroyuki; Urabe, Tetsuro; Fukui, Manabu
2004-01-01
This study describes the occurrence of unique dissimilatory sulfite reductase (DSR) genes at a depth of 1,380 m from the deep-sea hydrothermal vent field at the Suiyo Seamount, Izu-Bonin Arc, Western Pacific, Japan. The DSR genes were obtained from microbes that grew in a catheter-type in situ growth chamber deployed for 3 days on a vent and from the effluent water of drilled holes at 5 degrees C and natural vent fluids at 7 degrees C. DSR clones SUIYOdsr-A and SUIYOdsr-B were not closely related to cultivated species or environmental clones. Moreover, samples of microbial communities were examined by PCR-denaturing gradient gel electrophoresis (DGGE) analysis of the 16S rRNA gene. The sequence analysis of 16S rRNA gene fragments obtained from the vent catheter after a 3-day incubation revealed the occurrence of bacterial DGGE bands affiliated with the Aquificae and gamma- and epsilon-Proteobacteria as well as the occurrence of archaeal phylotypes affiliated with the Thermococcales and of a unique archaeon sequence that clustered with "Nanoarchaeota." The DGGE bands obtained from drilled holes and natural vent fluids from 7 to 300 degrees C were affiliated with the delta-Proteobacteria, genus Thiomicrospira, and Pelodictyon. The dominant DGGE bands retrieved from the effluent water of casing pipes at 3 and 4 degrees C were closely related to phylotypes obtained from the Arctic Ocean. Our results suggest the presence of microorganisms corresponding to a unique DSR lineage not detected previously from other geothermal environments.
Transcriptomic investigation of meat tenderness in two Italian cattle breeds.
Bongiorni, S; Gruber, C E M; Bueno, S; Chillemi, G; Ferrè, F; Failla, S; Moioli, B; Valentini, A
2016-06-01
Our objectives for this study were to understand the biological basis of meat tenderness and to provide an overview of the gene expression profiles related to meat quality as a tool for selection. Through deep mRNA sequencing, we analyzed gene expression in muscle tissues of two Italian cattle breeds: Maremmana and Chianina. We uncovered several differentially expressed genes that encode for proteins belonging to a family of tripartite motif proteins, which are involved in growth, cell differentiation and apoptosis, such as TRIM45, or play an essential role in regulating skeletal muscle differentiation and the regeneration of adult skeletal muscle, such as TRIM32. Other differentially expressed genes (SCN2B, SLC9A7 and KCNK3) emphasize the involvement of potassium-sodium pumps in tender meat. By mapping splice junctions in RNA-Seq reads, we found significant differences in gene isoform expression levels. The PRKAG3 gene, which is involved in the regulation of energy metabolism, showed four isoforms that were differentially expressed. This distinct pattern of PRKAG3 gene expression could indicate impaired glycogen storage in skeletal muscle, and consequently, this gene very likely has a role in the tenderization process. Furthermore, with this deep RNA-sequencing, we captured a high number of expressed SNPs, for example, we found 1462 homozygous SNPs showing the alternative allele with a 100% frequency when comparing tender and tough meat. SNPs were then classified into categories by their position and also by their effect on gene coding (174 non-synonymous polymorphisms) based on the available UMD_3.1 annotations. © 2016 Stichting International Foundation for Animal Genetics.
Wang, Yong; Lan, Qingkuo; Zhao, Xin; Xu, Wentao; Li, Feiwu; Wang, Qinying; Chen, Rui
2016-01-01
MicroRNAs (miRNAs) have been widely demonstrated to play fundamental roles in gene regulation in most eukaryotes. To date, there has been no study describing the miRNA composition in genetically modified organisms (GMOs). In this study, small RNAs from dry seeds of two GM soybean lines and their parental cultivars were investigated using deep sequencing technology and bioinformatic approaches. As a result, several differentially expressed gma-miRNAs were found between the GM and non-GM soybeans. Meanwhile, more differentially expressed gma-miRNAs were identified between distantly relatednon-GM soybeans, indicating that the miRNA components of soybean seeds varied among different soybean lines, including the GM and non-GM soybeans, and the extent of difference might be related to their genetic relationship. Additionally, fourteen novel gma-miRNA candidates were predicted in soybean seeds including a potential bidirectionally transcribed miRNA family with two genomic loci (gma-miR-N1). Our findings firstly provided useful data for miRNA composition in edible GM crops and also provided valuable information for soybean miRNA research.
Complete genome sequence of Sulfurimonas autotrophica type strain (OK10T)
Sikorski, Johannes; Munk, Christine; Lapidus, Alla; Ngatchou Djao, Olivier Duplex; Lucas, Susan; Glavina Del Rio, Tijana; Nolan, Matt; Tice, Hope; Han, Cliff; Cheng, Jan-Fang; Tapia, Roxanne; Goodwin, Lynne; Pitluck, Sam; Liolios, Konstantinos; Ivanova, Natalia; Mavromatis, Konstantinos; Mikhailova, Natalia; Pati, Amrita; Sims, David; Meincke, Linda; Brettin, Thomas; Detter, John C.; Chen, Amy; Palaniappan, Krishna; Land, Miriam; Hauser, Loren; Chang, Yun-Juan; Jeffries, Cynthia D.; Rohde, Manfred; Lang, Elke; Spring, Stefan; Göker, Markus; Woyke, Tanja; Bristow, James; Eisen, Jonathan A.; Markowitz, Victor; Hugenholtz, Philip; Kyrpides, Nikos C.; Klenk, Hans-Peter
2010-01-01
Sulfurimonas autotrophica Inagaki et al. 2003 is the type species of the genus Sulfurimonas. This genus is of interest because of its significant contribution to the global sulfur cycle as it oxidizes sulfur compounds to sulfate and by its apparent habitation of deep-sea hydrothermal and marine sulfidic environments as potential ecological niche. Here we describe the features of this organism, together with the complete genome sequence and annotation. This is the second complete genome sequence of the genus Sulfurimonas and the 15th genome in the family Helicobacteraceae. The 2,153,198 bp long genome with its 2,165 protein-coding and 55 RNA genes is part of the Genomic Encyclopedia of Bacteria and Archaea project. PMID:21304749
A deep learning-based multi-model ensemble method for cancer prediction.
Xiao, Yawen; Wu, Jun; Lin, Zongli; Zhao, Xiaodong
2018-01-01
Cancer is a complex worldwide health problem associated with high mortality. With the rapid development of the high-throughput sequencing technology and the application of various machine learning methods that have emerged in recent years, progress in cancer prediction has been increasingly made based on gene expression, providing insight into effective and accurate treatment decision making. Thus, developing machine learning methods, which can successfully distinguish cancer patients from healthy persons, is of great current interest. However, among the classification methods applied to cancer prediction so far, no one method outperforms all the others. In this paper, we demonstrate a new strategy, which applies deep learning to an ensemble approach that incorporates multiple different machine learning models. We supply informative gene data selected by differential gene expression analysis to five different classification models. Then, a deep learning method is employed to ensemble the outputs of the five classifiers. The proposed deep learning-based multi-model ensemble method was tested on three public RNA-seq data sets of three kinds of cancers, Lung Adenocarcinoma, Stomach Adenocarcinoma and Breast Invasive Carcinoma. The test results indicate that it increases the prediction accuracy of cancer for all the tested RNA-seq data sets as compared to using a single classifier or the majority voting algorithm. By taking full advantage of different classifiers, the proposed deep learning-based multi-model ensemble method is shown to be accurate and effective for cancer prediction. Copyright © 2017 Elsevier B.V. All rights reserved.
Edgar, Robyn; Veerapaneni, Ram S.; D’Elia, Tom; Morris, Paul F.; Rogers, Scott O.
2013-01-01
Lake Vostok, the 7th largest (by volume) and 4th deepest lake on Earth, is covered by more than 3,700 m of ice, making it the largest subglacial lake known. The combination of cold, heat (from possible hydrothermal activity), pressure (from the overriding glacier), limited nutrients and complete darkness presents extreme challenges to life. Here, we report metagenomic/metatranscriptomic sequence analyses from four accretion ice sections from the Vostok 5G ice core. Two sections accreted in the vicinity of an embayment on the southwestern end of the lake, and the other two represented part of the southern main basin. We obtained 3,507 unique gene sequences from concentrates of 500 ml of 0.22 µm-filtered accretion ice meltwater. Taxonomic classifications (to genus and/or species) were possible for 1,623 of the sequences. Species determinations in combination with mRNA gene sequence results allowed deduction of the metabolic pathways represented in the accretion ice and, by extension, in the lake. Approximately 94% of the sequences were from Bacteria and 6% were from Eukarya. Only two sequences were from Archaea. In general, the taxa were similar to organisms previously described from lakes, brackish water, marine environments, soil, glaciers, ice, lake sediments, deep-sea sediments, deep-sea thermal vents, animals and plants. Sequences from aerobic, anaerobic, psychrophilic, thermophilic, halophilic, alkaliphilic, acidophilic, desiccation-resistant, autotrophic and heterotrophic organisms were present, including a number from multicellular eukaryotes. PMID:23843994
Shtarkman, Yury M; Koçer, Zeynep A; Edgar, Robyn; Veerapaneni, Ram S; D'Elia, Tom; Morris, Paul F; Rogers, Scott O
2013-01-01
Lake Vostok, the 7(th) largest (by volume) and 4(th) deepest lake on Earth, is covered by more than 3,700 m of ice, making it the largest subglacial lake known. The combination of cold, heat (from possible hydrothermal activity), pressure (from the overriding glacier), limited nutrients and complete darkness presents extreme challenges to life. Here, we report metagenomic/metatranscriptomic sequence analyses from four accretion ice sections from the Vostok 5G ice core. Two sections accreted in the vicinity of an embayment on the southwestern end of the lake, and the other two represented part of the southern main basin. We obtained 3,507 unique gene sequences from concentrates of 500 ml of 0.22 µm-filtered accretion ice meltwater. Taxonomic classifications (to genus and/or species) were possible for 1,623 of the sequences. Species determinations in combination with mRNA gene sequence results allowed deduction of the metabolic pathways represented in the accretion ice and, by extension, in the lake. Approximately 94% of the sequences were from Bacteria and 6% were from Eukarya. Only two sequences were from Archaea. In general, the taxa were similar to organisms previously described from lakes, brackish water, marine environments, soil, glaciers, ice, lake sediments, deep-sea sediments, deep-sea thermal vents, animals and plants. Sequences from aerobic, anaerobic, psychrophilic, thermophilic, halophilic, alkaliphilic, acidophilic, desiccation-resistant, autotrophic and heterotrophic organisms were present, including a number from multicellular eukaryotes.
2011-01-01
Background Small RNA (sRNA) regulatory pathways (SRRPs) are important to anti-viral defence in mosquitoes. To identify critical features of the virus infection process in Dengue serotype 2 (DENV2)-infected Ae. aegypti, we deep-sequenced small non-coding RNAs. Triplicate biological replicates were used so that rigorous statistical metrics could be applied. Results In addition to virus-derived siRNAs (20-23 nts) previously reported for other arbovirus-infected mosquitoes, we show that PIWI pathway sRNAs (piRNAs) (24-30 nts) and unusually small RNAs (usRNAs) (13-19 nts) are produced in DENV-infected mosquitoes. We demonstrate that a major catalytic enzyme of the siRNA pathway, Argonaute 2 (Ago2), co-migrates with a ~1 megadalton complex in adults prior to bloodfeeding. sRNAs were cloned and sequenced from Ago2 immunoprecipitations. Viral sRNA patterns change over the course of infection. Host sRNAs were mapped to the published aedine transcriptome and subjected to analysis using edgeR (Bioconductor). We found that sRNA profiles are altered early in DENV2 infection, and mRNA targets from mitochondrial, transcription/translation, and transport functional categories are affected. Moreover, small non-coding RNAs (ncRNAs), such as tRNAs, spliceosomal U RNAs, and snoRNAs are highly enriched in DENV-infected samples at 2 and 4 dpi. Conclusions These data implicate the PIWI pathway in anti-viral defense. Changes to host sRNA profiles indicate that specific cellular processes are affected during DENV infection, such as mitochondrial function and ncRNA levels. Together, these data provide important progress in understanding the DENV2 infection process in Ae. aegypti. PMID:21356105
Denef, Vincent J.; Fujimoto, Masanori; Berry, Michelle A.; ...
2016-04-29
Relative abundance profiles of bacterial populations measured by sequencing DNA or RNA of marker genes can widely differ. These differences, made apparent when calculating ribosomal RNA:DNA ratios, have been interpreted as variable activities of bacterial populations. However, inconsistent correlations between ribosomal RNA:DNA ratios and metabolic activity or growth rates have led to a more conservative interpretation of this metric as the cellular protein synthesis potential (PSP). Little is known, particularly in freshwater systems, about how PSP varies for specific taxa across temporal and spatial environmental gradients and how conserved PSP is across bacterial phylogeny. Here, we generated 16S rRNA genemore » sequencing data using simultaneously extracted DNA and RNA from fractionated (free-living and particulate) water samples taken seasonally along a eutrophic freshwater estuary to oligotrophic pelagic transect in Lake Michigan. In contrast to previous reports, we observed frequent clustering of DNA and RNA data from the same sample. Analysis of the overlap in taxa detected at the RNA and DNA level indicated that microbial dormancy may be more common in the estuary, the particulate fraction, and during the stratified period. Across spatiotemporal gradients, PSP was often conserved at the phylum and class levels. PSPs for specific taxa were more similar across habitats in spring than in summer and fall. This was most notable for PSPs of the same taxa when located in the free-living or particulate fractions, but also when contrasting surface to deep, and estuary to Lake Michigan communities. Our results show that community composition assessed by RNA and DNA measurements are more similar than previously assumed in freshwater systems. Furthermore, the similarity between RNA and DNA measurements and taxa-specific PSPs that drive community-level similarities are conditional on spatiotemporal factors.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Denef, Vincent J.; Fujimoto, Masanori; Berry, Michelle A.
Relative abundance profiles of bacterial populations measured by sequencing DNA or RNA of marker genes can widely differ. These differences, made apparent when calculating ribosomal RNA:DNA ratios, have been interpreted as variable activities of bacterial populations. However, inconsistent correlations between ribosomal RNA:DNA ratios and metabolic activity or growth rates have led to a more conservative interpretation of this metric as the cellular protein synthesis potential (PSP). Little is known, particularly in freshwater systems, about how PSP varies for specific taxa across temporal and spatial environmental gradients and how conserved PSP is across bacterial phylogeny. Here, we generated 16S rRNA genemore » sequencing data using simultaneously extracted DNA and RNA from fractionated (free-living and particulate) water samples taken seasonally along a eutrophic freshwater estuary to oligotrophic pelagic transect in Lake Michigan. In contrast to previous reports, we observed frequent clustering of DNA and RNA data from the same sample. Analysis of the overlap in taxa detected at the RNA and DNA level indicated that microbial dormancy may be more common in the estuary, the particulate fraction, and during the stratified period. Across spatiotemporal gradients, PSP was often conserved at the phylum and class levels. PSPs for specific taxa were more similar across habitats in spring than in summer and fall. This was most notable for PSPs of the same taxa when located in the free-living or particulate fractions, but also when contrasting surface to deep, and estuary to Lake Michigan communities. Our results show that community composition assessed by RNA and DNA measurements are more similar than previously assumed in freshwater systems. Furthermore, the similarity between RNA and DNA measurements and taxa-specific PSPs that drive community-level similarities are conditional on spatiotemporal factors.« less
Kadri, Sabah; Hinman, Veronica F.; Benos, Panayiotis V.
2011-01-01
microRNAs (miRNAs) are small (20–23 nt), non-coding single stranded RNA molecules that act as post-transcriptional regulators of mRNA gene expression. They have been implicated in regulation of developmental processes in diverse organisms. The echinoderms, Strongylocentrotus purpuratus (sea urchin) and Patiria miniata (sea star) are excellent model organisms for studying development with well-characterized transcriptional networks. However, to date, nothing is known about the role of miRNAs during development in these organisms, except that the genes that are involved in the miRNA biogenesis pathway are expressed during their developmental stages. In this paper, we used Illumina Genome Analyzer (Illumina, Inc.) to sequence small RNA libraries in mixed stage population of embryos from one to three days after fertilization of sea urchin and sea star (total of 22,670,000 reads). Analysis of these data revealed the miRNA populations in these two species. We found that 47 and 38 known miRNAs are expressed in sea urchin and sea star, respectively, during early development (32 in common). We also found 13 potentially novel miRNAs in the sea urchin embryonic library. miRNA expression is generally conserved between the two species during development, but 7 miRNAs are highly expressed in only one species. We expect that our two datasets will be a valuable resource for everyone working in the field of developmental biology and the regulatory networks that affect it. The computational pipeline to analyze Illumina reads is available at http://www.benoslab.pitt.edu/services.html. PMID:22216218
Perspectives on the mechanism of transcriptional regulation by long non-coding RNAs.
Roberts, Thomas C; Morris, Kevin V; Weinberg, Marc S
2014-01-01
Long non-coding RNAs (lncRNAs) are increasingly being recognized as epigenetic regulators of gene transcription. The diversity and complexity of lncRNA genes means that they exert their regulatory effects by a variety of mechanisms. Although there is still much to be learned about the mechanism of lncRNA function, general principles are starting to emerge. In particular, the application of high throughput (deep) sequencing methodologies has greatly advanced our understanding of lncRNA gene function. lncRNAs function as adaptors that link specific chromatin loci with chromatin-remodeling complexes and transcription factors. lncRNAs can act in cis or trans to guide epigenetic-modifier complexes to distinct genomic sites, or act as scaffolds which recruit multiple proteins simultaneously, thereby coordinating their activities. In this review we discuss the genomic organization of lncRNAs, the importance of RNA secondary structure to lncRNA functionality, the multitude of ways in which they interact with the genome, and what evolutionary conservation tells us about their function.
Pontvianne, Frédéric; Carpentier, Marie-Christine; Durut, Nathalie; Pavlištová, Veronika; Jaške, Karin; Schořová, Šárka; Parrinello, Hugues; Rohmer, Marine; Pikaard, Craig S; Fojtová, Miloslava; Fajkus, Jiří; Saez-Vasquez, Julio
2017-01-01
The nucleolus is the site of ribosomal RNA (rRNA) gene transcription, rRNA processing and ribosome biogenesis. However, the nucleolus also plays additional roles in the cell. We isolated nucleoli by Fluorescence Activated Cell Sorting (FACS) and identified Nucleolus-Associated Chromatin Domains (NADs) by deep sequencing, comparing wild-type plants and null mutants for the nucleolar protein, NUCLEOLIN 1 (NUC1). NADs are primarily genomic regions with heterochromatic signatures and include transposable elements (TEs), sub-telomeric regions and mostly inactive protein-coding genes. However, NADs also include active ribosomal RNA genes, and the entire short arm of chromosome 4 adjacent to them. In nuc1 null mutants, which alter rRNA gene expression and overall nucleolar structure, NADs are altered, telomere association with the nucleolus is decreased and telomeres become shorter. Collectively, our studies reveal roles for NUC1 and the nucleolus in the spatial organization of chromosomes as well as telomere maintenance. PMID:27477271
Matrin 3 binds and stabilizes mRNA.
Salton, Maayan; Elkon, Ran; Borodina, Tatiana; Davydov, Aleksey; Yaspo, Marie-Laure; Halperin, Eran; Shiloh, Yosef
2011-01-01
Matrin 3 (MATR3) is a highly conserved, inner nuclear matrix protein with two zinc finger domains and two RNA recognition motifs (RRM), whose function is largely unknown. Recently we found MATR3 to be phosphorylated by the protein kinase ATM, which activates the cellular response to double strand breaks in the DNA. Here, we show that MATR3 interacts in an RNA-dependent manner with several proteins with established roles in RNA processing, and maintains its interaction with RNA via its RRM2 domain. Deep sequencing of the bound RNA (RIP-seq) identified several small noncoding RNA species. Using microarray analysis to explore MATR3's role in transcription, we identified 77 transcripts whose amounts depended on the presence of MATR3. We validated this finding with nine transcripts which were also bound to the MATR3 complex. Finally, we demonstrated the importance of MATR3 for maintaining the stability of several of these mRNA species and conclude that it has a role in mRNA stabilization. The data suggest that the cellular level of MATR3, known to be highly regulated, modulates the stability of a group of gene transcripts.
Matrin 3 Binds and Stabilizes mRNA
Salton, Maayan; Elkon, Ran; Borodina, Tatiana; Davydov, Aleksey; Yaspo, Marie-Laure; Halperin, Eran; Shiloh, Yosef
2011-01-01
Matrin 3 (MATR3) is a highly conserved, inner nuclear matrix protein with two zinc finger domains and two RNA recognition motifs (RRM), whose function is largely unknown. Recently we found MATR3 to be phosphorylated by the protein kinase ATM, which activates the cellular response to double strand breaks in the DNA. Here, we show that MATR3 interacts in an RNA-dependent manner with several proteins with established roles in RNA processing, and maintains its interaction with RNA via its RRM2 domain. Deep sequencing of the bound RNA (RIP-seq) identified several small noncoding RNA species. Using microarray analysis to explore MATR3′s role in transcription, we identified 77 transcripts whose amounts depended on the presence of MATR3. We validated this finding with nine transcripts which were also bound to the MATR3 complex. Finally, we demonstrated the importance of MATR3 for maintaining the stability of several of these mRNA species and conclude that it has a role in mRNA stabilization. The data suggest that the cellular level of MATR3, known to be highly regulated, modulates the stability of a group of gene transcripts. PMID:21858232
Shen, Yingjia; Venu, R.C.; Nobuta, Kan; Wu, Xiaohui; Notibala, Varun; Demirci, Caghan; Meyers, Blake C.; Wang, Guo-Liang; Ji, Guoli; Li, Qingshun Q.
2011-01-01
Polyadenylation sites mark the ends of mRNA transcripts. Alternative polyadenylation (APA) may alter sequence elements and/or the coding capacity of transcripts, a mechanism that has been demonstrated to regulate gene expression and transcriptome diversity. To study the role of APA in transcriptome dynamics, we analyzed a large-scale data set of RNA “tags” that signify poly(A) sites and expression levels of mRNA. These tags were derived from a wide range of tissues and developmental stages that were mutated or exposed to environmental treatments, and generated using digital gene expression (DGE)–based protocols of the massively parallel signature sequencing (MPSS-DGE) and the Illumina sequencing-by-synthesis (SBS-DGE) sequencing platforms. The data offer a global view of APA and how it contributes to transcriptome dynamics. Upon analysis of these data, we found that ∼60% of Arabidopsis genes have multiple poly(A) sites. Likewise, ∼47% and 82% of rice genes use APA, supported by MPSS-DGE and SBS-DGE tags, respectively. In both species, ∼49%–66% of APA events were mapped upstream of annotated stop codons. Interestingly, 10% of the transcriptomes are made up of APA transcripts that are differentially distributed among developmental stages and in tissues responding to environmental stresses, providing an additional level of transcriptome dynamics. Examples of pollen-specific APA switching and salicylic acid treatment-specific APA clearly demonstrated such dynamics. The significance of these APAs is more evident in the 3034 genes that have conserved APA events between rice and Arabidopsis. PMID:21813626
Asha, Srinivasan; Soniya, E V
2017-02-01
Small RNAs derived from ribosomal RNAs (srRNAs) are rarely explored in the high-throughput data of plant systems. Here, we analyzed srRNAs from the deep-sequenced small RNA libraries of Piper nigrum, a unique magnoliid plant. The 5' end of the putative long form of 5.8S rRNA (5.8S L rRNA) was identified as the site for biogenesis of highly abundant srRNAs that are unique among the Piperaceae family of plants. A subsequent comparative analysis of the ninety-seven sRNAomes of diverse plants successfully uncovered the abundant existence and precise cleavage of unique rRF signature small RNAs upstream of a novel 5' consensus sequence of the 5.8S rRNA. The major cleavage process mapped identically among the different tissues of the same plant. The differential expression and cleavage of 5'5.8S srRNAs in Phytophthora capsici infected P. nigrum tissues indicated the critical biological functions of these srRNAs during stress response. The non-canonical short hairpin precursor structure, the association with Argonaute proteins, and the potential targets of 5'5.8S srRNAs reinforced their regulatory role in the RNAi pathway in plants. In addition, this novel lineage specific small RNAs may have tremendous biological potential in the taxonomic profiling of plants.
Asha, Srinivasan; Soniya, E. V.
2017-01-01
Small RNAs derived from ribosomal RNAs (srRNAs) are rarely explored in the high-throughput data of plant systems. Here, we analyzed srRNAs from the deep-sequenced small RNA libraries of Piper nigrum, a unique magnoliid plant. The 5′ end of the putative long form of 5.8S rRNA (5.8SLrRNA) was identified as the site for biogenesis of highly abundant srRNAs that are unique among the Piperaceae family of plants. A subsequent comparative analysis of the ninety-seven sRNAomes of diverse plants successfully uncovered the abundant existence and precise cleavage of unique rRF signature small RNAs upstream of a novel 5′ consensus sequence of the 5.8S rRNA. The major cleavage process mapped identically among the different tissues of the same plant. The differential expression and cleavage of 5′5.8S srRNAs in Phytophthora capsici infected P. nigrum tissues indicated the critical biological functions of these srRNAs during stress response. The non-canonical short hairpin precursor structure, the association with Argonaute proteins, and the potential targets of 5′5.8S srRNAs reinforced their regulatory role in the RNAi pathway in plants. In addition, this novel lineage specific small RNAs may have tremendous biological potential in the taxonomic profiling of plants. PMID:28145468
Koskey, Amber M.; Fisher, Jenny C.; Traudt, Mary F.; Newton, Ryan J.
2014-01-01
Gulls are prevalent in beach environments and can be a major source of fecal contamination. Gulls have been shown to harbor a high abundance of fecal indicator bacteria (FIB), such as Escherichia coli and enterococci, which can be readily detected as part of routine beach monitoring. Despite the ubiquitous presence of gull fecal material in beach environments, the associated microbial community is relatively poorly characterized. We generated comprehensive microbial community profiles of gull fecal samples using Roche 454 and Illumina MiSeq platforms to investigate the composition and variability of the gull fecal microbial community and to measure the proportion of FIB. Enterococcaceae and Enterobacteriaceae were the two most abundant families in our gull samples. Sequence comparisons between short-read data and nearly full-length 16S rRNA gene clones generated from the same samples revealed Catellicoccus marimammalium as the most numerous taxon among all samples. The identification of bacteria from gull fecal pellets cultured on membrane-Enterococcus indoxyl-β-d-glucoside (mEI) plates showed that the dominant sequences recovered in our sequence libraries did not represent organisms culturable on mEI. Based on 16S rRNA gene sequencing of gull fecal isolates cultured on mEI plates, 98.8% were identified as Enterococcus spp., 1.2% were identified as Streptococcus spp., and none were identified as C. marimammalium. Illumina deep sequencing indicated that gull fecal samples harbor significantly higher proportions of C. marimammalium 16S rRNA gene sequences (>50-fold) relative to typical mEI culturable Enterococcus spp. C. marimammalium therefore can be confidently utilized as a genetic marker to identify gull fecal pollution in the beach environment. PMID:24242244
Takai, Ken; Abe, Mariko; Miyazaki, Masayuki; Koide, Osamu; Nunoura, Takuro; Imachi, Hiroyuki; Inagaki, Fumio; Kobayashi, Tohru
2013-05-01
A facultatively anaerobic organoheterotroph, designated JAM-BA0302(T), was isolated from a deep subseafloor sediment at a depth of 247.1 m below the seafloor off the Shimokita Peninsula of Japan in the north-western Pacific Ocean (Site C9001 , water depth 1180 m). Cells of strain JAM-BA0302(T) showed gliding motility and were thin, long rods with peritrichous fimbriae-like structures. Growth occurred at 4-37 °C (optimum 30 °C; doubling time 8 h), at pH 5.4-8.3 (optimum pH 7.5) and with 5-60 g NaCl l(-1) (optimum 20-25 g l(-1)). The isolate utilized proteinaceous substrates such as yeast extract, tryptone, casein and Casamino acids with O2 respiration or fermentation. Strain JAM-BA0302(T) was a piezotolerant bacterium that could grow at pressures as high as 25 MPa under aerobic conditions and 10 MPa under anaerobic conditions. The G+C content of the genomic DNA was 43.2 mol%. Phylogenetic analysis based on 16S rRNA gene sequences indicated that strain JAM-BA0302(T) was most closely related to yet-undescribed strains recently isolated from various marine sedimentary environments (>99.6 % 16S rRNA gene sequence similarity) and was moderately related to Sunxiuqinia elliptica DQHS-4(T), isolated from a sea cucumber farm sediment (95.5 % 16S rRNA gene sequence similarity) within the Bacteroidetes. The phylogenetic analysis suggested that the isolate should belong to the genus Sunxiuqinia. However, low DNA-DNA relatedness (<11 %) and many physiological and molecular properties differentiated the isolate from those previously describedhttp://dx.doi.org/10.1601/nm.22746. We propose here a novel species of the genus Sunxiuqinia, with the name Sunxiuqinia faeciviva sp. nov. The type strain is JAM-BA0302(T) ( = JCM 15547(T) = NCIMB 14481(T)).
Mahato, Ajay Kumar; Sharma, Nimisha; Singh, Akshay; Srivastav, Manish; Jaiprakash; Singh, Sanjay Kumar; Singh, Anand Kumar; Sharma, Tilak Raj; Singh, Nagendra Kumar
2016-01-01
Mango (Mangifera indica L.) is called "king of fruits" due to its sweetness, richness of taste, diversity, large production volume and a variety of end usage. Despite its huge economic importance genomic resources in mango are scarce and genetics of useful horticultural traits are poorly understood. Here we generated deep coverage leaf RNA sequence data for mango parental varieties 'Neelam', 'Dashehari' and their hybrid 'Amrapali' using next generation sequencing technologies. De-novo sequence assembly generated 27,528, 20,771 and 35,182 transcripts for the three genotypes, respectively. The transcripts were further assembled into a non-redundant set of 70,057 unigenes that were used for SSR and SNP identification and annotation. Total 5,465 SSR loci were identified in 4,912 unigenes with 288 type I SSR (n ≥ 20 bp). One hundred type I SSR markers were randomly selected of which 43 yielded PCR amplicons of expected size in the first round of validation and were designated as validated genic-SSR markers. Further, 22,306 SNPs were identified by aligning high quality sequence reads of the three mango varieties to the reference unigene set, revealing significantly enhanced SNP heterozygosity in the hybrid Amrapali. The present study on leaf RNA sequencing of mango varieties and their hybrid provides useful genomic resource for genetic improvement of mango.
Mahato, Ajay Kumar; Sharma, Nimisha; Singh, Akshay; Srivastav, Manish; Jaiprakash; Singh, Sanjay Kumar; Singh, Anand Kumar; Sharma, Tilak Raj; Singh, Nagendra Kumar
2016-01-01
Mango (Mangifera indica L.) is called “king of fruits” due to its sweetness, richness of taste, diversity, large production volume and a variety of end usage. Despite its huge economic importance genomic resources in mango are scarce and genetics of useful horticultural traits are poorly understood. Here we generated deep coverage leaf RNA sequence data for mango parental varieties ‘Neelam’, ‘Dashehari’ and their hybrid ‘Amrapali’ using next generation sequencing technologies. De-novo sequence assembly generated 27,528, 20,771 and 35,182 transcripts for the three genotypes, respectively. The transcripts were further assembled into a non-redundant set of 70,057 unigenes that were used for SSR and SNP identification and annotation. Total 5,465 SSR loci were identified in 4,912 unigenes with 288 type I SSR (n ≥ 20 bp). One hundred type I SSR markers were randomly selected of which 43 yielded PCR amplicons of expected size in the first round of validation and were designated as validated genic-SSR markers. Further, 22,306 SNPs were identified by aligning high quality sequence reads of the three mango varieties to the reference unigene set, revealing significantly enhanced SNP heterozygosity in the hybrid Amrapali. The present study on leaf RNA sequencing of mango varieties and their hybrid provides useful genomic resource for genetic improvement of mango. PMID:27736892
Task 1.5 Genomic Shift and Drift Trends of Emerging Pathogens
DOE Office of Scientific and Technical Information (OSTI.GOV)
Borucki, M
2010-01-05
The Lawrence Livermore National Laboratory (LLNL) Bioinformatics group has recently taken on a role in DTRA's Transformation Medical Technologies Initiative (TMTI). The high-level goal of TMTI is to accelerate the development of broad-spectrum countermeasures. To achieve those goals, TMTI has a near term need to conduct analyses of genomic shift and drift trends of emerging pathogens, with a focused eye on select agent pathogens, as well as antibiotic and virulence markers. Most emerging human pathogens are zoonotic viruses with a genome composed of RNA. The high mutation rate of the replication enzymes of RNA viruses contributes to sequence drift andmore » provides one mechanism for these viruses to adapt to diverse hosts (interspecies transmission events) and cause new human and zoonotic diseases. Additionally, new viral pathogens frequently emerge due to genetic shift (recombination and segment reassortment) which allows for dramatic genotypic and phenotypic changes to occur rapidly. Bacterial pathogens also evolve via genetic drift and shift, although sequence drift generally occurs at a much slower rate for bacteria as compared to RNA viruses. However, genetic shift such as lateral gene transfer and inter- and intragenomic recombination enables bacteria to rapidly acquire new mechanisms of survival and antibiotic resistance. New technologies such as rapid whole genome sequencing of bacterial genomes, ultra-deep sequencing of RNA virus populations, metagenomic studies of environments rich in antibiotic resistance genes, and the use of microarrays for the detection and characterization of emerging pathogens provide mechanisms to address the challenges posed by the rapid emergence of pathogens. Bioinformatic algorithms that enable efficient analysis of the massive amounts of data generated by these technologies as well computational modeling of protein structures and evolutionary processes need to be developed to allow the technology to fulfill its potential.« less
Liu, Xiaochuan; Freitas, Jaime; Zheng, Dinghai; Oliveira, Marta S; Hoque, Mainul; Martins, Torcato; Henriques, Telmo; Tian, Bin; Moreira, Alexandra
2017-12-01
Alternative polyadenylation (APA) is a mechanism that generates multiple mRNA isoforms with different 3'UTRs and/or coding sequences from a single gene. Here, using 3' region extraction and deep sequencing (3'READS), we have systematically mapped cleavage and polyadenylation sites (PASs) in Drosophila melanogaster , expanding the total repertoire of PASs previously identified for the species, especially those located in A-rich genomic sequences. Cis -element analysis revealed distinct sequence motifs around fly PASs when compared to mammalian ones, including the greater enrichment of upstream UAUA elements and the less prominent presence of downstream UGUG elements. We found that over 75% of mRNA genes in Drosophila melanogaster undergo APA. The head tissue tends to use distal PASs when compared to the body, leading to preferential expression of APA isoforms with long 3'UTRs as well as with distal terminal exons. The distance between the APA sites and intron location of PAS are important parameters for APA difference between body and head, suggesting distinct PAS selection contexts. APA analysis of the RpII215 C4 mutant strain, which harbors a mutant RNA polymerase II (RNAPII) with a slower elongation rate, revealed that a 50% decrease in transcriptional elongation rate leads to a mild trend of more usage of proximal, weaker PASs, both in 3'UTRs and in introns, consistent with the "first come, first served" model of APA regulation. However, this trend was not observed in the head, suggesting a different regulatory context in neuronal cells. Together, our data expand the PAS collection for Drosophila melanogaster and reveal a tissue-specific effect of APA regulation by RNAPII elongation rate. © 2017 Liu et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Liu, Jun-Ying; Fan, Hui-Yan; Wang, Ying; Zhang, Yong-Liang; Li, Da-Wei; Yu, Jia-Lin; Han, Cheng-Gui
2017-01-01
Plant microRNAs (miRNAs) are a class of non-coding RNAs that play important roles in plant development, defense, and symptom development. Here, 547 known miRNAs representing 129 miRNA families, and 282 potential novel miRNAs were identified in Beta macrocarpa using small RNA deep sequencing. A phylogenetic analysis was performed, and 8 Beta lineage-specific miRNAs were identified. Through a differential expression analysis, miRNAs associated with Beet necrotic yellow vein virus (BNYVV) infection were identified and confirmed using a microarray analysis and stem-loop RT-qPCR. In total, 103 known miRNAs representing 38 miRNA families, and 45 potential novel miRNAs were differentially regulated, with at least a two-fold change, in BNYVV-infected plants compared with that of the mock-inoculated control. Targets of these differentially expressed miRNAs were also predicted by degradome sequencing. These differentially expressed miRNAs were involved in hormone biosynthesis and signal transduction pathways, and enhanced axillary bud development and plant defenses. This work is the first to describe miRNAs of the plant genus Beta and may offer a reference for miRNA research in other species in the genus. It provides valuable information on the pathogenicity mechanisms of BNYVV.
Identification and comparative analysis of drought-associated microRNAs in two cowpea genotypes.
Barrera-Figueroa, Blanca E; Gao, Lei; Diop, Ndeye N; Wu, Zhigang; Ehlers, Jeffrey D; Roberts, Philip A; Close, Timothy J; Zhu, Jian-Kang; Liu, Renyi
2011-09-17
Cowpea (Vigna unguiculata) is an important crop in arid and semi-arid regions and is a good model for studying drought tolerance. MicroRNAs (miRNAs) are known to play critical roles in plant stress responses, but drought-associated miRNAs have not been identified in cowpea. In addition, it is not understood how miRNAs might contribute to different capacities of drought tolerance in different cowpea genotypes. We generated deep sequencing small RNA reads from two cowpea genotypes (CB46, drought-sensitive, and IT93K503-1, drought-tolerant) that grew under well-watered and drought stress conditions. We mapped small RNA reads to cowpea genomic sequences and identified 157 miRNA genes that belong to 89 families. Among 44 drought-associated miRNAs, 30 were upregulated in drought condition and 14 were downregulated. Although miRNA expression was in general consistent in two genotypes, we found that nine miRNAs were predominantly or exclusively expressed in one of the two genotypes and that 11 miRNAs were drought-regulated in only one genotype, but not the other. These results suggest that miRNAs may play important roles in drought tolerance in cowpea and may be a key factor in determining the level of drought tolerance in different cowpea genotypes.
Molecular characterization of emaraviruses associated with Pigeonpea sterility mosaic disease.
Kumar, Surender; Subbarao, B L; Hallan, Vipin
2017-09-19
Sterility Mosaic Disease (SMD) of pigeonpea (Cajanus cajan (L.) Millspaugh) is a complex disease due to various factors including the presence of a mixed infection. Comparison of dsRNA profile and small RNA (sRNA) deep sequencing analysis of samples from three locations revealed the presence of Pigeonpea sterility mosaic virus-I and II (PPSMV-I and II) from Chevella and only PPSMV-II from Bengaluru and Coimbatore. PPSMV-I genome consisted of four while PPSMV-II encompassed six RNAs. The two viruses have modest sequence homology between their corresponding RNA 1-4 encoding RdRp, glycoprotein precursor, nucleocapsid and movement proteins and the corresponding orthologs of other emaraviruses. However, PPSMV-II is more related to Fig mosaic virus (FMV) than to PPSMV-I. ELISA based detection methodology was standardized to identify these two viruses, uniquely. Mite inoculation of sub-isolate Chevella sometimes resulted in few- to- many pigeonpea plants containing PPSMV-I alone. The study shows that (i) the N-terminal region of RdRp (SRD-1) of both the viruses contain "cap-snatching" endonuclease domain and a 13 AA cap binding site at the C-terminal, essential for viral cap-dependent transcription similar to the members of Bunyaviridae family and (ii) P4 is the movement protein and may belong to '30 K superfamily' of MPs.
NASA Astrophysics Data System (ADS)
Dong, Y.; Cann, I.; Mackie, R.; Price, N.; Flynn, T. M.; Sanford, R.; Miller, P.; Chia, N.; Kumar, C. G.; Kim, P.; Sivaguru, M.; Fouke, B. W.
2010-12-01
Knowledge of the composition, structure and activity of microbial communities that live in deeply buried sedimentary rocks is fundamental to the future of subsurface biosphere stewardship as it relates to hydrocarbon exploration and extraction, carbon sequestration, gas storage and groundwater management. However, the study of indigenous subsurface microorganisms has been limited by the technical challenges of collecting deep formation water samples that have not been heavily contaminated by the mud used to drill the wells. To address this issue, a “clean-sampling method” deploying the newly developed Schlumberger Quicksilver MDT probe was used to collect a subsurface sample at a depth of 1.79 km (5872 ft) from an exploratory well within Cambrian-age sandstones in the Illinois Basin. This yielded a formation water sample that was determined to have less than 4% drilling mud contamination based on tracking changes in the aqueous geochemistry of the formation water during ~3 hours of pumping at depth prior to sample collection. A suite of microscopy and culture-independent molecular analyses were completed using the DNA extracted from microbial cells in the formation water, which included 454 amplicon pyrosequencing that targeted the V1-V3 hypervariable region of bacterial 16S rRNA gene sequences. Results demonstrated an extremely low diversity microbial community living in formation water at 1.79 km-depth. More than 95 % of the total V1-V3 pyrosequencing reads (n=11574) obtained from the formation water were affiliated with a halophilic γ-proteobacterium and most closely related to the genus Halomonas. In contrast, about 3 % of the V1-V3 sequences in the drilling mud library (n=13044) were classified as genus Halomonas but were distinctly different and distantly related to the formation water Halomonas detected at 1.79 km-depth. These results were consistent with those obtained using a suite of other molecular screens (e.g., Terminal-Restriction Fragment Length Polymorphism (T-RFLP) and the initial full length 16S rRNA amplicon libraries) and bioinformatic analyses (e.g., 16S rRNA and Open Reading Frame (ORF) calls established from the 454 metagenomic community analyses). Functional pathway modeling is underway to evaluate the adaptation of this indigenous microbial community to the hydrologic and geologic history of the deep subsurface environment of the Illinois Basin.
GRID-seq reveals the global RNA-chromatin interactome
Li, Xiao; Zhou, Bing; Chen, Liang; Gou, Lan-Tao; Li, Hairi; Fu, Xiang-Dong
2017-01-01
Higher eukaryotic genomes are bound by a large number of coding and non-coding RNAs, but approaches to comprehensively map the identity and binding sites of these RNAs are lacking. Here we report a method to in situ capture global RNA interactions with DNA by deep sequencing (GRID-seq), which enables the comprehensive identification of the entire repertoire of chromatin-interacting RNAs and their respective binding sites. In human, mouse and Drosophila cells, we detected a large set of tissue-specific coding and non-coding RNAs that are bound to active promoters and enhancers, especially super-enhancers. Assuming that most mRNA-chromatin interactions indicate the physical proximity of a promoter and an enhancer, we constructed a three-dimensional global connectivity map of promoters and enhancers, revealing transcription activity-linked genomic interactions in the nucleus. PMID:28922346
High-Resolution Analysis of Coronavirus Gene Expression by RNA Sequencing and Ribosome Profiling
Jones, Joshua D.; Chung, Betty Y.-W.; Siddell, Stuart G.; Brierley, Ian
2016-01-01
Members of the family Coronaviridae have the largest genomes of all RNA viruses, typically in the region of 30 kilobases. Several coronaviruses, such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV) and Middle East respiratory syndrome-related coronavirus (MERS-CoV), are of medical importance, with high mortality rates and, in the case of SARS-CoV, significant pandemic potential. Other coronaviruses, such as Porcine epidemic diarrhea virus and Avian coronavirus, are important livestock pathogens. Ribosome profiling is a technique which exploits the capacity of the translating ribosome to protect around 30 nucleotides of mRNA from ribonuclease digestion. Ribosome-protected mRNA fragments are purified, subjected to deep sequencing and mapped back to the transcriptome to give a global “snap-shot” of translation. Parallel RNA sequencing allows normalization by transcript abundance. Here we apply ribosome profiling to cells infected with Murine coronavirus, mouse hepatitis virus, strain A59 (MHV-A59), a model coronavirus in the same genus as SARS-CoV and MERS-CoV. The data obtained allowed us to study the kinetics of virus transcription and translation with exquisite precision. We studied the timecourse of positive and negative-sense genomic and subgenomic viral RNA production and the relative translation efficiencies of the different virus ORFs. Virus mRNAs were not found to be translated more efficiently than host mRNAs; rather, virus translation dominates host translation at later time points due to high levels of virus transcripts. Triplet phasing of the profiling data allowed precise determination of translated reading frames and revealed several translated short open reading frames upstream of, or embedded within, known virus protein-coding regions. Ribosome pause sites were identified in the virus replicase polyprotein pp1a ORF and investigated experimentally. Contrary to expectations, ribosomes were not found to pause at the ribosomal frameshift site. To our knowledge this is the first application of ribosome profiling to an RNA virus. PMID:26919232
Computational RNomics of Drosophilids
Rose, Dominic; Hackermüller, Jörg; Washietl, Stefan; Reiche, Kristin; Hertel, Jana; Findeiß, Sven; Stadler, Peter F; Prohaska, Sonja J
2007-01-01
Background Recent experimental and computational studies have provided overwhelming evidence for a plethora of diverse transcripts that are unrelated to protein-coding genes. One subclass consists of those RNAs that require distinctive secondary structure motifs to exert their biological function and hence exhibit distinctive patterns of sequence conservation characteristic for positive selection on RNA secondary structure. The deep-sequencing of 12 drosophilid species coordinated by the NHGRI provides an ideal data set of comparative computational approaches to determine those genomic loci that code for evolutionarily conserved RNA motifs. This class of loci includes the majority of the known small ncRNAs as well as structured RNA motifs in mRNAs. We report here on a genome-wide survey using RNAz. Results We obtain 16 000 high quality predictions among which we recover the majority of the known ncRNAs. Taking a pessimistically estimated false discovery rate of 40% into account, this implies that at least some ten thousand loci in the Drosophila genome show the hallmarks of stabilizing selection action of RNA structure, and hence are most likely functional at the RNA level. A subset of RNAz predictions overlapping with TRF1 and BRF binding sites [Isogai et al., EMBO J. 26: 79–89 (2007)], which are plausible candidates of Pol III transcripts, have been studied in more detail. Among these sequences we identify several "clusters" of ncRNA candidates with striking structural similarities. Conclusion The statistical evaluation of the RNAz predictions in comparison with a similar analysis of vertebrate genomes [Washietl et al., Nat. Biotech. 23: 1383–1390 (2005)] shows that qualitatively similar fractions of structured RNAs are found in introns, UTRs, and intergenic regions. The intergenic RNA structures, however, are concentrated much more closely around known protein-coding loci, suggesting that flies have significantly smaller complement of independent structured ncRNAs compared to mammals. PMID:17996037
MicroRNA and Transcription Factor: Key Players in Plant Regulatory Network
Samad, Abdul F. A.; Sajad, Muhammad; Nazaruddin, Nazaruddin; Fauzi, Izzat A.; Murad, Abdul M. A.; Zainal, Zamri; Ismail, Ismanizan
2017-01-01
Recent achievements in plant microRNA (miRNA), a large class of small and non-coding RNAs, are very exciting. A wide array of techniques involving forward genetic, molecular cloning, bioinformatic analysis, and the latest technology, deep sequencing have greatly advanced miRNA discovery. A tiny miRNA sequence has the ability to target single/multiple mRNA targets. Most of the miRNA targets are transcription factors (TFs) which have paramount importance in regulating the plant growth and development. Various families of TFs, which have regulated a range of regulatory networks, may assist plants to grow under normal and stress environmental conditions. This present review focuses on the regulatory relationships between miRNAs and different families of TFs like; NF-Y, MYB, AP2, TCP, WRKY, NAC, GRF, and SPL. For instance NF-Y play important role during drought tolerance and flower development, MYB are involved in signal transduction and biosynthesis of secondary metabolites, AP2 regulate the floral development and nodule formation, TCP direct leaf development and growth hormones signaling. WRKY have known roles in multiple stress tolerances, NAC regulate lateral root formation, GRF are involved in root growth, flower, and seed development, and SPL regulate plant transition from juvenile to adult. We also studied the relation between miRNAs and TFs by consolidating the research findings from different plant species which will help plant scientists in understanding the mechanism of action and interaction between these regulators in the plant growth and development under normal and stress environmental conditions. PMID:28446918
Kinoshita, Natsuko; Wang, Huan; Kasahara, Hiroyuki; Liu, Jun; MacPherson, Cameron; Machida, Yasunori; Kamiya, Yuji; Hannah, Matthew A.; Chua, Nam-Hai
2012-01-01
The functions of microRNAs and their target mRNAs in Arabidopsis thaliana development have been widely documented; however, roles of stress-responsive microRNAs and their targets are not as well understood. Using small RNA deep sequencing and ATH1 microarrays to profile mRNAs, we identified IAA-Ala Resistant3 (IAR3) as a new target of miR167a. As expected, IAR3 mRNA was cleaved at the miR167a complementary site and under high osmotic stress miR167a levels decreased, whereas IAR3 mRNA levels increased. IAR3 hydrolyzes an inactive form of auxin (indole-3-acetic acid [IAA]-alanine) and releases bioactive auxin (IAA), a central phytohormone for root development. In contrast with the wild type, iar3 mutants accumulated reduced IAA levels and did not display high osmotic stress–induced root architecture changes. Transgenic plants expressing a cleavage-resistant form of IAR3 mRNA accumulated high levels of IAR3 mRNAs and showed increased lateral root development compared with transgenic plants expressing wild-type IAR3. Expression of an inducible noncoding RNA to sequester miR167a by target mimicry led to an increase in IAR3 mRNA levels, further confirming the inverse relationship between the two partners. Sequence comparison revealed the miR167 target site on IAR3 mRNA is conserved in evolutionarily distant plant species. Finally, we showed that IAR3 is required for drought tolerance. PMID:22960911
Recurrent and functional regulatory mutations in breast cancer.
Rheinbay, Esther; Parasuraman, Prasanna; Grimsby, Jonna; Tiao, Grace; Engreitz, Jesse M; Kim, Jaegil; Lawrence, Michael S; Taylor-Weiner, Amaro; Rodriguez-Cuevas, Sergio; Rosenberg, Mara; Hess, Julian; Stewart, Chip; Maruvka, Yosef E; Stojanov, Petar; Cortes, Maria L; Seepo, Sara; Cibulskis, Carrie; Tracy, Adam; Pugh, Trevor J; Lee, Jesse; Zheng, Zongli; Ellisen, Leif W; Iafrate, A John; Boehm, Jesse S; Gabriel, Stacey B; Meyerson, Matthew; Golub, Todd R; Baselga, Jose; Hidalgo-Miranda, Alfredo; Shioda, Toshi; Bernards, Andre; Lander, Eric S; Getz, Gad
2017-07-06
Genomic analysis of tumours has led to the identification of hundreds of cancer genes on the basis of the presence of mutations in protein-coding regions. By contrast, much less is known about cancer-causing mutations in non-coding regions. Here we perform deep sequencing in 360 primary breast cancers and develop computational methods to identify significantly mutated promoters. Clear signals are found in the promoters of three genes. FOXA1, a known driver of hormone-receptor positive breast cancer, harbours a mutational hotspot in its promoter leading to overexpression through increased E2F binding. RMRP and NEAT1, two non-coding RNA genes, carry mutations that affect protein binding to their promoters and alter expression levels. Our study shows that promoter regions harbour recurrent mutations in cancer with functional consequences and that the mutations occur at similar frequencies as in coding regions. Power analyses indicate that more such regions remain to be discovered through deep sequencing of adequately sized cohorts of patients.
Archaeal Diversity in Waters from Deep South African Gold Mines
Takai, Ken; Moser, Duane P.; DeFlaun, Mary; Onstott, Tullis C.; Fredrickson, James K.
2001-01-01
A culture-independent molecular analysis of archaeal communities in waters collected from deep South African gold mines was performed by performing a PCR-mediated terminal restriction fragment length polymorphism (T-RFLP) analysis of rRNA genes (rDNA) in conjunction with a sequencing analysis of archaeal rDNA clone libraries. The water samples used represented various environments, including deep fissure water, mine service water, and water from an overlying dolomite aquifer. T-RFLP analysis revealed that the ribotype distribution of archaea varied with the source of water. The archaeal communities in the deep gold mine environments exhibited great phylogenetic diversity; the majority of the members were most closely related to uncultivated species. Some archaeal rDNA clones obtained from mine service water and dolomite aquifer water samples were most closely related to environmental rDNA clones from surface soil (soil clones) and marine environments (marine group I [MGI]). Other clones exhibited intermediate phylogenetic affiliation between soil clones and MGI in the Crenarchaeota. Fissure water samples, derived from active or dormant geothermal environments, yielded archaeal sequences that exhibited novel phylogeny, including a novel lineage of Euryarchaeota. These results suggest that deep South African gold mines harbor novel archaeal communities distinct from those observed in other environments. Based on the phylogenetic analysis of archaeal strains and rDNA clones, including the newly discovered archaeal rDNA clones, the evolutionary relationship and the phylogenetic organization of the domain Archaea are reevaluated. PMID:11722932
Analysis of microRNA profile of Anopheles sinensis by deep sequencing and bioinformatic approaches.
Feng, Xinyu; Zhou, Xiaojian; Zhou, Shuisen; Wang, Jingwen; Hu, Wei
2018-03-12
microRNAs (miRNAs) are small non-coding RNAs widely identified in many mosquitoes. They are reported to play important roles in development, differentiation and innate immunity. However, miRNAs in Anopheles sinensis, one of the Chinese malaria mosquitoes, remain largely unknown. We investigated the global miRNA expression profile of An. sinensis using Illumina Hiseq 2000 sequencing. Meanwhile, we applied a bioinformatic approach to identify potential miRNAs in An. sinensis. The identified miRNA profiles were compared and analyzed by two approaches. The selected miRNAs from the sequencing result and the bioinformatic approach were confirmed with qRT-PCR. Moreover, target prediction, GO annotation and pathway analysis were carried out to understand the role of miRNAs in An. sinensis. We identified 49 conserved miRNAs and 12 novel miRNAs by next-generation high-throughput sequencing technology. In contrast, 43 miRNAs were predicted by the bioinformatic approach, of which two were assigned as novel. Comparative analysis of miRNA profiles by two approaches showed that 21 miRNAs were shared between them. Twelve novel miRNAs did not match any known miRNAs of any organism, indicating that they are possibly species-specific. Forty miRNAs were found in many mosquito species, indicating that these miRNAs are evolutionally conserved and may have critical roles in the process of life. Both the selected known and novel miRNAs (asi-miR-281, asi-miR-184, asi-miR-14, asi-miR-nov5, asi-miR-nov4, asi-miR-9383, and asi-miR-2a) could be detected by quantitative real-time PCR (qRT-PCR) in the sequenced sample, and the expression patterns of these miRNAs measured by qRT-PCR were in concordance with the original miRNA sequencing data. The predicted targets for the known and the novel miRNAs covered many important biological roles and pathways indicating the diversity of miRNA functions. We also found 21 conserved miRNAs and eight counterparts of target immune pathway genes in An. sinensis based on the analysis of An. gambiae. Our results provide the first lead to the elucidation of the miRNA profile in An. sinensis. Unveiling the roles of mosquito miRNAs will undoubtedly lead to a better understanding of mosquito biology and mosquito-pathogen interactions. This work lays the foundation for the further functional study of An. sinensis miRNAs and will facilitate their application in vector control.
Environmental surveillance of viruses by tangential flow filtration and metagenomic reconstruction.
Furtak, Vyacheslav; Roivainen, Merja; Mirochnichenko, Olga; Zagorodnyaya, Tatiana; Laassri, Majid; Zaidi, Sohail Z; Rehman, Lubna; Alam, Muhammad M; Chizhikov, Vladimir; Chumakov, Konstantin
2016-04-14
An approach is proposed for environmental surveillance of poliovirus by concentrating sewage samples with tangential flow filtration (TFF) followed by deep sequencing of viral RNA. Subsequent to testing the method with samples from Finland, samples from Pakistan, a country endemic for poliovirus, were investigated. Genomic sequencing was either performed directly, for unbiased identification of viruses regardless of their ability to grow in cell cultures, or after virus enrichment by cell culture or immunoprecipitation. Bioinformatics enabled separation and determination of individual consensus sequences. Overall, deep sequencing of the entire viral population identified polioviruses, non-polio enteroviruses, and other viruses. In Pakistani sewage samples, adeno-associated virus, unable to replicate autonomously in cell cultures, was the most abundant human virus. The presence of recombinants of wild polioviruses of serotype 1 (WPV1) was also inferred, whereby currently circulating WPV1 of south-Asian (SOAS) lineage comprised two sub-lineages depending on their non-capsid region origin. Complete genome analyses additionally identified point mutants and intertypic recombinants between attenuated Sabin strains in the Pakistani samples, and in one Finnish sample. The approach could allow rapid environmental surveillance of viruses causing human infections. It creates a permanent digital repository of the entire virome potentially useful for retrospective screening of future discovered viruses.
Egge, Elianne S; Eikrem, Wenche; Edvardsen, Bente
2015-01-01
Microalgae in the division Haptophyta may be difficult to identify to species by microscopy because they are small and fragile. Here, we used high-throughput sequencing to explore the diversity of haptophytes in outer Oslofjorden, Skagerrak, and supplemented this with electron microscopy. Nano- and picoplanktonic subsurface samples were collected monthly for 2 yr, and the haptophytes were targeted by amplification of RNA/cDNA with Haptophyta-specific 18S ribosomal DNA V4 primers. Pyrosequencing revealed higher species richness of haptophytes than previously observed in the Skagerrak by microscopy. From ca. 400,000 reads we obtained 156 haptophyte operational taxonomic units (OTUs) after rigorous filtering and 99.5% clustering. The majority (84%) of the OTUs matched environmental sequences not linked to a morphological species, most of which were affiliated with the order Prymnesiales. Phylogenetic analyses including Oslofjorden OTUs and available cultured and environmental haptophyte sequences showed that several of the OTUs matched sequences forming deep-branching lineages, potentially representing novel haptophyte classes. Pyrosequencing also retrieved cultured species not previously reported by microscopy in the Skagerrak. Electron microscopy revealed species not yet genetically characterised and some potentially novel taxa. This study contributes to linking genotype to phenotype within this ubiquitous and ecologically important protist group, and reveals great, unknown diversity. PMID:25099994
Vallvé-Juanico, Júlia; Suárez-Salvador, Elena; Castellví, Josep; Ballesteros, Agustín; Taylor, Hugh S; Gil-Moreno, Antonio; Santamaria, Xavier
2017-11-01
To characterize leucine-rich repeat containing G protein-coupled receptor 5-positive (LGR5 + ) cells from the endometrium of women with endometriosis. Prospective experimental study. University hospital/fertility clinic. Twenty-seven women with endometriosis who underwent surgery and 12 healthy egg donors, together comprising 39 endometrial samples. Obtaining of uterine aspirates by using a Cornier Pipelle. Immunofluorescence in formalin-fixed paraffin-embedded tissue from mice and healthy and pathologic human endometrium using antibodies against LGR5, E-cadherin, and cytokeratin, and epithelial and stromal LGR5 + cells isolated from healthy and pathologic human eutopic endometrium by fluorescence-activated cell sorting and transcriptomic characterization by RNA high sequencing. Immunofluorescence showed that LGR5 + cells colocalized with epithelial markers in the stroma of the endometrium only in endometriotic patients. The results from RNA high sequencing of LGR5 + cells from epithelium and stroma did not show any statistically significant differences between them. The LGR5 + versus LGR5 - cells in pathologic endometrium showed 394 differentially expressed genes. The LGR5 + cells in deep-infiltrating endometriosis expressed inflammatory markers not present in the other types of the disease. Our results revealed the presence of aberrantly located LGR5 + cells coexpressing epithelial markers in the stromal compartment of women with endometriosis. These cells have a statistically significantly different expression profile in deep-infiltrating endometriosis in comparison with other types of endometriosis, independent of the menstrual cycle phase. Further studies are needed to elucidate their role and influence in reproductive outcomes. Copyright © 2017. Published by Elsevier Inc.
Diverse molecular signatures for ribosomally ‘active’ Perkinsea in marine sediments
2014-01-01
Background Perkinsea are a parasitic lineage within the eukaryotic superphylum Alveolata. Recent studies making use of environmental small sub-unit ribosomal RNA gene (SSU rDNA) sequencing methodologies have detected a significant diversity and abundance of Perkinsea-like phylotypes in freshwater environments. In contrast only a few Perkinsea environmental sequences have been retrieved from marine samples and only two groups of Perkinsea have been cultured and morphologically described and these are parasites of marine molluscs or marine protists. These two marine groups form separate and distantly related phylogenetic clusters, composed of closely related lineages on SSU rDNA trees. Here, we test the hypothesis that Perkinsea are a hitherto under-sampled group in marine environments. Using 454 diversity ‘tag’ sequencing we investigate the diversity and distribution of these protists in marine sediments and water column samples taken from the Deep Chlorophyll Maximum (DCM) and sub-surface using both DNA and RNA as the source template and sampling four European offshore locations. Results We detected the presence of 265 sequences branching with known Perkinsea, the majority of them recovered from marine sediments. Moreover, 27% of these sequences were sampled from RNA derived cDNA libraries. Phylogenetic analyses classify a large proportion of these sequences into 38 cluster groups (including 30 novel marine cluster groups), which share less than 97% sequence similarity suggesting this diversity encompasses a range of biologically and ecologically distinct organisms. Conclusions These results demonstrate that the Perkinsea lineage is considerably more diverse than previously detected in marine environments. This wide diversity of Perkinsea-like protists is largely retrieved in marine sediment with a significant proportion detected in RNA derived libraries suggesting this diversity represents ribosomally ‘active’ and intact cells. Given the phylogenetic range of hosts infected by known Perkinsea parasites, these data suggest that Perkinsea either play a significant but hitherto unrecognized role as parasites in marine sediments and/or members of this group are present in the marine sediment possibly as part of the ‘seed bank’ microbial community. PMID:24779375
Feliu, Neus; Kohonen, Pekka; Ji, Jie; Zhang, Yuning; Karlsson, Hanna L; Palmberg, Lena; Nyström, Andreas; Fadeel, Bengt
2015-01-27
Gene expression profiling has developed rapidly in recent years with the advent of deep sequencing technologies such as RNA sequencing (RNA Seq) and could be harnessed to predict and define mechanisms of toxicity of chemicals and nanomaterials. However, the full potential of these technologies in (nano)toxicology is yet to be realized. Here, we show that systems biology approaches can uncover mechanisms underlying cellular responses to nanomaterials. Using RNA Seq and computational approaches, we found that cationic poly(amidoamine) dendrimers (PAMAM-NH2) are capable of triggering down-regulation of cell-cycle-related genes in primary human bronchial epithelial cells at doses that do not elicit acute cytotoxicity, as demonstrated using conventional cell viability assays, while gene transcription was not affected by neutral PAMAM-OH dendrimers. The PAMAMs were internalized in an active manner by lung cells and localized mainly in lysosomes; amine-terminated dendrimers were internalized more efficiently when compared to the hydroxyl-terminated dendrimers. Upstream regulator analysis implicated NF-κB as a putative transcriptional regulator, and subsequent cell-based assays confirmed that PAMAM-NH2 caused NF-κB-dependent cell cycle arrest. However, PAMAM-NH2 did not affect cell cycle progression in the human A549 adenocarcinoma cell line. These results demonstrate the feasibility of applying systems biology approaches to predict cellular responses to nanomaterials and highlight the importance of using relevant (primary) cell models.
Acebo, Paloma; Martin-Galiano, Antonio J.; Navarro, Sara; Zaballos, Ángel; Amblar, Mónica
2012-01-01
Streptococcus pneumoniae is the main etiological agent of community-acquired pneumonia and a major cause of mortality and morbidity among children and the elderly. Genome sequencing of several pneumococcal strains revealed valuable information about the potential proteins and genetic diversity of this prevalent human pathogen. However, little is known about its transcriptional regulation and its small regulatory noncoding RNAs. In this study, we performed deep sequencing of the S. pneumoniae TIGR4 strain RNome to identify small regulatory RNA candidates expressed in this pathogen. We discovered 1047 potential small RNAs including intragenic, 5′- and/or 3′-overlapping RNAs and 88 small RNAs encoded in intergenic regions. With this approach, we recovered many of the previously identified intergenic small RNAs and identified 68 novel candidates, most of which are conserved in both sequence and genomic context in other S. pneumoniae strains. We confirmed the independent expression of 17 intergenic small RNAs and predicted putative mRNA targets for six of them using bioinformatics tools. Preliminary results suggest that one of these six is a key player in the regulation of competence development. This study is the biggest catalog of small noncoding RNAs reported to date in S. pneumoniae and provides a highly complete view of the small RNA network in this pathogen. PMID:22274957
Urios, Laurent; Intertaglia, Laurent; Magot, Michel
2013-01-01
A Gram-negative bacterium, designated TF5-37.2-LB10(T), was isolated from subsurface water of the Toarcian geological layer of Tournemire, France. Cells were non-motile straight rods that formed cream to light pink colonies on 10-fold diluted LB agar. Strain TF5-37.2-LB10(T) contained menaquinone 7 and its major fatty acids were iso-C(15 : 0), summed feature 3 (iso-C(15 : 0) 2-OH and/or C(16 : 1)ω7c), iso-C(17 : 0) 3-OH and iso-C(17 : 1)ω9c. The G+C content of the genomic DNA was 46 mol%. Phylogenetic analysis of the 16S rRNA gene sequence placed strain TF5-37.2-LB10(T) within the genus Pedobacter, family Sphingobacteriaceae. Pedobacter composti TR6-06(T) and Pedobacter oryzae DSM 19973(T) were the closest phylogenetic relatives (93.5 and 93.3 % 16S rRNA gene sequence similarity, respectively). On the basis of 16S rRNA gene sequence comparison and physiological and biochemical characteristics, strain TF5-37.2-LB10(T) represents a novel species of the genus Pedobacter, for which the name Pedobacter tournemirensis sp. nov. is proposed. The type strain is TF5-37.2-LB10(T) (= DSM 23085(T) = CIP 110085(T) = MOLA 820(T)).
Bigot, Diane; Atyame, Célestine M; Weill, Mylène; Justy, Fabienne
2018-01-01
Abstract In the global context of arboviral emergence, deep sequencing unlocks the discovery of new mosquito-borne viruses. Mosquitoes of the species Culex pipiens, C. torrentium, and C. hortensis were sampled from 22 locations worldwide for transcriptomic analyses. A virus discovery pipeline was used to analyze the dataset of 0.7 billion reads comprising 22 individual transcriptomes. Two closely related 6.8 kb viral genomes were identified in C. pipiens and named as Culex pipiens associated tunisia virus (CpATV) strains Ayed and Jedaida. The CpATV genome contained four ORFs. ORF1 possessed helicase and RNA-dependent RNA polymerase (RdRp) domains related to new viral sequences recently found mainly in dipterans. ORF2 and 4 contained a capsid protein domain showing strong homology with Virgaviridae plant viruses. ORF3 displayed similarities with eukaryotic Rhoptry domain and a merozoite surface protein (MSP7) domain only found in mosquito-transmitted Plasmodium, suggesting possible interactions between CpATV and vertebrate cells. Estimation of a strong purifying selection exerted on each ORFs and the presence of a polymorphism maintained in the coding region of ORF3 suggested that both CpATV sequences are genuine functional viruses. CpATV is part of an entirely new and highly diversified group of viruses recently found in insects, and that bears the genomic hallmarks of a new viral family. PMID:29340209
Santamaria, Monica; Fosso, Bruno; Licciulli, Flavio; Balech, Bachir; Larini, Ilaria; Grillo, Giorgio; De Caro, Giorgio; Liuni, Sabino
2018-01-01
Abstract A holistic understanding of environmental communities is the new challenge of metagenomics. Accordingly, the amplicon-based or metabarcoding approach, largely applied to investigate bacterial microbiomes, is moving to the eukaryotic world too. Indeed, the analysis of metabarcoding data may provide a comprehensive assessment of both bacterial and eukaryotic composition in a variety of environments, including human body. In this respect, whereas hypervariable regions of the 16S rRNA are the de facto standard barcode for bacteria, the Internal Transcribed Spacer 1 (ITS1) of ribosomal RNA gene cluster has shown a high potential in discriminating eukaryotes at deep taxonomic levels. As metabarcoding data analysis rely on the availability of a well-curated barcode reference resource, a comprehensive collection of ITS1 sequences supplied with robust taxonomies, is highly needed. To address this issue, we created ITSoneDB (available at http://itsonedb.cloud.ba.infn.it/) which in its current version hosts 985 240 ITS1 sequences spanning over 134 000 eukaryotic species. Each ITS1 is mapped on the NCBI reference taxonomy with its start and end positions precisely annotated. ITSoneDB has been developed in agreement to the FAIR guidelines by enabling the users to query and download its content through a simple web-interface and access relevant metadata by cross-linking to European Nucleotide Archive. PMID:29036529
Tsutsui, Kenta; Sato, Tomomi
2018-06-15
Actinoporins are pore-forming proteins found in sea anemones. Although we now have a large collection of data on actinoporins, our knowledge is based heavily on those identified in shallow-water anemones. Because the deep sea differs considerably from shallow waters in hydrostatic pressures, temperatures, and the prey composition, the deep-sea actinoporin may have evolved in unique ways. This study, therefore, aimed to obtain new actinoporins in the deep-sea anemone Cribrinopis japonica (Actiniaria, Actiniidae). An actinoporin-like sequence was identified from the previously established C. japonica RNA-Seq database, and the complete length (663 bp) of the deep-sea actinoporin gene, Cjtox I, was obtained. In addition, a similar gene, Cjtox II (666 bp), was also identified from RNA of actinopharynx. CJTOX I and CJTOX II were similar in their primary structures, but CJTOX I lacked one residue in the middle of the protein. There was also a difference in the gene expression in live animals, where only Cjtox I was expressed in tentacles of C. japonica. In the heterologous expression where BL21 (DE3) strain was retransformed with the plasmid containing either Cjtox I or Cjtox II gene, the supernatants of both cell lysates showed hemolytic activity on the equine erythrocytes. Preincubation of the supernatants with sphingomyelin caused reduced activity, implying that the CJTOX I and II would target sphingomyelin as with other actinoporins. Because of the structures similarity to the known actinoporins and the sphingomyelin-inhibitable hemolytic activity, both CJTOX I and II were concluded to be new actinoporins, which were identified for the first time from a deep-sea anemone. Copyright © 2018 Elsevier Ltd. All rights reserved.
Kalyzhnaya, O V; Itskovich, V B
2014-07-01
The diversity of bacteria associated with deep-water sponge Baikalospongia intermedia was evaluated by sequence analysis of 16S rRNA genes from two sponge samples collected in Lake Baikal from depths of 550 and 1204 m. A total of 64 operational taxonomic units, belonging to nine bacterial phyla, Proteobacteria (classes Alphaproteobacteria,. Betaproteobacteria, Gammaproteobacteria, and Deltaproteobacteria), Actinobacteria, Planctomycetes, Cloroflexi, Verrucomicrobia, Acidobacteria, Chlorobi, and Nitrospirae, including candidate phylum WS5, were identified. Phylogenetic analysis showed that the examined communities contained phylotypes exhibiting homology to uncultured bacteria from different lake ecosystems, freshwater sediments, soil and geological formations. Moreover, a number of phylotypes were relative to psychrophilic, methane-oxidizing, sulfate-reducing bacteria, and to microorganisms resistant to the influence of heavy metals. It seems likely that the unusual habitation conditions of deep-water sponges contribute to the taxonomic diversity of associated bacteria and have an influence on the presence of functionally important microorganisms in bacterial communities.
Smalheiser, Neil R; Lugli, Giovanni; Thimmapuram, Jyothi; Cook, Edwin H; Larson, John
2011-01-01
We previously proposed that endogenous siRNAs may regulate synaptic plasticity and long-term gene expression in the mammalian brain. Here, a hippocampal-dependent task was employed in which adult mice were trained to execute a nose-poke in a port containing one of two simultaneously present odors in order to obtain a reward. Mice demonstrating olfactory discrimination training were compared to pseudo-training and nose-poke control groups; size-selected hippocampal RNA was subjected to Illumina deep sequencing. Sequences that aligned uniquely and exactly to the genome without uncertain nucleotide assignments, within exons or introns of MGI annotated genes, were examined further. The data confirm that small RNAs having features of endogenous siRNAs are expressed in brain; that many of them derive from genes that regulate synaptic plasticity (and have been implicated in neuropsychiatric diseases); and that hairpin-derived endo-siRNAs and the 20- to 23-nt size class of small RNAs show a significant increase during an early stage of training. The most abundant putative siRNAs arose from an intronic inverted repeat within the SynGAP1 locus; this inverted repeat was a substrate for dicer in vitro, and SynGAP1 siRNA was specifically associated with Argonaute proteins in vivo. Unexpectedly, a dramatic increase with training (more than 100-fold) was observed for a class of 25- to 30-nt small RNAs derived from specific sites within snoRNAs and abundant noncoding RNAs (Y1 RNA, RNA component of mitochondrial RNAse P, 28S rRNA, and 18S rRNA). Further studies are warranted to characterize the role(s) played by endogenous siRNAs and noncoding RNA-derived small RNAs in learning and memory.
Mason, Olivia U; Hazen, Terry C; Borglin, Sharon; Chain, Patrick S G; Dubinsky, Eric A; Fortney, Julian L; Han, James; Holman, Hoi-Ying N; Hultman, Jenni; Lamendella, Regina; Mackelprang, Rachel; Malfatti, Stephanie; Tom, Lauren M; Tringe, Susannah G; Woyke, Tanja; Zhou, Jizhong; Rubin, Edward M; Jansson, Janet K
2012-09-01
The Deepwater Horizon oil spill in the Gulf of Mexico resulted in a deep-sea hydrocarbon plume that caused a shift in the indigenous microbial community composition with unknown ecological consequences. Early in the spill history, a bloom of uncultured, thus uncharacterized, members of the Oceanospirillales was previously detected, but their role in oil disposition was unknown. Here our aim was to determine the functional role of the Oceanospirillales and other active members of the indigenous microbial community using deep sequencing of community DNA and RNA, as well as single-cell genomics. Shotgun metagenomic and metatranscriptomic sequencing revealed that genes for motility, chemotaxis and aliphatic hydrocarbon degradation were significantly enriched and expressed in the hydrocarbon plume samples compared with uncontaminated seawater collected from plume depth. In contrast, although genes coding for degradation of more recalcitrant compounds, such as benzene, toluene, ethylbenzene, total xylenes and polycyclic aromatic hydrocarbons, were identified in the metagenomes, they were expressed at low levels, or not at all based on analysis of the metatranscriptomes. Isolation and sequencing of two Oceanospirillales single cells revealed that both cells possessed genes coding for n-alkane and cycloalkane degradation. Specifically, the near-complete pathway for cyclohexane oxidation in the Oceanospirillales single cells was elucidated and supported by both metagenome and metatranscriptome data. The draft genome also included genes for chemotaxis, motility and nutrient acquisition strategies that were also identified in the metagenomes and metatranscriptomes. These data point towards a rapid response of members of the Oceanospirillales to aliphatic hydrocarbons in the deep sea.
Chen, Yi-Guang; Li, Wen-Jun; Cui, Xiao-Long; Jiang, Cheng-Lin; Xu, Li-Hua
2006-10-01
One facultative alkaliphilic actinomycete strain YIM 90022 was isolated from hypersaline alkaline soil in Qinghai province, China. An almost-complete 16S rRNA gene sequence (1500 bp) for strain YIM 90022 was obtained. Phylogenetic analysis based on 16S rRNA gene sequences showed that strain YIM 90022 was closely related to four members of the genus Nocardiopsis with 16S rRNA gene sequence similarity values of 98.8% (N. exhalans DSM 44407T), 98.5% (N. prasina DSM 43845T), 98.4% (N. metallicus DSM 44598T) and 97.8% (N. listeri DSM 40297T), but represented a distinct phylogenetic lineage. Repetitive element sequence-based PCR (rep-PCR) genomic fingerprinting was evaluated on strain YIM 90022 and its closest relatives to investigate their genetic relatedness. The analysis of the rep-PCR genomic fingerprints showed that strain YIM 90022 was distinguishable from its closest relatives. The polyphasic taxonomic data presented in this study, including its morphology, physiological and biochemical characteristics, chemotaxonomy, 16S rRNA gene sequence-based phylogenetic analysis and rep-PCR genomic fingerprinting, supported the view that strain YIM 90022 represented a potential new species of the genus Nocardiopsis. The fermentation broth of strain YIM 90022 strongly inhibited growth of cell series of gastric cancer, lung cancer, mammary cancer, melanoma cancer, renal cancer and uterus cancer. Strain YIM 90022 grew well on most tested media, producing exuberant vegetative hyphae and aerial hyphae. The vegetative hyphae are long and fragmented. Light yellow to deep brown diffusible pigments were produced on ISP 2, ISP 3 and ISP 6. Growth of the strain occurred in the pH range 6.0-12.0, with optimal pH8.5. The NaCl tolerate range was 0-15% (W/V). Cell walls contain meso-diaminopimelic acid and have no diagnostic sugars. Polar lipids are phosphatidylcholine, phosphatidylglycerol, diphosphatidylglycerol, phosphatidylmethylethanolamine. Major menaquinones are MK-10 (H4, H6). The DNA G + C content is 71.5 mol %.
Exploring Connectivity in Sequence Space of Functional RNA
NASA Technical Reports Server (NTRS)
Wei, Chenyu; Pohorille, Andrzej; Popovic, Milena; Ditzler, Mark
2017-01-01
Emergence of replicable genetic molecules was one of the marking points in the origin of life, evolution of which can be conceptualized as a walk through the space of all possible sequences. A theoretical concept of fitness landscape helps to understand evolutionary processes through assigning a value of fitness to each genotype. Then, evolution of a phenotype is viewed as a series of consecutive, single-point mutations. Natural selection biases evolution toward peaks of high fitness and away from valleys of low fitness. whereas neutral drift occurs in the sequence space without direction as mutations are introduced at random. Large networks of neutral or near-neutral mutations on a fitness landscape, especially for sufficiently long genomes, are possible or even inevitable. Their detection in experiments, however, has been elusive. Although a few near-neutral evolutionary pathways have been found, recent experimental evidence indicates landscapes consist of largely isolated islands. The generality of these results, however, is not clear, as the genome length or the fraction of functional molecules in the genotypic space might have been insufficient for the emergence of large, neutral networks. Thorough investigation on the structure of the fitness landscape is essential to understand the mechanisms of evolution of early genomes. RNA molecules are commonly assumed to play the pivotal role in the origin of genetic systems. They are widely believed to be early, if not the earliest, genetic and catalytic molecules, with abundant biochemical activities as aptamers and ribozymes, i.e. RNA molecules capable, respectively, to bind small molecules or catalyze chemical reactions. Here, we present results of our recent studies on the structure of the sequence space of RNA ligase ribozymes selected through in vitro evolution. Several hundred thousands of sequences active to a different degree were obtained by way of deep sequencing. Analysis of these sequences revealed several large clusters defined such that every sequence in a cluster can be reached from any other sequence in the same cluster through a series of single point mutations. Sequences in a single cluster appear to adopt more than one secondary structure. The mechanism of refolding within a single cluster was examined. To shed light on possible evolutionary paths in the space of ribozymes, the connectivity between clusters was investigated. The effect of length of RNA molecules on the structure of the fitness landscape and possible evolutionary paths was examined by way of comparing functional sequences of 20 and 80 nucleobases in length. It was found that sequences of different lengths shared secondary structure motifs that were presumed responsible for catalytic activity, with increasing complexity and global structural rearrangements emerging in longer molecules.
Nakagawa, Tatsunori; Ishibashi, Jun-Ichiro; Maruyama, Akihiko; Yamanaka, Toshiro; Morimoto, Yusuke; Kimura, Hiroyuki; Urabe, Tetsuro; Fukui, Manabu
2004-01-01
This study describes the occurrence of unique dissimilatory sulfite reductase (DSR) genes at a depth of 1,380 m from the deep-sea hydrothermal vent field at the Suiyo Seamount, Izu-Bonin Arc, Western Pacific, Japan. The DSR genes were obtained from microbes that grew in a catheter-type in situ growth chamber deployed for 3 days on a vent and from the effluent water of drilled holes at 5°C and natural vent fluids at 7°C. DSR clones SUIYOdsr-A and SUIYOdsr-B were not closely related to cultivated species or environmental clones. Moreover, samples of microbial communities were examined by PCR-denaturing gradient gel electrophoresis (DGGE) analysis of the 16S rRNA gene. The sequence analysis of 16S rRNA gene fragments obtained from the vent catheter after a 3-day incubation revealed the occurrence of bacterial DGGE bands affiliated with the Aquificae and γ- and ɛ-Proteobacteria as well as the occurrence of archaeal phylotypes affiliated with the Thermococcales and of a unique archaeon sequence that clustered with “Nanoarchaeota.” The DGGE bands obtained from drilled holes and natural vent fluids from 7 to 300°C were affiliated with the δ-Proteobacteria, genus Thiomicrospira, and Pelodictyon. The dominant DGGE bands retrieved from the effluent water of casing pipes at 3 and 4°C were closely related to phylotypes obtained from the Arctic Ocean. Our results suggest the presence of microorganisms corresponding to a unique DSR lineage not detected previously from other geothermal environments. PMID:14711668
Kim, Ah Ran; Alam, Md Jobaidul; Yoon, Tae-ho; Lee, Soo Rin; Park, Hyun; Kim, Doo-Nam; An, Doo-Hae; Lee, Jae-Bong; Lee, Chung Il
2016-01-01
Adiponectin (AdipoQ) and its receptors (AdipoRs) are strongly related to growth and development of skeletal muscle, as well as glucose and lipid metabolism in vertebrates. Herein we report the identification of the first full-length cDNA encoding an AdipoR homolog (Liv-AdipoR) from the decapod crustacean Litopenaeus vannamei using a combination of next generation sequencing (NGS) technology and bioinformatics analysis. The full-length Liv-AdipoR (1,245 bp) encoded a protein that exhibited the canonical seven transmembrane domains (7TMs) and the inversed topology that characterize members of the progestin and adipoQ receptor (PAQR) family. Based on the obtained sequence information, only a single orthologous AdipoR gene appears to exist in arthropods, whereas two paralogs, AdipoR1 and AdipoR2, have evolved in vertebrates. Transcriptional analysis suggested that the single Liv-AdipoR gene appears to serve the functions of two mammalian AdipoRs. At 72 h after injection of 50 pmol Liv-AdipoR dsRNA (340 bp) into L. vannamei thoracic muscle and deep abdominal muscle, transcription levels of Liv-AdipoR decreased by 93% and 97%, respectively. This confirmed optimal conditions for RNAi of Liv-AdipoR. Knockdown of Liv-AdipoR resulted in significant changes in the plasma levels of ammonia, 3-methylhistine, and ornithine, but not plasma glucose, suggesting that that Liv-AdipoR is important for maintaining muscle fibers. The chronic effect of Liv-AdipoR dsRNA injection was increased mortality. Transcriptomic analysis showed that 804 contigs were upregulated and 212 contigs were downregulated by the knockdown of Liv-AdipoR in deep abdominal muscle. The significantly upregulated genes were categorized as four main functional groups: RNA-editing and transcriptional regulators, molecular chaperones, metabolic regulators, and channel proteins. PMID:27478708
Identification of Small RNAs in Desulfovibrio vulgaris Hildenborough
DOE Office of Scientific and Technical Information (OSTI.GOV)
Burns, Andrew; Joachimiak, Marcin; Deutschbauer, Adam
2010-05-17
Desulfovibrio vulgaris is an anaerobic sulfate-reducing bacterium capable of facilitating the removal of toxic metals such as uranium from contaminated sites via reduction. As such, it is essential to understand the intricate regulatory cascades involved in how D. vulgaris and its relatives respond to stressors in such sites. One approach is the identification and analysis of small non-coding RNAs (sRNAs); molecules ranging in size from 20-200 nucleotides that predominantly affect gene regulation by binding to complementary mRNA in an anti-sense fashion and therefore provide an immediate regulatory response. To identify sRNAs in D. vulgaris, a bacterium that does not possessmore » an annotated hfq gene, RNA was pooled from stationary and exponential phases, nitrate exposure, and biofilm conditions. The subsequent RNA was size fractionated, modified, and converted to cDNA for high throughput transcriptomic deep sequencing. A computational approach to identify sRNAs via the alignment of seven separate Desulfovibrio genomes was also performed. From the deep sequencing analysis, 2,296 reads between 20 and 250 nt were identified with expression above genome background. Analysis of those reads limited the number of candidates to ~;;87 intergenic, while ~;;140 appeared to be antisense to annotated open reading frames (ORFs). Further BLAST analysis of the intergenic candidates and other Desulfovibrio genomes indicated that eight candidates were likely portions of ORFs not previously annotated in the D. vulgaris genome. Comparison of the intergenic and antisense data sets to the bioinformatical predicted candidates, resulted in ~;;54 common candidates. Current approaches using Northern analysis and qRT-PCR are being used toverify expression of the candidates and to further develop the role these sRNAs play in D. vulgaris regulation.« less
Antisense Transcription Is Pervasive but Rarely Conserved in Enteric Bacteria
Raghavan, Rahul; Sloan, Daniel B.; Ochman, Howard
2012-01-01
ABSTRACT Noncoding RNAs, including antisense RNAs (asRNAs) that originate from the complementary strand of protein-coding genes, are involved in the regulation of gene expression in all domains of life. Recent application of deep-sequencing technologies has revealed that the transcription of asRNAs occurs genome-wide in bacteria. Although the role of the vast majority of asRNAs remains unknown, it is often assumed that their presence implies important regulatory functions, similar to those of other noncoding RNAs. Alternatively, many antisense transcripts may be produced by chance transcription events from promoter-like sequences that result from the degenerate nature of bacterial transcription factor binding sites. To investigate the biological relevance of antisense transcripts, we compared genome-wide patterns of asRNA expression in closely related enteric bacteria, Escherichia coli and Salmonella enterica serovar Typhimurium, by performing strand-specific transcriptome sequencing. Although antisense transcripts are abundant in both species, less than 3% of asRNAs are expressed at high levels in both species, and only about 14% appear to be conserved among species. And unlike the promoters of protein-coding genes, asRNA promoters show no evidence of sequence conservation between, or even within, species. Our findings suggest that many or even most bacterial asRNAs are nonadaptive by-products of the cell’s transcription machinery. PMID:22872780
Madrigal, Pedro
2017-03-01
Computational evaluation of variability across DNA or RNA sequencing datasets is a crucial step in genomic science, as it allows both to evaluate reproducibility of biological or technical replicates, and to compare different datasets to identify their potential correlations. Here we present fCCAC, an application of functional canonical correlation analysis to assess covariance of nucleic acid sequencing datasets such as chromatin immunoprecipitation followed by deep sequencing (ChIP-seq). We show how this method differs from other measures of correlation, and exemplify how it can reveal shared covariance between histone modifications and DNA binding proteins, such as the relationship between the H3K4me3 chromatin mark and its epigenetic writers and readers. An R/Bioconductor package is available at http://bioconductor.org/packages/fCCAC/ . pmb59@cam.ac.uk. Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.
Chen, Hui; Adam Arsovski, Andrej; Yu, Kangfu; Wang, Aiming
2017-04-01
Rsv1, a single dominant resistance locus in soybean, confers extreme resistance to the majority of Soybean mosaic virus (SMV) strains, but is susceptible to the G7 strain. In Rsv1-genotype soybean, G7 infection provokes a lethal systemic hypersensitive response (LSHR), a delayed host defence response. The Rsv1-mediated LSHR signalling pathway remains largely unknown. In this study, we employed a genome-wide investigation to gain an insight into the molecular interplay between SMV G7 and Rsv1-genotype soybean. Small RNA (sRNA), degradome and transcriptome sequencing analyses were used to identify differentially expressed genes (DEGs) and microRNAs (DEMs) in response to G7 infection. A number of DEGs, DEMs and microRNA targets, and the interaction network of DEMs and their target mRNAs responsive to G7 infection, were identified. Knock-down of one of the identified DEGs, the eukaryotic translation initiation factor 5A (eIF5A), diminished the LSHR and enhanced viral accumulation, suggesting the essential role of eIF5A in the G7-induced, Rsv1-mediated LSHR signalling pathway. This work provides an in-depth genome-wide analysis of high-throughput sequencing data, and identifies multiple genes and microRNA signatures that are associated with the Rsv1-mediated LSHR. © 2016 HER MAJESTY THE QUEEN IN RIGHT OF CANADA MOLECULAR PLANT PATHOLOGY © 2016 BSPP AND JOHN WILEY & SONS LTD.
Barrera-Figueroa, Blanca E; Gao, Lei; Wu, Zhigang; Zhou, Xuefeng; Zhu, Jianhua; Jin, Hailing; Liu, Renyi; Zhu, Jian-Kang
2012-08-03
MicroRNAs (miRNAs) are small RNA molecules that play important regulatory roles in plant development and stress responses. Identification of stress-regulated miRNAs is crucial for understanding how plants respond to environmental stimuli. Abiotic stresses are one of the major factors that limit crop growth and yield. Whereas abiotic stress-regulated miRNAs have been identified in vegetative tissues in several plants, they are not well studied in reproductive tissues such as inflorescences. We used Illumina deep sequencing technology to sequence four small RNA libraries that were constructed from the inflorescences of rice plants that were grown under control condition and drought, cold, or salt stress. We identified 227 miRNAs that belong to 127 families, including 70 miRNAs that are not present in the miRBase. We validated 62 miRNAs (including 10 novel miRNAs) using published small RNA expression data in DCL1, DCL3, and RDR2 RNAi lines and confirmed 210 targets from 86 miRNAs using published degradome data. By comparing the expression levels of miRNAs, we identified 18, 15, and 10 miRNAs that were regulated by drought, cold and salt stress conditions, respectively. In addition, we identified 80 candidate miRNAs that originated from transposable elements or repeats, especially miniature inverted-repeat elements (MITEs). We discovered novel miRNAs and stress-regulated miRNAs that may play critical roles in stress response in rice inflorescences. Transposable elements or repeats, especially MITEs, are rich sources for miRNA origination.
Suzuki, Harukazu; Forrest, Alistair R R; van Nimwegen, Erik; Daub, Carsten O; Balwierz, Piotr J; Irvine, Katharine M; Lassmann, Timo; Ravasi, Timothy; Hasegawa, Yuki; de Hoon, Michiel J L; Katayama, Shintaro; Schroder, Kate; Carninci, Piero; Tomaru, Yasuhiro; Kanamori-Katayama, Mutsumi; Kubosaki, Atsutaka; Akalin, Altuna; Ando, Yoshinari; Arner, Erik; Asada, Maki; Asahara, Hiroshi; Bailey, Timothy; Bajic, Vladimir B; Bauer, Denis; Beckhouse, Anthony G; Bertin, Nicolas; Björkegren, Johan; Brombacher, Frank; Bulger, Erika; Chalk, Alistair M; Chiba, Joe; Cloonan, Nicole; Dawe, Adam; Dostie, Josee; Engström, Pär G; Essack, Magbubah; Faulkner, Geoffrey J; Fink, J Lynn; Fredman, David; Fujimori, Ko; Furuno, Masaaki; Gojobori, Takashi; Gough, Julian; Grimmond, Sean M; Gustafsson, Mika; Hashimoto, Megumi; Hashimoto, Takehiro; Hatakeyama, Mariko; Heinzel, Susanne; Hide, Winston; Hofmann, Oliver; Hörnquist, Michael; Huminiecki, Lukasz; Ikeo, Kazuho; Imamoto, Naoko; Inoue, Satoshi; Inoue, Yusuke; Ishihara, Ryoko; Iwayanagi, Takao; Jacobsen, Anders; Kaur, Mandeep; Kawaji, Hideya; Kerr, Markus C; Kimura, Ryuichiro; Kimura, Syuhei; Kimura, Yasumasa; Kitano, Hiroaki; Koga, Hisashi; Kojima, Toshio; Kondo, Shinji; Konno, Takeshi; Krogh, Anders; Kruger, Adele; Kumar, Ajit; Lenhard, Boris; Lennartsson, Andreas; Lindow, Morten; Lizio, Marina; Macpherson, Cameron; Maeda, Norihiro; Maher, Christopher A; Maqungo, Monique; Mar, Jessica; Matigian, Nicholas A; Matsuda, Hideo; Mattick, John S; Meier, Stuart; Miyamoto, Sei; Miyamoto-Sato, Etsuko; Nakabayashi, Kazuhiko; Nakachi, Yutaka; Nakano, Mika; Nygaard, Sanne; Okayama, Toshitsugu; Okazaki, Yasushi; Okuda-Yabukami, Haruka; Orlando, Valerio; Otomo, Jun; Pachkov, Mikhail; Petrovsky, Nikolai; Plessy, Charles; Quackenbush, John; Radovanovic, Aleksandar; Rehli, Michael; Saito, Rintaro; Sandelin, Albin; Schmeier, Sebastian; Schönbach, Christian; Schwartz, Ariel S; Semple, Colin A; Sera, Miho; Severin, Jessica; Shirahige, Katsuhiko; Simons, Cas; St Laurent, George; Suzuki, Masanori; Suzuki, Takahiro; Sweet, Matthew J; Taft, Ryan J; Takeda, Shizu; Takenaka, Yoichi; Tan, Kai; Taylor, Martin S; Teasdale, Rohan D; Tegnér, Jesper; Teichmann, Sarah; Valen, Eivind; Wahlestedt, Claes; Waki, Kazunori; Waterhouse, Andrew; Wells, Christine A; Winther, Ole; Wu, Linda; Yamaguchi, Kazumi; Yanagawa, Hiroshi; Yasuda, Jun; Zavolan, Mihaela; Hume, David A; Arakawa, Takahiro; Fukuda, Shiro; Imamura, Kengo; Kai, Chikatoshi; Kaiho, Ai; Kawashima, Tsugumi; Kawazu, Chika; Kitazume, Yayoi; Kojima, Miki; Miura, Hisashi; Murakami, Kayoko; Murata, Mitsuyoshi; Ninomiya, Noriko; Nishiyori, Hiromi; Noma, Shohei; Ogawa, Chihiro; Sano, Takuma; Simon, Christophe; Tagami, Michihira; Takahashi, Yukari; Kawai, Jun; Hayashizaki, Yoshihide
2009-05-01
Using deep sequencing (deepCAGE), the FANTOM4 study measured the genome-wide dynamics of transcription-start-site usage in the human monocytic cell line THP-1 throughout a time course of growth arrest and differentiation. Modeling the expression dynamics in terms of predicted cis-regulatory sites, we identified the key transcription regulators, their time-dependent activities and target genes. Systematic siRNA knockdown of 52 transcription factors confirmed the roles of individual factors in the regulatory network. Our results indicate that cellular states are constrained by complex networks involving both positive and negative regulatory interactions among substantial numbers of transcription factors and that no single transcription factor is both necessary and sufficient to drive the differentiation process.
Saldaña-Meyer, Ricardo; González-Buendía, Edgar; Guerrero, Georgina; Narendra, Varun; Bonasio, Roberto; Recillas-Targa, Félix; Reinberg, Danny
2014-01-01
The multifunctional CCCTC-binding factor (CTCF) protein exhibits a broad range of functions, including that of insulator and higher-order chromatin organizer. We found that CTCF comprises a previously unrecognized region that is necessary and sufficient to bind RNA (RNA-binding region [RBR]) and is distinct from its DNA-binding domain. Depletion of cellular CTCF led to a decrease in not only levels of p53 mRNA, as expected, but also those of Wrap53 RNA, an antisense transcript originated from the p53 locus. PAR-CLIP-seq (photoactivatable ribonucleoside-enhanced cross-linking and immunoprecipitation [PAR-CLIP] combined with deep sequencing) analyses indicate that CTCF binds a multitude of transcripts genome-wide as well as to Wrap53 RNA. Apart from its established role at the p53 promoter, CTCF regulates p53 expression through its physical interaction with Wrap53 RNA. Cells harboring a CTCF mutant in its RBR exhibit a defective p53 response to DNA damage. Moreover, the RBR facilitates CTCF multimerization in an RNA-dependent manner, which may bear directly on its role in establishing higher-order chromatin structures in vivo. PMID:24696455
MicroRNA-944 Affects Cell Growth by Targeting EPHA7 in Non-Small Cell Lung Cancer.
Liu, Minxia; Zhou, Kecheng; Cao, Yi
2016-09-26
MicroRNAs (miRNAs) have critical roles in lung tumorigenesis and development. To determine aberrantly expressed miRNAs involved in non-small cell lung cancer (NSCLC) and investigate pathophysiological functions and mechanisms, we firstly carried out small RNA deep sequencing in NSCLC cell lines (EPLC-32M1, A549 and 801D) and a human immortalized cell line 16HBE, we then studied miRNA function by cell proliferation and apoptosis. cDNA microarray, luciferase reporter assay and miRNA transfection were used to investigate interaction between the miRNA and target gene. miR-944 was significantly down-regulated in NSCLC and had many putative targets. Moreover, the forced expression of miR-944 significantly inhibited the proliferation of NSCLC cells in vitro. By integrating mRNA expression data and miR-944-target prediction, we disclosed that EPHA7 was a potential target of miR-944, which was further verified by luciferase reporter assay and microRNA transfection. Our data indicated that miR-944 targets EPHA7 in NSCLC and regulates NSCLC cell proliferation, which may offer a new mechanism underlying the development and progression of NSCLC.
Glasner, Heidelinde; Riml, Christian; Micura, Ronald; Breuker, Kathrin
2017-07-27
Nucleobase methylations are ubiquitous posttranscriptional modifications of ribonucleic acids (RNA) that can substantially increase the structural diversity of RNA in a highly dynamic fashion with implications for gene expression and human disease. However, high throughput, deep sequencing does not generally provide information on posttranscriptional modifications (PTMs). A promising alternative approach for the characterization of PTMs, i.e. their identification, localization, and relative quantitation, is top-down mass spectrometry (MS). In this study, we have investigated how specific nucleobase methylations affect RNA ionization in electrospray ionization (ESI), and backbone cleavage in collisionally activated dissociation (CAD) and electron detachment dissociation (EDD). For this purpose, we have developed two new approaches for the characterization of RNA methylations in mixtures of either isomers of RNA or nonisomeric RNA forms. Fragment ions from dissociation experiments were analyzed to identify the modification type, to localize the modification sites, and to reveal the site-specific, relative extent of modification for each site. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Genome-wide discovery of novel and conserved microRNAs in white shrimp (Litopenaeus vannamei).
Xi, Qian-Yun; Xiong, Yuan-Yan; Wang, Yuan-Mei; Cheng, Xiao; Qi, Qi-En; Shu, Gang; Wang, Song-Bo; Wang, Li-Na; Gao, Ping; Zhu, Xiao-Tong; Jiang, Qing-Yan; Zhang, Yong-Liang; Liu, Li
2015-01-01
Of late years, a large amount of conserved and species-specific microRNAs (miRNAs) have been performed on identification from species which are economically important but lack a full genome sequence. In this study, Solexa deep sequencing and cross-species miRNA microarray were used to detect miRNAs in white shrimp. We identified 239 conserved miRNAs, 14 miRNA* sequences and 20 novel miRNAs by bioinformatics analysis from 7,561,406 high-quality reads representing 325,370 distinct sequences. The all 20 novel miRNAs were species-specific in white shrimp and not homologous in other species. Using the conserved miRNAs from the miRBase database as a query set to search for homologs from shrimp expressed sequence tags (ESTs), 32 conserved computationally predicted miRNAs were discovered in shrimp. In addition, using microarray analysis in the shrimp fed with Panax ginseng polysaccharide complex, 151 conserved miRNAs were identified, 18 of which were significant up-expression, while 49 miRNAs were significant down-expression. In particular, qRT-PCR analysis was also performed for nine miRNAs in three shrimp tissues such as muscle, gill and hepatopancreas. Results showed that these miRNAs expression are tissue specific. Combining results of the three methods, we detected 20 novel and 394 conserved miRNAs. Verification with quantitative reverse transcription (qRT-PCR) and Northern blot showed a high confidentiality of data. The study provides the first comprehensive specific miRNA profile of white shrimp, which includes useful information for future investigations into the function of miRNAs in regulation of shrimp development and immunology.
Senatore, Adriano; Edirisinghe, Neranjan; Katz, Paul S.
2015-01-01
Background The sea slug Tritonia diomedea (Mollusca, Gastropoda, Nudibranchia), has a simple and highly accessible nervous system, making it useful for studying neuronal and synaptic mechanisms underlying behavior. Although many important contributions have been made using Tritonia, until now, a lack of genetic information has impeded exploration at the molecular level. Results We performed Illumina sequencing of central nervous system mRNAs from Tritonia, generating 133.1 million 100 base pair, paired-end reads. De novo reconstruction of the RNA-Seq data yielded a total of 185,546 contigs, which partitioned into 123,154 non-redundant gene clusters (unigenes). BLAST comparison with RefSeq and Swiss-Prot protein databases, as well as mRNA data from other invertebrates (gastropod molluscs: Aplysia californica, Lymnaea stagnalis and Biomphalaria glabrata; cnidarian: Nematostella vectensis) revealed that up to 76,292 unigenes in the Tritonia transcriptome have putative homologues in other databases, 18,246 of which are below a more stringent E-value cut-off of 1x10-6. In silico prediction of secreted proteins from the Tritonia transcriptome shotgun assembly (TSA) produced a database of 579 unique sequences of secreted proteins, which also exhibited markedly higher expression levels compared to other genes in the TSA. Conclusions Our efforts greatly expand the availability of gene sequences available for Tritonia diomedea. We were able to extract full length protein sequences for most queried genes, including those involved in electrical excitability, synaptic vesicle release and neurotransmission, thus confirming that the transcriptome will serve as a useful tool for probing the molecular correlates of behavior in this species. We also generated a neurosecretome database that will serve as a useful tool for probing peptidergic signalling systems in the Tritonia brain. PMID:25719197
Kondo, Hideki; Hisano, Sakae; Chiba, Sotaro; Maruyama, Kazuyuki; Andika, Ida Bagus; Toyoda, Kazuhiro; Fujimori, Fumihiro; Suzuki, Nobuhiro
2016-07-02
The identification of mycoviruses contributes greatly to understanding of the diversity and evolutionary aspects of viruses. Powdery mildew fungi are important and widely studied obligate phytopathogenic agents, but there has been no report on mycoviruses infecting these fungi. In this study, we used a deep sequencing approach to analyze the double-stranded RNA (dsRNA) segments isolated from field-collected samples of powdery mildew fungus-infected red clover plants in Japan. Database searches identified the presence of at least ten totivirus (genus Totivirus)-like sequences, termed red clover powdery mildew-associated totiviruses (RPaTVs). The majority of these sequences shared moderate amino acid sequence identity with each other (<44%) and with other known totiviruses (<59%). Nine of these identified sequences (RPaTV1a, 1b and 2-8) resembled the genome of the prototype totivirus, Saccharomyces cerevisiae virus-L-A (ScV-L-A) in that they contained two overlapping open reading frames (ORFs) encoding a putative coat protein (CP) and an RNA dependent RNA polymerase (RdRp), while one sequence (RPaTV9) showed similarity to another totivirus, Ustilago maydis virus H1 (UmV-H1) that encodes a single polyprotein (CP-RdRp fusion). Similar to yeast totiviruses, each ScV-L-A-like RPaTV contains a -1 ribosomal frameshift site downstream of a predicted pseudoknot structure in the overlapping region of these ORFs, suggesting that the RdRp is translated as a CP-RdRp fusion. Moreover, several ScV-L-A-like sequences were also found by searches of the transcriptome shotgun assembly (TSA) libraries from rust fungi, plants and insects. Phylogenetic analyses show that nine ScV-L-A-like RPaTVs along with ScV-L-A-like sequences derived from TSA libraries are clustered with most established members of the genus Totivirus, while one RPaTV forms a new distinct clade with UmV-H1, possibly establishing an additional genus in the family. Taken together, our results indicate the presence of diverse, novel totiviruses in the powdery mildew fungus populations infecting red clover plants in the field. Copyright © 2015 Elsevier B.V. All rights reserved.
Characterization of viral siRNA populations in honey bee colony collapse disorder.
Chejanovsky, Nor; Ophir, Ron; Schwager, Michal Sharabi; Slabezki, Yossi; Grossman, Smadar; Cox-Foster, Diana
2014-04-01
Colony Collapse Disorder (CCD), a special case of collapse of honey bee colonies, has resulted in significant losses for beekeepers. CCD-colonies show abundance of pathogens which suggests that they have a weakened immune system. Since honey bee viruses are major players in colony collapse and given the important role of viral RNA interference (RNAi) in combating viral infections we investigated if CCD-colonies elicit an RNAi response. Deep-sequencing analysis of samples from CCD-colonies from US and Israel revealed abundant small interfering RNAs (siRNA) of 21-22 nucleotides perfectly matching the Israeli acute paralysis virus (IAPV), Kashmir virus and Deformed wing virus genomes. Israeli colonies showed high titers of IAPV and a conserved RNAi-pattern of matching the viral genome. That was also observed in sample analysis from colonies experimentally infected with IAPV. Our results suggest that CCD-colonies set out a siRNA response that is specific against predominant viruses associated with colony losses. Copyright © 2014 Elsevier Inc. All rights reserved.
SARS-CoV-Encoded Small RNAs Contribute to Infection-Associated Lung Pathology.
Morales, Lucía; Oliveros, Juan Carlos; Fernandez-Delgado, Raúl; tenOever, Benjamin Robert; Enjuanes, Luis; Sola, Isabel
2017-03-08
Severe acute respiratory syndrome coronavirus (SARS-CoV) causes lethal disease in humans, which is characterized by exacerbated inflammatory response and extensive lung pathology. To address the relevance of small non-coding RNAs in SARS-CoV pathology, we deep sequenced RNAs from the lungs of infected mice and discovered three 18-22 nt small viral RNAs (svRNAs). The three svRNAs were derived from the nsp3 (svRNA-nsp3.1 and -nsp3.2) and N (svRNA-N) genomic regions of SARS-CoV. Biogenesis of CoV svRNAs was RNase III, cell type, and host species independent, but it was dependent on the extent of viral replication. Antagomir-mediated inhibition of svRNA-N significantly reduced in vivo lung pathology and pro-inflammatory cytokine expression. Taken together, these data indicate that svRNAs contribute to SARS-CoV pathogenesis and highlight the potential of svRNA-N antagomirs as antivirals. Copyright © 2017 Elsevier Inc. All rights reserved.
Transcription start site associated RNAs (TSSaRNAs) are ubiquitous in all domains of life.
Zaramela, Livia S; Vêncio, Ricardo Z N; ten-Caten, Felipe; Baliga, Nitin S; Koide, Tie
2014-01-01
A plethora of non-coding RNAs has been discovered using high-resolution transcriptomics tools, indicating that transcriptional and post-transcriptional regulation is much more complex than previously appreciated. Small RNAs associated with transcription start sites of annotated coding regions (TSSaRNAs) are pervasive in both eukaryotes and bacteria. Here, we provide evidence for existence of TSSaRNAs in several archaeal transcriptomes including: Halobacterium salinarum, Pyrococcus furiosus, Methanococcus maripaludis, and Sulfolobus solfataricus. We validated TSSaRNAs from the model archaeon Halobacterium salinarum NRC-1 by deep sequencing two independent small-RNA enriched (RNA-seq) and a primary-transcript enriched (dRNA-seq) strand-specific libraries. We identified 652 transcripts, of which 179 were shown to be primary transcripts (∼7% of the annotated genome). Distinct growth-associated expression patterns between TSSaRNAs and their cognate genes were observed, indicating a possible role in environmental responses that may result from RNA polymerase with varying pausing rhythms. This work shows that TSSaRNAs are ubiquitous across all domains of life.
Zhang, Dingxiao; Park, Daechan; Zhong, Yi; Lu, Yue; Rycaj, Kiera; Gong, Shuai; Chen, Xin; Liu, Xin; Chao, Hsueh-Ping; Whitney, Pamela; Calhoun-Davis, Tammy; Takata, Yoko; Shen, Jianjun; Iyer, Vishwanath R.; Tang, Dean G.
2016-01-01
The prostate gland mainly contains basal and luminal cells constructed as a pseudostratified epithelium. Annotation of prostate epithelial transcriptomes provides a foundation for discoveries that can impact disease understanding and treatment. Here we describe a genome-wide transcriptome analysis of human benign prostatic basal and luminal epithelial populations using deep RNA sequencing. Through molecular and biological characterizations, we show that the differential gene-expression profiles account for their distinct functional properties. Strikingly, basal cells preferentially express gene categories associated with stem cells, neurogenesis and ribosomal RNA (rRNA) biogenesis. Consistent with this profile, basal cells functionally exhibit intrinsic stem-like and neurogenic properties with enhanced rRNA transcription activity. Of clinical relevance, the basal cell gene-expression profile is enriched in advanced, anaplastic, castration-resistant and metastatic prostate cancers. Therefore, we link the cell-type-specific gene signatures to aggressive subtypes of prostate cancer and identify gene signatures associated with adverse clinical features. PMID:26924072
Zhang, Dingxiao; Park, Daechan; Zhong, Yi; Lu, Yue; Rycaj, Kiera; Gong, Shuai; Chen, Xin; Liu, Xin; Chao, Hsueh-Ping; Whitney, Pamela; Calhoun-Davis, Tammy; Takata, Yoko; Shen, Jianjun; Iyer, Vishwanath R; Tang, Dean G
2016-02-29
The prostate gland mainly contains basal and luminal cells constructed as a pseudostratified epithelium. Annotation of prostate epithelial transcriptomes provides a foundation for discoveries that can impact disease understanding and treatment. Here we describe a genome-wide transcriptome analysis of human benign prostatic basal and luminal epithelial populations using deep RNA sequencing. Through molecular and biological characterizations, we show that the differential gene-expression profiles account for their distinct functional properties. Strikingly, basal cells preferentially express gene categories associated with stem cells, neurogenesis and ribosomal RNA (rRNA) biogenesis. Consistent with this profile, basal cells functionally exhibit intrinsic stem-like and neurogenic properties with enhanced rRNA transcription activity. Of clinical relevance, the basal cell gene-expression profile is enriched in advanced, anaplastic, castration-resistant and metastatic prostate cancers. Therefore, we link the cell-type-specific gene signatures to aggressive subtypes of prostate cancer and identify gene signatures associated with adverse clinical features.
MicroRNA-based biotechnology for plant improvement.
Zhang, Baohong; Wang, Qinglian
2015-01-01
MicroRNAs (miRNAs) are an extensive class of newly discovered endogenous small RNAs, which negatively regulate gene expression at the post-transcription levels. As the application of next-generation deep sequencing and advanced bioinformatics, the miRNA-related study has been expended to non-model plant species and the number of identified miRNAs has dramatically increased in the past years. miRNAs play a critical role in almost all biological and metabolic processes, and provide a unique strategy for plant improvement. Here, we first briefly review the discovery, history, and biogenesis of miRNAs, then focus more on the application of miRNAs on plant breeding and the future directions. Increased plant biomass through controlling plant development and phase change has been one achievement for miRNA-based biotechnology; plant tolerance to abiotic and biotic stress was also significantly enhanced by regulating the expression of an individual miRNA. Both endogenous and artificial miRNAs may serve as important tools for plant improvement. © 2014 Wiley Periodicals, Inc.
Canella, Donatella; Bernasconi, David; Gilardi, Federica; LeMartelot, Gwendal; Migliavacca, Eugenia; Praz, Viviane; Cousin, Pascal; Delorenzi, Mauro; Hernandez, Nouria; Hernandez, Nouria; Delorenzi, Mauro; Deplancke, Bart; Desvergne, Béatrice; Guex, Nicolas; Herr, Winship; Naef, Felix; Rougemont, Jacques; Schibler, Ueli; Deplancke, Bart; Guex, Nicolas; Herr, Winship; Guex, Nicolas; Andersin, Teemu; Cousin, Pascal; Gilardi, Federica; Gos, Pascal; Le Martelot, Gwendal; Lammers, Fabienne; Canella, Donatella; Gilardi, Federica; Raghav, Sunil; Fabbretti, Roberto; Fortier, Arnaud; Long, Li; Vlegel, Volker; Xenarios, Ioannis; Migliavacca, Eugenia; Praz, Viviane; Guex, Nicolas; Naef, Felix; Rougemont, Jacques; David, Fabrice; Jarosz, Yohan; Kuznetsov, Dmitry; Liechti, Robin; Martin, Olivier; Ross, Frederick; Sinclair, Lucas; Cajan, Julia; Krier, Irina; Leleu, Marion; Migliavacca, Eugenia; Molina, Nacho; Naldi, Aurélien; Rey, Guillaume; Symul, Laura; Guex, Nicolas; Naef, Felix; Rougemont, Jacques; Bernasconi, David; Delorenzi, Mauro; Andersin, Teemu; Canella, Donatella; Gilardi, Federica; Le Martelot, Gwendal; Lammers, Fabienne; Raghav, Sunil
2012-01-01
The genomic loci occupied by RNA polymerase (RNAP) III have been characterized in human culture cells by genome-wide chromatin immunoprecipitations, followed by deep sequencing (ChIP-seq). These studies have shown that only ∼40% of the annotated 622 human tRNA genes and pseudogenes are occupied by RNAP-III, and that these genes are often in open chromatin regions rich in active RNAP-II transcription units. We have used ChIP-seq to characterize RNAP-III-occupied loci in a differentiated tissue, the mouse liver. Our studies define the mouse liver RNAP-III-occupied loci including a conserved mammalian interspersed repeat (MIR) as a potential regulator of an RNAP-III subunit-encoding gene. They reveal that synteny relationships can be established between a number of human and mouse RNAP-III genes, and that the expression levels of these genes are significantly linked. They establish that variations within the A and B promoter boxes, as well as the strength of the terminator sequence, can strongly affect RNAP-III occupancy of tRNA genes. They reveal correlations with various genomic features that explain the observed variation of 81% of tRNA scores. In mouse liver, loci represented in the NCBI37/mm9 genome assembly that are clearly occupied by RNAP-III comprise 50 Rn5s (5S RNA) genes, 14 known non-tRNA RNAP-III genes, nine Rn4.5s (4.5S RNA) genes, and 29 SINEs. Moreover, out of the 433 annotated tRNA genes, half are occupied by RNAP-III. Transfer RNA gene expression levels reflect both an underlying genomic organization conserved in dividing human culture cells and resting mouse liver cells, and the particular promoter and terminator strengths of individual genes. PMID:22287103
Genome-wide assessment of differential translations with ribosome profiling data.
Xiao, Zhengtao; Zou, Qin; Liu, Yu; Yang, Xuerui
2016-04-04
The closely regulated process of mRNA translation is crucial for precise control of protein abundance and quality. Ribosome profiling, a combination of ribosome foot-printing and RNA deep sequencing, has been used in a large variety of studies to quantify genome-wide mRNA translation. Here, we developed Xtail, an analysis pipeline tailored for ribosome profiling data that comprehensively and accurately identifies differentially translated genes in pairwise comparisons. Applied on simulated and real datasets, Xtail exhibits high sensitivity with minimal false-positive rates, outperforming existing methods in the accuracy of quantifying differential translations. With published ribosome profiling datasets, Xtail does not only reveal differentially translated genes that make biological sense, but also uncovers new events of differential translation in human cancer cells on mTOR signalling perturbation and in human primary macrophages on interferon gamma (IFN-γ) treatment. This demonstrates the value of Xtail in providing novel insights into the molecular mechanisms that involve translational dysregulations.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Poudel, Saroj; Aryal, Niranjan; Lu, Chaofu
Camelina sativa is an annual oilseed crop that is under intensive development for renewable resources of biofuels and industrial oils. MicroRNAs, or miRNAs, are endogenously encoded small RNAs that play key roles in diverse plant biological processes. Here, we conducted deep sequencing on small RNA libraries prepared from camelina leaves, flower buds and two stages of developing seeds corresponding to initial and peak storage products accumulation. Computational analyses identified 207 known miRNAs belonging to 63 families, as well as 5 novel miRNAs. These miRNAs, especially members of the miRNA families, varied greatly in different tissues and developmental stages. The predictedmore » miRNA target genes are involved in a broad range of physiological functions including lipid metabolism. This report is the first step toward elucidating roles of miRNAs in C. sativa and will provide additional tools to improve this oilseed crop for biofuels and biomaterials.« less
Apple miRNAs and tasiRNAs with novel regulatory networks
2012-01-01
Background MicroRNAs (miRNAs) and their regulatory functions have been extensively characterized in model species but whether apple has evolved similar or unique regulatory features remains unknown. Results We performed deep small RNA-seq and identified 23 conserved, 10 less-conserved and 42 apple-specific miRNAs or families with distinct expression patterns. The identified miRNAs target 118 genes representing a wide range of enzymatic and regulatory activities. Apple also conserves two TAS gene families with similar but unique trans-acting small interfering RNA (tasiRNA) biogenesis profiles and target specificities. Importantly, we found that miR159, miR828 and miR858 can collectively target up to 81 MYB genes potentially involved in diverse aspects of plant growth and development. These miRNA target sites are differentially conserved among MYBs, which is largely influenced by the location and conservation of the encoded amino acid residues in MYB factors. Finally, we found that 10 of the 19 miR828-targeted MYBs undergo small interfering RNA (siRNA) biogenesis at the 3' cleaved, highly divergent transcript regions, generating over 100 sequence-distinct siRNAs that potentially target over 70 diverse genes as confirmed by degradome analysis. Conclusions Our work identified and characterized apple miRNAs, their expression patterns, targets and regulatory functions. We also discovered that three miRNAs and the ensuing siRNAs exploit both conserved and divergent sequence features of MYB genes to initiate distinct regulatory networks targeting a multitude of genes inside and outside the MYB family. PMID:22704043
Jenkins, Adam M; Waterhouse, Robert M; Muskavitch, Marc A T
2015-04-23
Long non-coding RNAs (lncRNAs) have been defined as mRNA-like transcripts longer than 200 nucleotides that lack significant protein-coding potential, and many of them constitute scaffolds for ribonucleoprotein complexes with critical roles in epigenetic regulation. Various lncRNAs have been implicated in the modulation of chromatin structure, transcriptional and post-transcriptional gene regulation, and regulation of genomic stability in mammals, Caenorhabditis elegans, and Drosophila melanogaster. The purpose of this study is to identify the lncRNA landscape in the malaria vector An. gambiae and assess the evolutionary conservation of lncRNAs and their secondary structures across the Anopheles genus. Using deep RNA sequencing of multiple Anopheles gambiae life stages, we have identified 2,949 lncRNAs and more than 300 previously unannotated putative protein-coding genes. The lncRNAs exhibit differential expression profiles across life stages and adult genders. We find that across the genus Anopheles, lncRNAs display much lower sequence conservation than protein-coding genes. Additionally, we find that lncRNA secondary structure is highly conserved within the Gambiae complex, but diverges rapidly across the rest of the genus Anopheles. This study offers one of the first lncRNA secondary structure analyses in vector insects. Our description of lncRNAs in An. gambiae offers the most comprehensive genome-wide insights to date into lncRNAs in this vector mosquito, and defines a set of potential targets for the development of vector-based interventions that may further curb the human malaria burden in disease-endemic countries.
Genomic approaches for the elucidation of genes and gene networks underlying cardiovascular traits.
Adriaens, M E; Bezzina, C R
2018-06-22
Genome-wide association studies have shed light on the association between natural genetic variation and cardiovascular traits. However, linking a cardiovascular trait associated locus to a candidate gene or set of candidate genes for prioritization for follow-up mechanistic studies is all but straightforward. Genomic technologies based on next-generation sequencing technology nowadays offer multiple opportunities to dissect gene regulatory networks underlying genetic cardiovascular trait associations, thereby aiding in the identification of candidate genes at unprecedented scale. RNA sequencing in particular becomes a powerful tool when combined with genotyping to identify loci that modulate transcript abundance, known as expression quantitative trait loci (eQTL), or loci modulating transcript splicing known as splicing quantitative trait loci (sQTL). Additionally, the allele-specific resolution of RNA-sequencing technology enables estimation of allelic imbalance, a state where the two alleles of a gene are expressed at a ratio differing from the expected 1:1 ratio. When multiple high-throughput approaches are combined with deep phenotyping in a single study, a comprehensive elucidation of the relationship between genotype and phenotype comes into view, an approach known as systems genetics. In this review, we cover key applications of systems genetics in the broad cardiovascular field.
Transposable elements in TDP-43-mediated neurodegenerative disorders.
Li, Wanhe; Jin, Ying; Prazak, Lisa; Hammell, Molly; Dubnau, Josh
2012-01-01
Elevated expression of specific transposable elements (TEs) has been observed in several neurodegenerative disorders. TEs also can be active during normal neurogenesis. By mining a series of deep sequencing datasets of protein-RNA interactions and of gene expression profiles, we uncovered extensive binding of TE transcripts to TDP-43, an RNA-binding protein central to amyotrophic lateral sclerosis (ALS) and frontotemporal lobar degeneration (FTLD). Second, we find that association between TDP-43 and many of its TE targets is reduced in FTLD patients. Third, we discovered that a large fraction of the TEs to which TDP-43 binds become de-repressed in mouse TDP-43 disease models. We propose the hypothesis that TE mis-regulation contributes to TDP-43 related neurodegenerative diseases.
Oasis: online analysis of small RNA deep sequencing data.
Capece, Vincenzo; Garcia Vizcaino, Julio C; Vidal, Ramon; Rahman, Raza-Ur; Pena Centeno, Tonatiuh; Shomroni, Orr; Suberviola, Irantzu; Fischer, Andre; Bonn, Stefan
2015-07-01
Oasis is a web application that allows for the fast and flexible online analysis of small-RNA-seq (sRNA-seq) data. It was designed for the end user in the lab, providing an easy-to-use web frontend including video tutorials, demo data and best practice step-by-step guidelines on how to analyze sRNA-seq data. Oasis' exclusive selling points are a differential expression module that allows for the multivariate analysis of samples, a classification module for robust biomarker detection and an advanced programming interface that supports the batch submission of jobs. Both modules include the analysis of novel miRNAs, miRNA targets and functional analyses including GO and pathway enrichment. Oasis generates downloadable interactive web reports for easy visualization, exploration and analysis of data on a local system. Finally, Oasis' modular workflow enables for the rapid (re-) analysis of data. Oasis is implemented in Python, R, Java, PHP, C++ and JavaScript. It is freely available at http://oasis.dzne.de. stefan.bonn@dzne.de Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press.
mirEX: a platform for comparative exploration of plant pri-miRNA expression data.
Bielewicz, Dawid; Dolata, Jakub; Zielezinski, Andrzej; Alaba, Sylwia; Szarzynska, Bogna; Szczesniak, Michal W; Jarmolowski, Artur; Szweykowska-Kulinska, Zofia; Karlowski, Wojciech M
2012-01-01
mirEX is a comprehensive platform for comparative analysis of primary microRNA expression data. RT-qPCR-based gene expression profiles are stored in a universal and expandable database scheme and wrapped by an intuitive user-friendly interface. A new way of accessing gene expression data in mirEX includes a simple mouse operated querying system and dynamic graphs for data mining analyses. In contrast to other publicly available databases, the mirEX interface allows a simultaneous comparison of expression levels between various microRNA genes in diverse organs and developmental stages. Currently, mirEX integrates information about the expression profile of 190 Arabidopsis thaliana pri-miRNAs in seven different developmental stages: seeds, seedlings and various organs of mature plants. Additionally, by providing RNA structural models, publicly available deep sequencing results, experimental procedure details and careful selection of auxiliary data in the form of web links, mirEX can function as a one-stop solution for Arabidopsis microRNA information. A web-based mirEX interface can be accessed at http://bioinfo.amu.edu.pl/mirex.
2012-01-01
Background Roses (Rosa sp.), which belong to the family Rosaceae, are the most economically important ornamental plants—making up 30% of the floriculture market. However, given high demand for roses, rose breeding programs are limited in molecular resources which can greatly enhance and speed breeding efforts. A better understanding of important genes that contribute to important floral development and desired phenotypes will lead to improved rose cultivars. For this study, we analyzed rose miRNAs and the rose flower transcriptome in order to generate a database to expound upon current knowledge regarding regulation of important floral characteristics. A rose genetic database will enable comprehensive analysis of gene expression and regulation via miRNA among different Rosa cultivars. Results We produced more than 0.5 million reads from expressed sequences, totalling more than 110 million bp. From these, we generated 35,657, 31,434, 34,725, and 39,722 flower unigenes from Rosa hybrid: ‘Vital’, ‘Maroussia’, and ‘Sympathy’ and Rosa rugosa Thunb. , respectively. The unigenes were assigned functional annotations, domains, metabolic pathways, Gene Ontology (GO) terms, Plant Ontology (PO) terms, and MIPS Functional Catalogue (FunCat) terms. Rose flower transcripts were compared with genes from whole genome sequences of Rosaceae members (apple, strawberry, and peach) and grape. We also produced approximately 40 million small RNA reads from flower tissue for Rosa, representing 267 unique miRNA tags. Among identified miRNAs, 25 of them were novel and 242 of them were conserved miRNAs. Statistical analyses of miRNA profiles revealed both shared and species-specific miRNAs, which presumably effect flower development and phenotypes. Conclusions In this study, we constructed a Rose miRNA and transcriptome database, and we analyzed the miRNAs and transcriptome generated from the flower tissues of four Rosa cultivars. The database provides a comprehensive genetic resource which can be used to better understand rose flower development and to identify candidate genes for important phenotypes. PMID:23171001
Kim, Jungeun; Park, June Hyun; Lim, Chan Ju; Lim, Jae Yun; Ryu, Jee-Youn; Lee, Bong-Woo; Choi, Jae-Pil; Kim, Woong Bom; Lee, Ha Yeon; Choi, Yourim; Kim, Donghyun; Hur, Cheol-Goo; Kim, Sukweon; Noh, Yoo-Sun; Shin, Chanseok; Kwon, Suk-Yoon
2012-11-21
Roses (Rosa sp.), which belong to the family Rosaceae, are the most economically important ornamental plants--making up 30% of the floriculture market. However, given high demand for roses, rose breeding programs are limited in molecular resources which can greatly enhance and speed breeding efforts. A better understanding of important genes that contribute to important floral development and desired phenotypes will lead to improved rose cultivars. For this study, we analyzed rose miRNAs and the rose flower transcriptome in order to generate a database to expound upon current knowledge regarding regulation of important floral characteristics. A rose genetic database will enable comprehensive analysis of gene expression and regulation via miRNA among different Rosa cultivars. We produced more than 0.5 million reads from expressed sequences, totalling more than 110 million bp. From these, we generated 35,657, 31,434, 34,725, and 39,722 flower unigenes from Rosa hybrid: 'Vital', 'Maroussia', and 'Sympathy' and Rosa rugosa Thunb., respectively. The unigenes were assigned functional annotations, domains, metabolic pathways, Gene Ontology (GO) terms, Plant Ontology (PO) terms, and MIPS Functional Catalogue (FunCat) terms. Rose flower transcripts were compared with genes from whole genome sequences of Rosaceae members (apple, strawberry, and peach) and grape. We also produced approximately 40 million small RNA reads from flower tissue for Rosa, representing 267 unique miRNA tags. Among identified miRNAs, 25 of them were novel and 242 of them were conserved miRNAs. Statistical analyses of miRNA profiles revealed both shared and species-specific miRNAs, which presumably effect flower development and phenotypes. In this study, we constructed a Rose miRNA and transcriptome database, and we analyzed the miRNAs and transcriptome generated from the flower tissues of four Rosa cultivars. The database provides a comprehensive genetic resource which can be used to better understand rose flower development and to identify candidate genes for important phenotypes.
Mori, Koji; Maruyama, Akihiko; Urabe, Tetsuro; Suzuki, Ken-Ichiro; Hanada, Satoshi
2008-04-01
A novel thermophilic, strictly anaerobic archaeon, designated strain Arc51T, was isolated from a rock sample collected from a deep-sea hydrothermal field in Suiyo Seamount, Izu-Bonin Arc, western Pacific Ocean. Cells of the isolate were irregular cocci with single flagella and exhibited blue-green fluorescence at 436 nm. The optimum temperature, pH and NaCl concentration for growth were 70 degrees C, pH 6.5 and 3 % (w/v), respectively. Strain Arc51T could grow on thiosulfate or sulfite as an electron acceptor in the presence of hydrogen. This strain required acetate as a carbon source for its growth, suggesting that the reductive acetyl CoA pathway for CO2 fixation was incomplete. In addition, coenzyme M (2-mercaptoethanesulfonic acid), which is a known methyl carrier in methanogenesis, was also a requirement for growth of the strain. Analysis of the 16S rRNA gene sequence revealed that the isolate was similar to members of the genus Archaeoglobus, with sequence similarities of 93.6-97.2 %; the closest relative was Archaeoglobus veneficus. Phylogenetic analyses of the dsrAB and apsA genes, encoding the alpha and beta subunits of dissimilatory sulfite reductase and the alpha subunit of adenosine-5'-phosphosulfate reductase, respectively, produced results similar to those inferred from comparisons based on the 16S rRNA gene sequence. On the basis of phenotypic and phylogenetic data, strain Arc51T represents a novel species of the genus Archaeoglobus, for which the name Archaeoglobus infectus sp. nov. is proposed. The type strain is Arc51T (=NBRC 100649T=DSM 18877T).
Itakura, Jun; Kurosaki, Masayuki; Higuchi, Mayu; Takada, Hitomi; Nakakuki, Natsuko; Itakura, Yoshie; Tamaki, Nobuharu; Yasui, Yutaka; Suzuki, Shoko; Tsuchiya, Kaoru; Nakanishi, Hiroyuki; Takahashi, Yuka; Maekawa, Shinya; Enomoto, Nobuyuki; Izumi, Namiki
2015-01-01
The presence of resistance-associated variants (RAVs) of hepatitis C virus (HCV) attenuates the efficacy of direct acting antivirals (DAAs). The objective of this study was to characterize the susceptibility of RAVs to interferon-based therapy. Direct and deep sequencing were performed to detect Y93H RAV in the NS5A region. Twenty nine genotype 1b patients with detectable RAV at baseline were treated by a combination of simeprevir, pegylated interferon and ribavirin. The longitudinal changes in the proportion of Y93H RAV during therapy and at breakthrough or relapse were determined. By direct sequencing, Y93H RAV became undetectable or decreased in proportion at an early time point during therapy (within 7 days) in 57% of patients with both the Y93H variant and wild type virus at baseline when HCV RNA was still detectable. By deep sequencing, the proportion of Y93H RAV against Y93 wild type was 52.7% (5.8%- 97.4%) at baseline which significantly decreased to 29.7% (0.16%- 98.3%) within 7 days of initiation of treatment (p = 0.023). The proportion of Y93H RAV was reduced in 21 of 29 cases (72.4%) and a marked reduction of more than 10% was observed in 14 cases (48.7%). HCV RNA reduction was significantly greater for Y93H RAV (-3.65±1.3 logIU/mL/day) than the Y93 wild type (-3.35±1.0 logIU/mL/day) (p<0.001). Y93H RAV is more susceptible to interferon-based therapy than the Y93 wild type.
Fan, Huiyan; Sun, Haiwen; Wang, Ying; Zhang, Yongliang; Wang, Xianbing; Li, Dawei; Yu, Jialin; Han, Chenggui
2014-01-01
Beet necrotic yellow vein virus (BNYVV), encodes either four or five plus-sense single stranded RNAs and is the causal agent of sugar beet rhizomania disease, which is widely distributed in most regions of the world. BNYVV can also infect Nicotiana benthamiana systemically, and causes severe curling and stunting symptoms in the presence of RNA4 or mild symptoms in the absence of RNA4. Confocal laser scanning microscopy (CLSM) analyses showed that the RNA4-encoded p31 protein fused to the red fluorescent protein (RFP) accumulated mainly in the nuclei of N. benthamiana epidermal cells. This suggested that severe RNA4-induced symptoms might result from p31-dependent modifications of the transcriptome. Therefore, we used next-generation sequencing technologies to analyze the transcriptome profile of N. benthamiana in response to infection with different isolates of BNYVV. Comparisons of the transcriptomes of mock, BN3 (RNAs 1+2+3), and BN34 (RNAs 1+2+3+4) infected plants identified 3,016 differentially expressed transcripts, which provided a list of candidate genes that potentially are elicited in response to virus infection. Our data indicate that modifications in the expression of genes involved in RNA silencing, ubiquitin-proteasome pathway, cellulose synthesis, and metabolism of the plant hormone gibberellin may contribute to the severe symptoms induced by RNA4 from BNYVV. These results expand our understanding of the genetic architecture of N. benthamiana as well as provide valuable clues to identify genes potentially involved in resistance to BNYVV infection. Our global survey of gene expression changes in infected plants reveals new insights into the complicated molecular mechanisms underlying symptom development, and aids research into new strategies to protect crops against viruses.
Identification and verification of potential piRNAs from domesticated yak testis.
Gong, Jishang; Zhang, Quanwei; Wang, Qi; Ma, Youji; Du, Jiaxiang; Zhang, Yong; Zhao, Xingxu
2018-02-01
PIWI-interacting RNAs (piRNA) are small non-coding RNA molecules expressed in animal germ cells that interact with PIWI family proteins to form RNA-protein complexes involved in epigenetic and post-transcriptional gene silencing of retrotransposons and other genetic elements in germ line cells, including reproductive stem cell self-sustainment, differentiation, meiosis and spermatogenesis. In the present study, we performed high-throughput sequencing of piRNAs in testis samples from yaks in different stages of sexual maturity. Deep sequencing of the small RNAs (18-40 nt in length) yielded 4,900,538 unique reads from a total of 53,035,635 reads. We identified yak small RNAs (18-30 nt) and performed functional characterization. Yak small RNAs showed a bimodal length distribution, with two peaks at 22 nt and >28 nt. More than 80% of the 3,106,033 putative piRNAs were mapped to 4637 piRNA-producing genomic clusters using RPKM. 6388 candidate piRNAs were identified from clean reads and the annotations were compared with the yak reference genome repeat region. Integrated network analysis suggested that some differentially expressed genes were involved in spermatogenesis through ECM-receptor interaction and PI3K-Akt signaling pathways. Our data provide novel insights into the molecular expression and regulation similarities and diversities in spermatogenesis and testicular development in yaks at different stages of sexual maturity. © 2018 The authors.
Complex and dynamic landscape of RNA polyadenylation revealed by PAS-Seq
Shepard, Peter J.; Choi, Eun-A; Lu, Jente; Flanagan, Lisa A.; Hertel, Klemens J.; Shi, Yongsheng
2011-01-01
Alternative polyadenylation (APA) of mRNAs has emerged as an important mechanism for post-transcriptional gene regulation in higher eukaryotes. Although microarrays have recently been used to characterize APA globally, they have a number of serious limitations that prevents comprehensive and highly quantitative analysis. To better characterize APA and its regulation, we have developed a deep sequencing-based method called Poly(A) Site Sequencing (PAS-Seq) for quantitatively profiling RNA polyadenylation at the transcriptome level. PAS-Seq not only accurately and comprehensively identifies poly(A) junctions in mRNAs and noncoding RNAs, but also provides quantitative information on the relative abundance of polyadenylated RNAs. PAS-Seq analyses of human and mouse transcriptomes showed that 40%–50% of all expressed genes produce alternatively polyadenylated mRNAs. Furthermore, our study detected evolutionarily conserved polyadenylation of histone mRNAs and revealed novel features of mitochondrial RNA polyadenylation. Finally, PAS-Seq analyses of mouse embryonic stem (ES) cells, neural stem/progenitor (NSP) cells, and neurons not only identified more poly(A) sites than what was found in the entire mouse EST database, but also detected significant changes in the global APA profile that lead to lengthening of 3′ untranslated regions (UTR) in many mRNAs during stem cell differentiation. Together, our PAS-Seq analyses revealed a complex landscape of RNA polyadenylation in mammalian cells and the dynamic regulation of APA during stem cell differentiation. PMID:21343387
Joseph, S; Schmidt, L M; Danquah, W B; Timper, P; Mekete, T
2017-02-01
To generate single spore lines of a population of bacterial parasite of root-knot nematode (RKN), Pasteuria penetrans, isolated from Florida and examine genotypic variation and virulence characteristics exist within the population. Six single spore lines (SSP), 16SSP, 17SSP, 18SSP, 25SSP, 26SSP and 30SSP were generated. Genetic variability was evaluated by comparing single-nucleotide polymorphisms (SNPs) in six protein-coding genes and the 16S rRNA gene. An average of one SNP was observed for every 69 bp in the 16S rRNA, whereas no SNPs were observed in the protein-coding sequences. Hierarchical cluster analysis of 16S rRNA sequences placed the clones into three distinct clades. Bio-efficacy analysis revealed significant heterogeneity in the level virulence and host specificity between the individual clones. The SNP markers developed to the 5' hypervariable region of the 16S rRNA gene may be useful in biotype differentiation within a population of P. penetrans. This study demonstrates an efficient method for generating single spore lines of P. penetrans and gives a deep insight into genetic heterogeneity and varying level of virulence exists within a population parasitizing a specific Meloidogyne sp. host. The results also suggest that the application of generalist spore lines in nematode management may achieve broad RKN control. © 2016 The Society for Applied Microbiology.
Deep sequencing of evolving pathogen populations: applications, errors, and bioinformatic solutions
2014-01-01
Deep sequencing harnesses the high throughput nature of next generation sequencing technologies to generate population samples, treating information contained in individual reads as meaningful. Here, we review applications of deep sequencing to pathogen evolution. Pioneering deep sequencing studies from the virology literature are discussed, such as whole genome Roche-454 sequencing analyses of the dynamics of the rapidly mutating pathogens hepatitis C virus and HIV. Extension of the deep sequencing approach to bacterial populations is then discussed, including the impacts of emerging sequencing technologies. While it is clear that deep sequencing has unprecedented potential for assessing the genetic structure and evolutionary history of pathogen populations, bioinformatic challenges remain. We summarise current approaches to overcoming these challenges, in particular methods for detecting low frequency variants in the context of sequencing error and reconstructing individual haplotypes from short reads. PMID:24428920
NASA Astrophysics Data System (ADS)
Flot, J.-F.; Licuanan, W. Y.; Nakano, Y.; Payri, C.; Cruaud, C.; Tillier, S.
2008-12-01
The taxonomy of corals of the genus Seriatopora has not previously been studied using molecular sequence markers. As a first step toward a re-evaluation of species boundaries in this genus, mitochondrial sequence variability was analyzed in 51 samples collected from Okinawa, New Caledonia, and the Philippines. Four clusters of sequences were detected that showed little concordance with species currently recognized on a morphological basis. The most likely explanation is that the skeletal characters used for species identification are highly variable (polymorphic or phenotypically plastic); alternative explanations include introgression/hybridization, or deep coalescence and the retention of ancestral mitochondrial polymorphisms. In all individuals sequenced, two copies of trnW were found on either side of the atp8 gene near the putative D-loop, a novel mitochondrial gene arrangement that may have arisen from a duplication of the trnW-atp8 region followed by a deletion of one atp8.
Urbarova, Ilona; Karlsen, Bård Ove; Okkenhaug, Siri; Seternes, Ole Morten; Johansen, Steinar D.; Emblem, Åse
2012-01-01
Marine bioprospecting is the search for new marine bioactive compounds and large-scale screening in extracts represents the traditional approach. Here, we report an alternative complementary protocol, called digital marine bioprospecting, based on deep sequencing of transcriptomes. We sequenced the transcriptomes from the adult polyp stage of two cold-water sea anemones, Bolocera tuediae and Hormathia digitata. We generated approximately 1.1 million quality-filtered sequencing reads by 454 pyrosequencing, which were assembled into approximately 120,000 contigs and 220,000 single reads. Based on annotation and gene ontology analysis we profiled the expressed mRNA transcripts according to known biological processes. As a proof-of-concept we identified polypeptide toxins with a potential blocking activity on sodium and potassium voltage-gated channels from digital transcriptome libraries. PMID:23170083
Lopez-Gomollon, Sara; Mohorianu, Irina; Szittya, Gyorgy; Moulton, Vincent; Dalmay, Tamas
2012-12-01
MicroRNAs negatively regulate the accumulation of mRNAs therefore when they are expressed in the same cells their expression profiles show an inverse correlation. We previously described one positively correlated miRNA/target pair, but it is not known how widespread this phenomenon is. Here, we investigated the correlation between the expression profiles of differentially expressed miRNAs and their targets during tomato fruit development using deep sequencing, Northern blot and RT-qPCR. We found an equal number of positively and negatively correlated miRNA/target pairs indicating that positive correlation is more frequent than previously thought. We also found that the correlation between microRNA and target expression profiles can vary between mRNAs belonging to the same gene family and even for the same target mRNA at different developmental stages. Since microRNAs always negatively regulate their targets, the high number of positively correlated microRNA/target pairs suggests that mutual exclusion could be as widespread as temporal regulation. The change of correlation during development suggests that the type of regulatory circuit directed by a microRNA can change over time and can be different for individual gene family members. Our results also highlight potential problems for expression profiling-based microRNA target identification/validation.
Active fungi amidst a marine subsurface RNA paleome
NASA Astrophysics Data System (ADS)
Orsi, W.; Biddle, J.; Edgcomb, V.
2012-12-01
The deep marine subsurface is a vast habitat for microbial life where cells may live on geologic timescales. Since extracellular DNA in sediments may be preserved on long timescales, ribosomal RNA (rRNA) is suggested to be a proxy for the active fraction of a microbial community in the subsurface. During an investigation of eukaryotic 18S rRNA signatures by amplicon pyrosequencing, metazoan, plant, and diatom rRNA signatures were recovered from marine sediments up to 2.7 million years old, suggesting that rRNA may be much more stable than previously considered in the marine subsurface. This finding confirms the concept of a paleome, extending it to include rRNA. Within the same dataset, unique profiles of fungi were found across a range of marine subsurface provinces exhibiting statistically significant correlations with total organic carbon (TOC), sulfide, and dissolved inorganic carbon (DIC). Sequences from metazoans, plants and diatoms showed different correlation patterns, consistent with a depth-controlled paleome. The fungal correlations with geochemistry allow the inference that some fungi are active and adapted for survival in the marine subsurface. A metatranscriptomic analysis of fungal derived mRNA confirms that fungi are metabolically active and utilize a range of organic and inorganic substrates in the marine subsurface.
Wu, Xiaofen; Pedersen, Karsten; Edlund, Johanna; Eriksson, Lena; Åström, Mats; Andersson, Anders F; Bertilsson, Stefan; Dopson, Mark
2017-03-23
Deep terrestrial biosphere waters are separated from the light-driven surface by the time required to percolate to the subsurface. Despite biofilms being the dominant form of microbial life in many natural environments, they have received little attention in the oligotrophic and anaerobic waters found in deep bedrock fractures. This study is the first to use community DNA sequencing to describe biofilm formation under in situ conditions in the deep terrestrial biosphere. In this study, flow cells were attached to boreholes containing either "modern marine" or "old saline" waters of different origin and degree of isolation from the light-driven surface of the earth. Using 16S rRNA gene sequencing, we showed that planktonic and attached populations were dissimilar while gene frequencies in the metagenomes suggested that hydrogen-fed, carbon dioxide- and nitrogen-fixing populations were responsible for biofilm formation across the two aquifers. Metagenome analyses further suggested that only a subset of the populations were able to attach and produce an extracellular polysaccharide matrix. Initial biofilm formation is thus likely to be mediated by a few bacterial populations which were similar to Epsilonproteobacteria, Deltaproteobacteria, Betaproteobacteria, Verrucomicrobia, and unclassified bacteria. Populations potentially capable of attaching to a surface and to produce extracellular polysaccharide matrix for attachment were identified in the terrestrial deep biosphere. Our results suggest that the biofilm populations were taxonomically distinct from the planktonic community and were enriched in populations with a chemolithoautotrophic and diazotrophic metabolism coupling hydrogen oxidation to energy conservation under oligotrophic conditions.
Proteogenomic database construction driven from large scale RNA-seq data.
Woo, Sunghee; Cha, Seong Won; Merrihew, Gennifer; He, Yupeng; Castellana, Natalie; Guest, Clark; MacCoss, Michael; Bafna, Vineet
2014-01-03
The advent of inexpensive RNA-seq technologies and other deep sequencing technologies for RNA has the promise to radically improve genomic annotation, providing information on transcribed regions and splicing events in a variety of cellular conditions. Using MS-based proteogenomics, many of these events can be confirmed directly at the protein level. However, the integration of large amounts of redundant RNA-seq data and mass spectrometry data poses a challenging problem. Our paper addresses this by construction of a compact database that contains all useful information expressed in RNA-seq reads. Applying our method to cumulative C. elegans data reduced 496.2 GB of aligned RNA-seq SAM files to 410 MB of splice graph database written in FASTA format. This corresponds to 1000× compression of data size, without loss of sensitivity. We performed a proteogenomics study using the custom data set, using a completely automated pipeline, and identified a total of 4044 novel events, including 215 novel genes, 808 novel exons, 12 alternative splicings, 618 gene-boundary corrections, 245 exon-boundary changes, 938 frame shifts, 1166 reverse strands, and 42 translated UTRs. Our results highlight the usefulness of transcript + proteomic integration for improved genome annotations.
Photobacterium kishitanii sp. nov., a luminous marine bacterium symbiotic with deep-sea fishes.
Ast, Jennifer C; Cleenwerck, Ilse; Engelbeen, Katrien; Urbanczyk, Henryk; Thompson, Fabiano L; De Vos, Paul; Dunlap, Paul V
2007-09-01
Six representatives of a luminous bacterium commonly found in association with deep, cold-dwelling marine fishes were isolated from the light organs and skin of different fish species. These bacteria were Gram-negative, catalase-positive, and weakly oxidase-positive or oxidase-negative. Morphologically, cells of these strains were coccoid or coccoid-rods, occurring singly or in pairs, and motile by means of polar flagellation. After growth on seawater-based agar medium at 22 degrees C for 18 h, colonies were small, round and white, with an intense cerulean blue luminescence. Analysis of 16S rRNA gene sequence similarity placed these bacteria in the genus Photobacterium. Phylogenetic analysis based on seven housekeeping gene sequences (16S rRNA gene, gapA, gyrB, pyrH, recA, rpoA and rpoD), seven gene sequences of the lux operon (luxC, luxD, luxA, luxB, luxF, luxE and luxG) and four gene sequences of the rib operon (ribE, ribB, ribH and ribA), resolved the six strains as members of the genus Photobacterium and as a clade distinct from other species of Photobacterium. These strains were most closely related to Photobacterium phosphoreum and Photobacterium iliopiscarium. DNA-DNA hybridization values between the designated type strain, Photobacterium kishitanii pjapo.1.1(T), and P. phosphoreum LMG 4233(T), P. iliopiscarium LMG 19543(T) and Photobacterium indicum LMG 22857(T) were 51, 43 and 19 %, respectively. In AFLP analysis, the six strains clustered together, forming a group distinct from other analysed species. The fatty acid C(17 : 0) cyclo was present in these bacteria, but not in P. phosphoreum, P. iliopiscarium or P. indicum. A combination of biochemical tests (arginine dihydrolase and lysine decarboxylase) differentiates these strains from P. phosphoreum and P. indicum. The DNA G+C content of P. kishitanii pjapo.1.1(T) is 40.2 %, and the genome size is approximately 4.2 Mbp, in the form of two circular chromosomes. These strains represent a novel species, for which the name Photobacterium kishitanii sp. nov. is proposed. The type strain, pjapo.1.1(T) (=ATCC BAA-1194(T)=LMG 23890(T)), is a luminous symbiont isolated from the light organ of the deep-water fish Physiculus japonicus.
A korarchaeal genome reveals insights into the evolution of the Archaea
DOE Office of Scientific and Technical Information (OSTI.GOV)
Anderson, Iain J; Elkins, James G.; Podar, Mircea
2008-06-05
The candidate division Korarchaeota comprises a group of uncultivated microorganisms that, by their small subunit rRNA phylogeny, may have diverged early from the major archaeal phyla Crenarchaeota and Euryarchaeota. Here, we report the initial characterization of a member of the Korarchaeota with the proposed name,"Candidatus Korarchaeum cryptofilum," which exhibits an ultrathin filamentous morphology. To investigate possible ancestral relationships between deep-branching Korarchaeota and other phyla, we used whole-genome shotgun sequencing to construct a complete composite korarchaeal genome from enriched cells. The genome was assembled into a single contig 1.59 Mb in length with a G + C content of 49percent. Ofmore » the 1,617 predicted protein-coding genes, 1,382 (85percent) could be assigned to a revised set of archaeal Clusters of Orthologous Groups (COGs). The predicted gene functions suggest that the organism relies on a simple mode of peptide fermentation for carbon and energy and lacks the ability to synthesize de novo purines, CoA, and several other cofactors. Phylogenetic analyses based on conserved single genes and concatenated protein sequences positioned the korarchaeote as a deep archaeal lineage with an apparent affinity to the Crenarchaeota. However, the predicted gene content revealed that several conserved cellular systems, such as cell division, DNA replication, and tRNA maturation, resemble the counterparts in the Euryarchaeota. In light of the known composition of archaeal genomes, the Korarchaeota might have retained a set of cellular features that represents the ancestral archaeal form.« less
A Korarchael Genome Reveals Insights into the Evolution of the Archaea
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lapidus, Alla; Elkins, James G.; Podar, Mircea
2008-01-07
The candidate division Korarchaeota comprises a group of uncultivated microorganisms that, by their small subunit rRNA phylogeny, may have diverged early from the major archaeal phyla Crenarchaeota and Euryarchaeota. Here, we report the initial characterization of a member of the Korarchaeota with the proposed name, ?Candidatus Korarchaeum cryptofilum,? which exhibits an ultrathin filamentous morphology. To investigate possible ancestral relationships between deep-branching Korarchaeota and other phyla, we used whole-genome shotgun sequencing to construct a complete composite korarchaeal genome from enriched cells. The genome was assembled into a single contig 1.59 Mb in length with a G + C content of 49percent.more » Of the 1,617 predicted protein-coding genes, 1,382 (85percent) could be assigned to a revised set of archaeal Clusters of Orthologous Groups (COGs). The predicted gene functions suggest that the organism relies on a simple mode of peptide fermentation for carbon and energy and lacks the ability to synthesize de novo purines, CoA, and several other cofactors. Phylogenetic analyses based on conserved single genes and concatenated protein sequences positioned the korarchaeote as a deep archaeal lineage with an apparent affinity to the Crenarchaeota. However, the predicted gene content revealed that several conserved cellular systems, such as cell division, DNA replication, and tRNA maturation, resemble the counterparts in the Euryarchaeota. In light of the known composition of archaeal genomes, the Korarchaeota might have retained a set of cellular features that represents the ancestral archaeal form.« less
Tarn, Jonathan; Peoples, Logan M; Hardy, Kevin; Cameron, James; Bartlett, Douglas H
2016-01-01
Relatively few studies have described the microbial populations present in ultra-deep hadal environments, largely as a result of difficulties associated with sampling. Here we report Illumina-tag V6 16S rRNA sequence-based analyses of the free-living and particle-associated microbial communities recovered from locations within two of the deepest hadal sites on Earth, the Challenger Deep (10,918 meters below surface-mbs) and the Sirena Deep (10,667 mbs) within the Mariana Trench, as well as one control site (Ulithi Atoll, 761 mbs). Seawater samples were collected using an autonomous lander positioned ~1 m above the seafloor. The bacterial populations within the Mariana Trench bottom water samples were dissimilar to other deep-sea microbial communities, though with overlap with those of diffuse flow hydrothermal vents and deep-subsurface locations. Distinct particle-associated and free-living bacterial communities were found to exist. The hadal bacterial populations were also markedly different from one another, indicating the likelihood of different chemical conditions at the two sites. In contrast to the bacteria, the hadal archaeal communities were more similar to other less deep datasets and to each other due to an abundance of cosmopolitan deep-sea taxa. The hadal communities were enriched in 34 bacterial and 4 archaeal operational taxonomic units (OTUs) including members of the Gammaproteobacteria, Epsilonproteobacteria, Marinimicrobia, Cyanobacteria, Deltaproteobacteria, Gemmatimonadetes, Atribacteria, Spirochaetes, and Euryarchaeota. Sequences matching cultivated piezophiles were notably enriched in the Challenger Deep, especially within the particle-associated fraction, and were found in higher abundances than in other hadal studies, where they were either far less prevalent or missing. Our results indicate the importance of heterotrophy, sulfur-cycling, and methane and hydrogen utilization within the bottom waters of the deeper regions of the Mariana Trench, and highlight novel community features of these extreme habitats.
Tarn, Jonathan; Peoples, Logan M.; Hardy, Kevin; Cameron, James; Bartlett, Douglas H.
2016-01-01
Relatively few studies have described the microbial populations present in ultra-deep hadal environments, largely as a result of difficulties associated with sampling. Here we report Illumina-tag V6 16S rRNA sequence-based analyses of the free-living and particle-associated microbial communities recovered from locations within two of the deepest hadal sites on Earth, the Challenger Deep (10,918 meters below surface-mbs) and the Sirena Deep (10,667 mbs) within the Mariana Trench, as well as one control site (Ulithi Atoll, 761 mbs). Seawater samples were collected using an autonomous lander positioned ~1 m above the seafloor. The bacterial populations within the Mariana Trench bottom water samples were dissimilar to other deep-sea microbial communities, though with overlap with those of diffuse flow hydrothermal vents and deep-subsurface locations. Distinct particle-associated and free-living bacterial communities were found to exist. The hadal bacterial populations were also markedly different from one another, indicating the likelihood of different chemical conditions at the two sites. In contrast to the bacteria, the hadal archaeal communities were more similar to other less deep datasets and to each other due to an abundance of cosmopolitan deep-sea taxa. The hadal communities were enriched in 34 bacterial and 4 archaeal operational taxonomic units (OTUs) including members of the Gammaproteobacteria, Epsilonproteobacteria, Marinimicrobia, Cyanobacteria, Deltaproteobacteria, Gemmatimonadetes, Atribacteria, Spirochaetes, and Euryarchaeota. Sequences matching cultivated piezophiles were notably enriched in the Challenger Deep, especially within the particle-associated fraction, and were found in higher abundances than in other hadal studies, where they were either far less prevalent or missing. Our results indicate the importance of heterotrophy, sulfur-cycling, and methane and hydrogen utilization within the bottom waters of the deeper regions of the Mariana Trench, and highlight novel community features of these extreme habitats. PMID:27242695
2010-01-01
Background Nematodes represent the most abundant benthic metazoa in one of the largest habitats on earth, the deep sea. Characterizing major patterns of biodiversity within this dominant group is a critical step towards understanding evolutionary patterns across this vast ecosystem. The present study has aimed to place deep-sea nematode species into a phylogenetic framework, investigate relationships between shallow water and deep-sea taxa, and elucidate phylogeographic patterns amongst the deep-sea fauna. Results Molecular data (18 S and 28 S rRNA) confirms a high diversity amongst deep-sea Enoplids. There is no evidence for endemic deep-sea lineages in Maximum Likelihood or Bayesian phylogenies, and Enoplids do not cluster according to depth or geographic location. Tree topologies suggest frequent interchanges between deep-sea and shallow water habitats, as well as a mixture of early radiations and more recently derived lineages amongst deep-sea taxa. This study also provides convincing evidence of cosmopolitan marine species, recovering a subset of Oncholaimid nematodes with identical gene sequences (18 S, 28 S and cox1) at trans-Atlantic sample sites. Conclusions The complex clade structures recovered within the Enoplida support a high global species richness for marine nematodes, with phylogeographic patterns suggesting the existence of closely related, globally distributed species complexes in the deep sea. True cosmopolitan species may additionally exist within this group, potentially driven by specific life history traits of Enoplids. Although this investigation aimed to intensively sample nematodes from the order Enoplida, specimens were only identified down to genus (at best) and our sampling regime focused on an infinitesimal small fraction of the deep-sea floor. Future nematode studies should incorporate an extended sample set covering a wide depth range (shelf, bathyal, and abyssal sites), utilize additional genetic loci (e.g. mtDNA) that are informative at the species level, and apply high-throughput sequencing methods to fully assay community diversity. Finally, further molecular studies are needed to determine whether phylogeographic patterns observed in Enoplids are common across other ubiquitous marine groups (e.g. Chromadorida, Monhysterida). PMID:21167065
Response of Bacterial Communities to Different Detritus Compositions in Arctic Deep-Sea Sediments.
Hoffmann, Katy; Hassenrück, Christiane; Salman-Carvalho, Verena; Holtappels, Moritz; Bienhold, Christina
2017-01-01
Benthic deep-sea communities are largely dependent on particle flux from surface waters. In the Arctic Ocean, environmental changes occur more rapidly than in other ocean regions, and have major effects on the export of organic matter to the deep sea. Because bacteria constitute the majority of deep-sea benthic biomass and influence global element cycles, it is important to better understand how changes in organic matter input will affect bacterial communities at the Arctic seafloor. In a multidisciplinary ex situ experiment, benthic bacterial deep-sea communities from the Long-Term Ecological Research Observatory HAUSGARTEN were supplemented with different types of habitat-related detritus (chitin, Arctic algae) and incubated for 23 days under in situ conditions. Chitin addition caused strong changes in community activity, while community structure remained similar to unfed control incubations. In contrast, the addition of phytodetritus resulted in strong changes in community composition, accompanied by increased community activity, indicating the need for adaptation in these treatments. High-throughput sequencing of the 16S rRNA gene and 16S rRNA revealed distinct taxonomic groups of potentially fast-growing, opportunistic bacteria in the different detritus treatments. Compared to the unfed control, Colwelliaceae, Psychromonadaceae , and Oceanospirillaceae increased in relative abundance in the chitin treatment, whereas Flavobacteriaceae, Marinilabiaceae , and Pseudoalteromonadaceae increased in the phytodetritus treatments. Hence, these groups may constitute indicator taxa for the different organic matter sources at this study site. In summary, differences in community structure and in the uptake and remineralization of carbon in the different treatments suggest an effect of organic matter quality on bacterial diversity as well as on carbon turnover at the seafloor, an important feedback mechanism to be considered in future climate change scenarios.
Cultivation and diversity of fungi buried in the Baltic Sea sediments
NASA Astrophysics Data System (ADS)
Xiao, N.
2015-12-01
@font-face { "MS 明朝"; }@font-face { "Century"; }@font-face { "Century"; }@font-face { "@MS 明朝"; }p.MsoNormal, li.MsoNormal, div.MsoNormal { margin: 0mm 0mm 0.0001pt; text-align: justify; font-size: 12pt; ; }.MsoChpDefault { ; }div.WordSection1 { page: WordSection1; } Studies on molecular biological and cultivation have been done for the prokaryotic microbial community in the deep biosphere. Compare to the prokaryotic community, few attempts have been done for eukaryotic microbial community. Here we report the study on fungi buried in deep-subsurface sediments by approaches of both cultivation and molecular diversity survey. Cultivation targeting fungi has been done using a sequential sediment samples obtained from the Baltic Sea, Landsort Deep site during the IODP expedition 347. 6 culture media with different nutrition and salt concentration have been tried for the fungi cultivation. 50 isolates of fungi were obtained from the sediment samples. The surface sediments showed richness of fungi strains but not for the deep sediments. Internal Transcribed Spacer (ITS) regions of RNA genes were amplified and for the identification of the isolates. The isolates were classified to 11 different genera. Pseudeurotium bakeri was the dominant strain throughout the glacial and interglacial sediments. We also found different representative fungal strains from glacial and interglacial sediments, suggesting the cultivated strains are buried from different sources. The survey of fungal diversity was done by sequencing the 18S RNA genes in the total DNA extracted from selected sediment samples. Fungi community showed different cluster in the glacial and interglacial sediments.Our results revealed the presence and activity of fungi in the deep biosphere of the Baltic sea and provided evidence of fungal community response to the climate change.
Response of Bacterial Communities to Different Detritus Compositions in Arctic Deep-Sea Sediments
Hoffmann, Katy; Hassenrück, Christiane; Salman-Carvalho, Verena; Holtappels, Moritz; Bienhold, Christina
2017-01-01
Benthic deep-sea communities are largely dependent on particle flux from surface waters. In the Arctic Ocean, environmental changes occur more rapidly than in other ocean regions, and have major effects on the export of organic matter to the deep sea. Because bacteria constitute the majority of deep-sea benthic biomass and influence global element cycles, it is important to better understand how changes in organic matter input will affect bacterial communities at the Arctic seafloor. In a multidisciplinary ex situ experiment, benthic bacterial deep-sea communities from the Long-Term Ecological Research Observatory HAUSGARTEN were supplemented with different types of habitat-related detritus (chitin, Arctic algae) and incubated for 23 days under in situ conditions. Chitin addition caused strong changes in community activity, while community structure remained similar to unfed control incubations. In contrast, the addition of phytodetritus resulted in strong changes in community composition, accompanied by increased community activity, indicating the need for adaptation in these treatments. High-throughput sequencing of the 16S rRNA gene and 16S rRNA revealed distinct taxonomic groups of potentially fast-growing, opportunistic bacteria in the different detritus treatments. Compared to the unfed control, Colwelliaceae, Psychromonadaceae, and Oceanospirillaceae increased in relative abundance in the chitin treatment, whereas Flavobacteriaceae, Marinilabiaceae, and Pseudoalteromonadaceae increased in the phytodetritus treatments. Hence, these groups may constitute indicator taxa for the different organic matter sources at this study site. In summary, differences in community structure and in the uptake and remineralization of carbon in the different treatments suggest an effect of organic matter quality on bacterial diversity as well as on carbon turnover at the seafloor, an important feedback mechanism to be considered in future climate change scenarios. PMID:28286496
Zhu, Wenhui; Liu, Shanshan; Liu, Jia; Zhou, Yan; Lin, Huancai
2018-05-01
Adherence capacity is one of the principal virulence factors of Streptococcus mutans, and adhesion virulence factors are controlled by small RNAs (sRNAs) at the post-transcriptional level in various bacteria. Here, we aimed to identify and decipher putative adhesion-related sRNAs in clinical strains of S. mutans. RNA deep-sequencing was performed to identify potential sRNAs under different adhesion conditions. The expression of sRNAs was analysed by quantitative real-time PCR (qRT-PCR), and bioinformatic methods were used to predict the functional characteristics of sRNAs. A total of 736 differentially expressed candidate sRNAs were predicted, and these included 352 sRNAs located on the antisense to mRNA (AM) and 384 sRNAs in intergenic regions (IGRs). The top 7 differentially expressed sRNAs were successfully validated by qRT-PCR in UA159, and 2 of these were further confirmed in 100 clinical isolates. Moreover, the sequences of two sRNAs were conserved in other Streptococcus species, indicating a conserved role in such closely related species. A good correlation between the expression of sRNAs and the adhesion of 100 clinical strains was observed, which, combined with GO and KEGG, provides a perspective for the comprehension of sRNA function annotation. This study revealed a multitude of novel putative adhesion-related sRNAs in S. mutans and contributed to a better understanding of information concerning the transcriptional regulation of adhesion in S. mutans.
Weber, Felix; Mylnikov, Alexander P; Jürgens, Klaus; Wylezich, Claudia
2017-03-01
The study of cultured strains has a long tradition in protistological research and has greatly contributed to establishing the morphology, taxonomy, and ecology of many protist species. However, cultivation-independent techniques, based on 18S rRNA gene sequences, have demonstrated that natural protistan assemblages mainly consist of hitherto uncultured protist lineages. This mismatch impedes the linkage of environmental diversity data with the biological features of cultured strains. Thus, novel taxa need to be obtained in culture to close this knowledge gap. In this study, traditional cultivation techniques were applied to samples from coastal surface waters and from deep oxygen-depleted waters of the Baltic Sea. Based on 18S rRNA gene sequencing, 126 monoclonal cultures of heterotrophic protists were identified. The majority of the isolated strains were affiliated with already cultured and described taxa, mainly chrysophytes and bodonids. This was likely due to "culturing bias" but also to the eutrophic nature of the Baltic Sea. Nonetheless, ~ 12% of the isolates in our culture collection showed highly divergent 18S rRNA gene sequences compared to those of known organisms and thus may represent novel taxa, either at the species level or at the genus level. Moreover, we also obtained evidence that some of the isolated taxa are ecologically relevant, under certain conditions, in the Baltic Sea. © 2016 The Author(s) Journal of Eukaryotic Microbiology © 2016 International Society of Protistologists.
Fung, Elisabeth; Hill, Kelly; Hogendoorn, Katja; Glatz, Richard V; Napier, Kathryn R; Bellgard, Matthew I; Barrero, Roberto A
2018-02-01
Bee pollination is critical for improving productivity of one third of all plants or plant products consumed by humans. The health of honey bees is in decline in many countries worldwide, and RNA viruses together with other biological, environmental and anthropogenic factors have been identified as the main causes. The rapid genetic variation of viruses represents a challenge for diagnosis. Thus, application of deep sequencing methods for detection and analysis of viruses has increased over the last years. In this study, we leverage from the innate Dicer-2 mediated antiviral response against viruses to reconstruct complete viral genomes using virus-derived small interfering RNAs (vsiRNAs). Symptomatic A. mellifera larvae collected from hives free of Colony Collapse Disorder (CCD) and the parasitic Varroa mite (Varroa destructor) were used to generate more than 107 million small RNA reads. We show that de novo assembly of insect viral sequences is less fragmented using only 22 nt long vsiRNAs rather than a combination of 21-22 nt small RNAs. Our results show that A. mellifera larvae activate the RNAi immune response in the presence of Sacbrood virus (SBV). We assembled three SBV genomes from three individual larvae from different hives in a single apiary, with 1-2% nucleotide sequence variability among them. We found 3-4% variability between SBV genomes generated in this study and earlier published Australian variants suggesting the presence of different SBV quasispecies within the country. Copyright © 2018. Published by Elsevier Inc.
Santamaria, Monica; Fosso, Bruno; Licciulli, Flavio; Balech, Bachir; Larini, Ilaria; Grillo, Giorgio; De Caro, Giorgio; Liuni, Sabino; Pesole, Graziano
2018-01-04
A holistic understanding of environmental communities is the new challenge of metagenomics. Accordingly, the amplicon-based or metabarcoding approach, largely applied to investigate bacterial microbiomes, is moving to the eukaryotic world too. Indeed, the analysis of metabarcoding data may provide a comprehensive assessment of both bacterial and eukaryotic composition in a variety of environments, including human body. In this respect, whereas hypervariable regions of the 16S rRNA are the de facto standard barcode for bacteria, the Internal Transcribed Spacer 1 (ITS1) of ribosomal RNA gene cluster has shown a high potential in discriminating eukaryotes at deep taxonomic levels. As metabarcoding data analysis rely on the availability of a well-curated barcode reference resource, a comprehensive collection of ITS1 sequences supplied with robust taxonomies, is highly needed. To address this issue, we created ITSoneDB (available at http://itsonedb.cloud.ba.infn.it/) which in its current version hosts 985 240 ITS1 sequences spanning over 134 000 eukaryotic species. Each ITS1 is mapped on the NCBI reference taxonomy with its start and end positions precisely annotated. ITSoneDB has been developed in agreement to the FAIR guidelines by enabling the users to query and download its content through a simple web-interface and access relevant metadata by cross-linking to European Nucleotide Archive. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Li, Xinzheng
2017-07-01
This paper reviews the taxonomic and biodiversity studies of deep-sea invertebrates in the South China Sea based on the samples collected by the Chinese manned deep-sea submersible Jiaolong. To date, 6 new species have been described, including the sponges Lophophysema eversa, Saccocalyx microhexactin and Semperella jiaolongae as well as the crustaceans Uroptychus jiaolongae, Uroptychus spinulosus and Globospongicola jiaolongi; some newly recorded species from the South China Sea have also been reported. The Bathymodiolus platifrons-Shinkaia crosnieri deep-sea cold seep community has been reported by Li (2015), as has the mitochondrial genome of the glass sponge L. eversa by Zhang et al. (2016). The population structures of two dominant species, the shrimp Shinkaia crosnieri and the mussel Bathymodiolus platifrons, from the cold seep Bathymodiolus platifrons-Shinkaia crosnieri community in the South China Sea and the hydrothermal vents in the Okinawa Trough, were compared using molecular analysis. The systematic position of the shrimp genus Globospongicola was discussed based on 16S rRNA gene sequences. © 2017 International Society of Zoological Sciences, Institute of Zoology/Chinese Academy of Sciences and John Wiley & Sons Australia, Ltd.
Wei, Jun S; Kuznetsov, Igor B; Zhang, Shile; Song, Young K; Asgharzadeh, Shahab; Sindiri, Sivasish; Wen, Xinyu; Patidar, Rajesh; Nagaraj, Sushma; Walton, Ashley; Guidry Auvil, Jaime M; Gerhard, Daniela S; Yuksel, Aysen; Catchpoole, Daniel R; Hewitt, Stephen M; Sondel, Paul M; Seeger, Robert C; Maris, John M; Khan, Javed
2018-05-21
High-risk neuroblastoma is an aggressive disease. DNA sequencing studies have revealed a paucity of actionable genomic alterations and a low mutation burden, posing challenges to develop effective novel therapies. We used RNA sequencing (RNA-seq) to investigate the biology of this disease including a focus on tumor-infiltrating lymphocytes (TILs). We performed deep RNA-seq on pre-treatment diagnostic tumors from 129 high-risk and 21 low- or intermediate-risk patients with neuroblastomas. We used single-sample gene set enrichment analysis to detect gene expression signatures of TILs in tumors and examined their association with clinical and molecular parameters including patient outcome. The expression profiles of 190 additional pre-treatment diagnostic neuroblastomas, a neuroblastoma tissue microarray, and T-cell receptor (TCR) sequencing were used to validate our findings. We found that MYCN -not-amplified ( MYCN -NA) tumors had significant higher cytotoxic TIL signatures compared to MYCN -amplified ( MYCN -A) tumors. A reported MYCN-activation-signature was significantly associated with poor outcome for high-risk patients with MYCN -NA tumors; however, a subgroup of these patients who had elevated activated NK cells, CD8+ T-cells, and cytolytic signatures showed improved outcome and expansion of infiltrating T-cell receptor (TCR) clones. Furthermore, we observed up-regulation of immune exhaustion marker genes, indicating an immune suppressive microenvironment in these neuroblastomas. Conclusions: This study provides evidence that RNA signatures of cytotoxic TIL are associated with the presence of activated NK-/T-cells and improved outcomes in high-risk neuroblastoma patients harboring MYCN -NA tumors. Our findings suggest that these high-risk patients with MYCN -NA neuroblastoma may benefit from additional immunotherapies incorporated into the current therapeutic strategies. Copyright ©2018, American Association for Cancer Research.
Bhattarai, Sunil; Aly, Ahmed; Garcia, Kristy; Ruiz, Diandra; Pontarelli, Fabrizio; Dharap, Ashutosh
2018-06-03
Gene expression in cerebral ischemia has been a subject of intense investigations for several years. Studies utilizing probe-based high-throughput methodologies such as microarrays have contributed significantly to our existing knowledge but lacked the capacity to dissect the transcriptome in detail. Genome-wide RNA-sequencing (RNA-seq) enables comprehensive examinations of transcriptomes for attributes such as strandedness, alternative splicing, alternative transcription start/stop sites, and sequence composition, thus providing a very detailed account of gene expression. Leveraging this capability, we conducted an in-depth, genome-wide evaluation of the protein-coding transcriptome of the adult mouse cortex after transient focal ischemia at 6, 12, or 24 h of reperfusion using RNA-seq. We identified a total of 1007 transcripts at 6 h, 1878 transcripts at 12 h, and 1618 transcripts at 24 h of reperfusion that were significantly altered as compared to sham controls. With isoform-level resolution, we identified 23 splice variants arising from 23 genes that were novel mRNA isoforms. For a subset of genes, we detected reperfusion time-point-dependent splice isoform switching, indicating an expression and/or functional switch for these genes. Finally, for 286 genes across all three reperfusion time-points, we discovered multiple, distinct, simultaneously expressed and differentially altered isoforms per gene that were generated via alternative transcription start/stop sites. Of these, 165 isoforms derived from 109 genes were novel mRNAs. Together, our data unravel the protein-coding transcriptome of the cerebral cortex at an unprecedented depth to provide several new insights into the flexibility and complexity of stroke-related gene transcription and transcript organization.
Sattar, Sampurna; Anstead, James A.; Sunkar, Ramanjulu; Thompson, Gary A.
2012-01-01
Background The regulatory role of small RNAs (sRNAs) in various biological processes is an active area of investigation; however, there has been limited information available on the role of sRNAs in plant-insect interactions. This study was designed to identify sRNAs in cotton-melon aphid (Aphis gossypii) during the Vat-mediated resistance interaction with melon (Cucumis melo). Methodology/Principal Findings The role of miRNAs was investigated in response to aphid herbivory, during both resistant and susceptible interactions. sRNA libraries made from A. gossypii tissues feeding on Vat+ and Vat− plants revealed an unexpected abundance of 27 nt long sRNA sequences in the aphids feeding on Vat+ plants. Eighty-one conserved microRNAs (miRNAs), twelve aphid-specific miRNAs, and nine novel candidate miRNAs were also identified. Plant miRNAs found in the aphid libraries were most likely ingested during phloem feeding. The presence of novel miRNAs was verified by qPCR experiments in both resistant Vat+ and susceptible Vat− interactions. The comparative analyses revealed that novel miRNAs were differentially regulated during the resistant and susceptible interactions. Gene targets predicted for the miRNAs identified in this study by in silico analyses revealed their involvement in morphogenesis and anatomical structure determination, signal transduction pathways, cell differentiation and catabolic processes. Conclusion/Significance In this study, conserved and novel miRNAs were reported in A. gossypii. Deep sequencing data showed differences in the abundance of miRNAs and piRNA-like sequences in A. gossypii. Quantitative RT-PCR revealed that A. gossypii miRNAs were differentially regulated during resistant and susceptible interactions. Aphids can also ingest plant miRNAs during phloem feeding that are stable in the insect. PMID:23173035
Du, Xinxin; Liu, Xiaobing; Zhang, Kai; Liu, Yuxiang; Cheng, Jie; Zhang, Quanqi
2018-05-16
The spotted knifejaw (Oplegnathus punctatus) is a newly emerging economical fishery species in China. Studies focused on the regulation of gonadal development and gametogenesis of spotted knifejaw are still insufficient. As a key post-transcriptional regulator, miRNAs have been shown to play important roles in development and reproduction systems. In this study, small RNA deep sequencing in ovary and testis of spotted knifejaw were performed to screen miRNA expression patterns. After sequencing and bioinformatics analysis, a total of 247 conserved known miRNAs and 41 novel miRNAs were identified in spotted knifejaw gonads for the first time. In addition, 36 miRNAs were differentially expressed between testis and ovary. The putative target genes of differentially expressed (DE) miRNAs were significantly enriched in several pathways related to sexual differentiation and gonadal development, such as steroid hormone biosynthesis. Sequencing data was validated through qRT-PCR analysis of selected DE miRNAs. Dual-luciferase reporter analyses of filtered miRNA-target gene pairs confirmed that opu-miR-27b-3p targeted in piwi2 and mov10l1 3' UTRs and down-regulated their expressions in spotted knifejaw. The notion that mov10l1 and piwi2 enhance germ cells proliferation and regulate gonadal development and gametogenesis suggests that opu-miR-27b-3p may attenuated this process in the gonads of spotted knifejaw. These findings provided insights into regulatory roles of gonadal miRNAs and supplied fundamental resources for further studies on miRNA-mediated post-transcriptional regulation in reproductive system of spotted knifejaw. Copyright © 2018. Published by Elsevier Inc.
Restructuring of the Aquatic Bacterial Community by Hydric Dynamics Associated with Superstorm Sandy
Ulrich, Nikea; Rosenberger, Abigail; Brislawn, Colin; Wright, Justin; Kessler, Collin; Toole, David; Solomon, Caroline; Strutt, Steven; McClure, Erin
2016-01-01
ABSTRACT Bacterial community composition and longitudinal fluctuations were monitored in a riverine system during and after Superstorm Sandy to better characterize inter- and intracommunity responses associated with the disturbance associated with a 100-year storm event. High-throughput sequencing of the 16S rRNA gene was used to assess microbial community structure within water samples from Muddy Creek Run, a second-order stream in Huntingdon, PA, at 12 different time points during the storm event (29 October to 3 November 2012) and under seasonally matched baseline conditions. High-throughput sequencing of the 16S rRNA gene was used to track changes in bacterial community structure and divergence during and after Superstorm Sandy. Bacterial community dynamics were correlated to measured physicochemical parameters and fecal indicator bacteria (FIB) concentrations. Bioinformatics analyses of 2.1 million 16S rRNA gene sequences revealed a significant increase in bacterial diversity in samples taken during peak discharge of the storm. Beta-diversity analyses revealed longitudinal shifts in the bacterial community structure. Successional changes were observed, in which Betaproteobacteria and Gammaproteobacteria decreased in 16S rRNA gene relative abundance, while the relative abundance of members of the Firmicutes increased. Furthermore, 16S rRNA gene sequences matching pathogenic bacteria, including strains of Legionella, Campylobacter, Arcobacter, and Helicobacter, as well as bacteria of fecal origin (e.g., Bacteroides), exhibited an increase in abundance after peak discharge of the storm. This study revealed a significant restructuring of in-stream bacterial community structure associated with hydric dynamics of a storm event. IMPORTANCE In order to better understand the microbial risks associated with freshwater environments during a storm event, a more comprehensive understanding of the variations in aquatic bacterial diversity is warranted. This study investigated the bacterial communities during and after Superstorm Sandy to provide fine time point resolution of dynamic changes in bacterial composition. This study adds to the current literature by revealing the variation in bacterial community structure during the course of a storm. This study employed high-throughput DNA sequencing, which generated a deep analysis of inter- and intracommunity responses during a significant storm event. This study has highlighted the utility of applying high-throughput sequencing for water quality monitoring purposes, as this approach enabled a more comprehensive investigation of the bacterial community structure. Altogether, these data suggest a drastic restructuring of the stream bacterial community during a storm event and highlight the potential of high-throughput sequencing approaches for assessing the microbiological quality of our environment. PMID:27060115
Zhang, Bo; Zhang, Yan-Hong; Wang, Xin; Zhang, Hui-Xian; Lin, Qiang
2017-07-01
The deep sea is one of the most extensive ecosystems on earth. Organisms living there survive in an extremely harsh environment, and their mitochondrial energy metabolism might be a result of evolution. As one of the most important organelles, mitochondria generate energy through energy metabolism and play an important role in almost all biological activities. In this study, the mitogenome of a deep-sea sea anemone ( Bolocera sp.) was sequenced and characterized. Like other metazoans, it contained 13 energy pathway protein-coding genes and two ribosomal RNAs. However, it also exhibited some unique features: just two transfer RNA genes, two group I introns, two transposon-like noncanonical open reading frames (ORFs), and a control region-like (CR-like) element. All of the mitochondrial genes were coded by the same strand (the H-strand). The genetic order and orientation were identical to those of most sequenced actiniarians. Phylogenetic analyses showed that this species was closely related to Bolocera tuediae . Positive selection analysis showed that three residues (31 L and 42 N in ATP6 , 570 S in ND5 ) of Bolocera sp. were positively selected sites. By comparing these features with those of shallow sea anemone species, we deduced that these novel gene features may influence the activity of mitochondrial genes. This study may provide some clues regarding the adaptation of Bolocera sp. to the deep-sea environment.
Lan, DaoLiang; Xiong, XianRong; Wei, YanLi; Xu, Tong; Zhong, JinCheng; Zhi, XiangDong; Wang, Yong; Li, Jian
2014-09-01
RNA-Seq, a high-throughput (HT) sequencing technique, has been used effectively in large-scale transcriptomic studies, and is particularly useful for improving gene structure information and mining of new genes. In this study, RNA-Seq HT technology was employed to analyze the transcriptome of yak ovary. After Illumina-Solexa deep sequencing, 26826516 clean reads with a total of 4828772880 bp were obtained from the ovary library. Alignment analysis showed that 16992 yak genes mapped to the yak genome and 3734 of these genes were involved in alternative splicing. Gene structure refinement analysis showed that 7340 genes that were annotated in the yak genome could be extended at the 5' or 3' ends based on the alignments been the transcripts and the genome sequence. Novel transcript prediction analysis identified 6321 new transcripts with lengths ranging from 180 to 14884 bp, and 2267 of them were predicted to code proteins. BLAST analysis of the new transcripts showed that 1200?4933 mapped to the non-redundant (nr), nucleotide (nt) and/or SwissProt sequence databases. Comparative statistical analysis of the new mapped transcripts showed that the majority of them were similar to genes in Bos taurus (41.4%), Bos grunniens mutus (33.0%), Ovis aries (6.3%), Homo sapiens (2.8%), Mus musculus (1.6%) and other species. Functional analysis showed that these expressed genes were involved in various Gene Ontology (GO) categories and Kyoto Encyclopedia of Genes and Genomes pathways. GO analysis of the new transcripts found that the largest proportion of them was associated with reproduction. The results of this study will provide a basis for describing the normal transcriptome map of yak ovary and for future studies on yak breeding performance. Moreover, the results confirmed that RNA-Seq HT technology is highly advantageous in improving gene structure information and mining of new genes, as well as in providing valuable data to expand the yak genome information.
Benardini, James N; Vaishampayan, Parag A; Schwendner, Petra; Swanner, Elizabeth; Fukui, Youhei; Osman, Sharif; Satomi, Masakata; Venkateswaran, Kasthuri
2011-06-01
A novel Gram-positive, motile, endospore-forming, aerobic bacterium was isolated from the NASA Phoenix Lander assembly clean room that exhibits 100 % 16S rRNA gene sequence similarity to two strains isolated from a deep subsurface environment. All strains are rod-shaped, endospore-forming bacteria, whose endospores are resistant to UV radiation up to 500 J m(-2). A polyphasic taxonomic study including traditional phenotypic tests, fatty acid analysis, 16S rRNA gene sequencing and DNA-DNA hybridization analysis was performed to characterize these novel strains. The 16S rRNA gene sequencing convincingly grouped these novel strains within the genus Paenibacillus as a separate cluster from previously described species. The similarity of 16S rRNA gene sequences among the novel strains was identical but only 98.1 to 98.5 % with their nearest neighbours Paenibacillus barengoltzii ATCC BAA-1209(T) and Paenibacillus timonensis CIP 108005(T). The menaquinone MK-7 was dominant in these novel strains as shown in other species of the genus Paenibacillus. The DNA-DNA hybridization dissociation value was <45 % with the closest related species. The novel strains had DNA G+C contents of 51.9 to 52.8 mol%. Phenotypically, the novel strains can be readily differentiated from closely related species by the absence of urease and gelatinase and the production of acids from a variety of sugars including l-arabinose. The major fatty acid was anteiso-C(15 : 0) as seen in P. barengoltzii and P. timonensis whereas the proportion of C(16 : 0) was significantly different from the closely related species. Based on phylogenetic and phenotypic results, it was concluded that these strains represent a novel species of the genus Paenibacillus, for which the name Paenibacillus phoenicis sp. nov. is proposed. The type strain is 3PO2SA(T) ( = NRRL B-59348(T) = NBRC 106274(T)).
Sahl, Jason W; Fairfield, Nathaniel; Harris, J Kirk; Wettergreen, David; Stone, William C; Spear, John R
2010-03-01
The deep phreatic thermal explorer (DEPTHX) is an autonomous underwater vehicle designed to navigate an unexplored environment, generate high-resolution three-dimensional (3-D) maps, collect biological samples based on an autonomous sampling decision, and return to its origin. In the spring of 2007, DEPTHX was deployed in Zacatón, a deep (approximately 318 m), limestone, phreatic sinkhole (cenote) in northeastern Mexico. As DEPTHX descended, it generated a 3-D map based on the processing of range data from 54 onboard sonars. The vehicle collected water column samples and wall biomat samples throughout the depth profile of the cenote. Post-expedition sample analysis via comparative analysis of 16S rRNA gene sequences revealed a wealth of microbial diversity. Traditional Sanger gene sequencing combined with a barcoded-amplicon pyrosequencing approach revealed novel, phylum-level lineages from the domains Bacteria and Archaea; in addition, several novel subphylum lineages were also identified. Overall, DEPTHX successfully navigated and mapped Zacatón, and collected biological samples based on an autonomous decision, which revealed novel microbial diversity in a previously unexplored environment.
Morris, R. M.; Rappé, M. S.; Urbach, E.; Connon, S. A.; Giovannoni, S. J.
2004-01-01
Since their initial discovery in samples from the north Atlantic Ocean, 16S rRNA genes related to the environmental gene clone cluster known as SAR202 have been recovered from pelagic freshwater, marine sediment, soil, and deep subsurface terrestrial environments. Together, these clones form a major, monophyletic subgroup of the phylum Chloroflexi. While members of this diverse group are consistently identified in the marine environment, there are currently no cultured representatives, and very little is known about their distribution or abundance in the world's oceans. In this study, published and newly identified SAR202-related 16S rRNA gene sequences were used to further resolve the phylogeny of this cluster and to design taxon-specific oligonucleotide probes for fluorescence in situ hybridization. Direct cell counts from the Bermuda Atlantic time series study site in the north Atlantic Ocean, the Hawaii ocean time series site in the central Pacific Ocean, and along the Newport hydroline in eastern Pacific coastal waters showed that SAR202 cluster cells were most abundant below the deep chlorophyll maximum and that they persisted to 3,600 m in the Atlantic Ocean and to 4,000 m in the Pacific Ocean, the deepest samples used in this study. On average, members of the SAR202 group accounted for 10.2% (±5.7%) of all DNA-containing bacterioplankton between 500 and 4,000 m. PMID:15128540
Chemosynthetic bacteria found in bivalve species from mud volcanoes of the Gulf of Cadiz.
Rodrigues, Clara F; Webster, Gordon; Cunha, Marina R; Duperron, Sébastien; Weightman, Andrew J
2010-09-01
As in other cold seeps, the dominant bivalves in mud volcanoes (MV) from the Gulf of Cadiz are macrofauna belonging to the families Solemyidae (Acharax sp., Petrasma sp.), Lucinidae (Lucinoma sp.), Thyasiridae (Thyasira vulcolutre) and Mytilidae (Bathymodiolus mauritanicus). The delta(13)C values measured in solemyid, lucinid and thyasirid specimens support the hypothesis of thiotrophic nutrition, whereas isotopic signatures of B. mauritanicus suggest methanotrophic nutrition. The indication by stable isotope analysis that chemosynthetic bacteria make a substantial contribution to the nutrition of the bivalves led us to investigate their associated bacteria and their phylogenetic relationships based on comparative 16S rRNA gene sequence analysis. PCR-denaturing gradient gel electrophoresis analysis and cloning of bacterial 16S rRNA-encoding genes confirmed the presence of sulfide-oxidizing symbionts within gill tissues of many of the studied specimens. Phylogenetic analysis of bacterial 16S rRNA gene sequences demonstrated that most bacteria were related to known sulfide-oxidizing endosymbionts found in other deep-sea chemosynthetic environments, with the co-occurrence of methane-oxidizing symbionts in Bathymodiolus specimens. This study confirms the presence of several chemosynthetic bivalves in the Gulf of Cadiz and further highlights the importance of sulfide- and methane-oxidizing symbionts in the trophic ecology of macrobenthic communities in MV.
MicroRNA analysis in mouse neuro-2a cells after pseudorabies virus infection.
Li, Yongtao; Zheng, Guanmin; Zhang, Yujuan; Yang, Xia; Liu, Hongying; Chang, Hongtao; Wang, Xinwei; Zhao, Jun; Wang, Chuanqing; Chen, Lu
2017-06-01
Pseudorabies virus (PRV), an alpha herpesvirus can enter the mammalian nervous system, causing Aujezsky's disease. Previous studies have reported an alteration of microRNA (miRNA) expression levels during PRV infections. However, knowledge regarding miRNA response in nervous cells to PRV infection is still unknown. To address this issue, small RNA libraries from infected and uninfected mouse neuroblastoma cells were assessed after Illumina deep sequencing. A total of eight viral miRNA were identified, and ten host miRNAs showed significantly different expression upon PRV infection. Among these, five were analyzed by stem-loop RT-qPCR, which confirmed the above data. Interestingly, these viral miRNAs were mainly found in the large latency transcript region of PRV, and predicted to target a variety of genes, forming a complicated regulatory network. Moreover, ten cellular miRNAs were expressed differently upon PRV infection, including nine upregulated and one downregulated miRNAs. Host targets of these miRNAs obtained by bioinformatics analysis belonged to large signaling networks, mainly encompassing calcium signaling pathway, cAMP signaling pathway, MAPK signaling pathway, and other nervous-associated pathways. These findings further highlighted miRNA features in nervous cells after PRV infection and contributed to unveil the underlying mechanisms of neurotropism as well as the neuropathogenesis of PRV.
Núñez-Hernández, Fernando; Pérez, Lester J; Vera, Gonzalo; Córdoba, Sarai; Segalés, Joaquim; Sánchez, Armand; Núñez, José I
2015-05-01
Porcine circovirus type 2 (PCV2) is a ssDNA virus causing PCV2-systemic disease (PCV2-SD), one of the most important diseases in swine. MicroRNAs (miRNAs) are a new class of small non-coding RNAs that regulate gene expression post-transcriptionally. Viral miRNAs have recently been described and the number of viral miRNAs has been increasing in the past few years. In this study, small RNA libraries were constructed from two tissues of subclinically PCV2 infected pigs to explore if PCV2 can encode viral miRNAs. The deep sequencing data revealed that PCV2 does not express miRNAs in an in vivo subclinical infection.
Romanenko, Lyudmila A; Tanaka, Naoto; Svetashev, Vassilii I; Kalinovskaya, Natalia I
2013-04-01
A novel bacterial strain Sl 79(T) was isolated from a deep surface sediment sample obtained from the Sea of Japan and investigated by phenotypic and molecular methods. The bacterium Sl 79(T) was Gram-positive, facultatively anaerobic, spore-forming, motile and able to form two different types of colonies. It contained the major menaquinone MK-7 and anteiso-C(15:0) followed by iso-C(15:0) as predominant fatty acids. Phylogenetic analysis based on 16S rRNA gene sequences revealed that strain Sl 79(T) belonged to the genus Paenibacillus where it clustered to Paenibacillus apiarius NRRL NRS-1438(T) with a sequence similarity of 97.7 % and sharing sequence similarities below than 96.7 % to other validly named Paenibacillus species. Strain Sl 79(T) was found to possess a remarkable inhibitory activity against indicatory microorganisms. On the basis of combined spectral analyses, strain Paenibacillus sp. Sl 79(T) was established to produce isocoumarin and novel peptide antibiotics. On the basis of DNA-DNA relatedness, phenotypic and phylogenetic data obtained, it was concluded that strain Sl 79(T) represents a novel species, Paenibacillus profundus sp. nov. with the type strain Sl 79(T) = KMM 9420(T) = NRIC 0885(T).
Characterization by Deep Sequencing of Prunus virus T, a Novel Tepovirus Infecting Prunus Species.
Marais, Armelle; Faure, Chantal; Mustafayev, Eldar; Barone, Maria; Alioto, Daniela; Candresse, Thierry
2015-01-01
Double-stranded RNAs purified from a cherry tree collected in Italy and a plum tree collected in Azerbaijan were submitted to deep sequencing. Contigs showing weak but significant identity with various members of the family Betaflexiviridae were reconstructed. Sequence comparisons led to the conclusion that the viral isolates identified in the analyzed Prunus plants belong to the same viral species. Their genome organization is similar to that of some members of the family Betaflexiviridae, with three overlapping open reading frames (RNA polymerase, movement protein, and capsid protein). Phylogenetic analyses of the deduced encoded proteins showed a clustering with the sole member of the genus Tepovirus, Potato virus T (PVT). Given these results, the name Prunus virus T (PrVT) is proposed for the new virus. It should be considered as a new member of the genus Tepovirus, even if the level of nucleotide identity with PVT is borderline with the genus demarcation criteria for the family Betaflexiviridae. A reverse-transcription polymerase chain reaction detection assay was developed and allowed the identification of two other PrVT isolates and an estimate of 1% prevalence in the large Prunus collection screened. Due to the mixed infection status of all hosts identified to date, it was not possible to correlate the presence of PrVT with specific symptoms.
López-Carrasco, Amparo; Ballesteros, Cristina; Sentandreu, Vicente; Delgado, Sonia; Gago-Zachert, Selma; Flores, Ricardo; Sanjuán, Rafael
2017-09-01
Mutation rates vary by orders of magnitude across biological systems, being higher for simpler genomes. The simplest known genomes correspond to viroids, subviral plant replicons constituted by circular non-coding RNAs of few hundred bases. Previous work has revealed an extremely high mutation rate for chrysanthemum chlorotic mottle viroid, a chloroplast-replicating viroid. However, whether this is a general feature of viroids remains unclear. Here, we have used high-fidelity ultra-deep sequencing to determine the mutation rate in a common host (eggplant) of two viroids, each representative of one family: the chloroplastic eggplant latent viroid (ELVd, Avsunviroidae) and the nuclear potato spindle tuber viroid (PSTVd, Pospiviroidae). This revealed higher mutation frequencies in ELVd than in PSTVd, as well as marked differences in the types of mutations produced. Rates of spontaneous mutation, quantified in vivo using the lethal mutation method, ranged from 1/1000 to 1/800 for ELVd and from 1/7000 to 1/3800 for PSTVd depending on sequencing run. These results suggest that extremely high mutability is a common feature of chloroplastic viroids, whereas the mutation rates of PSTVd and potentially other nuclear viroids appear significantly lower and closer to those of some RNA viruses.
Ballesteros, Cristina; Sentandreu, Vicente; Gago-Zachert, Selma
2017-01-01
Mutation rates vary by orders of magnitude across biological systems, being higher for simpler genomes. The simplest known genomes correspond to viroids, subviral plant replicons constituted by circular non-coding RNAs of few hundred bases. Previous work has revealed an extremely high mutation rate for chrysanthemum chlorotic mottle viroid, a chloroplast-replicating viroid. However, whether this is a general feature of viroids remains unclear. Here, we have used high-fidelity ultra-deep sequencing to determine the mutation rate in a common host (eggplant) of two viroids, each representative of one family: the chloroplastic eggplant latent viroid (ELVd, Avsunviroidae) and the nuclear potato spindle tuber viroid (PSTVd, Pospiviroidae). This revealed higher mutation frequencies in ELVd than in PSTVd, as well as marked differences in the types of mutations produced. Rates of spontaneous mutation, quantified in vivo using the lethal mutation method, ranged from 1/1000 to 1/800 for ELVd and from 1/7000 to 1/3800 for PSTVd depending on sequencing run. These results suggest that extremely high mutability is a common feature of chloroplastic viroids, whereas the mutation rates of PSTVd and potentially other nuclear viroids appear significantly lower and closer to those of some RNA viruses. PMID:28910391
Qiao, Wenjie; Zarzyńska-Nowak, Aleksandra; Nerva, Luca; Kuo, Yen-Wen; Falk, Bryce W
2018-04-28
RNA silencing is a conserved antiviral defense mechanism that has been used to develop robust resistance against plant virus infections. Previous efforts have been made to develop RNA silencing-mediated resistance to criniviruses, yet none have given immunity. In this study, transgenic Nicotiana benthamiana plants harboring a hairpin construct of the Lettuce infectious yellows virus (LIYV) RdRp sequence exhibited immunity to systemic LIYV infection. Deep-sequencing analysis was performed to characterize virus-derived siRNAs (vsiRNAs) generated upon systemic LIYV infection in non-transgenic N. benthamiana plants as well as transgene-derived siRNAs (t-siRNAs) derived from the immune transgenic plants before and after LIYV inoculation. Interestingly, a similar sequence distribution pattern was obtained with t-siRNAs and vsiRNAs mapped to the transgene region in both immune and susceptible plants except a significant increase of t-siRNAs of 24 nt in length, which was consistent with small RNA northern blot results that showed the abundance of t-siRNAs of 21-, 22-, and 24- nt in length. The accumulated 24-nt sequences haven't yet been reported in transgenic plants partially resistant to criniviruses, thus may indicate their correlation with crinivirus immunity. To further test this hypothesis, we developed transgenic melon (Cucumis melo) plants immune to systemic infection of another crinivirus, Cucurbit yellow stunting disorder virus (CYSDV). As predicted, the accumulation of 24-nt t-siRNAs was detected in transgenic melon plants by northern blot. Together with our findings and previous studies on crinivirus resistance, we propose that the accumulation of 24 nt t-siRNAs is associated with crinivirus immunity in transgenic plants. This article is protected by copyright. All rights reserved. © 2018 BSPP and John Wiley & Sons Ltd.
Tian, Bin; Wang, Shichen; Todd, Timothy C; Johnson, Charles D; Tang, Guiliang; Trick, Harold N
2017-08-02
The soybean cyst nematode (SCN), Heterodera glycines, is one of the most devastating diseases limiting soybean production worldwide. It is known that small RNAs, including microRNAs (miRNAs) and small interfering RNAs (siRNAs), play important roles in regulating plant growth and development, defense against pathogens, and responses to environmental changes. In order to understand the role of soybean miRNAs during SCN infection, we analyzed 24 small RNA libraries including three biological replicates from two soybean cultivars (SCN susceptible KS4607, and SCN HG Type 7 resistant KS4313N) that were grown under SCN-infested and -noninfested soil at two different time points (SCN feeding establishment and egg production). In total, 537 known and 70 putative novel miRNAs in soybean were identified from a total of 0.3 billion reads (average about 13.5 million reads for each sample) with the programs of Bowtie and miRDeep2 mapper. Differential expression analyses were carried out using edgeR to identify miRNAs involved in the soybean-SCN interaction. Comparative analysis of miRNA profiling indicated a total of 60 miRNAs belonging to 25 families that might be specifically related to cultivar responses to SCN. Quantitative RT-PCR validated similar miRNA interaction patterns as sequencing results. These findings suggest that miRNAs are likely to play key roles in soybean response to SCN. The present work could provide a framework for miRNA functional identification and the development of novel approaches for improving soybean SCN resistance in future studies.
2012-01-01
Background microRNAs (miRNAs) have been found to play an essential role in the modulation of numerous biological processes in eukaryotes. Chlamydomonas reinhardtii is an ideal model organism for the study of many metabolic processes including responses to sulfur-deprivation. We used a deep sequencing platform to extensively profile and identify changes in the miRNAs expression that occurred under sulfur-replete and sulfur-deprived conditions. The aim of our research was to characterize the differential expression of Chlamydomonas miRNAs under sulfur-deprived conditions, and subsequently, the target genes of miRNA involved in sulfur-deprivation were further predicted and analyzed. Results By using high-throughput sequencing, we characterized the microRNA transcriptomes under sulphur-replete and sulfur-deprived conditions in Chlamydomonas reinhardtii. We predicted a total of 310 miRNAs which included 85 known miRNAs and 225 novel miRNAs. 13 miRNAs were the specific to the sulfur-deprived conditions. 47 miRNAs showed significantly differential expressions responding to sulfur-deprivation, and most were up-regulated in the small RNA libraries with sulfur-deprivation. Using a web-based integrated system (Web MicroRNAs Designer 3) and combing the former information from a transcriptome of Chlamydomonas reinhardtii, 22 miRNAs and their targets involved in metabolism regulation with sulfur-deprivation were verified. Conclusions Our results indicate that sulfur-deprivation may have a significant influence on small RNA expression patterns, and the differential expressions of miRNAs and interactions between miRNA and its targets might further reveal the molecular mechanism responding to sulfur-deprivation in Chlamydomonas reinhardtii. PMID:22439676
DOE Office of Scientific and Technical Information (OSTI.GOV)
MacMillan, Colleen P.; Birke, Hannah; Chuah, Aaron
Knowledge of plant secondary cell wall (SCW) regulation and deposition is mainly based on the Arabidopsis model of a ‘typical’ lignocellulosic SCW. However, SCWs in other plants can vary from this. The SCW of mature cotton seed fibres is highly cellulosic and lacks lignification whereas xylem SCWs are lignocellulosic. We used cotton as a model to study different SCWs and the expression of the genes involved in their formation via RNA deep sequencing and chemical analysis of stem and seed fibre.
MacMillan, Colleen P.; Birke, Hannah; Chuah, Aaron; ...
2017-07-18
Knowledge of plant secondary cell wall (SCW) regulation and deposition is mainly based on the Arabidopsis model of a ‘typical’ lignocellulosic SCW. However, SCWs in other plants can vary from this. The SCW of mature cotton seed fibres is highly cellulosic and lacks lignification whereas xylem SCWs are lignocellulosic. We used cotton as a model to study different SCWs and the expression of the genes involved in their formation via RNA deep sequencing and chemical analysis of stem and seed fibre.
Chen, Mu-Xin; Ai, Lin; Xu, Min-Jun; Zhang, Ren-Li; Chen, Shao-Hong; Zhang, Yong-Nian; Guo, Jian; Cai, Yu-Chun; Tian, Li-Guang; Zhang, Ling-Ling; Zhu, Xing-Quan; Chen, Jia-Xu
2011-06-01
Angiostrongylus cantonensis causes eosinophilic meningitis and eosinophilic pleocytosis in humans and is of significant socio-economic importance globally. microRNAs (miRNAs) are endogenous small non-coding RNAs that play crucial roles in gene expression regulation, cellular function and defense, homeostasis and pathogenesis. They have been identified in a diverse range of organisms. The objective of this study was to determine and characterize miRNAs of female and male adults of A. cantonensis by Solexa deep sequencing. A total of 8,861,260 and 10,957,957 high quality reads with 20 and 23 conserved miRNAs were obtained in females and males, respectively. No new miRNA sequence was found. Nucleotide bias analysis showed that uracil was the prominent nucleotide, particularly at positions of 1, 10, 14, 17 and 22, approximately at the beginning, middle and the end of the conserved miRNAs. To our knowledge, this is the first report of miRNA profiles in A. cantonensis, which may represent a new platform for studying regulation of genes and their networks in A. cantonensis. Copyright © 2011 Elsevier Inc. All rights reserved.
Complete genome sequence of Brachybacterium faecium type strain (Schefferle 6-10T)
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lapidus, Alla; Pukall, Rudiger; LaButti, Kurt
2009-05-20
Brachybacterium faecium Collins et al. 1988 is the type species of the genus, and is of phylogenetic interest because of its location in the Dermabacteraceae, a rather isolated family within the actinobacterial suborder Micrococcineae. B. faecium is known for its rod-coccus growth cycle and the ability to degrade uric acid. It grows aerobically or weakly anaerobically. The strain described in this report is a free-living, nonmotile, Gram-positive bacterium, originally isolated from poultry deep litter. Here we describe the features of this organism, together with the complete genome sequence, and annotation. This is the first complete genome sequence of a membermore » of the actinobacterial family Dermabacteraceae, and the 3,614,992 bp long single replicon genome with its 3129 protein-coding and 69 RNA genes is part of the Genomic Encyclopedia of Bacteria and Archaea project.« less
Prediction of novel pre-microRNAs with high accuracy through boosting and SVM.
Zhang, Yuanwei; Yang, Yifan; Zhang, Huan; Jiang, Xiaohua; Xu, Bo; Xue, Yu; Cao, Yunxia; Zhai, Qian; Zhai, Yong; Xu, Mingqing; Cooke, Howard J; Shi, Qinghua
2011-05-15
High-throughput deep-sequencing technology has generated an unprecedented number of expressed short sequence reads, presenting not only an opportunity but also a challenge for prediction of novel microRNAs. To verify the existence of candidate microRNAs, we have to show that these short sequences can be processed from candidate pre-microRNAs. However, it is laborious and time consuming to verify these using existing experimental techniques. Therefore, here, we describe a new method, miRD, which is constructed using two feature selection strategies based on support vector machines (SVMs) and boosting method. It is a high-efficiency tool for novel pre-microRNA prediction with accuracy up to 94.0% among different species. miRD is implemented in PHP/PERL+MySQL+R and can be freely accessed at http://mcg.ustc.edu.cn/rpg/mird/mird.php.
Comparative Single-Cell Genomics of Chloroflexi from the Okinawa Trough Deep-Subsurface Biosphere.
Fullerton, Heather; Moyer, Craig L
2016-05-15
Chloroflexi small-subunit (SSU) rRNA gene sequences are frequently recovered from subseafloor environments, but the metabolic potential of the phylum is poorly understood. The phylum Chloroflexi is represented by isolates with diverse metabolic strategies, including anoxic phototrophy, fermentation, and reductive dehalogenation; therefore, function cannot be attributed to these organisms based solely on phylogeny. Single-cell genomics can provide metabolic insights into uncultured organisms, like the deep-subsurface Chloroflexi Nine SSU rRNA gene sequences were identified from single-cell sorts of whole-round core material collected from the Okinawa Trough at Iheya North hydrothermal field as part of Integrated Ocean Drilling Program (IODP) expedition 331 (Deep Hot Biosphere). Previous studies of subsurface Chloroflexi single amplified genomes (SAGs) suggested heterotrophic or lithotrophic metabolisms and provided no evidence for growth by reductive dehalogenation. Our nine Chloroflexi SAGs (seven of which are from the order Anaerolineales) indicate that, in addition to genes for the Wood-Ljungdahl pathway, exogenous carbon sources can be actively transported into cells. At least one subunit for pyruvate ferredoxin oxidoreductase was found in four of the Chloroflexi SAGs. This protein can provide a link between the Wood-Ljungdahl pathway and other carbon anabolic pathways. Finally, one of the seven Anaerolineales SAGs contains a distinct reductive dehalogenase homologous (rdhA) gene. Through the use of single amplified genomes (SAGs), we have extended the metabolic potential of an understudied group of subsurface microbes, the Chloroflexi These microbes are frequently detected in the subsurface biosphere, though their metabolic capabilities have remained elusive. In contrast to previously examined Chloroflexi SAGs, our genomes (several are from the order Anaerolineales) were recovered from a hydrothermally driven system and therefore provide a unique window into the metabolic potential of this type of habitat. In addition, a reductive dehalogenase gene (rdhA) has been directly linked to marine subsurface Chloroflexi, suggesting that reductive dehalogenation is not limited to the class Dehalococcoidia This discovery expands the nutrient-cycling and metabolic potential present within the deep subsurface and provides functional gene information relating to this enigmatic group. Copyright © 2016 Fullerton and Moyer.
Evolution of coding and non-coding genes in HOX clusters of a marsupial.
Yu, Hongshi; Lindsay, James; Feng, Zhi-Ping; Frankenberg, Stephen; Hu, Yanqiu; Carone, Dawn; Shaw, Geoff; Pask, Andrew J; O'Neill, Rachel; Papenfuss, Anthony T; Renfree, Marilyn B
2012-06-18
The HOX gene clusters are thought to be highly conserved amongst mammals and other vertebrates, but the long non-coding RNAs have only been studied in detail in human and mouse. The sequencing of the kangaroo genome provides an opportunity to use comparative analyses to compare the HOX clusters of a mammal with a distinct body plan to those of other mammals. Here we report a comparative analysis of HOX gene clusters between an Australian marsupial of the kangaroo family and the eutherians. There was a strikingly high level of conservation of HOX gene sequence and structure and non-protein coding genes including the microRNAs miR-196a, miR-196b, miR-10a and miR-10b and the long non-coding RNAs HOTAIR, HOTAIRM1 and HOXA11AS that play critical roles in regulating gene expression and controlling development. By microRNA deep sequencing and comparative genomic analyses, two conserved microRNAs (miR-10a and miR-10b) were identified and one new candidate microRNA with typical hairpin precursor structure that is expressed in both fibroblasts and testes was found. The prediction of microRNA target analysis showed that several known microRNA targets, such as miR-10, miR-414 and miR-464, were found in the tammar HOX clusters. In addition, several novel and putative miRNAs were identified that originated from elsewhere in the tammar genome and that target the tammar HOXB and HOXD clusters. This study confirms that the emergence of known long non-coding RNAs in the HOX clusters clearly predate the marsupial-eutherian divergence 160 Ma ago. It also identified a new potentially functional microRNA as well as conserved miRNAs. These non-coding RNAs may participate in the regulation of HOX genes to influence the body plan of this marsupial.
Evolution of coding and non-coding genes in HOX clusters of a marsupial
2012-01-01
Background The HOX gene clusters are thought to be highly conserved amongst mammals and other vertebrates, but the long non-coding RNAs have only been studied in detail in human and mouse. The sequencing of the kangaroo genome provides an opportunity to use comparative analyses to compare the HOX clusters of a mammal with a distinct body plan to those of other mammals. Results Here we report a comparative analysis of HOX gene clusters between an Australian marsupial of the kangaroo family and the eutherians. There was a strikingly high level of conservation of HOX gene sequence and structure and non-protein coding genes including the microRNAs miR-196a, miR-196b, miR-10a and miR-10b and the long non-coding RNAs HOTAIR, HOTAIRM1 and HOXA11AS that play critical roles in regulating gene expression and controlling development. By microRNA deep sequencing and comparative genomic analyses, two conserved microRNAs (miR-10a and miR-10b) were identified and one new candidate microRNA with typical hairpin precursor structure that is expressed in both fibroblasts and testes was found. The prediction of microRNA target analysis showed that several known microRNA targets, such as miR-10, miR-414 and miR-464, were found in the tammar HOX clusters. In addition, several novel and putative miRNAs were identified that originated from elsewhere in the tammar genome and that target the tammar HOXB and HOXD clusters. Conclusions This study confirms that the emergence of known long non-coding RNAs in the HOX clusters clearly predate the marsupial-eutherian divergence 160 Ma ago. It also identified a new potentially functional microRNA as well as conserved miRNAs. These non-coding RNAs may participate in the regulation of HOX genes to influence the body plan of this marsupial. PMID:22708672
The double-stranded transcriptome of Escherichia coli.
Lybecker, Meghan; Zimmermann, Bob; Bilusic, Ivana; Tukhtubaeva, Nadezda; Schroeder, Renée
2014-02-25
Advances in high-throughput transcriptome analyses have revealed hundreds of antisense RNAs (asRNAs) for many bacteria, although few have been characterized, and the number of functional asRNAs remains unknown. We have developed a genome-wide high-throughput method to identify functional asRNAs in vivo. Most mechanisms of gene regulation via asRNAs require an RNA-RNA interaction with its target RNA, and we hypothesized that a functional asRNA would be found in a double strand (dsRNA), duplexed with its cognate RNA in a single cell. We developed a method of isolating dsRNAs from total RNA by immunoprecipitation with a ds-RNA specific antibody. Total RNA and immunoprecipitated dsRNA from Escherichia coli RNase III WT and mutant strains were deep-sequenced. A statistical model was applied to filter for biologically relevant dsRNA regions, which were subsequently categorized by location relative to annotated genes. A total of 316 potentially functional asRNAs were identified in the RNase III mutant strain and are encoded primarily opposite to the 5' ends of transcripts, but are also found opposite ncRNAs, gene junctions, and the 3' ends. A total of 21 sense/antisense RNA pairs identified in dsRNAs were confirmed by Northern blot analyses. Most of the RNA steady-state levels were higher or detectable only in the RNase III mutant strain. Taken together, our data indicate that a significant amount of dsRNA is formed in the cell, that RNase III degrades or processes these dsRNAs, and that dsRNA plays a major role in gene regulation in E. coli.
Samuels, Amy K; Weisrock, David W; Smith, Jeramiah J; France, Katherine J; Walker, John A; Putta, Srikrishna; Voss, S Randal
2005-04-11
We report on a study that extended mitochondrial transcript information from a recent EST project to obtain complete mitochondrial genome sequence for 5 tiger salamander complex species (Ambystoma mexicanum, A. t. tigrinum, A. andersoni, A. californiense, and A. dumerilii). We describe, for the first time, aspects of mitochondrial transcription in a representative amphibian, and then use complete mitochondrial sequence data to examine salamander phylogeny at both deep and shallow levels of evolutionary divergence. The available mitochondrial ESTs for A. mexicanum (N=2481) and A. t. tigrinum (N=1205) provided 92% and 87% coverage of the mitochondrial genome, respectively. Complete mitochondrial sequences for all species were rapidly obtained by using long distance PCR and DNA sequencing. A number of genome structural characteristics (base pair length, base composition, gene number, gene boundaries, codon usage) were highly similar among all species and to other distantly related salamanders. Overall, mitochondrial transcription in Ambystoma approximated the pattern observed in other vertebrates. We inferred from the mapping of ESTs onto mtDNA that transcription occurs from both heavy and light strand promoters and continues around the entire length of the mtDNA, followed by post-transcriptional processing. However, the observation of many short transcripts corresponding to rRNA genes indicates that transcription may often terminate prematurely to bias transcription of rRNA genes; indeed an rRNA transcription termination signal sequence was observed immediately following the 16S rRNA gene. Phylogenetic analyses of salamander family relationships consistently grouped Ambystomatidae in a clade containing Cryptobranchidae and Hynobiidae, to the exclusion of Salamandridae. This robust result suggests a novel alternative hypothesis because previous studies have consistently identified Ambystomatidae and Salamandridae as closely related taxa. Phylogenetic analyses of tiger salamander complex species also produced robustly supported trees. The D-loop, used in previous molecular phylogenetic studies of the complex, was found to contain a relatively low level of variation and we identified mitochondrial regions with higher rates of molecular evolution that are more useful in resolving relationships among species. Our results show the benefit of using complete genome mitochondrial information in studies of recently and rapidly diverged taxa.
Li, Yunfeng; Zhou, Zunchun; Tian, Meilin; Tian, Yi; Dong, Ying; Li, Shilei; Liu, Weidong; He, Chongbo
2017-08-01
In this study, single nucleotide polymorphism (SNP), microsatellite (SSR) and differentially expressed genes (DEGs) in the oral parts, gonads, and umbrella parts of the jellyfish Rhopilema esculentum were analyzed by RNA-Seq technology. A total of 76.4 million raw reads and 72.1 million clean reads were generated from deep sequencing. Approximately 119,874 tentative unigenes and 149,239 transcripts were obtained. A total of 1,034,708 SNP markers were detected in the three tissues. For microsatellite mining, 5088 SSRs were identified from the unigene sequences. The most frequent repeat motifs were mononucleotide repeats, which accounted for 61.93%. Transcriptome comparison of the three tissues yielded a total of 8841 DEGs, of which 3560 were up-regulated and 5281 were down-regulated. This study represents the greatest sequencing effort carried out for a jellyfish and provides the first high-throughput transcriptomic resource for jellyfish. Copyright © 2017 Elsevier B.V. All rights reserved.
GWIPS-viz: development of a ribo-seq genome browser
Michel, Audrey M.; Fox, Gearoid; M. Kiran, Anmol; De Bo, Christof; O’Connor, Patrick B. F.; Heaphy, Stephen M.; Mullan, James P. A.; Donohue, Claire A.; Higgins, Desmond G.; Baranov, Pavel V.
2014-01-01
We describe the development of GWIPS-viz (http://gwips.ucc.ie), an online genome browser for viewing ribosome profiling data. Ribosome profiling (ribo-seq) is a recently developed technique that provides genome-wide information on protein synthesis (GWIPS) in vivo. It is based on the deep sequencing of ribosome-protected messenger RNA (mRNA) fragments, which allows the ribosome density along all mRNA transcripts present in the cell to be quantified. Since its inception, ribo-seq has been carried out in a number of eukaryotic and prokaryotic organisms. Owing to the increasing interest in ribo-seq, there is a pertinent demand for a dedicated ribo-seq genome browser. GWIPS-viz is based on The University of California Santa Cruz (UCSC) Genome Browser. Ribo-seq tracks, coupled with mRNA-seq tracks, are currently available for several genomes: human, mouse, zebrafish, nematode, yeast, bacteria (Escherichia coli K12, Bacillus subtilis), human cytomegalovirus and bacteriophage lambda. Our objective is to continue incorporating published ribo-seq data sets so that the wider community can readily view ribosome profiling information from multiple studies without the need to carry out computational processing. PMID:24185699
Pontvianne, Frédéric; Carpentier, Marie-Christine; Durut, Nathalie; Pavlištová, Veronika; Jaške, Karin; Schořová, Šárka; Parrinello, Hugues; Rohmer, Marine; Pikaard, Craig S; Fojtová, Miloslava; Fajkus, Jiří; Sáez-Vásquez, Julio
2016-08-09
The nucleolus is the site of rRNA gene transcription, rRNA processing, and ribosome biogenesis. However, the nucleolus also plays additional roles in the cell. We isolated nucleoli using fluorescence-activated cell sorting (FACS) and identified nucleolus-associated chromatin domains (NADs) by deep sequencing, comparing wild-type plants and null mutants for the nucleolar protein NUCLEOLIN 1 (NUC1). NADs are primarily genomic regions with heterochromatic signatures and include transposable elements (TEs), sub-telomeric regions, and mostly inactive protein-coding genes. However, NADs also include active rRNA genes and the entire short arm of chromosome 4 adjacent to them. In nuc1 null mutants, which alter rRNA gene expression and overall nucleolar structure, NADs are altered, telomere association with the nucleolus is decreased, and telomeres become shorter. Collectively, our studies reveal roles for NUC1 and the nucleolus in the spatial organization of chromosomes as well as telomere maintenance. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mason, Olivia U.; Hazen, Terry C.; Borglin, Sharon
The Deepwater Horizon oil spill in the Gulf of Mexico resulted in a deep-sea hydrocarbon plume that caused a shift in the indigenous microbial community composition with unknown ecological consequences. Early in the spill history, a bloom of uncultured, thus uncharacterized, members of the Oceanospirillales was previously detected, but their role in oil disposition was unknown. Here our aim was to determine the functional role of the Oceanospirillales and other active members of the indigenous microbial community using deep sequencing of community DNA and RNA, as well as single-cell genomics. Shotgun metagenomic and metatranscriptomic sequencing revealed that genes for motility,more » chemotaxis and aliphatic hydrocarbon degradation were significantly enriched and expressed in the hydrocarbon plume samples compared with uncontaminated seawater collected from plume depth. In contrast, although genes coding for degradation of more recalcitrant compounds, such as benzene, toluene, ethylbenzene, total xylenes and polycyclic aromatic hydrocarbons, were identified in the metagenomes, they were expressed at low levels, or not at all based on analysis of the metatranscriptomes. Isolation and sequencing of two Oceanospirillales single cells revealed that both cells possessed genes coding for n-alkane and cycloalkane degradation. Specifically, the near-complete pathway for cyclohexane oxidation in the Oceanospirillales single cells was elucidated and supported by both metagenome and metatranscriptome data. The draft genome also included genes for chemotaxis, motility and nutrient acquisition strategies that were also identified in the metagenomes and metatranscriptomes. These data point towards a rapid response of members of the Oceanospirillales to aliphatic hydrocarbons in the deep sea.« less
Zcchc11 Uridylates Mature miRNAs to Enhance Neonatal IGF-1 Expression, Growth, and Survival
Kozlowski, Elyse; Matsuura, Kori Y.; Ferrari, Joseph D.; Morris, Samantha A.; Powers, John T.; Daley, George Q.; Quinton, Lee J.; Mizgerd, Joseph P.
2012-01-01
The Zcchc11 enzyme is implicated in microRNA (miRNA) regulation. It can uridylate let-7 precursors to decrease quantities of the mature miRNA in embryonic stem cell lines, suggested to mediate stem cell maintenance. It can uridylate mature miR-26 to relieve silencing activity without impacting miRNA content in cancer cell lines, suggested to mediate cytokine and growth factor expression. Broader roles of Zcchc11 in shaping or remodeling the miRNome or in directing biological or physiological processes remain entirely speculative. We generated Zcchc11-deficient mice to address these knowledge gaps. Zcchc11 deficiency had no impact on embryogenesis or fetal development, but it significantly decreased survival and growth immediately following birth, indicating a role for this enzyme in early postnatal fitness. Deep sequencing of small RNAs from neonatal livers revealed roles of this enzyme in miRNA sequence diversity. Zcchc11 deficiency diminished the lengths and terminal uridine frequencies for diverse mature miRNAs, but it had no influence on the quantities of any miRNAs. The expression of IGF-1, a liver-derived protein essential to early growth and survival, was enhanced by Zcchc11 expression in vitro, and miRNA silencing of IGF-1 was alleviated by uridylation events observed to be Zcchc11-dependent in the neonatal liver. In neonatal mice, Zcchc11 deficiency significantly decreased IGF-1 mRNA in the liver and IGF-1 protein in the blood. We conclude that the Zcchc11-mediated terminal uridylation of mature miRNAs is pervasive and physiologically significant, especially important in the neonatal period for fostering IGF-1 expression and enhancing postnatal growth and survival. We propose that the miRNA 3′ terminus is a regulatory node upon which multiple enzymes converge to direct silencing activity and tune gene expression. PMID:23209448
Root of the universal tree of life based on ancient aminoacyl-tRNA synthetase gene duplications.
Brown, J R; Doolittle, W F
1995-03-28
Universal trees based on sequences of single gene homologs cannot be rooted. Iwabe et al. [Iwabe, N., Kuma, K.-I., Hasegawa, M., Osawa, S. & Miyata, T. (1989) Proc. Natl. Acad. Sci. USA 86, 9355-9359] circumvented this problem by using ancient gene duplications that predated the last common ancestor of all living things. Their separate, reciprocally rooted gene trees for elongation factors and ATPase subunits showed Bacteria (eubacteria) as branching first from the universal tree with Archaea (archaebacteria) and Eucarya (eukaryotes) as sister groups. Given its topical importance to evolutionary biology and concerns about the appropriateness of the ATPase data set, an evaluation of the universal tree root using other ancient gene duplications is essential. In this study, we derive a rooting for the universal tree using aminoacyl-tRNA synthetase genes, an extensive multigene family whose divergence likely preceded that of prokaryotes and eukaryotes. An approximately 1600-bp conserved region was sequenced from the isoleucyl-tRNA synthetases of several species representing deep evolutionary branches of eukaryotes (Nosema locustae), Bacteria (Aquifex pyrophilus and Thermotoga maritima) and Archaea (Pyrococcus furiosus and Sulfolobus acidocaldarius). In addition, a new valyl-tRNA synthetase was characterized from the protist Trichomonas vaginalis. Different phylogenetic methods were used to generate trees of isoleucyl-tRNA synthetases rooted by valyl- and leucyl-tRNA synthetases. All isoleucyl-tRNA synthetase trees showed Archaea and Eucarya as sister groups, providing strong confirmation for the universal tree rooting reported by Iwabe et al. As well, there was strong support for the monophyly (sensu Hennig) of Archaea. The valyl-tRNA synthetase gene from Tr. vaginalis clustered with other eukaryotic ValRS genes, which may have been transferred from the mitochondrial genome to the nuclear genome, suggesting that this amitochondrial trichomonad once harbored an endosymbiotic bacterium.
Intronic splicing mutations in PTCH1 cause Gorlin syndrome.
Bholah, Zaynab; Smith, Miriam J; Byers, Helen J; Miles, Emma K; Evans, D Gareth; Newman, William G
2014-09-01
Gorlin syndrome is an autosomal dominant disorder characterized by multiple early-onset basal cell carcinoma, odontogenic keratocysts and skeletal abnormalities. It is caused by heterozygous mutations in the tumour suppressor PTCH1. Routine clinical genetic testing, by Sanger sequencing and multiplex ligation-dependent probe amplification (MLPA) to confirm a clinical diagnosis of Gorlin syndrome, identifies a mutation in 60-90 % of cases. We undertook RNA analysis on lymphocytes from ten individuals diagnosed with Gorlin syndrome, but without known PTCH1 mutations by exonic sequencing or MLPA. Two altered PTCH1 transcripts were identified. Genomic DNA sequence analysis identified an intron 7 mutation c.1068-10T>A, which created a strong cryptic splice acceptor site, leading to an intronic insertion of eight bases; this is predicted to create a frameshift p.(His358Alafs*12). Secondly, a deep intronic mutation c.2561-2057A>G caused an inframe insertion of 78 intronic bases in the cDNA transcript, leading to a premature stop codon p.(Gly854fs*3). The mutations are predicted to cause loss of function of PTCH1, consistent with its tumour suppressor function. The findings indicate the importance of RNA analysis to detect intronic mutations in PTCH1 not identified by routine screening techniques.
Miranda, Priscilla J.; McLain, Nathan K.; Hatzenpichler, Roland; Orphan, Victoria J.; Dillon, Jesse G.
2016-01-01
The shallow-sea hydrothermal vents at White Point (WP) in Palos Verdes on the southern California coast support microbial mats and provide easily accessed settings in which to study chemolithoautotrophic sulfur cycling. Previous studies have cultured sulfur-oxidizing bacteria from the WP mats; however, almost nothing is known about the in situ diversity and activity of the microorganisms in these habitats. We studied the diversity, micron-scale spatial associations and metabolic activity of the mat community via sequence analysis of 16S rRNA and aprA genes, fluorescence in situ hybridization (FISH) microscopy and sulfate reduction rate (SRR) measurements. Sequence analysis revealed a diverse group of bacteria, dominated by sulfur cycling gamma-, epsilon-, and deltaproteobacterial lineages such as Marithrix, Sulfurovum, and Desulfuromusa. FISH microscopy suggests a close physical association between sulfur-oxidizing and sulfur-reducing genotypes, while radiotracer studies showed low, but detectable, SRR. Comparative 16S rRNA gene sequence analyses indicate the WP sulfur vent microbial mat community is similar, but distinct from other hydrothermal vent communities representing a range of biotopes and lithologic settings. These findings suggest a complete biological sulfur cycle is operating in the WP mat ecosystem mediated by diverse bacterial lineages, with some similarity with deep-sea hydrothermal vent communities. PMID:27512390
Takahara, Hiroyuki; Dolf, Andreas; Endl, Elmar; O'Connell, Richard
2009-08-01
Generation of stage-specific cDNA libraries is a powerful approach to identify pathogen genes that are differentially expressed during plant infection. Biotrophic pathogens develop specialized infection structures inside living plant cells, but sampling the transcriptome of these structures is problematic due to the low ratio of fungal to plant RNA, and the lack of efficient methods to isolate them from infected plants. Here we established a method, based on fluorescence-activated cell sorting (FACS), to purify the intracellular biotrophic hyphae of Colletotrichum higginsianum from homogenates of infected Arabidopsis leaves. Specific selection of viable hyphae using a fluorescent vital marker provided intact RNA for cDNA library construction. Pilot-scale sequencing showed that the library was enriched with plant-induced and pathogenicity-related fungal genes, including some encoding small, soluble secreted proteins that represent candidate fungal effectors. The high purity of the hyphae (94%) prevented contamination of the library by sequences derived from host cells or other fungal cell types. RT-PCR confirmed that genes identified in the FACS-purified hyphae were also expressed in planta. The method has wide applicability for isolating the infection structures of other plant pathogens, and will facilitate cell-specific transcriptome analysis via deep sequencing and microarray hybridization, as well as proteomic analyses.
Kale, Shiv D; Ayubi, Tariq; Chung, Dawoon; Tubau-Juni, Nuria; Leber, Andrew; Dang, Ha X; Karyala, Saikumar; Hontecillas, Raquel; Lawrence, Christopher B; Cramer, Robert A; Bassaganya-Riera, Josep
2017-12-06
Incidences of invasive pulmonary aspergillosis, an infection caused predominantly by Aspergillus fumigatus, have increased due to the growing number of immunocompromised individuals. While A. fumigatus is reliant upon deficiencies in the host to facilitate invasive disease, the distinct mechanisms that govern the host-pathogen interaction remain enigmatic, particularly in the context of distinct immune modulating therapies. To gain insights into these mechanisms, RNA-Seq technology was utilized to sequence RNA derived from lungs of 2 clinically relevant, but immunologically distinct murine models of IPA on days 2 and 3 post inoculation when infection is established and active disease present. Our findings identify notable differences in host gene expression between the chemotherapeutic and steroid models at the interface of immunity and metabolism. RT-qPCR verified model specific and nonspecific expression of 23 immune-associated genes. Deep sequencing facilitated identification of highly expressed fungal genes. We utilized sequence similarity and gene expression to categorize the A. fumigatus putative in vivo secretome. RT-qPCR suggests model specific gene expression for nine putative fungal secreted proteins. Our analysis identifies contrasting responses by the host and fungus from day 2 to 3 between the two models. These differences may help tailor the identification, development, and deployment of host- and/or fungal-targeted therapeutics.
Ren, Xianyun; Cui, Yanting; Gao, Baoquan; Liu, Ping; Li, Jian
2016-08-01
MicroRNAs (miRNAs) are a class of endogenous small non-coding RNAs that regulate gene expression by post-transcriptional repression of mRNAs. The swimming crab Portunus trituberculatus is one of the most important crustacean species for aquaculture in China. However, to date no miRNAs have been reported to for modulating growth in P. trituberculatus. To investigate miRNAs involved in the growth of this species, we constructed six small RNA libraries for big individuals (BIs) and small individuals (SIs) from a highly inbred family. Six mixed RNA pools of five tissues (eyestalk, gill, heart, hepatopancreas, and muscle) were obtained. By aligning sequencing data with those for known miRNAs, a total of 404 miRNAs, including 339 known and 65 novel miRNAs, were identified from the six libraries. MiR-100 and miR-276a-3p were among the most prominent miRNA species. We identified seven differentially expressed miRNAs between the BIs and SIs, which were validated using real-time PCR. Preliminary analyzes of their putative target genes and GO and KEGG pathway analyzes showed that these differentially expressed miRNAs could play important roles in global transcriptional depression and cell differentiation of P. trituberculatus. This study reveals the first miRNA profile related to the body growth of P. trituberculatus, which would be particularly useful for crab breeding programs. Copyright © 2016 Elsevier B.V. All rights reserved.
Yang, Jian-Hua; Li, Jun-Hao; Jiang, Shan; Zhou, Hui; Qu, Liang-Hu
2013-01-01
Long non-coding RNAs (lncRNAs) and microRNAs (miRNAs) represent two classes of important non-coding RNAs in eukaryotes. Although these non-coding RNAs have been implicated in organismal development and in various human diseases, surprisingly little is known about their transcriptional regulation. Recent advances in chromatin immunoprecipitation with next-generation DNA sequencing (ChIP-Seq) have provided methods of detecting transcription factor binding sites (TFBSs) with unprecedented sensitivity. In this study, we describe ChIPBase (http://deepbase.sysu.edu.cn/chipbase/), a novel database that we have developed to facilitate the comprehensive annotation and discovery of transcription factor binding maps and transcriptional regulatory relationships of lncRNAs and miRNAs from ChIP-Seq data. The current release of ChIPBase includes high-throughput sequencing data that were generated by 543 ChIP-Seq experiments in diverse tissues and cell lines from six organisms. By analysing millions of TFBSs, we identified tens of thousands of TF-lncRNA and TF-miRNA regulatory relationships. Furthermore, two web-based servers were developed to annotate and discover transcriptional regulatory relationships of lncRNAs and miRNAs from ChIP-Seq data. In addition, we developed two genome browsers, deepView and genomeView, to provide integrated views of multidimensional data. Moreover, our web implementation supports diverse query types and the exploration of TFs, lncRNAs, miRNAs, gene ontologies and pathways.
Huiet, L; Feldstein, P A; Tsai, J H; Falk, B W
1993-12-01
Primer extension analyses and a PCR-based cloning strategy were used to identify and characterize 5' nucleotide sequences on the maize stripe virus (MStV) RNA4 mRNA transcripts encoding the major noncapsid protein (NCP). Direct RNA sequence analysis by primer extension showed that the NCP mRNA transcripts had 10-15 nucleotides beyond the 5' terminus of the MStV RNA4 nucleotide sequence. MStV genomic RNAs isolated from ribonucleoprotein particles (RNPs) lacked the additional 5' nucleotides. cDNA clones representing the 5' region of the mRNA transcripts were constructed, and the nucleotide sequences of the 5' regions were determined for 16 clones. Each was found to have a distinct 10-15 nucleotide sequence immediately 5' of the MStV RNA4 sequence. Eleven of 16 clones had the correct MStV RNA4 5' nucleotide sequence, while five showed minor variations at or near the 5' most MStV RNA4 nucleotide. These characteristics show strong similarities to other viral mRNA transcripts which are synthesized by cap snatching.
Perreault, Nancy N.; Andersen, Dale T.; Pollard, Wayne H.; Greer, Charles W.; Whyte, Lyle G.
2007-01-01
The springs at Gypsum Hill and Colour Peak on Axel Heiberg Island in the Canadian Arctic originate from deep salt aquifers and are among the few known examples of cold springs in thick permafrost on Earth. The springs discharge cold anoxic brines (7.5 to 15.8% salts), with a mean oxidoreduction potential of −325 mV, and contain high concentrations of sulfate and sulfide. We surveyed the microbial diversity in the sediments of seven springs by denaturing gradient gel electrophoresis (DGGE) and analyzing clone libraries of 16S rRNA genes amplified with Bacteria and Archaea-specific primers. Dendrogram analysis of the DGGE banding patterns divided the springs into two clusters based on their geographic origin. Bacterial 16S rRNA clone sequences from the Gypsum Hill library (spring GH-4) were classified into seven phyla (Actinobacteria, Bacteroidetes, Firmicutes, Gemmatimonadetes, Proteobacteria, Spirochaetes, and Verrucomicrobia); Deltaproteobacteria and Gammaproteobacteria sequences represented half of the clone library. Sequences related to Proteobacteria (82%), Firmicutes (9%), and Bacteroidetes (6%) constituted 97% of the bacterial clone library from Colour Peak (spring CP-1). Most GH-4 archaeal clone sequences (79%) were related to the Crenarchaeota while half of the CP-1 sequences were related to orders Halobacteriales and Methanosarcinales of the Euryarchaeota. Sequences related to the sulfur-oxidizing bacterium Thiomicrospira psychrophila dominated both the GH-4 (19%) and CP-1 (45%) bacterial libraries, and 56 to 76% of the bacterial sequences were from potential sulfur-metabolizing bacteria. These results suggest that the utilization and cycling of sulfur compounds may play a major role in the energy production and maintenance of microbial communities in these unique, cold environments. PMID:17220254