Sample records for h-dbas human-transcriptome database

  1. Database Administrator

    ERIC Educational Resources Information Center

    Moore, Pam

    2010-01-01

    The Internet and electronic commerce (e-commerce) generate lots of data. Data must be stored, organized, and managed. Database administrators, or DBAs, work with database software to find ways to do this. They identify user needs, set up computer databases, and test systems. They ensure that systems perform as they should and add people to the…

  2. Generation and characterization of highly strained dibenzotetrakisdehydro[12]- and dibenzopentakisdehydro[14]annulenes.

    PubMed

    Hisaki, Ichiro; Eda, Takeshi; Sonoda, Motohiro; Niino, Hiroyuki; Sato, Tadatake; Wakabayashi, Tomonari; Tobe, Yoshito

    2005-03-04

    To generate dibenzotetrakisdehydro[12]- and dibenzopentakisdehydro[14]annulenes ([12]- and [14]DBAs) having a highly deformed triyne moiety, [4.3.2]propellatriene-anneleted dehydro[12]- and dehydro[14]annulenes were prepared as their precursors. UV irradiation of the precursors resulted in the photochemical [2 + 2] cycloreversion to generate the strained [12]- and [14]DBAs, respectively. The [12]DBA was not detected by 1H NMR spectroscopy, but it was intercepted as Diels-Alder adducts in solution, suggesting its intermediacy. Its spectroscopic characterization was successfully carried out by UV-vis spectroscopy in a 2-methyltetrahydrofuran (MTHF) glass matrix at 77 K and by FT-IR spectroscopy in an argon matrix at 20 K. On the other hand, the [14]DBA was stable enough for observation by 1H and 13C NMR spectra in solution, though it was not isolated because of the low efficiency of the cycloreversion. The [14]DBA was also characterized by interception as Diels-Alder adducts in solution and by UV-vis spectroscopy in a MTHF glass matrix at 77 K. The kinetic stabilities of the DBAs are compared with the related dehydrobenzoannulenes with respect to the topology of the pi-systems. In addition, the tropicity of the [14]DBA is discussed based on its experimental and theoretical 1H NMR chemical shifts.

  3. The top skin-associated genes: a comparative analysis of human and mouse skin transcriptomes.

    PubMed

    Gerber, Peter Arne; Buhren, Bettina Alexandra; Schrumpf, Holger; Homey, Bernhard; Zlotnik, Albert; Hevezi, Peter

    2014-06-01

    The mouse represents a key model system for the study of the physiology and biochemistry of skin. Comparison of skin between mouse and human is critical for interpretation and application of data from mouse experiments to human disease. Here, we review the current knowledge on structure and immunology of mouse and human skin. Moreover, we present a systematic comparison of human and mouse skin transcriptomes. To this end, we have recently used a genome-wide database of human gene expression to identify genes highly expressed in skin, with no, or limited expression elsewhere - human skin-associated genes (hSAGs). Analysis of our set of hSAGs allowed us to generate a comprehensive molecular characterization of healthy human skin. Here, we used a similar database to generate a list of mouse skin-associated genes (mSAGs). A comparative analysis between the top human (n=666) and mouse (n=873) skin-associated genes (SAGs) revealed a total of only 30.2% identity between the two lists. The majority of shared genes encode proteins that participate in structural and barrier functions. Analysis of the top functional annotation terms revealed an overlap for morphogenesis, cell adhesion, structure, and signal transduction. The results of this analysis, discussed in the context of published data, illustrate the diversity between the molecular make up of skin of both species and grants a probable explanation, why results generated in murine in vivo models often fail to translate into the human.

  4. ESCAPE: database for integrating high-content published data collected from human and mouse embryonic stem cells.

    PubMed

    Xu, Huilei; Baroukh, Caroline; Dannenfelser, Ruth; Chen, Edward Y; Tan, Christopher M; Kou, Yan; Kim, Yujin E; Lemischka, Ihor R; Ma'ayan, Avi

    2013-01-01

    High content studies that profile mouse and human embryonic stem cells (m/hESCs) using various genome-wide technologies such as transcriptomics and proteomics are constantly being published. However, efforts to integrate such data to obtain a global view of the molecular circuitry in m/hESCs are lagging behind. Here, we present an m/hESC-centered database called Embryonic Stem Cell Atlas from Pluripotency Evidence integrating data from many recent diverse high-throughput studies including chromatin immunoprecipitation followed by deep sequencing, genome-wide inhibitory RNA screens, gene expression microarrays or RNA-seq after knockdown (KD) or overexpression of critical factors, immunoprecipitation followed by mass spectrometry proteomics and phosphoproteomics. The database provides web-based interactive search and visualization tools that can be used to build subnetworks and to identify known and novel regulatory interactions across various regulatory layers. The web-interface also includes tools to predict the effects of combinatorial KDs by additive effects controlled by sliders, or through simulation software implemented in MATLAB. Overall, the Embryonic Stem Cell Atlas from Pluripotency Evidence database is a comprehensive resource for the stem cell systems biology community. Database URL: http://www.maayanlab.net/ESCAPE

  5. Validation of the German version of the short form of the dysfunctional beliefs and attitudes about sleep scale (DBAS-16).

    PubMed

    Lang, Christin; Brand, Serge; Holsboer-Trachsler, Edith; Pühse, Uwe; Colledge, Flora; Gerber, Markus

    2017-06-01

    Research shows that dysfunctional sleep-related cognitions play an important role in the development, maintenance and exacerbation of insomnia. This study examines the factorial validity, psychometric properties and both concurrent and predictive validity of the German version of the 16-item DBAS (dysfunctional beliefs and attitudes about sleep) scale. Data was collected in 864 vocational students from the German-speaking part of Switzerland (43% females, M age  = 17.9 years). Data collection took place twice within a 10-month interval. The students completed a German translation of the DBAS-16, the Insomnia Severity Index (ISI), the Pittsburgh Sleep Quality Index (PSQI), and provided information about their psychological functioning. Descriptive statistics, factorial validity, internal consistency, gender differences, concurrent, and predictive validity were examined. Confirmatory factor analysis supported the 4-factor structure of the DBAS-16. All factors (consequences, worry/helplessness, expectations, medication) were positively correlated and had acceptable psychometric properties. Females reported higher scores across all DBAS measures. Weak-to-moderate correlations were found between dysfunctional sleep-related beliefs, insomnia and poor sleep quality. Dysfunctional sleep-related beliefs were also associated with decreased psychological functioning, and consistently predicted insomnia and poor psychological functioning at follow-up, even after controlling for socio-demographic background and baseline levels. The present study provides support for the validity and psychometric properties of the German version of the DBAS-16. Most importantly, it corroborates the relevance of cognitive-emotional factors in the onset and maintenance of insomnia and psychological symptoms among young people.

  6. iMETHYL: an integrative database of human DNA methylation, gene expression, and genomic variation.

    PubMed

    Komaki, Shohei; Shiwa, Yuh; Furukawa, Ryohei; Hachiya, Tsuyoshi; Ohmomo, Hideki; Otomo, Ryo; Satoh, Mamoru; Hitomi, Jiro; Sobue, Kenji; Sasaki, Makoto; Shimizu, Atsushi

    2018-01-01

    We launched an integrative multi-omics database, iMETHYL (http://imethyl.iwate-megabank.org). iMETHYL provides whole-DNA methylation (~24 million autosomal CpG sites), whole-genome (~9 million single-nucleotide variants), and whole-transcriptome (>14 000 genes) data for CD4 + T-lymphocytes, monocytes, and neutrophils collected from approximately 100 subjects. These data were obtained from whole-genome bisulfite sequencing, whole-genome sequencing, and whole-transcriptome sequencing, making iMETHYL a comprehensive database.

  7. Modification of Flight and Locomotion Performances, Respiratory Metabolism, and Transcriptome Expression in the Lady Beetle Harmonia axyridis through Sublethal Pesticide Exposure

    PubMed Central

    Xiao, Da; Tan, Xiaoling; Wang, Wenjuan; Zhang, Fan; Desneux, Nicolas; Wang, Su

    2017-01-01

    Biological control is usually used in combination with chemical control for practical agricultural applications. Thus, the influence of insecticides on the natural predators used for biological control should be investigated for integrated pest management. The ladybird Harmonia axyridis is an effective predator on aphids and coccids. Beta-cypermethrin is a broad-spectrum insecticide used worldwide for controlling insect pests. H. axyridis is becoming increasingly threatened by this insecticide. Here, we investigated the effect of a sublethal dose of beta-cypermethrin on flight, locomotion, respiration, and detoxification system of H. axyridis. After exposure to beta-cypermethrin, succinic female adults flew more times, longer distances, and during longer time periods. Exposure to a sublethal dose of beta-cypermethrin also promoted an increase in walking rate, walking distance, walking duration, and also an increase in respiratory quotient and respiratory rate. To investigate the effects of beta-cypermethrin on H. axyridis detoxification system, we analyzed the transcriptome of H. axyridis adults, focusing on genes related to detoxification systems. De novo assembly generated 65,509 unigenes with a mean length of 799 bp. From these genes, 26,020 unigenes (40.91% of all unigenes) exhibited clear homology to known genes in the NCBI non-redundant database. In addition, 10,402 unigenes were annotated in the Cluster of Orthologous Groups database, 12,088 unigenes were assigned to the Gene Ontology database and 12,269 unigenes were in the Kyoto Encyclopedia of Genes and Genome (KEGG) database. Exposure to beta-cypermethrin had significant effects on the transcriptome profile of H. axyridis adult. Based on uniquely mapped reads, 3,296 unigenes were differentially expressed, 868 unigenes were up-regulated and 2,248 unigenes were down-regulated. We identified differentially-expressed unigenes related to general detoxification systems in H. axyridis. This assembled, annotated transcriptome provides a valuable genomic resource for further understanding the molecular basis of detoxification mechanisms in H. axyridis. PMID:28239355

  8. Modification of Flight and Locomotion Performances, Respiratory Metabolism, and Transcriptome Expression in the Lady Beetle Harmonia axyridis through Sublethal Pesticide Exposure.

    PubMed

    Xiao, Da; Tan, Xiaoling; Wang, Wenjuan; Zhang, Fan; Desneux, Nicolas; Wang, Su

    2017-01-01

    Biological control is usually used in combination with chemical control for practical agricultural applications. Thus, the influence of insecticides on the natural predators used for biological control should be investigated for integrated pest management. The ladybird Harmonia axyridis is an effective predator on aphids and coccids. Beta-cypermethrin is a broad-spectrum insecticide used worldwide for controlling insect pests. H. axyridis is becoming increasingly threatened by this insecticide. Here, we investigated the effect of a sublethal dose of beta-cypermethrin on flight, locomotion, respiration, and detoxification system of H. axyridis . After exposure to beta-cypermethrin, succinic female adults flew more times, longer distances, and during longer time periods. Exposure to a sublethal dose of beta-cypermethrin also promoted an increase in walking rate, walking distance, walking duration, and also an increase in respiratory quotient and respiratory rate. To investigate the effects of beta-cypermethrin on H. axyridis detoxification system, we analyzed the transcriptome of H. axyridis adults, focusing on genes related to detoxification systems. De novo assembly generated 65,509 unigenes with a mean length of 799 bp. From these genes, 26,020 unigenes (40.91% of all unigenes) exhibited clear homology to known genes in the NCBI non-redundant database. In addition, 10,402 unigenes were annotated in the Cluster of Orthologous Groups database, 12,088 unigenes were assigned to the Gene Ontology database and 12,269 unigenes were in the Kyoto Encyclopedia of Genes and Genome (KEGG) database. Exposure to beta-cypermethrin had significant effects on the transcriptome profile of H. axyridis adult. Based on uniquely mapped reads, 3,296 unigenes were differentially expressed, 868 unigenes were up-regulated and 2,248 unigenes were down-regulated. We identified differentially-expressed unigenes related to general detoxification systems in H. axyridis . This assembled, annotated transcriptome provides a valuable genomic resource for further understanding the molecular basis of detoxification mechanisms in H. axyridis .

  9. DBGC: A Database of Human Gastric Cancer

    PubMed Central

    Wang, Chao; Zhang, Jun; Cai, Mingdeng; Zhu, Zhenggang; Gu, Wenjie; Yu, Yingyan; Zhang, Xiaoyan

    2015-01-01

    The Database of Human Gastric Cancer (DBGC) is a comprehensive database that integrates various human gastric cancer-related data resources. Human gastric cancer-related transcriptomics projects, proteomics projects, mutations, biomarkers and drug-sensitive genes from different sources were collected and unified in this database. Moreover, epidemiological statistics of gastric cancer patients in China and clinicopathological information annotated with gastric cancer cases were also integrated into the DBGC. We believe that this database will greatly facilitate research regarding human gastric cancer in many fields. DBGC is freely available at http://bminfor.tongji.edu.cn/dbgc/index.do PMID:26566288

  10. Synthesis and surface activities of a novel di-hydroxyl-sulfate-betaine-type zwitterionic gemini surfactants

    NASA Astrophysics Data System (ADS)

    Geng, Xiang F.; Hu, Xing Q.; Xia, Ji J.; Jia, Xue C.

    2013-04-01

    A series of novel di-hydroxyl-sulfate-betaine-type zwitterionic gemini surfactants of 1,2-bis[N-ethyl-N-(2-hydroxyl-3-sulfopropyl)-alkylammonium] alkyl betaines (DBAs-n, where s and n represent the spacer length of 2, 4 and 6 and the hydrocarbon chain length of 8, 12, 14, 16 and 18, respectively) were synthesized by reacting alkylamine with sodium 3-chloro-2-hydroxypropanesulfonate (the alternative sulphonated agent), followed by the reactions with а,ω-dibromoalkyl and then ethyl bromide. Their adsorption and aggregation properties were investigated by means of equilibrium surface tension, dynamic light-scattering (DLS) and transmission electron microscopy (TEM). DBAs-n gemini surfactants showed excellent surface activities and packed tightly at the interface. For example, the minimum CMC value for DBAs-n series was of the order of 10-5 M and the surface tension of water can be decreased as low as 22.2 mN/m. It was also found that the aggregates of DBAs-n solutions were significantly dependent on their hydrocarbon chain lengths. The aggregates changed from vesicles to entangled fiber-like micelles as the chain length increased from dodecyl to tetradecyl.

  11. Investigation on dysfunctional beliefs and attitudes about sleep in Chinese college students.

    PubMed

    Jin, Lairun; Zhou, Jun; Peng, Hui; Ding, Shushu; Yuan, Hui

    2018-01-01

    The aims of this study were to evaluate a subset of sleep-related cognitions and to examine whether dysfunctional beliefs and attitudes about sleep were associated with sleep quality in college students. A total of 1,333 college students were enrolled in this study by randomized cluster sampling. A brief version of Dysfunctional Beliefs and Attitudes about Sleep Scale (DBAS-16) was administered to college students at several colleges. Sleep quality was also assessed using the Pittsburgh Sleep Quality Index (PSQI). The DBAS-16 scores were analyzed across different demographic variables, corresponding subscales of 7-item PSQI, and relevant sleep behavior variables. A total of 343 participants were poor sleepers, while 990 were good sleepers, as defined by PSQI. The DBAS-16 scores were lower in poor sleepers than in good sleepers (46.32 ± 7.851 vs 49.87 ± 8.349, p < 0.001), and DBAS-16 scores were lower in females and nonmedical students when compared with those in males and medical students, respectively (48.20 ± 8.711 vs 49.73 ± 7.923, p = 0.001; 48.56 ± 8.406 vs 49.88 ± 8.208, p = 0.009, respectively). The total score for sleep quality, as measured by PSQI, was negatively correlated with the DBAS-16 total score ( r = -0.197, p < 0.01). There were significant differences in PSQI scores between individuals with attitudes and those without attitudes about sleep with respect to good sleep habits ( p < 0.001), self-relaxation ( p = 0.001), physical exercise ( p < 0.001), taking sleeping pills ( p = 0.004), and taking no action ( p < 0.001). Dysfunctional beliefs about sleep are associated with sleep quality and should be discouraged, especially for females and nonmedical college students.

  12. Cognitive Expectancies for Hypnotic Use among Older Adult Veterans with Chronic Insomnia.

    PubMed

    Fung, Constance H; Martin, Jennifer L; Josephson, Karen; Fiorentino, Lavinia; Dzierzewski, Joseph M; Jouldjian, Stella; Song, Yeonsu; Rodriguez Tapia, Juan Carlos; Mitchell, Michael N; Alessi, Cathy A

    2018-01-01

    To examine relationships between cognitive expectancies about sleep and hypnotics and use of medications commonly used for insomnia (hypnotics). We analyzed baseline data from older veterans who met diagnostic criteria for insomnia and were enrolled in a trial comparing CBTI delivered by a supervised, sleep educator to an attention control condition (N = 159; 97% male, mean age 72 years). We classified individuals as hypnotic users (N = 23) vs. non-users (N = 135) based upon medication diaries. Associations between hypnotic status and Dysfunctional Beliefs and Attitudes about Sleep-16 (DBAS) total score (0-10, higher = worse) and two DBAS medication item scores (Item 1: "…better off taking a sleeping pill rather than having a poor night's sleep;" Item 2: "Medication… probably the only solution to sleeplessness"; 0-10, higher = worse) were examined in logistic regression models. Higher scores on the DBAS medication items (both odds ratios = 1.3; p-values < .001) were significantly associated with hypnotic use. DBAS-16 total score was not associated with hypnotic use. Cognitive expectancy (dysfunctional beliefs) about hypnotics was associated with hypnotic use in older adults with chronic insomnia disorder. Strategies that specifically target dysfunctional beliefs about hypnotics are needed and may impact hypnotic use in older adults.

  13. Mining a human transcriptome database for Nrf2 modulators

    EPA Science Inventory

    Nuclear factor erythroid-2 related factor 2 (Nrf2) is a key transcription factor important in the protection against oxidative stress. We developed computational procedures to enable the identification of chemical, genetic and environmental modulators of Nrf2 in a large database ...

  14. Construction of Pará rubber tree genome and multi-transcriptome database accelerates rubber researches.

    PubMed

    Makita, Yuko; Kawashima, Mika; Lau, Nyok Sean; Othman, Ahmad Sofiman; Matsui, Minami

    2018-01-19

    Natural rubber is an economically important material. Currently the Pará rubber tree, Hevea brasiliensis is the main commercial source. Little is known about rubber biosynthesis at the molecular level. Next-generation sequencing (NGS) technologies brought draft genomes of three rubber cultivars and a variety of RNA sequencing (RNA-seq) data. However, no current genome or transcriptome databases (DB) are organized by gene. A gene-oriented database is a valuable support for rubber research. Based on our original draft genome sequence of H. brasiliensis RRIM600, we constructed a rubber tree genome and transcriptome DB. Our DB provides genome information including gene functional annotations and multi-transcriptome data of RNA-seq, full-length cDNAs including PacBio Isoform sequencing (Iso-Seq), ESTs and genome wide transcription start sites (TSSs) derived from CAGE technology. Using our original and publically available RNA-seq data, we calculated co-expressed genes for identifying functionally related gene sets and/or genes regulated by the same transcription factor (TF). Users can access multi-transcriptome data through both a gene-oriented web page and a genome browser. For the gene searching system, we provide keyword search, sequence homology search and gene expression search; users can also select their expression threshold easily. The rubber genome and transcriptome DB provides rubber tree genome sequence and multi-transcriptomics data. This DB is useful for comprehensive understanding of the rubber transcriptome. This will assist both industrial and academic researchers for rubber and economically important close relatives such as R. communis, M. esculenta and J. curcas. The Rubber Transcriptome DB release 2017.03 is accessible at http://matsui-lab.riken.jp/rubber/ .

  15. Mining biological databases for candidate disease genes

    NASA Astrophysics Data System (ADS)

    Braun, Terry A.; Scheetz, Todd; Webster, Gregg L.; Casavant, Thomas L.

    2001-07-01

    The publicly-funded effort to sequence the complete nucleotide sequence of the human genome, the Human Genome Project (HGP), has currently produced more than 93% of the 3 billion nucleotides of the human genome into a preliminary `draft' format. In addition, several valuable sources of information have been developed as direct and indirect results of the HGP. These include the sequencing of model organisms (rat, mouse, fly, and others), gene discovery projects (ESTs and full-length), and new technologies such as expression analysis and resources (micro-arrays or gene chips). These resources are invaluable for the researchers identifying the functional genes of the genome that transcribe and translate into the transcriptome and proteome, both of which potentially contain orders of magnitude more complexity than the genome itself. Preliminary analyses of this data identified approximately 30,000 - 40,000 human `genes.' However, the bulk of the effort still remains -- to identify the functional and structural elements contained within the transcriptome and proteome, and to associate function in the transcriptome and proteome to genes. A fortuitous consequence of the HGP is the existence of hundreds of databases containing biological information that may contain relevant data pertaining to the identification of disease-causing genes. The task of mining these databases for information on candidate genes is a commercial application of enormous potential. We are developing a system to acquire and mine data from specific databases to aid our efforts to identify disease genes. A high speed cluster of Linux of workstations is used to analyze sequence and perform distributed sequence alignments as part of our data mining and processing. This system has been used to mine GeneMap99 sequences within specific genomic intervals to identify potential candidate disease genes associated with Bardet-Biedle Syndrome (BBS).

  16. Integration of deep transcriptome and proteome analyses reveals the components of alkaloid metabolism in opium poppy cell cultures

    PubMed Central

    2010-01-01

    Background Papaver somniferum (opium poppy) is the source for several pharmaceutical benzylisoquinoline alkaloids including morphine, the codeine and sanguinarine. In response to treatment with a fungal elicitor, the biosynthesis and accumulation of sanguinarine is induced along with other plant defense responses in opium poppy cell cultures. The transcriptional induction of alkaloid metabolism in cultured cells provides an opportunity to identify components of this process via the integration of deep transcriptome and proteome databases generated using next-generation technologies. Results A cDNA library was prepared for opium poppy cell cultures treated with a fungal elicitor for 10 h. Using 454 GS-FLX Titanium pyrosequencing, 427,369 expressed sequence tags (ESTs) with an average length of 462 bp were generated. Assembly of these sequences yielded 93,723 unigenes, of which 23,753 were assigned Gene Ontology annotations. Transcripts encoding all known sanguinarine biosynthetic enzymes were identified in the EST database, 5 of which were represented among the 50 most abundant transcripts. Liquid chromatography-tandem mass spectrometry (LC-MS/MS) of total protein extracts from cell cultures treated with a fungal elicitor for 50 h facilitated the identification of 1,004 proteins. Proteins were fractionated by one-dimensional SDS-PAGE and digested with trypsin prior to LC-MS/MS analysis. Query of an opium poppy-specific EST database substantially enhanced peptide identification. Eight out of 10 known sanguinarine biosynthetic enzymes and many relevant primary metabolic enzymes were represented in the peptide database. Conclusions The integration of deep transcriptome and proteome analyses provides an effective platform to catalogue the components of secondary metabolism, and to identify genes encoding uncharacterized enzymes. The establishment of corresponding transcript and protein databases generated by next-generation technologies in a system with a well-defined metabolite profile facilitates an improved linkage between genes, enzymes, and pathway components. The proteome database represents the most relevant alkaloid-producing enzymes, compared with the much deeper and more complete transcriptome library. The transcript database contained full-length mRNAs encoding most alkaloid biosynthetic enzymes, which is a key requirement for the functional characterization of novel gene candidates. PMID:21083930

  17. Examining maladaptive beliefs about sleep across insomnia patient groups

    PubMed Central

    Carney, Colleen E.; Edinger, Jack D.; Morin, Charles M.; Manber, Rachel; Rybarczyk, Bruce; Stepanski, Edward J.; Wright, Helen; Lack, Leon

    2009-01-01

    Objectives: Unhelpful beliefs about sleep have been linked to insomnia, and increasing one's cognitive flexibility about sleep has been linked to post-treatment sleep improvement. This study evaluated if levels of such beliefs differ across insomnia groups, and whether there are particular beliefs that differ for specific insomnia subtypes. Methods: Participants (N = 1384) were people with insomnia and good sleepers ranging from 18 to 89 years old (M = 42.6, SD = 19.4). Data from previous studies at five insomnia clinical sites were pooled to examine responses on the Dysfunctional Beliefs and Attitudes about Sleep Scale (DBAS-16) across differing insomnia groups. Results: Group analyses revealed that those from community-based insomnia clinics and those who are hypnotic-dependent generally had the highest levels of unhelpful sleep-related beliefs. With the exception of beliefs about sleep needs (wherein only community sleep clinic patients had high scores relative to good sleepers), all insomnia groups had higher scores on the DBAS-16 than good sleepers. A validity analysis suggested that a DBAS-16 index score > 3.8 was the level of unhelpful beliefs associated with clinically significant insomnia, although a slightly lower cutoff may be useful to identify an unhelpful degree of sleep-related beliefs in highly screened PI and medical patient groups. Conclusions: This study offers descriptive data for the use of the DBAS-16 across insomnia subgroups, which will help the user understand what degree of maladaptive sleep beliefs are most strongly associated with clinically significant levels of insomnia. Results also may have implications for cognitive targeting during treatment for particular insomnia groups. PMID:20004301

  18. Transcriptome Dynamics of Developing Photoreceptors in Three‐Dimensional Retina Cultures Recapitulates Temporal Sequence of Human Cone and Rod Differentiation Revealing Cell Surface Markers and Gene Networks

    PubMed Central

    Kaewkhaw, Rossukon; Kaya, Koray Dogan; Brooks, Matthew; Homma, Kohei; Zou, Jizhong; Chaitankar, Vijender; Rao, Mahendra

    2015-01-01

    Abstract The derivation of three‐dimensional (3D) stratified neural retina from pluripotent stem cells has permitted investigations of human photoreceptors. We have generated a H9 human embryonic stem cell subclone that carries a green fluorescent protein (GFP) reporter under the control of the promoter of cone‐rod homeobox (CRX), an established marker of postmitotic photoreceptor precursors. The CRXp‐GFP reporter replicates endogenous CRX expression in vitro when the H9 subclone is induced to form self‐organizing 3D retina‐like tissue. At day 37, CRX+ photoreceptors appear in the basal or middle part of neural retina and migrate to apical side by day 67. Temporal and spatial patterns of retinal cell type markers recapitulate the predicted sequence of development. Cone gene expression is concomitant with CRX, whereas rod differentiation factor neural retina leucine zipper protein (NRL) is first observed at day 67. At day 90, robust expression of NRL and its target nuclear receptor NR2E3 is evident in many CRX+ cells, while minimal S‐opsin and no rhodopsin or L/M‐opsin is present. The transcriptome profile, by RNA‐seq, of developing human photoreceptors is remarkably concordant with mRNA and immunohistochemistry data available for human fetal retina although many targets of CRX, including phototransduction genes, exhibit a significant delay in expression. We report on temporal changes in gene signatures, including expression of cell surface markers and transcription factors; these expression changes should assist in isolation of photoreceptors at distinct stages of differentiation and in delineating coexpression networks. Our studies establish the first global expression database of developing human photoreceptors, providing a reference map for functional studies in retinal cultures. Stem Cells 2015;33:3504–3518 PMID:26235913

  19. TRAM (Transcriptome Mapper): database-driven creation and analysis of transcriptome maps from multiple sources

    PubMed Central

    2011-01-01

    Background Several tools have been developed to perform global gene expression profile data analysis, to search for specific chromosomal regions whose features meet defined criteria as well as to study neighbouring gene expression. However, most of these tools are tailored for a specific use in a particular context (e.g. they are species-specific, or limited to a particular data format) and they typically accept only gene lists as input. Results TRAM (Transcriptome Mapper) is a new general tool that allows the simple generation and analysis of quantitative transcriptome maps, starting from any source listing gene expression values for a given gene set (e.g. expression microarrays), implemented as a relational database. It includes a parser able to assign univocal and updated gene symbols to gene identifiers from different data sources. Moreover, TRAM is able to perform intra-sample and inter-sample data normalization, including an original variant of quantile normalization (scaled quantile), useful to normalize data from platforms with highly different numbers of investigated genes. When in 'Map' mode, the software generates a quantitative representation of the transcriptome of a sample (or of a pool of samples) and identifies if segments of defined lengths are over/under-expressed compared to the desired threshold. When in 'Cluster' mode, the software searches for a set of over/under-expressed consecutive genes. Statistical significance for all results is calculated with respect to genes localized on the same chromosome or to all genome genes. Transcriptome maps, showing differential expression between two sample groups, relative to two different biological conditions, may be easily generated. We present the results of a biological model test, based on a meta-analysis comparison between a sample pool of human CD34+ hematopoietic progenitor cells and a sample pool of megakaryocytic cells. Biologically relevant chromosomal segments and gene clusters with differential expression during the differentiation toward megakaryocyte were identified. Conclusions TRAM is designed to create, and statistically analyze, quantitative transcriptome maps, based on gene expression data from multiple sources. The release includes FileMaker Pro database management runtime application and it is freely available at http://apollo11.isto.unibo.it/software/, along with preconfigured implementations for mapping of human, mouse and zebrafish transcriptomes. PMID:21333005

  20. Transcriptome analysis of Houttuynia cordata Thunb. by Illumina paired-end RNA sequencing and SSR marker discovery.

    PubMed

    Wei, Lin; Li, Shenghua; Liu, Shenggui; He, Anna; Wang, Dan; Wang, Jie; Tang, Yulian; Wu, Xianjin

    2014-01-01

    Houttuynia cordata Thunb. is an important traditional medical herb in China and other Asian countries, with high medicinal and economic value. However, a lack of available genomic information has become a limitation for research on this species. Thus, we carried out high-throughput transcriptomic sequencing of H. cordata to generate an enormous transcriptome sequence dataset for gene discovery and molecular marker development. Illumina paired-end sequencing technology produced over 56 million sequencing reads from H. cordata mRNA. Subsequent de novo assembly yielded 63,954 unigenes, 39,982 (62.52%) and 26,122 (40.84%) of which had significant similarity to proteins in the NCBI nonredundant protein and Swiss-Prot databases (E-value <10(-5)), respectively. Of these annotated unigenes, 30,131 and 15,363 unigenes were assigned to gene ontology categories and clusters of orthologous groups, respectively. In addition, 24,434 (38.21%) unigenes were mapped onto 128 pathways using the KEGG pathway database and 17,964 (44.93%) unigenes showed homology to Vitis vinifera (Vitaceae) genes in BLASTx analysis. Furthermore, 4,800 cDNA SSRs were identified as potential molecular markers. Fifty primer pairs were randomly selected to detect polymorphism among 30 samples of H. cordata; 43 (86%) produced fragments of expected size, suggesting that the unigenes were suitable for specific primer design and of high quality, and the SSR marker could be widely used in marker-assisted selection and molecular breeding of H. cordata in the future. This is the first application of Illumina paired-end sequencing technology to investigate the whole transcriptome of H. cordata and to assemble RNA-seq reads without a reference genome. These data should help researchers investigating the evolution and biological processes of this species. The SSR markers developed can be used for construction of high-resolution genetic linkage maps and for gene-based association analyses in H. cordata. This work will enable future functional genomic research and research into the distinctive active constituents of this genus.

  1. Insight into the transcriptome of Arthrobotrys conoides using high throughput sequencing.

    PubMed

    Ramesh, Pandit; Reena, Patel; Amitbikram, Mohapatra; Chaitanya, Joshi; Anju, Kunjadia

    2015-12-01

    Arthrobotrys conoides is a nematode-trapping fungus belonging to Orbiliales, Ascomycota group, and traps prey nematodes by means of adhesive network. Fungus has a potential to be used as a biocontrol agent against plant parasitic nematodes. In the present study, we characterized the transcriptome of A. conoides using high-throughput sequencing technology and characterized its virulence unigenes. Total 7,255 cDNA contigs with an average length of 425 bp were generated and 6184 (61.81%) transcripts were functionally annotated and characterized. Majority of unigenes were found analogous to the genes of plant pathogenic fungi. A total of 1749 transcripts were found to be orthologous with eukaryotic proteins of KOG database. Several carbohydrate active enzymes and peptidases were identified. We also analyzed classically and nonclassically secreted proteins and confirmed by BLASTP against fungal secretome database. A total of 916 contigs were analogous to 556 unique proteins of Pathogen Host Interaction (PHI) database. Further, we identified 91 unigenes homologous to the database of fungal virulence factor (DFVF). A total of 104 putative protein kinases coding transcripts were identified by BLASTP against KinBase database, which are major players in signaling pathways. This study provides a comprehensive look at the transcriptome of A. conoides and the identified unigenes might have a role in catching and killing prey nematodes by A. conoides. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  2. Stress-related sleep vulnerability and maladaptive sleep beliefs predict insomnia at long-term follow-up.

    PubMed

    Yang, Chien-Ming; Hung, Chih-Ying; Lee, Hsin-Chien

    2014-09-15

    Vulnerability to stress-related sleep disturbances and maladaptive sleep beliefs has been proposed to be predisposing factors for insomnia. Yet previous studies addressing these factors have been cross-sectional in nature and could not be used to infer the time sequences of the association. The current study used a six-year follow-up to examine the predisposing roles of these two factors and their interactions with major life stressors in the development of insomnia. One hundred seventeen college students recruited for a survey in 2006 participated in this follow-up survey in 2012. In 2006, they completed a packet of questionnaires including the Dysfunctional Beliefs and Attitudes about Sleep Questionnaire, 10-item version (DBAS-10), the Ford Insomnia Response to Stress Test (FIRST), and the Pittsburgh Sleep Quality Index (PSQI); in 2012 they completed the Insomnia Severity Index (ISI) and the modified Life Experiences Survey (LES). Fourteen of the participants were found to suffer from insomnia as measured by the ISI. Logistic regression showed that scores on both DBAS-10 and FIRST could predict insomnia at follow-up. When the interaction of DBAS-10 and LES and that of FIRST and LES were added, both DBAS-10 and FIRST remained significant predictors, while the interaction of FIRST and LES showed a near-significant trend in predicting insomnia. The results showed that both vulnerability to stress-related sleep disturbances and maladaptive sleep beliefs are predisposing factors for insomnia. The hypothesized interaction effect between sleep vulnerability and major life stressors was found to be marginal. The maladaptive sleep beliefs, on the other hand, showed a predisposing effect independent from the influences of negative life events. © 2014 American Academy of Sleep Medicine.

  3. RNA-seq analysis of Rubus idaeus cv. Nova: transcriptome sequencing and de novo assembly for subsequent functional genomics approaches.

    PubMed

    Hyun, Tae Kyung; Lee, Sarah; Kumar, Dhinesh; Rim, Yeonggil; Kumar, Ritesh; Lee, Sang Yeol; Lee, Choong Hwan; Kim, Jae-Yean

    2014-10-01

    Using Illumina sequencing technology, we have generated the large-scale transcriptome sequencing data containing abundant information on genes involved in the metabolic pathways in R. idaeus cv. Nova fruits. Rubus idaeus (Red raspberry) is one of the important economical crops that possess numerous nutrients, micronutrients and phytochemicals with essential health benefits to human. The molecular mechanism underlying the ripening process and phytochemical biosynthesis in red raspberry is attributed to the changes in gene expression, but very limited transcriptomic and genomic information in public databases is available. To address this issue, we generated more than 51 million sequencing reads from R. idaeus cv. Nova fruit using Illumina RNA-Seq technology. After de novo assembly, we obtained 42,604 unigenes with an average length of 812 bp. At the protein level, Nova fruit transcriptome showed 77 and 68 % sequence similarities with Rubus coreanus and Fragaria versa, respectively, indicating the evolutionary relationship between them. In addition, 69 % of assembled unigenes were annotated using public databases including NCBI non-redundant, Cluster of Orthologous Groups and Gene ontology database, suggesting that our transcriptome dataset provides a valuable resource for investigating metabolic processes in red raspberry. To analyze the relationship between several novel transcripts and the amounts of metabolites such as γ-aminobutyric acid and anthocyanins, real-time PCR and target metabolite analysis were performed on two different ripening stages of Nova. This is the first attempt using Illumina sequencing platform for RNA sequencing and de novo assembly of Nova fruit without reference genome. Our data provide the most comprehensive transcriptome resource available for Rubus fruits, and will be useful for understanding the ripening process and for breeding R. idaeus cultivars with improved fruit quality.

  4. Reduction of dark-band-like metal artifacts caused by dental implant bodies using hypothetical monoenergetic imaging after dual-energy computed tomography.

    PubMed

    Tanaka, Ray; Hayashi, Takafumi; Ike, Makiko; Noto, Yoshiyuki; Goto, Tazuko K

    2013-06-01

    The aim of this study was to evaluate the usefulness of hypothetical monoenergetic images after dual-energy computed tomography (DECT) for assessment of the bone encircling dental implant bodies. Seventy-two axial images of implantation sites clipped out from image data scanned using DECT in dual-energy mode were used. Subjective assessment on reduction of dark-band-like artifacts (R-DBAs) and diagnosability of adjacent bone condition (D-ABC) in 3 sets of DECT images-a fused image set (DE120) and 2 sets of hypothetical monoenergetic images (ME100, ME190)-was performed and the results were statistically analyzed. With regards to R-DBAs and D-ABC, significant differences among DE120, ME100, and ME190 were observed. The ME100 and ME190 images revealed more artifact reduction and diagnosability than those of DE120. DECT imaging followed by hypothetical monoenergetic image construction can cause R-DBAs and increase D-ABC and may be potentially used for the evaluation of postoperative changes in the bone encircling implant bodies. Copyright © 2013 Elsevier Inc. All rights reserved.

  5. Haemophilus ducreyi Seeks Alternative Carbon Sources and Adapts to Nutrient Stress and Anaerobiosis during Experimental Infection of Human Volunteers.

    PubMed

    Gangaiah, Dharanesh; Zhang, Xinjun; Baker, Beth; Fortney, Kate R; Gao, Hongyu; Holley, Concerta L; Munson, Robert S; Liu, Yunlong; Spinola, Stanley M

    2016-05-01

    Haemophilus ducreyi causes the sexually transmitted disease chancroid in adults and cutaneous ulcers in children. In humans, H. ducreyi resides in an abscess and must adapt to a variety of stresses. Previous studies (D. Gangaiah, M. Labandeira-Rey, X. Zhang, K. R. Fortney, S. Ellinger, B. Zwickl, B. Baker, Y. Liu, D. M. Janowicz, B. P. Katz, C. A. Brautigam, R. S. Munson, Jr., E. J. Hansen, and S. M. Spinola, mBio 5:e01081-13, 2014, http://dx.doi.org/10.1128/mBio.01081-13) suggested that H. ducreyi encounters growth conditions in human lesions resembling those found in stationary phase. However, how H. ducreyi transcriptionally responds to stress during human infection is unknown. Here, we determined the H. ducreyi transcriptome in biopsy specimens of human lesions and compared it to the transcriptomes of bacteria grown to mid-log, transition, and stationary phases. Multidimensional scaling showed that the in vivo transcriptome is distinct from those of in vitro growth. Compared to the inoculum (mid-log-phase bacteria), H. ducreyi harvested from pustules differentially expressed ∼93 genes, of which 62 were upregulated. The upregulated genes encode homologs of proteins involved in nutrient transport, alternative carbon pathways (l-ascorbate utilization and metabolism), growth arrest response, heat shock response, DNA recombination, and anaerobiosis. H. ducreyi upregulated few genes (hgbA, flp-tad, and lspB-lspA2) encoding virulence determinants required for human infection. Most genes regulated by CpxRA, RpoE, Hfq, (p)ppGpp, and DksA, which control the expression of virulence determinants and adaptation to a variety of stresses, were not differentially expressed in vivo, suggesting that these systems are cycling on and off during infection. Taken together, these data suggest that the in vivo transcriptome is distinct from those of in vitro growth and that adaptation to nutrient stress and anaerobiosis is crucial for H. ducreyi survival in humans. Copyright © 2016, American Society for Microbiology. All Rights Reserved.

  6. Discovery of parvovirus-related sequences in an unexpected broad range of animals.

    PubMed

    François, S; Filloux, D; Roumagnac, P; Bigot, D; Gayral, P; Martin, D P; Froissart, R; Ogliastro, M

    2016-09-07

    Our knowledge of the genetic diversity and host ranges of viruses is fragmentary. This is particularly true for the Parvoviridae family. Genetic diversity studies of single stranded DNA viruses within this family have been largely focused on arthropod- and vertebrate-infecting species that cause diseases of humans and our domesticated animals: a focus that has biased our perception of parvovirus diversity. While metagenomics approaches could help rectify this bias, so too could transcriptomics studies. Large amounts of transcriptomic data are available for a diverse array of animal species and whenever this data has inadvertently been gathered from virus-infected individuals, it could contain detectable viral transcripts. We therefore performed a systematic search for parvovirus-related sequences (PRSs) within publicly available transcript, genome and protein databases and eleven new transcriptome datasets. This revealed 463 PRSs in the transcript databases of 118 animals. At least 41 of these PRSs are likely integrated within animal genomes in that they were also found within genomic sequence databases. Besides illuminating the ubiquity of parvoviruses, the number of parvoviral sequences discovered within public databases revealed numerous previously unknown parvovirus-host combinations; particularly in invertebrates. Our findings suggest that the host-ranges of extant parvoviruses might span the entire animal kingdom.

  7. GigaTON: an extensive publicly searchable database providing a new reference transcriptome in the pacific oyster Crassostrea gigas.

    PubMed

    Riviere, Guillaume; Klopp, Christophe; Ibouniyamine, Nabihoudine; Huvet, Arnaud; Boudry, Pierre; Favrel, Pascal

    2015-12-02

    The Pacific oyster, Crassostrea gigas, is one of the most important aquaculture shellfish resources worldwide. Important efforts have been undertaken towards a better knowledge of its genome and transcriptome, which makes now C. gigas becoming a model organism among lophotrochozoans, the under-described sister clade of ecdysozoans within protostomes. These massive sequencing efforts offer the opportunity to assemble gene expression data and make such resource accessible and exploitable for the scientific community. Therefore, we undertook this assembly into an up-to-date publicly available transcriptome database: the GigaTON (Gigas TranscriptOme pipeliNe) database. We assembled 2204 million sequences obtained from 114 publicly available RNA-seq libraries that were realized using all embryo-larval development stages, adult organs, different environmental stressors including heavy metals, temperature, salinity and exposure to air, which were mostly performed as part of the Crassostrea gigas genome project. This data was analyzed in silico and resulted into 56621 newly assembled contigs that were deposited into a publicly available database, the GigaTON database. This database also provides powerful and user-friendly request tools to browse and retrieve information about annotation, expression level, UTRs, splice and polymorphism, and gene ontology associated to all the contigs into each, and between all libraries. The GigaTON database provides a convenient, potent and versatile interface to browse, retrieve, confront and compare massive transcriptomic information in an extensive range of conditions, tissues and developmental stages in Crassostrea gigas. To our knowledge, the GigaTON database constitutes the most extensive transcriptomic database to date in marine invertebrates, thereby a new reference transcriptome in the oyster, a highly valuable resource to physiologists and evolutionary biologists.

  8. Comparison of a teratogenic transcriptome-based predictive test based on human embryonic versus inducible pluripotent stem cells.

    PubMed

    Shinde, Vaibhav; Perumal Srinivasan, Sureshkumar; Henry, Margit; Rotshteyn, Tamara; Hescheler, Jürgen; Rahnenführer, Jörg; Grinberg, Marianna; Meisig, Johannes; Blüthgen, Nils; Waldmann, Tanja; Leist, Marcel; Hengstler, Jan Georg; Sachinidis, Agapios

    2016-12-30

    Human embryonic stem cells (hESCs) partially recapitulate early embryonic three germ layer development, allowing testing of potential teratogenic hazards. Because use of hESCs is ethically debated, we investigated the potential for human induced pluripotent stem cells (hiPSCs) to replace hESCs in such tests. Three cell lines, comprising hiPSCs (foreskin and IMR90) and hESCs (H9) were differentiated for 14 days. Their transcriptome profiles were obtained on day 0 and day 14 and analyzed by comprehensive bioinformatics tools. The transcriptomes on day 14 showed that more than 70% of the "developmental genes" (regulated genes with > 2-fold change on day 14 compared to day 0) exhibited variability among cell lines. The developmental genes belonging to all three cell lines captured biological processes and KEGG pathways related to all three germ layer embryonic development. In addition, transcriptome profiles were obtained after 14 days of exposure to teratogenic valproic acid (VPA) during differentiation. Although the differentially regulated genes between treated and untreated samples showed more than 90% variability among cell lines, VPA clearly antagonized the expression of developmental genes in all cell lines: suppressing upregulated developmental genes, while inducing downregulated ones. To quantify VPA-disturbed development based on developmental genes, we estimated the "developmental potency" (D p ) and "developmental index" (D i ). Despite differences in genes deregulated by VPA, uniform D i values were obtained for all three cell lines. Given that the D i values for VPA were similar for hESCs and hiPSCs, D i can be used for robust hazard identification, irrespective of whether hESCs or hiPSCs are used in the test systems.

  9. Transcriptome analysis of duck liver and identification of differentially expressed transcripts in response to duck hepatitis A virus genotype C infection.

    PubMed

    Tang, Cheng; Lan, Daoliang; Zhang, Huanrong; Ma, Jing; Yue, Hua

    2013-01-01

    Duck is an economically important poultry and animal model for human viral hepatitis B. However, the molecular mechanisms underlying host-virus interaction remain unclear because of limited information on the duck genome. This study aims to characterize the duck normal liver transcriptome and to identify the differentially expressed transcripts at 24 h after duck hepatitis A virus genotype C (DHAV-C) infection using Illumina-Solexa sequencing. After removal of low-quality sequences and assembly, a total of 52,757 unigenes was obtained from the normal liver group. Further blast analysis showed that 18,918 unigenes successfully matched the known genes in the database. GO analysis revealed that 25,116 unigenes took part in 61 categories of biological processes, cellular components, and molecular functions. Among the 25 clusters of orthologous group categories (COG), the cluster for "General function prediction only" represented the largest group, followed by "Transcription" and "Replication, recombination, and repair." KEGG analysis showed that 17,628 unigenes were involved in 301 pathways. Through comparison of normal and infected transcriptome data, we identified 20 significantly differentially expressed unigenes, which were further confirmed by real-time polymerase chain reaction. Of the 20 unigenes, nine matched the known genes in the database, including three up-regulated genes (virus replicase polyprotein, LRRC3B, and PCK1) and six down-regulated genes (CRP, AICL-like 2, L1CAM, CYB26A1, CHAC1, and ADAM32). The remaining 11 novel unigenes that did not match any known genes in the database may provide a basis for the discovery of new transcripts associated with infection. This study provided a gene expression pattern for normal duck liver and for the previously unrecognized changes in gene transcription that are altered during DHAV-C infection. Our data revealed useful information for future studies on the duck genome and provided new insights into the molecular mechanism of host-DHAV-C interaction.

  10. Generation of a foveomacular transcriptome

    PubMed Central

    Bernstein, Steven; Wong, Paul W.

    2014-01-01

    Purpose Organizing molecular biologic data is a growing challenge since the rate of data accumulation is steadily increasing. Information relevant to a particular biologic query can be difficult to extract from the comprehensive databases currently available. We present a data collection and organization model designed to ameliorate these problems and applied it to generate an expressed sequence tag (EST)–based foveomacular transcriptome. Methods Using Perl, MySQL, EST libraries, screening, and human foveomacular gene expression as a model system, we generated a foveomacular transcriptome database enriched for molecularly relevant data. Results Using foveomacula as a gene expression model tissue, we identified and organized 6,056 genes expressed in that tissue. Of those identified genes, 3,480 had not been previously described as expressed in the foveomacula. Internal experimental controls as well as comparison of our data set to published data sets suggest we do not yet have a complete description of the foveomacula transcriptome. Conclusions We present an organizational method designed to amplify the utility of data pertinent to a specific research interest. Our method is generic enough to be applicable to a variety of conditions yet focused enough to allow for specialized study. PMID:24991187

  11. Transcriptome Analysis of Houttuynia cordata Thunb. by Illumina Paired-End RNA Sequencing and SSR Marker Discovery

    PubMed Central

    Wei, Lin; Li, Shenghua; Liu, Shenggui; He, Anna; Wang, Dan; Wang, Jie; Tang, Yulian; Wu, Xianjin

    2014-01-01

    Background Houttuynia cordata Thunb. is an important traditional medical herb in China and other Asian countries, with high medicinal and economic value. However, a lack of available genomic information has become a limitation for research on this species. Thus, we carried out high-throughput transcriptomic sequencing of H. cordata to generate an enormous transcriptome sequence dataset for gene discovery and molecular marker development. Principal Findings Illumina paired-end sequencing technology produced over 56 million sequencing reads from H. cordata mRNA. Subsequent de novo assembly yielded 63,954 unigenes, 39,982 (62.52%) and 26,122 (40.84%) of which had significant similarity to proteins in the NCBI nonredundant protein and Swiss-Prot databases (E-value <10−5), respectively. Of these annotated unigenes, 30,131 and 15,363 unigenes were assigned to gene ontology categories and clusters of orthologous groups, respectively. In addition, 24,434 (38.21%) unigenes were mapped onto 128 pathways using the KEGG pathway database and 17,964 (44.93%) unigenes showed homology to Vitis vinifera (Vitaceae) genes in BLASTx analysis. Furthermore, 4,800 cDNA SSRs were identified as potential molecular markers. Fifty primer pairs were randomly selected to detect polymorphism among 30 samples of H. cordata; 43 (86%) produced fragments of expected size, suggesting that the unigenes were suitable for specific primer design and of high quality, and the SSR marker could be widely used in marker-assisted selection and molecular breeding of H. cordata in the future. Conclusions This is the first application of Illumina paired-end sequencing technology to investigate the whole transcriptome of H. cordata and to assemble RNA-seq reads without a reference genome. These data should help researchers investigating the evolution and biological processes of this species. The SSR markers developed can be used for construction of high-resolution genetic linkage maps and for gene-based association analyses in H. cordata. This work will enable future functional genomic research and research into the distinctive active constituents of this genus. PMID:24392108

  12. Characterization and analysis of a transcriptome from the boreal spider crab Hyas araneus.

    PubMed

    Harms, Lars; Frickenhaus, Stephan; Schiffer, Melanie; Mark, Felix C; Storch, Daniela; Pörtner, Hans-Otto; Held, Christoph; Lucassen, Magnus

    2013-12-01

    Research investigating the genetic basis of physiological responses has significantly broadened our understanding of the mechanisms underlying organismic response to environmental change. However, genomic data are currently available for few taxa only, thus excluding physiological model species from this approach. In this study we report the transcriptome of the model organism Hyas araneus from Spitsbergen (Arctic). We generated 20,479 transcripts, using the 454 GS FLX sequencing technology in combination with an Illumina HiSeq sequencing approach. Annotation by Blastx revealed 7159 blast hits in the NCBI non-redundant protein database. The comparison between the spider crab H. araneus transcriptome and EST libraries of the European lobster Homarus americanus and the porcelain crab Petrolisthes cinctipes yielded 3229/2581 sequences with a significant hit, respectively. The clustering by the Markov Clustering Algorithm (MCL) revealed a common core of 1710 clusters present in all three species and 5903 unique clusters for H. araneus. The combined sequencing approaches generated transcripts that will greatly expand the limited genomic data available for crustaceans. We introduce the MCL clustering for transcriptome comparisons as a simple approach to estimate similarities between transcriptomic libraries of different size and quality and to analyze homologies within the selected group of species. In particular, we identified a large variety of reverse transcriptase (RT) sequences not only in the H. araneus transcriptome and other decapod crustaceans, but also sea urchin, supporting the hypothesis of a heritable, anti-viral immunity and the proposed viral fragment integration by host-derived RTs in marine invertebrates. © 2013.

  13. Ginseng Genome Database: an open-access platform for genomics of Panax ginseng.

    PubMed

    Jayakodi, Murukarthick; Choi, Beom-Soon; Lee, Sang-Choon; Kim, Nam-Hoon; Park, Jee Young; Jang, Woojong; Lakshmanan, Meiyappan; Mohan, Shobhana V G; Lee, Dong-Yup; Yang, Tae-Jin

    2018-04-12

    The ginseng (Panax ginseng C.A. Meyer) is a perennial herbaceous plant that has been used in traditional oriental medicine for thousands of years. Ginsenosides, which have significant pharmacological effects on human health, are the foremost bioactive constituents in this plant. Having realized the importance of this plant to humans, an integrated omics resource becomes indispensable to facilitate genomic research, molecular breeding and pharmacological study of this herb. The first draft genome sequences of P. ginseng cultivar "Chunpoong" were reported recently. Here, using the draft genome, transcriptome, and functional annotation datasets of P. ginseng, we have constructed the Ginseng Genome Database http://ginsengdb.snu.ac.kr /, the first open-access platform to provide comprehensive genomic resources of P. ginseng. The current version of this database provides the most up-to-date draft genome sequence (of approximately 3000 Mbp of scaffold sequences) along with the structural and functional annotations for 59,352 genes and digital expression of genes based on transcriptome data from different tissues, growth stages and treatments. In addition, tools for visualization and the genomic data from various analyses are provided. All data in the database were manually curated and integrated within a user-friendly query page. This database provides valuable resources for a range of research fields related to P. ginseng and other species belonging to the Apiales order as well as for plant research communities in general. Ginseng genome database can be accessed at http://ginsengdb.snu.ac.kr /.

  14. Association between stress-related sleep reactivity and cognitive processes in insomnia disorder and insomnia subgroups: preliminary results.

    PubMed

    Palagini, Laura; Faraguna, Ugo; Mauri, Mauro; Gronchi, Alessia; Morin, Charles M; Riemann, Dieter

    2016-03-01

    Stress-related sleep reactivity, sleep-related cognitions, and psychological factors play an important role in insomnia. The aim was to investigate their possible association in Insomnia Disorder, insomnia subgroups, and healthy subjects. The cross-sectional study consisted of 93 subjects who met diagnostic criteria for Insomnia Disorder according to Diagnostic and Statistical Manual of Mental Disorders, 5th Edition (DSM-5) and of 30 healthy subjects. Survey instruments included the Insomnia Severity Index (ISI), Pittsburgh Sleep Quality Index (PSQI), Ford Insomnia Response to Stress Test (FIRST), Dysfunctional Beliefs about Sleep scale (DBAS), Beck Depression Inventory (BDI), and Zung Self-Rating Anxiety Scale (SAS). Descriptive statistics, Pearson correlations, χ(2)-test, and multiple linear regression were performed. FIRST and SAS best determined the insomnia subjects vs good sleepers (FIRST χ(2) = 109.6, p <0.001, SAS χ(2) = 120.3, p <0.001). FIRST was best predicted by DBAS (p <0.001), PSQI (p <0.001), and SAS by PSQI (p <0.001), ISI (p <0.05), BDI (p <0.001). In the sleep onset subgroup FIRST was related to ISI, PSQI, and DBAS and in the combined subgroup with DBAS. In both subgroups SAS was related to PSQI, ISI, and BDI. Findings suggest potential implications: (1) among the factors that may contribute to insomnia, stress-related sleep reactivity, and psychological factors, such as anxiety symptoms, may distinguish insomnia subjects from good sleepers; (2) sleep reactivity and sleep-related cognitions seem interrelated, unhelpful beliefs may affect the stress reactivity; (3) psychological factors may influence sleep quality and the severity of insomnia; (4) these important sleep-related variables may have similar associations in insomnia subgroups; they may constitute the core factors for insomnia development and maintenance. Copyright © 2015 Elsevier B.V. All rights reserved.

  15. Comparative transcriptome analysis of shoot and root tissue of Bacopa monnieri identifies potential genes related to triterpenoid saponin biosynthesis.

    PubMed

    Jeena, Gajendra Singh; Fatima, Shahnoor; Tripathi, Pragya; Upadhyay, Swati; Shukla, Rakesh Kumar

    2017-06-28

    Bacopa monnieri commonly known as Brahmi is utilized in Ayurveda to improve memory and many other human health benefits. Bacosides enriched standardized extract of Bacopa monnieri is being marketed as a memory enhancing agent. In spite of its well known pharmacological properties it is not much studied in terms of transcripts involved in biosynthetic pathway and its regulation that controls the secondary metabolic pathway in this plant. The aim of this study was to identify the potential transcripts and provide a framework of identified transcripts involved in bacosides production through transcriptome assembly. We performed comparative transcriptome analysis of shoot and root tissue of Bacopa monnieri in two independent biological replicate and obtained 22.48 million and 22.0 million high quality processed reads in shoot and root respectively. After de novo assembly and quantitative assessment total 26,412 genes got annotated in root and 18,500 genes annotated in shoot sample. Quality of raw reads was determined by using SeqQC-V2.2. Assembled sequences were annotated using BLASTX against public database such as NR or UniProt. Searching against the KEGG pathway database indicated that 37,918 unigenes from root and 35,130 unigenes from shoot were mapped to 133 KEGG pathways. Based on the DGE data we found that most of the transcript related to CYP450s and UDP-glucosyltransferases were specifically upregulated in shoot tissue as compared to root tissue. Finally, we have selected 43 transcripts related to secondary metabolism including transcription factor families which are differentially expressed in shoot and root tissues were validated by qRT-PCR and their expression level were monitored after MeJA treatment and wounding for 1, 3 and 5 h. This study not only represents the first de novo transcriptome analysis of Bacopa monnieri but also provides information about the identification, expression and differential tissues specific distribution of transcripts related to triterpenoid sapogenin which is one of the most important pharmacologically active secondary metabolite present in Bacopa monnieri. The identified transcripts in this study will establish a foundation for future studies related to carrying out the metabolic engineering for increasing the bacosides biosynthesis and its regulation for human health benefits.

  16. ASGARD: an open-access database of annotated transcriptomes for emerging model arthropod species.

    PubMed

    Zeng, Victor; Extavour, Cassandra G

    2012-01-01

    The increased throughput and decreased cost of next-generation sequencing (NGS) have shifted the bottleneck genomic research from sequencing to annotation, analysis and accessibility. This is particularly challenging for research communities working on organisms that lack the basic infrastructure of a sequenced genome, or an efficient way to utilize whatever sequence data may be available. Here we present a new database, the Assembled Searchable Giant Arthropod Read Database (ASGARD). This database is a repository and search engine for transcriptomic data from arthropods that are of high interest to multiple research communities but currently lack sequenced genomes. We demonstrate the functionality and utility of ASGARD using de novo assembled transcriptomes from the milkweed bug Oncopeltus fasciatus, the cricket Gryllus bimaculatus and the amphipod crustacean Parhyale hawaiensis. We have annotated these transcriptomes to assign putative orthology, coding region determination, protein domain identification and Gene Ontology (GO) term annotation to all possible assembly products. ASGARD allows users to search all assemblies by orthology annotation, GO term annotation or Basic Local Alignment Search Tool. User-friendly features of ASGARD include search term auto-completion suggestions based on database content, the ability to download assembly product sequences in FASTA format, direct links to NCBI data for predicted orthologs and graphical representation of the location of protein domains and matches to similar sequences from the NCBI non-redundant database. ASGARD will be a useful repository for transcriptome data from future NGS studies on these and other emerging model arthropods, regardless of sequencing platform, assembly or annotation status. This database thus provides easy, one-stop access to multi-species annotated transcriptome information. We anticipate that this database will be useful for members of multiple research communities, including developmental biology, physiology, evolutionary biology, ecology, comparative genomics and phylogenomics. Database URL: asgard.rc.fas.harvard.edu.

  17. Defining the Human Macula Transcriptome and Candidate Retinal Disease Genes UsingEyeSAGE

    PubMed Central

    Rickman, Catherine Bowes; Ebright, Jessica N.; Zavodni, Zachary J.; Yu, Ling; Wang, Tianyuan; Daiger, Stephen P.; Wistow, Graeme; Boon, Kathy; Hauser, Michael A.

    2009-01-01

    Purpose To develop large-scale, high-throughput annotation of the human macula transcriptome and to identify and prioritize candidate genes for inherited retinal dystrophies, based on ocular-expression profiles using serial analysis of gene expression (SAGE). Methods Two human retina and two retinal pigment epithelium (RPE)/choroid SAGE libraries made from matched macula or midperipheral retina and adjacent RPE/choroid of morphologically normal 28- to 66-year-old donors and a human central retina longSAGE library made from 41- to 66-year-old donors were generated. Their transcription profiles were entered into a relational database, EyeSAGE, including microarray expression profiles of retina and publicly available normal human tissue SAGE libraries. EyeSAGE was used to identify retina- and RPE-specific and -associated genes, and candidate genes for retina and RPE disease loci. Differential and/or cell-type specific expression was validated by quantitative and single-cell RT-PCR. Results Cone photoreceptor-associated gene expression was elevated in the macula transcription profiles. Analysis of the longSAGE retina tags enhanced tag-to-gene mapping and revealed alternatively spliced genes. Analysis of candidate gene expression tables for the identified Bardet-Biedl syndrome disease gene (BBS5) in the BBS5 disease region table yielded BBS5 as the top candidate. Compelling candidates for inherited retina diseases were identified. Conclusions The EyeSAGE database, combining three different gene-profiling platforms including the authors’ multidonor-derived retina/RPE SAGE libraries and existing single-donor retina/RPE libraries, is a powerful resource for definition of the retina and RPE transcriptomes. It can be used to identify retina-specific genes, including alternatively spliced transcripts and to prioritize candidate genes within mapped retinal disease regions. PMID:16723438

  18. Defining the human macula transcriptome and candidate retinal disease genes using EyeSAGE.

    PubMed

    Bowes Rickman, Catherine; Ebright, Jessica N; Zavodni, Zachary J; Yu, Ling; Wang, Tianyuan; Daiger, Stephen P; Wistow, Graeme; Boon, Kathy; Hauser, Michael A

    2006-06-01

    To develop large-scale, high-throughput annotation of the human macula transcriptome and to identify and prioritize candidate genes for inherited retinal dystrophies, based on ocular-expression profiles using serial analysis of gene expression (SAGE). Two human retina and two retinal pigment epithelium (RPE)/choroid SAGE libraries made from matched macula or midperipheral retina and adjacent RPE/choroid of morphologically normal 28- to 66-year-old donors and a human central retina longSAGE library made from 41- to 66-year-old donors were generated. Their transcription profiles were entered into a relational database, EyeSAGE, including microarray expression profiles of retina and publicly available normal human tissue SAGE libraries. EyeSAGE was used to identify retina- and RPE-specific and -associated genes, and candidate genes for retina and RPE disease loci. Differential and/or cell-type specific expression was validated by quantitative and single-cell RT-PCR. Cone photoreceptor-associated gene expression was elevated in the macula transcription profiles. Analysis of the longSAGE retina tags enhanced tag-to-gene mapping and revealed alternatively spliced genes. Analysis of candidate gene expression tables for the identified Bardet-Biedl syndrome disease gene (BBS5) in the BBS5 disease region table yielded BBS5 as the top candidate. Compelling candidates for inherited retina diseases were identified. The EyeSAGE database, combining three different gene-profiling platforms including the authors' multidonor-derived retina/RPE SAGE libraries and existing single-donor retina/RPE libraries, is a powerful resource for definition of the retina and RPE transcriptomes. It can be used to identify retina-specific genes, including alternatively spliced transcripts and to prioritize candidate genes within mapped retinal disease regions.

  19. Novel transcriptome assembly and improved annotation of the whiteleg shrimp (Litopenaeus vannamei), a dominant crustacean in global seafood mariculture.

    PubMed

    Ghaffari, Noushin; Sanchez-Flores, Alejandro; Doan, Ryan; Garcia-Orozco, Karina D; Chen, Patricia L; Ochoa-Leyva, Adrian; Lopez-Zavala, Alonso A; Carrasco, J Salvador; Hong, Chris; Brieba, Luis G; Rudiño-Piñera, Enrique; Blood, Philip D; Sawyer, Jason E; Johnson, Charles D; Dindot, Scott V; Sotelo-Mundo, Rogerio R; Criscitiello, Michael F

    2014-11-25

    We present a new transcriptome assembly of the Pacific whiteleg shrimp (Litopenaeus vannamei), the species most farmed for human consumption. Its functional annotation, a substantial improvement over previous ones, is provided freely. RNA-Seq with Illumina HiSeq technology was used to analyze samples extracted from shrimp abdominal muscle, hepatopancreas, gills and pleopods. We used the Trinity and Trinotate software suites for transcriptome assembly and annotation, respectively. The quality of this assembly and the affiliated targeted homology searches greatly enrich the curated transcripts currently available in public databases for this species. Comparison with the model arthropod Daphnia allows some insights into defining characteristics of decapod crustaceans. This large-scale gene discovery gives the broadest depth yet to the annotated transcriptome of this important species and should be of value to ongoing genomics and immunogenetic resistance studies in this shrimp of paramount global economic importance.

  20. Title: Comparative transcriptome profiling of the human and mouse dorsal root ganglia: an RNA-seq-based resource for pain and sensory neuroscience research.

    PubMed

    Ray, Pradipta; Torck, Andrew; Quigley, Lilyana; Wangzhou, Andi; Neiman, Matthew; Rao, Chandranshu; Lam, Tiffany; Kim, Ji-Young; Kim, Tae Hoon; Zhang, Michael Q; Dussor, Gregory; Price, Theodore J

    2018-03-20

    Molecular neurobiological insight into human nervous tissues is needed to generate next generation therapeutics for neurological disorders like chronic pain. We obtained human Dorsal Root Ganglia (DRG) samples from organ donors and performed RNA-sequencing (RNA-seq) to study the human DRG (hDRG) transcriptional landscape, systematically comparing it with publicly available data from a variety of human and orthologous mouse tissues, including mouse DRG (mDRG). We characterized the hDRG transcriptional profile in terms of tissue-restricted gene co-expression patterns and putative transcriptional regulators, and formulated an information-theoretic framework to quantify DRG enrichment. Relevant gene families and pathways were also analyzed, including transcription factors (TFs), g-protein coupled receptors (GCPRs) and ion channels. Our analyses reveal a hDRG-enriched protein-coding gene set (∼140), some of which have not been described in the context of DRG or pain signaling. A majority of these show conserved enrichment in mDRG, and were mined for known drug - gene product interactions. Conserved enrichment of the vast majority of TFs suggest that the mDRG is a faithful model system for studying hDRGs, due to evolutionarily conserved regulatory programs. Comparison of hDRG and tibial nerve transcriptomes suggest trafficking of neuronal mRNA to axons in adult hDRG, and are consistent with studies of axonal transport in rodent sensory neurons. We present our work as an online, searchable repository (https://www.utdallas.edu/bbs/painneurosciencelab/sensoryomics/drgtxome), creating a resource for the community. Our analyses provide insight into DRG biology for guiding development of novel therapeutics, and a blueprint for cross-species transcriptomic analyses.

  1. The transcriptome of Legionella pneumophila-infected human monocyte-derived macrophages.

    PubMed

    Price, Christopher T D; Abu Kwaik, Yousef

    2014-01-01

    Legionella pneumophila is an intracellular bacterial pathogen that invades and replicates within alveolar macrophages through injection of ∼ 300 effector proteins by its Dot/Icm type IV translocation apparatus. The bona fide F-box protein, AnkB, is a nutritional virulence effector that triggers macrophages to generate a surplus of amino acids, which is essential for intravacuolar proliferation. Therefore, the ankB mutant represents a novel genetic tool to determine the transcriptional response of human monocyte-derived macrophages (hMDMs) to actively replicating L. pneumophila. Here, we utilized total human gene microarrays to determine the global transcriptional response of hMDMs to infection by wild type or the ankB mutant of L. pneumophila. The transcriptomes of hMDMs infected with either actively proliferating wild type or non-replicative ankB mutant bacteria were remarkably similar. The transcriptome of infected hMDMs was predominated by up-regulation of inflammatory pathways (IL-10 anti-inflammatory, interferon signaling and amphoterin signaling), anti-apoptosis, and down-regulation of protein synthesis pathways. In addition, L. pneumophila modulated diverse metabolic pathways, particularly those associated with bio-active lipid metabolism, and SLC amino acid transporters expression. Taken together, the hMDM transcriptional response to L. pneumophila is independent of intra-vacuolar replication of the bacteria and primarily involves modulation of the immune response and metabolic as well as nutritional pathways.

  2. Transcriptome analysis reveals determinant stages controlling human embryonic stem cell commitment to neuronal cells.

    PubMed

    Li, Yuanyuan; Wang, Ran; Qiao, Nan; Peng, Guangdun; Zhang, Ke; Tang, Ke; Han, Jing-Dong J; Jing, Naihe

    2017-12-01

    Proper neural commitment is essential for ensuring the appropriate development of the human brain and for preventing neurodevelopmental diseases such as autism spectrum disorders, schizophrenia, and intellectual disorders. However, the molecular mechanisms underlying the neural commitment in humans remain elusive. Here, we report the establishment of a neural differentiation system based on human embryonic stem cells (hESCs) and on comprehensive RNA sequencing analysis of transcriptome dynamics during early hESC differentiation. Using weighted gene co-expression network analysis, we reveal that the hESC neurodevelopmental trajectory has five stages: pluripotency (day 0); differentiation initiation (days 2, 4, and 6); neural commitment (days 8-10); neural progenitor cell proliferation (days 12, 14, and 16); and neuronal differentiation (days 18, 20, and 22). These stages were characterized by unique module genes, which may recapitulate the early human cortical development. Moreover, a comparison of our RNA-sequencing data with several other transcriptome profiling datasets from mice and humans indicated that Module 3 associated with the day 8-10 stage is a critical window of fate switch from the pluripotency to the neural lineage. Interestingly, at this stage, no key extrinsic signals were activated. In contrast, using CRISPR/Cas9-mediated gene knockouts, we also found that intrinsic hub transcription factors, including the schizophrenia-associated SIX3 gene and septo-optic dysplasia-related HESX1 gene, are required to program hESC neural determination. Our results improve the understanding of the mechanism of neural commitment in the human brain and may help elucidate the etiology of human mental disorders and advance therapies for managing these conditions. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.

  3. The retina/RPE proteome in chick myopia and hyperopia models: Commonalities with inherited and age-related ocular pathologies

    PubMed Central

    Riddell, Nina; Faou, Pierre; Murphy, Melanie; Giummarra, Loretta; Downs, Rachael A.; Rajapaksha, Harinda

    2017-01-01

    Purpose Microarray and RNA sequencing studies in the chick model of early optically induced refractive error have implicated thousands of genes, many of which have also been linked to ocular pathologies in humans, including age-related macular degeneration (AMD), choroidal neovascularization, glaucoma, and cataract. These findings highlight the potential relevance of the chick model to understanding both refractive error development and the progression to secondary pathological complications. The present study aimed to determine whether proteomic responses to early optical defocus in the chick share similarities with these transcriptome-level changes, particularly in terms of dysregulation of pathology-related molecular processes. Methods Chicks were assigned to a lens condition (monocular +10 D [diopters] to induce hyperopia, −10 D to induce myopia, or no lens) on post-hatch day 5. Biometric measures were collected following a further 6 h and 48 h of rearing. The retina/RPE was then removed and prepared for liquid chromatography-electrospray ionization-tandem mass spectrometry (LC-ESI-MS/MS) on an LTQ-Orbitrap Elite. Raw data were processed using MaxQuant, and differentially abundant proteins were identified using moderated t tests (fold change ≥1.5, Benjamini-Hochberg adjusted p<0.05). These differentially abundant proteins were compared with the genes and proteins implicated in previous exploratory transcriptome and proteomic studies of refractive error, as well as the genes and proteins linked to the ocular pathologies listed above for which myopia or hyperopia are risk factors. Finally, gene set enrichment analysis (GSEA) was used to assess whether gene sets from the Human Phenotype Ontology database were enriched in the lens groups relative to the no lens groups, and at the top or bottom of the protein data ranked by Spearman’s correlation with refraction at 6 and 48 h. Results Refractive errors of −2.63 D ± 0.31 D (mean ± standard error, SE) and 3.90 D ± 0.37 D were evident in the negative and positive lens groups, respectively, at 6 h. By 48 h, refractive compensation to both lens types was almost complete (negative lens −9.70 D ± 0.41 D, positive lens 7.70 D ± 0.44 D). More than 140 differentially abundant proteins were identified in each lens group relative to the no lens controls at both time points. No proteins were differentially abundant between the negative and positive lens groups at 6 h, and 13 were differentially abundant at 48 h. As there was substantial overlap in the proteins implicated across the six comparisons, a total of 390 differentially abundant proteins were identified. Sixty-five of these 390 proteins had previously been implicated in transcriptome studies of refractive error animal models, and 42 had previously been associated with AMD, choroidal neovascularization, glaucoma, and/or cataract in humans. The overlap of differentially abundant proteins with AMD-associated genes and proteins was statistically significant for all conditions (Benjamini-Hochberg adjusted p<0.05), with over-representation analysis implicating ontologies related to oxidative stress, cholesterol homeostasis, and melanin biosynthesis. GSEA identified significant enrichment of genes associated with abnormal electroretinogram, photophobia, and nyctalopia phenotypes in the proteins negatively correlated with ocular refraction across the lens groups at 6 h. The implicated proteins were primarily linked to photoreceptor dystrophies and mitochondrial disorders in humans. Conclusions Optical defocus in the chicks induces rapid changes in the abundance of many proteins in the retina/RPE that have previously been linked to inherited and age-related ocular pathologies in humans. Similar changes have been identified in a meta-analysis of chick refractive error transcriptome studies, highlighting the chick as a model for the study of optically induced stress with possible relevance to understanding the development of a range of pathological states in humans. PMID:29259393

  4. DBATE: database of alternative transcripts expression.

    PubMed

    Bianchi, Valerio; Colantoni, Alessio; Calderone, Alberto; Ausiello, Gabriele; Ferrè, Fabrizio; Helmer-Citterich, Manuela

    2013-01-01

    The use of high-throughput RNA sequencing technology (RNA-seq) allows whole transcriptome analysis, providing an unbiased and unabridged view of alternative transcript expression. Coupling splicing variant-specific expression with its functional inference is still an open and difficult issue for which we created the DataBase of Alternative Transcripts Expression (DBATE), a web-based repository storing expression values and functional annotation of alternative splicing variants. We processed 13 large RNA-seq panels from human healthy tissues and in disease conditions, reporting expression levels and functional annotations gathered and integrated from different sources for each splicing variant, using a variant-specific annotation transfer pipeline. The possibility to perform complex queries by cross-referencing different functional annotations permits the retrieval of desired subsets of splicing variant expression values that can be visualized in several ways, from simple to more informative. DBATE is intended as a novel tool to help appreciate how, and possibly why, the transcriptome expression is shaped. DATABASE URL: http://bioinformatica.uniroma2.it/DBATE/.

  5. Comorbid insomnia symptoms predict lower 6-month adherence to CPAP in US veterans with obstructive sleep apnea.

    PubMed

    Wallace, Douglas M; Sawyer, A M; Shafazand, S

    2018-03-01

    There is limited information on the association between pre-treatment insomnia symptoms and dysfunctional sleep beliefs with continuous positive airway pressure (CPAP) adherence in veterans with obstructive sleep apnea (OSA). Our aims were to describe demographic and sleep characteristics of veterans with and without comorbid insomnia and determine whether pre-treatment insomnia symptoms and dysfunctional sleep beliefs predict CPAP use after 6 months of therapy. Hispanic veterans attending the Miami VA sleep clinic were recruited and completed the insomnia severity index, the dysfunctional sleep belief and attitude scale (DBAS), and other questionnaires. Participants were asked to return after 7 days and 1 and 6 months to repeat questionnaires and for objective CPAP adherence download. Hierarchical regression models were performed to determine adjusted associations of pre-treatment insomnia symptoms and DBAS sub-scores on 6-month mean daily CPAP use. Fifty-three participants completed the 6-month follow-up visit with a mean CPAP use of 3.4 ± 1.9 h. Veterans with comorbid insomnia had lower mean daily CPAP use (168 ± 125 vs 237 ± 108 min, p = 0.04) and lower percent daily CPAP use ≥ 4 h (32 ± 32 vs 51 ± 32%, p = 0.05) compared to participants without insomnia. In adjusted analyses, pre-treatment insomnia symptoms (early, late, and aggregated nocturnal symptoms) and sleep dissatisfaction were predictive of lower CPAP use at 6 months. Pre-treatment dysfunctional sleep beliefs were not associated with CPAP adherence. Pre-treatment nocturnal insomnia symptoms and sleep dissatisfaction predicted poorer 6- month CPAP use. Insomnia treatment preceding or concurrent with CPAP initiation may eliminate a barrier to regular use.

  6. Transcriptomic Analysis of the Primary Roots of Alhagi sparsifolia in Response to Water Stress

    PubMed Central

    Pei, Xinwu; Zhang, Chao; Jia, Shirong; Li, Weimin

    2015-01-01

    Background Alhagi sparsifolia is a typical desert phreatophyte and has evolved to withstand extreme dry, cold and hot weather. While A. sparsifolia represents an ideal model to study the molecular mechanism of plant adaption to abiotic stress, no research has been done in this aspect to date. Here we took advantage of Illumina platform to survey transcriptome in primary roots of A. sparsifolia under water stress conditions in aim to facilitate the exploration of its genetic basis for drought tolerance. Methodology and Principal Findings We sequenced four primary roots samples individually collected at 0, 6, 24 and 30h from the A. sparsifolia seedlings in the course of 24h of water stress following 6h of rehydration. The resulting 38,763,230, 67,511,150, 49,259,804 and 54,744,906 clean reads were pooled and assembled into 33,255 unigenes with an average length of 1,057 bp. All-unigenes were subjected to functional annotation by searching against the public databases. Based on the established transcriptome database, we further evaluated the gene expression profiles in the four different primary roots samples, and identified numbers of differently expressed genes (DEGs) reflecting the early response to water stress (6h vs. 0h), the late response to water stress (24h vs. 0h) and the response to post water stress rehydration (30h vs. 24h). Moreover, the DEGs specifically regulated at 6, 24 and 30h were captured in order to depict the dynamic changes of gene expression during water stress and subsequent rehydration. Functional categorization of the DEGs indicated the activation of oxidoreductase system, and particularly emphasized the significance of the ‘Glutathione metabolism pathway’ in response to water stress. Conclusions This is the first description of the genetic makeup of A. sparsifolia, thus providing a substantial contribution to the sequence resources for this species. The identified DEGs offer a deep insight into the molecular mechanism of A. sparsifolia in response to water stress, and merit further investigation. PMID:25822368

  7. Reptilian Transcriptomes v2.0: An Extensive Resource for Sauropsida Genomics and Transcriptomics

    PubMed Central

    Tzika, Athanasia C.; Ullate-Agote, Asier; Grbic, Djordje; Milinkovitch, Michel C.

    2015-01-01

    Despite the availability of deep-sequencing techniques, genomic and transcriptomic data remain unevenly distributed across phylogenetic groups. For example, reptiles are poorly represented in sequence databases, hindering functional evolutionary and developmental studies in these lineages substantially more diverse than mammals. In addition, different studies use different assembly and annotation protocols, inhibiting meaningful comparisons. Here, we present the “Reptilian Transcriptomes Database 2.0,” which provides extensive annotation of transcriptomes and genomes from species covering the major reptilian lineages. To this end, we sequenced normalized complementary DNA libraries of multiple adult tissues and various embryonic stages of the leopard gecko and the corn snake and gathered published reptilian sequence data sets from representatives of the four extant orders of reptiles: Squamata (snakes and lizards), the tuatara, crocodiles, and turtles. The LANE runner 2.0 software was implemented to annotate all assemblies within a single integrated pipeline. We show that this approach increases the annotation completeness of the assembled transcriptomes/genomes. We then built large concatenated protein alignments of single-copy genes and inferred phylogenetic trees that support the positions of turtles and the tuatara as sister groups of Archosauria and Squamata, respectively. The Reptilian Transcriptomes Database 2.0 resource will be updated to include selected new data sets as they become available, thus making it a reference for differential expression studies, comparative genomics and transcriptomics, linkage mapping, molecular ecology, and phylogenomic analyses involving reptiles. The database is available at www.reptilian-transcriptomes.org and can be enquired using a wwwblast server installed at the University of Geneva. PMID:26133641

  8. Ethyl carbamate induces cell death through its effects on multiple metabolic pathways.

    PubMed

    Liu, Huichang; Cui, Bo; Xu, Yi; Hu, Chaoyang; Liu, Ying; Qu, Guorun; Li, Dawei; Wu, Yongning; Zhang, Dabing; Quan, Sheng; Shi, Jianxin

    2017-11-01

    Ethyl carbamate (EC), a multisite carcinogenic chemical causing tumors in various animal species, is probably carcinogenic to humans. However, information about the possible carcinogenic and toxicological effects of EC in humans is quite limited. Because EC is found in many dietary foods (such as fermented foods) and tobacco and its products, and exposure of humans to EC often occurs inevitably, its toxicological effects in humans need to be studied. This study was conducted to understand the metabolomic and transcriptomic changes in human hepatocellular carcinoma cells (HepG2) exposed to 100 mM EC for short term (4 h) and long term (12 h) period, respectively. The results revealed multiple influences of EC on the metabolome and transcriptome of HepG2 cells, which was exposure time-dependent and well correlated with the kinetic changes of cell viability and mortality. EC treatment affected multiple metabolic pathways, inducing oxidative stress, reducing detoxification capacity, depleting energy, decreasing reducing power, disrupting membrane integrity, and damaging DNA and protein. These metabolomic and transcriptomic biomarkers of EC on human cell metabolism identified in this study would facilitate further studies on the risk assessment and the mitigation of dietary EC. Copyright © 2017 Elsevier B.V. All rights reserved.

  9. Space Debris Alert System for Aviation

    NASA Astrophysics Data System (ADS)

    Sgobba, Tommaso

    2013-09-01

    Despite increasing efforts to accurately predict space debris re-entry, the exact time and location of re-entry is still very uncertain. Partially, this is due to a skipping effect uncontrolled spacecraft may experience as they enter the atmosphere at a shallow angle. Such effect difficult to model depends on atmospheric variations of density. When the bouncing off ends and atmospheric re-entry starts, the trajectory and the overall location of surviving fragments can be precisely predicted but the time to impact with ground, or to reach the airspace, becomes very short.Different is the case of a functional space system performing controlled re-entry. Suitable forecasts methods are available to clear air and maritime traffic from hazard areas (so-called traffic segregation).In US, following the Space Shuttle Columbia accident in 2003, a re-entry hazard areas location forecast system was putted in place for the specific case of major malfunction of a Reusable Launch Vehicles (RLV) at re-entry. The Shuttle Hazard Area to Aircraft Calculator (SHAAC) is a system based on ground equipment and software analyses and prediction tools, which require trained personnel and close coordination between the organization responsible for RLV operation (NASA for Shuttle) and the Federal Aviation Administration. The system very much relies on the operator's capability to determine that a major malfunction has occurred.This paper presents a US pending patent by the European Space Agency, which consists of a "smart fragment" using a GPS localizer together with pre- computed debris footprint area and direct broadcasting of such hazard areas.The risk for aviation from falling debris is very remote but catastrophic. Suspending flight over vast swath of airspace for every re-entering spacecraft or rocket upper stage, which is a weekly occurrence, would be extremely costly and disruptive.The Re-entry Direct Broadcasting Alert System (R- DBAS) is an original merging and evolution of the Re- entry Breakup Recorder (REBR) concept developed by The Aerospace Corporation, often called the black box of spacecraft, and of the Shuttle Hazard Area to Aircraft Calculator (SHAAC). Unlike the REBR, whichdownloads data via satellite link for later analysis, the R-DBAS is intended as a direct communication tool with the end user. As a spacecraft carrying R-DBAS re- enters into the atmosphere, it relays a message with the coordinates of the falling debris footprint area to anyone with a receiver and a display like laptop or iPad, warning them of the hazard.Much like the REBR, the R-DBAS is designed to release from its host vehicle when it experiences significant heat which melts the attachment point and closes the power circuit. Once activated, the R-DBAS determines its own location and computes the final coordinates of the preloaded debris footprint which is then broadcasted to anyone holding a receiver in the proximity of the hazard area.The R-DBAS is intended to provide precise information directly to the cockpit. An airplane would have about 5- 7 minutes to get out of the way. Being the hazard area 1,000-2000 km long but very narrow, 30 -70 km, an escape manoeuvre from the risky area can be readily performed or go on holding before crossing the hazard area.By equipping aircraft and other vulnerable systems with a simple receivers that can be attached to a common laptop, escape manoeuvres can be performed as in front of bad weather or shelter can be taken by people on ground.

  10. A GRAVITATIONAL REDSHIFT DETERMINATION OF THE MEAN MASS OF WHITE DWARFS: DBA AND DB STARS

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Falcon, Ross E.; Winget, D. E.; Montgomery, M. H.

    2012-10-01

    We measure apparent velocities (v{sub app}) of absorption lines for 36 white dwarfs (WDs) with helium-dominated atmospheres-16 DBAs and 20 DBs-using optical spectra taken for the European Southern Observatory SN Ia progenitor survey. We find a difference of 6.9 {+-} 6.9 km s{sup -1} in the average apparent velocity of the H{alpha} lines versus that of the He I 5876 A lines for our DBAs. This is a measure of the blueshift of this He line due to pressure effects. By using this as a correction, we extend the gravitational redshift method employed by Falcon et al. to use themore » apparent velocity of the He I 5876 A line and conduct the first gravitational redshift investigation of a group of WDs without visible hydrogen lines. We use biweight estimators to find an average apparent velocity, (v{sub app}){sub BI}, (and hence average gravitational redshift, (v{sub g}){sub BI}) for our WDs; from that we derive an average mass, (M){sub BI}. For the DBAs, we find (v{sub app}){sub BI} = 40.8 {+-} 4.7 km s{sup -1} and derive (M){sub BI} = 0.71{sup +0.04}{sub -0.05} M{sub Sun }. Though different from (v{sub app}) of DAs (32.57 km s{sup -1}) at the 91% confidence level and suggestive of a larger DBA mean mass than that for normal DAs derived using the same method (0.647{sup +0.013}{sub -0.014} M{sub Sun }; Falcon et al.), we do not claim this as a stringent detection. Rather, we emphasize that the difference between (v{sub app}){sub BI} of the DBAs and (v{sub app}) of normal DAs is no larger than 9.2 km s{sup -1}, at the 95% confidence level; this corresponds to roughly 0.10 M{sub Sun }. For the DBs, we find (v {sup He}{sub app}){sub BI} = 42.9 {+-} 8.49 km s{sup -1} after applying the blueshift correction and determine (M){sub BI} = 0.74{sup +0.08}{sub -0.09} M{sub Sun }. The difference between (v{sup He}{sub app}){sub BI} of the DBs and (v{sub app}) of DAs is {<=}11.5 km s{sup -1} ({approx}0.12 M{sub Sun }), at the 95% confidence level. The gravitational redshift method indicates much larger mean masses than the spectroscopic determinations of the same sample by Voss et al. Given the small sample sizes, it is possible that systematic uncertainties are skewing our results due to the potential of kinematic substructures that may not average out. We estimate this to be unlikely, but a larger sample size is necessary to rule out these systematics.« less

  11. SCPortalen: human and mouse single-cell centric database

    PubMed Central

    Noguchi, Shuhei; Böttcher, Michael; Hasegawa, Akira; Kouno, Tsukasa; Kato, Sachi; Tada, Yuhki; Ura, Hiroki; Abe, Kuniya; Shin, Jay W; Plessy, Charles; Carninci, Piero

    2018-01-01

    Abstract Published single-cell datasets are rich resources for investigators who want to address questions not originally asked by the creators of the datasets. The single-cell datasets might be obtained by different protocols and diverse analysis strategies. The main challenge in utilizing such single-cell data is how we can make the various large-scale datasets to be comparable and reusable in a different context. To challenge this issue, we developed the single-cell centric database ‘SCPortalen’ (http://single-cell.clst.riken.jp/). The current version of the database covers human and mouse single-cell transcriptomics datasets that are publicly available from the INSDC sites. The original metadata was manually curated and single-cell samples were annotated with standard ontology terms. Following that, common quality assessment procedures were conducted to check the quality of the raw sequence. Furthermore, primary data processing of the raw data followed by advanced analyses and interpretation have been performed from scratch using our pipeline. In addition to the transcriptomics data, SCPortalen provides access to single-cell image files whenever available. The target users of SCPortalen are all researchers interested in specific cell types or population heterogeneity. Through the web interface of SCPortalen users are easily able to search, explore and download the single-cell datasets of their interests. PMID:29045713

  12. Human-specific features of spatial gene expression and regulation in eight brain regions.

    PubMed

    Xu, Chuan; Li, Qian; Efimova, Olga; He, Liu; Tatsumoto, Shoji; Stepanova, Vita; Oishi, Takao; Udono, Toshifumi; Yamaguchi, Katsushi; Shigenobu, Shuji; Kakita, Akiyoshi; Nawa, Hiroyuki; Khaitovich, Philipp; Go, Yasuhiro

    2018-06-13

    Molecular maps of the human brain alone do not inform us of the features unique to humans. Yet, the identification of these features is important for understanding both the evolution and nature of human cognition. Here, we approached this question by analyzing gene expression and H3K27ac chromatin modification data collected in eight brain regions of humans, chimpanzees, gorillas, a gibbon and macaques. An analysis of spatial transcriptome trajectories across eight brain regions in four primate species revealed 1,851 genes showing human-specific transcriptome differences in one or multiple brain regions, in contrast to 240 chimpanzee-specific ones. More than half of these human-specific differences represented elevated expression of genes enriched in neuronal and astrocytic markers in the human hippocampus, while the rest were enriched in microglial markers and displayed human-specific expression in several frontal cortical regions and the cerebellum. An analysis of the predicted regulatory interactions driving these differences revealed the role of transcription factors in species-specific transcriptome changes, while epigenetic modifications were linked to spatial expression differences conserved across species. Published by Cold Spring Harbor Laboratory Press.

  13. De novo sequencing and analysis of the transcriptome during the browning of fresh-cut Luffa cylindrica 'Fusi-3' fruits.

    PubMed

    Zhu, Haisheng; Liu, Jianting; Wen, Qingfang; Chen, Mindong; Wang, Bin; Zhang, Qianrong; Xue, Zhuzheng

    2017-01-01

    Fresh-cut luffa (Luffa cylindrica) fruits commonly undergo browning. However, little is known about the molecular mechanisms regulating this process. We used the RNA-seq technique to analyze the transcriptomic changes occurring during the browning of fresh-cut fruits from luffa cultivar 'Fusi-3'. Over 90 million high-quality reads were assembled into 58,073 Unigenes, and 60.86% of these were annotated based on sequences in four public databases. We detected 35,282 Unigenes with significant hits to sequences in the NCBInr database, and 24,427 Unigenes encoded proteins with sequences that were similar to those of known proteins in the Swiss-Prot database. Additionally, 20,546 and 13,021 Unigenes were similar to existing sequences in the Eukaryotic Orthologous Groups of proteins and Kyoto Encyclopedia of Genes and Genomes databases, respectively. Furthermore, 27,301 Unigenes were differentially expressed during the browning of fresh-cut luffa fruits (i.e., after 1-6 h). Moreover, 11 genes from five gene families (i.e., PPO, PAL, POD, CAT, and SOD) identified as potentially associated with enzymatic browning as well as four WRKY transcription factors were observed to be differentially regulated in fresh-cut luffa fruits. With the assistance of rapid amplification of cDNA ends technology, we obtained the full-length sequences of the 15 Unigenes. We also confirmed these Unigenes were expressed by quantitative real-time polymerase chain reaction analysis. This study provides a comprehensive transcriptome sequence resource, and may facilitate further studies aimed at identifying genes affecting luffa fruit browning for the exploitation of the underlying mechanism.

  14. IDPT: Insights into potential intrinsically disordered proteins through transcriptomic analysis of genes for prostate carcinoma epigenetic data.

    PubMed

    Mallik, Saurav; Sen, Sagnik; Maulik, Ujjwal

    2016-07-15

    Involvement of intrinsically disordered proteins (IDPs) with various dreadful diseases like cancer is an interesting research topic. In order to gain novel insights into the regulation of IDPs, in this article, we perform a transcriptomic analysis of mRNAs (genes) for transcripts encoding IDPs on a human multi-omics prostate carcinoma dataset having both gene expression and methylation data. In this regard, firstly the genes that consist of both the expression and methylation data, and that are corresponding to the cancer-related prostate-tissue-specific disordered proteins of MobiDb database, are selected. We apply standard t-test for determining differentially expressed genes as well as differentially methylated genes. A network having these genes and their targeter miRNAs from Diana Tarbase v7.0 database and corresponding Transcription Factors from TRANSFAC and ITFP databases, is then built. Thereafter, we perform literature search, and KEGG pathway and Gene Ontology analyses using DAVID database. Finally, we report several significant potential gene-markers (with the corresponding IDPs) that have inverse relationship between differential expression and methylation patterns, and that are hub genes of the TF-miRNA-gene network. Copyright © 2016 Elsevier B.V. All rights reserved.

  15. TCW: Transcriptome Computational Workbench

    PubMed Central

    Soderlund, Carol; Nelson, William; Willer, Mark; Gang, David R.

    2013-01-01

    Background The analysis of transcriptome data involves many steps and various programs, along with organization of large amounts of data and results. Without a methodical approach for storage, analysis and query, the resulting ad hoc analysis can lead to human error, loss of data and results, inefficient use of time, and lack of verifiability, repeatability, and extensibility. Methodology The Transcriptome Computational Workbench (TCW) provides Java graphical interfaces for methodical analysis for both single and comparative transcriptome data without the use of a reference genome (e.g. for non-model organisms). The singleTCW interface steps the user through importing transcript sequences (e.g. Illumina) or assembling long sequences (e.g. Sanger, 454, transcripts), annotating the sequences, and performing differential expression analysis using published statistical programs in R. The data, metadata, and results are stored in a MySQL database. The multiTCW interface builds a comparison database by importing sequence and annotation from one or more single TCW databases, executes the ESTscan program to translate the sequences into proteins, and then incorporates one or more clusterings, where the clustering options are to execute the orthoMCL program, compute transitive closure, or import clusters. Both singleTCW and multiTCW allow extensive query and display of the results, where singleTCW displays the alignment of annotation hits to transcript sequences, and multiTCW displays multiple transcript alignments with MUSCLE or pairwise alignments. The query programs can be executed on the desktop for fastest analysis, or from the web for sharing the results. Conclusion It is now affordable to buy a multi-processor machine, and easy to install Java and MySQL. By simply downloading the TCW, the user can interactively analyze, query and view their data. The TCW allows in-depth data mining of the results, which can lead to a better understanding of the transcriptome. TCW is freely available from www.agcol.arizona.edu/software/tcw. PMID:23874959

  16. TCW: transcriptome computational workbench.

    PubMed

    Soderlund, Carol; Nelson, William; Willer, Mark; Gang, David R

    2013-01-01

    The analysis of transcriptome data involves many steps and various programs, along with organization of large amounts of data and results. Without a methodical approach for storage, analysis and query, the resulting ad hoc analysis can lead to human error, loss of data and results, inefficient use of time, and lack of verifiability, repeatability, and extensibility. The Transcriptome Computational Workbench (TCW) provides Java graphical interfaces for methodical analysis for both single and comparative transcriptome data without the use of a reference genome (e.g. for non-model organisms). The singleTCW interface steps the user through importing transcript sequences (e.g. Illumina) or assembling long sequences (e.g. Sanger, 454, transcripts), annotating the sequences, and performing differential expression analysis using published statistical programs in R. The data, metadata, and results are stored in a MySQL database. The multiTCW interface builds a comparison database by importing sequence and annotation from one or more single TCW databases, executes the ESTscan program to translate the sequences into proteins, and then incorporates one or more clusterings, where the clustering options are to execute the orthoMCL program, compute transitive closure, or import clusters. Both singleTCW and multiTCW allow extensive query and display of the results, where singleTCW displays the alignment of annotation hits to transcript sequences, and multiTCW displays multiple transcript alignments with MUSCLE or pairwise alignments. The query programs can be executed on the desktop for fastest analysis, or from the web for sharing the results. It is now affordable to buy a multi-processor machine, and easy to install Java and MySQL. By simply downloading the TCW, the user can interactively analyze, query and view their data. The TCW allows in-depth data mining of the results, which can lead to a better understanding of the transcriptome. TCW is freely available from www.agcol.arizona.edu/software/tcw.

  17. Identification and Validation of Human Missing Proteins and Peptides in Public Proteome Databases: Data Mining Strategy.

    PubMed

    Elguoshy, Amr; Hirao, Yoshitoshi; Xu, Bo; Saito, Suguru; Quadery, Ali F; Yamamoto, Keiko; Mitsui, Toshiaki; Yamamoto, Tadashi

    2017-12-01

    In an attempt to complete human proteome project (HPP), Chromosome-Centric Human Proteome Project (C-HPP) launched the journey of missing protein (MP) investigation in 2012. However, 2579 and 572 protein entries in the neXtProt (2017-1) are still considered as missing and uncertain proteins, respectively. Thus, in this study, we proposed a pipeline to analyze, identify, and validate human missing and uncertain proteins in open-access transcriptomics and proteomics databases. Analysis of RNA expression pattern for missing proteins in Human protein Atlas showed that 28% of them, such as Olfactory receptor 1I1 ( O60431 ), had no RNA expression, suggesting the necessity to consider uncommon tissues for transcriptomic and proteomic studies. Interestingly, 21% had elevated expression level in a particular tissue (tissue-enriched proteins), indicating the importance of targeting such proteins in their elevated tissues. Additionally, the analysis of RNA expression level for missing proteins showed that 95% had no or low expression level (0-10 transcripts per million), indicating that low abundance is one of the major obstacles facing the detection of missing proteins. Moreover, missing proteins are predicted to generate fewer predicted unique tryptic peptides than the identified proteins. Searching for these predicted unique tryptic peptides that correspond to missing and uncertain proteins in the experimental peptide list of open-access MS-based databases (PA, GPM) resulted in the detection of 402 missing and 19 uncertain proteins with at least two unique peptides (≥9 aa) at <(5 × 10 -4 )% FDR. Finally, matching the native spectra for the experimentally detected peptides with their SRMAtlas synthetic counterparts at three transition sources (QQQ, QTOF, QTRAP) gave us an opportunity to validate 41 missing proteins by ≥2 proteotypic peptides.

  18. Getting the most out of parasitic helminth transcriptomes using HelmDB: implications for biology and biotechnology.

    PubMed

    Mangiola, Stefano; Young, Neil D; Korhonen, Pasi; Mondal, Alinda; Scheerlinck, Jean-Pierre; Sternberg, Paul W; Cantacessi, Cinzia; Hall, Ross S; Jex, Aaron R; Gasser, Robin B

    2013-12-01

    Compounded by a massive global food shortage, many parasitic diseases have a devastating, long-term impact on animal and human health and welfare worldwide. Parasitic helminths (worms) affect the health of billions of animals. Unlocking the systems biology of these neglected pathogens will underpin the design of new and improved interventions against them. Currently, the functional annotation of genomic and transcriptomic sequence data for socio-economically important parasitic worms relies almost exclusively on comparative bioinformatic analyses using model organism- and other databases. However, many genes and gene products of parasitic helminths (often >50%) cannot be annotated using this approach, because they are specific to parasites and/or do not have identifiable homologs in other organisms for which sequence data are available. This inability to fully annotate transcriptomes and predicted proteomes is a major challenge and constrains our understanding of the biology of parasites, interactions with their hosts and of parasitism and the pathogenesis of disease on a molecular level. In the present article, we compiled transcriptomic data sets of key, socioeconomically important parasitic helminths, and constructed and validated a curated database, called HelmDB (www.helmdb.org). We demonstrate how this database can be used effectively for the improvement of functional annotation by employing data integration and clustering. Importantly, HelmDB provides a practical and user-friendly toolkit for sequence browsing and comparative analyses among divergent helminth groups (including nematodes and trematodes), and should be readily adaptable and applicable to a wide range of other organisms. This web-based, integrative database should assist 'systems biology' studies of parasitic helminths, and the discovery and prioritization of novel drug and vaccine targets. This focus provides a pathway toward developing new and improved approaches for the treatment and control of parasitic diseases, with the potential for important biotechnological outcomes. Copyright © 2012 Elsevier Inc. All rights reserved.

  19. The effect of skin fatty acids on Staphylococcus aureus.

    PubMed

    Neumann, Yvonne; Ohlsen, Knut; Donat, Stefanie; Engelmann, Susanne; Kusch, Harald; Albrecht, Dirk; Cartron, Michael; Hurd, Alexander; Foster, Simon J

    2015-03-01

    Staphylococcus aureus is a commensal of the human nose and skin. Human skin fatty acids, in particular cis-6-hexadecenoic acid (C-6-H), have high antistaphylococcal activity and can inhibit virulence determinant production. Here, we show that sub-MIC levels of C-6-H result in induction of increased resistance. The mechanism(s) of C-6-H activity was investigated by combined transcriptome and proteome analyses. Proteome analysis demonstrated a pleiotropic effect of C-6-H on virulence determinant production. In response to C-6-H, transcriptomics revealed altered expression of over 500 genes, involved in many aspects of virulence and cellular physiology. The expression of toxins (hla, hlb, hlgBC) was reduced, whereas that of host defence evasion components (cap, sspAB, katA) was increased. In particular, members of the SaeRS regulon had highly reduced expression, and the use of specific mutants revealed that the effect on toxin production is likely mediated via SaeRS.

  20. The aquatic animals' transcriptome resource for comparative functional analysis.

    PubMed

    Chou, Chih-Hung; Huang, Hsi-Yuan; Huang, Wei-Chih; Hsu, Sheng-Da; Hsiao, Chung-Der; Liu, Chia-Yu; Chen, Yu-Hung; Liu, Yu-Chen; Huang, Wei-Yun; Lee, Meng-Lin; Chen, Yi-Chang; Huang, Hsien-Da

    2018-05-09

    Aquatic animals have great economic and ecological importance. Among them, non-model organisms have been studied regarding eco-toxicity, stress biology, and environmental adaptation. Due to recent advances in next-generation sequencing techniques, large amounts of RNA-seq data for aquatic animals are publicly available. However, currently there is no comprehensive resource exist for the analysis, unification, and integration of these datasets. This study utilizes computational approaches to build a new resource of transcriptomic maps for aquatic animals. This aquatic animal transcriptome map database dbATM provides de novo assembly of transcriptome, gene annotation and comparative analysis of more than twenty aquatic organisms without draft genome. To improve the assembly quality, three computational tools (Trinity, Oases and SOAPdenovo-Trans) were employed to enhance individual transcriptome assembly, and CAP3 and CD-HIT-EST software were then used to merge these three assembled transcriptomes. In addition, functional annotation analysis provides valuable clues to gene characteristics, including full-length transcript coding regions, conserved domains, gene ontology and KEGG pathways. Furthermore, all aquatic animal genes are essential for comparative genomics tasks such as constructing homologous gene groups and blast databases and phylogenetic analysis. In conclusion, we establish a resource for non model organism aquatic animals, which is great economic and ecological importance and provide transcriptomic information including functional annotation and comparative transcriptome analysis. The database is now publically accessible through the URL http://dbATM.mbc.nctu.edu.tw/ .

  1. Proteogenomics: Integrating Next-Generation Sequencing and Mass Spectrometry to Characterize Human Proteomic Variation

    NASA Astrophysics Data System (ADS)

    Sheynkman, Gloria M.; Shortreed, Michael R.; Cesnik, Anthony J.; Smith, Lloyd M.

    2016-06-01

    Mass spectrometry-based proteomics has emerged as the leading method for detection, quantification, and characterization of proteins. Nearly all proteomic workflows rely on proteomic databases to identify peptides and proteins, but these databases typically contain a generic set of proteins that lack variations unique to a given sample, precluding their detection. Fortunately, proteogenomics enables the detection of such proteomic variations and can be defined, broadly, as the use of nucleotide sequences to generate candidate protein sequences for mass spectrometry database searching. Proteogenomics is experiencing heightened significance due to two developments: (a) advances in DNA sequencing technologies that have made complete sequencing of human genomes and transcriptomes routine, and (b) the unveiling of the tremendous complexity of the human proteome as expressed at the levels of genes, cells, tissues, individuals, and populations. We review here the field of human proteogenomics, with an emphasis on its history, current implementations, the types of proteomic variations it reveals, and several important applications.

  2. Proteogenomics: Integrating Next-Generation Sequencing and Mass Spectrometry to Characterize Human Proteomic Variation

    PubMed Central

    Sheynkman, Gloria M.; Shortreed, Michael R.; Cesnik, Anthony J.; Smith, Lloyd M.

    2016-01-01

    Mass spectrometry–based proteomics has emerged as the leading method for detection, quantification, and characterization of proteins. Nearly all proteomic workflows rely on proteomic databases to identify peptides and proteins, but these databases typically contain a generic set of proteins that lack variations unique to a given sample, precluding their detection. Fortunately, proteogenomics enables the detection of such proteomic variations and can be defined, broadly, as the use of nucleotide sequences to generate candidate protein sequences for mass spectrometry database searching. Proteogenomics is experiencing heightened significance due to two developments: (a) advances in DNA sequencing technologies that have made complete sequencing of human genomes and transcriptomes routine, and (b) the unveiling of the tremendous complexity of the human proteome as expressed at the levels of genes, cells, tissues, individuals, and populations. We review here the field of human proteogenomics, with an emphasis on its history, current implementations, the types of proteomic variations it reveals, and several important applications. PMID:27049631

  3. Identification of an HV 1 voltage-gated proton channel in insects.

    PubMed

    Chaves, Gustavo; Derst, Christian; Franzen, Arne; Mashimo, Yuta; Machida, Ryuichiro; Musset, Boris

    2016-04-01

    The voltage-gated proton channel 1 (HV 1) is an important component of the cellular proton extrusion machinery and is essential for charge compensation during the respiratory burst of phagocytes. HV 1 has been identified in a wide range of eukaryotes throughout the animal kingdom, with the exception of insects. Therefore, it has been proposed that insects do not possess an HV 1 channel. In the present study, we report the existence of an HV 1-type proton channel in insects. We searched insect transcriptome shotgun assembly (TSA) sequence databases and found putative HV 1 orthologues in various polyneopteran insects. To confirm that these putative HV 1 orthologues were functional channels, we studied the HV 1 channel of Nicoletia phytophila (NpHV 1), an insect of the Zygentoma order, in more detail. NpHV 1 comprises 239 amino acids and is 33% identical to the human voltage-gated proton channel 1. Patch clamp measurements in a heterologous expression system showed proton selectivity, as well as pH- and voltage-dependent gating. Interestingly, NpHV 1 shows slightly enhanced pH-dependent gating compared to the human channel. Mutations in the first transmembrane segment at position 66 (Asp66), the presumed selectivity filter, lead to a loss of proton-selective conduction, confirming the importance of this aspartate residue in voltage-gated proton channels. Nucleotide sequence data have been deposited in the GenBank database under accession number KT780722. © 2016 Federation of European Biochemical Societies.

  4. De novo transcriptomic analysis of hydrogen production in the green alga Chlamydomonas moewusii through RNA-Seq

    PubMed Central

    2013-01-01

    Background Microalgae can make a significant contribution towards meeting global renewable energy needs in both carbon-based and hydrogen (H2) biofuel. The development of energy-related products from algae could be accelerated with improvements in systems biology tools, and recent advances in sequencing technology provide a platform for enhanced transcriptomic analyses. However, these techniques are still heavily reliant upon available genomic sequence data. Chlamydomonas moewusii is a unicellular green alga capable of evolving molecular H2 under both dark and light anaerobic conditions, and has high hydrogenase activity that can be rapidly induced. However, to date, there is no systematic investigation of transcriptomic profiling during induction of H2 photoproduction in this organism. Results In this work, RNA-Seq was applied to investigate transcriptomic profiles during the dark anaerobic induction of H2 photoproduction. 156 million reads generated from 7 samples were then used for de novo assembly after data trimming. BlastX results against NCBI database and Blast2GO results were used to interpret the functions of the assembled 34,136 contigs, which were then used as the reference contigs for RNA-Seq analysis. Our results indicated that more contigs were differentially expressed during the period of early and higher H2 photoproduction, and fewer contigs were differentially expressed when H2-photoproduction rates decreased. In addition, C. moewusii and C. reinhardtii share core functional pathways, and transcripts for H2 photoproduction and anaerobic metabolite production were identified in both organisms. C. moewusii also possesses similar metabolic flexibility as C. reinhardtii, and the difference between C. moewusii and C. reinhardtii on hydrogenase expression and anaerobic fermentative pathways involved in redox balancing may explain their different profiles of hydrogenase activity and secreted anaerobic metabolites. Conclusions Herein, we have described a workflow using commercial software to analyze RNA-Seq data without reference genome sequence information, which can be applied to other unsequenced microorganisms. This study provided biological insights into the anaerobic fermentation and H2 photoproduction of C. moewusii, and the first transcriptomic RNA-Seq dataset of C. moewusii generated in this study also offer baseline data for further investigation (e.g. regulatory proteins related to fermentative pathway discussed in this study) of this organism as a H2-photoproduction strain. PMID:23971877

  5. Visual analytics techniques for large multi-attribute time series data

    NASA Astrophysics Data System (ADS)

    Hao, Ming C.; Dayal, Umeshwar; Keim, Daniel A.

    2008-01-01

    Time series data commonly occur when variables are monitored over time. Many real-world applications involve the comparison of long time series across multiple variables (multi-attributes). Often business people want to compare this year's monthly sales with last year's sales to make decisions. Data warehouse administrators (DBAs) want to know their daily data loading job performance. DBAs need to detect the outliers early enough to act upon them. In this paper, two new visual analytic techniques are introduced: The color cell-based Visual Time Series Line Charts and Maps highlight significant changes over time in a long time series data and the new Visual Content Query facilitates finding the contents and histories of interesting patterns and anomalies, which leads to root cause identification. We have applied both methods to two real-world applications to mine enterprise data warehouse and customer credit card fraud data to illustrate the wide applicability and usefulness of these techniques.

  6. De novo sequencing and analysis of the transcriptome during the browning of fresh-cut Luffa cylindrica 'Fusi-3' fruits

    PubMed Central

    Chen, Mindong; Wang, Bin; Zhang, Qianrong; Xue, Zhuzheng

    2017-01-01

    Fresh-cut luffa (Luffa cylindrica) fruits commonly undergo browning. However, little is known about the molecular mechanisms regulating this process. We used the RNA-seq technique to analyze the transcriptomic changes occurring during the browning of fresh-cut fruits from luffa cultivar ‘Fusi-3’. Over 90 million high-quality reads were assembled into 58,073 Unigenes, and 60.86% of these were annotated based on sequences in four public databases. We detected 35,282 Unigenes with significant hits to sequences in the NCBInr database, and 24,427 Unigenes encoded proteins with sequences that were similar to those of known proteins in the Swiss-Prot database. Additionally, 20,546 and 13,021 Unigenes were similar to existing sequences in the Eukaryotic Orthologous Groups of proteins and Kyoto Encyclopedia of Genes and Genomes databases, respectively. Furthermore, 27,301 Unigenes were differentially expressed during the browning of fresh-cut luffa fruits (i.e., after 1–6 h). Moreover, 11 genes from five gene families (i.e., PPO, PAL, POD, CAT, and SOD) identified as potentially associated with enzymatic browning as well as four WRKY transcription factors were observed to be differentially regulated in fresh-cut luffa fruits. With the assistance of rapid amplification of cDNA ends technology, we obtained the full-length sequences of the 15 Unigenes. We also confirmed these Unigenes were expressed by quantitative real-time polymerase chain reaction analysis. This study provides a comprehensive transcriptome sequence resource, and may facilitate further studies aimed at identifying genes affecting luffa fruit browning for the exploitation of the underlying mechanism. PMID:29145430

  7. OperomeDB: A Database of Condition-Specific Transcription Units in Prokaryotic Genomes.

    PubMed

    Chetal, Kashish; Janga, Sarath Chandra

    2015-01-01

    Background. In prokaryotic organisms, a substantial fraction of adjacent genes are organized into operons-codirectionally organized genes in prokaryotic genomes with the presence of a common promoter and terminator. Although several available operon databases provide information with varying levels of reliability, very few resources provide experimentally supported results. Therefore, we believe that the biological community could benefit from having a new operon prediction database with operons predicted using next-generation RNA-seq datasets. Description. We present operomeDB, a database which provides an ensemble of all the predicted operons for bacterial genomes using available RNA-sequencing datasets across a wide range of experimental conditions. Although several studies have recently confirmed that prokaryotic operon structure is dynamic with significant alterations across environmental and experimental conditions, there are no comprehensive databases for studying such variations across prokaryotic transcriptomes. Currently our database contains nine bacterial organisms and 168 transcriptomes for which we predicted operons. User interface is simple and easy to use, in terms of visualization, downloading, and querying of data. In addition, because of its ability to load custom datasets, users can also compare their datasets with publicly available transcriptomic data of an organism. Conclusion. OperomeDB as a database should not only aid experimental groups working on transcriptome analysis of specific organisms but also enable studies related to computational and comparative operomics.

  8. Comparative Transcriptome Analysis of Genes Involved in Anthocyanin Biosynthesis in the Red and Yellow Fruits of Sweet Cherry (Prunus avium L.)

    PubMed Central

    Wei, Hairong; Chen, Xin; Zong, Xiaojuan; Shu, Huairui; Gao, Dongsheng; Liu, Qingzhong

    2015-01-01

    Background Fruit color is one of the most important economic traits of the sweet cherry (Prunus avium L.). The red coloration of sweet cherry fruit is mainly attributed to anthocyanins. However, limited information is available regarding the molecular mechanisms underlying anthocyanin biosynthesis and its regulation in sweet cherry. Methodology/Principal Findings In this study, a reference transcriptome of P. avium L. was sequenced and annotated to identify the transcriptional determinants of fruit color. Normalized cDNA libraries from red and yellow fruits were sequenced using the next-generation Illumina/Solexa sequencing platform and de novo assembly. Over 66 million high-quality reads were assembled into 43,128 unigenes using a combined assembly strategy. Then a total of 22,452 unigenes were compared to public databases using homology searches, and 20,095 of these unigenes were annotated in the Nr protein database. Furthermore, transcriptome differences between the four stages of fruit ripening were analyzed using Illumina digital gene expression (DGE) profiling. Biological pathway analysis revealed that 72 unigenes were involved in anthocyanin biosynthesis. The expression patterns of unigenes encoding phenylalanine ammonia-lyase (PAL), 4-coumarate-CoA ligase (4CL), chalcone synthase (CHS), chalcone isomerase (CHI), flavanone 3-hydroxylase (F3H), flavanone 3’-hydroxylase (F3’H), dihydroflavonol 4-reductase (DFR), anthocyanidin synthase (ANS) and UDP glucose: flavonol 3-O-glucosyltransferase (UFGT) during fruit ripening differed between red and yellow fruit. In addition, we identified some transcription factor families (such as MYB, bHLH and WD40) that may control anthocyanin biosynthesis. We confirmed the altered expression levels of eighteen unigenes that encode anthocyanin biosynthetic enzymes and transcription factors using quantitative real-time PCR (qRT-PCR). Conclusions/Significance The obtained sweet cherry transcriptome and DGE profiling data provide comprehensive gene expression information that lends insights into the molecular mechanisms underlying anthocyanin biosynthesis. These results will provide a platform for further functional genomic research on this fruit crop. PMID:25799516

  9. De novo transcriptome sequencing and analysis of the cereal cyst nematode, Heterodera avenae.

    PubMed

    Kumar, Mukesh; Gantasala, Nagavara Prasad; Roychowdhury, Tanmoy; Thakur, Prasoon Kumar; Banakar, Prakash; Shukla, Rohit N; Jones, Michael G K; Rao, Uma

    2014-01-01

    The cereal cyst nematode (CCN, Heterodera avenae) is a major pest of wheat (Triticum spp) that reduces crop yields in many countries. Cyst nematodes are obligate sedentary endoparasites that reproduce by amphimixis. Here, we report the first transcriptome analysis of two stages of H. avenae. After sequencing extracted RNA from pre parasitic infective juvenile and adult stages of the life cycle, 131 million Illumina high quality paired end reads were obtained which generated 27,765 contigs with N50 of 1,028 base pairs, of which 10,452 were annotated. Comparative analyses were undertaken to evaluate H. avenae sequences with those of other plant, animal and free living nematodes to identify differences in expressed genes. There were 4,431 transcripts common to H. avenae and the free living nematode Caenorhabditis elegans, and 9,462 in common with more closely related potato cyst nematode, Globodera pallida. Annotation of H. avenae carbohydrate active enzymes (CAZy) revealed fewer glycoside hydrolases (GHs) but more glycosyl transferases (GTs) and carbohydrate esterases (CEs) when compared to M. incognita. 1,280 transcripts were found to have secretory signature, presence of signal peptide and absence of transmembrane. In a comparison of genes expressed in the pre-parasitic juvenile and feeding female stages, expression levels of 30 genes with high RPKM (reads per base per kilo million) value, were analysed by qRT-PCR which confirmed the observed differences in their levels of expression levels. In addition, we have also developed a user-friendly resource, Heterodera transcriptome database (HATdb) for public access of the data generated in this study. The new data provided on the transcriptome of H. avenae adds to the genetic resources available to study plant parasitic nematodes and provides an opportunity to seek new effectors that are specifically involved in the H. avenae-cereal host interaction.

  10. De Novo Transcriptome Sequencing and Analysis of the Cereal Cyst Nematode, Heterodera avenae

    PubMed Central

    Kumar, Mukesh; Gantasala, Nagavara Prasad; Roychowdhury, Tanmoy; Thakur, Prasoon Kumar; Banakar, Prakash; Shukla, Rohit N.; Jones, Michael G. K.; Rao, Uma

    2014-01-01

    The cereal cyst nematode (CCN, Heterodera avenae) is a major pest of wheat (Triticum spp) that reduces crop yields in many countries. Cyst nematodes are obligate sedentary endoparasites that reproduce by amphimixis. Here, we report the first transcriptome analysis of two stages of H. avenae. After sequencing extracted RNA from pre parasitic infective juvenile and adult stages of the life cycle, 131 million Illumina high quality paired end reads were obtained which generated 27,765 contigs with N50 of 1,028 base pairs, of which 10,452 were annotated. Comparative analyses were undertaken to evaluate H. avenae sequences with those of other plant, animal and free living nematodes to identify differences in expressed genes. There were 4,431 transcripts common to H. avenae and the free living nematode Caenorhabditis elegans, and 9,462 in common with more closely related potato cyst nematode, Globodera pallida. Annotation of H. avenae carbohydrate active enzymes (CAZy) revealed fewer glycoside hydrolases (GHs) but more glycosyl transferases (GTs) and carbohydrate esterases (CEs) when compared to M. incognita. 1,280 transcripts were found to have secretory signature, presence of signal peptide and absence of transmembrane. In a comparison of genes expressed in the pre-parasitic juvenile and feeding female stages, expression levels of 30 genes with high RPKM (reads per base per kilo million) value, were analysed by qRT-PCR which confirmed the observed differences in their levels of expression levels. In addition, we have also developed a user-friendly resource, Heterodera transcriptome database (HATdb) for public access of the data generated in this study. The new data provided on the transcriptome of H. avenae adds to the genetic resources available to study plant parasitic nematodes and provides an opportunity to seek new effectors that are specifically involved in the H. avenae-cereal host interaction. PMID:24802510

  11. Development of Transcriptomic Resources for Interrogating the Biosynthesis of Monoterpene Indole Alkaloids in Medicinal Plant Species

    PubMed Central

    Góngora-Castillo, Elsa; Childs, Kevin L.; Fedewa, Greg; Hamilton, John P.; Liscombe, David K.; Magallanes-Lundback, Maria; Mandadi, Kranthi K.; Nims, Ezekiel; Runguphan, Weerawat; Vaillancourt, Brieanne; Varbanova-Herde, Marina; DellaPenna, Dean; McKnight, Thomas D.; O’Connor, Sarah; Buell, C. Robin

    2012-01-01

    The natural diversity of plant metabolism has long been a source for human medicines. One group of plant-derived compounds, the monoterpene indole alkaloids (MIAs), includes well-documented therapeutic agents used in the treatment of cancer (vinblastine, vincristine, camptothecin), hypertension (reserpine, ajmalicine), malaria (quinine), and as analgesics (7-hydroxymitragynine). Our understanding of the biochemical pathways that synthesize these commercially relevant compounds is incomplete due in part to a lack of molecular, genetic, and genomic resources for the identification of the genes involved in these specialized metabolic pathways. To address these limitations, we generated large-scale transcriptome sequence and expression profiles for three species of Asterids that produce medicinally important MIAs: Camptotheca acuminata, Catharanthus roseus, and Rauvolfia serpentina. Using next generation sequencing technology, we sampled the transcriptomes of these species across a diverse set of developmental tissues, and in the case of C. roseus, in cultured cells and roots following elicitor treatment. Through an iterative assembly process, we generated robust transcriptome assemblies for all three species with a substantial number of the assembled transcripts being full or near-full length. The majority of transcripts had a related sequence in either UniRef100, the Arabidopsis thaliana predicted proteome, or the Pfam protein domain database; however, we also identified transcripts that lacked similarity with entries in either database and thereby lack a known function. Representation of known genes within the MIA biosynthetic pathway was robust. As a diverse set of tissues and treatments were surveyed, expression abundances of transcripts in the three species could be estimated to reveal transcripts associated with development and response to elicitor treatment. Together, these transcriptomes and expression abundance matrices provide a rich resource for understanding plant specialized metabolism, and promotes realization of innovative production systems for plant-derived pharmaceuticals. PMID:23300689

  12. Transcriptome Analysis and Differential Gene Expression on the Testis of Orange Mud Crab, Scylla olivacea, during Sexual Maturation

    PubMed Central

    Waiho, Khor; Fazhan, Hanafiah; Shahreza, Md Sheriff; Moh, Julia Hwei Zhong; Noorbaiduri, Shaibani; Wong, Li Lian; Sinnasamy, Saranya

    2017-01-01

    Adequate genetic information is essential for sustainable crustacean fisheries and aquaculture management. The commercially important orange mud crab, Scylla olivacea, is prevalent in Southeast Asia region and is highly sought after. Although it is a suitable aquaculture candidate, full domestication of this species is hampered by the lack of knowledge about the sexual maturation process and the molecular mechanisms behind it, especially in males. To date, data on its whole genome is yet to be reported for S. olivacea. The available transcriptome data published previously on this species focus primarily on females and the role of central nervous system in reproductive development. De novo transcriptome sequencing for the testes of S. olivacea from immature, maturing and mature stages were performed. A total of approximately 144 million high-quality reads were generated and de novo assembled into 160,569 transcripts with a total length of 142.2 Mb. Approximately 15–23% of the total assembled transcripts were annotated when compared to public protein sequence databases (i.e. UniProt database, Interpro database, Pfam database and Drosophila melanogaster protein database), and GO-categorised with GO Ontology terms. A total of 156,181 high-quality Single-Nucleotide Polymorphisms (SNPs) were mined from the transcriptome data of present study. Transcriptome comparison among the testes of different maturation stages revealed one gene (beta crystallin like gene) with the most significant differential expression—up-regulated in immature stage and down-regulated in maturing and mature stages. This was further validated by qRT-PCR. In conclusion, a comprehensive transcriptome of the testis of orange mud crabs from different maturation stages were obtained. This report provides an invaluable resource for enhancing our understanding of this species’ genome structure and biology, as expressed and controlled by their gonads. PMID:28135340

  13. Suicidal ideation and insomnia symptoms in subjects with obstructive sleep apnea syndrome.

    PubMed

    Choi, Su Jung; Joo, Eun Yeon; Lee, Young Jun; Hong, Seung Bong

    2015-09-01

    Insomnia symptoms are prevalent in subjects with obstructive sleep apnea syndrome (OSA) and are important risk factors for suicidal ideation (SI). However, the significance of SI has not been clearly demonstrated in persons with both OSA and insomnia. We aimed to investigate the prevalence of SI and its relationship with insomnia symptoms, mood, and other relevant factors. A total of 117 consecutive subjects with untreated OSA (apnea-hypopnea index ≥5/h) participated in the study. They completed questionnaires regarding SI ([BDI-II], item 9), insomnia symptoms (Insomnia Severity Index [ISI]), depressive mood (modified BDI-II [mBDI-II], which excluded items on SI and sleep disturbances), dysfunctional beliefs and attitudes about sleep (DBAS), social support, and quality of life. The overall prevalence of SI was 20.5% in subjects with OSA. A total of 32 subjects (27.4%) reported significant insomnia symptoms (ISI ≥ 15). Higher SI was associated with higher scores on ISI, DBAS, and mBDI-II and lower scores on social support and quality of life questionnaires. The severity of insomnia was positively correlated with depressive mood. The relationship between SI and insomnia severity was insignificant after adjusting for depressive symptom severity. Patients with OSA may have SI and insomnia symptoms. Collinearity was observed between sleep and mood disturbances. Yet, it is remarkable to find a significant association between OSA and SI, which are additional contributions to insomnia. This study suggests the necessity of integrated approaches to SI and related factors for the comprehensive treatment of OSA. Copyright © 2015 Elsevier B.V. All rights reserved.

  14. TranscriptomeBrowser 3.0: introducing a new compendium of molecular interactions and a new visualization tool for the study of gene regulatory networks.

    PubMed

    Lepoivre, Cyrille; Bergon, Aurélie; Lopez, Fabrice; Perumal, Narayanan B; Nguyen, Catherine; Imbert, Jean; Puthier, Denis

    2012-01-31

    Deciphering gene regulatory networks by in silico approaches is a crucial step in the study of the molecular perturbations that occur in diseases. The development of regulatory maps is a tedious process requiring the comprehensive integration of various evidences scattered over biological databases. Thus, the research community would greatly benefit from having a unified database storing known and predicted molecular interactions. Furthermore, given the intrinsic complexity of the data, the development of new tools offering integrated and meaningful visualizations of molecular interactions is necessary to help users drawing new hypotheses without being overwhelmed by the density of the subsequent graph. We extend the previously developed TranscriptomeBrowser database with a set of tables containing 1,594,978 human and mouse molecular interactions. The database includes: (i) predicted regulatory interactions (computed by scanning vertebrate alignments with a set of 1,213 position weight matrices), (ii) potential regulatory interactions inferred from systematic analysis of ChIP-seq experiments, (iii) regulatory interactions curated from the literature, (iv) predicted post-transcriptional regulation by micro-RNA, (v) protein kinase-substrate interactions and (vi) physical protein-protein interactions. In order to easily retrieve and efficiently analyze these interactions, we developed In-teractomeBrowser, a graph-based knowledge browser that comes as a plug-in for Transcriptome-Browser. The first objective of InteractomeBrowser is to provide a user-friendly tool to get new insight into any gene list by providing a context-specific display of putative regulatory and physical interactions. To achieve this, InteractomeBrowser relies on a "cell compartments-based layout" that makes use of a subset of the Gene Ontology to map gene products onto relevant cell compartments. This layout is particularly powerful for visual integration of heterogeneous biological information and is a productive avenue in generating new hypotheses. The second objective of InteractomeBrowser is to fill the gap between interaction databases and dynamic modeling. It is thus compatible with the network analysis software Cytoscape and with the Gene Interaction Network simulation software (GINsim). We provide examples underlying the benefits of this visualization tool for large gene set analysis related to thymocyte differentiation. The InteractomeBrowser plugin is a powerful tool to get quick access to a knowledge database that includes both predicted and validated molecular interactions. InteractomeBrowser is available through the TranscriptomeBrowser framework and can be found at: http://tagc.univ-mrs.fr/tbrowser/. Our database is updated on a regular basis.

  15. RNA-seq analysis and de novo transcriptome assembly of Jerusalem artichoke (Helianthus tuberosus Linne).

    PubMed

    Jung, Won Yong; Lee, Sang Sook; Kim, Chul Wook; Kim, Hyun-Soon; Min, Sung Ran; Moon, Jae Sun; Kwon, Suk-Yoon; Jeon, Jae-Heung; Cho, Hye Sun

    2014-01-01

    Jerusalem artichoke (Helianthus tuberosus L.) has long been cultivated as a vegetable and as a source of fructans (inulin) for pharmaceutical applications in diabetes and obesity prevention. However, transcriptomic and genomic data for Jerusalem artichoke remain scarce. In this study, Illumina RNA sequencing (RNA-Seq) was performed on samples from Jerusalem artichoke leaves, roots, stems and two different tuber tissues (early and late tuber development). Data were used for de novo assembly and characterization of the transcriptome. In total 206,215,632 paired-end reads were generated. These were assembled into 66,322 loci with 272,548 transcripts. Loci were annotated by querying against the NCBI non-redundant, Phytozome and UniProt databases, and 40,215 loci were homologous to existing database sequences. Gene Ontology terms were assigned to 19,848 loci, 15,434 loci were matched to 25 Clusters of Eukaryotic Orthologous Groups classifications, and 11,844 loci were classified into 142 Kyoto Encyclopedia of Genes and Genomes pathways. The assembled loci also contained 10,778 potential simple sequence repeats. The newly assembled transcriptome was used to identify loci with tissue-specific differential expression patterns. In total, 670 loci exhibited tissue-specific expression, and a subset of these were confirmed using RT-PCR and qRT-PCR. Gene expression related to inulin biosynthesis in tuber tissue was also investigated. Exsiting genetic and genomic data for H. tuberosus are scarce. The sequence resources developed in this study will enable the analysis of thousands of transcripts and will thus accelerate marker-assisted breeding studies and studies of inulin biosynthesis in Jerusalem artichoke.

  16. Identification of Novel Placentally Expressed Aspartic Proteinase in Humans

    PubMed Central

    Majewska, Marta; Lipka, Aleksandra; Panasiewicz, Grzegorz; Gowkielewicz, Marek; Jozwik, Marcin; Majewski, Mariusz Krzysztof; Szafranska, Bozena

    2017-01-01

    This study presents pioneering data concerning the human pregnancy-associated glycoprotein-Like family, identified in the genome, of the term placental transcriptome and proteome. RNA-seq allowed the identification of 1364 bp hPAG-L/pep cDNA with at least 56.5% homology with other aspartic proteinases (APs). In silico analyses revealed 388 amino acids (aa) of full-length hPAG-L polypeptide precursor, with 15 aa-signal peptide, 47 aa-blocking peptide and 326 aa-mature protein, and two Asp residues (D), specific for a catalytic cleft of the APs (VVFDTGSSNLWV91-102 and AIVDTGTSLLTG274-285). Capillary sequencing identified 9330 bp of the hPAG-L gene (Gen Bank Acc. No. KX533473), composed of nine exons and eight introns. Heterologous Western blotting revealed the presence of one dominant 60 kDa isoform of the hPAG-L amongst cellular placental proteins. Detection with anti-pPAG-P and anti-Rec pPAG2 polyclonals allowed identification of the hPAG-L proteins located within regions of chorionic villi, especially within the syncytiotrophoblast of term singleton placentas. Our novel data extend the present knowledge about the human genome, as well as placental transcriptome and proteome during term pregnancy. Presumably, this may contribute to establishing a new diagnostic tool for examination of some disturbances during human pregnancy, as well as growing interest from both scientific and clinical perspectives. PMID:28594357

  17. Identification of Novel Placentally Expressed Aspartic Proteinase in Humans.

    PubMed

    Majewska, Marta; Lipka, Aleksandra; Panasiewicz, Grzegorz; Gowkielewicz, Marek; Jozwik, Marcin; Majewski, Mariusz Krzysztof; Szafranska, Bozena

    2017-06-08

    This study presents pioneering data concerning the human pregnancy-associated glycoprotein-Like family, identified in the genome, of the term placental transcriptome and proteome. RNA-seq allowed the identification of 1364 bp hPAG-L/pep cDNA with at least 56.5% homology with other aspartic proteinases (APs). In silico analyses revealed 388 amino acids (aa) of full-length hPAG-L polypeptide precursor, with 15 aa-signal peptide, 47 aa-blocking peptide and 326 aa-mature protein, and two Asp residues (D), specific for a catalytic cleft of the APs (VVFDTGSSNLWV91-102 and AIVDTGTSLLTG274-285). Capillary sequencing identified 9330 bp of the hPAG-L gene (Gen Bank Acc. No. KX533473), composed of nine exons and eight introns. Heterologous Western blotting revealed the presence of one dominant 60 kDa isoform of the hPAG-L amongst cellular placental proteins. Detection with anti-pPAG-P and anti-Rec pPAG2 polyclonals allowed identification of the hPAG-L proteins located within regions of chorionic villi, especially within the syncytiotrophoblast of term singleton placentas. Our novel data extend the present knowledge about the human genome, as well as placental transcriptome and proteome during term pregnancy. Presumably, this may contribute to establishing a new diagnostic tool for examination of some disturbances during human pregnancy, as well as growing interest from both scientific and clinical perspectives.

  18. Alternative Splicing Profile and Sex-Preferential Gene Expression in the Female and Male Pacific Abalone Haliotis discus hannai.

    PubMed

    Kim, Mi Ae; Rhee, Jae-Sung; Kim, Tae Ha; Lee, Jung Sick; Choi, Ah-Young; Choi, Beom-Soon; Choi, Ik-Young; Sohn, Young Chang

    2017-03-09

    In order to characterize the female or male transcriptome of the Pacific abalone and further increase genomic resources, we sequenced the mRNA of full-length complementary DNA (cDNA) libraries derived from pooled tissues of female and male Haliotis discus hannai by employing the Iso-Seq protocol of the PacBio RSII platform. We successfully assembled whole full-length cDNA sequences and constructed a transcriptome database that included isoform information. After clustering, a total of 15,110 and 12,145 genes that coded for proteins were identified in female and male abalones, respectively. A total of 13,057 putative orthologs were retained from each transcriptome in abalones. Overall Gene Ontology terms and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways analyzed in each database showed a similar composition between sexes. In addition, a total of 519 and 391 isoforms were genome-widely identified with at least two isoforms from female and male transcriptome databases. We found that the number of isoforms and their alternatively spliced patterns are variable and sex-dependent. This information represents the first significant contribution to sex-preferential genomic resources of the Pacific abalone. The availability of whole female and male transcriptome database and their isoform information will be useful to improve our understanding of molecular responses and also for the analysis of population dynamics in the Pacific abalone.

  19. Alternative Splicing Profile and Sex-Preferential Gene Expression in the Female and Male Pacific Abalone Haliotis discus hannai

    PubMed Central

    Kim, Mi Ae; Rhee, Jae-Sung; Kim, Tae Ha; Lee, Jung Sick; Choi, Ah-Young; Choi, Beom-Soon; Choi, Ik-Young; Sohn, Young Chang

    2017-01-01

    In order to characterize the female or male transcriptome of the Pacific abalone and further increase genomic resources, we sequenced the mRNA of full-length complementary DNA (cDNA) libraries derived from pooled tissues of female and male Haliotis discus hannai by employing the Iso-Seq protocol of the PacBio RSII platform. We successfully assembled whole full-length cDNA sequences and constructed a transcriptome database that included isoform information. After clustering, a total of 15,110 and 12,145 genes that coded for proteins were identified in female and male abalones, respectively. A total of 13,057 putative orthologs were retained from each transcriptome in abalones. Overall Gene Ontology terms and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways analyzed in each database showed a similar composition between sexes. In addition, a total of 519 and 391 isoforms were genome-widely identified with at least two isoforms from female and male transcriptome databases. We found that the number of isoforms and their alternatively spliced patterns are variable and sex-dependent. This information represents the first significant contribution to sex-preferential genomic resources of the Pacific abalone. The availability of whole female and male transcriptome database and their isoform information will be useful to improve our understanding of molecular responses and also for the analysis of population dynamics in the Pacific abalone. PMID:28282934

  20. hSAGEing: an improved SAGE-based software for identification of human tissue-specific or common tumor markers and suppressors.

    PubMed

    Yang, Cheng-Hong; Chuang, Li-Yeh; Shih, Tsung-Mu; Chang, Hsueh-Wei

    2010-12-17

    SAGE (serial analysis of gene expression) is a powerful method of analyzing gene expression for the entire transcriptome. There are currently many well-developed SAGE tools. However, the cross-comparison of different tissues is seldom addressed, thus limiting the identification of common- and tissue-specific tumor markers. To improve the SAGE mining methods, we propose a novel function for cross-tissue comparison of SAGE data by combining the mathematical set theory and logic with a unique "multi-pool method" that analyzes multiple pools of pair-wise case controls individually. When all the settings are in "inclusion", the common SAGE tag sequences are mined. When one tissue type is in "inclusion" and the other types of tissues are not in "inclusion", the selected tissue-specific SAGE tag sequences are generated. They are displayed in tags-per-million (TPM) and fold values, as well as visually displayed in four kinds of scales in a color gradient pattern. In the fold visualization display, the top scores of the SAGE tag sequences are provided, along with cluster plots. A user-defined matrix file is designed for cross-tissue comparison by selecting libraries from publically available databases or user-defined libraries. The hSAGEing tool provides a combination of friendly cross-tissue analysis and an interface for comparing SAGE libraries for the first time. Some up- or down-regulated genes with tissue-specific or common tumor markers and suppressors are identified computationally. The tool is useful and convenient for in silico cancer transcriptomic studies and is freely available at http://bio.kuas.edu.tw/hSAGEing.

  1. NeuroTransDB: highly curated and structured transcriptomic metadata for neurodegenerative diseases.

    PubMed

    Bagewadi, Shweta; Adhikari, Subash; Dhrangadhariya, Anjani; Irin, Afroza Khanam; Ebeling, Christian; Namasivayam, Aishwarya Alex; Page, Matthew; Hofmann-Apitius, Martin; Senger, Philipp

    2015-01-01

    Neurodegenerative diseases are chronic debilitating conditions, characterized by progressive loss of neurons that represent a significant health care burden as the global elderly population continues to grow. Over the past decade, high-throughput technologies such as the Affymetrix GeneChip microarrays have provided new perspectives into the pathomechanisms underlying neurodegeneration. Public transcriptomic data repositories, namely Gene Expression Omnibus and curated ArrayExpress, enable researchers to conduct integrative meta-analysis; increasing the power to detect differentially regulated genes in disease and explore patterns of gene dysregulation across biologically related studies. The reliability of retrospective, large-scale integrative analyses depends on an appropriate combination of related datasets, in turn requiring detailed meta-annotations capturing the experimental setup. In most cases, we observe huge variation in compliance to defined standards for submitted metadata in public databases. Much of the information to complete, or refine meta-annotations are distributed in the associated publications. For example, tissue preparation or comorbidity information is frequently described in an article's supplementary tables. Several value-added databases have employed additional manual efforts to overcome this limitation. However, none of these databases explicate annotations that distinguish human and animal models in neurodegeneration context. Therefore, adopting a more specific disease focus, in combination with dedicated disease ontologies, will better empower the selection of comparable studies with refined annotations to address the research question at hand. In this article, we describe the detailed development of NeuroTransDB, a manually curated database containing metadata annotations for neurodegenerative studies. The database contains more than 20 dimensions of metadata annotations within 31 mouse, 5 rat and 45 human studies, defined in collaboration with domain disease experts. We elucidate the step-by-step guidelines used to critically prioritize studies from public archives and their metadata curation and discuss the key challenges encountered. Curated metadata for Alzheimer's disease gene expression studies are available for download. Database URL: www.scai.fraunhofer.de/NeuroTransDB.html. © The Author(s) 2015. Published by Oxford University Press.

  2. NeuroTransDB: highly curated and structured transcriptomic metadata for neurodegenerative diseases

    PubMed Central

    Bagewadi, Shweta; Adhikari, Subash; Dhrangadhariya, Anjani; Irin, Afroza Khanam; Ebeling, Christian; Namasivayam, Aishwarya Alex; Page, Matthew; Hofmann-Apitius, Martin

    2015-01-01

    Neurodegenerative diseases are chronic debilitating conditions, characterized by progressive loss of neurons that represent a significant health care burden as the global elderly population continues to grow. Over the past decade, high-throughput technologies such as the Affymetrix GeneChip microarrays have provided new perspectives into the pathomechanisms underlying neurodegeneration. Public transcriptomic data repositories, namely Gene Expression Omnibus and curated ArrayExpress, enable researchers to conduct integrative meta-analysis; increasing the power to detect differentially regulated genes in disease and explore patterns of gene dysregulation across biologically related studies. The reliability of retrospective, large-scale integrative analyses depends on an appropriate combination of related datasets, in turn requiring detailed meta-annotations capturing the experimental setup. In most cases, we observe huge variation in compliance to defined standards for submitted metadata in public databases. Much of the information to complete, or refine meta-annotations are distributed in the associated publications. For example, tissue preparation or comorbidity information is frequently described in an article’s supplementary tables. Several value-added databases have employed additional manual efforts to overcome this limitation. However, none of these databases explicate annotations that distinguish human and animal models in neurodegeneration context. Therefore, adopting a more specific disease focus, in combination with dedicated disease ontologies, will better empower the selection of comparable studies with refined annotations to address the research question at hand. In this article, we describe the detailed development of NeuroTransDB, a manually curated database containing metadata annotations for neurodegenerative studies. The database contains more than 20 dimensions of metadata annotations within 31 mouse, 5 rat and 45 human studies, defined in collaboration with domain disease experts. We elucidate the step-by-step guidelines used to critically prioritize studies from public archives and their metadata curation and discuss the key challenges encountered. Curated metadata for Alzheimer’s disease gene expression studies are available for download. Database URL: www.scai.fraunhofer.de/NeuroTransDB.html PMID:26475471

  3. Cognitive mechanisms of sleep outcomes in a randomized clinical trial of internet-based cognitive behavioral therapy for insomnia.

    PubMed

    Chow, Philip I; Ingersoll, Karen S; Thorndike, Frances P; Lord, Holly R; Gonder-Frederick, Linda; Morin, Charles M; Ritterband, Lee M

    2018-07-01

    The aim of this study was to investigate in a randomized clinical trial the role of sleep-related cognitive variables in the long-term efficacy of an online, fully automated cognitive behavioral therapy intervention for insomnia (CBT-I) (Sleep Healthy Using the Internet [SHUTi]). Three hundred and three participants (M age  = 43.3 years; SD = 11.6) were randomly assigned to SHUTi or an online patient education condition and assessed at baseline, postintervention (nine weeks after baseline), and six and 12 months after the intervention period. Cognitive variables were self-reported internal and chance sleep locus of control, dysfunctional beliefs and attitudes about sleep (DBAS), sleep specific self-efficacy, and insomnia knowledge. Primary outcomes were self-reported online ratings of insomnia severity (Insomnia Severity Index), and sleep onset latency and wake after sleep onset from online sleep diaries, collected 12 months after the intervention period. Those who received SHUTi had, at postassessment, higher levels of insomnia knowledge (95% confidence interval [CI] = 0.10-0.16) and internal sleep locus of control (95% CI = 0.04-0.55) as well as lower DBAS (95% CI = 1.52-2.39) and sleep locus of control attributed to chance (95% CI = 0.15-0.71). Insomnia knowledge, chance sleep locus of control, and DBAS mediated the relationship between condition and at least one 12-month postassessment sleep outcome. Within the SHUTi condition, changes in each cognitive variable (with the exception of internal sleep locus of control) predicted improvement in at least one sleep outcome one year later. Online CBT-I may reduce the enormous public health burden of insomnia by changing underlying cognitive variables that lead to long-term changes in sleep outcomes. Published by Elsevier B.V.

  4. Global transcriptome analysis of Huperzia serrata and identification of critical genes involved in the biosynthesis of huperzine A.

    PubMed

    Yang, Mengquan; You, Wenjing; Wu, Shiwen; Fan, Zhen; Xu, Baofu; Zhu, Mulan; Li, Xuan; Xiao, Youli

    2017-03-22

    Huperzia serrata (H. serrata) is an economically important traditional Chinese herb with the notably medicinal value. As a representative member of the Lycopodiaceae family, the H. serrata produces various types of effectively bioactive lycopodium alkaloids, especially the huperzine A (HupA) which is a promising drug for Alzheimer's disease. Despite their medicinal importance, the public genomic and transcriptomic resources are very limited and the biosynthesis of HupA is largely unknown. Previous studies on comparison of 454-ESTs from H. serrata and Phlegmariurus carinatus predicted putative genes involved in lycopodium alkaloid biosynthesis, such as lysine decarboxylase like (LDC-like) protein and some CYP450s. However, these gene annotations were not carried out with further biochemical characterizations. To understand the biosynthesis of HupA and its regulation in H. serrata, a global transcriptome analysis on H. Serrata tissues was performed. In this study, we used the Illumina Highseq4000 platform to generate a substantial RNA sequencing dataset of H. serrata. A total of 40.1 Gb clean data was generated from four different tissues: root, stem, leaf, and sporangia and assembled into 181,141 unigenes. The total length, average length, N50 and GC content of unigenes were 219,520,611 bp, 1,211 bp, 2,488 bp and 42.51%, respectively. Among them, 105,516 unigenes (58.25%) were annotated by seven public databases (NR, NT, Swiss-Prot, KEGG, COG, Interpro, GO), and 54 GO terms and 3,391 transcription factors (TFs) were functionally classified, respectively. KEGG pathway analysis revealed that 72,230 unigenes were classified into 21 functional pathways. Three types of candidate enzymes, LDC, CAO and PKS, responsible for the biosynthesis of precursors of HupA were all identified in the transcripts. Four hundred and fifty-seven CYP450 genes in H. serrata were also analyzed and compared with tissue-specific gene expression. Moreover, two key classes of CYP450 genes BBE and SLS, with 23 members in total, for modification of the lycopodium alkaloid scaffold in the late two stages of biosynthesis of HupA were further evaluated. This study is the first report of global transcriptome analysis on all tissues of H. serrata, and critical genes involved in the biosynthesis of precursors and scaffold modifications of HupA were discovered and predicted. The transcriptome data from this work not only could provide an important resource for further investigating on metabolic pathways in H. serrata, but also shed light on synthetic biology study of HupA.

  5. Novel Insights into the Transcriptome of Dirofilaria immitis

    PubMed Central

    Zhang, Zhihe; Hou, Rong; Wu, Xuhang; Yang, Deying; Zhang, Runhui; Zheng, Wanpeng; Nie, Huaming; Xie, Yue; Yan, Ning; Yang, Zhi; Wang, Chengdong; Luo, Li; Liu, Li; Gu, Xiaobin; Wang, Shuxian; Peng, Xuerong; Yang, Guangyou

    2012-01-01

    Background The heartworm Dirofilaria immitis is the causal agent of cardiopulmonary dirofilariosis in dogs and cats, and also infects a wide range of wild mammals as well as humans. One bottleneck for the design of fundamentally new intervention and management strategies against D. immitis may be the currently limited knowledge of fundamental molecular aspects of D. immitis. Methodology/Principal Findings A next-generation sequencing platform combining computational approaches was employed to assess a global view of the heartworm transcriptome. A total of 20,810 unigenes (mean length  = 1,270 bp) were assembled from 22.3 million clean reads. From these, 15,698 coding sequences (CDS) were inferred, and about 85% of the unigenes had orthologs/homologs in public databases. Comparative transcriptomic study uncovered 4,157 filarial-specific genes as well as 3,795 genes potentially involved in filarial-Wolbachia symbiosis. In addition, the potential intestine transcriptome of D. immitis (1,101 genes) was mined for the first time, which might help to discover ‘hidden antigens’. Conclusions/Significance This study provides novel insights into the transcriptome of D. immitis and sheds light on its molecular processes and survival mechanisms. Furthermore, it provides a platform to discover new vaccine candidates and potential targets for new drugs against dirofilariosis. PMID:22911833

  6. Transcriptomic analysis of flower development in wintersweet (Chimonanthus praecox).

    PubMed

    Liu, Daofeng; Sui, Shunzhao; Ma, Jing; Li, Zhineng; Guo, Yulong; Luo, Dengpan; Yang, Jianfeng; Li, Mingyang

    2014-01-01

    Wintersweet (Chimonanthus praecox) is familiar as a garden plant and woody ornamental flower. On account of its unique flowering time and strong fragrance, it has a high ornamental and economic value. Despite a long history of human cultivation, our understanding of wintersweet genetics and molecular biology remains scant, reflecting a lack of basic genomic and transcriptomic data. In this study, we assembled three cDNA libraries, from three successive stages in flower development, designated as the flower bud with displayed petal, open flower and senescing flower stages. Using the Illumina RNA-Seq method, we obtained 21,412,928, 26,950,404, 24,912,954 qualified Illumina reads, respectively, for the three successive stages. The pooled reads from all three libraries were then assembled into 106,995 transcripts, 51,793 of which were annotated in the NCBI non-redundant protein database. Of these annotated sequences, 32,649 and 21,893 transcripts were assigned to gene ontology categories and clusters of orthologous groups, respectively. We could map 15,587 transcripts onto 312 pathways using the Kyoto Encyclopedia of Genes and Genomes pathway database. Based on these transcriptomic data, we obtained a large number of candidate genes that were differentially expressed at the open flower and senescing flower stages. An analysis of differentially expressed genes involved in plant hormone signal transduction pathways indicated that although flower opening and senescence may be independent of the ethylene signaling pathway in wintersweet, salicylic acid may be involved in the regulation of flower senescence. We also succeeded in isolating key genes of floral scent biosynthesis and proposed a biosynthetic pathway for monoterpenes and sesquiterpenes in wintersweet flowers, based on the annotated sequences. This comprehensive transcriptomic analysis presents fundamental information on the genes and pathways which are involved in flower development in wintersweet. And our data provided a useful database for further research of wintersweet and other Calycanthaceae family plants.

  7. Transcriptomic Analysis of Flower Development in Wintersweet (Chimonanthus praecox)

    PubMed Central

    Liu, Daofeng; Sui, Shunzhao; Ma, Jing; Li, Zhineng; Guo, Yulong; Luo, Dengpan; Yang, Jianfeng; Li, Mingyang

    2014-01-01

    Wintersweet (Chimonanthus praecox) is familiar as a garden plant and woody ornamental flower. On account of its unique flowering time and strong fragrance, it has a high ornamental and economic value. Despite a long history of human cultivation, our understanding of wintersweet genetics and molecular biology remains scant, reflecting a lack of basic genomic and transcriptomic data. In this study, we assembled three cDNA libraries, from three successive stages in flower development, designated as the flower bud with displayed petal, open flower and senescing flower stages. Using the Illumina RNA-Seq method, we obtained 21,412,928, 26,950,404, 24,912,954 qualified Illumina reads, respectively, for the three successive stages. The pooled reads from all three libraries were then assembled into 106,995 transcripts, 51,793 of which were annotated in the NCBI non-redundant protein database. Of these annotated sequences, 32,649 and 21,893 transcripts were assigned to gene ontology categories and clusters of orthologous groups, respectively. We could map 15,587 transcripts onto 312 pathways using the Kyoto Encyclopedia of Genes and Genomes pathway database. Based on these transcriptomic data, we obtained a large number of candidate genes that were differentially expressed at the open flower and senescing flower stages. An analysis of differentially expressed genes involved in plant hormone signal transduction pathways indicated that although flower opening and senescence may be independent of the ethylene signaling pathway in wintersweet, salicylic acid may be involved in the regulation of flower senescence. We also succeeded in isolating key genes of floral scent biosynthesis and proposed a biosynthetic pathway for monoterpenes and sesquiterpenes in wintersweet flowers, based on the annotated sequences. This comprehensive transcriptomic analysis presents fundamental information on the genes and pathways which are involved in flower development in wintersweet. And our data provided a useful database for further research of wintersweet and other Calycanthaceae family plants. PMID:24489818

  8. GeNNet: an integrated platform for unifying scientific workflows and graph databases for transcriptome data analysis

    PubMed Central

    Gadelha, Luiz; Ribeiro-Alves, Marcelo; Porto, Fábio

    2017-01-01

    There are many steps in analyzing transcriptome data, from the acquisition of raw data to the selection of a subset of representative genes that explain a scientific hypothesis. The data produced can be represented as networks of interactions among genes and these may additionally be integrated with other biological databases, such as Protein-Protein Interactions, transcription factors and gene annotation. However, the results of these analyses remain fragmented, imposing difficulties, either for posterior inspection of results, or for meta-analysis by the incorporation of new related data. Integrating databases and tools into scientific workflows, orchestrating their execution, and managing the resulting data and its respective metadata are challenging tasks. Additionally, a great amount of effort is equally required to run in-silico experiments to structure and compose the information as needed for analysis. Different programs may need to be applied and different files are produced during the experiment cycle. In this context, the availability of a platform supporting experiment execution is paramount. We present GeNNet, an integrated transcriptome analysis platform that unifies scientific workflows with graph databases for selecting relevant genes according to the evaluated biological systems. It includes GeNNet-Wf, a scientific workflow that pre-loads biological data, pre-processes raw microarray data and conducts a series of analyses including normalization, differential expression inference, clusterization and gene set enrichment analysis. A user-friendly web interface, GeNNet-Web, allows for setting parameters, executing, and visualizing the results of GeNNet-Wf executions. To demonstrate the features of GeNNet, we performed case studies with data retrieved from GEO, particularly using a single-factor experiment in different analysis scenarios. As a result, we obtained differentially expressed genes for which biological functions were analyzed. The results are integrated into GeNNet-DB, a database about genes, clusters, experiments and their properties and relationships. The resulting graph database is explored with queries that demonstrate the expressiveness of this data model for reasoning about gene interaction networks. GeNNet is the first platform to integrate the analytical process of transcriptome data with graph databases. It provides a comprehensive set of tools that would otherwise be challenging for non-expert users to install and use. Developers can add new functionality to components of GeNNet. The derived data allows for testing previous hypotheses about an experiment and exploring new ones through the interactive graph database environment. It enables the analysis of different data on humans, rhesus, mice and rat coming from Affymetrix platforms. GeNNet is available as an open source platform at https://github.com/raquele/GeNNet and can be retrieved as a software container with the command docker pull quelopes/gennet. PMID:28695067

  9. GeNNet: an integrated platform for unifying scientific workflows and graph databases for transcriptome data analysis.

    PubMed

    Costa, Raquel L; Gadelha, Luiz; Ribeiro-Alves, Marcelo; Porto, Fábio

    2017-01-01

    There are many steps in analyzing transcriptome data, from the acquisition of raw data to the selection of a subset of representative genes that explain a scientific hypothesis. The data produced can be represented as networks of interactions among genes and these may additionally be integrated with other biological databases, such as Protein-Protein Interactions, transcription factors and gene annotation. However, the results of these analyses remain fragmented, imposing difficulties, either for posterior inspection of results, or for meta-analysis by the incorporation of new related data. Integrating databases and tools into scientific workflows, orchestrating their execution, and managing the resulting data and its respective metadata are challenging tasks. Additionally, a great amount of effort is equally required to run in-silico experiments to structure and compose the information as needed for analysis. Different programs may need to be applied and different files are produced during the experiment cycle. In this context, the availability of a platform supporting experiment execution is paramount. We present GeNNet, an integrated transcriptome analysis platform that unifies scientific workflows with graph databases for selecting relevant genes according to the evaluated biological systems. It includes GeNNet-Wf, a scientific workflow that pre-loads biological data, pre-processes raw microarray data and conducts a series of analyses including normalization, differential expression inference, clusterization and gene set enrichment analysis. A user-friendly web interface, GeNNet-Web, allows for setting parameters, executing, and visualizing the results of GeNNet-Wf executions. To demonstrate the features of GeNNet, we performed case studies with data retrieved from GEO, particularly using a single-factor experiment in different analysis scenarios. As a result, we obtained differentially expressed genes for which biological functions were analyzed. The results are integrated into GeNNet-DB, a database about genes, clusters, experiments and their properties and relationships. The resulting graph database is explored with queries that demonstrate the expressiveness of this data model for reasoning about gene interaction networks. GeNNet is the first platform to integrate the analytical process of transcriptome data with graph databases. It provides a comprehensive set of tools that would otherwise be challenging for non-expert users to install and use. Developers can add new functionality to components of GeNNet. The derived data allows for testing previous hypotheses about an experiment and exploring new ones through the interactive graph database environment. It enables the analysis of different data on humans, rhesus, mice and rat coming from Affymetrix platforms. GeNNet is available as an open source platform at https://github.com/raquele/GeNNet and can be retrieved as a software container with the command docker pull quelopes/gennet.

  10. Transcriptome analysis of Ruditapes philippinarum hepatopancreas provides insights into immune signaling pathways under Vibrio anguillarum infection.

    PubMed

    Ren, Yipeng; Xue, Junli; Yang, Huanhuan; Pan, Baoping; Bu, Wenjun

    2017-05-01

    The Manila clam, Ruditapes philippinarum, is one of the most economically important aquatic clams that are harvested on a large scale by the mariculture industry in China. However, increasing reports of bacterial pathogenic diseases have had a negative effect on the aquaculture industry of R. philippinarum. In the present study, the two transcriptome libraries of untreated (termed H) and challenged Vibrio anguillarum (termed HV) hepatopancreas were constructed and sequenced from Manila clam using an Illumina-based paired-end sequencing platform. In total, 75,302,886 and 66,578,976 high-quality clean reads were assembled from 101,080,746 and 99,673,538 raw data points from the two transcriptome libraries described above, respectively. Furthermore, 156,116 unigenes were generated from 210,685 transcripts, with an N50 length of 1125 bp, and from the annotated SwissProt, NR, NT, KO, GO, KOG and KEGG databases. Moreover, a total of 4071 differentially expressed unigenes (HV vs H) were detected, including 903 up-regulated and 3168 down-regulated genes. Among these differentially expressed unigenes, 226 unigenes were annotated using KEGG annotation in 16 immune-related signaling pathways, including Toll-like receptor, NF-kappa B, MAPK, NOD-like receptor, RIG-I-like receptor, and the TNF and chemokine signaling pathways. Finally, 20,341 simple sequence repeats (SSRs) and 214,430 potential single nucleotide polymorphisms (SNPs) were detected from the H and HV transcriptome libraries. In conclusion, these studies identified many candidate immune-related genes and signaling pathways and conducted a comparative analysis of the differentially expressed unigenes from Manila clam hepatopancreas in response to V. anguillarum stimulation. These data laid the foundation for studying the innate immune systems and defense mechanisms in R. philippinarum. Copyright © 2017 Elsevier Ltd. All rights reserved.

  11. Insights into the increasing virulence of the swine-origin pandemic H1N1/2009 influenza virus

    PubMed Central

    Zou, Wei; Chen, Dijun; Xiong, Min; Zhu, Jiping; Lin, Xian; Wang, Lun; Zhang, Jun; Chen, Lingling; Zhang, Hongyu; Chen, Huanchun; Chen, Ming; Jin, Meilin

    2013-01-01

    Pandemic H1N1/2009 viruses have been stabilized in swine herds, and some strains display higher pathogenicity than the human-origin isolates. In this study, high-throughput RNA sequencing (RNA-seq) is applied to explore the systemic transcriptome responses of the mouse lungs infected by swine (Jia6/10) and human (LN/09) H1N1/2009 viruses. The transcriptome data show that Jia6/10 activates stronger virus-sensing signals, such as the toll-like receptor, RIG-I like receptor and NOD-like receptor signalings, as well as a stronger NF-κB and JAK-STAT singals, which play significant roles in inducing innate immunity. Most cytokines and interferon-stimulated genes show higher expression lever in Jia/06 infected groups. Meanwhile, virus Jia6/10 activates stronger production of reactive oxygen species, which might further promote higher mutation rate of the virus genome. Collectively, our data reveal that the swine-origin pandemic H1N1/2009 virus elicits a stronger innate immune reaction and pro-oxidation stimulation, which might relate closely to the increasing pathogenicity. PMID:23549303

  12. Distribution of cellular HSV-1 receptor expression in human brain.

    PubMed

    Lathe, Richard; Haas, Juergen G

    2017-06-01

    Herpes simplex virus type 1 (HSV-1) is a neurotropic virus linked to a range of acute and chronic neurological disorders affecting distinct regions of the brain. Unusually, HSV-1 entry into cells requires the interaction of viral proteins glycoprotein D (gD) and glycoprotein B (gB) with distinct cellular receptor proteins. Several different gD and gB receptors have been identified, including TNFRSF14/HVEM and PVRL1/nectin 1 as gD receptors and PILRA, MAG, and MYH9 as gB receptors. We investigated the expression of these receptor molecules in different areas of the adult and developing human brain using online transcriptome databases. Whereas all HSV-1 receptors showed distinct expression patterns in different brain areas, the Allan Brain Atlas (ABA) reported increased expression of both gD and gB receptors in the hippocampus. Specifically, for PVRL1, TNFRFS14, and MYH9, the differential z scores for hippocampal expression, a measure of relative levels of increased expression, rose to 2.9, 2.9, and 2.5, respectively, comparable to the z score for the archetypical hippocampus-enriched mineralocorticoid receptor (NR3C2, z = 3.1). These data were confirmed at the Human Brain Transcriptome (HBT) database, but HBT data indicate that MAG expression is also enriched in hippocampus. The HBT database allowed the developmental pattern of expression to be investigated; we report that all HSV1 receptors markedly increase in expression levels between gestation and the postnatal/adult periods. These results suggest that differential receptor expression levels of several HSV-1 gD and gB receptors in the adult hippocampus are likely to underlie the susceptibility of this brain region to HSV-1 infection.

  13. Construction of an Ostrea edulis database from genomic and expressed sequence tags (ESTs) obtained from Bonamia ostreae infected haemocytes: Development of an immune-enriched oligo-microarray.

    PubMed

    Pardo, Belén G; Álvarez-Dios, José Antonio; Cao, Asunción; Ramilo, Andrea; Gómez-Tato, Antonio; Planas, Josep V; Villalba, Antonio; Martínez, Paulino

    2016-12-01

    The flat oyster, Ostrea edulis, is one of the main farmed oysters, not only in Europe but also in the United States and Canada. Bonamiosis due to the parasite Bonamia ostreae has been associated with high mortality episodes in this species. This parasite is an intracellular protozoan that infects haemocytes, the main cells involved in oyster defence. Due to the economical and ecological importance of flat oyster, genomic data are badly needed for genetic improvement of the species, but they are still very scarce. The objective of this study is to develop a sequence database, OedulisDB, with new genomic and transcriptomic resources, providing new data and convenient tools to improve our knowledge of the oyster's immune mechanisms. Transcriptomic and genomic sequences were obtained using 454 pyrosequencing and compiled into an O. edulis database, OedulisDB, consisting of two sets of 10,318 and 7159 unique sequences that represent the oyster's genome (WG) and de novo haemocyte transcriptome (HT), respectively. The flat oyster transcriptome was obtained from two strains (naïve and tolerant) challenged with B. ostreae, and from their corresponding non-challenged controls. Approximately 78.5% of 5619 HT unique sequences were successfully annotated by Blast search using public databases. A total of 984 sequences were identified as being related to immune response and several key immune genes were identified for the first time in flat oyster. Additionally, transcriptome information was used to design and validate the first oligo-microarray in flat oyster enriched with immune sequences from haemocytes. Our transcriptomic and genomic sequencing and subsequent annotation have largely increased the scarce resources available for this economically important species and have enabled us to develop an OedulisDB database and accompanying tools for gene expression analysis. This study represents the first attempt to characterize in depth the O. edulis haemocyte transcriptome in response to B. ostreae through massively sequencing and has aided to improve our knowledge of the immune mechanisms of flat oyster. The validated oligo-microarray and the establishment of a reference transcriptome will be useful for large-scale gene expression studies in this species. Copyright © 2016 Elsevier Ltd. All rights reserved.

  14. An Integrated Human/Murine Transcriptome and Pathway Approach To Identify Prenatal Treatments For Down Syndrome.

    PubMed

    Guedj, Faycal; Pennings, Jeroen LA; Massingham, Lauren J; Wick, Heather C; Siegel, Ashley E; Tantravahi, Umadevi; Bianchi, Diana W

    2016-09-02

    Anatomical and functional brain abnormalities begin during fetal life in Down syndrome (DS). We hypothesize that novel prenatal treatments can be identified by targeting signaling pathways that are consistently perturbed in cell types/tissues obtained from human fetuses with DS and mouse embryos. We analyzed transcriptome data from fetuses with trisomy 21, age and sex-matched euploid controls, and embryonic day 15.5 forebrains from Ts1Cje, Ts65Dn, and Dp16 mice. The new datasets were compared to other publicly available datasets from humans with DS. We used the human Connectivity Map (CMap) database and created a murine adaptation to identify FDA-approved drugs that can rescue affected pathways. USP16 and TTC3 were dysregulated in all affected human cells and two mouse models. DS-associated pathway abnormalities were either the result of gene dosage specific effects or the consequence of a global cell stress response with activation of compensatory mechanisms. CMap analyses identified 56 molecules with high predictive scores to rescue abnormal gene expression in both species. Our novel integrated human/murine systems biology approach identified commonly dysregulated genes and pathways. This can help to prioritize therapeutic molecules on which to further test safety and efficacy. Additional studies in human cells are ongoing prior to pre-clinical prenatal treatment in mice.

  15. Sequencing and de novo assembly of visceral mass transcriptome of the critically endangered land snail Satsuma myomphala: Annotation and SSR discovery.

    PubMed

    Kang, Se Won; Patnaik, Bharat Bhusan; Hwang, Hee-Ju; Park, So Young; Chung, Jong Min; Song, Dae Kwon; Patnaik, Hongray Howrelia; Lee, Jae Bong; Kim, Changmu; Kim, Soonok; Park, Hong Seog; Park, Seung-Hwan; Park, Young-Su; Han, Yeon Soo; Lee, Jun Sang; Lee, Yong Seok

    2017-03-01

    Satsuma myomphala is critically endangered through loss of natural habitats, predation by natural enemies, and indiscriminate collection. It is a protected species in Korea but lacks genomic resources for an understanding of varied functional processes attributable to evolutionary success under natural habitats. For assessing the genetic information of S. myomphala, we performed for the first time, de novo transcriptome sequencing and functional annotation of expressed sequences using Illumina Next-Generation Sequencing (NGS) platform and bioinformatics analysis. We identified 103,774 unigenes of which 37,959, 12,890, and 17,699 were annotated in the PANM (Protostome DB), Unigene, and COG (Clusters of Orthologous Groups) databases, respectively. In addition, 14,451 unigenes were predicted under Gene Ontology functional categories, with 4581 assigned to a single category. Furthermore, 3369 sequences with 646 having Enzyme Commission (EC) numbers were mapped to 122 pathways in the Kyoto Encyclopedia of Genes and Genomes Pathway database. The prominent protein domains included the Zinc finger (C2H2-like), Reverse Transcriptase, Thioredoxin-like fold, and RNA recognition motif domain. Many unigenes with homology to immunity, defense, and reproduction-related genes were screened in the transcriptome. We also detected 3120 putative simple sequence repeats (SSRs) encompassing dinucleotide to hexanucleotide repeat motifs from >1kb unigene sequences. A list of PCR primers of SSR loci have been identified to study the genetic polymorphisms. The transcriptome data represents a valuable resource for further investigations on the species genome structure and biology. The unigenes information and microsatellites would provide an indispensable tool for conservation of the species in natural and adaptive environments. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.

  16. Construction of a medicinal leech transcriptome database and its application to the identification of leech homologs of neural and innate immune genes.

    PubMed

    Macagno, Eduardo R; Gaasterland, Terry; Edsall, Lee; Bafna, Vineet; Soares, Marcelo B; Scheetz, Todd; Casavant, Thomas; Da Silva, Corinne; Wincker, Patrick; Tasiemski, Aurélie; Salzet, Michel

    2010-06-25

    The medicinal leech, Hirudo medicinalis, is an important model system for the study of nervous system structure, function, development, regeneration and repair. It is also a unique species in being presently approved for use in medical procedures, such as clearing of pooled blood following certain surgical procedures. It is a current, and potentially also future, source of medically useful molecular factors, such as anticoagulants and antibacterial peptides, which may have evolved as a result of its parasitizing large mammals, including humans. Despite the broad focus of research on this system, little has been done at the genomic or transcriptomic levels and there is a paucity of openly available sequence data. To begin to address this problem, we constructed whole embryo and adult central nervous system (CNS) EST libraries and created a clustered sequence database of the Hirudo transcriptome that is available to the scientific community. A total of approximately 133,000 EST clones from two directionally-cloned cDNA libraries, one constructed from mRNA derived from whole embryos at several developmental stages and the other from adult CNS cords, were sequenced in one or both directions by three different groups: Genoscope (French National Sequencing Center), the University of Iowa Sequencing Facility and the DOE Joint Genome Institute. These were assembled using the phrap software package into 31,232 unique contigs and singletons, with an average length of 827 nt. The assembled transcripts were then translated in all six frames and compared to proteins in NCBI's non-redundant (NR) and to the Gene Ontology (GO) protein sequence databases, resulting in 15,565 matches to 11,236 proteins in NR and 13,935 matches to 8,073 proteins in GO. Searching the database for transcripts of genes homologous to those thought to be involved in the innate immune responses of vertebrates and other invertebrates yielded a set of nearly one hundred evolutionarily conserved sequences, representing all known pathways involved in these important functions. The sequences obtained for Hirudo transcripts represent the first major database of genes expressed in this important model system. Comparison of translated open reading frames (ORFs) with the other openly available leech datasets, the genome and transcriptome of Helobdella robusta, shows an average identity at the amino acid level of 58% in matched sequences. Interestingly, comparison with other available Lophotrochozoans shows similar high levels of amino acid identity, where sequences match, for example, 64% with Capitella capitata (a polychaete) and 56% with Aplysia californica (a mollusk), as well as 58% with Schistosoma mansoni (a platyhelminth). Phylogenetic comparisons of putative Hirudo innate immune response genes present within the Hirudo transcriptome database herein described show a strong resemblance to the corresponding mammalian genes, indicating that this important physiological response may have older origins than what has been previously proposed.

  17. hPDI: a database of experimental human protein-DNA interactions.

    PubMed

    Xie, Zhi; Hu, Shaohui; Blackshaw, Seth; Zhu, Heng; Qian, Jiang

    2010-01-15

    The human protein DNA Interactome (hPDI) database holds experimental protein-DNA interaction data for humans identified by protein microarray assays. The unique characteristics of hPDI are that it contains consensus DNA-binding sequences not only for nearly 500 human transcription factors but also for >500 unconventional DNA-binding proteins, which are completely uncharacterized previously. Users can browse, search and download a subset or the entire data via a web interface. This database is freely accessible for any academic purposes. http://bioinfo.wilmer.jhu.edu/PDI/.

  18. Draft de novo transcriptome assembly and proteome characterization of the electric lobe of Tetronarce californica: a molecular tool for the study of cholinergic neurotransmission in the electric organ.

    PubMed

    Stavrianakou, Maria; Perez, Ricardo; Wu, Cheng; Sachs, Matthew S; Aramayo, Rodolfo; Harlow, Mark

    2017-08-14

    The electric organ of Tetronarce californica (an electric ray formerly known as Torpedo californica) is a classic preparation for biochemical studies of cholinergic neurotransmission. To broaden the usefulness of this preparation, we have performed a transcriptome assembly of the presynaptic component of the electric organ (the electric lobe). We combined our assembled transcriptome with a previous transcriptome of the postsynaptic electric organ, to define a MetaProteome containing pre- and post-synaptic components of the electric organ. Sequencing yielded 102 million paired-end 100 bp reads. De novo Trinity assembly was performed at Kmer 25 (default) and Kmers 27, 29, and 31. Trinity, generated around 103,000 transcripts, and 78,000 genes per assembly. Assemblies were evaluated based on the number of bases/transcripts assembled, RSEM-EVAL scores and informational content and completeness. We found that different assemblies scored differently according to the evaluation criteria used, and that while each individual assembly contained unique information, much of the assembly information was shared by all assemblies. To generate the presynaptic transcriptome (electric lobe), while capturing all information, assemblies were first clustered and then combined with postsynaptic transcripts (electric organ) downloaded from NCBI. The completness of the resulting clustered predicted MetaProteome was rigorously evaluated by comparing its information against the predicted proteomes from Homo sapiens, Callorhinchus milli, and the Transporter Classification Database (TCDB). In summary, we obtained a MetaProteome containing 92%, 88.5%, and 66% of the expected set of ultra-conserved sequences (i.e., BUSCOs), expected to be found for Eukaryotes, Metazoa, and Vertebrata, respectively. We cross-annotated the conserved set of proteins shared between the T. californica MetaProteome and the proteomes of H. sapiens and C. milli, using the H. sapiens genome as a reference. This information was used to predict the position in human pathways of the conserved members of the T. californica MetaProteome. We found proteins not detected before in T. californica, corresponding to processes involved in synaptic vesicle biology. Finally, we identified 42 transporter proteins in TCDB that were detected by the T. californica MetaProteome (electric fish) and not selected by a control proteome consisting of the combined proteomes of 12 widely diverse non-electric fishes by Reverse-Blast-Hit Blast. Combined, the information provided here is not only a unique tool for the study of cholinergic neurotransmission, but it is also a starting point for understanding the evolution of early vertebrates.

  19. Hv 1 Proton Channels in Dinoflagellates: Not Just for Bioluminescence?

    PubMed

    Kigundu, Gabriel; Cooper, Jennifer L; Smith, Susan M E

    2018-04-26

    Bioluminescence in dinoflagellates is controlled by H V 1 proton channels. Database searches of dinoflagellate transcriptomes and genomes yielded hits with sequence features diagnostic of all confirmed H V 1, and show that H V 1 is widely distributed in the dinoflagellate phylogeny including the basal species Oxyrrhis marina. Multiple sequence alignments followed by phylogenetic analysis revealed three major subfamilies of H V 1 that do not correlate with presence of theca, autotrophy, geographic location, or bioluminescence. These data suggest that most dinoflagellates express a H V 1 which has a function separate from bioluminescence. Sequence evidence also suggests that dinoflagellates can contain more than one H V 1 gene. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.

  20. NGS Catalog: A Database of Next Generation Sequencing Studies in Humans

    PubMed Central

    Xia, Junfeng; Wang, Qingguo; Jia, Peilin; Wang, Bing; Pao, William; Zhao, Zhongming

    2015-01-01

    Next generation sequencing (NGS) technologies have been rapidly applied in biomedical and biological research since its advent only a few years ago, and they are expected to advance at an unprecedented pace in the following years. To provide the research community with a comprehensive NGS resource, we have developed the database Next Generation Sequencing Catalog (NGS Catalog, http://bioinfo.mc.vanderbilt.edu/NGS/index.html), a continually updated database that collects, curates and manages available human NGS data obtained from published literature. NGS Catalog deposits publication information of NGS studies and their mutation characteristics (SNVs, small insertions/deletions, copy number variations, and structural variants), as well as mutated genes and gene fusions detected by NGS. Other functions include user data upload, NGS general analysis pipelines, and NGS software. NGS Catalog is particularly useful for investigators who are new to NGS but would like to take advantage of these powerful technologies for their own research. Finally, based on the data deposited in NGS Catalog, we summarized features and findings from whole exome sequencing, whole genome sequencing, and transcriptome sequencing studies for human diseases or traits. PMID:22517761

  1. Epididymal genomics and the search for a male contraceptive.

    PubMed

    Turner, T T; Johnston, D S; Jelinsky, S A

    2006-05-16

    This report represents the joint efforts of three laboratories, one with a primary interest in understanding regulatory processes in the epididymal epithelium (TTT) and two with a primary interest in identifying and characterizing new contraceptive targets (DSJ and SAJ). We have developed a highly refined mouse epididymal transcriptome and have used it as a starting point for determining genes in the human epididymis, which may serve as targets for male contraceptives. Our database represents gene expression information for approximately 39,000 transcripts, of which over 17,000 are significantly expressed in at least one segment of the mouse epididymis. Over 2000 of these transcripts are up- or down-regulated by at least four-fold between at least two segments. In addition, human databases have been queried to determine expression of orthologs in the human epididymis and the specificity of their expression in the epididymis. Genes highly regulated in the human epididymis and showing high tissue specificity are potential targets for male contraceptives.

  2. Conserved and divergent rhythms of crassulacean acid metabolism-related and core clock gene expression in the cactus Opuntia ficus-indica.

    PubMed

    Mallona, Izaskun; Egea-Cortines, Marcos; Weiss, Julia

    2011-08-01

    The cactus Opuntia ficus-indica is a constitutive Crassulacean acid metabolism (CAM) species. Current knowledge of CAM metabolism suggests that the enzyme phosphoenolpyruvate carboxylase kinase (PPCK) is circadian regulated at the transcriptional level, whereas phosphoenolpyruvate carboxylase (PEPC), malate dehydrogenase (MDH), NADP-malic enzyme (NADP-ME), and pyruvate phosphate dikinase (PPDK) are posttranslationally controlled. As little transcriptomic data are available from obligate CAM plants, we created an expressed sequence tag database derived from different organs and developmental stages. Sequences were assembled, compared with sequences in the National Center for Biotechnology Information nonredundant database for identification of putative orthologs, and mapped using Kyoto Encyclopedia of Genes and Genomes Orthology and Gene Ontology. We identified genes involved in circadian regulation and CAM metabolism for transcriptomic analysis in plants grown in long days. We identified stable reference genes for quantitative polymerase chain reaction and found that OfiSAND, like its counterpart in Arabidopsis (Arabidopsis thaliana), and OfiTUB are generally appropriate standards for use in the quantification of gene expression in O. ficus-indica. Three kinds of expression profiles were found: transcripts of OfiPPCK oscillated with a 24-h periodicity; transcripts of the light-active OfiNADP-ME and OfiPPDK genes adapted to 12-h cycles, while transcript accumulation patterns of OfiPEPC and OfiMDH were arrhythmic. Expression of the circadian clock gene OfiTOC1, similar to Arabidopsis, oscillated with a 24-h periodicity, peaking at night. Expression of OfiCCA1 and OfiPRR9, unlike in Arabidopsis, adapted best to a 12-h rhythm, suggesting that circadian clock gene interactions differ from those of Arabidopsis. Our results indicate that the evolution of CAM metabolism could be the result of modified circadian regulation at both the transcriptional and posttranscriptional levels.

  3. Regulatory RNA binding proteins contribute to the transcriptome-wide splicing alterations in human cellular senescence.

    PubMed

    Dong, Qiongye; Wei, Lei; Zhang, Michael Q; Wang, Xiaowo

    2018-06-24

    Dysregulation of mRNA splicing has been observed in certain cellular senescence process. However, the common splicing alterations on the whole transcriptome shared by various types of senescence are poorly understood. In order to systematically identify senescence-associated transcriptomic changes in genome-wide scale, we collected RNA sequencing datasets of different human cell types with a variety of senescence-inducing methods from public databases and performed meta-analysis. First, we discovered that a group of RNA binding proteins were consistently down-regulated in diverse senescent samples and identified 406 senescence-associated common differential splicing events. Then, eight differentially expressed RNA binding proteins were predicted to regulate these senescence-associated splicing alterations through an enrichment analysis of their RNA binding information, including motif scanning and enhanced cross-linking immunoprecipitation data. In addition, we constructed the splicing regulatory modules that might contribute to senescence-associated biological processes. Finally, it was confirmed that knockdown of the predicted senescence-associated potential splicing regulators through shRNAs in HepG2 cell line could result in senescence-like splicing changes. Taken together, our work demonstrated a broad range of common changes in mRNA splicing switches and detected their central regulatory RNA binding proteins during senescence. These findings would help to better understand the coordinating splicing alterations in cellular senescence.

  4. H7N9 and Other Pathogenic Avian Influenza Viruses Elicit a Three-Pronged Transcriptomic Signature That Is Reminiscent of 1918 Influenza Virus and Is Associated with Lethal Outcome in Mice

    PubMed Central

    Morrison, Juliet; Josset, Laurence; Tchitchek, Nicolas; Chang, Jean; Belser, Jessica A.; Swayne, David E.; Pantin-Jackwood, Mary J.; Tumpey, Terrence M.

    2014-01-01

    ABSTRACT Modulating the host response is a promising approach to treating influenza, caused by a virus whose pathogenesis is determined in part by the reaction it elicits within the host. Though the pathogenicity of emerging H7N9 influenza virus in several animal models has been reported, these studies have not included a detailed characterization of the host response following infection. Therefore, we characterized the transcriptomic response of BALB/c mice infected with H7N9 (A/Anhui/01/2013) virus and compared it to the responses induced by H5N1 (A/Vietnam/1203/2004), H7N7 (A/Netherlands/219/2003), and pandemic 2009 H1N1 (A/Mexico/4482/2009) influenza viruses. We found that responses to the H7 subtype viruses were intermediate to those elicited by H5N1 and pdm09H1N1 early in infection but that they evolved to resemble the H5N1 response as infection progressed. H5N1, H7N7, and H7N9 viruses were pathogenic in mice, and this pathogenicity correlated with increased transcription of cytokine response genes and decreased transcription of lipid metabolism and coagulation signaling genes. This three-pronged transcriptomic signature was observed in mice infected with pathogenic H1N1 strains such as the 1918 virus, indicating that it may be predictive of pathogenicity across multiple influenza virus strains. Finally, we used host transcriptomic profiling to computationally predict drugs that reverse the host response to H7N9 infection, and we identified six FDA-approved drugs that could potentially be repurposed to treat H7N9 and other pathogenic influenza viruses. IMPORTANCE Emerging avian influenza viruses are of global concern because the human population is immunologically naive to them. Current influenza drugs target viral molecules, but the high mutation rate of influenza viruses eventually leads to the development of antiviral resistance. As the host evolves far more slowly than the virus, and influenza pathogenesis is determined in part by the host response, targeting the host response is a promising approach to treating influenza. Here we characterize the host transcriptomic response to emerging H7N9 influenza virus and compare it with the responses to H7N7, H5N1, and pdm09H1N1. All three avian viruses were pathogenic in mice and elicited a transcriptomic signature that also occurs in response to the legendary 1918 influenza virus. Our work identifies host responses that could be targeted to treat severe H7N9 influenza and identifies six FDA-approved drugs that could potentially be repurposed as H7N9 influenza therapeutics. PMID:24991006

  5. H7N9 and other pathogenic avian influenza viruses elicit a three-pronged transcriptomic signature that is reminiscent of 1918 influenza virus and is associated with lethal outcome in mice.

    PubMed

    Morrison, Juliet; Josset, Laurence; Tchitchek, Nicolas; Chang, Jean; Belser, Jessica A; Swayne, David E; Pantin-Jackwood, Mary J; Tumpey, Terrence M; Katze, Michael G

    2014-09-01

    Modulating the host response is a promising approach to treating influenza, caused by a virus whose pathogenesis is determined in part by the reaction it elicits within the host. Though the pathogenicity of emerging H7N9 influenza virus in several animal models has been reported, these studies have not included a detailed characterization of the host response following infection. Therefore, we characterized the transcriptomic response of BALB/c mice infected with H7N9 (A/Anhui/01/2013) virus and compared it to the responses induced by H5N1 (A/Vietnam/1203/2004), H7N7 (A/Netherlands/219/2003), and pandemic 2009 H1N1 (A/Mexico/4482/2009) influenza viruses. We found that responses to the H7 subtype viruses were intermediate to those elicited by H5N1 and pdm09H1N1 early in infection but that they evolved to resemble the H5N1 response as infection progressed. H5N1, H7N7, and H7N9 viruses were pathogenic in mice, and this pathogenicity correlated with increased transcription of cytokine response genes and decreased transcription of lipid metabolism and coagulation signaling genes. This three-pronged transcriptomic signature was observed in mice infected with pathogenic H1N1 strains such as the 1918 virus, indicating that it may be predictive of pathogenicity across multiple influenza virus strains. Finally, we used host transcriptomic profiling to computationally predict drugs that reverse the host response to H7N9 infection, and we identified six FDA-approved drugs that could potentially be repurposed to treat H7N9 and other pathogenic influenza viruses. Emerging avian influenza viruses are of global concern because the human population is immunologically naive to them. Current influenza drugs target viral molecules, but the high mutation rate of influenza viruses eventually leads to the development of antiviral resistance. As the host evolves far more slowly than the virus, and influenza pathogenesis is determined in part by the host response, targeting the host response is a promising approach to treating influenza. Here we characterize the host transcriptomic response to emerging H7N9 influenza virus and compare it with the responses to H7N7, H5N1, and pdm09H1N1. All three avian viruses were pathogenic in mice and elicited a transcriptomic signature that also occurs in response to the legendary 1918 influenza virus. Our work identifies host responses that could be targeted to treat severe H7N9 influenza and identifies six FDA-approved drugs that could potentially be repurposed as H7N9 influenza therapeutics. Copyright © 2014, American Society for Microbiology. All Rights Reserved.

  6. Developmental Gene Discovery in a Hemimetabolous Insect: De Novo Assembly and Annotation of a Transcriptome for the Cricket Gryllus bimaculatus

    PubMed Central

    Zeng, Victor; Ewen-Campen, Ben; Horch, Hadley W.; Roth, Siegfried; Mito, Taro; Extavour, Cassandra G.

    2013-01-01

    Most genomic resources available for insects represent the Holometabola, which are insects that undergo complete metamorphosis like beetles and flies. In contrast, the Hemimetabola (direct developing insects), representing the basal branches of the insect tree, have very few genomic resources. We have therefore created a large and publicly available transcriptome for the hemimetabolous insect Gryllus bimaculatus (cricket), a well-developed laboratory model organism whose potential for functional genetic experiments is currently limited by the absence of genomic resources. cDNA was prepared using mRNA obtained from adult ovaries containing all stages of oogenesis, and from embryo samples on each day of embryogenesis. Using 454 Titanium pyrosequencing, we sequenced over four million raw reads, and assembled them into 21,512 isotigs (predicted transcripts) and 120,805 singletons with an average coverage per base pair of 51.3. We annotated the transcriptome manually for over 400 conserved genes involved in embryonic patterning, gametogenesis, and signaling pathways. BLAST comparison of the transcriptome against the NCBI non-redundant protein database (nr) identified significant similarity to nr sequences for 55.5% of transcriptome sequences, and suggested that the transcriptome may contain 19,874 unique transcripts. For predicted transcripts without significant similarity to known sequences, we assessed their similarity to other orthopteran sequences, and determined that these transcripts contain recognizable protein domains, largely of unknown function. We created a searchable, web-based database to allow public access to all raw, assembled and annotated data. This database is to our knowledge the largest de novo assembled and annotated transcriptome resource available for any hemimetabolous insect. We therefore anticipate that these data will contribute significantly to more effective and higher-throughput deployment of molecular analysis tools in Gryllus. PMID:23671567

  7. The Transcriptome Analysis and Comparison Explorer--T-ACE: a platform-independent, graphical tool to process large RNAseq datasets of non-model organisms.

    PubMed

    Philipp, E E R; Kraemer, L; Mountfort, D; Schilhabel, M; Schreiber, S; Rosenstiel, P

    2012-03-15

    Next generation sequencing (NGS) technologies allow a rapid and cost-effective compilation of large RNA sequence datasets in model and non-model organisms. However, the storage and analysis of transcriptome information from different NGS platforms is still a significant bottleneck, leading to a delay in data dissemination and subsequent biological understanding. Especially database interfaces with transcriptome analysis modules going beyond mere read counts are missing. Here, we present the Transcriptome Analysis and Comparison Explorer (T-ACE), a tool designed for the organization and analysis of large sequence datasets, and especially suited for transcriptome projects of non-model organisms with little or no a priori sequence information. T-ACE offers a TCL-based interface, which accesses a PostgreSQL database via a php-script. Within T-ACE, information belonging to single sequences or contigs, such as annotation or read coverage, is linked to the respective sequence and immediately accessible. Sequences and assigned information can be searched via keyword- or BLAST-search. Additionally, T-ACE provides within and between transcriptome analysis modules on the level of expression, GO terms, KEGG pathways and protein domains. Results are visualized and can be easily exported for external analysis. We developed T-ACE for laboratory environments, which have only a limited amount of bioinformatics support, and for collaborative projects in which different partners work on the same dataset from different locations or platforms (Windows/Linux/MacOS). For laboratories with some experience in bioinformatics and programming, the low complexity of the database structure and open-source code provides a framework that can be customized according to the different needs of the user and transcriptome project.

  8. Transcriptome Complexity and Riboregulation in the Human Pathogen Helicobacter pylori

    PubMed Central

    Pernitzsch, Sandy R.; Sharma, Cynthia M.

    2012-01-01

    The Gram-negative Epsilonproteobacterium Helicobacter pylori is considered as one of the major human pathogens and many studies have focused on its virulence mechanisms as well as genomic diversity. In contrast, only very little is known about post-transcriptional regulation and small regulatory RNAs (sRNAs) in this spiral-shaped microaerophilic bacterium. Considering the absence of the common RNA chaperone Hfq, which is a key-player in post-transcriptional regulation in enterobacteria, H. pylori was even regarded as an organism without riboregulation. However, analysis of the H. pylori primary transcriptome using RNA-seq revealed a very complex transcriptional output from its small genome. Furthermore, the identification of a wealth of sRNAs as well as massive antisense transcription indicates that H. pylori uses riboregulation for its gene expression control. The ongoing functional characterization of sRNAs along with the identification of associated RNA binding proteins will help to understand their potential roles in Helicobacter virulence and stress response. Moreover, research on riboregulation in H. pylori will provide new insights into its virulence mechanisms and will also help to shed light on post-transcriptional regulation in other Epsilonproteobacteria, including widespread and emerging pathogens such as Campylobacter. PMID:22919606

  9. GATA2/3-TFAP2A/C transcription factor network couples human pluripotent stem cell differentiation to trophectoderm with repression of pluripotency

    PubMed Central

    Krendl, Christian; Shaposhnikov, Dmitry; Rishko, Valentyna; Ori, Chaido; Ziegenhain, Christoph; Sass, Steffen; Simon, Lukas; Müller, Nikola S.; Straub, Tobias; Brooks, Kelsey E.; Chavez, Shawn L.; Enard, Wolfgang; Theis, Fabian J.; Drukker, Micha

    2017-01-01

    To elucidate the molecular basis of BMP4-induced differentiation of human pluripotent stem cells (PSCs) toward progeny with trophectoderm characteristics, we produced transcriptome, epigenome H3K4me3, H3K27me3, and CpG methylation maps of trophoblast progenitors, purified using the surface marker APA. We combined them with the temporally resolved transcriptome of the preprogenitor phase and of single APA+ cells. This revealed a circuit of bivalent TFAP2A, TFAP2C, GATA2, and GATA3 transcription factors, coined collectively the “trophectoderm four” (TEtra), which are also present in human trophectoderm in vivo. At the onset of differentiation, the TEtra factors occupy multiple sites in epigenetically inactive placental genes and in OCT4. Functional manipulation of GATA3 and TFAP2A indicated that they directly couple trophoblast-specific gene induction with suppression of pluripotency. In accordance, knocking down GATA3 in primate embryos resulted in a failure to form trophectoderm. The discovery of the TEtra circuit indicates how trophectoderm commitment is regulated in human embryogenesis. PMID:29078328

  10. Proteogenomics Dashboard for the Human Proteome Project.

    PubMed

    Tabas-Madrid, Daniel; Alves-Cruzeiro, Joao; Segura, Victor; Guruceaga, Elizabeth; Vialas, Vital; Prieto, Gorka; García, Carlos; Corrales, Fernando J; Albar, Juan Pablo; Pascual-Montano, Alberto

    2015-09-04

    dasHPPboard is a novel proteomics-based dashboard that collects and reports the experiments produced by the Spanish Human Proteome Project consortium (SpHPP) and aims to help HPP to map the entire human proteome. We have followed the strategy of analog genomics projects like the Encyclopedia of DNA Elements (ENCODE), which provides a vast amount of data on human cell lines experiments. The dashboard includes results of shotgun and selected reaction monitoring proteomics experiments, post-translational modifications information, as well as proteogenomics studies. We have also processed the transcriptomics data from the ENCODE and Human Body Map (HBM) projects for the identification of specific gene expression patterns in different cell lines and tissues, taking special interest in those genes having little proteomic evidence available (missing proteins). Peptide databases have been built using single nucleotide variants and novel junctions derived from RNA-Seq data that can be used in search engines for sample-specific protein identifications on the same cell lines or tissues. The dasHPPboard has been designed as a tool that can be used to share and visualize a combination of proteomic and transcriptomic data, providing at the same time easy access to resources for proteogenomics analyses. The dasHPPboard can be freely accessed at: http://sphppdashboard.cnb.csic.es.

  11. Transcriptome Sequencing and Positive Selected Genes Analysis of Bombyx mandarina

    PubMed Central

    Wu, Yuqian; Long, Renwen; Liu, Chun; Xia, Qingyou

    2015-01-01

    The wild silkworm Bombyx mandarina is widely believed to be an ancestor of the domesticated silkworm, Bombyx mori. Silkworms are often used as a model for studying the mechanism of species domestication. Here, we performed transcriptome sequencing of the wild silkworm using an Illumina HiSeq2000 platform. We produced 100,004,078 high-quality reads and assembled them into 50,773 contigs with an N50 length of 1764 bp and a mean length of 941.62 bp. A total of 33,759 unigenes were identified, with 12,805 annotated in the Nr database, 8273 in the Pfam database, and 9093 in the Swiss-Prot database. Expression profile analysis found significant differential expression of 1308 unigenes between the middle silk gland (MSG) and posterior silk gland (PSG). Three sericin genes (sericin 1, sericin 2, and sericin 3) were expressed specifically in the MSG and three fibroin genes (fibroin-H, fibroin-L, and fibroin/P25) were expressed specifically in the PSG. In addition, 32,297 Single-nucleotide polymorphisms (SNPs) and 361 insertion-deletions (INDELs) were detected. Comparison with the domesticated silkworm p50/Dazao identified 5,295 orthologous genes, among which 400 might have experienced or to be experiencing positive selection by Ka/Ks analysis. These data and analyses presented here provide insights into silkworm domestication and an invaluable resource for wild silkworm genomics research. PMID:25806526

  12. Transcriptome sequencing and identification of cold tolerance genes in hardy Corylus species (C. heterophylla Fisch) floral buds.

    PubMed

    Chen, Xin; Zhang, Jin; Liu, Qingzhong; Guo, Wei; Zhao, Tiantian; Ma, Qinghua; Wang, Guixi

    2014-01-01

    The genus Corylus is an important woody species in Northeast China. Its products, hazelnuts, constitute one of the most important raw materials for the pastry and chocolate industry. However, limited genetic research has focused on Corylus because of the lack of genomic resources. The advent of high-throughput sequencing technologies provides a turning point for Corylus research. In the present study, we performed de novo transcriptome sequencing for the first time to produce a comprehensive database for the Corylus heterophylla Fisch floral buds. The C. heterophylla Fisch floral buds transcriptome was sequenced using the Illumina paired-end sequencing technology. We produced 28,930,890 raw reads and assembled them into 82,684 contigs. A total of 40,941 unigenes were identified, among which 30,549 were annotated in the NCBI Non-redundant (Nr) protein database and 18,581 were annotated in the Swiss-Prot database. Of these annotated unigenes, 25,311 and 10,514 unigenes were assigned to gene ontology (GO) categories and clusters of orthologous groups (COG), respectively. We could map 17,207 unigenes onto 128 pathways using the Kyoto Encyclopedia of Genes and Genomes Pathway (KEGG) database. Additionally, based on the transcriptome, we constructed a candidate cold tolerance gene set of C. heterophylla Fisch floral buds. The expression patterns of selected genes during four stages of cold acclimation suggested that these genes might be involved in different cold responsive stages in C. heterophylla Fisch floral buds. The transcriptome of C. heterophylla Fisch floral buds was deep sequenced, de novo assembled, and annotated, providing abundant data to better understand the C. heterophylla Fisch floral buds transcriptome. Candidate genes potentially involved in cold tolerance were identified, providing a material basis for future molecular mechanism analysis of C. heterophylla Fisch floral buds tolerant to cold stress.

  13. The Co-regulation Data Harvester: Automating gene annotation starting from a transcriptome database

    NASA Astrophysics Data System (ADS)

    Tsypin, Lev M.; Turkewitz, Aaron P.

    Identifying co-regulated genes provides a useful approach for defining pathway-specific machinery in an organism. To be efficient, this approach relies on thorough genome annotation, a process much slower than genome sequencing per se. Tetrahymena thermophila, a unicellular eukaryote, has been a useful model organism and has a fully sequenced but sparsely annotated genome. One important resource for studying this organism has been an online transcriptomic database. We have developed an automated approach to gene annotation in the context of transcriptome data in T. thermophila, called the Co-regulation Data Harvester (CDH). Beginning with a gene of interest, the CDH identifies co-regulated genes by accessing the Tetrahymena transcriptome database. It then identifies their closely related genes (orthologs) in other organisms by using reciprocal BLAST searches. Finally, it collates the annotations of those orthologs' functions, which provides the user with information to help predict the cellular role of the initial query. The CDH, which is freely available, represents a powerful new tool for analyzing cell biological pathways in Tetrahymena. Moreover, to the extent that genes and pathways are conserved between organisms, the inferences obtained via the CDH should be relevant, and can be explored, in many other systems.

  14. ATGC transcriptomics: a web-based application to integrate, explore and analyze de novo transcriptomic data.

    PubMed

    Gonzalez, Sergio; Clavijo, Bernardo; Rivarola, Máximo; Moreno, Patricio; Fernandez, Paula; Dopazo, Joaquín; Paniego, Norma

    2017-02-22

    In the last years, applications based on massively parallelized RNA sequencing (RNA-seq) have become valuable approaches for studying non-model species, e.g., without a fully sequenced genome. RNA-seq is a useful tool for detecting novel transcripts and genetic variations and for evaluating differential gene expression by digital measurements. The large and complex datasets resulting from functional genomic experiments represent a challenge in data processing, management, and analysis. This problem is especially significant for small research groups working with non-model species. We developed a web-based application, called ATGC transcriptomics, with a flexible and adaptable interface that allows users to work with new generation sequencing (NGS) transcriptomic analysis results using an ontology-driven database. This new application simplifies data exploration, visualization, and integration for a better comprehension of the results. ATGC transcriptomics provides access to non-expert computer users and small research groups to a scalable storage option and simple data integration, including database administration and management. The software is freely available under the terms of GNU public license at http://atgcinta.sourceforge.net .

  15. Genome-Scale Transcriptome Analysis in Response to Nitric Oxide in Birch Cells: Implications of the Triterpene Biosynthetic Pathway

    PubMed Central

    Zeng, Fansuo; Sun, Fengkun; Li, Leilei; Liu, Kun; Zhan, Yaguang

    2014-01-01

    Evidence supporting nitric oxide (NO) as a mediator of plant biochemistry continues to grow, but its functions at the molecular level remains poorly understood and, in some cases, controversial. To study the role of NO at the transcriptional level in Betula platyphylla cells, we conducted a genome-scale transcriptome analysis of these cells. The transcriptome of untreated birch cells and those treated by sodium nitroprusside (SNP) were analyzed using the Solexa sequencing. Data were collected by sequencing cDNA libraries of birch cells, which had a long period to adapt to the suspension culture conditions before SNP-treated cells and untreated cells were sampled. Among the 34,100 UniGenes detected, BLASTX search revealed that 20,631 genes showed significant (E-values≤10−5) sequence similarity with proteins from the NR-database. Numerous expressed sequence tags (i.e., 1374) were identified as differentially expressed between the 12 h SNP-treated cells and control cells samples: 403 up-regulated and 971 down-regulated. From this, we specifically examined a core set of NO-related transcripts. The altered expression levels of several transcripts, as determined by transcriptome analysis, was confirmed by qRT-PCR. The results of transcriptome analysis, gene expression quantification, the content of triterpenoid and activities of defensive enzymes elucidated NO has a significant effect on many processes including triterpenoid production, carbohydrate metabolism and cell wall biosynthesis. PMID:25551661

  16. Nodeomics: Pathogen Detection in Vertebrate Lymph Nodes Using Meta-Transcriptomics

    USGS Publications Warehouse

    Wittekindt, Nicola E.; Padhi, Abinash; Schuster, Stephan C.; Qi, Ji; Zhao, Fangqing; Tomsho, Lynn P.; Kasson, Lindsay R.; Packard, Michael; Cross, Paul C.; Poss, Mary

    2010-01-01

    The ongoing emergence of human infections originating from wildlife highlights the need for better knowledge of the microbial community in wildlife species where traditional diagnostic approaches are limited. Here we evaluate the microbial biota in healthy mule deer (Odocoileus hemionus) by analyses of lymph node meta-transcriptomes. cDNA libraries from five individuals and two pools of samples were prepared from retropharyngeal lymph node RNA enriched for polyadenylated RNA and sequenced using Roche-454 Life Sciences technology. Protein-coding and 16S ribosomal RNA (rRNA) sequences were taxonomically profiled using protein and rRNA specific databases. Representatives of all bacterial phyla were detected in the seven libraries based on protein-coding transcripts indicating that viable microbiota were present in lymph nodes. Residents of skin and rumen, and those ubiquitous in mule deer habitat dominated classifiable bacterial species. Based on detection of both rRNA and protein-coding transcripts, we identified two new proteobacterial species; a Helicobacter closely related to Helicobacter cetorum in the Helicobacter pylori/Helicobacter acinonychis complex and an Acinetobacter related to Acinetobacter schindleri. Among viruses, a novel gamma retrovirus and other members of the Poxviridae and Retroviridae were identified. We additionally evaluated bacterial diversity by amplicon sequencing the hypervariable V6 region of 16S rRNA and demonstrate that overall taxonomic diversity is higher with the meta-transcriptomic approach. These data provide the most complete picture to date of the microbial diversity within a wildlife host. Our research advances the use of meta-transcriptomics to study microbiota in wildlife tissues, which will facilitate detection of novel organisms with pathogenic potential to human and animals.

  17. PARRoT- a homology-based strategy to quantify and compare RNA-sequencing from non-model organisms.

    PubMed

    Gan, Ruei-Chi; Chen, Ting-Wen; Wu, Timothy H; Huang, Po-Jung; Lee, Chi-Ching; Yeh, Yuan-Ming; Chiu, Cheng-Hsun; Huang, Hsien-Da; Tang, Petrus

    2016-12-22

    Next-generation sequencing promises the de novo genomic and transcriptomic analysis of samples of interests. However, there are only a few organisms having reference genomic sequences and even fewer having well-defined or curated annotations. For transcriptome studies focusing on organisms lacking proper reference genomes, the common strategy is de novo assembly followed by functional annotation. However, things become even more complicated when multiple transcriptomes are compared. Here, we propose a new analysis strategy and quantification methods for quantifying expression level which not only generate a virtual reference from sequencing data, but also provide comparisons between transcriptomes. First, all reads from the transcriptome datasets are pooled together for de novo assembly. The assembled contigs are searched against NCBI NR databases to find potential homolog sequences. Based on the searched result, a set of virtual transcripts are generated and served as a reference transcriptome. By using the same reference, normalized quantification values including RC (read counts), eRPKM (estimated RPKM) and eTPM (estimated TPM) can be obtained that are comparable across transcriptome datasets. In order to demonstrate the feasibility of our strategy, we implement it in the web service PARRoT. PARRoT stands for Pipeline for Analyzing RNA Reads of Transcriptomes. It analyzes gene expression profiles for two transcriptome sequencing datasets. For better understanding of the biological meaning from the comparison among transcriptomes, PARRoT further provides linkage between these virtual transcripts and their potential function through showing best hits in SwissProt, NR database, assigning GO terms. Our demo datasets showed that PARRoT can analyze two paired-end transcriptomic datasets of approximately 100 million reads within just three hours. In this study, we proposed and implemented a strategy to analyze transcriptomes from non-reference organisms which offers the opportunity to quantify and compare transcriptome profiles through a homolog based virtual transcriptome reference. By using the homolog based reference, our strategy effectively avoids the problems that may cause from inconsistencies among transcriptomes. This strategy will shed lights on the field of comparative genomics for non-model organism. We have implemented PARRoT as a web service which is freely available at http://parrot.cgu.edu.tw .

  18. A Single-Cell Roadmap of Lineage Bifurcation in Human ESC Models of Embryonic Brain Development.

    PubMed

    Yao, Zizhen; Mich, John K; Ku, Sherman; Menon, Vilas; Krostag, Anne-Rachel; Martinez, Refugio A; Furchtgott, Leon; Mulholland, Heather; Bort, Susan; Fuqua, Margaret A; Gregor, Ben W; Hodge, Rebecca D; Jayabalu, Anu; May, Ryan C; Melton, Samuel; Nelson, Angelique M; Ngo, N Kiet; Shapovalova, Nadiya V; Shehata, Soraya I; Smith, Michael W; Tait, Leah J; Thompson, Carol L; Thomsen, Elliot R; Ye, Chaoyang; Glass, Ian A; Kaykas, Ajamete; Yao, Shuyuan; Phillips, John W; Grimley, Joshua S; Levi, Boaz P; Wang, Yanling; Ramanathan, Sharad

    2017-01-05

    During human brain development, multiple signaling pathways generate diverse cell types with varied regional identities. Here, we integrate single-cell RNA sequencing and clonal analyses to reveal lineage trees and molecular signals underlying early forebrain and mid/hindbrain cell differentiation from human embryonic stem cells (hESCs). Clustering single-cell transcriptomic data identified 41 distinct populations of progenitor, neuronal, and non-neural cells across our differentiation time course. Comparisons with primary mouse and human gene expression data demonstrated rostral and caudal progenitor and neuronal identities from early brain development. Bayesian analyses inferred a unified cell-type lineage tree that bifurcates between cortical and mid/hindbrain cell types. Two methods of clonal analyses confirmed these findings and further revealed the importance of Wnt/β-catenin signaling in controlling this lineage decision. Together, these findings provide a rich transcriptome-based lineage map for studying human brain development and modeling developmental disorders. Copyright © 2017 Elsevier Inc. All rights reserved.

  19. Biological pattern and transcriptomic exploration and phylogenetic analysis in the odd floral architecture tree: Helwingia willd.

    PubMed

    Sun, Cheng; Yu, Guoliang; Bao, Manzhu; Zheng, Bo; Ning, Guogui

    2014-06-27

    Odd traits in few of plant species usually implicate potential biology significances in plant evolutions. The genus Helwingia Willd, a dioecious medical shrub in Aquifoliales order, has an odd floral architecture-epiphyllous inflorescence. The potential significances and possible evolutionary origin of this specie are not well understood due to poorly available data of biological and genetic studies. In addition, the advent of genomics-based technologies has widely revolutionized plant species with unknown genomic information. Morphological and biological pattern were detailed via anatomical and pollination analyses. An RNA sequencing based transcriptomic analysis were undertaken and a high-resolution phylogenetic analysis was conducted based on single-copy genes in more than 80 species of seed plants, including H. japonica. It is verified that a potential fusion of rachis to the leaf midvein facilitates insect pollination. RNA sequencing yielded a total of 111450 unigenes; half of them had significant similarity with proteins in the public database, and 20281 unigenes were mapped to 119 pathways. Deduced from the phylogenetic analysis based on single-copy genes, the group of Helwingia is closer with Euasterids II and rather than Euasterids, congruent with previous reports using plastid sequences. The odd flower architecture make H. Willd adapt to insect pollination by hosting those insects larger than the flower in size via leave, which has little common character that other insect pollination plants hold. Further the present transcriptome greatly riches genomics information of Helwingia species and nucleus genes based phylogenetic analysis also greatly improve the resolution and robustness of phylogenetic reconstruction in H. japonica.

  20. A SAGE based approach to human glomerular endothelium: defining the transcriptome, finding a novel molecule and highlighting endothelial diversity.

    PubMed

    Sengoelge, Guerkan; Winnicki, Wolfgang; Kupczok, Anne; von Haeseler, Arndt; Schuster, Michael; Pfaller, Walter; Jennings, Paul; Weltermann, Ansgar; Blake, Sophia; Sunder-Plassmann, Gere

    2014-08-27

    Large scale transcript analysis of human glomerular microvascular endothelial cells (HGMEC) has never been accomplished. We designed this study to define the transcriptome of HGMEC and facilitate a better characterization of these endothelial cells with unique features. Serial analysis of gene expression (SAGE) was used for its unbiased approach to quantitative acquisition of transcripts. We generated a HGMEC SAGE library consisting of 68,987 transcript tags. Then taking advantage of large public databases and advanced bioinformatics we compared the HGMEC SAGE library with a SAGE library of non-cultured ex vivo human glomeruli (44,334 tags) which contained endothelial cells. The 823 tags common to both which would have the potential to be expressed in vivo were subsequently checked against 822,008 tags from 16 non-glomerular endothelial SAGE libraries. This resulted in 268 transcript tags differentially overexpressed in HGMEC compared to non-glomerular endothelia. These tags were filtered using a set of criteria: never before shown in kidney or any type of endothelial cell, absent in all nephron regions except the glomerulus, more highly expressed than statistically expected in HGMEC. Neurogranin, a direct target of thyroid hormone action which had been thought to be brain specific and never shown in endothelial cells before, fulfilled these criteria. Its expression in glomerular endothelium in vitro and in vivo was then verified by real-time-PCR, sequencing and immunohistochemistry. Our results represent an extensive molecular characterization of HGMEC beyond a mere database, underline the endothelial heterogeneity, and propose neurogranin as a potential link in the kidney-thyroid axis.

  1. Genic insights from integrated human proteomics in GeneCards.

    PubMed

    Fishilevich, Simon; Zimmerman, Shahar; Kohn, Asher; Iny Stein, Tsippi; Olender, Tsviya; Kolker, Eugene; Safran, Marilyn; Lancet, Doron

    2016-01-01

    GeneCards is a one-stop shop for searchable human gene annotations (http://www.genecards.org/). Data are automatically mined from ∼120 sources and presented in an integrated web card for every human gene. We report the application of recent advances in proteomics to enhance gene annotation and classification in GeneCards. First, we constructed the Human Integrated Protein Expression Database (HIPED), a unified database of protein abundance in human tissues, based on the publically available mass spectrometry (MS)-based proteomics sources ProteomicsDB, Multi-Omics Profiling Expression Database, Protein Abundance Across Organisms and The MaxQuant DataBase. The integrated database, residing within GeneCards, compares favourably with its individual sources, covering nearly 90% of human protein-coding genes. For gene annotation and comparisons, we first defined a protein expression vector for each gene, based on normalized abundances in 69 normal human tissues. This vector is portrayed in the GeneCards expression section as a bar graph, allowing visual inspection and comparison. These data are juxtaposed with transcriptome bar graphs. Using the protein expression vectors, we further defined a pairwise metric that helps assess expression-based pairwise proximity. This new metric for finding functional partners complements eight others, including sharing of pathways, gene ontology (GO) terms and domains, implemented in the GeneCards Suite. In parallel, we calculated proteome-based differential expression, highlighting a subset of tissues that overexpress a gene and subserving gene classification. This textual annotation allows users of VarElect, the suite's next-generation phenotyper, to more effectively discover causative disease variants. Finally, we define the protein-RNA expression ratio and correlation as yet another attribute of every gene in each tissue, adding further annotative information. The results constitute a significant enhancement of several GeneCards sections and help promote and organize the genome-wide structural and functional knowledge of the human proteome. Database URL:http://www.genecards.org/. © The Author(s) 2016. Published by Oxford University Press.

  2. Creation of a Human Secretome: A Novel Composite Library of Human Secreted Proteins: Validation Using Ovarian Cancer Gene Expression Data and a Virtual Secretome Array.

    PubMed

    Vathipadiekal, Vinod; Wang, Victoria; Wei, Wei; Waldron, Levi; Drapkin, Ronny; Gillette, Michael; Skates, Steven; Birrer, Michael

    2015-11-01

    To generate a comprehensive "Secretome" of proteins potentially found in the blood and derive a virtual Affymetrix array. To validate the utility of this database for the discovery of novel serum-based biomarkers using ovarian cancer transcriptomic data. The secretome was constructed by aggregating the data from databases of known secreted proteins, transmembrane or membrane proteins, signal peptides, G-protein coupled receptors, or proteins existing in the extracellular region, and the virtual array was generated by mapping them to Affymetrix probeset identifiers. Whole-genome microarray data from ovarian cancer, normal ovarian surface epithelium, and fallopian tube epithelium were used to identify transcripts upregulated in ovarian cancer. We established the secretome from eight public databases and a virtual array consisting of 16,521 Affymetrix U133 Plus 2.0 probesets. Using ovarian cancer transcriptomic data, we identified candidate blood-based biomarkers for ovarian cancer and performed bioinformatic validation by demonstrating rediscovery of known biomarkers including CA125 and HE4. Two novel top biomarkers (FGF18 and GPR172A) were validated in serum samples from an independent patient cohort. We present the secretome, comprising the most comprehensive resource available for protein products that are potentially found in the blood. The associated virtual array can be used to translate gene-expression data into cancer biomarker discovery. A list of blood-based biomarkers for ovarian cancer detection is reported and includes CA125 and HE4. FGF18 and GPR172A were identified and validated by ELISA as being differentially expressed in the serum of ovarian cancer patients compared with controls. ©2015 American Association for Cancer Research.

  3. RAID: a comprehensive resource for human RNA-associated (RNA–RNA/RNA–protein) interaction

    PubMed Central

    Zhang, Xiaomeng; Wu, Deng; Chen, Liqun; Li, Xiang; Yang, Jinxurong; Fan, Dandan; Dong, Tingting; Liu, Mingyue; Tan, Puwen; Xu, Jintian; Yi, Ying; Wang, Yuting; Zou, Hua; Hu, Yongfei; Fan, Kaili; Kang, Juanjuan; Huang, Yan; Miao, Zhengqiang; Bi, Miaoman; Jin, Nana; Li, Kongning; Li, Xia; Xu, Jianzhen; Wang, Dong

    2014-01-01

    Transcriptomic analyses have revealed an unexpected complexity in the eukaryote transcriptome, which includes not only protein-coding transcripts but also an expanding catalog of noncoding RNAs (ncRNAs). Diverse coding and noncoding RNAs (ncRNAs) perform functions through interaction with each other in various cellular processes. In this project, we have developed RAID (http://www.rna-society.org/raid), an RNA-associated (RNA–RNA/RNA–protein) interaction database. RAID intends to provide the scientific community with all-in-one resources for efficient browsing and extraction of the RNA-associated interactions in human. This version of RAID contains more than 6100 RNA-associated interactions obtained by manually reviewing more than 2100 published papers, including 4493 RNA–RNA interactions and 1619 RNA–protein interactions. Each entry contains detailed information on an RNA-associated interaction, including RAID ID, RNA/protein symbol, RNA/protein categories, validated method, expressing tissue, literature references (Pubmed IDs), and detailed functional description. Users can query, browse, analyze, and manipulate RNA-associated (RNA–RNA/RNA–protein) interaction. RAID provides a comprehensive resource of human RNA-associated (RNA–RNA/RNA–protein) interaction network. Furthermore, this resource will help in uncovering the generic organizing principles of cellular function network. PMID:24803509

  4. De Novo Assembly of the Donkey White Blood Cell Transcriptome and a Comparative Analysis of Phenotype-Associated Genes between Donkeys and Horses

    PubMed Central

    Xie, Feng-Yun; Feng, Yu-Long; Wang, Hong-Hui; Ma, Yun-Feng; Yang, Yang; Wang, Yin-Chao; Shen, Wei; Pan, Qing-Jie; Yin, Shen; Sun, Yu-Jiang; Ma, Jun-Yu

    2015-01-01

    Prior to the mechanization of agriculture and labor-intensive tasks, humans used donkeys (Equus africanus asinus) for farm work and packing. However, as mechanization increased, donkeys have been increasingly raised for meat, milk, and fur in China. To maintain the development of the donkey industry, breeding programs should focus on traits related to these new uses. Compared to conventional marker-assisted breeding plans, genome- and transcriptome-based selection methods are more efficient and effective. To analyze the coding genes of the donkey genome, we assembled the transcriptome of donkey white blood cells de novo. Using transcriptomic deep-sequencing data, we identified 264,714 distinct donkey unigenes and predicted 38,949 protein fragments. We annotated the donkey unigenes by BLAST searches against the non-redundant (NR) protein database. We also compared the donkey protein sequences with those of the horse (E. caballus) and wild horse (E. przewalskii), and linked the donkey protein fragments with mammalian phenotypes. As the outer ear size of donkeys and horses are obviously different, we compared the outer ear size-associated proteins in donkeys and horses. We identified three ear size-associated proteins, HIC1, PRKRA, and KMT2A, with sequence differences among the donkey, horse, and wild horse loci. Since the donkey genome sequence has not been released, the de novo assembled donkey transcriptome is helpful for preliminary investigations of donkey cultivars and for genetic improvement. PMID:26208029

  5. De Novo Assembly of the Donkey White Blood Cell Transcriptome and a Comparative Analysis of Phenotype-Associated Genes between Donkeys and Horses.

    PubMed

    Xie, Feng-Yun; Feng, Yu-Long; Wang, Hong-Hui; Ma, Yun-Feng; Yang, Yang; Wang, Yin-Chao; Shen, Wei; Pan, Qing-Jie; Yin, Shen; Sun, Yu-Jiang; Ma, Jun-Yu

    2015-01-01

    Prior to the mechanization of agriculture and labor-intensive tasks, humans used donkeys (Equus africanus asinus) for farm work and packing. However, as mechanization increased, donkeys have been increasingly raised for meat, milk, and fur in China. To maintain the development of the donkey industry, breeding programs should focus on traits related to these new uses. Compared to conventional marker-assisted breeding plans, genome- and transcriptome-based selection methods are more efficient and effective. To analyze the coding genes of the donkey genome, we assembled the transcriptome of donkey white blood cells de novo. Using transcriptomic deep-sequencing data, we identified 264,714 distinct donkey unigenes and predicted 38,949 protein fragments. We annotated the donkey unigenes by BLAST searches against the non-redundant (NR) protein database. We also compared the donkey protein sequences with those of the horse (E. caballus) and wild horse (E. przewalskii), and linked the donkey protein fragments with mammalian phenotypes. As the outer ear size of donkeys and horses are obviously different, we compared the outer ear size-associated proteins in donkeys and horses. We identified three ear size-associated proteins, HIC1, PRKRA, and KMT2A, with sequence differences among the donkey, horse, and wild horse loci. Since the donkey genome sequence has not been released, the de novo assembled donkey transcriptome is helpful for preliminary investigations of donkey cultivars and for genetic improvement.

  6. TISSUES 2.0: an integrative web resource on mammalian tissue expression

    PubMed Central

    Palasca, Oana; Santos, Alberto; Stolte, Christian; Gorodkin, Jan; Jensen, Lars Juhl

    2018-01-01

    Abstract Physiological and molecular similarities between organisms make it possible to translate findings from simpler experimental systems—model organisms—into more complex ones, such as human. This translation facilitates the understanding of biological processes under normal or disease conditions. Researchers aiming to identify the similarities and differences between organisms at the molecular level need resources collecting multi-organism tissue expression data. We have developed a database of gene–tissue associations in human, mouse, rat and pig by integrating multiple sources of evidence: transcriptomics covering all four species and proteomics (human only), manually curated and mined from the scientific literature. Through a scoring scheme, these associations are made comparable across all sources of evidence and across organisms. Furthermore, the scoring produces a confidence score assigned to each of the associations. The TISSUES database (version 2.0) is publicly accessible through a user-friendly web interface and as part of the STRING app for Cytoscape. In addition, we analyzed the agreement between datasets, across and within organisms, and identified that the agreement is mainly affected by the quality of the datasets rather than by the technologies used or organisms compared. Database URL: http://tissues.jensenlab.org/ PMID:29617745

  7. Fasting and Fast Food Diet Play an Opposite Role in Mice Brain Aging.

    PubMed

    Castrogiovanni, Paola; Li Volti, Giovanni; Sanfilippo, Cristina; Tibullo, Daniele; Galvano, Fabio; Vecchio, Michele; Avola, Roberto; Barbagallo, Ignazio; Malaguarnera, Lucia; Castorina, Sergio; Musumeci, Giuseppe; Imbesi, Rosa; Di Rosa, Michelino

    2018-01-20

    Fasting may be exploited as a possible strategy for prevention and treatment of several diseases such as diabetes, obesity, and aging. On the other hand, high-fat diet (HFD) represents a risk factor for several diseases and increased mortality. The aim of the present study was to evaluate the impact of fasting on mouse brain aging transcriptome and how HFD regulates such pathways. We used the NCBI Gene Expression Omnibus (GEO) database, in order to identify suitable microarray datasets comparing mouse brain transcriptome under fasting or HFD vs aged mouse brain transcriptome. Three microarray datasets were selected for this study, GSE24504, GSE6285, and GSE8150, and the principal molecular mechanisms involved in this process were evaluated. This analysis showed that, regardless of fasting duration, mouse brain significantly expressed 21 and 30 upregulated and downregulated genes, respectively. The involved biological processes were related to cell cycle arrest, cell death inhibition, and regulation of cellular metabolism. Comparing mouse brain transcriptome under fasting and aged conditions, we found out that the number of genes in common increased with the duration of fasting (222 genes), peaking at 72 h. In addition, mouse brain transcriptome under HFD resembles for the 30% the one of the aged mice. Furthermore, several molecular processes were found to be shared between HFD and aging. In conclusion, we suggest that fasting and HFD play an opposite role in brain transcriptome of aged mice. Therefore, an intermittent diet could represent a possible clinical strategy to counteract aging, loss of memory, and neuroinflammation. Furthermore, low-fat diet leads to the inactivation of brain degenerative processes triggered by aging.

  8. ReprOlive: a database with linked data for the olive tree (Olea europaea L.) reproductive transcriptome

    PubMed Central

    Carmona, Rosario; Zafra, Adoración; Seoane, Pedro; Castro, Antonio J.; Guerrero-Fernández, Darío; Castillo-Castillo, Trinidad; Medina-García, Ana; Cánovas, Francisco M.; Aldana-Montes, José F.; Navas-Delgado, Ismael; Alché, Juan de Dios; Claros, M. Gonzalo

    2015-01-01

    Plant reproductive transcriptomes have been analyzed in different species due to the agronomical and biotechnological importance of plant reproduction. Here we presented an olive tree reproductive transcriptome database with samples from pollen and pistil at different developmental stages, and leaf and root as control vegetative tissues http://reprolive.eez.csic.es). It was developed from 2,077,309 raw reads to 1,549 Sanger sequences. Using a pre-defined workflow based on open-source tools, sequences were pre-processed, assembled, mapped, and annotated with expression data, descriptions, GO terms, InterPro signatures, EC numbers, KEGG pathways, ORFs, and SSRs. Tentative transcripts (TTs) were also annotated with the corresponding orthologs in Arabidopsis thaliana from TAIR and RefSeq databases to enable Linked Data integration. It results in a reproductive transcriptome comprising 72,846 contigs with average length of 686 bp, of which 63,965 (87.8%) included at least one functional annotation, and 55,356 (75.9%) had an ortholog. A minimum of 23,568 different TTs was identified and 5,835 of them contain a complete ORF. The representative reproductive transcriptome can be reduced to 28,972 TTs for further gene expression studies. Partial transcriptomes from pollen, pistil, and vegetative tissues as control were also constructed. ReprOlive provides free access and download capability to these results. Retrieval mechanisms for sequences and transcript annotations are provided. Graphical localization of annotated enzymes into KEGG pathways is also possible. Finally, ReprOlive has included a semantic conceptualisation by means of a Resource Description Framework (RDF) allowing a Linked Data search for extracting the most updated information related to enzymes, interactions, allergens, structures, and reactive oxygen species. PMID:26322066

  9. Transcriptional analysis of the Escherichia coli ColV-Ia plasmid pS88 during growth in human serum and urine.

    PubMed

    Lemaître, Chloé; Bidet, Philippe; Bingen, Edouard; Bonacorsi, Stéphane

    2012-06-21

    The sequenced O45:K1:H7 Escherichia coli meningitis strain S88 harbors a large virulence plasmid. To identify possible genetic determinants of pS88 virulence, we examined the transcriptomes of 88 plasmidic ORFs corresponding to known and putative virulence genes, and 35 ORFs of unknown function. Quantification of plasmidic transcripts was obtained by quantitative real-time reverse transcription of extracted RNA, normalized on three housekeeping genes. The transcriptome of E. coli strain S88 grown in human serum and urine ex vivo were compared to that obtained during growth in Luria Bertani broth, with and without iron depletion. We also analyzed the transcriptome of a pS88-like plasmid recovered from a neonate with urinary tract infection. The transcriptome obtained after ex vivo growth in serum and urine was very similar to those obtained in iron-depleted LB broth. Genes encoding iron acquisition systems were strongly upregulated. ShiF and ORF 123, two ORFs encoding protein with hypothetical function and physically linked to aerobactin and salmochelin loci, respectively, were also highly expressed in iron-depleted conditions and may correspond to ancillary iron acquisition genes. Four ORFs were induced ex vivo, independently of the iron concentration. Other putative virulence genes such as iss, etsC, ompTp and hlyF were not upregulated in any of the conditions studied. Transcriptome analysis of the pS88-like plasmid recovered in vivo showed a similar pattern of induction but at much higher levels. We identify new pS88 genes potentially involved in the growth of E. coli meningitis strain S88 in human serum and urine.

  10. SolEST database: a "one-stop shop" approach to the study of Solanaceae transcriptomes.

    PubMed

    D'Agostino, Nunzio; Traini, Alessandra; Frusciante, Luigi; Chiusano, Maria Luisa

    2009-11-30

    Since no genome sequences of solanaceous plants have yet been completed, expressed sequence tag (EST) collections represent a reliable tool for broad sampling of Solanaceae transcriptomes, an attractive route for understanding Solanaceae genome functionality and a powerful reference for the structural annotation of emerging Solanaceae genome sequences. We describe the SolEST database http://biosrv.cab.unina.it/solestdb which integrates different EST datasets from both cultivated and wild Solanaceae species and from two species of the genus Coffea. Background as well as processed data contained in the database, extensively linked to external related resources, represent an invaluable source of information for these plant families. Two novel features differentiate SolEST from other resources: i) the option of accessing and then visualizing Solanaceae EST/TC alignments along the emerging tomato and potato genome sequences; ii) the opportunity to compare different Solanaceae assemblies generated by diverse research groups in the attempt to address a common complaint in the SOL community. Different databases have been established worldwide for collecting Solanaceae ESTs and are related in concept, content and utility to the one presented herein. However, the SolEST database has several distinguishing features that make it appealing for the research community and facilitates a "one-stop shop" for the study of Solanaceae transcriptomes.

  11. A comprehensive collection of systems biology data characterizing the host response to viral infection

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Aevermann, Brian D.; Pickett, Brett E.; Kumar, Sanjeev

    The Systems Biology for Infectious Diseases Research program was established by the U.S. National Institute of Allergy and Infectious Diseases to investigate host-pathogen interactions at a systems level. This program generated 47 transcriptomic and proteomic datasets from 30 studies that investigate in vivo and in vitro host responses to viral infections. Human pathogens in the Orthomyxoviridae and Coronaviridae families, especially pandemic H1N1 and avian H5N1 influenza A viruses and severe acute respiratory syndrome coronavirus (SARS-CoV), were investigated. Study validation was demonstrated via experimental quality control measures and meta-analysis of independent experiments performed under similar conditions. Primary assay results are archivedmore » at the GEO and PeptideAtlas public repositories, while processed statistical results together with standardized metadata are publically available at the Influenza Research Database (www.fludb.org) and the Virus Pathogen Resource (www.viprbrc.org). As a result, by comparing data from mutant versus wild-type virus and host strains, RNA versus protein differential expression, and infection with genetically similar strains, these data can be used to further investigate genetic and physiological determinants of host responses to viral infection.« less

  12. A comprehensive collection of systems biology data characterizing the host response to viral infection

    DOE PAGES

    Aevermann, Brian D.; Pickett, Brett E.; Kumar, Sanjeev; ...

    2014-10-14

    The Systems Biology for Infectious Diseases Research program was established by the U.S. National Institute of Allergy and Infectious Diseases to investigate host-pathogen interactions at a systems level. This program generated 47 transcriptomic and proteomic datasets from 30 studies that investigate in vivo and in vitro host responses to viral infections. Human pathogens in the Orthomyxoviridae and Coronaviridae families, especially pandemic H1N1 and avian H5N1 influenza A viruses and severe acute respiratory syndrome coronavirus (SARS-CoV), were investigated. Study validation was demonstrated via experimental quality control measures and meta-analysis of independent experiments performed under similar conditions. Primary assay results are archivedmore » at the GEO and PeptideAtlas public repositories, while processed statistical results together with standardized metadata are publically available at the Influenza Research Database (www.fludb.org) and the Virus Pathogen Resource (www.viprbrc.org). As a result, by comparing data from mutant versus wild-type virus and host strains, RNA versus protein differential expression, and infection with genetically similar strains, these data can be used to further investigate genetic and physiological determinants of host responses to viral infection.« less

  13. A comprehensive collection of systems biology data characterizing the host response to viral infection.

    PubMed

    Aevermann, Brian D; Pickett, Brett E; Kumar, Sanjeev; Klem, Edward B; Agnihothram, Sudhakar; Askovich, Peter S; Bankhead, Armand; Bolles, Meagen; Carter, Victoria; Chang, Jean; Clauss, Therese R W; Dash, Pradyot; Diercks, Alan H; Eisfeld, Amie J; Ellis, Amy; Fan, Shufang; Ferris, Martin T; Gralinski, Lisa E; Green, Richard R; Gritsenko, Marina A; Hatta, Masato; Heegel, Robert A; Jacobs, Jon M; Jeng, Sophia; Josset, Laurence; Kaiser, Shari M; Kelly, Sara; Law, G Lynn; Li, Chengjun; Li, Jiangning; Long, Casey; Luna, Maria L; Matzke, Melissa; McDermott, Jason; Menachery, Vineet; Metz, Thomas O; Mitchell, Hugh; Monroe, Matthew E; Navarro, Garnet; Neumann, Gabriele; Podyminogin, Rebecca L; Purvine, Samuel O; Rosenberger, Carrie M; Sanders, Catherine J; Schepmoes, Athena A; Shukla, Anil K; Sims, Amy; Sova, Pavel; Tam, Vincent C; Tchitchek, Nicolas; Thomas, Paul G; Tilton, Susan C; Totura, Allison; Wang, Jing; Webb-Robertson, Bobbie-Jo; Wen, Ji; Weiss, Jeffrey M; Yang, Feng; Yount, Boyd; Zhang, Qibin; McWeeney, Shannon; Smith, Richard D; Waters, Katrina M; Kawaoka, Yoshihiro; Baric, Ralph; Aderem, Alan; Katze, Michael G; Scheuermann, Richard H

    2014-01-01

    The Systems Biology for Infectious Diseases Research program was established by the U.S. National Institute of Allergy and Infectious Diseases to investigate host-pathogen interactions at a systems level. This program generated 47 transcriptomic and proteomic datasets from 30 studies that investigate in vivo and in vitro host responses to viral infections. Human pathogens in the Orthomyxoviridae and Coronaviridae families, especially pandemic H1N1 and avian H5N1 influenza A viruses and severe acute respiratory syndrome coronavirus (SARS-CoV), were investigated. Study validation was demonstrated via experimental quality control measures and meta-analysis of independent experiments performed under similar conditions. Primary assay results are archived at the GEO and PeptideAtlas public repositories, while processed statistical results together with standardized metadata are publically available at the Influenza Research Database (www.fludb.org) and the Virus Pathogen Resource (www.viprbrc.org). By comparing data from mutant versus wild-type virus and host strains, RNA versus protein differential expression, and infection with genetically similar strains, these data can be used to further investigate genetic and physiological determinants of host responses to viral infection.

  14. A comprehensive collection of systems biology data characterizing the host response to viral infection

    PubMed Central

    Aevermann, Brian D.; Pickett, Brett E.; Kumar, Sanjeev; Klem, Edward B.; Agnihothram, Sudhakar; Askovich, Peter S.; Bankhead, Armand; Bolles, Meagen; Carter, Victoria; Chang, Jean; Clauss, Therese R.W.; Dash, Pradyot; Diercks, Alan H.; Eisfeld, Amie J.; Ellis, Amy; Fan, Shufang; Ferris, Martin T.; Gralinski, Lisa E.; Green, Richard R.; Gritsenko, Marina A.; Hatta, Masato; Heegel, Robert A.; Jacobs, Jon M.; Jeng, Sophia; Josset, Laurence; Kaiser, Shari M.; Kelly, Sara; Law, G. Lynn; Li, Chengjun; Li, Jiangning; Long, Casey; Luna, Maria L.; Matzke, Melissa; McDermott, Jason; Menachery, Vineet; Metz, Thomas O.; Mitchell, Hugh; Monroe, Matthew E.; Navarro, Garnet; Neumann, Gabriele; Podyminogin, Rebecca L.; Purvine, Samuel O.; Rosenberger, Carrie M.; Sanders, Catherine J.; Schepmoes, Athena A.; Shukla, Anil K.; Sims, Amy; Sova, Pavel; Tam, Vincent C.; Tchitchek, Nicolas; Thomas, Paul G.; Tilton, Susan C.; Totura, Allison; Wang, Jing; Webb-Robertson, Bobbie-Jo; Wen, Ji; Weiss, Jeffrey M.; Yang, Feng; Yount, Boyd; Zhang, Qibin; McWeeney, Shannon; Smith, Richard D.; Waters, Katrina M.; Kawaoka, Yoshihiro; Baric, Ralph; Aderem, Alan; Katze, Michael G.; Scheuermann, Richard H.

    2014-01-01

    The Systems Biology for Infectious Diseases Research program was established by the U.S. National Institute of Allergy and Infectious Diseases to investigate host-pathogen interactions at a systems level. This program generated 47 transcriptomic and proteomic datasets from 30 studies that investigate in vivo and in vitro host responses to viral infections. Human pathogens in the Orthomyxoviridae and Coronaviridae families, especially pandemic H1N1 and avian H5N1 influenza A viruses and severe acute respiratory syndrome coronavirus (SARS-CoV), were investigated. Study validation was demonstrated via experimental quality control measures and meta-analysis of independent experiments performed under similar conditions. Primary assay results are archived at the GEO and PeptideAtlas public repositories, while processed statistical results together with standardized metadata are publically available at the Influenza Research Database (www.fludb.org) and the Virus Pathogen Resource (www.viprbrc.org). By comparing data from mutant versus wild-type virus and host strains, RNA versus protein differential expression, and infection with genetically similar strains, these data can be used to further investigate genetic and physiological determinants of host responses to viral infection. PMID:25977790

  15. Transcriptome analysis in cotton boll weevil (Anthonomus grandis) and RNA interference in insect pests.

    PubMed

    Firmino, Alexandre Augusto Pereira; Fonseca, Fernando Campos de Assis; de Macedo, Leonardo Lima Pepino; Coelho, Roberta Ramos; Antonino de Souza, José Dijair; Togawa, Roberto Coiti; Silva-Junior, Orzenil Bonfim; Pappas, Georgios Joannis; da Silva, Maria Cristina Mattar; Engler, Gilbert; Grossi-de-Sa, Maria Fatima

    2013-01-01

    Cotton plants are subjected to the attack of several insect pests. In Brazil, the cotton boll weevil, Anthonomus grandis, is the most important cotton pest. The use of insecticidal proteins and gene silencing by interference RNA (RNAi) as techniques for insect control are promising strategies, which has been applied in the last few years. For this insect, there are not much available molecular information on databases. Using 454-pyrosequencing methodology, the transcriptome of all developmental stages of the insect pest, A. grandis, was analyzed. The A. grandis transcriptome analysis resulted in more than 500.000 reads and a data set of high quality 20,841 contigs. After sequence assembly and annotation, around 10,600 contigs had at least one BLAST hit against NCBI non-redundant protein database and 65.7% was similar to Tribolium castaneum sequences. A comparison of A. grandis, Drosophila melanogaster and Bombyx mori protein families' data showed higher similarity to dipteran than to lepidopteran sequences. Several contigs of genes encoding proteins involved in RNAi mechanism were found. PAZ Domains sequences extracted from the transcriptome showed high similarity and conservation for the most important functional and structural motifs when compared to PAZ Domains from 5 species. Two SID-like contigs were phylogenetically analyzed and grouped with T. castaneum SID-like proteins. No RdRP gene was found. A contig matching chitin synthase 1 was mined from the transcriptome. dsRNA microinjection of a chitin synthase gene to A. grandis female adults resulted in normal oviposition of unviable eggs and malformed alive larvae that were unable to develop in artificial diet. This is the first study that characterizes the transcriptome of the coleopteran, A. grandis. A new and representative transcriptome database for this insect pest is now available. All data support the state of the art of RNAi mechanism in insects.

  16. Transcriptome Analysis in Cotton Boll Weevil (Anthonomus grandis) and RNA Interference in Insect Pests

    PubMed Central

    Coelho, Roberta Ramos; Antonino de Souza Jr, José Dijair; Togawa, Roberto Coiti; Silva-Junior, Orzenil Bonfim; Pappas-Jr, Georgios Joannis; da Silva, Maria Cristina Mattar; Engler, Gilbert; Grossi-de-Sa, Maria Fatima

    2013-01-01

    Cotton plants are subjected to the attack of several insect pests. In Brazil, the cotton boll weevil, Anthonomus grandis, is the most important cotton pest. The use of insecticidal proteins and gene silencing by interference RNA (RNAi) as techniques for insect control are promising strategies, which has been applied in the last few years. For this insect, there are not much available molecular information on databases. Using 454-pyrosequencing methodology, the transcriptome of all developmental stages of the insect pest, A. grandis, was analyzed. The A. grandis transcriptome analysis resulted in more than 500.000 reads and a data set of high quality 20,841 contigs. After sequence assembly and annotation, around 10,600 contigs had at least one BLAST hit against NCBI non-redundant protein database and 65.7% was similar to Tribolium castaneum sequences. A comparison of A. grandis, Drosophila melanogaster and Bombyx mori protein families’ data showed higher similarity to dipteran than to lepidopteran sequences. Several contigs of genes encoding proteins involved in RNAi mechanism were found. PAZ Domains sequences extracted from the transcriptome showed high similarity and conservation for the most important functional and structural motifs when compared to PAZ Domains from 5 species. Two SID-like contigs were phylogenetically analyzed and grouped with T. castaneum SID-like proteins. No RdRP gene was found. A contig matching chitin synthase 1 was mined from the transcriptome. dsRNA microinjection of a chitin synthase gene to A. grandis female adults resulted in normal oviposition of unviable eggs and malformed alive larvae that were unable to develop in artificial diet. This is the first study that characterizes the transcriptome of the coleopteran, A. grandis. A new and representative transcriptome database for this insect pest is now available. All data support the state of the art of RNAi mechanism in insects. PMID:24386449

  17. De novo assembly, characterization and functional annotation of pineapple fruit transcriptome through massively parallel sequencing.

    PubMed

    Ong, Wen Dee; Voo, Lok-Yung Christopher; Kumar, Vijay Subbiah

    2012-01-01

    Pineapple (Ananas comosus var. comosus), is an important tropical non-climacteric fruit with high commercial potential. Understanding the mechanism and processes underlying fruit ripening would enable scientists to enhance the improvement of quality traits such as, flavor, texture, appearance and fruit sweetness. Although, the pineapple is an important fruit, there is insufficient transcriptomic or genomic information that is available in public databases. Application of high throughput transcriptome sequencing to profile the pineapple fruit transcripts is therefore needed. To facilitate this, we have performed transcriptome sequencing of ripe yellow pineapple fruit flesh using Illumina technology. About 4.7 millions Illumina paired-end reads were generated and assembled using the Velvet de novo assembler. The assembly produced 28,728 unique transcripts with a mean length of approximately 200 bp. Sequence similarity search against non-redundant NCBI database identified a total of 16,932 unique transcripts (58.93%) with significant hits. Out of these, 15,507 unique transcripts were assigned to gene ontology terms. Functional annotation against Kyoto Encyclopedia of Genes and Genomes pathway database identified 13,598 unique transcripts (47.33%) which were mapped to 126 pathways. The assembly revealed many transcripts that were previously unknown. The unique transcripts derived from this work have rapidly increased of the number of the pineapple fruit mRNA transcripts as it is now available in public databases. This information can be further utilized in gene expression, genomics and other functional genomics studies in pineapple.

  18. De Novo Assembly, Characterization and Functional Annotation of Pineapple Fruit Transcriptome through Massively Parallel Sequencing

    PubMed Central

    Ong, Wen Dee; Voo, Lok-Yung Christopher; Kumar, Vijay Subbiah

    2012-01-01

    Background Pineapple (Ananas comosus var. comosus), is an important tropical non-climacteric fruit with high commercial potential. Understanding the mechanism and processes underlying fruit ripening would enable scientists to enhance the improvement of quality traits such as, flavor, texture, appearance and fruit sweetness. Although, the pineapple is an important fruit, there is insufficient transcriptomic or genomic information that is available in public databases. Application of high throughput transcriptome sequencing to profile the pineapple fruit transcripts is therefore needed. Methodology/Principal Findings To facilitate this, we have performed transcriptome sequencing of ripe yellow pineapple fruit flesh using Illumina technology. About 4.7 millions Illumina paired-end reads were generated and assembled using the Velvet de novo assembler. The assembly produced 28,728 unique transcripts with a mean length of approximately 200 bp. Sequence similarity search against non-redundant NCBI database identified a total of 16,932 unique transcripts (58.93%) with significant hits. Out of these, 15,507 unique transcripts were assigned to gene ontology terms. Functional annotation against Kyoto Encyclopedia of Genes and Genomes pathway database identified 13,598 unique transcripts (47.33%) which were mapped to 126 pathways. The assembly revealed many transcripts that were previously unknown. Conclusions The unique transcripts derived from this work have rapidly increased of the number of the pineapple fruit mRNA transcripts as it is now available in public databases. This information can be further utilized in gene expression, genomics and other functional genomics studies in pineapple. PMID:23091603

  19. Workflow and web application for annotating NCBI BioProject transcriptome data

    PubMed Central

    Vera Alvarez, Roberto; Medeiros Vidal, Newton; Garzón-Martínez, Gina A.; Barrero, Luz S.; Landsman, David

    2017-01-01

    Abstract The volume of transcriptome data is growing exponentially due to rapid improvement of experimental technologies. In response, large central resources such as those of the National Center for Biotechnology Information (NCBI) are continually adapting their computational infrastructure to accommodate this large influx of data. New and specialized databases, such as Transcriptome Shotgun Assembly Sequence Database (TSA) and Sequence Read Archive (SRA), have been created to aid the development and expansion of centralized repositories. Although the central resource databases are under continual development, they do not include automatic pipelines to increase annotation of newly deposited data. Therefore, third-party applications are required to achieve that aim. Here, we present an automatic workflow and web application for the annotation of transcriptome data. The workflow creates secondary data such as sequencing reads and BLAST alignments, which are available through the web application. They are based on freely available bioinformatics tools and scripts developed in-house. The interactive web application provides a search engine and several browser utilities. Graphical views of transcript alignments are available through SeqViewer, an embedded tool developed by NCBI for viewing biological sequence data. The web application is tightly integrated with other NCBI web applications and tools to extend the functionality of data processing and interconnectivity. We present a case study for the species Physalis peruviana with data generated from BioProject ID 67621. Database URL: http://www.ncbi.nlm.nih.gov/projects/physalis/ PMID:28605765

  20. SAMMD: Staphylococcus aureus microarray meta-database.

    PubMed

    Nagarajan, Vijayaraj; Elasri, Mohamed O

    2007-10-02

    Staphylococcus aureus is an important human pathogen, causing a wide variety of diseases ranging from superficial skin infections to severe life threatening infections. S. aureus is one of the leading causes of nosocomial infections. Its ability to resist multiple antibiotics poses a growing public health problem. In order to understand the mechanism of pathogenesis of S. aureus, several global expression profiles have been developed. These transcriptional profiles included regulatory mutants of S. aureus and growth of wild type under different growth conditions. The abundance of these profiles has generated a large amount of data without a uniform annotation system to comprehensively examine them. We report the development of the Staphylococcus aureus Microarray meta-database (SAMMD) which includes data from all the published transcriptional profiles. SAMMD is a web-accessible database that helps users to perform a variety of analysis against and within the existing transcriptional profiles. SAMMD is a relational database that uses MySQL as the back end and PHP/JavaScript/DHTML as the front end. The database is normalized and consists of five tables, which holds information about gene annotations, regulated gene lists, experimental details, references, and other details. SAMMD data is collected from the peer-reviewed published articles. Data extraction and conversion was done using perl scripts while data entry was done through phpMyAdmin tool. The database is accessible via a web interface that contains several features such as a simple search by ORF ID, gene name, gene product name, advanced search using gene lists, comparing among datasets, browsing, downloading, statistics, and help. The database is licensed under General Public License (GPL). SAMMD is hosted and available at http://www.bioinformatics.org/sammd/. Currently there are over 9500 entries for regulated genes, from 67 microarray experiments. SAMMD will help staphylococcal scientists to analyze their expression data and understand it at global level. It will also allow scientists to compare and contrast their transcriptome to that of the other published transcriptomes.

  1. SAMMD: Staphylococcus aureus Microarray Meta-Database

    PubMed Central

    Nagarajan, Vijayaraj; Elasri, Mohamed O

    2007-01-01

    Background Staphylococcus aureus is an important human pathogen, causing a wide variety of diseases ranging from superficial skin infections to severe life threatening infections. S. aureus is one of the leading causes of nosocomial infections. Its ability to resist multiple antibiotics poses a growing public health problem. In order to understand the mechanism of pathogenesis of S. aureus, several global expression profiles have been developed. These transcriptional profiles included regulatory mutants of S. aureus and growth of wild type under different growth conditions. The abundance of these profiles has generated a large amount of data without a uniform annotation system to comprehensively examine them. We report the development of the Staphylococcus aureus Microarray meta-database (SAMMD) which includes data from all the published transcriptional profiles. SAMMD is a web-accessible database that helps users to perform a variety of analysis against and within the existing transcriptional profiles. Description SAMMD is a relational database that uses MySQL as the back end and PHP/JavaScript/DHTML as the front end. The database is normalized and consists of five tables, which holds information about gene annotations, regulated gene lists, experimental details, references, and other details. SAMMD data is collected from the peer-reviewed published articles. Data extraction and conversion was done using perl scripts while data entry was done through phpMyAdmin tool. The database is accessible via a web interface that contains several features such as a simple search by ORF ID, gene name, gene product name, advanced search using gene lists, comparing among datasets, browsing, downloading, statistics, and help. The database is licensed under General Public License (GPL). Conclusion SAMMD is hosted and available at . Currently there are over 9500 entries for regulated genes, from 67 microarray experiments. SAMMD will help staphylococcal scientists to analyze their expression data and understand it at global level. It will also allow scientists to compare and contrast their transcriptome to that of the other published transcriptomes. PMID:17910768

  2. Comparative Transcriptomics Highlights the Role of the Activator Protein 1 Transcription Factor in the Host Response to Ebolavirus

    PubMed Central

    Todd, Shawn; Boyd, Victoria; Tachedjian, Mary; Klein, Reuben; Shiell, Brian; Dearnley, Megan; McAuley, Alexander J.; Woon, Amanda P.; Purcell, Anthony W.; Marsh, Glenn A.; Baker, Michelle L.

    2017-01-01

    ABSTRACT Ebolavirus and Marburgvirus comprise two genera of negative-sense single-stranded RNA viruses that cause severe hemorrhagic fevers in humans. Despite considerable research efforts, the molecular events following Ebola virus (EBOV) infection are poorly understood. With the view of identifying host factors that underpin EBOV pathogenesis, we compared the transcriptomes of EBOV-infected human, pig, and bat kidney cells using a transcriptome sequencing (RNA-seq) approach. Despite a significant difference in viral transcription/replication between the cell lines, all cells responded to EBOV infection through a robust induction of extracellular growth factors. Furthermore, a significant upregulation of activator protein 1 (AP1) transcription factor complex members FOS and JUN was observed in permissive cell lines. Functional studies focusing on human cells showed that EBOV infection induces protein expression, phosphorylation, and nuclear accumulation of JUN and, to a lesser degree, FOS. Using a luciferase-based reporter, we show that EBOV infection induces AP1 transactivation activity within human cells at 48 and 72 h postinfection. Finally, we show that JUN knockdown decreases the expression of EBOV-induced host gene expression. Taken together, our study highlights the role of AP1 in promoting the host gene expression profile that defines EBOV pathogenesis. IMPORTANCE Many questions remain about the molecular events that underpin filovirus pathophysiology. The rational design of new intervention strategies, such as postexposure therapeutics, will be significantly enhanced through an in-depth understanding of these molecular events. We believe that new insights into the molecular pathogenesis of EBOV may be possible by examining the transcriptomic response of taxonomically diverse cell lines (derived from human, pig, and bat). We first identified the responsive pathways using an RNA-seq-based transcriptomics approach. Further functional and computational analysis focusing on human cells highlighted an important role for the AP1 transcription factor in mediating the transcriptional response to EBOV infection. Our study sheds new light on how host transcription factors respond to and promote the transcriptional landscape that follows viral infection. PMID:28931675

  3. Transcriptomic characterization of the novel avian-origin influenza A (H7N9) virus: specific host response and responses intermediate between avian (H5N1 and H7N7) and human (H3N2) viruses and implications for treatment options.

    PubMed

    Josset, Laurence; Zeng, Hui; Kelly, Sara M; Tumpey, Terrence M; Katze, Michael G

    2014-02-04

    A novel avian-origin H7N9 influenza A virus (IAV) emerged in China in 2013, causing mild to lethal human respiratory infections. H7N9 originated with multiple reassortment events between avian viruses and carries genetic markers of human adaptation. Determining whether H7N9 induces a host response closer to that with human or avian IAV is important in order to better characterize this emerging virus. Here we compared the human lung epithelial cell response to infection with A/Anhui/01/13 (H7N9) or highly pathogenic avian-origin H5N1, H7N7, or human seasonal H3N2 IAV. The transcriptomic response to H7N9 was highly specific to this strain but was more similar to the response to human H3N2 than to that to other avian IAVs. H7N9 and H3N2 both elicited responses related to eicosanoid signaling and chromatin modification, whereas H7N9 specifically induced genes regulating the cell cycle and transcription. Among avian IAVs, the response to H7N9 was closest to that elicited by H5N1 virus. Host responses common to H7N9 and the other avian viruses included the lack of induction of the antigen presentation pathway and reduced proinflammatory cytokine induction compared to that with H3N2. Repression of these responses could have an important impact on the immunogenicity and virulence of H7N9 in humans. Finally, using a genome-based drug repurposing approach, we identified several drugs predicted to regulate the host response to H7N9 that may act as potential antivirals, including several kinase inhibitors, as well as FDA-approved drugs, such as troglitazone and minocycline. Importantly, we validated that minocycline inhibited H7N9 replication in vitro, suggesting that our computational approach holds promise for identifying novel antivirals. Whether H7N9 will be the next pandemic influenza virus or will persist and sporadically infect humans from its avian reservoir, similar to H5N1, is not known yet. High-throughput profiling of the host response to infection allows rapid characterization of virus-host interactions and generates many hypotheses that will accelerate understanding and responsiveness to this potential threat. We show that the cellular response to H7N9 virus is closer to that induced by H3N2 than to that induced by H5N1, reflecting the potential of this new virus for adaptation to humans. Importantly, dissecting the host response to H7N9 may guide host-directed antiviral development.

  4. De novo assembly of maritime pine transcriptome: implications for forest breeding and biotechnology.

    PubMed

    Canales, Javier; Bautista, Rocio; Label, Philippe; Gómez-Maldonado, Josefa; Lesur, Isabelle; Fernández-Pozo, Noe; Rueda-López, Marina; Guerrero-Fernández, Dario; Castro-Rodríguez, Vanessa; Benzekri, Hicham; Cañas, Rafael A; Guevara, María-Angeles; Rodrigues, Andreia; Seoane, Pedro; Teyssier, Caroline; Morel, Alexandre; Ehrenmann, François; Le Provost, Grégoire; Lalanne, Céline; Noirot, Céline; Klopp, Christophe; Reymond, Isabelle; García-Gutiérrez, Angel; Trontin, Jean-François; Lelu-Walter, Marie-Anne; Miguel, Celia; Cervera, María Teresa; Cantón, Francisco R; Plomion, Christophe; Harvengt, Luc; Avila, Concepción; Gonzalo Claros, M; Cánovas, Francisco M

    2014-04-01

    Maritime pine (Pinus pinasterAit.) is a widely distributed conifer species in Southwestern Europe and one of the most advanced models for conifer research. In the current work, comprehensive characterization of the maritime pine transcriptome was performed using a combination of two different next-generation sequencing platforms, 454 and Illumina. De novo assembly of the transcriptome provided a catalogue of 26 020 unique transcripts in maritime pine trees and a collection of 9641 full-length cDNAs. Quality of the transcriptome assembly was validated by RT-PCR amplification of selected transcripts for structural and regulatory genes. Transcription factors and enzyme-encoding transcripts were annotated. Furthermore, the available sequencing data permitted the identification of polymorphisms and the establishment of robust single nucleotide polymorphism (SNP) and simple-sequence repeat (SSR) databases for genotyping applications and integration of translational genomics in maritime pine breeding programmes. All our data are freely available at SustainpineDB, the P. pinaster expressional database. Results reported here on the maritime pine transcriptome represent a valuable resource for future basic and applied studies on this ecological and economically important pine species. © 2013 Society for Experimental Biology, Association of Applied Biologists and John Wiley & Sons Ltd.

  5. Epigenetic and genetic dissections of UV-induced global gene dysregulation in skin cells through multi-omics analyses

    PubMed Central

    Shen, Yao; Stanislauskas, Milda; Li, Gen; Zheng, Deyou; Liu, Liang

    2017-01-01

    To elucidate the complex molecular mechanisms underlying the adverse effects UV radiation (UVR) on skin homeostasis, we performed multi-omics studies to characterize UV-induced genetic and epigenetic changes. Human keratinocytes from a single donor treated with or without UVR were analyzed by RNA-seq, exome-seq, and H3K27ac ChIP-seq at 4 h and 72 h following UVR. Compared to the relatively moderate mutagenic effects of UVR, acute UV exposure induced substantial epigenomic and transcriptomic alterations, illuminating a previously underappreciated role of epigenomic and transcriptomic instability in skin pathogenesis. Integration of the multi-omics data revealed that UVR-induced transcriptional dysregulation of a subset of genes was attributable to either genetic mutations or global redistribution of H3K27ac. H3K27ac redistribution further led to the formation of distinctive super enhancers in UV-irradiated cells. Our analysis also identified several new UV target genes, including CYP24A1, GJA5, SLAMF7 and ETV1, which were frequently dysregulated in human squamous cell carcinomas, highlighting their potential as new molecular targets for prevention or treatment of UVR-induced skin cancers. Taken together, our concurrent multi-omics analyses provide new mechanistic insights into the complex molecular networks underlying UV photobiological effects, which have important implications in understanding its impact on skin homeostasis and pathogenesis. PMID:28211524

  6. Sequence comparison of prefrontal cortical brain transcriptome from a tame and an aggressive silver fox (Vulpes vulpes).

    PubMed

    Kukekova, Anna V; Johnson, Jennifer L; Teiling, Clotilde; Li, Lewyn; Oskina, Irina N; Kharlamova, Anastasiya V; Gulevich, Rimma G; Padte, Ravee; Dubreuil, Michael M; Vladimirova, Anastasiya V; Shepeleva, Darya V; Shikhevich, Svetlana G; Sun, Qi; Ponnala, Lalit; Temnykh, Svetlana V; Trut, Lyudmila N; Acland, Gregory M

    2011-10-03

    Two strains of the silver fox (Vulpes vulpes), with markedly different behavioral phenotypes, have been developed by long-term selection for behavior. Foxes from the tame strain exhibit friendly behavior towards humans, paralleling the sociability of canine puppies, whereas foxes from the aggressive strain are defensive and exhibit aggression to humans. To understand the genetic differences underlying these behavioral phenotypes fox-specific genomic resources are needed. cDNA from mRNA from pre-frontal cortex of a tame and an aggressive fox was sequenced using the Roche 454 FLX Titanium platform (> 2.5 million reads & 0.9 Gbase of tame fox sequence; >3.3 million reads & 1.2 Gbase of aggressive fox sequence). Over 80% of the fox reads were assembled into contigs. Mapping fox reads against the fox transcriptome assembly and the dog genome identified over 30,000 high confidence fox-specific SNPs. Fox transcripts for approximately 14,000 genes were identified using SwissProt and the dog RefSeq databases. An at least 2-fold expression difference between the two samples (p < 0.05) was observed for 335 genes, fewer than 3% of the total number of genes identified in the fox transcriptome. Transcriptome sequencing significantly expanded genomic resources available for the fox, a species without a sequenced genome. In a very cost efficient manner this yielded a large number of fox-specific SNP markers for genetic studies and provided significant insights into the gene expression profile of the fox pre-frontal cortex; expression differences between the two fox samples; and a catalogue of potentially important gene-specific sequence variants. This result demonstrates the utility of this approach for developing genomic resources in species with limited genomic information.

  7. Sequence comparison of prefrontal cortical brain transcriptome from a tame and an aggressive silver fox (Vulpes vulpes)

    PubMed Central

    2011-01-01

    Background Two strains of the silver fox (Vulpes vulpes), with markedly different behavioral phenotypes, have been developed by long-term selection for behavior. Foxes from the tame strain exhibit friendly behavior towards humans, paralleling the sociability of canine puppies, whereas foxes from the aggressive strain are defensive and exhibit aggression to humans. To understand the genetic differences underlying these behavioral phenotypes fox-specific genomic resources are needed. Results cDNA from mRNA from pre-frontal cortex of a tame and an aggressive fox was sequenced using the Roche 454 FLX Titanium platform (> 2.5 million reads & 0.9 Gbase of tame fox sequence; >3.3 million reads & 1.2 Gbase of aggressive fox sequence). Over 80% of the fox reads were assembled into contigs. Mapping fox reads against the fox transcriptome assembly and the dog genome identified over 30,000 high confidence fox-specific SNPs. Fox transcripts for approximately 14,000 genes were identified using SwissProt and the dog RefSeq databases. An at least 2-fold expression difference between the two samples (p < 0.05) was observed for 335 genes, fewer than 3% of the total number of genes identified in the fox transcriptome. Conclusions Transcriptome sequencing significantly expanded genomic resources available for the fox, a species without a sequenced genome. In a very cost efficient manner this yielded a large number of fox-specific SNP markers for genetic studies and provided significant insights into the gene expression profile of the fox pre-frontal cortex; expression differences between the two fox samples; and a catalogue of potentially important gene-specific sequence variants. This result demonstrates the utility of this approach for developing genomic resources in species with limited genomic information. PMID:21967120

  8. Transcriptome Analysis and Its Application in Identifying Genes Associated with Fruiting Body Development in Basidiomycete Hypsizygus marmoreus

    PubMed Central

    Chen, Hui; Zhao, Mingwen; Shi, Liang; Chen, Mingjie; Wang, Hong; Feng, Zhiyong

    2015-01-01

    To elucidate the mechanisms of fruit body development in H. marmoreus, a total of 43609521 high-quality RNA-seq reads were obtained from four developmental stages, including the mycelial knot (H-M), mycelial pigmentation (H-V), primordium (H-P) and fruiting body (H-F) stages. These reads were assembled to obtain 40568 unigenes with an average length of 1074 bp. A total of 26800 (66.06%) unigenes were annotated and analyzed with the Kyoto Encyclopedia of Genes and Genomes (KEGG), Gene Ontology (GO), and Eukaryotic Orthologous Group (KOG) databases. Differentially expressed genes (DEGs) from the four transcriptomes were analyzed. The KEGG enrichment analysis revealed that the mycelium pigmentation stage was associated with the MAPK, cAMP, and blue light signal transduction pathways. In addition, expression of the two-component system members changed with the transition from H-M to H-V, suggesting that light affected the expression of genes related to fruit body initiation in H. marmoreus. During the transition from H-V to H-P, stress signals associated with MAPK, cAMP and ROS signals might be the most important inducers. Our data suggested that nitrogen starvation might be one of the most important factors in promoting fruit body maturation, and nitrogen metabolism and mTOR signaling pathway were associated with this process. In addition, 30 genes of interest were analyzed by quantitative real-time PCR to verify their expression profiles at the four developmental stages. This study advances our understanding of the molecular mechanism of fruiting body development in H. marmoreus by identifying a wealth of new genes that may play important roles in mushroom morphogenesis. PMID:25837428

  9. Human DBR1 modulates the recycling of snRNPs to affect alternative RNA splicing and contributes to the suppression of cancer development.

    PubMed

    Han, B; Park, H K; Ching, T; Panneerselvam, J; Wang, H; Shen, Y; Zhang, J; Li, L; Che, R; Garmire, L; Fei, P

    2017-09-21

    The contribution of RNA processing to tumorigenesis is understudied. Here, we report that the human RNA debranching enzyme (hDBR1), when inappropriately regulated, induces oncogenesis by causing RNA processing defects, for example, splicing defects. We found that wild-type p53 and hypoxia-inducible factor 1 co-regulate hDBR1 expression, and insufficient hDBR1 leads to a higher rate of exon skipping. Transcriptomic sequencing confirmed the effect of hDBR1 on RNA splicing, and metabolite profiling supported the observation that neoplasm is triggered by a decrease in hDBR1 expression both in vitro and in vivo. Most importantly, when modulating the expression of hDBR1, which was found to be generally low in malignant human tissues, higher expression of hDBR1 only affected exon-skipping activity in malignant cells. Together, our findings demonstrate previously unrecognized regulation and functions of hDBR1, with immediate clinical implications regarding the regulation of hDBR1 as an effective strategy for combating human cancer.

  10. Developing Single Nucleotide Polymorphism (SNP) markers from transcriptome sequences for the identification of longan (Dimocarpus longan) germplasm

    USDA-ARS?s Scientific Manuscript database

    Longan (Dimocarpus longan Lour.) is an important tropical fruit tree crop. Accurate varietal identification is essential for germplasm management and breeding. Using longan transcriptome sequences from public databases, we developed single nucleotide polymorphism (SNP) markers; validated 60 SNPs in...

  11. Characterization of the heart transcriptome of the white shark (Carcharodon carcharias)

    PubMed Central

    2013-01-01

    Background The white shark (Carcharodon carcharias) is a globally distributed, apex predator possessing physical, physiological, and behavioral traits that have garnered it significant public attention. In addition to interest in the genetic basis of its form and function, as a representative of the oldest extant jawed vertebrate lineage, white sharks are also of conservation concern due to their small population size and threat from overfishing. Despite this, surprisingly little is known about the biology of white sharks, and genomic resources are unavailable. To address this deficit, we combined Roche-454 and Illumina sequencing technologies to characterize the first transciptome of any tissue for this species. Results From white shark heart cDNA we generated 665,399 Roche 454 reads (median length 387-bp) that were assembled into 141,626 contigs (mean length 503-bp). We also generated 78,566,588 Illumina reads, which we aligned to the 454 contigs producing 105,014 454/Illumina consensus sequences. To these, we added 3,432 non-singleton 454 contigs. By comparing these sequences to the UniProtKB/Swiss-Prot database we were able to annotate 21,019 translated open reading frames (ORFs) of ≥ 20 amino acids. Of these, 19,277 were additionally assigned Gene Ontology (GO) functional annotations. While acknowledging the limitations of our single tissue transcriptome, Fisher tests showed the white shark transcriptome to be significantly enriched for numerous metabolic GO terms compared to the zebra fish and human transcriptomes, with white shark showing more similarity to human than to zebra fish (i.e. fewer terms were significantly different). We also compared the transcriptome to other available elasmobranch sequences, for signatures of positive selection and identified several genes of putative adaptive significance on the white shark lineage. The white shark transcriptome also contained 8,404 microsatellites (dinucleotide, trinucleotide, or tetranucleotide motifs ≥ five perfect repeats). Detailed characterization of these microsatellites showed that ORFs with trinucleotide repeats, were significantly enriched for transcription regulatory roles and that trinucleotide frequency within ORFs was lower than for a wide range of taxonomic groups including other vertebrates. Conclusion The white shark heart transcriptome represents a valuable resource for future elasmobranch functional and comparative genomic studies, as well as for population and other biological studies vital for effective conservation of this globally vulnerable species. PMID:24112713

  12. Characterization of the heart transcriptome of the white shark (Carcharodon carcharias).

    PubMed

    Richards, Vincent P; Suzuki, Haruo; Stanhope, Michael J; Shivji, Mahmood S

    2013-10-11

    The white shark (Carcharodon carcharias) is a globally distributed, apex predator possessing physical, physiological, and behavioral traits that have garnered it significant public attention. In addition to interest in the genetic basis of its form and function, as a representative of the oldest extant jawed vertebrate lineage, white sharks are also of conservation concern due to their small population size and threat from overfishing. Despite this, surprisingly little is known about the biology of white sharks, and genomic resources are unavailable. To address this deficit, we combined Roche-454 and Illumina sequencing technologies to characterize the first transciptome of any tissue for this species. From white shark heart cDNA we generated 665,399 Roche 454 reads (median length 387-bp) that were assembled into 141,626 contigs (mean length 503-bp). We also generated 78,566,588 Illumina reads, which we aligned to the 454 contigs producing 105,014 454/Illumina consensus sequences. To these, we added 3,432 non-singleton 454 contigs. By comparing these sequences to the UniProtKB/Swiss-Prot database we were able to annotate 21,019 translated open reading frames (ORFs) of ≥ 20 amino acids. Of these, 19,277 were additionally assigned Gene Ontology (GO) functional annotations. While acknowledging the limitations of our single tissue transcriptome, Fisher tests showed the white shark transcriptome to be significantly enriched for numerous metabolic GO terms compared to the zebra fish and human transcriptomes, with white shark showing more similarity to human than to zebra fish (i.e. fewer terms were significantly different). We also compared the transcriptome to other available elasmobranch sequences, for signatures of positive selection and identified several genes of putative adaptive significance on the white shark lineage. The white shark transcriptome also contained 8,404 microsatellites (dinucleotide, trinucleotide, or tetranucleotide motifs ≥ five perfect repeats). Detailed characterization of these microsatellites showed that ORFs with trinucleotide repeats, were significantly enriched for transcription regulatory roles and that trinucleotide frequency within ORFs was lower than for a wide range of taxonomic groups including other vertebrates. The white shark heart transcriptome represents a valuable resource for future elasmobranch functional and comparative genomic studies, as well as for population and other biological studies vital for effective conservation of this globally vulnerable species.

  13. Genome-wide analysis of human constitutive androstane receptor (CAR) transcriptome in wild-type and CAR-knockout HepaRG cells.

    PubMed

    Li, Daochuan; Mackowiak, Bryan; Brayman, Timothy G; Mitchell, Michael; Zhang, Lei; Huang, Shiew-Mei; Wang, Hongbing

    2015-11-01

    The constitutive androstane receptor (CAR) modulates the transcription of numerous genes involving drug metabolism, energy homeostasis, and cell proliferation. Most functions of CAR however were defined from animal studies. Given the known species difference of CAR and the significant cross-talk between CAR and the pregnane X receptor (PXR), it is extremely difficult to decipher the exact role of human CAR (hCAR) in gene regulation, relying predominantly on pharmacological manipulations. Here, utilizing a newly generated hCAR-knockout (KO) HepaRG cell line, we carried out RNA-seq analysis of the global transcriptomes in wild-type (WT) and hCAR-KO HepaRG cells treated with CITCO, a selective hCAR agonist, phenobarbital (PB), a dual activator of hCAR and hPXR, or vehicle control. Real-time PCR assays in separate experiments were used to validate RNA-seq findings. Our results indicate that genes encoding drug-metabolizing enzymes are among the main clusters altered by both CITCO and PB. Specifically, CITCO significantly changed the expression of 135 genes in an hCAR-dependent manner, while PB altered the expression of 227 genes in WT cells of which 94 were simultaneously modulated in both cell lines reflecting dual effects of PB on hCAR/PXR. Notably, we found that many genes promoting cell proliferation and tumorigenesis were up-regulated in hCAR-KO cells, suggesting that hCAR may play an important role in cell growth that differs from mouse CAR. Together, our results reveal both novel and known targets of hCAR and support the role of hCAR in maintaining the homeostasis of metabolism and cell proliferation in the liver. Copyright © 2015 Elsevier Inc. All rights reserved.

  14. Desiccation tolerance in bryophytes: The dehydration and rehydration transcriptomes in the desiccation-tolerant bryophyte Bryum argenteum.

    PubMed

    Gao, Bei; Li, Xiaoshuang; Zhang, Daoyuan; Liang, Yuqing; Yang, Honglan; Chen, Moxian; Zhang, Yuanming; Zhang, Jianhua; Wood, Andrew J

    2017-08-08

    The desiccation tolerant bryophyte Bryum argenteum is an important component of desert biological soil crusts (BSCs) and is emerging as a model system for studying vegetative desiccation tolerance. Here we present and analyze the hydration-dehydration-rehydration transcriptomes in B. argenteum to establish a desiccation-tolerance transcriptomic atlas. B. argenteum gametophores representing five different hydration stages (hydrated (H0), dehydrated for 2 h (D2), 24 h (D24), then rehydrated for 2 h (R2) and 48 h (R48)), were sampled for transcriptome analyses. Illumina high throughput RNA-Seq technology was employed and generated more than 488.46 million reads. An in-house de novo transcriptome assembly optimization pipeline based on Trinity assembler was developed to obtain a reference Hydration-Dehydration-Rehydration (H-D-R) transcriptome comprising of 76,206 transcripts, with an N50 of 2,016 bp and average length of 1,222 bp. Comprehensive transcription factor (TF) annotation discovered 978 TFs in 62 families, among which 404 TFs within 40 families were differentially expressed upon dehydration-rehydration. Pfam term enrichment analysis revealed 172 protein families/domains were significantly associated with the H-D-R cycle and confirmed early rehydration (i.e. the R2 stage) as exhibiting the maximum stress-induced changes in gene expression.

  15. Conserved and Divergent Rhythms of Crassulacean Acid Metabolism-Related and Core Clock Gene Expression in the Cactus Opuntia ficus-indica1[C][W

    PubMed Central

    Mallona, Izaskun; Egea-Cortines, Marcos; Weiss, Julia

    2011-01-01

    The cactus Opuntia ficus-indica is a constitutive Crassulacean acid metabolism (CAM) species. Current knowledge of CAM metabolism suggests that the enzyme phosphoenolpyruvate carboxylase kinase (PPCK) is circadian regulated at the transcriptional level, whereas phosphoenolpyruvate carboxylase (PEPC), malate dehydrogenase (MDH), NADP-malic enzyme (NADP-ME), and pyruvate phosphate dikinase (PPDK) are posttranslationally controlled. As little transcriptomic data are available from obligate CAM plants, we created an expressed sequence tag database derived from different organs and developmental stages. Sequences were assembled, compared with sequences in the National Center for Biotechnology Information nonredundant database for identification of putative orthologs, and mapped using Kyoto Encyclopedia of Genes and Genomes Orthology and Gene Ontology. We identified genes involved in circadian regulation and CAM metabolism for transcriptomic analysis in plants grown in long days. We identified stable reference genes for quantitative polymerase chain reaction and found that OfiSAND, like its counterpart in Arabidopsis (Arabidopsis thaliana), and OfiTUB are generally appropriate standards for use in the quantification of gene expression in O. ficus-indica. Three kinds of expression profiles were found: transcripts of OfiPPCK oscillated with a 24-h periodicity; transcripts of the light-active OfiNADP-ME and OfiPPDK genes adapted to 12-h cycles, while transcript accumulation patterns of OfiPEPC and OfiMDH were arrhythmic. Expression of the circadian clock gene OfiTOC1, similar to Arabidopsis, oscillated with a 24-h periodicity, peaking at night. Expression of OfiCCA1 and OfiPRR9, unlike in Arabidopsis, adapted best to a 12-h rhythm, suggesting that circadian clock gene interactions differ from those of Arabidopsis. Our results indicate that the evolution of CAM metabolism could be the result of modified circadian regulation at both the transcriptional and posttranscriptional levels. PMID:21677095

  16. Transcriptome dynamics of human pluripotent stem cell-derived contracting cardiomyocytes using an embryoid body model with fetal bovine serum.

    PubMed

    Jung, Kwang Bo; Son, Ye Seul; Lee, Hana; Jung, Cho-Rok; Kim, Janghwan; Son, Mi-Young

    2017-07-25

    Cardiomyocyte (CM) differentiation techniques for generating adult-like mature CMs remain imperfect, and the plausible underlying mechanisms remain unclear; however, there are a number of current protocols available. Here, to explore the mechanisms controlling cardiac differentiation, we analyzed the genome-wide transcription dynamics occurring during the differentiation of human pluripotent stem cells (hPSCs) into CMs using embryoid body (EB) formation. We optimized and updated the protocol to efficiently generate contracting CMs from hPSCs by adding fetal bovine serum (FBS) as a medium supplement, which could have a significant impact on the efficiency of cardiac differentiation. To identify genes, biological processes, and pathways involved in the cardiac differentiation of hPSCs, integrative and comparative analyses of the transcriptome profiles of differentiated CMs from hPSCs and of control CMs of the adult human heart (CM-AHH) were performed using gene ontology, functional annotation clustering, and pathway analyses. Several genes commonly regulated in the differentiated CMs and CM-AHH were enriched in pathways related to cell cycle and nucleotide metabolism. Strikingly, we found that current differentiation protocols did not promote sufficient expression of genes involved in oxidative phosphorylation to differentiate CMs from hPSCs compared to the expression levels in CM-AHH. Therefore, to obtain mature CMs similar to CM-AHH, these deficient pathways in CM differentiation, such as energy-related pathways, must be augmented prior to use for in vitro and in vivo applications. This approach opens up new avenues for facilitating the utilization of hPSC-derived CMs in biomedical research, drug evaluation, and clinical applications for patients with cardiac failure.

  17. Whole Transcriptome Analysis Provides Insights into Molecular Mechanisms for Molting in Litopenaeus vannamei

    PubMed Central

    Gao, Yi; Zhang, Xiaojun; Wei, Jiankai; Sun, Xiaoqing; Yuan, Jianbo; Li, Fuhua; Xiang, Jianhai

    2015-01-01

    Molting is one of the most important biological processes in shrimp growth and development. All shrimp undergo cyclic molting periodically to shed and replace their exoskeletons. This process is essential for growth, metamorphosis, and reproduction in shrimp. However, the molecular mechanisms underlying shrimp molting remain poorly understood. In this study, we investigated global expression changes in the transcriptomes of the Pacific white shrimp, Litopenaeus vannamei, the most commonly cultured shrimp species worldwide. The transcriptome of whole L. vannamei was investigated by RNA-sequencing (RNA-seq) throughout the molting cycle, including the inter-molt (C), pre-molt (D0, D1, D2, D3, D4), and post-molt (P1 and P2) stages, and 93,756 unigenes were identified. Among these genes, we identified 5,117 genes differentially expressed (log2ratio ≥1 and FDR ≤0.001) in adjacent molt stages. The results were compared against the National Center for Biotechnology Information (NCBI) non-redundant protein/nucleotide sequence database, Swiss-Prot, PFAM database, the Gene Ontology database, and the Kyoto Encyclopedia of Genes and Genomes database in order to annotate gene descriptions, associate them with gene ontology terms, and assign them to pathways. The expression patterns for genes involved in several molecular events critical for molting, such as hormone regulation, triggering events, implementation phases, skelemin, immune responses were characterized and considered as mechanisms underlying molting in L. vannamei. Comparisons with transcriptomic analyses in other arthropods were also performed. The characterization of major transcriptional changes in genes involved in the molting cycle provides candidates for future investigation of the molecular mechanisms. The data generated in this study will serve as an important transcriptomic resource for the shrimp research community to facilitate gene and genome annotation and to characterize key molecular processes underlying shrimp development. PMID:26650402

  18. Genetic signatures of adaptation revealed from transcriptome sequencing of Arctic and red foxes.

    PubMed

    Kumar, Vikas; Kutschera, Verena E; Nilsson, Maria A; Janke, Axel

    2015-08-07

    The genus Vulpes (true foxes) comprises numerous species that inhabit a wide range of habitats and climatic conditions, including one species, the Arctic fox (Vulpes lagopus) which is adapted to the arctic region. A close relative to the Arctic fox, the red fox (Vulpes vulpes), occurs in subarctic to subtropical habitats. To study the genetic basis of their adaptations to different environments, transcriptome sequences from two Arctic foxes and one red fox individual were generated and analyzed for signatures of positive selection. In addition, the data allowed for a phylogenetic analysis and divergence time estimate between the two fox species. The de novo assembly of reads resulted in more than 160,000 contigs/transcripts per individual. Approximately 17,000 homologous genes were identified using human and the non-redundant databases. Positive selection analyses revealed several genes involved in various metabolic and molecular processes such as energy metabolism, cardiac gene regulation, apoptosis and blood coagulation to be under positive selection in foxes. Branch site tests identified four genes to be under positive selection in the Arctic fox transcriptome, two of which are fat metabolism genes. In the red fox transcriptome eight genes are under positive selection, including molecular process genes, notably genes involved in ATP metabolism. Analysis of the three transcriptomes and five Sanger re-sequenced genes in additional individuals identified a lower genetic variability within Arctic foxes compared to red foxes, which is consistent with distribution range differences and demographic responses to past climatic fluctuations. A phylogenomic analysis estimated that the Arctic and red fox lineages diverged about three million years ago. Transcriptome data are an economic way to generate genomic resources for evolutionary studies. Despite not representing an entire genome, this transcriptome analysis identified numerous genes that are relevant to arctic adaptation in foxes. Similar to polar bears, fat metabolism seems to play a central role in adaptation of Arctic foxes to the cold climate, as has been identified in the polar bear, another arctic specialist.

  19. De novo assembly and transcriptome analysis of the rubber tree (Hevea brasiliensis) and SNP markers development for rubber biosynthesis pathways.

    PubMed

    Mantello, Camila Campos; Cardoso-Silva, Claudio Benicio; da Silva, Carla Cristina; de Souza, Livia Moura; Scaloppi Junior, Erivaldo José; de Souza Gonçalves, Paulo; Vicentini, Renato; de Souza, Anete Pereira

    2014-01-01

    Hevea brasiliensis (Willd. Ex Adr. Juss.) Muell.-Arg. is the primary source of natural rubber that is native to the Amazon rainforest. The singular properties of natural rubber make it superior to and competitive with synthetic rubber for use in several applications. Here, we performed RNA sequencing (RNA-seq) of H. brasiliensis bark on the Illumina GAIIx platform, which generated 179,326,804 raw reads on the Illumina GAIIx platform. A total of 50,384 contigs that were over 400 bp in size were obtained and subjected to further analyses. A similarity search against the non-redundant (nr) protein database returned 32,018 (63%) positive BLASTx hits. The transcriptome analysis was annotated using the clusters of orthologous groups (COG), gene ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG), and Pfam databases. A search for putative molecular marker was performed to identify simple sequence repeats (SSRs) and single nucleotide polymorphisms (SNPs). In total, 17,927 SSRs and 404,114 SNPs were detected. Finally, we selected sequences that were identified as belonging to the mevalonate (MVA) and 2-C-methyl-D-erythritol 4-phosphate (MEP) pathways, which are involved in rubber biosynthesis, to validate the SNP markers. A total of 78 SNPs were validated in 36 genotypes of H. brasiliensis. This new dataset represents a powerful information source for rubber tree bark genes and will be an important tool for the development of microsatellites and SNP markers for use in future genetic analyses such as genetic linkage mapping, quantitative trait loci identification, investigations of linkage disequilibrium and marker-assisted selection.

  20. De Novo Assembly and Characterization of Fruit Transcriptome in Black Pepper (Piper nigrum)

    PubMed Central

    Hu, Lisong; Hao, Chaoyun; Fan, Rui; Wu, Baoduo; Tan, Lehe; Wu, Huasong

    2015-01-01

    Black pepper is one of the most popular and oldest spices in the world and valued for its pungent constituent alkaloids. Pinerine is the main bioactive compound in pepper alkaloids, which perform unique physiological functions. However, the mechanisms of piperine synthesis are poorly understood. This study is the first to describe the fruit transcriptome of black pepper by sequencing on Illumina HiSeq 2000 platform. A total of 56,281,710 raw reads were obtained and assembled. From these raw reads, 44,061 unigenes with an average length of 1,345 nt were generated. During functional annotation, 40,537 unigenes were annotated in Gene Ontology categories, Kyoto Encyclopedia of Genes and Genomes pathways, Swiss-Prot database, and Nucleotide Collection (NR/NT) database. In addition, 8,196 simple sequence repeats (SSRs) were detected. In a detailed analysis of the transcriptome, housekeeping genes for quantitative polymerase chain reaction internal control, polymorphic SSRs, and lysine/ornithine metabolism-related genes were identified. These results validated the availability of our database. Our study could provide useful data for further research on piperine synthesis in black pepper. PMID:26121657

  1. De Novo Assembly and Characterization of Fruit Transcriptome in Black Pepper (Piper nigrum).

    PubMed

    Hu, Lisong; Hao, Chaoyun; Fan, Rui; Wu, Baoduo; Tan, Lehe; Wu, Huasong

    2015-01-01

    Black pepper is one of the most popular and oldest spices in the world and valued for its pungent constituent alkaloids. Pinerine is the main bioactive compound in pepper alkaloids, which perform unique physiological functions. However, the mechanisms of piperine synthesis are poorly understood. This study is the first to describe the fruit transcriptome of black pepper by sequencing on Illumina HiSeq 2000 platform. A total of 56,281,710 raw reads were obtained and assembled. From these raw reads, 44,061 unigenes with an average length of 1,345 nt were generated. During functional annotation, 40,537 unigenes were annotated in Gene Ontology categories, Kyoto Encyclopedia of Genes and Genomes pathways, Swiss-Prot database, and Nucleotide Collection (NR/NT) database. In addition, 8,196 simple sequence repeats (SSRs) were detected. In a detailed analysis of the transcriptome, housekeeping genes for quantitative polymerase chain reaction internal control, polymorphic SSRs, and lysine/ornithine metabolism-related genes were identified. These results validated the availability of our database. Our study could provide useful data for further research on piperine synthesis in black pepper.

  2. The developmental transcriptome atlas of the spoon worm Urechis unicinctus (Echiurida: Annelida).

    PubMed

    Park, Chungoo; Han, Yong-Hee; Lee, Sung-Gwon; Ry, Kyoung-Bin; Oh, Jooseong; Kern, Elizabeth M A; Park, Joong-Ki; Cho, Sung-Jin

    2018-03-01

    Echiurida is one of the most intriguing major subgroups of annelida because, unlike most other annelids, echiurids lack metameric body segmentation as adults. For this reason, transcriptome analyses from various developmental stages of echiurid species can be of substantial value for understanding precise expression levels and the complex regulatory networks during early and larval development. A total of 914 million raw RNA-Seq reads were produced from 14 developmental stages of Urechis unicinctus and were de novo assembled into contigs spanning 63,928,225 bp with an N50 length of 2700 bp. The resulting comprehensive transcriptome database of the early developmental stages of U. unicinctus consists of 20,305 representative functional protein-coding transcripts. Approximately 66% of unigenes were assigned to superphylum-level taxa, including Lophotrochozoa (40%). The completeness of the transcriptome assembly was assessed using benchmarking universal single-copy orthologs; 75.7% of the single-copy orthologs were presented in our transcriptome database. We observed 3 distinct patterns of global transcriptome profiles from 14 developmental stages and identified 12,705 genes that showed dynamic regulation patterns during the differentiation and maturation of U. unicinctus cells. We present the first large-scale developmental transcriptome dataset of U. unicinctus and provide a general overview of the dynamics of global gene expression changes during its early developmental stages. The analysis of time-course gene expression data is a first step toward understanding the complex developmental gene regulatory networks in U. unicinctus and will furnish a valuable resource for analyzing the functions of gene repertoires in various developmental phases.

  3. Comprehensive transcriptome profiling reveals long noncoding RNA expression and alternative splicing regulation during fruit development and ripening in kiwifruit (Actinidia chinensis)

    USDA-ARS?s Scientific Manuscript database

    Genomic and transcriptomic data on kiwifruit (Actinidia chinensis) in public databases are very limited despite its nutritional and economic value. Previously, we have constructed and sequenced nine fruit RNA-Seq libraries of A. chinensis cv. 'Hongyang' at immature, mature, and postharvest ripening...

  4. Effects of insufficient sleep on circadian rhythmicity and expression amplitude of the human blood transcriptome.

    PubMed

    Möller-Levet, Carla S; Archer, Simon N; Bucca, Giselda; Laing, Emma E; Slak, Ana; Kabiljo, Renata; Lo, June C Y; Santhi, Nayantara; von Schantz, Malcolm; Smith, Colin P; Dijk, Derk-Jan

    2013-03-19

    Insufficient sleep and circadian rhythm disruption are associated with negative health outcomes, including obesity, cardiovascular disease, and cognitive impairment, but the mechanisms involved remain largely unexplored. Twenty-six participants were exposed to 1 wk of insufficient sleep (sleep-restriction condition 5.70 h, SEM = 0.03 sleep per 24 h) and 1 wk of sufficient sleep (control condition 8.50 h sleep, SEM = 0.11). Immediately following each condition, 10 whole-blood RNA samples were collected from each participant, while controlling for the effects of light, activity, and food, during a period of total sleep deprivation. Transcriptome analysis revealed that 711 genes were up- or down-regulated by insufficient sleep. Insufficient sleep also reduced the number of genes with a circadian expression profile from 1,855 to 1,481, reduced the circadian amplitude of these genes, and led to an increase in the number of genes that responded to subsequent total sleep deprivation from 122 to 856. Genes affected by insufficient sleep were associated with circadian rhythms (PER1, PER2, PER3, CRY2, CLOCK, NR1D1, NR1D2, RORA, DEC1, CSNK1E), sleep homeostasis (IL6, STAT3, KCNV2, CAMK2D), oxidative stress (PRDX2, PRDX5), and metabolism (SLC2A3, SLC2A5, GHRL, ABCA1). Biological processes affected included chromatin modification, gene-expression regulation, macromolecular metabolism, and inflammatory, immune and stress responses. Thus, insufficient sleep affects the human blood transcriptome, disrupts its circadian regulation, and intensifies the effects of acute total sleep deprivation. The identified biological processes may be involved with the negative effects of sleep loss on health, and highlight the interrelatedness of sleep homeostasis, circadian rhythmicity, and metabolism.

  5. Transcriptome alterations in zebrafish embryos after exposure to environmental estrogens and anti-androgens can reveal endocrine disruption.

    PubMed

    Schiller, Viktoria; Wichmann, Arne; Kriehuber, Ralf; Schäfers, Christoph; Fischer, Rainer; Fenske, Martina

    2013-12-01

    Exposure to environmental chemicals known as endocrine disruptors (EDs) is in many cases associated with an unpredictable hazard for wildlife and human health. The identification of endocrine disruptive properties of chemicals certain to enter the aquatic environment relies on toxicity tests with fish, assessing adverse effects on reproduction and sexual development. The demand for quick, reliable ED assays favored the use of fish embryos as alternative test organisms. We investigated the application of a transcriptomics-based assay for estrogenic and anti-androgenic chemicals with zebrafish embryos. Two reference compounds, 17α-ethinylestradiol and flutamide, were tested to evaluate the effects on development and the transcriptome after 48h-exposures. Comparison of the transcriptome response with other estrogenic and anti-androgenic compounds (genistein, bisphenol A, methylparaben, linuron, prochloraz, propanil) showed commonalities and differences in regulated pathways, enabling us to classify the estrogenic and anti-androgenic potencies. This demonstrates that different mechanism of ED can be assessed already in fish embryos. Copyright © 2013 Elsevier Inc. All rights reserved.

  6. A Novel Computational Strategy to Identify A-to-I RNA Editing Sites by RNA-Seq Data: De Novo Detection in Human Spinal Cord Tissue

    PubMed Central

    Picardi, Ernesto; Gallo, Angela; Galeano, Federica; Tomaselli, Sara; Pesole, Graziano

    2012-01-01

    RNA editing is a post-transcriptional process occurring in a wide range of organisms. In human brain, the A-to-I RNA editing, in which individual adenosine (A) bases in pre-mRNA are modified to yield inosine (I), is the most frequent event. Modulating gene expression, RNA editing is essential for cellular homeostasis. Indeed, its deregulation has been linked to several neurological and neurodegenerative diseases. To date, many RNA editing sites have been identified by next generation sequencing technologies employing massive transcriptome sequencing together with whole genome or exome sequencing. While genome and transcriptome reads are not always available for single individuals, RNA-Seq data are widespread through public databases and represent a relevant source of yet unexplored RNA editing sites. In this context, we propose a simple computational strategy to identify genomic positions enriched in novel hypothetical RNA editing events by means of a new two-steps mapping procedure requiring only RNA-Seq data and no a priori knowledge of RNA editing characteristics and genomic reads. We assessed the suitability of our procedure by confirming A-to-I candidates using conventional Sanger sequencing and performing RNA-Seq as well as whole exome sequencing of human spinal cord tissue from a single individual. PMID:22957051

  7. RAID: a comprehensive resource for human RNA-associated (RNA-RNA/RNA-protein) interaction.

    PubMed

    Zhang, Xiaomeng; Wu, Deng; Chen, Liqun; Li, Xiang; Yang, Jinxurong; Fan, Dandan; Dong, Tingting; Liu, Mingyue; Tan, Puwen; Xu, Jintian; Yi, Ying; Wang, Yuting; Zou, Hua; Hu, Yongfei; Fan, Kaili; Kang, Juanjuan; Huang, Yan; Miao, Zhengqiang; Bi, Miaoman; Jin, Nana; Li, Kongning; Li, Xia; Xu, Jianzhen; Wang, Dong

    2014-07-01

    Transcriptomic analyses have revealed an unexpected complexity in the eukaryote transcriptome, which includes not only protein-coding transcripts but also an expanding catalog of noncoding RNAs (ncRNAs). Diverse coding and noncoding RNAs (ncRNAs) perform functions through interaction with each other in various cellular processes. In this project, we have developed RAID (http://www.rna-society.org/raid), an RNA-associated (RNA-RNA/RNA-protein) interaction database. RAID intends to provide the scientific community with all-in-one resources for efficient browsing and extraction of the RNA-associated interactions in human. This version of RAID contains more than 6100 RNA-associated interactions obtained by manually reviewing more than 2100 published papers, including 4493 RNA-RNA interactions and 1619 RNA-protein interactions. Each entry contains detailed information on an RNA-associated interaction, including RAID ID, RNA/protein symbol, RNA/protein categories, validated method, expressing tissue, literature references (Pubmed IDs), and detailed functional description. Users can query, browse, analyze, and manipulate RNA-associated (RNA-RNA/RNA-protein) interaction. RAID provides a comprehensive resource of human RNA-associated (RNA-RNA/RNA-protein) interaction network. Furthermore, this resource will help in uncovering the generic organizing principles of cellular function network. © 2014 Zhang et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  8. RISE: a database of RNA interactome from sequencing experiments

    PubMed Central

    Gong, Jing; Shao, Di; Xu, Kui

    2018-01-01

    Abstract We present RISE (http://rise.zhanglab.net), a database of RNA Interactome from Sequencing Experiments. RNA-RNA interactions (RRIs) are essential for RNA regulation and function. RISE provides a comprehensive collection of RRIs that mainly come from recent transcriptome-wide sequencing-based experiments like PARIS, SPLASH, LIGR-seq, and MARIO, as well as targeted studies like RIA-seq, RAP-RNA and CLASH. It also includes interactions aggregated from other primary databases and publications. The RISE database currently contains 328,811 RNA-RNA interactions mainly in human, mouse and yeast. While most existing RNA databases mainly contain interactions of miRNA targeting, notably, more than half of the RRIs in RISE are among mRNA and long non-coding RNAs. We compared different RRI datasets in RISE and found limited overlaps in interactions resolved by different techniques and in different cell lines. It may suggest technology preference and also dynamic natures of RRIs. We also analyzed the basic features of the human and mouse RRI networks and found that they tend to be scale-free, small-world, hierarchical and modular. The analysis may nominate important RNAs or RRIs for further investigation. Finally, RISE provides a Circos plot and several table views for integrative visualization, with extensive molecular and functional annotations to facilitate exploration of biological functions for any RRI of interest. PMID:29040625

  9. HeLa Nucleic Acid Contamination in The Cancer Genome Atlas Leads to the Misidentification of Human Papillomavirus 18

    PubMed Central

    Cantalupo, Paul G.; Katz, Joshua P.

    2015-01-01

    ABSTRACT We searched The Cancer Genome Atlas (TCGA) database for viruses by comparing non-human reads present in transcriptome sequencing (RNA-Seq) and whole-exome sequencing (WXS) data to viral sequence databases. Human papillomavirus 18 (HPV18) is an etiologic agent of cervical cancer, and as expected, we found robust expression of HPV18 genes in cervical cancer samples. In agreement with previous studies, we also found HPV18 transcripts in non-cervical cancer samples, including those from the colon, rectum, and normal kidney. However, in each of these cases, HPV18 gene expression was low, and single-nucleotide variants and positions of genomic alignments matched the integrated portion of HPV18 present in HeLa cells. Chimeric reads that match a known virus-cell junction of HPV18 integrated in HeLa cells were also present in some samples. We hypothesize that HPV18 sequences in these non-cervical samples are due to nucleic acid contamination from HeLa cells. This finding highlights the problems that contamination presents in computational virus detection pipelines. IMPORTANCE Viruses associated with cancer can be detected by searching tumor sequence databases. Several studies involving searches of the TCGA database have reported the presence of HPV18, a known cause of cervical cancer, in a small number of additional cancers, including those of the rectum, kidney, and colon. We have determined that the sequences related to HPV18 in non-cervical samples are due to nucleic acid contamination from HeLa cells. To our knowledge, this is the first report of the misidentification of viruses in next-generation sequencing data of tumors due to contamination with a cancer cell line. These results raise awareness of the difficulty of accurately identifying viruses in human sequence databases. PMID:25631090

  10. An integrated genomic and transcriptomic survey of mucormycosis-causing fungi

    PubMed Central

    Chibucos, Marcus C.; Soliman, Sameh; Gebremariam, Teclegiorgis; Lee, Hongkyu; Daugherty, Sean; Orvis, Joshua; Shetty, Amol C.; Crabtree, Jonathan; Hazen, Tracy H.; Etienne, Kizee A.; Kumari, Priti; O'Connor, Timothy D.; Rasko, David A.; Filler, Scott G.; Fraser, Claire M.; Lockhart, Shawn R.; Skory, Christopher D.; Ibrahim, Ashraf S.; Bruno, Vincent M.

    2016-01-01

    Mucormycosis is a life-threatening infection caused by Mucorales fungi. Here we sequence 30 fungal genomes, and perform transcriptomics with three representative Rhizopus and Mucor strains and with human airway epithelial cells during fungal invasion, to reveal key host and fungal determinants contributing to pathogenesis. Analysis of the host transcriptional response to Mucorales reveals platelet-derived growth factor receptor B (PDGFRB) signaling as part of a core response to divergent pathogenic fungi; inhibition of PDGFRB reduces Mucorales-induced damage to host cells. The unique presence of CotH invasins in all invasive Mucorales, and the correlation between CotH gene copy number and clinical prevalence, are consistent with an important role for these proteins in mucormycosis pathogenesis. Our work provides insight into the evolution of this medically and economically important group of fungi, and identifies several molecular pathways that might be exploited as potential therapeutic targets. PMID:27447865

  11. Transcriptome analysis and related databases of Lactococcus lactis.

    PubMed

    Kuipers, Oscar P; de Jong, Anne; Baerends, Richard J S; van Hijum, Sacha A F T; Zomer, Aldert L; Karsens, Harma A; den Hengst, Chris D; Kramer, Naomi E; Buist, Girbe; Kok, Jan

    2002-08-01

    Several complete genome sequences of Lactococcus lactis and their annotations will become available in the near future, next to the already published genome sequence of L. lactis ssp. lactis IL 1403. This will allow intraspecies comparative genomics studies as well as functional genomics studies aimed at a better understanding of physiological processes and regulatory networks operating in lactococci. This paper describes the initial set-up of a DNA-microarray facility in our group, to enable transcriptome analysis of various Gram-positive bacteria, including a ssp. lactis and a ssp. cremoris strain of Lactococcus lactis. Moreover a global description will be given of the hardware and software requirements for such a set-up, highlighting the crucial integration of relevant bioinformatics tools and methods. This includes the development of MolGenIS, an information system for transcriptome data storage and retrieval, and LactococCye, a metabolic pathway/genome database of Lactococcus lactis.

  12. Transcriptome sequence analysis of an ornamental plant, Ananas comosus var. bracteatus, revealed the potential unigenes involved in terpenoid and phenylpropanoid biosynthesis.

    PubMed

    Ma, Jun; Kanakala, S; He, Yehua; Zhang, Junli; Zhong, Xiaolan

    2015-01-01

    Ananas comosus var. bracteatus (Red Pineapple) is an important ornamental plant for its colorful leaves and decorative red fruits. Because of its complex genome, it is difficult to understand the molecular mechanisms involved in the growth and development. Thus high-throughput transcriptome sequencing of Ananas comosus var. bracteatus is necessary to generate large quantities of transcript sequences for the purpose of gene discovery and functional genomic studies. The Ananas comosus var. bracteatus transcriptome was sequenced by the Illumina paired-end sequencing technology. We obtained a total of 23.5 million high quality sequencing reads, 1,555,808 contigs and 41,052 unigenes. In total 41,052 unigenes of Ananas comosus var. bracteatus, 23,275 unigenes were annotated in the NCBI non-redundant protein database and 23,134 unigenes were annotated in the Swiss-Port database. Out of these, 17,748 and 8,505 unigenes were assigned to gene ontology categories and clusters of orthologous groups, respectively. Functional annotation against Kyoto Encyclopedia of Genes and Genomes Pathway database identified 5,825 unigenes which were mapped to 117 pathways. The assembly predicted many unigenes that were previously unknown. The annotated unigenes were compared against pineapple, rice, maize, Arabidopsis, and sorghum. Unigenes that did not match any of those five sequence datasets are considered to be Ananas comosus var. bracteatus unique. We predicted unigenes encoding enzymes involved in terpenoid and phenylpropanoid biosynthesis. The sequence data provide the most comprehensive transcriptomic resource currently available for Ananas comosus var. bracteatus. To our knowledge; this is the first report on the de novo transcriptome sequencing of the Ananas comosus var. bracteatus. Unigenes obtained in this study, may help improve future gene expression, genetic and genomics studies in Ananas comosus var. bracteatus.

  13. Transcriptome Sequence Analysis of an Ornamental Plant, Ananas comosus var. bracteatus, Revealed the Potential Unigenes Involved in Terpenoid and Phenylpropanoid Biosynthesis

    PubMed Central

    Ma, Jun; Kanakala, S.; He, Yehua; Zhang, Junli; Zhong, Xiaolan

    2015-01-01

    Background Ananas comosus var. bracteatus (Red Pineapple) is an important ornamental plant for its colorful leaves and decorative red fruits. Because of its complex genome, it is difficult to understand the molecular mechanisms involved in the growth and development. Thus high-throughput transcriptome sequencing of Ananas comosus var. bracteatus is necessary to generate large quantities of transcript sequences for the purpose of gene discovery and functional genomic studies. Results The Ananas comosus var. bracteatus transcriptome was sequenced by the Illumina paired-end sequencing technology. We obtained a total of 23.5 million high quality sequencing reads, 1,555,808 contigs and 41,052 unigenes. In total 41,052 unigenes of Ananas comosus var. bracteatus, 23,275 unigenes were annotated in the NCBI non-redundant protein database and 23,134 unigenes were annotated in the Swiss-Port database. Out of these, 17,748 and 8,505 unigenes were assigned to gene ontology categories and clusters of orthologous groups, respectively. Functional annotation against Kyoto Encyclopedia of Genes and Genomes Pathway database identified 5,825 unigenes which were mapped to 117 pathways. The assembly predicted many unigenes that were previously unknown. The annotated unigenes were compared against pineapple, rice, maize, Arabidopsis, and sorghum. Unigenes that did not match any of those five sequence datasets are considered to be Ananas comosus var. bracteatus unique. We predicted unigenes encoding enzymes involved in terpenoid and phenylpropanoid biosynthesis. Conclusion The sequence data provide the most comprehensive transcriptomic resource currently available for Ananas comosus var. bracteatus. To our knowledge; this is the first report on the de novo transcriptome sequencing of the Ananas comosus var. bracteatus. Unigenes obtained in this study, may help improve future gene expression, genetic and genomics studies in Ananas comosus var. bracteatus. PMID:25769053

  14. EUCANEXT: an integrated database for the exploration of genomic and transcriptomic data from Eucalyptus species

    PubMed Central

    Nascimento, Leandro Costa; Salazar, Marcela Mendes; Lepikson-Neto, Jorge; Camargo, Eduardo Leal Oliveira; Parreiras, Lucas Salera; Carazzolle, Marcelo Falsarella

    2017-01-01

    Abstract Tree species of the genus Eucalyptus are the most valuable and widely planted hardwoods in the world. Given the economic importance of Eucalyptus trees, much effort has been made towards the generation of specimens with superior forestry properties that can deliver high-quality feedstocks, customized to the industrýs needs for both cellulosic (paper) and lignocellulosic biomass production. In line with these efforts, large sets of molecular data have been generated by several scientific groups, providing invaluable information that can be applied in the development of improved specimens. In order to fully explore the potential of available datasets, the development of a public database that provides integrated access to genomic and transcriptomic data from Eucalyptus is needed. EUCANEXT is a database that analyses and integrates publicly available Eucalyptus molecular data, such as the E. grandis genome assembly and predicted genes, ESTs from several species and digital gene expression from 26 RNA-Seq libraries. The database has been implemented in a Fedora Linux machine running MySQL and Apache, while Perl CGI was used for the web interfaces. EUCANEXT provides a user-friendly web interface for easy access and analysis of publicly available molecular data from Eucalyptus species. This integrated database allows for complex searches by gene name, keyword or sequence similarity and is publicly accessible at http://www.lge.ibi.unicamp.br/eucalyptusdb. Through EUCANEXT, users can perform complex analysis to identify genes related traits of interest using RNA-Seq libraries and tools for differential expression analysis. Moreover, all the bioinformatics pipeline here described, including the database schema and PERL scripts, are readily available and can be applied to any genomic and transcriptomic project, regardless of the organism. Database URL: http://www.lge.ibi.unicamp.br/eucalyptusdb PMID:29220468

  15. Transcriptome Analysis in Sheepgrass (Leymus chinensis): A Dominant Perennial Grass of the Eurasian Steppe

    PubMed Central

    Chen, Shuangyan; Huang, Xin; Yan, Xueqing; Liang, Ye; Wang, Yuezhu; Li, Xiaofeng; Peng, Xianjun; Ma, Xingyong; Zhang, Lexin; Cai, Yueyue; Ma, Tian; Cheng, Liqin; Qi, Dongmei; Zheng, Huajun; Yang, Xiaohan; Li, Xiaoxia; Liu, Gongshe

    2013-01-01

    Background Sheepgrass [Leymus chinensis (Trin.) Tzvel.] is an important perennial forage grass across the Eurasian Steppe and is known for its adaptability to various environmental conditions. However, insufficient data resources in public databases for sheepgrass limited our understanding of the mechanism of environmental adaptations, gene discovery and molecular marker development. Results The transcriptome of sheepgrass was sequenced using Roche 454 pyrosequencing technology. We assembled 952,328 high-quality reads into 87,214 unigenes, including 32,416 contigs and 54,798 singletons. There were 15,450 contigs over 500 bp in length. BLAST searches of our database against Swiss-Prot and NCBI non-redundant protein sequences (nr) databases resulted in the annotation of 54,584 (62.6%) of the unigenes. Gene Ontology (GO) analysis assigned 89,129 GO term annotations for 17,463 unigenes. We identified 11,675 core Poaceae-specific and 12,811 putative sheepgrass-specific unigenes by BLAST searches against all plant genome and transcriptome databases. A total of 2,979 specific freezing-responsive unigenes were found from this RNAseq dataset. We identified 3,818 EST-SSRs in 3,597 unigenes, and some SSRs contained unigenes that were also candidates for freezing-response genes. Characterizations of nucleotide repeats and dominant motifs of SSRs in sheepgrass were also performed. Similarity and phylogenetic analysis indicated that sheepgrass is closely related to barley and wheat. Conclusions This research has greatly enriched sheepgrass transcriptome resources. The identified stress-related genes will help us to decipher the genetic basis of the environmental and ecological adaptations of this species and will be used to improve wheat and barley crops through hybridization or genetic transformation. The EST-SSRs reported here will be a valuable resource for future gene-phenotype studies and for the molecular breeding of sheepgrass and other Poaceae species. PMID:23861841

  16. Transcriptome analysis in sheepgrass (Leymus chinensis): a dominant perennial grass of the Eurasian Steppe.

    PubMed

    Chen, Shuangyan; Huang, Xin; Yan, Xueqing; Liang, Ye; Wang, Yuezhu; Li, Xiaofeng; Peng, Xianjun; Ma, Xingyong; Zhang, Lexin; Cai, Yueyue; Ma, Tian; Cheng, Liqin; Qi, Dongmei; Zheng, Huajun; Yang, Xiaohan; Li, Xiaoxia; Liu, Gongshe

    2013-01-01

    Sheepgrass [Leymus chinensis (Trin.) Tzvel.] is an important perennial forage grass across the Eurasian Steppe and is known for its adaptability to various environmental conditions. However, insufficient data resources in public databases for sheepgrass limited our understanding of the mechanism of environmental adaptations, gene discovery and molecular marker development. The transcriptome of sheepgrass was sequenced using Roche 454 pyrosequencing technology. We assembled 952,328 high-quality reads into 87,214 unigenes, including 32,416 contigs and 54,798 singletons. There were 15,450 contigs over 500 bp in length. BLAST searches of our database against Swiss-Prot and NCBI non-redundant protein sequences (nr) databases resulted in the annotation of 54,584 (62.6%) of the unigenes. Gene Ontology (GO) analysis assigned 89,129 GO term annotations for 17,463 unigenes. We identified 11,675 core Poaceae-specific and 12,811 putative sheepgrass-specific unigenes by BLAST searches against all plant genome and transcriptome databases. A total of 2,979 specific freezing-responsive unigenes were found from this RNAseq dataset. We identified 3,818 EST-SSRs in 3,597 unigenes, and some SSRs contained unigenes that were also candidates for freezing-response genes. Characterizations of nucleotide repeats and dominant motifs of SSRs in sheepgrass were also performed. Similarity and phylogenetic analysis indicated that sheepgrass is closely related to barley and wheat. This research has greatly enriched sheepgrass transcriptome resources. The identified stress-related genes will help us to decipher the genetic basis of the environmental and ecological adaptations of this species and will be used to improve wheat and barley crops through hybridization or genetic transformation. The EST-SSRs reported here will be a valuable resource for future gene-phenotype studies and for the molecular breeding of sheepgrass and other Poaceae species.

  17. AmpuBase: a transcriptome database for eight species of apple snails (Gastropoda: Ampullariidae).

    PubMed

    Ip, Jack C H; Mu, Huawei; Chen, Qian; Sun, Jin; Ituarte, Santiago; Heras, Horacio; Van Bocxlaer, Bert; Ganmanee, Monthon; Huang, Xin; Qiu, Jian-Wen

    2018-03-05

    Gastropoda, with approximately 80,000 living species, is the largest class of Mollusca. Among gastropods, apple snails (family Ampullariidae) are globally distributed in tropical and subtropical freshwater ecosystems and many species are ecologically and economically important. Ampullariids exhibit various morphological and physiological adaptations to their respective habitats, which make them ideal candidates for studying adaptation, population divergence, speciation, and larger-scale patterns of diversity, including the biogeography of native and invasive populations. The limited availability of genomic data, however, hinders in-depth ecological and evolutionary studies of these non-model organisms. Using Illumina Hiseq platforms, we sequenced 1220 million reads for seven species of apple snails. Together with the previously published RNA-Seq data of two apple snails, we conducted de novo transcriptome assembly of eight species that belong to five genera of Ampullariidae, two of which represent Old World lineages and the other three New World lineages. There were 20,730 to 35,828 unigenes with predicted open reading frames for the eight species, with N50 (shortest sequence length at 50% of the unigenes) ranging from 1320 to 1803 bp. 69.7% to 80.2% of these unigenes were functionally annotated by searching against NCBI's non-redundant, Gene Ontology database and the Kyoto Encyclopaedia of Genes and Genomes. With these data we developed AmpuBase, a relational database that features online BLAST functionality for DNA/protein sequences, keyword searching for unigenes/functional terms, and download functions for sequences and whole transcriptomes. In summary, we have generated comprehensive transcriptome data for multiple ampullariid genera and species, and created a publicly accessible database with a user-friendly interface to facilitate future basic and applied studies on ampullariids, and comparative molecular studies with other invertebrates.

  18. Deep mRNA Sequencing of the Tritonia diomedea Brain Transcriptome Provides Access to Gene Homologues for Neuronal Excitability, Synaptic Transmission and Peptidergic Signalling

    PubMed Central

    Senatore, Adriano; Edirisinghe, Neranjan; Katz, Paul S.

    2015-01-01

    Background The sea slug Tritonia diomedea (Mollusca, Gastropoda, Nudibranchia), has a simple and highly accessible nervous system, making it useful for studying neuronal and synaptic mechanisms underlying behavior. Although many important contributions have been made using Tritonia, until now, a lack of genetic information has impeded exploration at the molecular level. Results We performed Illumina sequencing of central nervous system mRNAs from Tritonia, generating 133.1 million 100 base pair, paired-end reads. De novo reconstruction of the RNA-Seq data yielded a total of 185,546 contigs, which partitioned into 123,154 non-redundant gene clusters (unigenes). BLAST comparison with RefSeq and Swiss-Prot protein databases, as well as mRNA data from other invertebrates (gastropod molluscs: Aplysia californica, Lymnaea stagnalis and Biomphalaria glabrata; cnidarian: Nematostella vectensis) revealed that up to 76,292 unigenes in the Tritonia transcriptome have putative homologues in other databases, 18,246 of which are below a more stringent E-value cut-off of 1x10-6. In silico prediction of secreted proteins from the Tritonia transcriptome shotgun assembly (TSA) produced a database of 579 unique sequences of secreted proteins, which also exhibited markedly higher expression levels compared to other genes in the TSA. Conclusions Our efforts greatly expand the availability of gene sequences available for Tritonia diomedea. We were able to extract full length protein sequences for most queried genes, including those involved in electrical excitability, synaptic vesicle release and neurotransmission, thus confirming that the transcriptome will serve as a useful tool for probing the molecular correlates of behavior in this species. We also generated a neurosecretome database that will serve as a useful tool for probing peptidergic signalling systems in the Tritonia brain. PMID:25719197

  19. Transcriptome Sequencing in a Tibetan Barley Landrace with High Resistance to Powdery Mildew

    PubMed Central

    Zeng, Xing-Quan; Luo, Xiao-Mei; Wang, Yu-Lin; Xu, Qi-Jun; Bai, Li-Jun; Yuan, Hong-Jun; Tashi, Nyima

    2014-01-01

    Hulless barley is an important cereal crop worldwide, especially in Tibet of China. However, this crop is usually susceptible to powdery mildew caused by Blumeria graminis f. sp. hordei. In this study, we aimed to understand the functions and pathways of genes involved in the disease resistance by transcriptome sequencing of a Tibetan barley landrace with high resistance to powdery mildew. A total of 831 significant differentially expressed genes were found in the infected seedlings, covering 19 functions. Either “cell,” “cell part,” and “extracellular region” in the cellular component category or “binding” and “catalytic” in the category of molecular function as well as “metabolic process” and “cellular process” in the biological process category together demonstrated that these functions may be involved in the resistance to powdery mildew of the hulless barley. In addition, 330 KEGG pathways were found using BLASTx with an E-value cut-off of <10−5. Among them, three pathways, namely, “photosynthesis,” “plant-pathogen interaction,” and “photosynthesis-antenna proteins” had significant matches in the database. Significant expressions of the three pathways were detected at 24 h, 48 h, and 96 h after infection, respectively. These results indicated a complex process of barley response to powdery mildew infection. PMID:25587568

  20. In Silico Functional Networks Identified in Fish Nucleated Red Blood Cells by Means of Transcriptomic and Proteomic Profiling.

    PubMed

    Puente-Marin, Sara; Nombela, Iván; Ciordia, Sergio; Mena, María Carmen; Chico, Verónica; Coll, Julio; Ortega-Villaizan, María Del Mar

    2018-04-09

    Nucleated red blood cells (RBCs) of fish have, in the last decade, been implicated in several immune-related functions, such as antiviral response, phagocytosis or cytokine-mediated signaling. RNA-sequencing (RNA-seq) and label-free shotgun proteomic analyses were carried out for in silico functional pathway profiling of rainbow trout RBCs. For RNA-seq, a de novo assembly was conducted, in order to create a transcriptome database for RBCs. For proteome profiling, we developed a proteomic method that combined: (a) fractionation into cytosolic and membrane fractions, (b) hemoglobin removal of the cytosolic fraction, (c) protein digestion, and (d) a novel step with pH reversed-phase peptide fractionation and final Liquid Chromatography Electrospray Ionization Tandem Mass Spectrometric (LC ESI-MS/MS) analysis of each fraction. Combined transcriptome- and proteome- sequencing data identified, in silico, novel and striking immune functional networks for rainbow trout nucleated RBCs, which are mainly linked to innate and adaptive immunity. Functional pathways related to regulation of hematopoietic cell differentiation, antigen presentation via major histocompatibility complex class II (MHCII), leukocyte differentiation and regulation of leukocyte activation were identified. These preliminary findings further implicate nucleated RBCs in immune function, such as antigen presentation and leukocyte activation.

  1. In Silico Functional Networks Identified in Fish Nucleated Red Blood Cells by Means of Transcriptomic and Proteomic Profiling

    PubMed Central

    Puente-Marin, Sara; Ciordia, Sergio; Mena, María Carmen; Chico, Verónica; Coll, Julio

    2018-01-01

    Nucleated red blood cells (RBCs) of fish have, in the last decade, been implicated in several immune-related functions, such as antiviral response, phagocytosis or cytokine-mediated signaling. RNA-sequencing (RNA-seq) and label-free shotgun proteomic analyses were carried out for in silico functional pathway profiling of rainbow trout RBCs. For RNA-seq, a de novo assembly was conducted, in order to create a transcriptome database for RBCs. For proteome profiling, we developed a proteomic method that combined: (a) fractionation into cytosolic and membrane fractions, (b) hemoglobin removal of the cytosolic fraction, (c) protein digestion, and (d) a novel step with pH reversed-phase peptide fractionation and final Liquid Chromatography Electrospray Ionization Tandem Mass Spectrometric (LC ESI-MS/MS) analysis of each fraction. Combined transcriptome- and proteome- sequencing data identified, in silico, novel and striking immune functional networks for rainbow trout nucleated RBCs, which are mainly linked to innate and adaptive immunity. Functional pathways related to regulation of hematopoietic cell differentiation, antigen presentation via major histocompatibility complex class II (MHCII), leukocyte differentiation and regulation of leukocyte activation were identified. These preliminary findings further implicate nucleated RBCs in immune function, such as antigen presentation and leukocyte activation. PMID:29642539

  2. Relationship between maladaptive cognitions about sleep and recovery in patients with borderline personality disorder

    PubMed Central

    Plante, David T.; Frankenburg, Frances R.; Fitzmaurice, Garrett M.; Zanarini, Mary C.

    2013-01-01

    Borderline personality disorder (BPD) has been associated with maladaptive cognitive processes including dysfunctional attitudes and a negative attribution style. Comorbid insomnia affects the course of multiple psychiatric disorders, and has been associated with absence of recovery from BPD. Because dysfunctional beliefs and attitudes are common among patients with insomnia, the purpose of this study was to evaluate the association between maladaptive sleep-related cognitions and recovery status (symptomatic remission plus good concurrent psychosocial functioning) in patients with BPD. 223 BPD patients participating in the McLean Study of Adult Development (MSAD) were administered the Dysfunctional Beliefs and Attitudes about Sleep questionnaire (DBAS-16) as part of the 16-year follow-up wave. Maladaptive sleep cognitions were compared between recovered (n=105) and non-recovered (n=118) BPD participants, in analyses that adjusted for age, sex, depression, anxiety, and primary sleep disorders. Results demonstrated non-recovered BPD patients had significantly more severe maladaptive sleep-related cognitions as measured by the overall DBAS-16 score. These results demonstrate an association between dysfunctional beliefs and attitudes about sleep and recovery status among BPD patients. Further research is warranted to evaluate treatments targeted towards maladaptive sleep-related cognitions, and their subsequent effects on the course of BPD. PMID:23972789

  3. De novo transcriptome analysis of an imminent biofuel crop, Camelina sativa L. using Illumina GAIIX sequencing platform and identification of SSR markers.

    PubMed

    Mudalkar, Shalini; Golla, Ramesh; Ghatty, Sreenivas; Reddy, Attipalli Ramachandra

    2014-01-01

    Camelina sativa L. is an emerging biofuel crop with potential applications in industry, medicine, cosmetics and human nutrition. The crop is unexploited owing to very limited availability of transcriptome and genomic data. In order to analyse the various metabolic pathways, we performed de novo assembly of the transcriptome on Illumina GAIIX platform with paired end sequencing for obtaining short reads. The sequencing output generated a FastQ file size of 2.97 GB with 10.83 million reads having a maximum read length of 101 nucleotides. The number of contigs generated was 53,854 with maximum and minimum lengths of 10,086 and 200 nucleotides respectively. These trancripts were annotated using BLAST search against the Aracyc, Swiss-Prot, TrEMBL, gene ontology and clusters of orthologous groups (KOG) databases. The genes involved in lipid metabolism were studied and the transcription factors were identified. Sequence similarity studies of Camelina with the other related organisms indicated the close relatedness of Camelina with Arabidopsis. In addition, bioinformatics analysis revealed the presence of a total of 19,379 simple sequence repeats. This is the first report on Camelina sativa L., where the transcriptome of the entire plant, including seedlings, seed, root, leaves and stem was done. Our data established an excellent resource for gene discovery and provide useful information for functional and comparative genomic studies in this promising biofuel crop.

  4. Detailed transcriptome description of the neglected cestode Taenia multiceps.

    PubMed

    Wu, Xuhang; Fu, Yan; Yang, Deying; Zhang, Runhui; Zheng, Wanpeng; Nie, Huaming; Xie, Yue; Yan, Ning; Hao, Guiying; Gu, Xiaobin; Wang, Shuxian; Peng, Xuerong; Yang, Guangyou

    2012-01-01

    The larval stage of Taenia multiceps, a global cestode, encysts in the central nervous system (CNS) of sheep and other livestock. This frequently leads to their death and huge socioeconomic losses, especially in developing countries. This parasite can also cause zoonotic infections in humans, but has been largely neglected due to a lack of diagnostic techniques and studies. Recent developments in next-generation sequencing provide an opportunity to explore the transcriptome of T. multiceps. We obtained a total of 31,282 unigenes (mean length 920 bp) using Illumina paired-end sequencing technology and a new Trinity de novo assembler without a referenced genome. Individual transcription molecules were determined by sequence-based annotations and/or domain-based annotations against public databases (Nr, UniprotKB/Swiss-Prot, COG, KEGG, UniProtKB/TrEMBL, InterPro and Pfam). We identified 26,110 (83.47%) unigenes and inferred 20,896 (66.8%) coding sequences (CDS). Further comparative transcripts analysis with other cestodes (Taenia pisiformis, Taenia solium, Echincoccus granulosus and Echincoccus multilocularis) and intestinal parasites (Trichinella spiralis, Ancylostoma caninum and Ascaris suum) showed that 5,100 common genes were shared among three Taenia tapeworms, 261 conserved genes were detected among five Taeniidae cestodes, and 109 common genes were found in four zoonotic intestinal parasites. Some of the common genes were genes required for parasite survival, involved in parasite-host interactions. In addition, we amplified two full-length CDS of unigenes from the common genes using RT-PCR. This study provides an extensive transcriptome of the adult stage of T. multiceps, and demonstrates that comparative transcriptomic investigations deserve to be further studied. This transcriptome dataset forms a substantial public information platform to achieve a fundamental understanding of the biology of T. multiceps, and helps in the identification of drug targets and parasite-host interaction studies.

  5. The quantitative proteomes of human-induced pluripotent stem cells and embryonic stem cells

    PubMed Central

    Munoz, Javier; Low, Teck Y; Kok, Yee J; Chin, Angela; Frese, Christian K; Ding, Vanessa; Choo, Andre; Heck, Albert J R

    2011-01-01

    Assessing relevant molecular differences between human-induced pluripotent stem cells (hiPSCs) and human embryonic stem cells (hESCs) is important, given that such differences may impact their potential therapeutic use. Controversy surrounds recent gene expression studies comparing hiPSCs and hESCs. Here, we present an in-depth quantitative mass spectrometry-based analysis of hESCs, two different hiPSCs and their precursor fibroblast cell lines. Our comparisons confirmed the high similarity of hESCs and hiPSCS at the proteome level as 97.8% of the proteins were found unchanged. Nevertheless, a small group of 58 proteins, mainly related to metabolism, antigen processing and cell adhesion, was found significantly differentially expressed between hiPSCs and hESCs. A comparison of the regulated proteins with previously published transcriptomic studies showed a low overlap, highlighting the emerging notion that differences between both pluripotent cell lines rather reflect experimental conditions than a recurrent molecular signature. PMID:22108792

  6. Defining the Genomic Signature of Totipotency and Pluripotency during Early Human Development

    PubMed Central

    Galan, Amparo; Diaz-Gimeno, Patricia; Poo, Maria Eugenia; Valbuena, Diana; Sanchez, Eva; Ruiz, Veronica; Dopazo, Joaquin; Montaner, David; Conesa, Ana; Simon, Carlos

    2013-01-01

    The genetic mechanisms governing human pre-implantation embryo development and the in vitro counterparts, human embryonic stem cells (hESCs), still remain incomplete. Previous global genome studies demonstrated that totipotent blastomeres from day-3 human embryos and pluripotent inner cell masses (ICMs) from blastocysts, display unique and differing transcriptomes. Nevertheless, comparative gene expression analysis has revealed that no significant differences exist between hESCs derived from blastomeres versus those obtained from ICMs, suggesting that pluripotent hESCs involve a new developmental progression. To understand early human stages evolution, we developed an undifferentiation network signature (UNS) and applied it to a differential gene expression profile between single blastomeres from day-3 embryos, ICMs and hESCs. This allowed us to establish a unique signature composed of highly interconnected genes characteristic of totipotency (61 genes), in vivo pluripotency (20 genes), and in vitro pluripotency (107 genes), and which are also proprietary according to functional analysis. This systems biology approach has led to an improved understanding of the molecular and signaling processes governing human pre-implantation embryo development, as well as enabling us to comprehend how hESCs might adapt to in vitro culture conditions. PMID:23614026

  7. UniVIO: A Multiple Omics Database with Hormonome and Transcriptome Data from Rice

    PubMed Central

    Sakurai, Tetsuya; Sakakibara, Hitoshi

    2013-01-01

    Plant hormones play important roles as signaling molecules in the regulation of growth and development by controlling the expression of downstream genes. Since the hormone signaling system represents a complex network involving functional cross-talk through the mutual regulation of signaling and metabolism, a comprehensive and integrative analysis of plant hormone concentrations and gene expression is important for a deeper understanding of hormone actions. We have developed a database named Uniformed Viewer for Integrated Omics (UniVIO: http://univio.psc.riken.jp/), which displays hormone-metabolome (hormonome) and transcriptome data in a single formatted (uniformed) heat map. At the present time, hormonome and transcriptome data obtained from 14 organ parts of rice plants at the reproductive stage and seedling shoots of three gibberellin signaling mutants are included in the database. The hormone concentration and gene expression data can be searched by substance name, probe ID, gene locus ID or gene description. A correlation search function has been implemented to enable users to obtain information of correlated substance accumulation and gene expression. In the correlation search, calculation method, range of correlation coefficient and plant samples can be selected freely. PMID:23314752

  8. Transcriptome of intraperitoneal organs of starry flounder Platichthys stellatus challenged by Edwardsiella ictaluri JCM1680

    NASA Astrophysics Data System (ADS)

    Tong, Yanli; Sun, Xiuqin; Wang, Bo; Wang, Ling; Li, Yan; Tian, Jinhu; Zheng, Fengrong; Zheng, Minggang

    2015-01-01

    Platichthys stellatus is an economically important marine bony fish species that is cultured in China on a large scale. However, very little is known about its immune-related genes. In this study, the transcriptome of the immune organs of P. stellatus that were intraperitoneally challenged with the pathogen E dwardsiella ictaluri JCM1680 is analyzed. Total RNA from four tissues (spleen, kidney, liver, and intestine) was mixed equally and then sequenced on an Illumina HiSeq 2000 platform. Overall, 28 465 813 quality reads were generated and assembled into 43 061 unigenes. Similarity searches against public protein sequence databases were used to annotate 28 291 unigenes (65.7% of the total), 368 of which were associated with immunoregulation, including 188 related to immunity response. Additionally, the transcript levels of immunity response unigenes annotated as related to tumor necrosis factor (TNF), TNF receptor, chemokine, major histocompatibility complex, and interleukin-6 were investigated in the different tissues of normal and infected P. stellatus by real-time quantitative PCR. The results confirmed that the unigenes identified in the transcriptome database were indeed expressed and up-regulated in infected P. stellatus. To our knowledge, this is the first report of the sequencing and analysis of the transcriptome of P. stellatus. These findings provide insights into the transcriptomics and immunogenetics of bony fish.

  9. A high-throughput approach to profile RNA structure.

    PubMed

    Delli Ponti, Riccardo; Marti, Stefanie; Armaos, Alexandros; Tartaglia, Gian Gaetano

    2017-03-17

    Here we introduce the Computational Recognition of Secondary Structure (CROSS) method to calculate the structural profile of an RNA sequence (single- or double-stranded state) at single-nucleotide resolution and without sequence length restrictions. We trained CROSS using data from high-throughput experiments such as Selective 2΄-Hydroxyl Acylation analyzed by Primer Extension (SHAPE; Mouse and HIV transcriptomes) and Parallel Analysis of RNA Structure (PARS; Human and Yeast transcriptomes) as well as high-quality NMR/X-ray structures (PDB database). The algorithm uses primary structure information alone to predict experimental structural profiles with >80% accuracy, showing high performances on large RNAs such as Xist (17 900 nucleotides; Area Under the ROC Curve AUC of 0.75 on dimethyl sulfate (DMS) experiments). We integrated CROSS in thermodynamics-based methods to predict secondary structure and observed an increase in their predictive power by up to 30%. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  10. De Novo Assembly and Transcriptome Analysis of the Rubber Tree (Hevea brasiliensis) and SNP Markers Development for Rubber Biosynthesis Pathways

    PubMed Central

    Mantello, Camila Campos; Cardoso-Silva, Claudio Benicio; da Silva, Carla Cristina; de Souza, Livia Moura; Scaloppi Junior, Erivaldo José; de Souza Gonçalves, Paulo; Vicentini, Renato; de Souza, Anete Pereira

    2014-01-01

    Hevea brasiliensis (Willd. Ex Adr. Juss.) Muell.-Arg. is the primary source of natural rubber that is native to the Amazon rainforest. The singular properties of natural rubber make it superior to and competitive with synthetic rubber for use in several applications. Here, we performed RNA sequencing (RNA-seq) of H. brasiliensis bark on the Illumina GAIIx platform, which generated 179,326,804 raw reads on the Illumina GAIIx platform. A total of 50,384 contigs that were over 400 bp in size were obtained and subjected to further analyses. A similarity search against the non-redundant (nr) protein database returned 32,018 (63%) positive BLASTx hits. The transcriptome analysis was annotated using the clusters of orthologous groups (COG), gene ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG), and Pfam databases. A search for putative molecular marker was performed to identify simple sequence repeats (SSRs) and single nucleotide polymorphisms (SNPs). In total, 17,927 SSRs and 404,114 SNPs were detected. Finally, we selected sequences that were identified as belonging to the mevalonate (MVA) and 2-C-methyl-D-erythritol 4-phosphate (MEP) pathways, which are involved in rubber biosynthesis, to validate the SNP markers. A total of 78 SNPs were validated in 36 genotypes of H. brasiliensis. This new dataset represents a powerful information source for rubber tree bark genes and will be an important tool for the development of microsatellites and SNP markers for use in future genetic analyses such as genetic linkage mapping, quantitative trait loci identification, investigations of linkage disequilibrium and marker-assisted selection. PMID:25048025

  11. Human organomics: a fresh approach to understanding human development using single-cell transcriptomics.

    PubMed

    Camp, J Gray; Treutlein, Barbara

    2017-05-01

    Innovative methods designed to recapitulate human organogenesis from pluripotent stem cells provide a means to explore human developmental biology. New technologies to sequence and analyze single-cell transcriptomes can deconstruct these 'organoids' into constituent parts, and reconstruct lineage trajectories during cell differentiation. In this Spotlight article we summarize the different approaches to performing single-cell transcriptomics on organoids, and discuss the opportunities and challenges of applying these techniques to generate organ-level, mechanistic models of human development and disease. Together, these technologies will move past characterization to the prediction of human developmental and disease-related phenomena. © 2017. Published by The Company of Biologists Ltd.

  12. Comparative transcriptome analysis of Gossypium hirsutum L. in response to sap sucking insects: aphid and whitefly

    PubMed Central

    2013-01-01

    Background Cotton (Gossypium hirsutum L.) is a major fiber crop that is grown worldwide; it faces extensive damage from sap-sucking insects, including aphids and whiteflies. Genome-wide transcriptome analysis was performed to understand the molecular details of interaction between Gossypium hirsutum L. and sap-sucking pests, namely Aphis gossypii (Aphid) and Bemisia tabacci (Whiteflies). Roche’s GS-Titanium was used to sequence transcriptomes of cotton infested with aphids and whiteflies for 2 h and 24 h. Results A total of 100935 contigs were produced with an average length of 529 bp after an assembly in all five selected conditions. The Blastn of the non-redundant (nr) cotton EST database resulted in the identification of 580 novel contigs in the cotton plant. It should be noted that in spite of minimal physical damage caused by the sap-sucking insects, they can change the gene expression of plants in 2 h of infestation; further change in gene expression due to whiteflies is quicker than due to aphids. The impact of the whitefly 24 h after infestation was more or less similar to that of the aphid 2 h after infestation. Aphids and whiteflies affect many genes that are regulated by various phytohormones and in response to microbial infection, indicating the involvement of complex crosstalk between these pathways. The KOBAS analysis of differentially regulated transcripts in response to aphids and whiteflies indicated that both the insects induce the metabolism of amino acids biosynthesis specially in case of whiteflies infestation at later phase. Further we also observed that expression of transcript related to photosynthesis specially carbon fixation were significantly influenced by infestation of Aphids and Whiteflies. Conclusions A comparison of different transcriptomes leads to the identification of differentially and temporally regulated transcripts in response to infestation by aphids and whiteflies. Most of these differentially expressed contigs were related to genes involved in biotic, abiotic stresses and enzymatic activities related to hydrolases, transferases, and kinases. The expression of some marker genes such as the overexpressors of cationic peroxidase 3, lipoxygenase I, TGA2, and non-specific lipase, which are involved in phytohormonal-mediated plant resistance development, was suppressed after infestation by aphids and whiteflies, indicating that insects suppressed plant resistance in order to facilitate their infestation. We also concluded that cotton shares several pathways such as phagosomes, RNA transport, and amino acid metabolism with Arabidopsis in response to the infestation by aphids and whiteflies. PMID:23577705

  13. Elucidating a molecular mechanism that the deterioration of porcine meat quality responds to increased cortisol based on transcriptome sequencing.

    PubMed

    Wan, Xuebin; Wang, Dan; Xiong, Qi; Xiang, Hong; Li, Huanan; Wang, Hongshuai; Liu, Zezhang; Niu, Hongdan; Peng, Jian; Jiang, Siwen; Chai, Jin

    2016-11-11

    Stress response is tightly linked to meat quality. The current understanding of the intrinsic mechanism of meat deterioration under stress is limited. Here, male piglets were randomly assigned to cortisol and control groups. Our results showed that when serum cortisol level was significantly increased, the meat color at 1 h postmortem, muscle bundle ratio, apoptosis rate, and gene expression levels of calcium channel and cell apoptosis including SERCA1, IP3R1, BAX, Bcl-2, and Caspase-3, were notably increased. However, the value of drip loss at 24 h postmortem and serum CK were significantly decreased. Additionally, a large number of differentially expressed genes (DEGs) in GC regulation mechanism were screened out using transcriptome sequencing technology. A total of 223 DEGs were found, including 80 up-regulated genes and 143 down-regulated genes. A total of 204 genes were enriched in GO terms, and 140 genes annotated into in KEGG database. Numerous genes were primarily involved in defense, inflammatory and wound responses. This study not only identifies important genes and signalling pathways that may affect the meat quality but also offers a reference for breeding and feeding management to provide consumers with better quality pork products.

  14. Plant genome and transcriptome annotations: from misconceptions to simple solutions

    PubMed Central

    Bolger, Marie E; Arsova, Borjana; Usadel, Björn

    2018-01-01

    Abstract Next-generation sequencing has triggered an explosion of available genomic and transcriptomic resources in the plant sciences. Although genome and transcriptome sequencing has become orders of magnitudes cheaper and more efficient, often the functional annotation process is lagging behind. This might be hampered by the lack of a comprehensive enumeration of simple-to-use tools available to the plant researcher. In this comprehensive review, we present (i) typical ontologies to be used in the plant sciences, (ii) useful databases and resources used for functional annotation, (iii) what to expect from an annotated plant genome, (iv) an automated annotation pipeline and (v) a recipe and reference chart outlining typical steps used to annotate plant genomes/transcriptomes using publicly available resources. PMID:28062412

  15. Transcriptomes of Trypanosoma brucei rhodesiense from sleeping sickness patients, rodents and culture: Effects of strain, growth conditions and RNA preparation methods

    PubMed Central

    Mulindwa, Julius; Leiss, Kevin; Ibberson, David; Kamanyi Marucha, Kevin; Helbig, Claudia; Melo do Nascimento, Larissa; Silvester, Eleanor; Matthews, Keith; Matovu, Enock; Enyaru, John

    2018-01-01

    All of our current knowledge of African trypanosome metabolism is based on results from trypanosomes grown in culture or in rodents. Drugs against sleeping sickness must however treat trypanosomes in humans. We here compare the transcriptomes of Trypanosoma brucei rhodesiense from the blood and cerebrospinal fluid of human patients with those of trypanosomes from culture and rodents. The data were aligned and analysed using new user-friendly applications designed for Kinetoplastid RNA-Seq data. The transcriptomes of trypanosomes from human blood and cerebrospinal fluid did not predict major metabolic differences that might affect drug susceptibility. Usefully, there were relatively few differences between the transcriptomes of trypanosomes from patients and those of similar trypanosomes grown in rats. Transcriptomes of monomorphic laboratory-adapted parasites grown in in vitro culture closely resembled those of the human parasites, but some differences were seen. In poly(A)-selected mRNA transcriptomes, mRNAs encoding some protein kinases and RNA-binding proteins were under-represented relative to mRNA that had not been poly(A) selected; further investigation revealed that the selection tends to result in loss of longer mRNAs. PMID:29474390

  16. Transcriptomes of Trypanosoma brucei rhodesiense from sleeping sickness patients, rodents and culture: Effects of strain, growth conditions and RNA preparation methods.

    PubMed

    Mulindwa, Julius; Leiss, Kevin; Ibberson, David; Kamanyi Marucha, Kevin; Helbig, Claudia; Melo do Nascimento, Larissa; Silvester, Eleanor; Matthews, Keith; Matovu, Enock; Enyaru, John; Clayton, Christine

    2018-02-01

    All of our current knowledge of African trypanosome metabolism is based on results from trypanosomes grown in culture or in rodents. Drugs against sleeping sickness must however treat trypanosomes in humans. We here compare the transcriptomes of Trypanosoma brucei rhodesiense from the blood and cerebrospinal fluid of human patients with those of trypanosomes from culture and rodents. The data were aligned and analysed using new user-friendly applications designed for Kinetoplastid RNA-Seq data. The transcriptomes of trypanosomes from human blood and cerebrospinal fluid did not predict major metabolic differences that might affect drug susceptibility. Usefully, there were relatively few differences between the transcriptomes of trypanosomes from patients and those of similar trypanosomes grown in rats. Transcriptomes of monomorphic laboratory-adapted parasites grown in in vitro culture closely resembled those of the human parasites, but some differences were seen. In poly(A)-selected mRNA transcriptomes, mRNAs encoding some protein kinases and RNA-binding proteins were under-represented relative to mRNA that had not been poly(A) selected; further investigation revealed that the selection tends to result in loss of longer mRNAs.

  17. Genome-wide analysis on Chlamydomonas reinhardtii reveals the impact of hydrogen peroxide on protein stress responses and overlap with other stress transcriptomes

    DOE PAGES

    Blaby, Ian K.; Blaby-Haas, Crysten E.; Pérez-Pérez, María Esther; ...

    2015-12-07

    Reactive oxygen species (ROS) are produced by and have the potential to be damaging to all aerobic organisms. In photosynthetic organisms, they are an unavoidable byproduct of electron transfer in both the chloroplast and mitochondrion. Here, in this paper, we employ the reference unicellular green alga Chlamydomonas reinhardtii to identify the effect of H 2O 2 on gene expression by monitoring the changes in the transcriptome in a time-course experiment. Comparison of transcriptomes from cells sampled immediately prior to the addition of H 2O 2 and 0.5 and 1 h subsequently revealed 1278 differentially abundant transcripts. Of those transcripts thatmore » increase in abundance, many encode proteins involved in ROS detoxification, protein degradation and stress responses, whereas among those that decrease are transcripts encoding proteins involved in photosynthesis and central carbon metabolism. In addition to these transcriptomic adjustments, we observe that addition of H 2O 2 is followed by an accumulation and oxidation of the total intracellular glutathione pool, and a decrease in photosynthetic O 2 output. Additionally, we analyze our transcriptomes in the context of changes in transcript abundance in response to singlet O 2 (O 2 *), and relate our H 2O 2-induced transcripts to a diurnal transcriptome, where we demonstrate enrichments of H 2O 2-induced transcripts early in the light phase, late in the light phase and 2 h prior to light. In conclusion, on this basis several genes that are highlighted in this work may be involved in previously undiscovered stress remediation pathways or acclimation responses.« less

  18. Production of a reference transcriptome and transcriptomic database (EdwardsiellaBase) for the lined sea anemone, Edwardsiella lineata, a parasitic cnidarian

    PubMed Central

    2014-01-01

    Background The lined sea anemone Edwardsiella lineata is an informative model system for evolutionary-developmental studies of parasitism. In this species, it is possible to compare alternate developmental pathways leading from a larva to either a free-living polyp or a vermiform parasite that inhabits the mesoglea of a ctenophore host. Additionally, E. lineata is confamilial with the model cnidarian Nematostella vectensis, providing an opportunity for comparative genomic, molecular and organismal studies. Description We generated a reference transcriptome for E. lineata via high-throughput sequencing of RNA isolated from five developmental stages (parasite; parasite-to-larva transition; larva; larva-to-adult transition; adult). The transcriptome comprises 90,440 contigs assembled from >15 billion nucleotides of DNA sequence. Using a molecular clock approach, we estimated the divergence between E. lineata and N. vectensis at 215–364 million years ago. Based on gene ontology and metabolic pathway analyses and gene family surveys (bHLH-PAS, deiodinases, Fox genes, LIM homeodomains, minicollagens, nuclear receptors, Sox genes, and Wnts), the transcriptome of E. lineata is comparable in depth and completeness to N. vectensis. Analyses of protein motifs and revealed extensive conservation between the proteins of these two edwardsiid anemones, although we show the NF-κB protein of E. lineata reflects the ancestral structure, while the NF-κB protein of N. vectensis has undergone a split that separates the DNA-binding domain from the inhibitory domain. All contigs have been deposited in a public database (EdwardsiellaBase), where they may be searched according to contig ID, gene ontology, protein family motif (Pfam), enzyme commission number, and BLAST. The alignment of the raw reads to the contigs can also be visualized via JBrowse. Conclusions The transcriptomic data and database described here provide a platform for studying the evolutionary developmental genomics of a derived parasitic life cycle. In addition, these data from E. lineata will aid in the interpretation of evolutionary novelties in gene sequence or structure that have been reported for the model cnidarian N. vectensis (e.g., the split NF-κB locus). Finally, we include custom computational tools to facilitate the annotation of a transcriptome based on high-throughput sequencing data obtained from a “non-model system.” PMID:24467778

  19. Production of a reference transcriptome and transcriptomic database (EdwardsiellaBase) for the lined sea anemone, Edwardsiella lineata, a parasitic cnidarian.

    PubMed

    Stefanik, Derek J; Lubinski, Tristan J; Granger, Brian R; Byrd, Allyson L; Reitzel, Adam M; DeFilippo, Lukas; Lorenc, Allison; Finnerty, John R

    2014-01-28

    The lined sea anemone Edwardsiella lineata is an informative model system for evolutionary-developmental studies of parasitism. In this species, it is possible to compare alternate developmental pathways leading from a larva to either a free-living polyp or a vermiform parasite that inhabits the mesoglea of a ctenophore host. Additionally, E. lineata is confamilial with the model cnidarian Nematostella vectensis, providing an opportunity for comparative genomic, molecular and organismal studies. We generated a reference transcriptome for E. lineata via high-throughput sequencing of RNA isolated from five developmental stages (parasite; parasite-to-larva transition; larva; larva-to-adult transition; adult). The transcriptome comprises 90,440 contigs assembled from >15 billion nucleotides of DNA sequence. Using a molecular clock approach, we estimated the divergence between E. lineata and N. vectensis at 215-364 million years ago. Based on gene ontology and metabolic pathway analyses and gene family surveys (bHLH-PAS, deiodinases, Fox genes, LIM homeodomains, minicollagens, nuclear receptors, Sox genes, and Wnts), the transcriptome of E. lineata is comparable in depth and completeness to N. vectensis. Analyses of protein motifs and revealed extensive conservation between the proteins of these two edwardsiid anemones, although we show the NF-κB protein of E. lineata reflects the ancestral structure, while the NF-κB protein of N. vectensis has undergone a split that separates the DNA-binding domain from the inhibitory domain. All contigs have been deposited in a public database (EdwardsiellaBase), where they may be searched according to contig ID, gene ontology, protein family motif (Pfam), enzyme commission number, and BLAST. The alignment of the raw reads to the contigs can also be visualized via JBrowse. The transcriptomic data and database described here provide a platform for studying the evolutionary developmental genomics of a derived parasitic life cycle. In addition, these data from E. lineata will aid in the interpretation of evolutionary novelties in gene sequence or structure that have been reported for the model cnidarian N. vectensis (e.g., the split NF-κB locus). Finally, we include custom computational tools to facilitate the annotation of a transcriptome based on high-throughput sequencing data obtained from a "non-model system."

  20. RNA-seq Transcriptome Analysis of Panax japonicus, and Its Comparison with Other Panax Species to Identify Potential Genes Involved in the Saponins Biosynthesis

    PubMed Central

    Rai, Amit; Yamazaki, Mami; Takahashi, Hiroki; Nakamura, Michimi; Kojoma, Mareshige; Suzuki, Hideyuki; Saito, Kazuki

    2016-01-01

    The Panax genus has been a source of natural medicine, benefitting human health over the ages, among which the Panax japonicus represents an important species. Our understanding of several key pathways and enzymes involved in the biosynthesis of ginsenosides, a pharmacologically active class of metabolites and a major chemical constituents of the rhizome extracts from the Panax species, are limited. Limited genomic information, and lack of studies on comparative transcriptomics across the Panax species have restricted our understanding of the biosynthetic mechanisms of these and many other important classes of phytochemicals. Herein, we describe Illumina based RNA sequencing analysis to characterize the transcriptome and expression profiles of genes expressed in the five tissues of P. japonicus, and its comparison with other Panax species. RNA sequencing and de novo transcriptome assembly for P. japonicus resulted in a total of 135,235 unigenes with 78,794 (58.24%) unigenes being annotated using NCBI-nr database. Transcriptome profiling, and gene ontology enrichment analysis for five tissues of P. japonicus showed that although overall processes were evenly conserved across all tissues. However, each tissue was characterized by several unique unigenes with the leaves showing the most unique unigenes among the tissues studied. A comparative analysis of the P. japonicus transcriptome assembly with publically available transcripts from other Panax species, namely, P. ginseng, P. notoginseng, and P. quinquefolius also displayed high sequence similarity across all Panax species, with P. japonicus showing highest similarity with P. ginseng. Annotation of P. japonicus transcriptome resulted in the identification of putative genes encoding all enzymes from the triterpene backbone biosynthetic pathways, and identified 24 and 48 unigenes annotated as cytochrome P450 (CYP) and glycosyltransferases (GT), respectively. These CYPs and GTs annotated unigenes were conserved across all Panax species and co-expressed with other the transcripts involved in the triterpenoid backbone biosynthesis pathways. Unigenes identified in this study represent strong candidates for being involved in the triterpenoid saponins biosynthesis, and can serve as a basis for future validation studies. PMID:27148308

  1. Differential Responses of Human Fetal Brain Neural Stem Cells to Zika Virus Infection.

    PubMed

    McGrath, Erica L; Rossi, Shannan L; Gao, Junling; Widen, Steven G; Grant, Auston C; Dunn, Tiffany J; Azar, Sasha R; Roundy, Christopher M; Xiong, Ying; Prusak, Deborah J; Loucas, Bradford D; Wood, Thomas G; Yu, Yongjia; Fernández-Salas, Ildefonso; Weaver, Scott C; Vasilakis, Nikos; Wu, Ping

    2017-03-14

    Zika virus (ZIKV) infection causes microcephaly in a subset of infants born to infected pregnant mothers. It is unknown whether human individual differences contribute to differential susceptibility of ZIKV-related neuropathology. Here, we use an Asian-lineage ZIKV strain, isolated from the 2015 Mexican outbreak (Mex1-7), to infect primary human neural stem cells (hNSCs) originally derived from three individual fetal brains. All three strains of hNSCs exhibited similar rates of Mex1-7 infection and reduced proliferation. However, Mex1-7 decreased neuronal differentiation in only two of the three stem cell strains. Correspondingly, ZIKA-mediated transcriptome alterations were similar in these two strains but significantly different from that of the third strain with no ZIKV-induced neuronal reduction. This study thus confirms that an Asian-lineage ZIKV strain infects primary hNSCs and demonstrates a cell-strain-dependent response of hNSCs to ZIKV infection. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.

  2. Aquatic models, genomics and chemical risk management.

    PubMed

    Cheng, Keith C; Hinton, David E; Mattingly, Carolyn J; Planchart, Antonio

    2012-01-01

    The 5th Aquatic Animal Models for Human Disease meeting follows four previous meetings (Nairn et al., 2001; Schmale, 2004; Schmale et al., 2007; Hinton et al., 2009) in which advances in aquatic animal models for human disease research were reported, and community discussion of future direction was pursued. At this meeting, discussion at a workshop entitled Bioinformatics and Computational Biology with Web-based Resources (20 September 2010) led to an important conclusion: Aquatic model research using feral and experimental fish, in combination with web-based access to annotated anatomical atlases and toxicological databases, yields data that advance our understanding of human gene function, and can be used to facilitate environmental management and drug development. We propose here that the effects of genes and environment are best appreciated within an anatomical context - the specifically affected cells and organs in the whole animal. We envision the use of automated, whole-animal imaging at cellular resolution and computational morphometry facilitated by high-performance computing and automated entry into toxicological databases, as anchors for genetic and toxicological data, and as connectors between human and model system data. These principles should be applied to both laboratory and feral fish populations, which have been virtually irreplaceable sentinals for environmental contamination that results in human morbidity and mortality. We conclude that automation, database generation, and web-based accessibility, facilitated by genomic/transcriptomic data and high-performance and cloud computing, will potentiate the unique and potentially key roles that aquatic models play in advancing systems biology, drug development, and environmental risk management. Copyright © 2011 Elsevier Inc. All rights reserved.

  3. A New Omics Data Resource of Pleurocybella porrigens for Gene Discovery

    PubMed Central

    Dohra, Hideo; Someya, Takumi; Takano, Tomoyuki; Harada, Kiyonori; Omae, Saori; Hirai, Hirofumi; Yano, Kentaro; Kawagishi, Hirokazu

    2013-01-01

    Background Pleurocybella porrigens is a mushroom-forming fungus, which has been consumed as a traditional food in Japan. In 2004, 55 people were poisoned by eating the mushroom and 17 people among them died of acute encephalopathy. Since then, the Japanese government has been alerting Japanese people to take precautions against eating the P . porrigens mushroom. Unfortunately, despite efforts, the molecular mechanism of the encephalopathy remains elusive. The genome and transcriptome sequence data of P . porrigens and the related species, however, are not stored in the public database. To gain the omics data in P . porrigens , we sequenced genome and transcriptome of its fruiting bodies and mycelia by next generation sequencing. Methodology/Principal Findings Short read sequences of genomic DNAs and mRNAs in P . porrigens were generated by Illumina Genome Analyzer. Genome short reads were de novo assembled into scaffolds using Velvet. Comparisons of genome signatures among Agaricales showed that P . porrigens has a unique genome signature. Transcriptome sequences were assembled into contigs (unigenes). Biological functions of unigenes were predicted by Gene Ontology and KEGG pathway analyses. The majority of unigenes would be novel genes without significant counterparts in the public omics databases. Conclusions Functional analyses of unigenes present the existence of numerous novel genes in the basidiomycetes division. The results mean that the omics information such as genome, transcriptome and metabolome in basidiomycetes is short in the current databases. The large-scale omics information on P . porrigens , provided from this research, will give a new data resource for gene discovery in basidiomycetes. PMID:23936076

  4. Workflow and web application for annotating NCBI BioProject transcriptome data.

    PubMed

    Vera Alvarez, Roberto; Medeiros Vidal, Newton; Garzón-Martínez, Gina A; Barrero, Luz S; Landsman, David; Mariño-Ramírez, Leonardo

    2017-01-01

    The volume of transcriptome data is growing exponentially due to rapid improvement of experimental technologies. In response, large central resources such as those of the National Center for Biotechnology Information (NCBI) are continually adapting their computational infrastructure to accommodate this large influx of data. New and specialized databases, such as Transcriptome Shotgun Assembly Sequence Database (TSA) and Sequence Read Archive (SRA), have been created to aid the development and expansion of centralized repositories. Although the central resource databases are under continual development, they do not include automatic pipelines to increase annotation of newly deposited data. Therefore, third-party applications are required to achieve that aim. Here, we present an automatic workflow and web application for the annotation of transcriptome data. The workflow creates secondary data such as sequencing reads and BLAST alignments, which are available through the web application. They are based on freely available bioinformatics tools and scripts developed in-house. The interactive web application provides a search engine and several browser utilities. Graphical views of transcript alignments are available through SeqViewer, an embedded tool developed by NCBI for viewing biological sequence data. The web application is tightly integrated with other NCBI web applications and tools to extend the functionality of data processing and interconnectivity. We present a case study for the species Physalis peruviana with data generated from BioProject ID 67621. URL: http://www.ncbi.nlm.nih.gov/projects/physalis/. Published by Oxford University Press 2017. This work is written by US Government employees and is in the public domain in the US.

  5. Dual Transcriptomic Profiling of Host and Microbiota during Health and Disease in Pediatric Asthma.

    PubMed

    Pérez-Losada, Marcos; Castro-Nallar, Eduardo; Bendall, Matthew L; Freishtat, Robert J; Crandall, Keith A

    2015-01-01

    High-throughput sequencing (HTS) analysis of microbial communities from the respiratory airways has heavily relied on the 16S rRNA gene. Given the intrinsic limitations of this approach, airway microbiome research has focused on assessing bacterial composition during health and disease, and its variation in relation to clinical and environmental factors, or other microbiomes. Consequently, very little effort has been dedicated to describing the functional characteristics of the airway microbiota and even less to explore the microbe-host interactions. Here we present a simultaneous assessment of microbiome and host functional diversity and host-microbe interactions from the same RNA-seq experiment, while accounting for variation in clinical metadata. Transcriptomic (host) and metatranscriptomic (microbiota) sequences from the nasal epithelium of 8 asthmatics and 6 healthy controls were separated in silico and mapped to available human and NCBI-NR protein reference databases. Human genes differentially expressed in asthmatics and controls were then used to infer upstream regulators involved in immune and inflammatory responses. Concomitantly, microbial genes were mapped to metabolic databases (COG, SEED, and KEGG) to infer microbial functions differentially expressed in asthmatics and controls. Finally, multivariate analysis was applied to find associations between microbiome characteristics and host upstream regulators while accounting for clinical variation. Our study showed significant differences in the metabolism of microbiomes from asthmatic and non-asthmatic children for up to 25% of the functional properties tested. Enrichment analysis of 499 differentially expressed host genes for inflammatory and immune responses revealed 43 upstream regulators differentially activated in asthma. Microbial adhesion (virulence) and Proteobacteria abundance were significantly associated with variation in the expression of the upstream regulator IL1A; suggesting that microbiome characteristics modulate host inflammatory and immune systems during asthma.

  6. De novo transcriptome sequencing and comprehensive analysis of the heat stress response genes in the basidiomycetes fungus Ganoderma lucidum.

    PubMed

    Tan, Xiaoyan; Sun, Junshe; Ning, Huijuan; Qin, Zifang; Miao, Yuxin; Sun, Tian; Zhang, Xiuqing

    2018-06-30

    Ganoderma lucidum is a valuable basidiomycete with numerous pharmacological compounds, which is widely consumed throughout China. We previously found that the polysaccharide content of Ganoderma lucidum fruiting bodies could be significantly improved by 45.63% with treatment of 42 °C heat stress (HS) for 2 h. To further investigate genes involved in HS response and explore the mechanisms of HS regulating the carbohydrate metabolism in Ganoderma lucidum, high-throughput RNA-Seq was conducted to analyse the difference between control and heat-treated mycelia at transcriptome level. We sequenced six cDNA libraries with three from control group (mycelia cultivated at 28 °C) and three from heat-treated group (mycelia subjected to 42 °C for 2 h). A total of 99,899 transcripts were generated using Trinity method and 59,136 unigenes were annotated by seven public databases. Among them, 2790 genes were identified to be differential expressed genes (DEGs) under HS condition, which included 1991 up-regulated and 799 down-regulated. 176 DEGs were then manually classified into five main responsive-related categories according to their putative functions and possible metabolic pathways. These groups include stress resistance-related factors; protein assembly, transportation and degradation; signal transduction; carbohydrate metabolism and energy provision-related process; other related functions, suggesting that a series of metabolic pathways in Ganoderma lucidum are activated by HS and the response mechanism involves a complex molecular network which needs further study. Remarkably, 48 DEGs were found to regulate carbohydrate metabolism, both in carbohydrate hydrolysis for energy provision and polysaccharide synthesis. In summary, this comprehensive transcriptome analysis will provide enlarged resource for further investigation into the molecular mechanisms of basidiomycete under HS condition. Copyright © 2018 Elsevier B.V. All rights reserved.

  7. Transcriptome profiling of anthocyanin-related genes reveals effects of light intensity on anthocyanin biosynthesis in red leaf lettuce.

    PubMed

    Zhang, Yanzhao; Xu, Shuzhen; Cheng, Yanwei; Peng, Zhengfeng; Han, Jianming

    2018-01-01

    Red leaf lettuce ( Lactuca sativa L.) is popular due to its high anthocyanin content, but poor leaf coloring often occurs under low light intensity. In order to reveal the mechanisms of anthocyanins affected by light intensity, we compared the transcriptome of L. sativa L. var. capitata under light intensities of 40 and 100 μmol m -2 s -1 . A total of 62,111 unigenes were de novo assembled with an N50 of 1,681 bp, and 48,435 unigenes were functionally annotated in public databases. A total of 3,899 differentially expressed genes (DEGs) were detected, of which 1,377 unigenes were up-regulated and 2,552 unigenes were down-regulated in the high light samples. By Kyoto Encyclopedia of Genes and Genomes enrichment analysis, the DEGs were significantly enriched in 14 pathways. Using gene annotation and phylogenetic analysis, we identified seven anthocyanin structural genes, including CHS , CHI , F3H , F3'H , DFR , ANS , and 3GT , and two anthocyanin transport genes, GST and MATE . In terms of anthocyanin regulatory genes, five MYBs and one bHLH gene were identified. An HY5 gene was discovered, which may respond to light-signaling and regulate anthocyanin structural genes. These genes showed a log2FC of 2.7-9.0 under high irradiance, and were validated using quantitative real-time-PCR. In conclusion, our results indicated transcriptome variance in red leaf lettuce under low and high light intensity, and observed a anthocyanin biosynthesis and regulation pattern. The data should further help to unravel the molecular mechanisms of anthocyanins influenced by light intensity.

  8. Integrated Clinical, Pathologic, Virologic, and Transcriptomic Analysis of H5N1 Influenza Virus-Induced Viral Pneumonia in the Rhesus Macaque

    PubMed Central

    Shinya, Kyoko; Gao, Yuwei; Cilloniz, Cristian; Suzuki, Yasuhiro; Fujie, Masahiro; Deng, Guohua; Zhu, Qiyun; Fan, Shufang; Makino, Akiko; Muramoto, Yukiko; Fukuyama, Satoshi; Tamura, Daisuke; Noda, Takeshi; Eisfeld, Amie J.; Katze, Michael G.

    2012-01-01

    Viral pneumonia has been frequently reported during early stages of influenza virus pandemics and in many human cases of highly pathogenic avian influenza (HPAI) H5N1 virus infection. To better understand the pathogenesis of this disease, we produced nonlethal viral pneumonia in rhesus macaques by using an HPAI H5N1 virus (A/Anhui/2/2005; referred to as Anhui/2). Infected macaques were monitored for 14 days, and tissue samples were collected at 6 time points for virologic, histopathologic, and transcriptomic analyses. Anhui/2 efficiently replicated in the lung from 12 h to 3 days postinfection (p.i.) and caused temporal but severe pneumonia that began to resolve by day 14. Lung transcriptional changes were first observed at 6 h, and increased expression of vascular permeability regulators and neutrophil chemoattractants correlated with increased serum leakage and neutrophil infiltration in situ. Additional inflammatory, antiviral, and apoptotic genes were upregulated from 12 h, concurrent with viral antigen detection and increasing immune cell populations. A shift toward upregulation of acquired immunity was apparent after day 6. Expression levels of established immune cell molecular markers revealed remarkable similarity with pathological findings, indicating early and robust neutrophil infiltration, a slight delay in macrophage accumulation, and abundant late populations of T lymphocytes. We also characterized the putative mechanisms regulating a unique, pneumonia-associated biphasic fever pattern. Thus, this study is the first to use a comprehensive and integrative approach to delineate specific molecular mechanisms regulating influenza virus-induced pneumonia in nonhuman primates, an important first step toward better management of human influenza virus disease. PMID:22491448

  9. MeT-DB V2.0: elucidating context-specific functions of N6-methyl-adenosine methyltranscriptome

    PubMed Central

    Liu, Hui; Wang, Huaizhi; Wei, Zhen; Zhang, Songyao; Hua, Gang; Zhang, Shao-Wu; Zhang, Lin; Gao, Shou-Jiang

    2018-01-01

    Abstract Methyltranscriptome is an exciting new area that studies the mechanisms and functions of methylation in transcripts. A knowledge base with the systematic collection and curation of context specific transcriptome-wide methylations is critical for elucidating their biological functions as well as for developing bioinformatics tools. Since its inception in 2014, the Met-DB (Liu, H., Flores, M.A., Meng, J., Zhang, L., Zhao, X., Rao, M.K., Chen, Y. and Huang, Y. (2015) MeT-DB: a database of transcriptome methylation in mammalian cells. Nucleic Acids Res., 43, D197–D203), has become an important resource for methyltranscriptome, especially in the N6-methyl-adenosine (m6A) research community. Here, we report Met-DB v2.0, the significantly improved second version of Met-DB, which is entirely redesigned to focus more on elucidating context-specific m6A functions. Met-DB v2.0 has a major increase in context-specific m6A peaks and single-base sites predicted from 185 samples for 7 species from 26 independent studies. Moreover, it is also integrated with a new database for targets of m6A readers, erasers and writers and expanded with more collections of functional data. The redesigned Met-DB v2.0 web interface and genome browser provide more friendly, powerful, and informative ways to query and visualize the data. More importantly, MeT-DB v2.0 offers for the first time a series of tools specifically designed for understanding m6A functions. Met-DB V2.0 will be a valuable resource for m6A methyltranscriptome research. The Met-DB V2.0 database is available at http://compgenomics.utsa.edu/MeTDB/ and http://www.xjtlu.edu.cn/metdb2. PMID:29126312

  10. De Novo Transcriptome Analysis of Two Seahorse Species (Hippocampus erectus and H. mohnikei) and the Development of Molecular Markers for Population Genetics

    PubMed Central

    Lin, Qiang; Luo, Wei; Wan, Shiming; Gao, Zexia

    2016-01-01

    Seahorse conservation has been performed utilizing various strategies for many decades, and the deeper understanding of genomic information is necessary to more efficiently protect the germplasm resources of seahorse species. However, little genetic information about seahorses currently exists in the public databases. In this study, high-throughput RNA sequencing for two seahorse species, Hippocampus erectus and H. mohnikei, was carried out, and de novo assembly generated 37,506 unigenes for H. erectus and 36,113 unigenes for H. mohnikei. Among them, 17,338 (46.23%) unigenes for H. erectus and 17,900 (49.57%) for H. mohnikei were successfully annotated based on the information available from the public databases. Through comparing the unigenes of two seahorse species, 7,802 candidate orthologous genes were identified and 5,268 genes among them could be annotated. In addition, gene ontology analysis of two species was similarly performed on biological processes, cellular components, and molecular functions. Twenty-four and twenty-one unigenes in H. erectus and H. mohnikei were annotated in the biosynthesis of unsaturated fatty acids pathways, and both seahorses lacked the Δ12 and Δ15 desaturases. Total of 8,992 and 9,116 SSR loci were obtained from H. erectus and H. mohnikei unigenes, respectively. Dozens of SSR were developed and then applied to assess the population genetic diversity, as well as cross-amplified in a related species, H. trimaculatus. The HO and HE values of the tested populations for H. erectus, H. mohnikei, and H. trimaculatus were medium. These resources would facilitate the conservation of the species through a better understanding of the genomics and comparative genome analysis within the Hippocampus genus. PMID:27128031

  11. De Novo Transcriptome Analysis of Two Seahorse Species (Hippocampus erectus and H. mohnikei) and the Development of Molecular Markers for Population Genetics.

    PubMed

    Lin, Qiang; Luo, Wei; Wan, Shiming; Gao, Zexia

    2016-01-01

    Seahorse conservation has been performed utilizing various strategies for many decades, and the deeper understanding of genomic information is necessary to more efficiently protect the germplasm resources of seahorse species. However, little genetic information about seahorses currently exists in the public databases. In this study, high-throughput RNA sequencing for two seahorse species, Hippocampus erectus and H. mohnikei, was carried out, and de novo assembly generated 37,506 unigenes for H. erectus and 36,113 unigenes for H. mohnikei. Among them, 17,338 (46.23%) unigenes for H. erectus and 17,900 (49.57%) for H. mohnikei were successfully annotated based on the information available from the public databases. Through comparing the unigenes of two seahorse species, 7,802 candidate orthologous genes were identified and 5,268 genes among them could be annotated. In addition, gene ontology analysis of two species was similarly performed on biological processes, cellular components, and molecular functions. Twenty-four and twenty-one unigenes in H. erectus and H. mohnikei were annotated in the biosynthesis of unsaturated fatty acids pathways, and both seahorses lacked the Δ12 and Δ15 desaturases. Total of 8,992 and 9,116 SSR loci were obtained from H. erectus and H. mohnikei unigenes, respectively. Dozens of SSR were developed and then applied to assess the population genetic diversity, as well as cross-amplified in a related species, H. trimaculatus. The HO and HE values of the tested populations for H. erectus, H. mohnikei, and H. trimaculatus were medium. These resources would facilitate the conservation of the species through a better understanding of the genomics and comparative genome analysis within the Hippocampus genus.

  12. Profiling the venom gland transcriptomes of Costa Rican snakes by 454 pyrosequencing

    PubMed Central

    2011-01-01

    Background A long term research goal of venomics, of applied importance for improving current antivenom therapy, but also for drug discovery, is to understand the pharmacological potential of venoms. Individually or combined, proteomic and transcriptomic studies have demonstrated their feasibility to explore in depth the molecular diversity of venoms. In the absence of genome sequence, transcriptomes represent also valuable searchable databases for proteomic projects. Results The venom gland transcriptomes of 8 Costa Rican taxa from 5 genera (Crotalus, Bothrops, Atropoides, Cerrophidion, and Bothriechis) of pitvipers were investigated using high-throughput 454 pyrosequencing. 100,394 out of 330,010 masked reads produced significant hits in the available databases. 5.165,220 nucleotides (8.27%) were masked by RepeatMasker, the vast majority of which corresponding to class I (retroelements) and class II (DNA transposons) mobile elements. BLAST hits included 79,991 matches to entries of the taxonomic suborder Serpentes, of which 62,433 displayed similarity to documented venom proteins. Strong discrepancies between the transcriptome-computed and the proteome-gathered toxin compositions were obvious at first sight. Although the reasons underlaying this discrepancy are elusive, since no clear trend within or between species is apparent, the data indicate that individual mRNA species may be translationally controlled in a species-dependent manner. The minimum number of genes from each toxin family transcribed into the venom gland transcriptome of each species was calculated from multiple alignments of reads matched to a full-length reference sequence of each toxin family. Reads encoding ORF regions of Kazal-type inhibitor-like proteins were uniquely found in Bothriechis schlegelii and B. lateralis transcriptomes, suggesting a genus-specific recruitment event during the early-Middle Miocene. A transcriptome-based cladogram supports the large divergence between A. mexicanus and A. picadoi, and a closer kinship between A. mexicanus and C. godmani. Conclusions Our comparative next-generation sequencing (NGS) analysis reveals taxon-specific trends governing the formulation of the venom arsenal. Knowledge of the venom proteome provides hints on the translation efficiency of toxin-coding transcripts, contributing thereby to a more accurate interpretation of the transcriptome. The application of NGS to the analysis of snake venom transcriptomes, may represent the tool for opening the door to systems venomics. PMID:21605378

  13. Transcriptomic Analysis and the Expression of Disease-Resistant Genes in Oryza meyeriana under Native Condition

    PubMed Central

    He, Bin; Tao, Xiang; Gu, Yinghong; Wei, Changhe; Cheng, Xiaojie; Xiao, Suqin; Cheng, Zaiquan; Zhang, Yizheng

    2015-01-01

    Oryza meyeriana (O. meyeriana), with a GG genome type (2n = 24), accumulated plentiful excellent characteristics with respect to resistance to many diseases such as rice shade and blast, even immunity to bacterial blight. It is very important to know if the diseases-resistant genes exist and express in this wild rice under native conditions. However, limited genomic or transcriptomic data of O. meyeriana are currently available. In this study, we present the first comprehensive characterization of the O. meyeriana transcriptome using RNA-seq and obtained 185,323 contigs with an average length of 1,692 bp and an N50 of 2,391 bp. Through differential expression analysis, it was found that there were most tissue-specifically expressed genes in roots, and next to stems and leaves. By similarity search against protein databases, 146,450 had at least a significant alignment to existed gene models. Comparison with the Oryza sativa (japonica-type Nipponbare and indica-type 93–11) genomes revealed that 13% of the O. meyeriana contigs had not been detected in O. sativa. Many diseases-resistant genes, such as bacterial blight resistant, blast resistant, rust resistant, fusarium resistant, cyst nematode resistant and downy mildew gene, were mined from the transcriptomic database. There are two kinds of rice bacterial blight-resistant genes (Xa1 and Xa26) differentially or specifically expressed in O. meyeriana. The 4 Xa1 contigs were all only expressed in root, while three of Xa26 contigs have the highest expression level in leaves, two of Xa26 contigs have the highest expression profile in stems and one of Xa26 contigs was expressed dominantly in roots. The transcriptomic database of O. meyeriana has been constructed and many diseases-resistant genes were found to express under native condition, which provides a foundation for future discovery of a number of novel genes and provides a basis for studying the molecular mechanisms associated with disease resistance in O. meyeriana. PMID:26640944

  14. A Floral Transcriptome for Hippeastrum (Amaryllidaceae)

    USDA-ARS?s Scientific Manuscript database

    Two transcriptomes have been constructed from floral tissue of two Hippeastrum (Amaryllidaceae) species, H. brasilianum (Traub & J.L.Doran) Dutilh and H. papilio (Ravenna) Van Scheepan. The former has fragrant flowers, while flowers of the latter do not produce floral fragrance. RNA was isolated a...

  15. Comparative analysis of the early transcriptome of Brucella abortus - infected monocyte-derived macrophages from cattle naturally resistant or susceptible to brucellosis

    PubMed Central

    Rossetti, C.A.; Galindo, C.L.; Everts, R.E.; Lewin, H.A.; Garner, H.R.; Adams, L.G.

    2010-01-01

    Brucellosis is a worldwide zoonotic infectious disease that has a significant economic impact on animal production and human public health. We characterized the gene expression profile of B. abortus-infected monocyte-derived macrophages (MDMs) from naïve cattle naturally resistant (R) or susceptible (S) to brucellosis using a cDNA microarray technology. Our data indicate that 1) B. abortus induced a slightly increased genome activation in R MDMs and a down-regulated transcriptome in S MDMs, during the onset of infection, 2) R MDMs had the ability to mount a type 1 immune response against B. abortus infection which was impaired in S cells, and 3) the host cell activity was not altered after 12h post-B. abortus infection in R MDMs while the cell cycle was largely arrested in infected S MDMs at 12h p.i. These results contribute to understand of how host responses may be manipulated to prevent infection by brucellae. PMID:20932540

  16. De novo assembly of pen shell ( Atrina pectinata) transcriptome and screening of its genic microsatellites

    NASA Astrophysics Data System (ADS)

    Sun, Xiujun; Li, Dongming; Liu, Zhihong; Zhou, Liqing; Wu, Biao; Yang, Aiguo

    2017-10-01

    The pen shell ( Atrina pectinata) is a large wedge-shaped bivalve, which belongs to family Pinnidae. Due to its large and nutritious adductor muscle, it is the popular seafood with high commercial value in Asia-Pacific countries. However, limiting genomic and transcriptomic data have hampered its genetic investigations. In this study, the transcriptome of A. pectinata was deeply sequenced using Illumina pair-end sequencing technology. After assembling, a total of 127263 unigenes were obtained. Functional annotation indicated that the highest percentage of unigenes (18.60%) was annotated on GO database, followed by 18.44% on PFAM database and 17.04% on NR database. There were 270 biological pathways matched with those in KEGG database. Furthermore, a total of 23452 potential simple sequence repeats (SSRs) were identified, of them the most abundant type was mono-nucleotide repeats (12902, 55.01%), which was followed by di-nucleotide (8132, 34.68%), tri-nucleotide (2010, 8.57%), tetra-nucleotide (401, 1.71%), and penta-nucleotide (7, 0.03%) repeats. Sixty SSRs were selected for validating and developing genic SSR markers, of them 23 showed polymorphism in a cultured population with the average observed and expected heterozygosities of 0.412 and 0.579, respectively. In this study, we established the first comprehensive transcript dataset of A. pectinata genes. Our results demonstrated that RNA-Seq is a fast and cost-effective method for genic SSR development in non-model species.

  17. Molecular adaptation in the world's deepest-living animal: Insights from transcriptome sequencing of the hadal amphipod Hirondellea gigas.

    PubMed

    Lan, Yi; Sun, Jin; Tian, Renmao; Bartlett, Douglas H; Li, Runsheng; Wong, Yue Him; Zhang, Weipeng; Qiu, Jian-Wen; Xu, Ting; He, Li-Sheng; Tabata, Harry G; Qian, Pei-Yuan

    2017-07-01

    The Challenger Deep in the Mariana Trench is the deepest point in the oceans of our planet. Understanding how animals adapt to this harsh environment characterized by high hydrostatic pressure, food limitation, dark and cold is of great scientific interest. Of the animals dwelling in the Challenger Deep, amphipods have been captured using baited traps. In this study, we sequenced the transcriptome of the amphipod Hirondellea gigas collected at a depth of 10,929 m from the East Pond of the Challenger Deep. Assembly of these sequences resulted in 133,041 contigs and 22,046 translated proteins. Functional annotation of these contigs was made using the go and kegg databases. Comparison of these translated proteins with those of four shallow-water amphipods revealed 10,731 gene families, of which 5659 were single-copy orthologs. Base substitution analysis on these single-copy orthologs showed that 62 genes are positively selected in H. gigas, including genes related to β-alanine biosynthesis, energy metabolism and genetic information processing. For multiple-copy orthologous genes, gene family expansion analysis revealed that cold-inducible proteins (i.e., transcription factors II A and transcription elongation factor 1) as well as zinc finger domains are expanded in H. gigas. Overall, our results indicate that genetic adaptation to the hadal environment by H. gigas may be mediated by both gene family expansion and amino acid substitutions of specific proteins. © 2017 John Wiley & Sons Ltd.

  18. Arabidopsis Gene Family Profiler (aGFP)--user-oriented transcriptomic database with easy-to-use graphic interface.

    PubMed

    Dupl'áková, Nikoleta; Renák, David; Hovanec, Patrik; Honysová, Barbora; Twell, David; Honys, David

    2007-07-23

    Microarray technologies now belong to the standard functional genomics toolbox and have undergone massive development leading to increased genome coverage, accuracy and reliability. The number of experiments exploiting microarray technology has markedly increased in recent years. In parallel with the rapid accumulation of transcriptomic data, on-line analysis tools are being introduced to simplify their use. Global statistical data analysis methods contribute to the development of overall concepts about gene expression patterns and to query and compose working hypotheses. More recently, these applications are being supplemented with more specialized products offering visualization and specific data mining tools. We present a curated gene family-oriented gene expression database, Arabidopsis Gene Family Profiler (aGFP; http://agfp.ueb.cas.cz), which gives the user access to a large collection of normalised Affymetrix ATH1 microarray datasets. The database currently contains NASC Array and AtGenExpress transcriptomic datasets for various tissues at different developmental stages of wild type plants gathered from nearly 350 gene chips. The Arabidopsis GFP database has been designed as an easy-to-use tool for users needing an easily accessible resource for expression data of single genes, pre-defined gene families or custom gene sets, with the further possibility of keyword search. Arabidopsis Gene Family Profiler presents a user-friendly web interface using both graphic and text output. Data are stored at the MySQL server and individual queries are created in PHP script. The most distinguishable features of Arabidopsis Gene Family Profiler database are: 1) the presentation of normalized datasets (Affymetrix MAS algorithm and calculation of model-based gene-expression values based on the Perfect Match-only model); 2) the choice between two different normalization algorithms (Affymetrix MAS4 or MAS5 algorithms); 3) an intuitive interface; 4) an interactive "virtual plant" visualizing the spatial and developmental expression profiles of both gene families and individual genes. Arabidopsis GFP gives users the possibility to analyze current Arabidopsis developmental transcriptomic data starting with simple global queries that can be expanded and further refined to visualize comparative and highly selective gene expression profiles.

  19. Transcriptome-Wide Identification of Reference Genes for Expression Analysis of Soybean Responses to Drought Stress along the Day.

    PubMed

    Marcolino-Gomes, Juliana; Rodrigues, Fabiana Aparecida; Fuganti-Pagliarini, Renata; Nakayama, Thiago Jonas; Ribeiro Reis, Rafaela; Bouças Farias, Jose Renato; Harmon, Frank G; Correa Molinari, Hugo Bruno; Correa Molinari, Mayla Daiane; Nepomuceno, Alexandre

    2015-01-01

    The soybean transcriptome displays strong variation along the day in optimal growth conditions and also in response to adverse circumstances, like drought stress. However, no study conducted to date has presented suitable reference genes, with stable expression along the day, for relative gene expression quantification in combined studies on drought stress and diurnal oscillations. Recently, water deficit responses have been associated with circadian clock oscillations at the transcription level, revealing the existence of hitherto unknown processes and increasing the demand for studies on plant responses to drought stress and its oscillation during the day. We performed data mining from a transcriptome-wide background using microarrays and RNA-seq databases to select an unpublished set of candidate reference genes, specifically chosen for the normalization of gene expression in studies on soybean under both drought stress and diurnal oscillations. Experimental validation and stability analysis in soybean plants submitted to drought stress and sampled during a 24 h timecourse showed that four of these newer reference genes (FYVE, NUDIX, Golgin-84 and CYST) indeed exhibited greater expression stability than the conventionally used housekeeping genes (ELF1-β and β-actin) under these conditions. We also demonstrated the effect of using reference candidate genes with different stability values to normalize the relative expression data from a drought-inducible soybean gene (DREB5) evaluated in different periods of the day.

  20. Small RNA and transcriptome deep sequencing proffers insight into floral gene regulation in Rosa cultivars

    PubMed Central

    2012-01-01

    Background Roses (Rosa sp.), which belong to the family Rosaceae, are the most economically important ornamental plants—making up 30% of the floriculture market. However, given high demand for roses, rose breeding programs are limited in molecular resources which can greatly enhance and speed breeding efforts. A better understanding of important genes that contribute to important floral development and desired phenotypes will lead to improved rose cultivars. For this study, we analyzed rose miRNAs and the rose flower transcriptome in order to generate a database to expound upon current knowledge regarding regulation of important floral characteristics. A rose genetic database will enable comprehensive analysis of gene expression and regulation via miRNA among different Rosa cultivars. Results We produced more than 0.5 million reads from expressed sequences, totalling more than 110 million bp. From these, we generated 35,657, 31,434, 34,725, and 39,722 flower unigenes from Rosa hybrid: ‘Vital’, ‘Maroussia’, and ‘Sympathy’ and Rosa rugosa Thunb. , respectively. The unigenes were assigned functional annotations, domains, metabolic pathways, Gene Ontology (GO) terms, Plant Ontology (PO) terms, and MIPS Functional Catalogue (FunCat) terms. Rose flower transcripts were compared with genes from whole genome sequences of Rosaceae members (apple, strawberry, and peach) and grape. We also produced approximately 40 million small RNA reads from flower tissue for Rosa, representing 267 unique miRNA tags. Among identified miRNAs, 25 of them were novel and 242 of them were conserved miRNAs. Statistical analyses of miRNA profiles revealed both shared and species-specific miRNAs, which presumably effect flower development and phenotypes. Conclusions In this study, we constructed a Rose miRNA and transcriptome database, and we analyzed the miRNAs and transcriptome generated from the flower tissues of four Rosa cultivars. The database provides a comprehensive genetic resource which can be used to better understand rose flower development and to identify candidate genes for important phenotypes. PMID:23171001

  1. Small RNA and transcriptome deep sequencing proffers insight into floral gene regulation in Rosa cultivars.

    PubMed

    Kim, Jungeun; Park, June Hyun; Lim, Chan Ju; Lim, Jae Yun; Ryu, Jee-Youn; Lee, Bong-Woo; Choi, Jae-Pil; Kim, Woong Bom; Lee, Ha Yeon; Choi, Yourim; Kim, Donghyun; Hur, Cheol-Goo; Kim, Sukweon; Noh, Yoo-Sun; Shin, Chanseok; Kwon, Suk-Yoon

    2012-11-21

    Roses (Rosa sp.), which belong to the family Rosaceae, are the most economically important ornamental plants--making up 30% of the floriculture market. However, given high demand for roses, rose breeding programs are limited in molecular resources which can greatly enhance and speed breeding efforts. A better understanding of important genes that contribute to important floral development and desired phenotypes will lead to improved rose cultivars. For this study, we analyzed rose miRNAs and the rose flower transcriptome in order to generate a database to expound upon current knowledge regarding regulation of important floral characteristics. A rose genetic database will enable comprehensive analysis of gene expression and regulation via miRNA among different Rosa cultivars. We produced more than 0.5 million reads from expressed sequences, totalling more than 110 million bp. From these, we generated 35,657, 31,434, 34,725, and 39,722 flower unigenes from Rosa hybrid: 'Vital', 'Maroussia', and 'Sympathy' and Rosa rugosa Thunb., respectively. The unigenes were assigned functional annotations, domains, metabolic pathways, Gene Ontology (GO) terms, Plant Ontology (PO) terms, and MIPS Functional Catalogue (FunCat) terms. Rose flower transcripts were compared with genes from whole genome sequences of Rosaceae members (apple, strawberry, and peach) and grape. We also produced approximately 40 million small RNA reads from flower tissue for Rosa, representing 267 unique miRNA tags. Among identified miRNAs, 25 of them were novel and 242 of them were conserved miRNAs. Statistical analyses of miRNA profiles revealed both shared and species-specific miRNAs, which presumably effect flower development and phenotypes. In this study, we constructed a Rose miRNA and transcriptome database, and we analyzed the miRNAs and transcriptome generated from the flower tissues of four Rosa cultivars. The database provides a comprehensive genetic resource which can be used to better understand rose flower development and to identify candidate genes for important phenotypes.

  2. Transcriptome Analysis in Sheepgrass (Leymus chinensis). A Dominant Perennial Grass of the Eurasian Steppe

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Chen, Shuangyan; Huang, Xin; Yang, Xiaohan

    BACKGROUND: Sheepgrass [Leymus chinensis (Trin.) Tzvel.] is an important perennial forage grass across the Eurasian Steppe and is known for its adaptability to various environmental conditions. However, insufficient data resources in public databases for sheepgrass limited our understanding of the mechanism of environmental adaptations, gene discovery and molecular marker development. RESULTS: The transcriptome of sheepgrass was sequenced using Roche 454 pyrosequencing technology. We assembled 952,328 high-quality reads into 87,214 unigenes, including 32,416 contigs and 54,798 singletons. There were 15,450 contigs over 500 bp in length. BLAST searches of our database against Swiss-Prot and NCBI non-redundant protein sequences (nr) databases resultedmore » in the annotation of 54,584 (62.6%) of the unigenes. Gene Ontology (GO) analysis assigned 89,129 GO term annotations for 17,463 unigenes. We identified 11,675 core Poaceae-specific and 12,811 putative sheepgrass-specific unigenes by BLAST searches against all plant genome and transcriptome databases. A total of 2,979 specific freezing-responsive unigenes were found from this RNAseq dataset. We identified 3,818 EST-SSRs in 3,597 unigenes, and some SSRs contained unigenes that were also candidates for freezing-response genes. Characterizations of nucleotide repeats and dominant motifs of SSRs in sheepgrass were also performed. Similarity and phylogenetic analysis indicated that sheepgrass is closely related to barley and wheat. CONCLUSIONS: This research has greatly enriched sheepgrass transcriptome resources. The identified stress-related genes will help us to decipher the genetic basis of the environmental and ecological adaptations of this species and will be used to improve wheat and barley crops through hybridization or genetic transformation. The EST-SSRs reported here will be a valuable resource for future gene-phenotype studies and for the molecular breeding of sheepgrass and other Poaceae species.« less

  3. Deep insight into the Ganoderma lucidum by comprehensive analysis of its transcriptome.

    PubMed

    Yu, Guo-Jun; Wang, Man; Huang, Jie; Yin, Ya-Lin; Chen, Yi-Jie; Jiang, Shuai; Jin, Yan-Xia; Lan, Xian-Qing; Wong, Barry Hon Cheung; Liang, Yi; Sun, Hui

    2012-01-01

    Ganoderma lucidum is a basidiomycete white rot fungus and is of medicinal importance in China, Japan and other countries in the Asiatic region. To date, much research has been performed in identifying the medicinal ingredients in Ganoderma lucidum. Despite its important therapeutic effects in disease, little is known about Ganoderma lucidum at the genomic level. In order to gain a molecular understanding of this fungus, we utilized Illumina high-throughput technology to sequence and analyze the transcriptome of Ganoderma lucidum. We obtained 6,439,690 and 6,416,670 high-quality reads from the mycelium and fruiting body of Ganoderma lucidum, and these were assembled to form 18,892 and 27,408 unigenes, respectively. A similarity search was performed against the NCBI non-redundant nucleotide database and a customized database composed of five fungal genomes. 11,098 and 8, 775 unigenes were matched to the NCBI non-redundant nucleotide database and our customized database, respectively. All unigenes were subjected to annotation by Gene Ontology, Eukaryotic Orthologous Group terms and Kyoto Encyclopedia of Genes and Genomes. Differentially expressed genes from the Ganoderma lucidum mycelium and fruiting body stage were analyzed, resulting in the identification of 13 unigenes which are involved in the terpenoid backbone biosynthesis pathway. Quantitative real-time PCR was used to confirm the expression levels of these unigenes. Ganoderma lucidum was also studied for wood degrading activity and a total of 22 putative FOLymes (fungal oxidative lignin enzymes) and 120 CAZymes (carbohydrate-active enzymes) were predicted from our Ganoderma lucidum transcriptome. Our study provides comprehensive gene expression information on Ganoderma lucidum at the transcriptional level, which will form the foundation for functional genomics studies in this fungus. The use of Illumina sequencing technology has made de novo transcriptome assembly and gene expression analysis possible in species that lack full genome information.

  4. Deep Insight into the Ganoderma lucidum by Comprehensive Analysis of Its Transcriptome

    PubMed Central

    Yu, Guo-Jun; Wang, Man; Huang, Jie; Yin, Ya-Lin; Chen, Yi-Jie; Jiang, Shuai; Jin, Yan-Xia; Lan, Xian-Qing; Wong, Barry Hon Cheung; Liang, Yi; Sun, Hui

    2012-01-01

    Background Ganoderma lucidum is a basidiomycete white rot fungus and is of medicinal importance in China, Japan and other countries in the Asiatic region. To date, much research has been performed in identifying the medicinal ingredients in Ganoderma lucidum. Despite its important therapeutic effects in disease, little is known about Ganoderma lucidum at the genomic level. In order to gain a molecular understanding of this fungus, we utilized Illumina high-throughput technology to sequence and analyze the transcriptome of Ganoderma lucidum. Methodology/Principal Findings We obtained 6,439,690 and 6,416,670 high-quality reads from the mycelium and fruiting body of Ganoderma lucidum, and these were assembled to form 18,892 and 27,408 unigenes, respectively. A similarity search was performed against the NCBI non-redundant nucleotide database and a customized database composed of five fungal genomes. 11,098 and 8, 775 unigenes were matched to the NCBI non-redundant nucleotide database and our customized database, respectively. All unigenes were subjected to annotation by Gene Ontology, Eukaryotic Orthologous Group terms and Kyoto Encyclopedia of Genes and Genomes. Differentially expressed genes from the Ganoderma lucidum mycelium and fruiting body stage were analyzed, resulting in the identification of 13 unigenes which are involved in the terpenoid backbone biosynthesis pathway. Quantitative real-time PCR was used to confirm the expression levels of these unigenes. Ganoderma lucidum was also studied for wood degrading activity and a total of 22 putative FOLymes (fungal oxidative lignin enzymes) and 120 CAZymes (carbohydrate-active enzymes) were predicted from our Ganoderma lucidum transcriptome. Conclusions Our study provides comprehensive gene expression information on Ganoderma lucidum at the transcriptional level, which will form the foundation for functional genomics studies in this fungus. The use of Illumina sequencing technology has made de novo transcriptome assembly and gene expression analysis possible in species that lack full genome information. PMID:22952861

  5. Insulin immuno-neutralization in fed chickens: effects on liver and muscle transcriptome.

    PubMed

    Simon, Jean; Milenkovic, Dragan; Godet, Estelle; Cabau, Cedric; Collin, Anne; Métayer-Coustard, Sonia; Rideau, Nicole; Tesseraud, Sophie; Derouet, Michel; Crochet, Sabine; Cailleau-Audouin, Estelle; Hennequet-Antier, Christelle; Gespach, Christian; Porter, Tom E; Duclos, Michel J; Dupont, Joëlle; Cogburn, Larry A

    2012-03-01

    Chickens mimic an insulin-resistance state by exhibiting several peculiarities with regard to plasma glucose level and its control by insulin. To gain insight into the role of insulin in the control of chicken transcriptome, liver and leg muscle transcriptomes were compared in fed controls and "diabetic" chickens, at 5 h after insulin immuno-neutralization, using 20.7K-chicken oligo-microarrays. At a level of false discovery rate <0.01, 1,573 and 1,225 signals were significantly modified by insulin privation in liver and muscle, respectively. Microarray data agreed reasonably well with qRT-PCR and some protein level measurements. Differentially expressed mRNAs with human ID were classified using Biorag analysis and Ingenuity Pathway Analysis. Multiple metabolic pathways, structural proteins, transporters and proteins of intracellular trafficking, major signaling pathways, and elements of the transcriptional control machinery were largely represented in both tissues. At least 42 mRNAs have already been associated with diabetes, insulin resistance, obesity, energy expenditure, or identified as sensors of metabolism in mice or humans. The contribution of the pathways presently identified to chicken physiology (particularly those not yet related to insulin) needs to be evaluated in future studies. Other challenges include the characterization of "unknown" mRNAs and the identification of the steps or networks, which disturbed tissue transcriptome so extensively, quickly after the turning off of the insulin signal. In conclusion, pleiotropic effects of insulin in chickens are further evidenced; major pathways controlled by insulin in mammals have been conserved despite the presence of unique features of insulin signaling in chicken muscle.

  6. Proteomic profiling of developing cotton fibers from wild and domesticated Gossypium barbadense.

    PubMed

    Hu, Guanjing; Koh, Jin; Yoo, Mi-Jeong; Grupp, Kara; Chen, Sixue; Wendel, Jonathan F

    2013-10-01

    Pima cotton (Gossypium barbadense) is widely cultivated because of its long, strong seed trichomes ('fibers') used for premium textiles. These agronomically advanced fibers were derived following domestication and thousands of years of human-mediated crop improvement. To gain an insight into fiber development and evolution, we conducted comparative proteomic and transcriptomic profiling of developing fiber from an elite cultivar and a wild accession. Analyses using isobaric tag for relative and absolute quantification (iTRAQ) LC-MS/MS technology identified 1317 proteins in fiber. Of these, 205 were differentially expressed across developmental stages, and 190 showed differential expression between wild and cultivated forms, 14.4% of the proteome sampled. Human selection may have shifted the timing of developmental modules, such that some occur earlier in domesticated than in wild cotton. A novel approach was used to detect possible biased expression of homoeologous copies of proteins. Results indicate a significant partitioning of duplicate gene expression at the protein level, but an approximately equal degree of bias for each of the two constituent genomes of allopolyploid cotton. Our results demonstrate the power of complementary transcriptomic and proteomic approaches for the study of the domestication process. They also provide a rich database for mining for functional analyses of cotton improvement or evolution. © 2013 The Authors. New Phytologist © 2013 New Phytologist Trust.

  7. RNA-sequencing of the sturgeon Acipenser baeri provides insights into expression dynamics of morphogenic differentiation and developmental regulatory genes in early versus late developmental stages.

    PubMed

    Song, Wei; Jiang, Keji; Zhang, Fengying; Lin, Yu; Ma, Lingbo

    2016-08-08

    Acipenser baeri, one of the critically endangered animals on the verge of extinction, is a key species for evolutionary, developmental, physiology and conservation studies and a standout amongst the most important food products worldwide. Though the transcriptome of the early development of A. baeri has been published recently, the transcriptome changes occurring in the transition from embryonic to late stages are still unknown. The aim of this work was to analyze the transcriptomes of embryonic and post-embryonic stages of A. baeri and identify differentially expressed genes (DEGs) and their expression patterns using mRNA collected from specimens at big yolk plug, wide neural plate and 64 day old sturgeon developmental stages for RNA-Seq. The paired-end sequencing of the transcriptome of samples of A. baeri collected at two early (big yolk plug (T1, 32 h after fertilization) and wide neural plate formation (T2, 45 h after fertilization)) and one late (T22, 64 day old sturgeon) developmental stages using Illumina Hiseq2000 platform generated 64039846, 64635214 and 75293762 clean paired-end reads for T1, T2 and T22, respectively. After quality control, the sequencing reads were de novo assembled to generate a set of 149,265 unigenes with N50 value of 1277 bp. Functional annotation indicated that a substantial number of these unigenes had significant similarity with proteins in public databases. Differential expression profiling allowed the identification of 2789, 12,819 and 10,824 DEGs from the respective T1 vs. T2, T1 vs. T22 and T2 vs. T22 comparisons. High correlation of DEGs' features was recorded among early stages while significant divergences were observed when comparing the late stage with early stages. GO and KEGG enrichment analyses revealed the biological processes, cellular component, molecular functions and metabolic pathways associated with identified DEGs. The qRT-PCR performed for candidate genes in specimens confirmed the validity of the RNA-seq data. This study presents, for the first time, an extensive overview of RNA-Seq based characterization of the early and post-embryonic developmental transcriptomes of A. baeri and provided 149,265 gene sequences that will be potentially valuable for future molecular and genetic studies in A. baeri.

  8. The transcriptome of corona radiata cells from individual MІІ oocytes that after ICSI developed to embryos selected for transfer: PCOS women compared to healthy women.

    PubMed

    Wissing, Marie Louise; Sonne, Si Brask; Westergaard, David; Nguyen, Kho do; Belling, Kirstine; Høst, Thomas; Mikkelsen, Anne Lis

    2014-11-29

    Corona radiata cells (CRCs) refer to the fraction of cumulus cells just adjacent to the oocyte. The CRCs are closely connected to the oocyte throughout maturation and their gene expression profiles might reflect oocyte quality. Polycystic ovary syndrome (PCOS) is a common cause of infertility. It is controversial whether PCOS associate with diminished oocyte quality. The purpose of this study was to compare individual human CRC samples between PCOS patients and controls. All patients were stimulated by the long gonadotropin-releasing hormone (GnRH) agonist protocol. The CRC samples originated from individual oocytes developing into embryos selected for transfer. CRCs were isolated in a two-step denudation procedure, separating outer cumulus cells from the inner CRCs. Extracted RNA was amplified and transcriptome profiling was performed with Human Agilent® arrays. The transcriptomes of CRCs showed no individual genes with significant differential expression between PCOS and controls, but gene set enrichment analysis identified several cell cycle- and DNA replication pathways overexpressed in PCOS CRCs (FDR < 0.05). Five of the genes contributing to the up-regulated cell cycle pathways in the PCOS CRCs were selected for qRT-PCR validation in ten PCOS and ten control CRC samples. qRT-PCR confirmed significant up-regulation in PCOS CRCs of cell cycle progression genes HIST1H4C (FC = 2.7), UBE2C (FC = 2.6) and cell cycle related transcription factor E2F4 (FC = 2.5). The overexpression of cell cycle-related genes and cell cycle pathways in PCOS CRCs could indicate a disturbed or delayed final maturation and differentiation of the CRCs in response to the human chorionic gonadotropin (hCG) surge. However, this had no effect on the in vitro development of the corresponding embryos. Future studies are needed to clarify whether the up-regulated cell cycle pathways in PCOS CRCs have any clinical implications.

  9. The PBII gene of the human salivary proline-rich protein P-B produces another protein, Q504X8, with an opiorphin homolog, QRGPR.

    PubMed

    Saitoh, Eiichi; Sega, Takuya; Imai, Akane; Isemura, Satoko; Kato, Tetsuo; Ochiai, Akihito; Taniguchi, Masayuki

    2018-04-01

    The NCBI gene database and human-transcriptome database for alternative splicing were used to determine the expression of mRNAs for P-B (SMR3B) and variant form of P-B. The translational product from the former mRNA was identified as the protein named P-B, whereas that from the latter has not yet been elucidated. In the present study, we investigated the expression of P-B and its variant form at the protein level. To identify the variant protein of P-B, (1) cationic proteins with a higher isoelectric point in human pooled whole saliva were purified by a two dimensional liquid chromatography; (2) the peptide fragments generated from the in-solution of all proteins digested with trypsin separated and analyzed by MALDI-TOF-MS; and (3) the presence or absence of P-B in individual saliva was examined by 15% SDS-PAGE. The peptide sequences (I 37 PPPYSCTPNMNNCSR 52 , C 53 HHHHKRHHYPCNYCFCYPK 72 , R 59 HHYPCNYCFCYPK 72 and H 60 HYPCNYCFCYPK 72 ) present in the variant protein of P-B were identified. The peptide sequence (G 6 PYPPGPLAPPQPFGPGFVPPPPPPPYGPGR 36 ) in P-B (or the variant) and sequence (I 37 PPPPPAPYGPGIFPPPPPQP 57 ) in P-B were identified. The sum of the sequences identified indicated a 91.23% sequence identity for P-B and 79.76% for the variant. There were cases in which P-B existed in individual saliva, but there were cases in which it did not exist in individual saliva. The variant protein is produced by excising a non-canonical intron (CC-AC pair) from the 3'-noncoding sequence of the PBII gene. Both P-B and the variant are subject to proteolysis in the oral cavity. Copyright © 2018 Elsevier Ltd. All rights reserved.

  10. Transcriptome of interstitial cells of Cajal reveals unique and selective gene signatures

    PubMed Central

    Park, Paul J.; Fuchs, Robert; Wei, Lai; Jorgensen, Brian G.; Redelman, Doug; Ward, Sean M.; Sanders, Kenton M.

    2017-01-01

    Transcriptome-scale data can reveal essential clues into understanding the underlying molecular mechanisms behind specific cellular functions and biological processes. Transcriptomics is a continually growing field of research utilized in biomarker discovery. The transcriptomic profile of interstitial cells of Cajal (ICC), which serve as slow-wave electrical pacemakers for gastrointestinal (GI) smooth muscle, has yet to be uncovered. Using copGFP-labeled ICC mice and flow cytometry, we isolated ICC populations from the murine small intestine and colon and obtained their transcriptomes. In analyzing the transcriptome, we identified a unique set of ICC-restricted markers including transcription factors, epigenetic enzymes/regulators, growth factors, receptors, protein kinases/phosphatases, and ion channels/transporters. This analysis provides new and unique insights into the cellular and biological functions of ICC in GI physiology. Additionally, we constructed an interactive ICC genome browser (http://med.unr.edu/physio/transcriptome) based on the UCSC genome database. To our knowledge, this is the first online resource that provides a comprehensive library of all known genetic transcripts expressed in primary ICC. Our genome browser offers a new perspective into the alternative expression of genes in ICC and provides a valuable reference for future functional studies. PMID:28426719

  11. Global Analysis of Transcriptome Responses and Gene Expression Profiles to Cold Stress of Jatropha curcas L.

    PubMed Central

    Wang, Haibo; Zou, Zhurong; Wang, Shasha; Gong, Ming

    2013-01-01

    Background Jatropha curcas L., also called the Physic nut, is an oil-rich shrub with multiple uses, including biodiesel production, and is currently exploited as a renewable energy resource in many countries. Nevertheless, because of its origin from the tropical MidAmerican zone, J. curcas confers an inherent but undesirable characteristic (low cold resistance) that may seriously restrict its large-scale popularization. This adaptive flaw can be genetically improved by elucidating the mechanisms underlying plant tolerance to cold temperatures. The newly developed Illumina Hiseq™ 2000 RNA-seq and Digital Gene Expression (DGE) are deep high-throughput approaches for gene expression analysis at the transcriptome level, using which we carefully investigated the gene expression profiles in response to cold stress to gain insight into the molecular mechanisms of cold response in J. curcas. Results In total, 45,251 unigenes were obtained by assembly of clean data generated by RNA-seq analysis of the J. curcas transcriptome. A total of 33,363 and 912 complete or partial coding sequences (CDSs) were determined by protein database alignments and ESTScan prediction, respectively. Among these unigenes, more than 41.52% were involved in approximately 128 known metabolic or signaling pathways, and 4,185 were possibly associated with cold resistance. DGE analysis was used to assess the changes in gene expression when exposed to cold condition (12°C) for 12, 24, and 48 h. The results showed that 3,178 genes were significantly upregulated and 1,244 were downregulated under cold stress. These genes were then functionally annotated based on the transcriptome data from RNA-seq analysis. Conclusions This study provides a global view of transcriptome response and gene expression profiling of J. curcas in response to cold stress. The results can help improve our current understanding of the mechanisms underlying plant cold resistance and favor the screening of crucial genes for genetically enhancing cold resistance in J. curcas. PMID:24349370

  12. Global analysis of transcriptome responses and gene expression profiles to cold stress of Jatropha curcas L.

    PubMed

    Wang, Haibo; Zou, Zhurong; Wang, Shasha; Gong, Ming

    2013-01-01

    Jatropha curcas L., also called the Physic nut, is an oil-rich shrub with multiple uses, including biodiesel production, and is currently exploited as a renewable energy resource in many countries. Nevertheless, because of its origin from the tropical MidAmerican zone, J. curcas confers an inherent but undesirable characteristic (low cold resistance) that may seriously restrict its large-scale popularization. This adaptive flaw can be genetically improved by elucidating the mechanisms underlying plant tolerance to cold temperatures. The newly developed Illumina Hiseq™ 2000 RNA-seq and Digital Gene Expression (DGE) are deep high-throughput approaches for gene expression analysis at the transcriptome level, using which we carefully investigated the gene expression profiles in response to cold stress to gain insight into the molecular mechanisms of cold response in J. curcas. In total, 45,251 unigenes were obtained by assembly of clean data generated by RNA-seq analysis of the J. curcas transcriptome. A total of 33,363 and 912 complete or partial coding sequences (CDSs) were determined by protein database alignments and ESTScan prediction, respectively. Among these unigenes, more than 41.52% were involved in approximately 128 known metabolic or signaling pathways, and 4,185 were possibly associated with cold resistance. DGE analysis was used to assess the changes in gene expression when exposed to cold condition (12°C) for 12, 24, and 48 h. The results showed that 3,178 genes were significantly upregulated and 1,244 were downregulated under cold stress. These genes were then functionally annotated based on the transcriptome data from RNA-seq analysis. This study provides a global view of transcriptome response and gene expression profiling of J. curcas in response to cold stress. The results can help improve our current understanding of the mechanisms underlying plant cold resistance and favor the screening of crucial genes for genetically enhancing cold resistance in J. curcas.

  13. ELABELA Is an Endogenous Growth Factor that Sustains hESC Self-Renewal via the PI3K/AKT Pathway.

    PubMed

    Ho, Lena; Tan, Shawn Y X; Wee, Sheena; Wu, Yixuan; Tan, Sam J C; Ramakrishna, Navin B; Chng, Serene C; Nama, Srikanth; Szczerbinska, Iwona; Sczerbinska, Iwona; Chan, Yun-Shen; Avery, Stuart; Tsuneyoshi, Norihiro; Ng, Huck Hui; Gunaratne, Jayantha; Dunn, N Ray; Reversade, Bruno

    2015-10-01

    ELABELA (ELA) is a peptide hormone required for heart development that signals via the Apelin Receptor (APLNR, APJ). ELA is also abundantly secreted by human embryonic stem cells (hESCs), which do not express APLNR. Here we show that ELA signals in a paracrine fashion in hESCs to maintain self-renewal. ELA inhibition by CRISPR/Cas9-mediated deletion, shRNA, or neutralizing antibodies causes reduced hESC growth, cell death, and loss of pluripotency. Global phosphoproteomic and transcriptomic analyses of ELA-pulsed hESCs show that it activates PI3K/AKT/mTORC1 signaling required for cell survival. ELA promotes hESC cell-cycle progression and protein translation and blocks stress-induced apoptosis. INSULIN and ELA have partially overlapping functions in hESC medium, but only ELA can potentiate the TGFβ pathway to prime hESCs toward the endoderm lineage. We propose that ELA, acting through an alternate cell-surface receptor, is an endogenous secreted growth factor in human embryos and hESCs that promotes growth and pluripotency. Copyright © 2015 Elsevier Inc. All rights reserved.

  14. A Bioinformatics Approach for Integrated Transcriptomic and Proteomic Comparative Analyses of Model and Non-sequenced Anopheline Vectors of Human Malaria Parasites*

    PubMed Central

    Mohien, Ceereena Ubaida; Colquhoun, David R.; Mathias, Derrick K.; Gibbons, John G.; Armistead, Jennifer S.; Rodriguez, Maria C.; Rodriguez, Mario Henry; Edwards, Nathan J.; Hartler, Jürgen; Thallinger, Gerhard G.; Graham, David R.; Martinez-Barnetche, Jesus; Rokas, Antonis; Dinglasan, Rhoel R.

    2013-01-01

    Malaria morbidity and mortality caused by both Plasmodium falciparum and Plasmodium vivax extend well beyond the African continent, and although P. vivax causes between 80 and 300 million severe cases each year, vivax transmission remains poorly understood. Plasmodium parasites are transmitted by Anopheles mosquitoes, and the critical site of interaction between parasite and host is at the mosquito's luminal midgut brush border. Although the genome of the “model” African P. falciparum vector, Anopheles gambiae, has been sequenced, evolutionary divergence limits its utility as a reference across anophelines, especially non-sequenced P. vivax vectors such as Anopheles albimanus. Clearly, technologies and platforms that bridge this substantial scientific gap are required in order to provide public health scientists with key transcriptomic and proteomic information that could spur the development of novel interventions to combat this disease. To our knowledge, no approaches have been published that address this issue. To bolster our understanding of P. vivax–An. albimanus midgut interactions, we developed an integrated bioinformatic-hybrid RNA-Seq-LC-MS/MS approach involving An. albimanus transcriptome (15,764 contigs) and luminal midgut subproteome (9,445 proteins) assembly, which, when used with our custom Diptera protein database (685,078 sequences), facilitated a comparative proteomic analysis of the midgut brush borders of two important malaria vectors, An. gambiae and An. albimanus. PMID:23082028

  15. A bioinformatics approach for integrated transcriptomic and proteomic comparative analyses of model and non-sequenced anopheline vectors of human malaria parasites.

    PubMed

    Ubaida Mohien, Ceereena; Colquhoun, David R; Mathias, Derrick K; Gibbons, John G; Armistead, Jennifer S; Rodriguez, Maria C; Rodriguez, Mario Henry; Edwards, Nathan J; Hartler, Jürgen; Thallinger, Gerhard G; Graham, David R; Martinez-Barnetche, Jesus; Rokas, Antonis; Dinglasan, Rhoel R

    2013-01-01

    Malaria morbidity and mortality caused by both Plasmodium falciparum and Plasmodium vivax extend well beyond the African continent, and although P. vivax causes between 80 and 300 million severe cases each year, vivax transmission remains poorly understood. Plasmodium parasites are transmitted by Anopheles mosquitoes, and the critical site of interaction between parasite and host is at the mosquito's luminal midgut brush border. Although the genome of the "model" African P. falciparum vector, Anopheles gambiae, has been sequenced, evolutionary divergence limits its utility as a reference across anophelines, especially non-sequenced P. vivax vectors such as Anopheles albimanus. Clearly, technologies and platforms that bridge this substantial scientific gap are required in order to provide public health scientists with key transcriptomic and proteomic information that could spur the development of novel interventions to combat this disease. To our knowledge, no approaches have been published that address this issue. To bolster our understanding of P. vivax-An. albimanus midgut interactions, we developed an integrated bioinformatic-hybrid RNA-Seq-LC-MS/MS approach involving An. albimanus transcriptome (15,764 contigs) and luminal midgut subproteome (9,445 proteins) assembly, which, when used with our custom Diptera protein database (685,078 sequences), facilitated a comparative proteomic analysis of the midgut brush borders of two important malaria vectors, An. gambiae and An. albimanus.

  16. The salivary gland transcriptome of the eastern tree hole mosquito, Ochlerotatus triseriatus.

    PubMed

    Calvo, Eric; Sanchez-Vargas, Irma; Kotsyfakis, Michalis; Favreau, Amanda J; Barbian, Kent D; Pham, Van M; Olson, Kenneth E; Ribeiro, José M C

    2010-05-01

    Saliva of blood-sucking arthropods contains a complex mixture of peptides that affect their host's hemostasis, inflammation, and immunity. These activities can also modify the site of pathogen delivery and increase disease transmission. Saliva also induces hosts to mount an antisaliva immune response that can lead to skin allergies or even anaphylaxis. Accordingly, knowledge of the salivary repertoire, or sialome, of a mosquito is useful to provide a knowledge platform to mine for novel pharmacological activities, to develop novel vaccine targets for vector-borne diseases, and to develop epidemiological markers of vector exposure and candidate desensitization vaccines. The mosquito Ochlerotatus triseriatus is a vector of La Crosse virus and produces allergy in humans. In this work, a total of 1,575 clones randomly selected from an adult female O. triseriatus salivary gland cDNA library was sequenced and used to assemble a database that yielded 731 clusters of related sequences, 560 of which were singletons. Primer extension experiments were performed in selected clones to further extend sequence coverage, allowing for the identification of 159 protein sequences, 66 of which code for putative secreted proteins. Supplemental spreadsheets containing these data are available at http://exon.niaid.nih.gov/transcriptome/Ochlerotatus_triseriatus/S1/Ot-S1.xls and http://exon.niaid. nih.gov/transcriptome/Ochlerotatus_triseriatus/S2/Ot-S2.xls.

  17. Physiology of Pseudomonas aeruginosa in biofilms as revealed by transcriptome analysis

    PubMed Central

    2010-01-01

    Background Transcriptome analysis was applied to characterize the physiological activities of Pseudomonas aeruginosa grown for three days in drip-flow biofilm reactors. Conventional applications of transcriptional profiling often compare two paired data sets that differ in a single experimentally controlled variable. In contrast this study obtained the transcriptome of a single biofilm state, ranked transcript signals to make the priorities of the population manifest, and compared ranki ngs for a priori identified physiological marker genes between the biofilm and published data sets. Results Biofilms tolerated exposure to antibiotics, harbored steep oxygen concentration gradients, and exhibited stratified and heterogeneous spatial patterns of protein synthetic activity. Transcriptional profiling was performed and the signal intensity of each transcript was ranked to gain insight into the physiological state of the biofilm population. Similar rankings were obtained from data sets published in the GEO database http://www.ncbi.nlm.nih.gov/geo. By comparing the rank of genes selected as markers for particular physiological activities between the biofilm and comparator data sets, it was possible to infer qualitative features of the physiological state of the biofilm bacteria. These biofilms appeared, from their transcriptome, to be glucose nourished, iron replete, oxygen limited, and growing slowly or exhibiting stationary phase character. Genes associated with elaboration of type IV pili were strongly expressed in the biofilm. The biofilm population did not indicate oxidative stress, homoserine lactone mediated quorum sensing, or activation of efflux pumps. Using correlations with transcript ranks, the average specific growth rate of biofilm cells was estimated to be 0.08 h-1. Conclusions Collectively these data underscore the oxygen-limited, slow-growing nature of the biofilm population and are consistent with antimicrobial tolerance due to low metabolic activity. PMID:21083928

  18. Transcriptome analysis and identification of induced genes in the response of Harmonia axyridis to cold hardiness.

    PubMed

    Tang, Bin; Liu, Xiao-Jun; Shi, Zuo-Kun; Shen, Qi-Da; Xu, Yan-Xia; Wang, Su; Zhang, Fan; Wang, Shi-Gui

    2017-06-01

    Harmonia axyridis is an important predatory lady beetle that is a natural enemy of agricultural and forestry pests. In this research, the cold hardiness induced genes and their expression changes in H. axyridis were screened and detected by the way of the transcriptome and qualitative real-time PCR under normal and low temperatures, using high-throughput transcriptome and digital gene-expression-tag technologies. We obtained a 10Gb transcriptome and an 8Mb gene expression tag pool using Illumina deep sequencing technology and RNA-Seq analysis (accession number SRX540102). Of the 46,980 non-redundant unigenes identified, 28,037 (59.7%) were matched to known genes in GenBank, 21,604 (46.0%) in Swiss-Prot, 19,482 (41.5%) in Kyoto Encyclopedia of Genes and Genomes and 13,193 (28.1%) in Gene Ontology databases. Seventy-five percent of the unigene sequences had top matches with gene sequences from Tribolium castaneum. Results indicated that 60 genes regulated the entire cold-acclimation response, and, of these, seven genes were always up-regulated and five genes always down-regulated. Further screening revealed that six cold-resistant genes, E3 ubiquitin-protein ligase, transketolase, trehalase, serine/arginine repetitive matrix protein 2, glycerol kinase and sugar transporter SWEET1-like, play key roles in the response. Expression from a number of the differentially expressed genes was confirmed with quantitative real-time PCR (HaCS_Trans). The paper attempted to identify cold-resistance response genes, and study the potential mechanism by which cold acclimation enhances the insect's cold endurance. Information on these cold-resistance response genes will improve the development of low-temperature storage technology of natural enemy insects for future use in biological control. Copyright © 2017 Elsevier Inc. All rights reserved.

  19. Sugarcane giant borer transcriptome analysis and identification of genes related to digestion.

    PubMed

    Fonseca, Fernando Campos de Assis; Firmino, Alexandre Augusto Pereira; de Macedo, Leonardo Lima Pepino; Coelho, Roberta Ramos; de Souza Júnior, José Dijair Antonino; de Sousa Júnior, José Dijair Antonino; Silva-Junior, Orzenil Bonfim; Togawa, Roberto Coiti; Pappas, Georgios Joannis; de Góis, Luiz Avelar Brandão; da Silva, Maria Cristina Mattar; Grossi-de-Sá, Maria Fátima

    2015-01-01

    Sugarcane is a widely cultivated plant that serves primarily as a source of sugar and ethanol. Its annual yield can be significantly reduced by the action of several insect pests including the sugarcane giant borer (Telchin licus licus), a lepidopteran that presents a long life cycle and which efforts to control it using pesticides have been inefficient. Although its economical relevance, only a few DNA sequences are available for this species in the GenBank. Pyrosequencing technology was used to investigate the transcriptome of several developmental stages of the insect. To maximize transcript diversity, a pool of total RNA was extracted from whole body insects and used to construct a normalized cDNA database. Sequencing produced over 650,000 reads, which were de novo assembled to generate a reference library of 23,824 contigs. After quality score and annotation, 43% of the contigs had at least one BLAST hit against the NCBI non-redundant database, and 40% showed similarities with the lepidopteran Bombyx mori. In a further analysis, we conducted a comparison with Manduca sexta midgut sequences to identify transcripts of genes involved in digestion. Of these transcripts, many presented an expansion or depletion in gene number, compared to B. mori genome. From the sugarcane giant borer (SGB) transcriptome, a number of aminopeptidase N (APN) cDNAs were characterized based on homology to those reported as Cry toxin receptors. This is the first report that provides a large-scale EST database for the species. Transcriptome analysis will certainly be useful to identify novel developmental genes, to better understand the insect's biology and to guide the development of new strategies for insect-pest control.

  20. Mango (Mangifera indica L.) cv. Kent fruit mesocarp de novo transcriptome assembly identifies gene families important for ripening

    PubMed Central

    Dautt-Castro, Mitzuko; Ochoa-Leyva, Adrian; Contreras-Vergara, Carmen A.; Pacheco-Sanchez, Magda A.; Casas-Flores, Sergio; Sanchez-Flores, Alejandro; Kuhn, David N.; Islas-Osuna, Maria A.

    2015-01-01

    Fruit ripening is a physiological and biochemical process genetically programmed to regulate fruit quality parameters like firmness, flavor, odor and color, as well as production of ethylene in climacteric fruit. In this study, a transcriptomic analysis of mango (Mangifera indica L.) mesocarp cv. “Kent” was done to identify key genes associated with fruit ripening. Using the Illumina sequencing platform, 67,682,269 clean reads were obtained and a transcriptome of 4.8 Gb. A total of 33,142 coding sequences were predicted and after functional annotation, 25,154 protein sequences were assigned with a product according to Swiss-Prot database and 32,560 according to non-redundant database. Differential expression analysis identified 2,306 genes with significant differences in expression between mature-green and ripe mango [1,178 up-regulated and 1,128 down-regulated (FDR ≤ 0.05)]. The expression of 10 genes evaluated by both qRT-PCR and RNA-seq data was highly correlated (R = 0.97), validating the differential expression data from RNA-seq alone. Gene Ontology enrichment analysis, showed significantly represented terms associated to fruit ripening like “cell wall,” “carbohydrate catabolic process” and “starch and sucrose metabolic process” among others. Mango genes were assigned to 327 metabolic pathways according to Kyoto Encyclopedia of Genes and Genomes database, among them those involved in fruit ripening such as plant hormone signal transduction, starch and sucrose metabolism, galactose metabolism, terpenoid backbone, and carotenoid biosynthesis. This study provides a mango transcriptome that will be very helpful to identify genes for expression studies in early and late flowering mangos during fruit ripening. PMID:25741352

  1. Sugarcane Giant Borer Transcriptome Analysis and Identification of Genes Related to Digestion

    PubMed Central

    de Assis Fonseca, Fernando Campos; Firmino, Alexandre Augusto Pereira; de Macedo, Leonardo Lima Pepino; Coelho, Roberta Ramos; de Sousa Júnior, José Dijair Antonino; Silva-Junior, Orzenil Bonfim; Togawa, Roberto Coiti; Pappas, Georgios Joannis; de Góis, Luiz Avelar Brandão; da Silva, Maria Cristina Mattar; Grossi-de-Sá, Maria Fátima

    2015-01-01

    Sugarcane is a widely cultivated plant that serves primarily as a source of sugar and ethanol. Its annual yield can be significantly reduced by the action of several insect pests including the sugarcane giant borer (Telchin licus licus), a lepidopteran that presents a long life cycle and which efforts to control it using pesticides have been inefficient. Although its economical relevance, only a few DNA sequences are available for this species in the GenBank. Pyrosequencing technology was used to investigate the transcriptome of several developmental stages of the insect. To maximize transcript diversity, a pool of total RNA was extracted from whole body insects and used to construct a normalized cDNA database. Sequencing produced over 650,000 reads, which were de novo assembled to generate a reference library of 23,824 contigs. After quality score and annotation, 43% of the contigs had at least one BLAST hit against the NCBI non-redundant database, and 40% showed similarities with the lepidopteran Bombyx mori. In a further analysis, we conducted a comparison with Manduca sexta midgut sequences to identify transcripts of genes involved in digestion. Of these transcripts, many presented an expansion or depletion in gene number, compared to B. mori genome. From the sugarcane giant borer (SGB) transcriptome, a number of aminopeptidase N (APN) cDNAs were characterized based on homology to those reported as Cry toxin receptors. This is the first report that provides a large-scale EST database for the species. Transcriptome analysis will certainly be useful to identify novel developmental genes, to better understand the insect’s biology and to guide the development of new strategies for insect-pest control. PMID:25706301

  2. Mango (Mangifera indica L.) cv. Kent fruit mesocarp de novo transcriptome assembly identifies gene families important for ripening.

    PubMed

    Dautt-Castro, Mitzuko; Ochoa-Leyva, Adrian; Contreras-Vergara, Carmen A; Pacheco-Sanchez, Magda A; Casas-Flores, Sergio; Sanchez-Flores, Alejandro; Kuhn, David N; Islas-Osuna, Maria A

    2015-01-01

    Fruit ripening is a physiological and biochemical process genetically programmed to regulate fruit quality parameters like firmness, flavor, odor and color, as well as production of ethylene in climacteric fruit. In this study, a transcriptomic analysis of mango (Mangifera indica L.) mesocarp cv. "Kent" was done to identify key genes associated with fruit ripening. Using the Illumina sequencing platform, 67,682,269 clean reads were obtained and a transcriptome of 4.8 Gb. A total of 33,142 coding sequences were predicted and after functional annotation, 25,154 protein sequences were assigned with a product according to Swiss-Prot database and 32,560 according to non-redundant database. Differential expression analysis identified 2,306 genes with significant differences in expression between mature-green and ripe mango [1,178 up-regulated and 1,128 down-regulated (FDR ≤ 0.05)]. The expression of 10 genes evaluated by both qRT-PCR and RNA-seq data was highly correlated (R = 0.97), validating the differential expression data from RNA-seq alone. Gene Ontology enrichment analysis, showed significantly represented terms associated to fruit ripening like "cell wall," "carbohydrate catabolic process" and "starch and sucrose metabolic process" among others. Mango genes were assigned to 327 metabolic pathways according to Kyoto Encyclopedia of Genes and Genomes database, among them those involved in fruit ripening such as plant hormone signal transduction, starch and sucrose metabolism, galactose metabolism, terpenoid backbone, and carotenoid biosynthesis. This study provides a mango transcriptome that will be very helpful to identify genes for expression studies in early and late flowering mangos during fruit ripening.

  3. BISQUE: locus- and variant-specific conversion of genomic, transcriptomic and proteomic database identifiers.

    PubMed

    Meyer, Michael J; Geske, Philip; Yu, Haiyuan

    2016-05-15

    Biological sequence databases are integral to efforts to characterize and understand biological molecules and share biological data. However, when analyzing these data, scientists are often left holding disparate biological currency-molecular identifiers from different databases. For downstream applications that require converting the identifiers themselves, there are many resources available, but analyzing associated loci and variants can be cumbersome if data is not given in a form amenable to particular analyses. Here we present BISQUE, a web server and customizable command-line tool for converting molecular identifiers and their contained loci and variants between different database conventions. BISQUE uses a graph traversal algorithm to generalize the conversion process for residues in the human genome, genes, transcripts and proteins, allowing for conversion across classes of molecules and in all directions through an intuitive web interface and a URL-based web service. BISQUE is freely available via the web using any major web browser (http://bisque.yulab.org/). Source code is available in a public GitHub repository (https://github.com/hyulab/BISQUE). haiyuan.yu@cornell.edu Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  4. Lactobacillus gasseri K7 modulates the blood cell transcriptome of conventional mice infected with Escherichia coli O157:H7.

    PubMed

    Sagaya, F M; Hacin, B; Tompa, G; Ihan, A; Špela, Š; Černe, M; Hurrell, R F; Matijašić, B B; Rogelj, I; Vergères, G

    2014-05-01

    As the immune cells underlying the intestinal barrier sense luminal microbial signals, blood cell transcriptomics may identify subclinical changes triggered by gut bacteria that may otherwise not be detected. We have therefore investigated how Lactobacillus gasseri K7 and enterohemorrhagic Escherichia coli O157:H7 modulate the blood cell transcriptome of mice possessing an intact microbiota. We have analysed the transcriptome of five groups of C57BL/6J mice: (i) control, (ii) inoculated with a single dose of E. coli, (iii) inoculated during 2 weeks with Lact. gasseri, (iv) co-inoculated with E. coli and Lact. gasseri, (v) inoculated with Lact. gasseri prior to E. coli infection. The transcriptome could distinguish between the five treatment groups. Gene characteristics of bacterial infection, in particular inflammation, were upregulated in the mice inoculated with E. coli. Lact. gasseri had only mild effects on the transcriptome but modified the gene expression induced by E. coli. The transcriptome differentiates mice inoculated orally with E. coli, Lact. gasseri and combinations of these two strains. These results suggest that the blood cell transcriptome can be used as a source of biomarkers to monitor the impact of probiotics in subclinical models of infectious disease. © 2014 The Society for Applied Microbiology.

  5. rnaQUAST: a quality assessment tool for de novo transcriptome assemblies.

    PubMed

    Bushmanova, Elena; Antipov, Dmitry; Lapidus, Alla; Suvorov, Vladimir; Prjibelski, Andrey D

    2016-07-15

    Ability to generate large RNA-Seq datasets created a demand for both de novo and reference-based transcriptome assemblers. However, while many transcriptome assemblers are now available, there is still no unified quality assessment tool for RNA-Seq assemblies. We present rnaQUAST-a tool for evaluating RNA-Seq assembly quality and benchmarking transcriptome assemblers using reference genome and gene database. rnaQUAST calculates various metrics that demonstrate completeness and correctness levels of the assembled transcripts, and outputs them in a user-friendly report. rnaQUAST is implemented in Python and is freely available at http://bioinf.spbau.ru/en/rnaquast ap@bioinf.spbau.ru Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  6. Biosynthesis of the active compounds of Isatis indigotica based on transcriptome sequencing and metabolites profiling

    PubMed Central

    2013-01-01

    Backgroud Isatis indigotica is a widely used herb for the clinical treatment of colds, fever, and influenza in Traditional Chinese Medicine (TCM). Various structural classes of compounds have been identified as effective ingredients. However, little is known at genetics level about these active metabolites. In the present study, we performed de novo transcriptome sequencing for the first time to produce a comprehensive dataset of I. indigotica. Results A database of 36,367 unigenes (average length = 1,115.67 bases) was generated by performing transcriptome sequencing. Based on the gene annotation of the transcriptome, 104 unigenes were identified covering most of the catalytic steps in the general biosynthetic pathways of indole, terpenoid, and phenylpropanoid. Subsequently, the organ-specific expression patterns of the genes involved in these pathways, and their responses to methyl jasmonate (MeJA) induction, were investigated. Metabolites profile of effective phenylpropanoid showed accumulation pattern of secondary metabolites were mostly correlated with the transcription of their biosynthetic genes. According to the analysis of UDP-dependent glycosyltransferases (UGT) family, several flavonoids were indicated to exist in I. indigotica and further identified by metabolic profile using UPLC/Q-TOF. Moreover, applying transcriptome co-expression analysis, nine new, putative UGTs were suggested as flavonol glycosyltransferases and lignan glycosyltransferases. Conclusions This database provides a pool of candidate genes involved in biosynthesis of effective metabolites in I. indigotica. Furthermore, the comprehensive analysis and characterization of the significant pathways are expected to give a better insight regarding the diversity of chemical composition, synthetic characteristics, and the regulatory mechanism which operate in this medical herb. PMID:24308360

  7. Perigone Lobe Transcriptome Analysis Provides Insights into Rafflesia cantleyi Flower Development.

    PubMed

    Lee, Xin-Wei; Mat-Isa, Mohd-Noor; Mohd-Elias, Nur-Atiqah; Aizat-Juhari, Mohd Afiq; Goh, Hoe-Han; Dear, Paul H; Chow, Keng-See; Haji Adam, Jumaat; Mohamed, Rahmah; Firdaus-Raih, Mohd; Wan, Kiew-Lian

    2016-01-01

    Rafflesia is a biologically enigmatic species that is very rare in occurrence and possesses an extraordinary morphology. This parasitic plant produces a gigantic flower up to one metre in diameter with no leaves, stem or roots. However, little is known about the floral biology of this species especially at the molecular level. In an effort to address this issue, we have generated and characterised the transcriptome of the Rafflesia cantleyi flower, and performed a comparison with the transcriptome of its floral bud to predict genes that are expressed and regulated during flower development. Approximately 40 million sequencing reads were generated and assembled de novo into 18,053 transcripts with an average length of 641 bp. Of these, more than 79% of the transcripts had significant matches to annotated sequences in the public protein database. A total of 11,756 and 7,891 transcripts were assigned to Gene Ontology categories and clusters of orthologous groups respectively. In addition, 6,019 transcripts could be mapped to 129 pathways in Kyoto Encyclopaedia of Genes and Genomes Pathway database. Digital abundance analysis identified 52 transcripts with very high expression in the flower transcriptome of R. cantleyi. Subsequently, analysis of differential expression between developing flower and the floral bud revealed a set of 105 transcripts with potential role in flower development. Our work presents a deep transcriptome resource analysis for the developing flower of R. cantleyi. Genes potentially involved in the growth and development of the R. cantleyi flower were identified and provide insights into biological processes that occur during flower development.

  8. Bioinformatics analysis of transcriptome dynamics during growth in angus cattle longissimus muscle.

    PubMed

    Moisá, Sonia J; Shike, Daniel W; Graugnard, Daniel E; Rodriguez-Zas, Sandra L; Everts, Robin E; Lewin, Harris A; Faulkner, Dan B; Berger, Larry L; Loor, Juan J

    2013-01-01

    Transcriptome dynamics in the longissimus muscle (LM) of young Angus cattle were evaluated at 0, 60, 120, and 220 days from early-weaning. Bioinformatic analysis was performed using the dynamic impact approach (DIA) by means of Kyoto Encyclopedia of Genes and Genomes (KEGG) and Database for Annotation, Visualization and Integrated Discovery (DAVID) databases. Between 0 to 120 days (growing phase) most of the highly-impacted pathways (eg, ascorbate and aldarate metabolism, drug metabolism, cytochrome P450 and Retinol metabolism) were inhibited. The phase between 120 to 220 days (finishing phase) was characterized by the most striking differences with 3,784 differentially expressed genes (DEGs). Analysis of those DEGs revealed that the most impacted KEGG canonical pathway was glycosylphosphatidylinositol (GPI)-anchor biosynthesis, which was inhibited. Furthermore, inhibition of calpastatin and activation of tyrosine aminotransferase ubiquitination at 220 days promotes proteasomal degradation, while the concurrent activation of ribosomal proteins promotes protein synthesis. Therefore, the balance of these processes likely results in a steady-state of protein turnover during the finishing phase. Results underscore the importance of transcriptome dynamics in LM during growth.

  9. A rat RNA-Seq transcriptomic BodyMap across 11 organs and 4 developmental stages

    PubMed Central

    Yu, Ying; Fuscoe, James C.; Zhao, Chen; Guo, Chao; Jia, Meiwen; Qing, Tao; Bannon, Desmond I.; Lancashire, Lee; Bao, Wenjun; Du, Tingting; Luo, Heng; Su, Zhenqiang; Jones, Wendell D.; Moland, Carrie L.; Branham, William S.; Qian, Feng; Ning, Baitang; Li, Yan; Hong, Huixiao; Guo, Lei; Mei, Nan; Shi, Tieliu; Wang, Kevin Y.; Wolfinger, Russell D.; Nikolsky, Yuri; Walker, Stephen J.; Duerksen-Hughes, Penelope; Mason, Christopher E.; Tong, Weida; Thierry-Mieg, Jean; Thierry-Mieg, Danielle; Shi, Leming; Wang, Charles

    2014-01-01

    The rat has been used extensively as a model for evaluating chemical toxicities and for understanding drug mechanisms. However, its transcriptome across multiple organs, or developmental stages, has not yet been reported. Here we show, as part of the SEQC consortium efforts, a comprehensive rat transcriptomic BodyMap created by performing RNA-Seq on 320 samples from 11 organs of both sexes of juvenile, adolescent, adult and aged Fischer 344 rats. We catalogue the expression profiles of 40,064 genes, 65,167 transcripts, 31,909 alternatively spliced transcript variants and 2,367 non-coding genes/non-coding RNAs (ncRNAs) annotated in AceView. We find that organ-enriched, differentially expressed genes reflect the known organ-specific biological activities. A large number of transcripts show organ-specific, age-dependent or sex-specific differential expression patterns. We create a web-based, open-access rat BodyMap database of expression profiles with crosslinks to other widely used databases, anticipating that it will serve as a primary resource for biomedical research using the rat model. PMID:24510058

  10. No DNA damage response and negligible genome-wide transcriptional changes in human embryonic stem cells exposed to terahertz radiation

    PubMed Central

    Bogomazova, A. N.; Vassina, E. M.; Goryachkovskaya, T. N.; Popik, V. M.; Sokolov, A. S.; Kolchanov, N. A.; Lagarkova, M. A.; Kiselev, S. L.; Peltek, S. E.

    2015-01-01

    Terahertz (THz) radiation was proposed recently for use in various applications, including medical imaging and security scanners. However, there are concerns regarding the possible biological effects of non-ionising electromagnetic radiation in the THz range on cells. Human embryonic stem cells (hESCs) are extremely sensitive to environmental stimuli, and we therefore utilised this cell model to investigate the non-thermal effects of THz irradiation. We studied DNA damage and transcriptome responses in hESCs exposed to narrow-band THz radiation (2.3 THz) under strict temperature control. The transcription of approximately 1% of genes was subtly increased following THz irradiation. Functional annotation enrichment analysis of differentially expressed genes revealed 15 functional classes, which were mostly related to mitochondria. Terahertz irradiation did not induce the formation of γH2AX foci or structural chromosomal aberrations in hESCs. We did not observe any effect on the mitotic index or morphology of the hESCs following THz exposure. PMID:25582954

  11. Transcriptome Analysis and Comparison of Marmota monax and Marmota himalayana.

    PubMed

    Liu, Yanan; Wang, Baoju; Wang, Lu; Vikash, Vikash; Wang, Qin; Roggendorf, Michael; Lu, Mengji; Yang, Dongliang; Liu, Jia

    2016-01-01

    The Eastern woodchuck (Marmota monax) is a classical animal model for studying hepatitis B virus (HBV) infection and hepatocellular carcinoma (HCC) in humans. Recently, we found that Marmota himalayana, an Asian animal species closely related to Marmota monax, is susceptible to woodchuck hepatitis virus (WHV) infection and can be used as a new mammalian model for HBV infection. However, the lack of genomic sequence information of both Marmota models strongly limited their application breadth and depth. To address this major obstacle of the Marmota models, we utilized Illumina RNA-Seq technology to sequence the cDNA libraries of liver and spleen samples of two Marmota monax and four Marmota himalayana. In total, over 13 billion nucleotide bases were sequenced and approximately 1.5 billion clean reads were obtained. Following assembly, 106,496 consensus sequences of Marmota monax and 78,483 consensus sequences of Marmota himalayana were detected. For functional annotation, in total 73,603 Unigenes of Marmota monax and 78,483 Unigenes of Marmota himalayana were identified using different databases (NR, NT, Swiss-Prot, KEGG, COG, GO). The Unigenes were aligned by blastx to protein databases to decide the coding DNA sequences (CDS) and in total 41,247 CDS of Marmota monax and 34,033 CDS of Marmota himalayana were predicted. The single nucleotide polymorphisms (SNPs) and the simple sequence repeats (SSRs) were also analyzed for all Unigenes obtained. Moreover, a large-scale transcriptome comparison was performed and revealed a high similarity in transcriptome sequences between the two marmota species. Our study provides an extensive amount of novel sequence information for Marmota monax and Marmota himalayana. This information may serve as a valuable genomics resource for further molecular, developmental and comparative evolutionary studies, as well as for the identification and characterization of functional genes that are involved in WHV infection and HCC development in the woodchuck model.

  12. Transcriptome Analysis and Comparison of Marmota monax and Marmota himalayana

    PubMed Central

    Wang, Lu; Vikash, Vikash; Wang, Qin; Roggendorf, Michael; Lu, Mengji; Yang, Dongliang; Liu, Jia

    2016-01-01

    The Eastern woodchuck (Marmota monax) is a classical animal model for studying hepatitis B virus (HBV) infection and hepatocellular carcinoma (HCC) in humans. Recently, we found that Marmota himalayana, an Asian animal species closely related to Marmota monax, is susceptible to woodchuck hepatitis virus (WHV) infection and can be used as a new mammalian model for HBV infection. However, the lack of genomic sequence information of both Marmota models strongly limited their application breadth and depth. To address this major obstacle of the Marmota models, we utilized Illumina RNA-Seq technology to sequence the cDNA libraries of liver and spleen samples of two Marmota monax and four Marmota himalayana. In total, over 13 billion nucleotide bases were sequenced and approximately 1.5 billion clean reads were obtained. Following assembly, 106,496 consensus sequences of Marmota monax and 78,483 consensus sequences of Marmota himalayana were detected. For functional annotation, in total 73,603 Unigenes of Marmota monax and 78,483 Unigenes of Marmota himalayana were identified using different databases (NR, NT, Swiss-Prot, KEGG, COG, GO). The Unigenes were aligned by blastx to protein databases to decide the coding DNA sequences (CDS) and in total 41,247 CDS of Marmota monax and 34,033 CDS of Marmota himalayana were predicted. The single nucleotide polymorphisms (SNPs) and the simple sequence repeats (SSRs) were also analyzed for all Unigenes obtained. Moreover, a large-scale transcriptome comparison was performed and revealed a high similarity in transcriptome sequences between the two marmota species. Our study provides an extensive amount of novel sequence information for Marmota monax and Marmota himalayana. This information may serve as a valuable genomics resource for further molecular, developmental and comparative evolutionary studies, as well as for the identification and characterization of functional genes that are involved in WHV infection and HCC development in the woodchuck model. PMID:27806133

  13. Transcriptome analysis of mud crab (Scylla paramamosain) gills in response to Mud crab reovirus (MCRV).

    PubMed

    Liu, Shanshan; Chen, Guanxing; Xu, Haidong; Zou, Weibin; Yan, Wenrui; Wang, Qianqian; Deng, Hengwei; Zhang, Heqian; Yu, Guojiao; He, Jianguo; Weng, Shaoping

    2017-01-01

    Mud crab (Scylla paramamosain) is an economically important marine cultured species in China's coastal area. Mud crab reovirus (MCRV) is the most important pathogen of mud crab, resulting in large economic losses in crab farming. In this paper, next-generation sequencing technology and bioinformatics analysis are used to study transcriptome differences between MCRV-infected mud crab and normal control. A total of 104.3 million clean reads were obtained, including 52.7 million and 51.6 million clean reads from MCRV-infected (CA) and controlled (HA) mud crabs respectively. 81,901, 70,059 and 67,279 unigenes were gained respectively from HA reads, CA reads and HA&CA reads. A total of 32,547 unigenes from HA&CA reads called All-Unigenes were matched to at least one database among Nr, Nt, Swiss-prot, COG, GO and KEGG databases. Among these, 13,039, 20,260 and 11,866 unigenes belonged to the 3, 258 and 25 categories of GO, KEGG pathway, and COG databases, respectively. Solexa/Illumina's DGE platform was also used, and about 13,856 differentially expressed genes (DEGs), including 4444 significantly upregulated and 9412 downregulated DEGs were detected in diseased crabs compared with the control. KEGG pathway analysis revealed that DEGs were obviously enriched in the pathways related to different diseases or infections. This transcriptome analysis provided valuable information on gene functions associated with the response to MCRV in mud crab, as well as detail information for identifying novel genes in the absence of the mud crab genome database. Copyright © 2016. Published by Elsevier Ltd.

  14. Transcriptome Analysis in Venom Gland of the Predatory Giant Ant Dinoponera quadriceps: Insights into the Polypeptide Toxin Arsenal of Hymenopterans

    PubMed Central

    Chong, Cheong-Meng; Leung, Siu Wai; Prieto-da-Silva, Álvaro R. B.; Havt, Alexandre; Quinet, Yves P.; Martins, Alice M. C.; Lee, Simon M. Y.; Rádis-Baptista, Gandhi

    2014-01-01

    Background Dinoponera quadriceps is a predatory giant ant that inhabits the Neotropical region and subdues its prey (insects) with stings that deliver a toxic cocktail of molecules. Human accidents occasionally occur and cause local pain and systemic symptoms. A comprehensive study of the D. quadriceps venom gland transcriptome is required to advance our knowledge about the toxin repertoire of the giant ant venom and to understand the physiopathological basis of Hymenoptera envenomation. Results We conducted a transcriptome analysis of a cDNA library from the D. quadriceps venom gland with Sanger sequencing in combination with whole-transcriptome shotgun deep sequencing. From the cDNA library, a total of 420 independent clones were analyzed. Although the proportion of dinoponeratoxin isoform precursors was high, the first giant ant venom inhibitor cysteine-knot (ICK) toxin was found. The deep next generation sequencing yielded a total of 2,514,767 raw reads that were assembled into 18,546 contigs. A BLAST search of the assembled contigs against non-redundant and Swiss-Prot databases showed that 6,463 contigs corresponded to BLASTx hits and indicated an interesting diversity of transcripts related to venom gene expression. The majority of these venom-related sequences code for a major polypeptide core, which comprises venom allergens, lethal-like proteins and esterases, and a minor peptide framework composed of inter-specific structurally conserved cysteine-rich toxins. Both the cDNA library and deep sequencing yielded large proportions of contigs that showed no similarities with known sequences. Conclusions To our knowledge, this is the first report of the venom gland transcriptome of the New World giant ant D. quadriceps. The glandular venom system was dissected, and the toxin arsenal was revealed; this process brought to light novel sequences that included an ICK-folded toxins, allergen proteins, esterases (phospholipases and carboxylesterases), and lethal-like toxins. These findings contribute to the understanding of the ecology, behavior and venomics of hymenopterans. PMID:24498135

  15. Detailed Transcriptome Description of the Neglected Cestode Taenia multiceps

    PubMed Central

    Wu, Xuhang; Fu, Yan; Yang, Deying; Zhang, Runhui; Zheng, Wanpeng; Nie, Huaming; Xie, Yue; Yan, Ning; Hao, Guiying; Gu, Xiaobin; Wang, Shuxian; Peng, Xuerong; Yang, Guangyou

    2012-01-01

    Background The larval stage of Taenia multiceps, a global cestode, encysts in the central nervous system (CNS) of sheep and other livestock. This frequently leads to their death and huge socioeconomic losses, especially in developing countries. This parasite can also cause zoonotic infections in humans, but has been largely neglected due to a lack of diagnostic techniques and studies. Recent developments in next-generation sequencing provide an opportunity to explore the transcriptome of T. multiceps. Methodology/Principal Findings We obtained a total of 31,282 unigenes (mean length 920 bp) using Illumina paired-end sequencing technology and a new Trinity de novo assembler without a referenced genome. Individual transcription molecules were determined by sequence-based annotations and/or domain-based annotations against public databases (Nr, UniprotKB/Swiss-Prot, COG, KEGG, UniProtKB/TrEMBL, InterPro and Pfam). We identified 26,110 (83.47%) unigenes and inferred 20,896 (66.8%) coding sequences (CDS). Further comparative transcripts analysis with other cestodes (Taenia pisiformis, Taenia solium, Echincoccus granulosus and Echincoccus multilocularis) and intestinal parasites (Trichinella spiralis, Ancylostoma caninum and Ascaris suum) showed that 5,100 common genes were shared among three Taenia tapeworms, 261 conserved genes were detected among five Taeniidae cestodes, and 109 common genes were found in four zoonotic intestinal parasites. Some of the common genes were genes required for parasite survival, involved in parasite-host interactions. In addition, we amplified two full-length CDS of unigenes from the common genes using RT-PCR. Conclusions/Significance This study provides an extensive transcriptome of the adult stage of T. multiceps, and demonstrates that comparative transcriptomic investigations deserve to be further studied. This transcriptome dataset forms a substantial public information platform to achieve a fundamental understanding of the biology of T. multiceps, and helps in the identification of drug targets and parasite-host interaction studies. PMID:23049872

  16. LiverAtlas: a unique integrated knowledge database for systems-level research of liver and hepatic disease.

    PubMed

    Zhang, Yanqiong; Yang, Chunyuan; Wang, Shaochuang; Chen, Tao; Li, Mansheng; Wang, Xue; Li, Dongsheng; Wang, Kang; Ma, Jie; Wu, Songfeng; Zhang, Xueli; Zhu, Yunping; Wu, Jinsheng; He, Fuchu

    2013-09-01

    A large amount of liver-related physiological and pathological data exist in publicly available biological and bibliographic databases, which are usually far from comprehensive or integrated. Data collection, integration and mining processes pose a great challenge to scientific researchers and clinicians interested in the liver. To address these problems, we constructed LiverAtlas (http://liveratlas.hupo.org.cn), a comprehensive resource of biomedical knowledge related to the liver and various hepatic diseases by incorporating 53 databases. In the present version, LiverAtlas covers data on liver-related genomics, transcriptomics, proteomics, metabolomics and hepatic diseases. Additionally, LiverAtlas provides a wealth of manually curated information, relevant literature citations and cross-references to other databases. Importantly, an expert-confirmed Human Liver Disease Ontology, including relevant information for 227 types of hepatic disease, has been constructed and is used to annotate LiverAtlas data. Furthermore, we have demonstrated two examples of applying LiverAtlas data to identify candidate markers for hepatocellular carcinoma (HCC) at the systems level and to develop a systems biology-based classifier by combining the differential gene expression with topological features of human protein interaction networks to enhance the ability of HCC differential diagnosis. LiverAtlas is the most comprehensive liver and hepatic disease resource, which helps biologists and clinicians to analyse their data at the systems level and will contribute much to the biomarker discovery and diagnostic performance enhancement for liver diseases. © 2013 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  17. Fructose overfeeding in first-degree relatives of type 2 diabetic patients impacts energy metabolism and mitochondrial functions in skeletal muscle.

    PubMed

    Seyssel, Kevin; Meugnier, Emmanuelle; Lê, Kim-Anne; Durand, Christine; Disse, Emmanuel; Blond, Emilie; Pays, Laurent; Nataf, Serge; Brozek, John; Vidal, Hubert; Tappy, Luc; Laville, Martine

    2016-12-01

    The aim of the study was to assess the effects of a high-fructose diet (HFrD) on skeletal muscle transcriptomic response in healthy offspring of patients with type 2 diabetes, a subgroup of individuals prone to metabolic disorders. Ten healthy normal weight first-degree relatives of type 2 diabetic patients were submitted to a HFrD (+3.5 g fructose/kg fat-free mass per day) during 7 days. A global transcriptomic analysis was performed on skeletal muscle biopsies combined with in vitro experiments using primary myotubes. Transcriptomic analysis highlighted profound effects on fatty acid oxidation and mitochondrial pathways supporting the whole-body metabolic shift with the preferential use of carbohydrates instead of lipids. Bioinformatics tools pointed out possible transcription factors orchestrating this genomic regulation, such as PPARα and NR4A2. In vitro experiments in human myotubes suggested an indirect action of fructose in skeletal muscle, which seemed to be independent from lactate, uric acid, or nitric oxide. This study shows therefore that a large cluster of genes related to energy metabolism, mitochondrial function, and lipid oxidation was downregulated after 7 days of HFrD, thus supporting the concept that overconsumption of fructose-containing foods could contribute to metabolic deterioration in humans. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  18. De novo transcriptomic analysis of cowpea (Vigna unguiculata L. Walp.) for genic SSR marker development.

    PubMed

    Chen, Honglin; Wang, Lixia; Liu, Xiaoyan; Hu, Liangliang; Wang, Suhua; Cheng, Xuzhen

    2017-07-11

    Cowpea [Vigna unguiculata (L.) Walp.] is one of the most important legumes in tropical and semi-arid regions. However, there is relatively little genomic information available for genetic research on and breeding of cowpea. The objectives of this study were to analyse the cowpea transcriptome and develop genic molecular markers for future genetic studies of this genus. Approximately 54 million high-quality cDNA sequence reads were obtained from cowpea based on Illumina paired-end sequencing technology and were de novo assembled to generate 47,899 unigenes with an N50 length of 1534 bp. Sequence similarity analysis revealed 36,289 unigenes (75.8%) with significant similarity to known proteins in the non-redundant (Nr) protein database, 23,471 unigenes (49.0%) with BLAST hits in the Swiss-Prot database, and 20,654 unigenes (43.1%) with high similarity in the Kyoto Encyclopedia of Genes and Genomes (KEGG) database. Further analysis identified 5560 simple sequence repeats (SSRs) as potential genic molecular markers. Validating a random set of 500 SSR markers yielded 54 polymorphic markers among 32 cowpea accessions. This transcriptomic analysis of cowpea provided a valuable set of genomic data for characterizing genes with important agronomic traits in Vigna unguiculata and a new set of genic SSR markers for further genetic studies and breeding in cowpea and related Vigna species.

  19. Next-Generation Sequencing of the Chrysanthemum nankingense (Asteraceae) Transcriptome Permits Large-Scale Unigene Assembly and SSR Marker Discovery

    PubMed Central

    Wang, Haibin; Jiang, Jiafu; Chen, Sumei; Qi, Xiangyu; Peng, Hui; Li, Pirui; Song, Aiping; Guan, Zhiyong; Fang, Weimin; Liao, Yuan; Chen, Fadi

    2013-01-01

    Background Simple sequence repeats (SSRs) are ubiquitous in eukaryotic genomes. Chrysanthemum is one of the largest genera in the Asteraceae family. Only few Chrysanthemum expressed sequence tag (EST) sequences have been acquired to date, so the number of available EST-SSR markers is very low. Methodology/Principal Findings Illumina paired-end sequencing technology produced over 53 million sequencing reads from C. nankingense mRNA. The subsequent de novo assembly yielded 70,895 unigenes, of which 45,789 (64.59%) unigenes showed similarity to the sequences in NCBI database. Out of 45,789 sequences, 107 have hits to the Chrysanthemum Nr protein database; 679 and 277 sequences have hits to the database of Helianthus and Lactuca species, respectively. MISA software identified a large number of putative EST-SSRs, allowing 1,788 primer pairs to be designed from the de novo transcriptome sequence and a further 363 from archival EST sequence. Among 100 primer pairs randomly chosen, 81 markers have amplicons and 20 are polymorphic for genotypes analysis in Chrysanthemum. The results showed that most (but not all) of the assays were transferable across species and that they exposed a significant amount of allelic diversity. Conclusions/Significance SSR markers acquired by transcriptome sequencing are potentially useful for marker-assisted breeding and genetic analysis in the genus Chrysanthemum and its related genera. PMID:23626799

  20. De novo comparative transcriptome analysis of genes involved in fruit morphology of pumpkin cultivars with extreme size difference and development of EST-SSR markers.

    PubMed

    Xanthopoulou, Aliki; Ganopoulos, Ioannis; Psomopoulos, Fotis; Manioudaki, Maria; Moysiadis, Theodoros; Kapazoglou, Aliki; Osathanunkul, Maslin; Michailidou, Sofia; Kalivas, Apostolos; Tsaftaris, Athanasios; Nianiou-Obeidat, Irini; Madesis, Panagiotis

    2017-07-30

    The genetic basis of fruit size and shape was investigated for the first time in Cucurbita species and genetic loci associated with fruit morphology have been identified. Although extensive genomic resources are available at present for tomato (Solanum lycopersicum), cucumber (Cucumis sativus), melon (Cucumis melo) and watermelon (Citrullus lanatus), genomic databases for Cucurbita species are limited. Recently, our group reported the generation of pumpkin (Cucurbita pepo) transcriptome databases from two contrasting cultivars with extreme fruit sizes. In the current study we used these databases to perform comparative transcriptome analysis in order to identify genes with potential roles in fruit morphology and fruit size. Differential Gene Expression (DGE) analysis between cv. 'Munchkin' (small-fruit) and cv. 'Big Moose' (large-fruit) revealed a variety of candidate genes associated with fruit morphology with significant differences in gene expression between the two cultivars. In addition, we have set the framework for generating EST-SSR markers, which discriminate different C. pepo cultivars and show transferability to related Cucurbitaceae species. The results of the present study will contribute to both further understanding the molecular mechanisms regulating fruit morphology and furthermore identifying the factors that determine fruit size. Moreover, they may lead to the development of molecular marker tools for selecting genotypes with desired morphological traits. Copyright © 2017. Published by Elsevier B.V.

  1. Antimycobacterial Activity: A New Pharmacological Target for Conotoxins Found in the First Reported Conotoxin from Conasprella ximenes.

    PubMed

    Figueroa-Montiel, Andrea; Bernáldez, Johanna; Jiménez, Samanta; Ueberhide, Beatrix; González, Luis Javier; Licea-Navarro, Alexei

    2018-01-23

    Mycobacterium tuberculosis is the etiological agent of tuberculosis, an airborne infectious disease that is a leading cause of human morbidity and mortality worldwide. We report here the first conotoxin that is able to inhibit the growth of M. tuberculosis at a concentration similar to that of two other drugs that are currently used in clinics. Furthermore, it is also the first conopeptide that has been isolated from the venom of Conasprella ximenes. The venom gland transcriptome of C. ximenes was sequenced to construct a database with 24,284 non-redundant transcripts. The conopeptide was purified from the venom using reverse phase high performance liquid chromatography (RP-HPLC) and was analyzed using electrospray ionization-mass spectrometry (ESI-MS/MS). No automatic identification above the identity threshold with 1% of the false discovery rate was obtained; however, a 10-amino-acid sequence tag, manually extracted from the MS/MS spectra, allowed for the identification of a conotoxin in the transcriptome database. Electron transfer higher energy collision dissociation (EThcD) fragmentation of the native conotoxin confirmed the N-terminal sequence (1-14), while LC-MS/MS analysis of the tryptic digest of the reduced and S-alkylated conotoxin confirmed the C-terminal region (15-36). The expected and experimental molecular masses corresponded, within sub-ppm mass error. The 37-mer peptide (MW 4109.69 Da), containing eight cysteine residues, was named I1_xm11a, according to the current nomenclature for this type of molecule.

  2. Sulfide Homeostasis and Nitroxyl Intersect via Formation of Reactive Sulfur Species in Staphylococcus aureus.

    PubMed

    Peng, Hui; Shen, Jiangchuan; Edmonds, Katherine A; Luebke, Justin L; Hickey, Anne K; Palmer, Lauren D; Chang, Feng-Ming James; Bruce, Kevin A; Kehl-Fie, Thomas E; Skaar, Eric P; Giedroc, David P

    2017-01-01

    Staphylococcus aureus is a commensal human pathogen and a major cause of nosocomial infections. As gaseous signaling molecules, endogenous hydrogen sulfide (H 2 S) and nitric oxide (NO·) protect S. aureus from antibiotic stress synergistically, which we propose involves the intermediacy of nitroxyl (HNO). Here, we examine the effect of exogenous sulfide and HNO on the transcriptome and the formation of low-molecular-weight (LMW) thiol persulfides of bacillithiol, cysteine, and coenzyme A as representative of reactive sulfur species (RSS) in wild-type and Δ cstR strains of S. aureus . CstR is a per- and polysulfide sensor that controls the expression of a sulfide oxidation and detoxification system. As anticipated, exogenous sulfide induces the cst operon but also indirectly represses much of the CymR regulon which controls cysteine metabolism. A zinc limitation response is also observed, linking sulfide homeostasis to zinc bioavailability. Cellular RSS levels impact the expression of a number of virulence factors, including the exotoxins, particularly apparent in the Δ cstR strain. HNO, like sulfide, induces the cst operon as well as other genes regulated by exogenous sulfide, a finding that is traced to a direct reaction of CstR with HNO and to an endogenous perturbation in cellular RSS, possibly originating from disassembly of Fe-S clusters. More broadly, HNO induces a transcriptomic response to Fe overload, Cu toxicity, and reactive oxygen species and reactive nitrogen species and shares similarity with the sigB regulon. This work reveals an H 2 S/NO· interplay in S. aureus that impacts transition metal homeostasis and virulence gene expression. IMPORTANCE Hydrogen sulfide (H 2 S) is a toxic molecule and a recently described gasotransmitter in vertebrates whose function in bacteria is not well understood. In this work, we describe the transcriptomic response of the major human pathogen Staphylococcus aureus to quantified changes in levels of cellular organic reactive sulfur species, which are effector molecules involved in H 2 S signaling. We show that nitroxyl (HNO), a recently described signaling intermediate proposed to originate from the interplay of H 2 S and nitric oxide, also induces changes in cellular sulfur speciation and transition metal homeostasis, thus linking sulfide homeostasis to an adaptive response to antimicrobial reactive nitrogen species.

  3. Sulfide Homeostasis and Nitroxyl Intersect via Formation of Reactive Sulfur Species in Staphylococcus aureus

    PubMed Central

    Peng, Hui; Shen, Jiangchuan; Edmonds, Katherine A.; Luebke, Justin L.; Hickey, Anne K.; Palmer, Lauren D.; Chang, Feng-Ming James; Bruce, Kevin A.; Kehl-Fie, Thomas E.; Skaar, Eric P.

    2017-01-01

    ABSTRACT Staphylococcus aureus is a commensal human pathogen and a major cause of nosocomial infections. As gaseous signaling molecules, endogenous hydrogen sulfide (H2S) and nitric oxide (NO·) protect S. aureus from antibiotic stress synergistically, which we propose involves the intermediacy of nitroxyl (HNO). Here, we examine the effect of exogenous sulfide and HNO on the transcriptome and the formation of low-molecular-weight (LMW) thiol persulfides of bacillithiol, cysteine, and coenzyme A as representative of reactive sulfur species (RSS) in wild-type and ΔcstR strains of S. aureus. CstR is a per- and polysulfide sensor that controls the expression of a sulfide oxidation and detoxification system. As anticipated, exogenous sulfide induces the cst operon but also indirectly represses much of the CymR regulon which controls cysteine metabolism. A zinc limitation response is also observed, linking sulfide homeostasis to zinc bioavailability. Cellular RSS levels impact the expression of a number of virulence factors, including the exotoxins, particularly apparent in the ΔcstR strain. HNO, like sulfide, induces the cst operon as well as other genes regulated by exogenous sulfide, a finding that is traced to a direct reaction of CstR with HNO and to an endogenous perturbation in cellular RSS, possibly originating from disassembly of Fe-S clusters. More broadly, HNO induces a transcriptomic response to Fe overload, Cu toxicity, and reactive oxygen species and reactive nitrogen species and shares similarity with the sigB regulon. This work reveals an H2S/NO· interplay in S. aureus that impacts transition metal homeostasis and virulence gene expression. IMPORTANCE Hydrogen sulfide (H2S) is a toxic molecule and a recently described gasotransmitter in vertebrates whose function in bacteria is not well understood. In this work, we describe the transcriptomic response of the major human pathogen Staphylococcus aureus to quantified changes in levels of cellular organic reactive sulfur species, which are effector molecules involved in H2S signaling. We show that nitroxyl (HNO), a recently described signaling intermediate proposed to originate from the interplay of H2S and nitric oxide, also induces changes in cellular sulfur speciation and transition metal homeostasis, thus linking sulfide homeostasis to an adaptive response to antimicrobial reactive nitrogen species. PMID:28656172

  4. Transcriptional profiling of rat white adipose tissue response to 2,3,7,8-tetrachlorodibenzo-ρ-dioxin

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Houlahan, Kathleen E.; Prokopec, Stephenie D.; Sun, Ren X.

    Polychlorinated dibenzodioxins are environmental contaminants commonly produced as a by-product of industrial processes. The most potent of these, 2,3,7,8-tetrachlorodibenzo-ρ-dioxin (TCDD), is highly lipophilic, leading to bioaccumulation. White adipose tissue (WAT) is a major site for energy storage, and is one of the organs in which TCDD accumulates. In laboratory animals, exposure to TCDD causes numerous metabolic abnormalities, including a wasting syndrome. We therefore investigated the molecular effects of TCDD exposure on WAT by profiling the transcriptomic response of WAT to 100 μg/kg of TCDD at 1 or 4 days in TCDD-sensitive Long-Evans (Turku/AB; L-E) rats. A comparative analysis was conductedmore » simultaneously in identically treated TCDD-resistant Han/Wistar (Kuopio; H/W) rats one day after exposure to the same dose. We sought to identify transcriptomic changes coinciding with the onset of toxicity, while gaining additional insight into later responses. More transcriptional responses to TCDD were observed at 4 days than at 1 day post-exposure, suggesting WAT shows mostly secondary responses. Two classic AHR-regulated genes, Cyp1a1 and Nqo1, were significantly induced by TCDD in both strains, while several genes involved in the immune response, including Ms4a7 and F13a1 were altered in L-E rats alone. We compared genes affected by TCDD in rat WAT and human adipose cells, and observed little overlap. Interestingly, very few genes involved in lipid metabolism exhibited altered expression levels despite the pronounced lipid mobilization from peripheral fat pads by TCDD in L-E rats. Of these genes, the lipolysis-associated Lpin1 was induced slightly over 2-fold in L-E rat WAT on day 4. - Highlights: • Exposure to TCDD causes wasting syndrome in L-E rats but not in H/W rats. • We examined the transcriptome of TCDD-treated L-E and H/W rat white adipose tissue. • L-E WAT demonstrated altered abundance of several genes involved in immune response. • Few genes had altered abundance in both L-E rat WAT and human adipocytes (hMADS). • Pmepa1 was induced in both L-E WAT and differentiated hMADS cells.« less

  5. Transcriptomic analysis of flower development in tea (Camellia sinensis (L.)).

    PubMed

    Liu, Feng; Wang, Yu; Ding, Zhaotang; Zhao, Lei; Xiao, Jun; Wang, Linjun; Ding, Shibo

    2017-10-05

    Flowering is a critical and complicated process in plant development, involving interactions of numerous endogenous and environmental factors, but little is known about the complex network regulating flower development in tea plants. In this study, de novo transcriptome assembly and gene expression analysis using Illumina sequencing technology were performed. Transcriptomic analysis assembles gene-related information involved in reproductive growth of C. sinensis. Gene Ontology (GO) analysis of the annotated unigenes revealed that the majority of sequenced genes were associated with metabolic and cellular processes, cell and cell parts, catalytic activity and binding. Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis indicated that metabolic pathways, biosynthesis of secondary metabolites, and plant hormone signal transduction were enriched among the DEGs. Furthermore, 207 flowering-associated unigenes were identified from our database. Some transcription factors, such as WRKY, ERF, bHLH, MYB and MADS-box were shown to be up-regulated in floral transition, which might play the role of progression of flowering. Furthermore, 14 genes were selected for confirmation of expression levels using quantitative real-time PCR (qRT-PCR). The comprehensive transcriptomic analysis presents fundamental information on the genes and pathways which are involved in flower development in C. sinensis. Our data also provided a useful database for further research of tea and other species of plants. Copyright © 2017 Elsevier B.V. All rights reserved.

  6. Transcriptome In Vivo Analysis (TIVA) of spatially defined single cells in intact live mouse and human brain tissue

    PubMed Central

    Lovatt, Ditte; Ruble, Brittani K.; Lee, Jaehee; Dueck, Hannah; Kim, Tae Kyung; Fisher, Stephen; Francis, Chantal; Spaethling, Jennifer M.; Wolf, John A.; Grady, M. Sean; Ulyanova, Alexandra V.; Yeldell, Sean B.; Griepenburg, Julianne C.; Buckley, Peter T.; Kim, Junhyong; Sul, Jai-Yoon; Dmochowski, Ivan J.; Eberwine, James

    2014-01-01

    Transcriptome profiling is an indispensable tool in advancing the understanding of single cell biology, but depends upon methods capable of isolating mRNA at the spatial resolution of a single cell. Current capture methods lack sufficient spatial resolution to isolate mRNA from individual in vivo resident cells without damaging adjacent tissue. Because of this limitation, it has been difficult to assess the influence of the microenvironment on the transcriptome of individual neurons. Here, we engineered a Transcriptome In Vivo Analysis (TIVA)-tag, which upon photoactivation enables mRNA capture from single cells in live tissue. Using the TIVA-tag in combination with RNA-seq to analyze transcriptome variance among single dispersed cells and in vivo resident mouse and human neurons, we show that the tissue microenvironment shapes the transcriptomic landscape of individual cells. The TIVA methodology provides the first noninvasive approach for capturing mRNA from single cells in their natural microenvironment. PMID:24412976

  7. Transcriptome Analysis of PA Gain and Loss of Function Mutants.

    PubMed

    Marco, Francisco; Carrasco, Pedro

    2018-01-01

    Functional genomics has become a forefront methodology for plant science thanks to the widespread development of microarray technology. While technical difficulties associated with the process of obtaining raw expression data have been diminishing, allowing the appearance of tremendous amounts of transcriptome data in different databases, a common problem using "omic" technologies remains: the interpretation of these data and the inference of its biological meaning. In order to assist to this complex task, a wide variety of software tools have been developed. In this chapter we describe our current workflow of the application of some of these analyses. We have used it to compare the transcriptome of plants with differences in their polyamine levels.

  8. Estrogen alters the profile of the transcriptome in river snail Bellamya aeruginosa.

    PubMed

    Lei, Kun; Liu, Ruizhi; An, Li-Hui; Luo, Ying-Feng; LeBlanc, Gerald A

    2015-03-01

    We evaluated the transcriptome dynamics of the freshwater river snail Bellamya aeruginosa exposed to 17β-estradiol (E2) using the Roche/454 GS-FLX platform. In total, 41,869 unigenes, with an average length of 586 bp, representing 36,181 contigs and 5,688 singlets were obtained. Among them, 18.08, 36.85, and 25.47 % matched sequences in the GenBank non-redundant nucleic acid database, non-redundant protein database, and Swiss protein database, respectively. Annotation of the unigenes with gene ontology, and then mapping them to biological pathways, revealed large groups of genes related to growth, development, reproduction, signal transduction, and defense mechanisms. Significant differences were found in gene expression in both liver and testicular tissues between control and E2-exposed organisms. These changes in gene expression will help in understanding the molecular mechanisms of the response to physiological stress in the river snail exposed to estrogen, and will facilitate research into biological processes and underlying physiological adaptations to xenoestrogen exposure in gastropods.

  9. Transcriptomic Studies of Malaria: a Paradigm for Investigation of Systemic Host-Pathogen Interactions

    PubMed Central

    2018-01-01

    SUMMARY Transcriptomics, the analysis of genome-wide RNA expression, is a common approach to investigate host and pathogen processes in infectious diseases. Technical and bioinformatic advances have permitted increasingly thorough analyses of the association of RNA expression with fundamental biology, immunity, pathogenesis, diagnosis, and prognosis. Transcriptomic approaches can now be used to realize a previously unattainable goal, the simultaneous study of RNA expression in host and pathogen, in order to better understand their interactions. This exciting prospect is not without challenges, especially as focus moves from interactions in vitro under tightly controlled conditions to tissue- and systems-level interactions in animal models and natural and experimental infections in humans. Here we review the contribution of transcriptomic studies to the understanding of malaria, a parasitic disease which has exerted a major influence on human evolution and continues to cause a huge global burden of disease. We consider malaria a paradigm for the transcriptomic assessment of systemic host-pathogen interactions in humans, because much of the direct host-pathogen interaction occurs within the blood, a readily sampled compartment of the body. We illustrate lessons learned from transcriptomic studies of malaria and how these lessons may guide studies of host-pathogen interactions in other infectious diseases. We propose that the potential of transcriptomic studies to improve the understanding of malaria as a disease remains partly untapped because of limitations in study design rather than as a consequence of technological constraints. Further advances will require the integration of transcriptomic data with analytical approaches from other scientific disciplines, including epidemiology and mathematical modeling. PMID:29695497

  10. Transcriptomic Studies of Malaria: a Paradigm for Investigation of Systemic Host-Pathogen Interactions.

    PubMed

    Lee, Hyun Jae; Georgiadou, Athina; Otto, Thomas D; Levin, Michael; Coin, Lachlan J; Conway, David J; Cunnington, Aubrey J

    2018-06-01

    Transcriptomics, the analysis of genome-wide RNA expression, is a common approach to investigate host and pathogen processes in infectious diseases. Technical and bioinformatic advances have permitted increasingly thorough analyses of the association of RNA expression with fundamental biology, immunity, pathogenesis, diagnosis, and prognosis. Transcriptomic approaches can now be used to realize a previously unattainable goal, the simultaneous study of RNA expression in host and pathogen, in order to better understand their interactions. This exciting prospect is not without challenges, especially as focus moves from interactions in vitro under tightly controlled conditions to tissue- and systems-level interactions in animal models and natural and experimental infections in humans. Here we review the contribution of transcriptomic studies to the understanding of malaria, a parasitic disease which has exerted a major influence on human evolution and continues to cause a huge global burden of disease. We consider malaria a paradigm for the transcriptomic assessment of systemic host-pathogen interactions in humans, because much of the direct host-pathogen interaction occurs within the blood, a readily sampled compartment of the body. We illustrate lessons learned from transcriptomic studies of malaria and how these lessons may guide studies of host-pathogen interactions in other infectious diseases. We propose that the potential of transcriptomic studies to improve the understanding of malaria as a disease remains partly untapped because of limitations in study design rather than as a consequence of technological constraints. Further advances will require the integration of transcriptomic data with analytical approaches from other scientific disciplines, including epidemiology and mathematical modeling. Copyright © 2018 Lee et al.

  11. Lymphoblast-derived integration-free iPSC line AD-TREM2-3 from a 74 year-old Alzheimer's disease patient expressing the TREM2 p.R47H variant.

    PubMed

    Martins, Soraia; Yigit, Hatice; Bohndorf, Martina; Graffmann, Nina; Fiszl, Aurelian Robert; Wruck, Wasco; Sleegers, Kristel; Van Broeckhoven, Christine; Adjaye, James

    2018-06-01

    Human lymphoblast cells from a male diagnosed with Alzheimer's disease (AD) expressing the TREM2 p.R47H variant were used to generate integration-free induced pluripotent stem cells (iPSCs) by over-expressing episomal-based plasmids harbouring OCT4, SOX2, KLF4, LIN28, L-MYC and p53 shRNA. The derived iPSC line - AD-TREM2-3 was defined as pluripotent based on (i) expression of pluripotency-associated markers (ii) embryoid body-based differentiation into cell types representative of the three germ layers and (iii) the similarity between the transcriptome of the iPSC line and the human embryonic stem cell line H1 with a Pearson correlation of 0.940. Copyright © 2018. Published by Elsevier B.V.

  12. Developmental Transcriptome Analysis and Identification of Genes Involved in Larval Metamorphosis of the Razor Clam, Sinonovacula constricta.

    PubMed

    Niu, Donghong; Wang, Fei; Xie, Shumei; Sun, Fanyue; Wang, Ze; Peng, Maoxiao; Li, Jiale

    2016-04-01

    The razor clam Sinonovacula constricta is an important commercial species. The deficiency of developmental transcriptomic data is becoming the bottleneck of further researches on the mechanisms underlying settlement and metamorphosis in early development. In this study, de novo transcriptome sequencing was performed for S. constricta at different early developmental stages by using Illumina HiSeq 2000 paired-end (PE) sequencing technology. A total of 112,209,077 PE clean reads were generated. De novo assembly generated 249,795 contigs with an average length of 585 bp. Gene annotation resulted in the identification of 22,870 unigene hits against the NCBI database. Eight unique sequences related to metamorphosis were identified and analyzed using real-time PCR. The razor clam reference transcriptome would provide useful information on early developmental and metamorphosis mechanisms and could be used in the genetic breeding of shellfish.

  13. On the Spectral Evolution of Helium-atmosphere White Dwarfs Showing Traces of Hydrogen

    NASA Astrophysics Data System (ADS)

    Rolland, B.; Bergeron, P.; Fontaine, G.

    2018-04-01

    We present a detailed spectroscopic analysis of 115 helium-line (DB) and 28 cool, He-rich hydrogen-line (DA) white dwarfs based on atmosphere fits to optical spectroscopy and photometry. We find that 63% of our DB population show hydrogen lines, making them DBA stars. We also demonstrate the persistence of pure DB white dwarfs with no detectable hydrogen feature at low effective temperatures. Using state-of-the art envelope models, we next compute the total quantity of hydrogen, M H, that is contained in the outer convection zone as a function of effective temperature and atmospheric H/He ratio. We find that some (T eff, M H) pairs cannot physically exist as a homogeneously mixed structure; such a combination can only occur as stratified objects of the DA spectral type. On that basis, we show that the values of M H inferred for the bulk of the DBA stars are too large and incompatible with the convective dilution scenario. We also present evidence that the hydrogen abundances measured in DBA and cool, helium-rich white dwarfs cannot be globally accounted for by any kind of accretion mechanism onto a pure DB star. We suggest that cool, He-rich DA white dwarfs are most likely created by the convective mixing of a DA star with a thin hydrogen envelope; they are not cooled down DBAs. We finally explore several scenarios that could account for the presence of hydrogen in DBA stars.

  14. Pyrosequencing the Bemisia tabaci Transcriptome Reveals a Highly Diverse Bacterial Community and a Robust System for Insecticide Resistance

    PubMed Central

    Wu, Qing-jun; Wang, Shao-li; Yang, Xin; Yang, Ni-na; Li, Ru-mei; Jiao, Xiao-guo; Pan, Hui-peng; Liu, Bai-ming; Su, Qi; Xu, Bao-yun; Hu, Song-nian; Zhou, Xu-guo; Zhang, You-jun

    2012-01-01

    Background Bemisia tabaci (Gennadius) is a phloem-feeding insect poised to become one of the major insect pests in open field and greenhouse production systems throughout the world. The high level of resistance to insecticides is a main factor that hinders continued use of insecticides for suppression of B. tabaci. Despite its prevalence, little is known about B. tabaci at the genome level. To fill this gap, an invasive B. tabaci B biotype was subjected to pyrosequencing-based transcriptome analysis to identify genes and gene networks putatively involved in various physiological and toxicological processes. Methodology and Principal Findings Using Roche 454 pyrosequencing, 857,205 reads containing approximately 340 megabases were obtained from the B. tabaci transcriptome. De novo assembly generated 178,669 unigenes including 30,980 from insects, 17,881 from bacteria, and 129,808 from the nohit. A total of 50,835 (28.45%) unigenes showed similarity to the non-redundant database in GenBank with a cut-off E-value of 10–5. Among them, 40,611 unigenes were assigned to one or more GO terms and 6,917 unigenes were assigned to 288 known pathways. De novo metatranscriptome analysis revealed highly diverse bacterial symbionts in B. tabaci, and demonstrated the host-symbiont cooperation in amino acid production. In-depth transcriptome analysis indentified putative molecular markers, and genes potentially involved in insecticide resistance and nutrient digestion. The utility of this transcriptome was validated by a thiamethoxam resistance study, in which annotated cytochrome P450 genes were significantly overexpressed in the resistant B. tabaci in comparison to its susceptible counterparts. Conclusions This transcriptome/metatranscriptome analysis sheds light on the molecular understanding of symbiosis and insecticide resistance in an agriculturally important phloem-feeding insect pest, and lays the foundation for future functional genomics research of the B. tabaci complex. Moreover, current pyrosequencing effort greatly enriched the existing whitefly EST database, and makes RNAseq a viable option for future genomic analysis. PMID:22558125

  15. De novo assembly and characterization of the garlic (Allium sativum) bud transcriptome by Illumina sequencing.

    PubMed

    Sun, Xiudong; Zhou, Shumei; Meng, Fanlu; Liu, Shiqi

    2012-10-01

    Garlic is widely used as a spice throughout the world for the culinary value of its flavor and aroma, which are created by the chemical transformation of a series of organic sulfur compounds. To analyze the transcriptome of Allium sativum and discover the genes involved in sulfur metabolism, cDNAs derived from the total RNA of Allium sativum buds were analyzed by Illumina sequencing. Approximately 26.67 million 90 bp paired-end clean reads were achieved in two libraries. A total of 127,933 unigenes were generated by de novo assembly and were compared with the sequences in public databases. Of these, 45,286 unigenes had significant hits to the sequences in the Nr database, 29,514 showed significant similarity to known proteins in the Swiss-Prot database and, 20,706 and 21,952 unigenes had significant similarity to existing sequences in the KEGG and COG databases, respectively. Moreover, genes involved in organic sulfur biosynthesis were identified. These unigenes data will provide the foundation for research on gene expression, genomics and functional genomics in Allium sativum. Key message The obtained unigenes will provide the foundation for research on functional genomics in Allium sativum and its closely related species, and fill the gap of the existing plant EST database.

  16. Transcriptome profiling of pumpkin (Cucurbita moschata Duch.) leaves infected with powdery mildew

    PubMed Central

    Chen, Bi-Hua; Chen, Xue-Jin; Guo, Yan-Yan; Yang, He-Lian; Li, Xin-Zheng; Wang, Guang-Yin

    2018-01-01

    Cucurbit powdery mildew (PM) is one of the most severe fungal diseases, but the molecular mechanisms underlying PM resistance remain largely unknown, especially in pumpkin (Cucurbita moschata Duch.). The goal of this study was to identify gene expression differences in PM-treated plants (harvested at 24 h and 48 h after inoculation) and untreated (control) plants of inbred line “112–2” using RNA sequencing (RNA-Seq). The inbred line “112–2” has been purified over 8 consecutive generations of self-pollination and shows high resistance to PM. More than 7600 transcripts were examined in pumpkin leaves, and 3129 and 3080 differentially expressed genes (DEGs) were identified in inbred line “112–2” at 24 and 48 hours post inoculation (hpi), respectively. Based on the KEGG (Kyoto Encyclopedia of Genes and Genomes) pathway database and GO (Gene Ontology) database, a complex regulatory network for PM resistance that may involve hormone signal transduction pathways, transcription factors and defense responses was revealed at the transcription level. In addition, the expression profiles of 16 selected genes were analyzed using quantitative RT-PCR. Among these genes, the transcript levels of 6 DEGs, including bHLH87 (Basic Helix-loop-helix transcription factor), ERF014 (Ethylene response factor), WRKY21 (WRKY domain), HSF (heat stress transcription factor A), MLO3 (Mildew Locus O), and SGT1 (Suppressor of G-Two Allele of Skp1), in PM-resistant “112–2” were found to be significantly up- or down-regulated both before 9 hpi and at 24 hpi or 48 hpi; this behavior differed from that observed in the PM-susceptible material (cultivar “Jiujiangjiaoding”). The transcriptome data provide novel insights into the response of Cucurbita moschata to PM stress and are expected to be highly useful for dissecting PM defense mechanisms in this major vegetable and for improving pumpkin breeding with enhanced resistance to PM. PMID:29320569

  17. Transcriptome profiling of pumpkin (Cucurbita moschata Duch.) leaves infected with powdery mildew.

    PubMed

    Guo, Wei-Li; Chen, Bi-Hua; Chen, Xue-Jin; Guo, Yan-Yan; Yang, He-Lian; Li, Xin-Zheng; Wang, Guang-Yin

    2018-01-01

    Cucurbit powdery mildew (PM) is one of the most severe fungal diseases, but the molecular mechanisms underlying PM resistance remain largely unknown, especially in pumpkin (Cucurbita moschata Duch.). The goal of this study was to identify gene expression differences in PM-treated plants (harvested at 24 h and 48 h after inoculation) and untreated (control) plants of inbred line "112-2" using RNA sequencing (RNA-Seq). The inbred line "112-2" has been purified over 8 consecutive generations of self-pollination and shows high resistance to PM. More than 7600 transcripts were examined in pumpkin leaves, and 3129 and 3080 differentially expressed genes (DEGs) were identified in inbred line "112-2" at 24 and 48 hours post inoculation (hpi), respectively. Based on the KEGG (Kyoto Encyclopedia of Genes and Genomes) pathway database and GO (Gene Ontology) database, a complex regulatory network for PM resistance that may involve hormone signal transduction pathways, transcription factors and defense responses was revealed at the transcription level. In addition, the expression profiles of 16 selected genes were analyzed using quantitative RT-PCR. Among these genes, the transcript levels of 6 DEGs, including bHLH87 (Basic Helix-loop-helix transcription factor), ERF014 (Ethylene response factor), WRKY21 (WRKY domain), HSF (heat stress transcription factor A), MLO3 (Mildew Locus O), and SGT1 (Suppressor of G-Two Allele of Skp1), in PM-resistant "112-2" were found to be significantly up- or down-regulated both before 9 hpi and at 24 hpi or 48 hpi; this behavior differed from that observed in the PM-susceptible material (cultivar "Jiujiangjiaoding"). The transcriptome data provide novel insights into the response of Cucurbita moschata to PM stress and are expected to be highly useful for dissecting PM defense mechanisms in this major vegetable and for improving pumpkin breeding with enhanced resistance to PM.

  18. Evaluating the effect of database inflation in proteogenomic search on sensitive and reliable peptide identification.

    PubMed

    Li, Honglan; Joh, Yoon Sung; Kim, Hyunwoo; Paek, Eunok; Lee, Sang-Won; Hwang, Kyu-Baek

    2016-12-22

    Proteogenomics is a promising approach for various tasks ranging from gene annotation to cancer research. Databases for proteogenomic searches are often constructed by adding peptide sequences inferred from genomic or transcriptomic evidence to reference protein sequences. Such inflation of databases has potential of identifying novel peptides. However, it also raises concerns on sensitive and reliable peptide identification. Spurious peptides included in target databases may result in underestimated false discovery rate (FDR). On the other hand, inflation of decoy databases could decrease the sensitivity of peptide identification due to the increased number of high-scoring random hits. Although several studies have addressed these issues, widely applicable guidelines for sensitive and reliable proteogenomic search have hardly been available. To systematically evaluate the effect of database inflation in proteogenomic searches, we constructed a variety of real and simulated proteogenomic databases for yeast and human tandem mass spectrometry (MS/MS) data, respectively. Against these databases, we tested two popular database search tools with various approaches to search result validation: the target-decoy search strategy (with and without a refined scoring-metric) and a mixture model-based method. The effect of separate filtering of known and novel peptides was also examined. The results from real and simulated proteogenomic searches confirmed that separate filtering increases the sensitivity and reliability in proteogenomic search. However, no one method consistently identified the largest (or the smallest) number of novel peptides from real proteogenomic searches. We propose to use a set of search result validation methods with separate filtering, for sensitive and reliable identification of peptides in proteogenomic search.

  19. A-WINGS: an integrated genome database for Pleurocybella porrigens (Angel's wing oyster mushroom, Sugihiratake).

    PubMed

    Yamamoto, Naoki; Suzuki, Tomohiro; Kobayashi, Masaaki; Dohra, Hideo; Sasaki, Yohei; Hirai, Hirofumi; Yokoyama, Koji; Kawagishi, Hirokazu; Yano, Kentaro

    2014-12-03

    The angel's wing oyster mushroom (Pleurocybella porrigens, Sugihiratake) is a well-known delicacy. However, its potential risk in acute encephalopathy was recently revealed by a food poisoning incident. To disclose the genes underlying the accident and provide mechanistic insight, we seek to develop an information infrastructure containing omics data. In our previous work, we sequenced the genome and transcriptome using next-generation sequencing techniques. The next step in achieving our goal is to develop a web database to facilitate the efficient mining of large-scale omics data and identification of genes specifically expressed in the mushroom. This paper introduces a web database A-WINGS (http://bioinf.mind.meiji.ac.jp/a-wings/) that provides integrated genomic and transcriptomic information for the angel's wing oyster mushroom. The database contains structure and functional annotations of transcripts and gene expressions. Functional annotations contain information on homologous sequences from NCBI nr and UniProt, Gene Ontology, and KEGG Orthology. Digital gene expression profiles were derived from RNA sequencing (RNA-seq) analysis in the fruiting bodies and mycelia. The omics information stored in the database is freely accessible through interactive and graphical interfaces by search functions that include 'GO TREE VIEW' browsing, keyword searches, and BLAST searches. The A-WINGS database will accelerate omics studies on specific aspects of the angel's wing oyster mushroom and the family Tricholomataceae.

  20. Multi-tissue RNA-seq and transcriptome characterisation of the spiny dogfish shark (Squalus acanthias) provides a molecular tool for biological research and reveals new genes involved in osmoregulation.

    PubMed

    Chana-Munoz, Andres; Jendroszek, Agnieszka; Sønnichsen, Malene; Kristiansen, Rune; Jensen, Jan K; Andreasen, Peter A; Bendixen, Christian; Panitz, Frank

    2017-01-01

    The spiny dogfish shark (Squalus acanthias) is one of the most commonly used cartilaginous fishes in biological research, especially in the fields of nitrogen metabolism, ion transporters and osmoregulation. Nonetheless, transcriptomic data for this organism is scarce. In the present study, a multi-tissue RNA-seq experiment and de novo transcriptome assembly was performed in four different spiny dogfish tissues (brain, liver, kidney and ovary), providing an annotated sequence resource. The characterization of the transcriptome greatly increases the scarce sequence information for shark species. Reads were assembled with the Trinity de novo assembler both within each tissue and across all tissues combined resulting in 362,690 transcripts in the combined assembly which represent 289,515 Trinity genes. BUSCO analysis determined a level of 87% completeness for the combined transcriptome. In total, 123,110 proteins were predicted of which 78,679 and 83,164 had significant hits against the SwissProt and Uniref90 protein databases, respectively. Additionally, 61,215 proteins aligned to known protein domains, 7,208 carried a signal peptide and 15,971 possessed at least one transmembrane region. Based on the annotation, 81,582 transcripts were assigned to gene ontology terms and 42,078 belong to known clusters of orthologous groups (eggNOG). To demonstrate the value of our molecular resource, we show that the improved transcriptome data enhances the current possibilities of osmoregulation research in spiny dogfish by utilizing the novel gene and protein annotations to investigate a set of genes involved in urea synthesis and urea, ammonia and water transport, all of them crucial in osmoregulation. We describe the presence of different gene copies and isoforms of key enzymes involved in this process, including arginases and transporters of urea and ammonia, for which sequence information is currently absent in the databases for this model species. The transcriptome assemblies and the derived annotations generated in this study will support the ongoing research for this particular animal model and provides a new molecular tool to assist biological research in cartilaginous fishes.

  1. Multi-tissue RNA-seq and transcriptome characterisation of the spiny dogfish shark (Squalus acanthias) provides a molecular tool for biological research and reveals new genes involved in osmoregulation

    PubMed Central

    Chana-Munoz, Andres; Jendroszek, Agnieszka; Sønnichsen, Malene; Kristiansen, Rune; Jensen, Jan K.; Bendixen, Christian

    2017-01-01

    The spiny dogfish shark (Squalus acanthias) is one of the most commonly used cartilaginous fishes in biological research, especially in the fields of nitrogen metabolism, ion transporters and osmoregulation. Nonetheless, transcriptomic data for this organism is scarce. In the present study, a multi-tissue RNA-seq experiment and de novo transcriptome assembly was performed in four different spiny dogfish tissues (brain, liver, kidney and ovary), providing an annotated sequence resource. The characterization of the transcriptome greatly increases the scarce sequence information for shark species. Reads were assembled with the Trinity de novo assembler both within each tissue and across all tissues combined resulting in 362,690 transcripts in the combined assembly which represent 289,515 Trinity genes. BUSCO analysis determined a level of 87% completeness for the combined transcriptome. In total, 123,110 proteins were predicted of which 78,679 and 83,164 had significant hits against the SwissProt and Uniref90 protein databases, respectively. Additionally, 61,215 proteins aligned to known protein domains, 7,208 carried a signal peptide and 15,971 possessed at least one transmembrane region. Based on the annotation, 81,582 transcripts were assigned to gene ontology terms and 42,078 belong to known clusters of orthologous groups (eggNOG). To demonstrate the value of our molecular resource, we show that the improved transcriptome data enhances the current possibilities of osmoregulation research in spiny dogfish by utilizing the novel gene and protein annotations to investigate a set of genes involved in urea synthesis and urea, ammonia and water transport, all of them crucial in osmoregulation. We describe the presence of different gene copies and isoforms of key enzymes involved in this process, including arginases and transporters of urea and ammonia, for which sequence information is currently absent in the databases for this model species. The transcriptome assemblies and the derived annotations generated in this study will support the ongoing research for this particular animal model and provides a new molecular tool to assist biological research in cartilaginous fishes. PMID:28832628

  2. VCGDB: a dynamic genome database of the Chinese population

    PubMed Central

    2014-01-01

    Background The data released by the 1000 Genomes Project contain an increasing number of genome sequences from different nations and populations with a large number of genetic variations. As a result, the focus of human genome studies is changing from single and static to complex and dynamic. The currently available human reference genome (GRCh37) is based on sequencing data from 13 anonymous Caucasian volunteers, which might limit the scope of genomics, transcriptomics, epigenetics, and genome wide association studies. Description We used the massive amount of sequencing data published by the 1000 Genomes Project Consortium to construct the Virtual Chinese Genome Database (VCGDB), a dynamic genome database of the Chinese population based on the whole genome sequencing data of 194 individuals. VCGDB provides dynamic genomic information, which contains 35 million single nucleotide variations (SNVs), 0.5 million insertions/deletions (indels), and 29 million rare variations, together with genomic annotation information. VCGDB also provides a highly interactive user-friendly virtual Chinese genome browser (VCGBrowser) with functions like seamless zooming and real-time searching. In addition, we have established three population-specific consensus Chinese reference genomes that are compatible with mainstream alignment software. Conclusions VCGDB offers a feasible strategy for processing big data to keep pace with the biological data explosion by providing a robust resource for genomics studies; in particular, studies aimed at finding regions of the genome associated with diseases. PMID:24708222

  3. Genomic Organization, Transcriptomic Analysis, and Functional Characterization of Avian α- and β-Keratins in Diverse Feather Forms

    PubMed Central

    Fan, Wen-Lang; Yan, Jie; Chen, Chih-Kuan; Lai, Yu-Ting; Wu, Siao-Man; Mao, Chi-Tang; Chen, Jun-Jie; Lu, Mei-Yeh Jade; Ho, Meng-Ru; Widelitz, Randall B.; Chen, Chih-Feng; Chuong, Cheng-Ming; Li, Wen-Hsiung

    2014-01-01

    Feathers are hallmark avian integument appendages, although they were also present on theropods. They are composed of flexible corneous materials made of α- and β-keratins, but their genomic organization and their functional roles in feathers have not been well studied. First, we made an exhaustive search of α- and β-keratin genes in the new chicken genome assembly (Galgal4). Then, using transcriptomic analysis, we studied α- and β-keratin gene expression patterns in five types of feather epidermis. The expression patterns of β-keratin genes were different in different feather types, whereas those of α-keratin genes were less variable. In addition, we obtained extensive α- and β-keratin mRNA in situ hybridization data, showing that α-keratins and β-keratins are preferentially expressed in different parts of the feather components. Together, our data suggest that feather morphological and structural diversity can largely be attributed to differential combinations of α- and β-keratin genes in different intrafeather regions and/or feather types from different body parts. The expression profiles provide new insights into the evolutionary origin and diversification of feathers. Finally, functional analysis using mutant chicken keratin forms based on those found in the human α-keratin mutation database led to abnormal phenotypes. This demonstrates that the chicken can be a convenient model for studying the molecular biology of human keratin-based diseases. PMID:25152353

  4. Transcriptome and proteomic analysis of mango (Mangifera indica Linn) fruits.

    PubMed

    Wu, Hong-xia; Jia, Hui-min; Ma, Xiao-wei; Wang, Song-biao; Yao, Quan-sheng; Xu, Wen-tian; Zhou, Yi-gang; Gao, Zhong-shan; Zhan, Ru-lin

    2014-06-13

    Here we used Illumina RNA-seq technology for transcriptome sequencing of a mixed fruit sample from 'Zill' mango (Mangifera indica Linn) fruit pericarp and pulp during the development and ripening stages. RNA-seq generated 68,419,722 sequence reads that were assembled into 54,207 transcripts with a mean length of 858bp, including 26,413 clusters and 27,794 singletons. A total of 42,515(78.43%) transcripts were annotated using public protein databases, with a cut-off E-value above 10(-5), of which 35,198 and 14,619 transcripts were assigned to gene ontology terms and clusters of orthologous groups respectively. Functional annotation against the Kyoto Encyclopedia of Genes and Genomes database identified 23,741(43.79%) transcripts which were mapped to 128 pathways. These pathways revealed many previously unknown transcripts. We also applied mass spectrometry-based transcriptome data to characterize the proteome of ripe fruit. LC-MS/MS analysis of the mango fruit proteome was using tandem mass spectrometry (MS/MS) in an LTQ Orbitrap Velos (Thermo) coupled online to the HPLC. This approach enabled the identification of 7536 peptides that matched 2754 proteins. Our study provides a comprehensive sequence for a systemic view of transcriptome during mango fruit development and the most comprehensive fruit proteome to date, which are useful for further genomics research and proteomic studies. Our study provides a comprehensive sequence for a systemic view of both the transcriptome and proteome of mango fruit, and a valuable reference for further research on gene expression and protein identification. This article is part of a Special Issue entitled: Proteomics of non-model organisms. Copyright © 2014 Elsevier B.V. All rights reserved.

  5. De novo Assembly of Leaf Transcriptome in the Medicinal Plant Andrographis paniculata

    PubMed Central

    Cherukupalli, Neeraja; Divate, Mayur; Mittapelli, Suresh R.; Khareedu, Venkateswara R.; Vudem, Dashavantha R.

    2016-01-01

    Andrographis paniculata is an important medicinal plant containing various bioactive terpenoids and flavonoids. Despite its importance in herbal medicine, no ready-to-use transcript sequence information of this plant is made available in the public data base, this study mainly deals with the sequencing of RNA from A. paniculata leaf using Illumina HiSeq™ 2000 platform followed by the de novo transcriptome assembly. A total of 189.22 million high quality paired reads were generated and 1,70,724 transcripts were predicted in the primary assembly. Secondary assembly generated a transcriptome size of ~88 Mb with 83,800 clustered transcripts. Based on the similarity searches against plant non-redundant protein database, gene ontology, and eukaryotic orthologous groups, 49,363 transcripts were annotated constituting upto 58.91% of the identified unigenes. Annotation of transcripts—using kyoto encyclopedia of genes and genomes database—revealed 5606 transcripts plausibly involved in 140 pathways including biosynthesis of terpenoids and other secondary metabolites. Transcription factor analysis showed 6767 unique transcripts belonging to 97 different transcription factor families. A total number of 124 CYP450 transcripts belonging to seven divergent clans have been identified. Transcriptome revealed 146 different transcripts coding for enzymes involved in the biosynthesis of terpenoids of which 35 contained terpene synthase motifs. This study also revealed 32,341 simple sequence repeats (SSRs) in 23,168 transcripts. Assembled sequences of transcriptome of A. paniculata generated in this study are made available, for the first time, in the TSA database, which provides useful information for functional and comparative genomic analysis besides identification of key enzymes involved in the various pathways of secondary metabolism. PMID:27582746

  6. CyanOmics: an integrated database of omics for the model cyanobacterium Synechococcus sp. PCC 7002.

    PubMed

    Yang, Yaohua; Feng, Jie; Li, Tao; Ge, Feng; Zhao, Jindong

    2015-01-01

    Cyanobacteria are an important group of organisms that carry out oxygenic photosynthesis and play vital roles in both the carbon and nitrogen cycles of the Earth. The annotated genome of Synechococcus sp. PCC 7002, as an ideal model cyanobacterium, is available. A series of transcriptomic and proteomic studies of Synechococcus sp. PCC 7002 cells grown under different conditions have been reported. However, no database of such integrated omics studies has been constructed. Here we present CyanOmics, a database based on the results of Synechococcus sp. PCC 7002 omics studies. CyanOmics comprises one genomic dataset, 29 transcriptomic datasets and one proteomic dataset and should prove useful for systematic and comprehensive analysis of all those data. Powerful browsing and searching tools are integrated to help users directly access information of interest with enhanced visualization of the analytical results. Furthermore, Blast is included for sequence-based similarity searching and Cluster 3.0, as well as the R hclust function is provided for cluster analyses, to increase CyanOmics's usefulness. To the best of our knowledge, it is the first integrated omics analysis database for cyanobacteria. This database should further understanding of the transcriptional patterns, and proteomic profiling of Synechococcus sp. PCC 7002 and other cyanobacteria. Additionally, the entire database framework is applicable to any sequenced prokaryotic genome and could be applied to other integrated omics analysis projects. Database URL: http://lag.ihb.ac.cn/cyanomics. © The Author(s) 2015. Published by Oxford University Press.

  7. De novo transcriptome assembly databases for the butterfly orchid Phalaenopsis equestris

    PubMed Central

    Niu, Shan-Ce; Xu, Qing; Zhang, Guo-Qiang; Zhang, Yong-Qiang; Tsai, Wen-Chieh; Hsu, Jui-Ling; Liang, Chieh-Kai; Luo, Yi-Bo; Liu, Zhong-Jian

    2016-01-01

    Orchids are renowned for their spectacular flowers and ecological adaptations. After the sequencing of the genome of the tropical epiphytic orchid Phalaenopsis equestris, we combined Illumina HiSeq2000 for RNA-Seq and Trinity for de novo assembly to characterize the transcriptomes for 11 diverse P. equestris tissues representing the root, stem, leaf, flower buds, column, lip, petal, sepal and three developmental stages of seeds. Our aims were to contribute to a better understanding of the molecular mechanisms driving the analysed tissue characteristics and to enrich the available data for P. equestris. Here, we present three databases. The first dataset is the RNA-Seq raw reads, which can be used to execute new experiments with different analysis approaches. The other two datasets allow different types of searches for candidate homologues. The second dataset includes the sets of assembled unigenes and predicted coding sequences and proteins, enabling a sequence-based search. The third dataset consists of the annotation results of the aligned unigenes versus the Nonredundant (Nr) protein database, Kyoto Encyclopaedia of Genes and Genomes (KEGG) and Clusters of Orthologous Groups (COG) databases with low e-values, enabling a name-based search. PMID:27673730

  8. An integrative 'omics' solution to the detection of recombinant human erythropoietin and blood doping.

    PubMed

    Pitsiladis, Yannis P; Durussel, Jérôme; Rabin, Olivier

    2014-05-01

    Administration of recombinant human erythropoietin (rHumanEPO) improves sporting performance and hence is frequently subject to abuse by athletes, although rHumanEPO is prohibited by the WADA. Approaches to detect rHumanEPO doping have improved significantly in recent years but remain imperfect. A new transcriptomic-based longitudinal screening approach is being developed that has the potential to improve the analytical performance of current detection methods. In particular, studies are being funded by WADA to identify a 'molecular signature' of rHumanEPO doping and preliminary results are promising. In the first systematic study to be conducted, the expression of hundreds of genes were found to be altered by rHumanEPO with numerous gene transcripts being differentially expressed after the first injection and further transcripts profoundly upregulated during and subsequently downregulated up to 4 weeks postadministration of the drug; with the same transcriptomic pattern observed in all participants. The identification of a blood 'molecular signature' of rHumanEPO administration is the strongest evidence to date that gene biomarkers have the potential to substantially improve the analytical performance of current antidoping methods such as the Athlete Biological Passport for rHumanEPO detection. Given the early promise of transcriptomics, research using an 'omics'-based approach involving genomics, transcriptomics, proteomics and metabolomics should be intensified in order to achieve improved detection of rHumanEPO and other doping substances and methods difficult to detect such a recombinant human growth hormone and blood transfusions.

  9. Antennal Transcriptome Analysis and Comparison of Chemosensory Gene Families in Two Closely Related Noctuidae Moths, Helicoverpa armigera and H. assulta

    PubMed Central

    Zhang, Jin; Wang, Bing; Dong, Shuanglin; Cao, Depan; Dong, Junfeng; Walker, William B.; Liu, Yang; Wang, Guirong

    2015-01-01

    To better understand the olfactory mechanisms in the two lepidopteran pest model species, the Helicoverpa armigera and H. assulta, we conducted transcriptome analysis of the adult antennae using Illumina sequencing technology and compared the chemosensory genes between these two related species. Combined with the chemosensory genes we had identified previously in H. armigera by 454 sequencing, we identified 133 putative chemosensory unigenes in H. armigera including 60 odorant receptors (ORs), 19 ionotropic receptors (IRs), 34 odorant binding proteins (OBPs), 18 chemosensory proteins (CSPs), and 2 sensory neuron membrane proteins (SNMPs). Consistent with these results, 131 putative chemosensory genes including 64 ORs, 19 IRs, 29 OBPs, 17 CSPs, and 2 SNMPs were identified through male and female antennal transcriptome analysis in H. assulta. Reverse Transcription-PCR (RT-PCR) was conducted in H. assulta to examine the accuracy of the assembly and annotation of the transcriptome and the expression profile of these unigenes in different tissues. Most of the ORs, IRs and OBPs were enriched in adult antennae, while almost all the CSPs were expressed in antennae as well as legs. We compared the differences of the chemosensory genes between these two species in detail. Our work will surely provide valuable information for further functional studies of pheromones and host volatile recognition genes in these two related species. PMID:25659090

  10. Alcohol-Induced Molecular Dysregulation in Human Embryonic Stem Cell-Derived Neural Precursor Cells

    PubMed Central

    Kim, Yi Young; Roubal, Ivan; Lee, Youn Soo; Kim, Jin Seok; Hoang, Michael; Mathiyakom, Nathan; Kim, Yong

    2016-01-01

    Adverse effect of alcohol on neural function has been well documented. Especially, the teratogenic effect of alcohol on neurodevelopment during embryogenesis has been demonstrated in various models, which could be a pathologic basis for fetal alcohol spectrum disorders (FASDs). While the developmental defects from alcohol abuse during gestation have been described, the specific mechanisms by which alcohol mediates these injuries have yet to be determined. Recent studies have shown that alcohol has significant effect on molecular and cellular regulatory mechanisms in embryonic stem cell (ESC) differentiation including genes involved in neural development. To test our hypothesis that alcohol induces molecular alterations during neural differentiation we have derived neural precursor cells from pluripotent human ESCs in the presence or absence of ethanol treatment. Genome-wide transcriptomic profiling identified molecular alterations induced by ethanol exposure during neural differentiation of hESCs into neural rosettes and neural precursor cell populations. The Database for Annotation, Visualization and Integrated Discovery (DAVID) functional analysis on significantly altered genes showed potential ethanol’s effect on JAK-STAT signaling pathway, neuroactive ligand-receptor interaction, Toll-like receptor (TLR) signaling pathway, cytokine-cytokine receptor interaction and regulation of autophagy. We have further quantitatively verified ethanol-induced alterations of selected candidate genes. Among verified genes we further examined the expression of P2RX3, which is associated with nociception, a peripheral pain response. We found ethanol significantly reduced the level of P2RX3 in undifferentiated hESCs, but induced the level of P2RX3 mRNA and protein in hESC-derived NPCs. Our result suggests ethanol-induced dysregulation of P2RX3 along with alterations in molecules involved in neural activity such as neuroactive ligand-receptor interaction may be a molecular event associated with alcohol-related peripheral neuropathy of an enhanced nociceptive response. PMID:27682028

  11. Cytoplasmic Acidification and the Benzoate Transcriptome in Bacillus subtilis

    PubMed Central

    Kitko, Ryan D.; Cleeton, Rebecca L.; Armentrout, Erin I.; Lee, Grace E.; Noguchi, Ken; Berkmen, Melanie B.; Jones, Brian D.; Slonczewski, Joan L.

    2009-01-01

    Background Bacillus subtilis encounters a wide range of environmental pH. The bacteria maintain cytoplasmic pH within a narrow range. Response to acid stress is a poorly understood function of external pH and of permeant acids that conduct protons into the cytoplasm. Methods and Principal Findings Cytoplasmic acidification and the benzoate transcriptome were observed in Bacillus subtilis. Cytoplasmic pH was measured with 4-s time resolution using GFPmut3b fluorimetry. Rapid external acidification (pH 7.5 to 6.0) acidified the B. subtilis cytoplasm, followed by partial recovery. Benzoate addition up to 60 mM at external pH 7 depressed cytoplasmic pH but left a transmembrane ΔpH permitting growth; this robust adaptation to benzoate exceeds that seen in E. coli. Cytoplasmic pH was depressed by 0.3 units during growth with 30 mM benzoate. The transcriptome of benzoate-adapted cells was determined by comparing 4,095 gene expression indices following growth at pH 7, +/− 30 mM benzoate. 164 ORFs showed ≥2-fold up-regulation by benzoate (30 mM benzoate/0 mM), and 102 ORFs showed ≥2-fold down-regulation. 42% of benzoate-dependent genes are regulated up or down, respectively, at pH 6 versus pH 7; they are candidates for cytoplasmic pH response. Acid-stress genes up-regulated by benzoate included drug resistance genes (yhbI, yhcA, yuxJ, ywoGH); an oligopeptide transporter (opp); glycine catabolism (gcvPA-PB); acetate degradation (acsA); dehydrogenases (ald, fdhD, serA, yrhEFG, yjgCD); the TCA cycle (citZ, icd, mdh, sucD); and oxidative stress (OYE-family yqjM, ohrB). Base-stress genes down-regulated by benzoate included malate metabolism (maeN), sporulation control (spo0M, spo0E), and the SigW alkali shock regulon. Cytoplasmic pH could mediate alkali-shock induction of SigW. Conclusions B. subtilis maintains partial pH homeostasis during growth, and withstands high concentrations of permeant acid stress, higher than for gram-negative neutralophile E. coli. The benzoate adaptation transcriptome substantially overlaps that of external acid, contributing to a cytoplasmic pH transcriptome. PMID:20011599

  12. Human and feline adipose-derived mesenchymal stem cells have comparable phenotype, immunomodulatory functions, and transcriptome.

    PubMed

    Clark, Kaitlin C; Fierro, Fernando A; Ko, Emily Mills; Walker, Naomi J; Arzi, Boaz; Tepper, Clifford G; Dahlenburg, Heather; Cicchetto, Andrew; Kol, Amir; Marsh, Lyndsey; Murphy, William J; Fazel, Nasim; Borjesson, Dori L

    2017-03-20

    Adipose-derived mesenchymal stem cells (ASCs) are a promising cell therapy to treat inflammatory and immune-mediated diseases. Development of appropriate pre-clinical animal models is critical to determine safety and attain early efficacy data for the most promising therapeutic candidates. Naturally occurring diseases in cats already serve as valuable models to inform human clinical trials in oncologic, cardiovascular, and genetic diseases. The objective of this study was to complete a comprehensive side-by-side comparison of human and feline ASCs, with an emphasis on their immunomodulatory capacity and transcriptome. Human and feline ASCs were evaluated for phenotype, immunomodulatory profile, and transcriptome. Additionally, transwells were used to determine the role of cell-cell contact in ASC-mediated inhibition of lymphocyte proliferation in both humans and cats. Similar to human ASCs, feline ASCs were highly proliferative at low passages and fit the minimal criteria of multipotent stem cells including a compatible surface protein phenotype, osteogenic capacity, and normal karyotype. Like ASCs from all species, feline ASCs inhibited mitogen-activated lymphocyte proliferation in vitro, with or without direct ASC-lymphocyte contact. Feline ASCs mimic human ASCs in their mediator secretion pattern, including prostaglandin E2, indoleamine 2,3 dioxygenase, transforming growth factor beta, and interleukin-6, all augmented by interferon gamma secretion by lymphocytes. The transcriptome of three unactivated feline ASC lines were highly similar. Functional analysis of the most highly expressed genes highlighted processes including: 1) the regulation of apoptosis; 2) cell adhesion; 3) response to oxidative stress; and 4) regulation of cell differentiation. Finally, feline ASCs had a similar gene expression profile to noninduced human ASCs. Findings suggest that feline ASCs modulate lymphocyte proliferation using soluble mediators that mirror the human ASC secretion pattern. Uninduced feline ASCs have similar gene expression profiles to uninduced human ASCs, as revealed by transcriptome analysis. These data will help inform clinical trials using cats with naturally occurring diseases as surrogate models for human clinical trials in the regenerative medicine arena.

  13. Changes in the Transcriptome of the Human Endometrial Ishikawa Cancer Cell Line Induced by Estrogen, Progesterone, Tamoxifen, and Mifepristone (RU486) as Detected by RNA-Sequencing

    PubMed Central

    Tamm-Rosenstein, Karin; Simm, Jaak; Suhorutshenko, Marina; Salumets, Andres; Metsis, Madis

    2013-01-01

    Background Estrogen (E2) and progesterone (P4) are key players in the maturation of the human endometrium. The corresponding steroid hormone modulators, tamoxifen (TAM) and mifepristone (RU486) are widely used in breast cancer therapy and for contraception purposes, respectively. Methodology/Principal findings Gene expression profiling of the human endometrial Ishikawa cancer cell line treated with E2 and P4 for 3 h and 12 h, and TAM and RU486 for 12 h, was performed using RNA-sequencing. High levels of mRNA were detected for genes, including PSAP, ATP5G2, ATP5H, and GNB2L1 following E2 or P4 treatment. A total of 82 biomarkers for endometrial biology were identified among E2 induced genes, and 93 among P4 responsive genes. Identified biomarkers included: EZH2, MDK, MUC1, SLIT2, and IL6ST, which are genes previously associated with endometrial receptivity. Moreover, 98.8% and 98.6% of E2 and P4 responsive genes in Ishikawa cells, respectively, were also detected in two human mid-secretory endometrial biopsy samples. TAM treatment exhibited both antagonistic and agonistic effects of E2, and also regulated a subset of genes independently. The cell cycle regulator cyclin D1 (CCND1) showed significant up-regulation following treatment with TAM. RU486 did not appear to act as a pure antagonist of P4 and a functional analysis of RU486 response identified genes related to adhesion and apoptosis, including down-regulated genes associated with cell-cell contacts and adhesion as CTNND1, JUP, CDH2, IQGAP1, and COL2A1. Conclusions Significant changes in gene expression by the Ishikawa cell line were detected after treatments with E2, P4, TAM, and RU486. These transcriptome data provide valuable insight into potential biomarkers related to endometrial receptivity, and also facilitate an understanding of the molecular changes that take place in the endometrium in the early stages of breast cancer treatment and contraception usage. PMID:23874806

  14. Transcriptome Profiling of the Abdominal Skin of Larimichthys crocea in Light Stress

    NASA Astrophysics Data System (ADS)

    Han, Zhaofang; Lv, Changhuan; Xiao, Shijun; Ye, Kun; Zhang, Dongling; Tsai, Huai Jen; Wang, Zhiyong

    2018-04-01

    Large yellow croaker ( Larimichthys crocea), one of the most important marine fish species in China, can change its abdominal skin color when it is shifted from light to dark or from dark to light, providing us an opportunity of investigating the molecular responding mechanism of teleost in light stress. The gene expression profile of fish under light stress is rarely documented. In this research, the transcriptome profiles of the abdominal skin of L. crocea exposed to light or dark for 0 h, 0.5 h and 2 h were produced by next-generation sequencing (NGS). The cluster results demonstrated that stress period, rather than light intensity ( e.g., light or dark), is the major influencing factor. Differently expressed genes (DEGs) were identified between 0 h and 0.5 h groups, between 0 h and 2 h groups, between 0.5 h light and 0.5 h dark, and between 2 h light and 2 h dark, respectively. The gene ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) annotation revealed that the genes relating to immunity, energy metabolism, and cytoskeletal protein binding were significantly enriched. The detailed analysis of transcriptome profiles also revealed regular gene expression trends, indicating that the elaborate gene regulation networks underlined the molecular responses of the fish to light stress. This transcriptome analysis suggested that systematic and complicated regulatory cascades were functionally activated in response to external stress, and coloration change caused by light stress was mainly attributed to the change in the density of chromatophores for L. crocea. This study also provided valuable information for skin coloration or light stress research on other marine fish species.

  15. Reefgenomics.Org - a repository for marine genomics data.

    PubMed

    Liew, Yi Jin; Aranda, Manuel; Voolstra, Christian R

    2016-01-01

    Over the last decade, technological advancements have substantially decreased the cost and time of obtaining large amounts of sequencing data. Paired with the exponentially increased computing power, individual labs are now able to sequence genomes or transcriptomes to investigate biological questions of interest. This has led to a significant increase in available sequence data. Although the bulk of data published in articles are stored in public sequence databases, very often, only raw sequencing data are available; miscellaneous data such as assembled transcriptomes, genome annotations etc. are not easily obtainable through the same means. Here, we introduce our website (http://reefgenomics.org) that aims to centralize genomic and transcriptomic data from marine organisms. Besides providing convenient means to download sequences, we provide (where applicable) a genome browser to explore available genomic features, and a BLAST interface to search through the hosted sequences. Through the interface, multiple datasets can be queried simultaneously, allowing for the retrieval of matching sequences from organisms of interest. The minimalistic, no-frills interface reduces visual clutter, making it convenient for end-users to search and explore processed sequence data. DATABASE URL: http://reefgenomics.org. © The Author(s) 2016. Published by Oxford University Press.

  16. Transcriptome analysis in Concholepas concholepas (Gastropoda, Muricidae): mining and characterization of new genomic and molecular markers.

    PubMed

    Cárdenas, Leyla; Sánchez, Roland; Gomez, Daniela; Fuenzalida, Gonzalo; Gallardo-Escárate, Cristián; Tanguy, Arnaud

    2011-09-01

    The marine gastropod Concholepas concholepas, locally known as the "loco", is the main target species of the benthonic Chilean fisheries. Genetic and genomic tools are necessary to study the genome of this species in order to understand the molecular basis of its development, growth, and other key traits to improve the management strategies and to identify local adaptation to prevent loss of biodiversity. Here, we use pyrosequencing technologies to generate the first transcriptomic database from adult specimens of the loco. After trimming, a total of 140,756 Expressed Sequence Tag sequences were achieved. Clustering and assembly analysis identified 19,219 contigs and 105,435 singleton sequences. BlastN analysis showed a significant identity with Expressed Sequence Tags of different gastropod species available in public databases. Similarly, BlastX results showed that only 895 out of the total 124,654 had significant hits and may represent novel genes for marine gastropods. From this database, simple sequence repeat motifs were also identified and a total of 38 primer pairs were designed and tested to assess their potential as informative markers and to investigate their cross-species amplification in different related gastropod species. This dataset represents the first publicly available 454 data for a marine gastropod endemic to the southeastern Pacific coast, providing a valuable transcriptomic resource for future efforts of gene discovery and development of functional markers in other marine gastropods. Copyright © 2011 Elsevier B.V. All rights reserved.

  17. First insights into the giant panda (Ailuropoda melanoleuca) blood transcriptome: a resource for novel gene loci and immunogenetics.

    PubMed

    Du, Lianming; Li, Wujiao; Fan, Zhenxin; Shen, Fujun; Yang, Mingyu; Wang, Zili; Jian, Zuoyi; Hou, Rong; Yue, Bisong; Zhang, Xiuyue

    2015-07-01

    The giant panda (Ailuropoda melanoleuca) is one of the most famous flagship species for conservation, and its draft genome has recently been assembled. However, the transcriptome is not yet available. In this study, the blood transcriptomes of three pandas were characterized and about 160 million sequencing reads were generated using Illumina HiSeq 2000 paired-end sequencing technology. The assembly yielded 92 598 transcripts with an average length of 1626 bp and N50 length of 2842 bp. Based on a sequence similarity search against nonredundant (nr) protein database, a total of 38 522 (41.6%) transcripts were annotated. Of these annotated transcripts, 25 142 and 8272 transcripts were assigned to gene ontology terms and clusters of orthologous group, respectively. A search against the Kyoto Encyclopedia of Genes and Genomes Pathway database (KEGG) indicated that 9098 (9.83%) transcripts mapped to 324 KEGG pathways, and the best represented functional categories of pathways were signal transduction and immune system. We have also identified 23 460 microsatellites, 43 560 SNPs as well as 21 456 alternative splicing events in the assembly. Additionally, a total of 24 341 complete open reading frames (ORFs) were detected from the assembly where 1492 ORFs were found to be novel gene loci as these have not been annotated so far in any public database. © 2014 John Wiley & Sons Ltd.

  18. Tox-Database.net: a curated resource for data describing chemical triggered in vitro cardiac ion channels inhibition

    PubMed Central

    2012-01-01

    Background Drugs safety issues are now recognized as being factors generating the most reasons for drug withdrawals at various levels of development and at the post-approval stage. Among them cardiotoxicity remains the main reason, despite the substantial effort put into in vitro and in vivo testing, with the main focus put on hERG channel inhibition as the hypothesized surrogate of drug proarrhythmic potency. The large interest in the IKr current has resulted in the development of predictive tools and informative databases describing a drug's susceptibility to interactions with the hERG channel, although there are no similar, publicly available sets of data describing other ionic currents driven by the human cardiomyocyte ionic channels, which are recognized as an overlooked drug safety target. Discussion The aim of this database development and publication was to provide a scientifically useful, easily usable and clearly verifiable set of information describing not only IKr (hERG), but also other human cardiomyocyte specific ionic channels inhibition data (IKs, INa, ICa). Summary The broad range of data (chemical space and in vitro settings) and the easy to use user interface makes tox-database.net a useful tool for interested scientists. Database URL http://tox-database.net. PMID:22947121

  19. Pigmentation Is Associated with Stemness Hierarchy of Progenitor Cells Within Cultured Limbal Epithelial Cells.

    PubMed

    Liu, Lei; Nielsen, Frederik Mølgaard; Emmersen, Jeppe; Bath, Chris; Hjortdal, Jesper Østergaard; Riis, Simone; Fink, Trine; Pennisi, Cristian Pablo; Zachar, Vladimir

    2018-05-20

    Ex-vivo cultured human limbal epithelial stem/progenitor cells (hLESCs) are the main source for regenerative therapy of limbal stem cell deficiency (LSCD), which is worldwide one of the major causes of corneal blindness. Despite many stemness-associated markers have been identified within the limbal niche, the phenotype of the earliest hLESCs has not been hitherto identified. We sought to confirm or refute the use of tumor protein p63 (p63) and ATP binding cassette subfamily B member 5 (ABCB5) as surrogate markers for hLESCs early within the limbal differentiation hierarchy. Based on a robust fluorescence-activated cell sorting (FACS) and subsequent RNA isolation protocol, a comprehensive transcriptomic profile was obtained from four subpopulations of cultured hLESCs. The subpopulations were defined by co-expression of two putative stem/progenitor markers, the p63 and ABCB5, and the corneal differentiation marker cytokeratin 3 (CK3). A comparative transcriptomic analysis yielded novel data that indicated association between pigmentation and differentiation, with the p63 positive populations being the most pigmented and immature of the progenitors. In contrast, ABCB5, either alone or in co-expression patterns, identified more committed progenitor cells with less pigmentation. In conclusion, p63 is superior to ABCB5 as a marker for stemness. This article is protected by copyright. All rights reserved. © 2018 AlphaMed Press.

  20. Expression Profiling Smackdown: Human Transcriptome Array HTA 2.0 vs. RNA-Seq

    PubMed Central

    Palermo, Meghann; Driscoll, Heather; Tighe, Scott; Dragon, Julie; Bond, Jeff; Shukla, Arti; Vangala, Mahesh; Vincent, James; Hunter, Tim

    2014-01-01

    The advent of both microarray and massively parallel sequencing have revolutionized high-throughput analysis of the human transcriptome. Due to limitations in microarray technology, detecting and quantifying coding transcript isoforms, in addition to non-coding transcripts, has been challenging. As a result, RNA-Seq has been the preferred method for characterizing the full human transcriptome, until now. A new high-resolution array from Affymetrix, GeneChip Human Transcriptome Array 2.0 (HTA 2.0), has been designed to interrogate all transcript isoforms in the human transcriptome with >6 million probes targeting coding transcripts, exon-exon splice junctions, and non-coding transcripts. Here we compare expression results from GeneChip HTA 2.0 and RNA-Seq data using identical RNA extractions from three samples each of healthy human mesothelial cells in culture, LP9-C1, and healthy mesothelial cells treated with asbestos, LP9-A1. For GeneChip HTA 2.0 sample preparation, we chose to compare two target preparation methods, NuGEN Ovation Pico WTA V2 with the Encore Biotin Module versus Affymetrix's GeneChip WT PLUS with the WT Terminal Labeling Kit, on identical RNA extractions from both untreated and treated samples. These same RNA extractions were used for the RNA-Seq library preparation. All analyses were performed in Partek Genomics Suite 6.6. Expression profiles for control and asbestos-treated mesothelial cells prepared with NuGEN versus Affymetrix target preparation methods (GeneChip HTA 2.0) are compared to each other as well as to RNA-Seq results.

  1. Transcriptome analysis and de novo annotation of the critically endangered Amur sturgeon (Acipenser schrenckii).

    PubMed

    Zhang, X J; Jiang, H Y; Li, L M; Yuan, L H; Chen, J P

    2016-06-20

    The aim of this study was to provide comprehensive insights into the genetic background of sturgeon by transcriptome study. We performed a de novo assembly of the Amur sturgeon Acipenser schrenckii transcriptome using Illumina Hiseq 2000 sequencing. A total of 148,817 non-redundant unigenes with base length of approximately 121,698,536 bp and ranges from 201 to 26,789 bp were obtained. All the unigenes were classified into 3368 distinct categories and 145,449 singletons by homologous transcript cluster analysis. In all, 46,865 (31.49%) unigenes showed homologous matches with Nr database and 32,214 (21.65%) unigenes were matched to Nt database. In total, 24,862 unigenes were categorized into significantly enriched 52 function groups by GO analysis, and 38,436 unigenes were classified into 25 groups by KOG prediction, as well as 128 enriched KEGG pathways were identified by 45,598 unigenes (P < 0.05). Subsequently, a total of 19,860 SSRs markers were identified with the abundant di-nucleotide type (10,658; 53.67%) and the most AT/TA motif repeats (2689; 13.54%). A total of 1341 conserved lncRNAs were identified by a customized pipeline. Our study provides new sequence and function information for A. schrenckii, which will be the basis for further genetic studies on sturgeon species. The huge number of potential SSRs and putatively conserved lncRNAs isolated by the transcriptome also shed light on research in many fields, including the evolution, conservation management, and biological processes in sturgeon.

  2. Feasibility of the salivary transcriptome as a novel biomarker in determining disease susceptibility.

    PubMed

    Hidayat, M F H; Milne, T; Cullinan, M P; Seymour, G J

    2018-06-01

    The salivary transcriptome may present as a readily available and non-invasive source of potential biomarkers. The development of chronic periodontitis is determined by individual patient susceptibility; hence, the aim of this study was to determine the potential of the salivary transcriptome as a biomarker of disease susceptibility using chronic periodontitis as an example. Using an Oragene ® RNA kit, the total RNA was purified from the saliva of 10 patients with chronic periodontitis and 10 patients without chronic periodontitis. The quantity and quality of the total RNA was determined, and a measure of gene expression via cDNA was undertaken using the Affymetrix microarray system. The microarray profiling result was further validated by real-time quantitative polymerase chain reaction. Spectrophotometric analysis showed the total RNA purified from each participant ranged from 0.92 μg/500 μL to 62.85 μg/500 μL. There was great variability in the quantity of total RNA obtained from the 2 groups in the study with a mean of 10.21 ± 12.71 μg/500 μL for the periodontitis group and 15.97 ± 23.47 μg/500 μL for the control group. Further the RNA purity (based on the A 260 /A 280 ratio) for the majority of participants (9 periodontitis and 6 controls) were within the acceptable limits for downstream analysis (2.0 ± 0.1). The study samples, showed 2 distinct bands at 23S (3800 bp) and 16S (1500 bp) characteristic of bacterial rRNA. Preliminary microarray analysis was performed for 4 samples (P2, P6, H5 and H9). The percentage of genes present in each of the 4 samples was not consistent with about 1.8%-18.7% of genes being detected. Quantitative real-time polymerase chain reaction confirmed that the total RNA purified from each sample was mainly bacterial RNA (Uni 16S) with minimal human mRNA. This study showed that minimal amounts of human RNA were able to be isolated from the saliva of patients with periodontitis as well as controls. Further work is required to enhance the extraction process of human mRNA from saliva if the salivary transcriptome is to be used in determining individual patient susceptibility. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  3. The Yak genome database: an integrative database for studying yak biology and high-altitude adaption

    PubMed Central

    2012-01-01

    Background The yak (Bos grunniens) is a long-haired bovine that lives at high altitudes and is an important source of milk, meat, fiber and fuel. The recent sequencing, assembly and annotation of its genome are expected to further our understanding of the means by which it has adapted to life at high altitudes and its ecologically important traits. Description The Yak Genome Database (YGD) is an internet-based resource that provides access to genomic sequence data and predicted functional information concerning the genes and proteins of Bos grunniens. The curated data stored in the YGD includes genome sequences, predicted genes and associated annotations, non-coding RNA sequences, transposable elements, single nucleotide variants, and three-way whole-genome alignments between human, cattle and yak. YGD offers useful searching and data mining tools, including the ability to search for genes by name or using function keywords as well as GBrowse genome browsers and/or BLAST servers, which can be used to visualize genome regions and identify similar sequences. Sequence data from the YGD can also be downloaded to perform local searches. Conclusions A new yak genome database (YGD) has been developed to facilitate studies on high-altitude adaption and bovine genomics. The database will be continuously updated to incorporate new information such as transcriptome data and population resequencing data. The YGD can be accessed at http://me.lzu.edu.cn/yak. PMID:23134687

  4. ChiTaRS-3.1-the enhanced chimeric transcripts and RNA-seq database matched with protein-protein interactions.

    PubMed

    Gorohovski, Alessandro; Tagore, Somnath; Palande, Vikrant; Malka, Assaf; Raviv-Shay, Dorith; Frenkel-Morgenstern, Milana

    2017-01-04

    Discovery of chimeric RNAs, which are produced by chromosomal translocations as well as the joining of exons from different genes by trans-splicing, has added a new level of complexity to our study and understanding of the transcriptome. The enhanced ChiTaRS-3.1 database (http://chitars.md.biu.ac.il) is designed to make widely accessible a wealth of mined data on chimeric RNAs, with easy-to-use analytical tools built-in. The database comprises 34 922: chimeric transcripts along with 11 714: cancer breakpoints. In this latest version, we have included multiple cross-references to GeneCards, iHop, PubMed, NCBI, Ensembl, OMIM, RefSeq and the Mitelman collection for every entry in the 'Full Collection'. In addition, for every chimera, we have added a predicted Chimeric Protein-Protein Interaction (ChiPPI) network, which allows for easy visualization of protein partners of both parental and fusion proteins for all human chimeras. The database contains a comprehensive annotation for 34 922: chimeric transcripts from eight organisms, and includes the manual annotation of 200 sense-antiSense (SaS) chimeras. The current improvements in the content and functionality to the ChiTaRS database make it a central resource for the study of chimeric transcripts and fusion proteins. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  5. BGDB: a database of bivalent genes.

    PubMed

    Li, Qingyan; Lian, Shuabin; Dai, Zhiming; Xiang, Qian; Dai, Xianhua

    2013-01-01

    Bivalent gene is a gene marked with both H3K4me3 and H3K27me3 epigenetic modification in the same area, and is proposed to play a pivotal role related to pluripotency in embryonic stem (ES) cells. Identification of these bivalent genes and understanding their functions are important for further research of lineage specification and embryo development. So far, lots of genome-wide histone modification data were generated in mouse and human ES cells. These valuable data make it possible to identify bivalent genes, but no comprehensive data repositories or analysis tools are available for bivalent genes currently. In this work, we develop BGDB, the database of bivalent genes. The database contains 6897 bivalent genes in human and mouse ES cells, which are manually collected from scientific literature. Each entry contains curated information, including genomic context, sequences, gene ontology and other relevant information. The web services of BGDB database were implemented with PHP + MySQL + JavaScript, and provide diverse query functions. Database URL: http://dailab.sysu.edu.cn/bgdb/

  6. Mice carrying a human GLUD2 gene recapitulate aspects of human transcriptome and metabolome development

    PubMed Central

    Li, Qian; Guo, Song; Jiang, Xi; Bryk, Jaroslaw; Naumann, Ronald; Enard, Wolfgang; Tomita, Masaru; Sugimoto, Masahiro; Khaitovich, Philipp; Pääbo, Svante

    2016-01-01

    Whereas all mammals have one glutamate dehydrogenase gene (GLUD1), humans and apes carry an additional gene (GLUD2), which encodes an enzyme with distinct biochemical properties. We inserted a bacterial artificial chromosome containing the human GLUD2 gene into mice and analyzed the resulting changes in the transcriptome and metabolome during postnatal brain development. Effects were most pronounced early postnatally, and predominantly genes involved in neuronal development were affected. Remarkably, the effects in the transgenic mice partially parallel the transcriptome and metabolome differences seen between humans and macaques analyzed. Notably, the introduction of GLUD2 did not affect glutamate levels in mice, consistent with observations in the primates. Instead, the metabolic effects of GLUD2 center on the tricarboxylic acid cycle, suggesting that GLUD2 affects carbon flux during early brain development, possibly supporting lipid biosynthesis. PMID:27118840

  7. Human neural crest cells display molecular and phenotypic hallmarks of stem cells

    PubMed Central

    Thomas, Sophie; Thomas, Marie; Wincker, Patrick; Babarit, Candice; Xu, Puting; Speer, Marcy C.; Munnich, Arnold; Lyonnet, Stanislas; Vekemans, Michel; Etchevers, Heather C.

    2008-01-01

    The fields of both developmental and stem cell biology explore how functionally distinct cell types arise from a self-renewing founder population. Multipotent, proliferative human neural crest cells (hNCC) develop toward the end of the first month of pregnancy. It is assumed that most differentiate after migrating throughout the organism, although in animal models neural crest stem cells reportedly persist in postnatal tissues. Molecular pathways leading over time from an invasive mesenchyme to differentiated progeny such as the dorsal root ganglion, the maxillary bone or the adrenal medulla are altered in many congenital diseases. To identify additional components of such pathways, we derived and maintained self-renewing hNCC lines from pharyngulas. We show that, unlike their animal counterparts, hNCC are able to self-renew ex vivo under feeder-free conditions. While cross species comparisons showed extensive overlap between human, mouse and avian NCC transcriptomes, some molecular cascades are only active in the human cells, correlating with phenotypic differences. Furthermore, we found that the global hNCC molecular profile is highly similar to that of pluripotent embryonic stem cells when compared with other stem cell populations or hNCC derivatives. The pluripotency markers NANOG, POU5F1 and SOX2 are also expressed by hNCC, and a small subset of transcripts can unambiguously identify hNCC among other cell types. The hNCC molecular profile is thus both unique and globally characteristic of uncommitted stem cells. PMID:18689800

  8. Genome-wide identification of expression quantitative trait loci for human telomerase.

    PubMed

    Kim, Hanseol; Ryu, Jihye; Lee, Chaeyoung

    2016-10-01

    A genome-wide association study was conducted to identify expression quantitative trait loci (eQTL) for human telomerase.We tested the genetic associations of nucleotide variants with expression of the genes encoding human telomerase reverse transcriptase (hTERT) and telomerase RNA components (TERC) in lymphoblastoid cell lines derived from 373 Europeans.Our results revealed 6 eQTLs associated with hTERT (P < 5 × 10). One eQTL (rs17755753) was located in the intron 1 of the gene encoding R-spondin-3 (RSPO3), a well-known Wnt signaling regulator. Transcriptome-wide association analysis for these eQTLs revealed their additional associations with the expression of 29 genes (P < 4.75 × 10), including prickle planar cell polarity protein 2 (PRICKLE2) gene important for the Wnt signaling pathway. This concurs with previous studies in which significant expressional relationships between hTERT and some genes (β-catenin and Wnt-3a) in the Wnt signaling pathway have been observed.This study suggested 6 novel eQTLs for hTERT and the association of hTERT with the Wnt signaling pathway. Further studies are needed to understand their underlying mechanisms to improve our understanding of the role of hTERT in cancer.

  9. The vagal ganglia transcriptome identifies candidate therapeutics for airway hyperreactivity.

    PubMed

    Reznikov, Leah R; Meyerholz, David K; Abou Alaiwa, Mahmoud H; Kuan, Shin-Ping; Liao, Yan-Shin J; Bormann, Nicholas L; Bair, Thomas B; Price, Margaret; Stoltz, David A; Welsh, Michael J

    2018-04-05

    Mainstay therapeutics are ineffective in some people with asthma, suggesting a need for additional agents. In the current study, we used vagal ganglia transcriptome profiling and connectivity mapping to identify compounds beneficial for alleviating airway hyperreactivity. As a comparison, we also utilized previously published transcriptome data from sensitized mouse lungs and human asthmatic endobronchial biopsies. All transcriptomes revealed agents beneficial for mitigating airway hyperreactivity; however, only the vagal ganglia transcriptome identified agents used clinically to treat asthma (flunisolide, isoetarine). We also tested one compound identified by vagal ganglia transcriptome profiling that had not previously been linked to asthma and found that it had bronchodilator effects in both mouse and pig airways. These data suggest that transcriptome profiling of the vagal ganglia might be a novel strategy to identify potential asthma therapeutics.

  10. Host-associated bacterial taxa from Chlorobi, Chloroflexi, GN02, Synergistetes, SR1, TM7, and WPS-2 Phyla/candidate divisions

    PubMed Central

    Camanocha, Anuj; Dewhirst, Floyd E.

    2014-01-01

    Background and objective In addition to the well-known phyla Firmicutes, Proteobacteria, Bacteroidetes, Actinobacteria, Spirochaetes, Fusobacteria, Tenericutes, and Chylamydiae, the oral microbiomes of mammals contain species from the lesser-known phyla or candidate divisions, including Synergistetes, TM7, Chlorobi, Chloroflexi, GN02, SR1, and WPS-2. The objectives of this study were to create phyla-selective 16S rDNA PCR primer pairs, create selective 16S rDNA clone libraries, identify novel oral taxa, and update canine and human oral microbiome databases. Design 16S rRNA gene sequences for members of the lesser-known phyla were downloaded from GenBank and Greengenes databases and aligned with sequences in our RNA databases. Primers with potential phylum level selectivity were designed heuristically with the goal of producing nearly full-length 16S rDNA amplicons. The specificity of primer pairs was examined by making clone libraries from PCR amplicons and determining phyla identity by BLASTN analysis. Results Phylum-selective primer pairs were identified that allowed construction of clone libraries with 96–100% specificity for each of the lesser-known phyla. From these clone libraries, seven human and two canine novel oral taxa were identified and added to their respective taxonomic databases. For each phylum, genome sequences closest to human oral taxa were identified and added to the Human Oral Microbiome Database to facilitate metagenomic, transcriptomic, and proteomic studies that involve tiling sequences to the most closely related taxon. While examining ribosomal operons in lesser-known phyla from single-cell genomes and metagenomes, we identified a novel rRNA operon order (23S-5S-16S) in three SR1 genomes and the splitting of the 23S rRNA gene by an I-CeuI-like homing endonuclease in a WPS-2 genome. Conclusions This study developed useful primer pairs for making phylum-selective 16S rRNA clone libraries. Phylum-specific libraries were shown to be useful for identifying previously unrecognized taxa in lesser-known phyla and would be useful for future environmental and host-associated studies. PMID:25317252

  11. Bio-crude transcriptomics: gene discovery and metabolic network reconstruction for the biosynthesis of the terpenome of the hydrocarbon oil-producing green alga, Botryococcus braunii race B (Showa).

    PubMed

    Molnár, István; Lopez, David; Wisecaver, Jennifer H; Devarenne, Timothy P; Weiss, Taylor L; Pellegrini, Matteo; Hackett, Jeremiah D

    2012-10-30

    Microalgae hold promise for yielding a biofuel feedstock that is sustainable, carbon-neutral, distributed, and only minimally disruptive for the production of food and feed by traditional agriculture. Amongst oleaginous eukaryotic algae, the B race of Botryococcus braunii is unique in that it produces large amounts of liquid hydrocarbons of terpenoid origin. These are comparable to fossil crude oil, and are sequestered outside the cells in a communal extracellular polymeric matrix material. Biosynthetic engineering of terpenoid bio-crude production requires identification of genes and reconstruction of metabolic pathways responsible for production of both hydrocarbons and other metabolites of the alga that compete for photosynthetic carbon and energy. A de novo assembly of 1,334,609 next-generation pyrosequencing reads form the Showa strain of the B race of B. braunii yielded a transcriptomic database of 46,422 contigs with an average length of 756 bp. Contigs were annotated with pathway, ontology, and protein domain identifiers. Manual curation allowed the reconstruction of pathways that produce terpenoid liquid hydrocarbons from primary metabolites, and pathways that divert photosynthetic carbon into tetraterpenoid carotenoids, diterpenoids, and the prenyl chains of meroterpenoid quinones and chlorophyll. Inventories of machine-assembled contigs are also presented for reconstructed pathways for the biosynthesis of competing storage compounds including triacylglycerol and starch. Regeneration of S-adenosylmethionine, and the extracellular localization of the hydrocarbon oils by active transport and possibly autophagy are also investigated. The construction of an annotated transcriptomic database, publicly available in a web-based data depository and annotation tool, provides a foundation for metabolic pathway and network reconstruction, and facilitates further omics studies in the absence of a genome sequence for the Showa strain of B. braunii, race B. Further, the transcriptome database empowers future biosynthetic engineering approaches for strain improvement and the transfer of desirable traits to heterologous hosts.

  12. De Novo Assembly, Gene Annotation, and Marker Discovery in Stored-Product Pest Liposcelis entomophila (Enderlein) Using Transcriptome Sequences

    PubMed Central

    Wei, Dan-Dan; Chen, Er-Hu; Ding, Tian-Bo; Chen, Shi-Chun; Dou, Wei; Wang, Jin-Jun

    2013-01-01

    Background As a major stored-product pest insect, Liposcelis entomophila has developed high levels of resistance to various insecticides in grain storage systems. However, the molecular mechanisms underlying resistance and environmental stress have not been characterized. To date, there is a lack of genomic information for this species. Therefore, studies aimed at profiling the L. entomophila transcriptome would provide a better understanding of the biological functions at the molecular levels. Methodology/Principal Findings We applied Illumina sequencing technology to sequence the transcriptome of L. entomophila. A total of 54,406,328 clean reads were obtained and that de novo assembled into 54,220 unigenes, with an average length of 571 bp. Through a similarity search, 33,404 (61.61%) unigenes were matched to known proteins in the NCBI non-redundant (Nr) protein database. These unigenes were further functionally annotated with gene ontology (GO), cluster of orthologous groups of proteins (COG), and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases. A large number of genes potentially involved in insecticide resistance were manually curated, including 68 putative cytochrome P450 genes, 37 putative glutathione S-transferase (GST) genes, 19 putative carboxyl/cholinesterase (CCE) genes, and other 126 transcripts to contain target site sequences or encoding detoxification genes representing eight types of resistance enzymes. Furthermore, to gain insight into the molecular basis of the L. entomophila toward thermal stresses, 25 heat shock protein (Hsp) genes were identified. In addition, 1,100 SSRs and 57,757 SNPs were detected and 231 pairs of SSR primes were designed for investigating the genetic diversity in future. Conclusions/Significance We developed a comprehensive transcriptomic database for L. entomophila. These sequences and putative molecular markers would further promote our understanding of the molecular mechanisms underlying insecticide resistance or environmental stress, and will facilitate studies on population genetics for psocids, as well as providing useful information for functional genomic research in the future. PMID:24244605

  13. Bio-crude transcriptomics: Gene discovery and metabolic network reconstruction for the biosynthesis of the terpenome of the hydrocarbon oil-producing green alga, Botryococcus braunii race B (Showa)*

    PubMed Central

    2012-01-01

    Background Microalgae hold promise for yielding a biofuel feedstock that is sustainable, carbon-neutral, distributed, and only minimally disruptive for the production of food and feed by traditional agriculture. Amongst oleaginous eukaryotic algae, the B race of Botryococcus braunii is unique in that it produces large amounts of liquid hydrocarbons of terpenoid origin. These are comparable to fossil crude oil, and are sequestered outside the cells in a communal extracellular polymeric matrix material. Biosynthetic engineering of terpenoid bio-crude production requires identification of genes and reconstruction of metabolic pathways responsible for production of both hydrocarbons and other metabolites of the alga that compete for photosynthetic carbon and energy. Results A de novo assembly of 1,334,609 next-generation pyrosequencing reads form the Showa strain of the B race of B. braunii yielded a transcriptomic database of 46,422 contigs with an average length of 756 bp. Contigs were annotated with pathway, ontology, and protein domain identifiers. Manual curation allowed the reconstruction of pathways that produce terpenoid liquid hydrocarbons from primary metabolites, and pathways that divert photosynthetic carbon into tetraterpenoid carotenoids, diterpenoids, and the prenyl chains of meroterpenoid quinones and chlorophyll. Inventories of machine-assembled contigs are also presented for reconstructed pathways for the biosynthesis of competing storage compounds including triacylglycerol and starch. Regeneration of S-adenosylmethionine, and the extracellular localization of the hydrocarbon oils by active transport and possibly autophagy are also investigated. Conclusions The construction of an annotated transcriptomic database, publicly available in a web-based data depository and annotation tool, provides a foundation for metabolic pathway and network reconstruction, and facilitates further omics studies in the absence of a genome sequence for the Showa strain of B. braunii, race B. Further, the transcriptome database empowers future biosynthetic engineering approaches for strain improvement and the transfer of desirable traits to heterologous hosts. PMID:23110428

  14. De novo Assembly of the Indo-Pacific Humpback Dolphin Leucocyte Transcriptome to Identify Putative Genes Involved in the Aquatic Adaptation and Immune Response

    PubMed Central

    Xia, Jia; Yang, Lili; Chen, Jialin; Wu, Yuping; Yi, Meisheng

    2013-01-01

    Background The Indo-Pacific humpback dolphin (Sousa chinensis), a marine mammal species inhabited in the waters of Southeast Asia, South Africa and Australia, has attracted much attention because of the dramatic decline in population size in the past decades, which raises the concern of extinction. So far, this species is poorly characterized at molecular level due to little sequence information available in public databases. Recent advances in large-scale RNA sequencing provide an efficient approach to generate abundant sequences for functional genomic analyses in the species with un-sequenced genomes. Principal Findings We performed a de novo assembly of the Indo-Pacific humpback dolphin leucocyte transcriptome by Illumina sequencing. 108,751 high quality sequences from 47,840,388 paired-end reads were generated, and 48,868 and 46,587 unigenes were functionally annotated by BLAST search against the NCBI non-redundant and Swiss-Prot protein databases (E-value<10−5), respectively. In total, 16,467 unigenes were clustered into 25 functional categories by searching against the COG database, and BLAST2GO search assigned 37,976 unigenes to 61 GO terms. In addition, 36,345 unigenes were grouped into 258 KEGG pathways. We also identified 9,906 simple sequence repeats and 3,681 putative single nucleotide polymorphisms as potential molecular markers in our assembled sequences. A large number of unigenes were predicted to be involved in immune response, and many genes were predicted to be relevant to adaptive evolution and cetacean-specific traits. Conclusion This study represented the first transcriptome analysis of the Indo-Pacific humpback dolphin, an endangered species. The de novo transcriptome analysis of the unique transcripts will provide valuable sequence information for discovery of new genes, characterization of gene expression, investigation of various pathways and adaptive evolution, as well as identification of genetic markers. PMID:24015242

  15. De novo assembly of the Indo-Pacific humpback dolphin leucocyte transcriptome to identify putative genes involved in the aquatic adaptation and immune response.

    PubMed

    Gui, Duan; Jia, Kuntong; Xia, Jia; Yang, Lili; Chen, Jialin; Wu, Yuping; Yi, Meisheng

    2013-01-01

    The Indo-Pacific humpback dolphin (Sousa chinensis), a marine mammal species inhabited in the waters of Southeast Asia, South Africa and Australia, has attracted much attention because of the dramatic decline in population size in the past decades, which raises the concern of extinction. So far, this species is poorly characterized at molecular level due to little sequence information available in public databases. Recent advances in large-scale RNA sequencing provide an efficient approach to generate abundant sequences for functional genomic analyses in the species with un-sequenced genomes. We performed a de novo assembly of the Indo-Pacific humpback dolphin leucocyte transcriptome by Illumina sequencing. 108,751 high quality sequences from 47,840,388 paired-end reads were generated, and 48,868 and 46,587 unigenes were functionally annotated by BLAST search against the NCBI non-redundant and Swiss-Prot protein databases (E-value<10(-5)), respectively. In total, 16,467 unigenes were clustered into 25 functional categories by searching against the COG database, and BLAST2GO search assigned 37,976 unigenes to 61 GO terms. In addition, 36,345 unigenes were grouped into 258 KEGG pathways. We also identified 9,906 simple sequence repeats and 3,681 putative single nucleotide polymorphisms as potential molecular markers in our assembled sequences. A large number of unigenes were predicted to be involved in immune response, and many genes were predicted to be relevant to adaptive evolution and cetacean-specific traits. This study represented the first transcriptome analysis of the Indo-Pacific humpback dolphin, an endangered species. The de novo transcriptome analysis of the unique transcripts will provide valuable sequence information for discovery of new genes, characterization of gene expression, investigation of various pathways and adaptive evolution, as well as identification of genetic markers.

  16. Scanning of Transposable Elements and Analyzing Expression of Transposase Genes of Sweet Potato [Ipomoea batatas

    PubMed Central

    Tao, Xiang; Lai, Xian-Jun; Zhang, Yi-Zheng; Tan, Xue-Mei; Wang, Haiyan

    2014-01-01

    Background Transposable elements (TEs) are the most abundant genomic components in eukaryotes and affect the genome by their replications and movements to generate genetic plasticity. Sweet potato performs asexual reproduction generally and the TEs may be an important genetic factor for genome reorganization. Complete identification of TEs is essential for the study of genome evolution. However, the TEs of sweet potato are still poorly understood because of its complex hexaploid genome and difficulty in genome sequencing. The recent availability of the sweet potato transcriptome databases provides an opportunity for discovering and characterizing the expressed TEs. Methodology/Principal Findings We first established the integrated-transcriptome database by de novo assembling four published sweet potato transcriptome databases from three cultivars in China. Using sequence-similarity search and analysis, a total of 1,405 TEs including 883 retrotransposons and 522 DNA transposons were predicted and categorized. Depending on mapping sets of RNA-Seq raw short reads to the predicted TEs, we compared the quantities, classifications and expression activities of TEs inter- and intra-cultivars. Moreover, the differential expressions of TEs in seven tissues of Xushu 18 cultivar were analyzed by using Illumina digital gene expression (DGE) tag profiling. It was found that 417 TEs were expressed in one or more tissues and 107 in all seven tissues. Furthermore, the copy number of 11 transposase genes was determined to be 1–3 copies in the genome of sweet potato by Real-time PCR-based absolute quantification. Conclusions/Significance Our result provides a new method for TE searching on species with transcriptome sequences while lacking genome information. The searching, identification and expression analysis of TEs will provide useful TE information in sweet potato, which are valuable for the further studies of TE-mediated gene mutation and optimization in asexual reproduction. It contributes to elucidating the roles of TEs in genome evolution. PMID:24608103

  17. Transcriptome Profiling of Shewanella oneidensis Gene Expression following Exposure to Acidic and Alkaline pH†

    PubMed Central

    Leaphart, Adam B.; Thompson, Dorothea K.; Huang, Katherine; Alm, Eric; Wan, Xiu-Feng; Arkin, Adam; Brown, Steven D.; Wu, Liyou; Yan, Tingfen; Liu, Xueduan; Wickham, Gene S.; Zhou, Jizhong

    2006-01-01

    The molecular response of Shewanella oneidensis MR-1 to variations in extracellular pH was investigated based on genomewide gene expression profiling. Microarray analysis revealed that cells elicited both general and specific transcriptome responses when challenged with environmental acid (pH 4) or base (pH 10) conditions over a 60-min period. Global responses included the differential expression of genes functionally linked to amino acid metabolism, transcriptional regulation and signal transduction, transport, cell membrane structure, and oxidative stress protection. Response to acid stress included the elevated expression of genes encoding glycogen biosynthetic enzymes, phosphate transporters, and the RNA polymerase sigma-38 factor (rpoS), whereas the molecular response to alkaline pH was characterized by upregulation of nhaA and nhaR, which are predicted to encode an Na+/H+ antiporter and transcriptional activator, respectively, as well as sulfate transport and sulfur metabolism genes. Collectively, these results suggest that S. oneidensis modulates multiple transporters, cell envelope components, and pathways of amino acid consumption and central intermediary metabolism as part of its transcriptome response to changing external pH conditions. PMID:16452448

  18. Antimycobacterial Activity: A New Pharmacological Target for Conotoxins Found in the First Reported Conotoxin from Conasprella ximenes

    PubMed Central

    Figueroa-Montiel, Andrea; Bernáldez, Johanna; Ueberhide, Beatrix; González, Luis Javier

    2018-01-01

    Mycobacterium tuberculosis is the etiological agent of tuberculosis, an airborne infectious disease that is a leading cause of human morbidity and mortality worldwide. We report here the first conotoxin that is able to inhibit the growth of M. tuberculosis at a concentration similar to that of two other drugs that are currently used in clinics. Furthermore, it is also the first conopeptide that has been isolated from the venom of Conasprella ximenes. The venom gland transcriptome of C. ximenes was sequenced to construct a database with 24,284 non-redundant transcripts. The conopeptide was purified from the venom using reverse phase high performance liquid chromatography (RP-HPLC) and was analyzed using electrospray ionization-mass spectrometry (ESI-MS/MS). No automatic identification above the identity threshold with 1% of the false discovery rate was obtained; however, a 10-amino-acid sequence tag, manually extracted from the MS/MS spectra, allowed for the identification of a conotoxin in the transcriptome database. Electron transfer higher energy collision dissociation (EThcD) fragmentation of the native conotoxin confirmed the N-terminal sequence (1–14), while LC-MS/MS analysis of the tryptic digest of the reduced and S-alkylated conotoxin confirmed the C-terminal region (15–36). The expected and experimental molecular masses corresponded, within sub-ppm mass error. The 37-mer peptide (MW 4109.69 Da), containing eight cysteine residues, was named I1_xm11a, according to the current nomenclature for this type of molecule. PMID:29360782

  19. A combined strategy involving Sanger and 454 pyrosequencing increases genomic resources to aid in the management of reproduction, disease control and genetic selection in the turbot (Scophthalmus maximus).

    PubMed

    Ribas, Laia; Pardo, Belén G; Fernández, Carlos; Alvarez-Diós, José Antonio; Gómez-Tato, Antonio; Quiroga, María Isabel; Planas, Josep V; Sitjà-Bobadilla, Ariadna; Martínez, Paulino; Piferrer, Francesc

    2013-03-15

    Genomic resources for plant and animal species that are under exploitation primarily for human consumption are increasingly important, among other things, for understanding physiological processes and for establishing adequate genetic selection programs. Current available techniques for high-throughput sequencing have been implemented in a number of species, including fish, to obtain a proper description of the transcriptome. The objective of this study was to generate a comprehensive transcriptomic database in turbot, a highly priced farmed fish species in Europe, with potential expansion to other areas of the world, for which there are unsolved production bottlenecks, to understand better reproductive- and immune-related functions. This information is essential to implement marker assisted selection programs useful for the turbot industry. Expressed sequence tags were generated by Sanger sequencing of cDNA libraries from different immune-related tissues after several parasitic challenges. The resulting database ("Turbot 2 database") was enlarged with sequences generated from a 454 sequencing run of brain-hypophysis-gonadal axis-derived RNA obtained from turbot at different development stages. The assembly of Sanger and 454 sequences generated 52,427 consensus sequences ("Turbot 3 database"), of which 23,661 were successfully annotated. A total of 1,410 sequences were confirmed to be related to reproduction and key genes involved in sex differentiation and maturation were identified for the first time in turbot (AR, AMH, SRY-related genes, CYP19A, ZPGs, STAR FSHR, etc.). Similarly, 2,241 sequences were related to the immune system and several novel key immune genes were identified (BCL, TRAF, NCK, CD28 and TOLLIP, among others). The number of genes of many relevant reproduction- and immune-related pathways present in the database was 50-90% of the total gene count of each pathway. In addition, 1,237 microsatellites and 7,362 single nucleotide polymorphisms (SNPs) were also compiled. Further, 2,976 putative natural antisense transcripts (NATs) including microRNAs were also identified. The combined sequencing strategies employed here significantly increased the turbot genomic resources available, including 34,400 novel sequences. The generated database contains a larger number of genes relevant for reproduction- and immune-associated studies, with an excellent coverage of most genes present in many relevant physiological pathways. This database also allowed the identification of many microsatellites and SNP markers that will be very useful for population and genome screening and a valuable aid in marker assisted selection programs.

  20. IsoPlot: a database for comparison of mRNA isoforms in fruit fly and mosquitoes

    PubMed Central

    Ng, I-Man; Tsai, Shang-Chi

    2017-01-01

    Abstract Alternative splicing (AS), a mechanism by which different forms of mature messenger RNAs (mRNAs) are generated from the same gene, widely occurs in the metazoan genomes. Knowledge about isoform variants and abundance is crucial for understanding the functional context in the molecular diversity of the species. With increasing transcriptome data of model and non-model species, a database for visualization and comparison of AS events with up-to-date information is needed for further research. IsoPlot is a publicly available database with visualization tools for exploration of AS events, including three major species of mosquitoes, Aedes aegypti, Anopheles gambiae, and Culex quinquefasciatus, and fruit fly Drosophila melanogaster, the model insect species. IsoPlot includes not only 88,663 annotated transcripts but also 17,037 newly predicted transcripts from massive transcriptome data at different developmental stages of mosquitoes. The web interface enables users to explore the patterns and abundance of isoforms in different experimental conditions as well as cross-species sequence comparison of orthologous transcripts. IsoPlot provides a platform for researchers to access comprehensive information about AS events in mosquitoes and fruit fly. Our database is available on the web via an interactive user interface with an intuitive graphical design, which is applicable for the comparison of complex isoforms within or between species. Database URL: http://isoplot.iis.sinica.edu.tw/ PMID:29220459

  1. Genome-wide identification, phylogenetic analysis, and expression profiles of ATP-binding cassette transporter genes in the oriental fruit fly, Bactrocera dorsalis (Hendel) (Diptera: Tephritidae).

    PubMed

    Xiao, Lin-Fan; Zhang, Wei; Jing, Tian-Xing; Zhang, Meng-Yi; Miao, Ze-Qing; Wei, Dan-Dan; Yuan, Guo-Rui; Wang, Jin-Jun

    2018-03-01

    The ATP-binding cassette (ABC) is the largest transporter gene family and the genes play key roles in xenobiotic resistance, metabolism, and development of all phyla. However, the specific functions of ABC gene families in insects is unclear. We report a genome-wide identification, phylogenetic, and transcriptional analysis of the ABC genes in the oriental fruit fly, Bactrocera dorsalis (Hendel). We identified a total of 47 ABC genes (BdABCs) from the transcriptomic and genomic databases of B. dorsalis and classified these genes into eight subfamilies (A-H), including 7 ABCAs, 7 ABCBs, 9 ABCCs, 2 ABCDs, 1 ABCE, 3 ABCFs, 15 ABCGs, and 3 ABCHs. Comparative phylogenetic analysis of the ABCs suggests an orthologous relationship between B. dorsalis and other insect species in which these genes have been related to pesticide resistance and essential biological processes. Comparison of transcriptome and relative expression patterns of BdABCs indicated diverse multifunctions within different B. dorsalis tissues. The expression of 4, 10, and 14 BdABCs from 18 BdABCs was significantly upregulated after exposure to LD 50 s of malathion, avermectin, and beta-cypermethrin, respectively. The maximum expression level of most BdABCs (including BdABCFs, BdABCGs, and BdABCHs) occurred at 48h post exposures, whereas BdABCEs peaked at 24h after treatment. Furthermore, RNA interference-mediated suppression of BdABCB7 resulted in increased toxicity of malathion against B. dorsalis. These data suggest that ABC transporter genes might play key roles in xenobiotic metabolism and biosynthesis in B. dorsalis. Copyright © 2017 Elsevier Inc. All rights reserved.

  2. Evaluation of Potential Infectivity of Alzheimer and Parkinson Disease Proteins in Recipients of Cadaver-Derived Human Growth Hormone

    PubMed Central

    Irwin, David J.; Abrams, Joseph Y.; Schonberger, Lawrence B.; Leschek, Ellen Werber; Mills, James L.; Lee, Virginia M.-Y.; Trojanowski, John Q.

    2013-01-01

    Importance Growing evidence of cell-to-cell transmission of neurodegenerative disease (ND)–associated proteins (NDAPs) (ie, tau, Aβ, and α-synuclein) suggests possible similarities in the infectious prion protein (PrPsc) in spongiform encephalopathies. There are limited data on the potential human-to-human transmission of NDAPs associated with Alzheimer disease (AD) and other non-PrPsc ND. Objective To examine evidence for human-to-human transmission of AD, Parkinson disease (PD), and related NDAPs in cadaveric human growth hormone (c-hGH) recipients. Design We conducted a detailed immunohistochemical analysis of pathological NDAPs other than PrPsc in human pituitary glands. We also searched for ND in recipients of pituitary-derived c-hGH by reviewing the National Hormone and Pituitary Program (NHPP) cohort database and medical literature. Setting University-based academic center and agencies of the US Department of Health and Human Services. Participants Thirty-four routine autopsy subjects (10 non-ND controls and 24 patients with ND) and a US cohort of c-hGH recipients in the NHPP. Main Outcome Measures Detectable NDAPs in human pituitary sections and death certificate reports of non-PrPsc ND in the NHPP database. Results We found mild amounts of pathological tau, Aβ, and α-synuclein deposits in the adeno/neurohypophysis of patients with ND and control patients. No cases of AD or PD were identified, and 3 deaths attributed to amyotrophic lateral sclerosis (ALS) were found among US NHPP c-hGH recipients, including 2 of the 796 decedents in the originally confirmed NHPP c-hGH cohort database. Conclusions and Relevance Despite the likely frequent exposure of c-hGH recipients to NDAPs, and their markedly elevated risk of PrPsc-related disease, this population of NHPP c-hGH recipients does not appear to be at increased risk of AD or PD. We discovered 3 ALS cases of unclear significance among US c-hGH recipients despite the absence of pathological deposits of ALS-associated proteins (TDP-43, FUS, and ubiquilin) in human pituitary glands. In this unique in vivo model of human-to-human transmission, we found no evidence to support concerns that NDAPs underlying AD and PD transmit disease in humans despite evidence of their cell-to-cell transmission in model systems of these disorders. Further monitoring is required to confirm these conclusions. PMID:23380910

  3. Safety behaviors and sleep effort predict sleep disturbance and fatigue in an outpatient sample with anxiety and depressive disorders.

    PubMed

    Fairholme, Christopher P; Manber, Rachel

    2014-03-01

    Theoretical and empirical support for the role of dysfunctional beliefs, safety behaviors, and increased sleep effort in the maintenance of insomnia has begun to accumulate. It is not yet known how these factors predict sleep disturbance and fatigue occurring in the context of anxiety and mood disorders. It was hypothesized that these three insomnia-specific cognitive-behavioral factors would be uniquely associated with insomnia and fatigue among patients with emotional disorders after adjusting for current symptoms of anxiety and depression and trait levels of neuroticism and extraversion. Outpatients with a current anxiety or mood disorder (N = 63) completed self-report measures including the Dysfunctional Beliefs About Sleep Scale (DBAS), Sleep-Related Safety Behaviors Questionnaire (SRBQ), Glasgow Sleep Effort Scale (GSES), Pittsburgh Sleep Quality Index (PSQI), NEO Five-Factor Inventory (FFI), and the 21-item Depression Anxiety and Stress Scale (DASS). Multivariate path analysis was used to evaluate study hypotheses. SRBQ (B = .60, p < .001, 95% CI [.34, .86]) and GSES (B = .31, p < .01, 95% CI [.07, .55]) were both significantly associated with PSQI. There was a significant interaction between SRBQ and DBAS (B = .25, p < .05, 95% CI [.04, .47]) such that the relationship between safety behaviors and fatigue was strongest among individuals with greater levels of dysfunctional beliefs. Findings are consistent with cognitive behavioral models of insomnia and suggest that sleep-specific factors might be important treatment targets among patients with anxiety and depressive disorders with disturbed sleep. Copyright © 2013 Elsevier Inc. All rights reserved.

  4. Genetic expression programming-based DBA for enhancing peer-assisted music-on-demand service in EPON

    NASA Astrophysics Data System (ADS)

    Liem, Andrew Tanny; Hwang, I.-Shyan; Nikoukar, AliAkbar; Lee, Jhong-Yue

    2015-03-01

    Today, the popularity of peer-assisted music-on-demand (MoD) has increased significantly worldwide. This service allows users to access large music library tracks, listen to music, and share their playlist with other users. Unlike the conventional voice traffic, such an application maintains music quality that ranges from 160 kbps to 320 kbps, which most likely consumes more bandwidth than other traffics. In the access network, Ethernet passive optical network (EPON) is one of the best candidates for delivering such a service because of being cost-effective and with high bandwidth. To maintain music quality, a stutter needs to be prevented because of either network effects or when the due user was not receiving enough resources to play in a timely manner. Therefore, in this paper, we propose two genetic expression programming (GEP)-based dynamic bandwidth allocations (DBAs). The first DBA is a generic DBA that aims to find an optimum formula for voice, video, and data services. The second DBA aims to find optimum formulas so that Optical Line Terminal (OLT) can satisfy not only the voice and Peer-to-Peer (P2P) MoD traffics but also reduce the stutter. Optical Network Unit (ONU) traits such as REPORT and GATE messages, cycle time, and mean packet delay are set to be predictor variables. Simulation results show that our proposed DBAs can satisfy the voice and P2P MoD services packet delay and monitor other overall system performances such as expedited forwarding (EF) jitter, packet loss, bandwidth waste, and system throughputs.

  5. De novo sequencing, assembly and analysis of salivary gland transcriptome of Haemaphysalis flava and identification of sialoprotein genes.

    PubMed

    Xu, Xing-Li; Cheng, Tian-Yin; Yang, Hu; Yan, Fen; Yang, Ya

    2015-06-01

    Saliva plays an important role in feeding and pathogen transmission, identification and analysis of tick salivary gland (SG) proteins is considered as a hot spot in anti-tick researching area. Herein, we present the first description of SG transcriptome of Haemaphysalis flava using next-generation sequencing (NGS). A total of over 143 million high-quality reads were assembled into 54,357 unigenes, of which 20,145 (37.06%) had significant similarities to proteins in the Swiss-Prot database. 13,513 annotated sequences were associated with GO terms. Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis showed that 14,280 unigenes were assigned to 279 KEGG pathways in total. Reads per kb per million reads (RPKM) analysis showed that there were 3035 down-regulated unigenes and 2260 up-regulated unigenes in the engorged ticks (ET) compared with the semi-engorged one (SET). Several important genes are associated with blood feeding and ingestion as secreted salivary proteins, concluding cysteine, longipain, 4D8, calreticulin, metalloproteases, serine protease inhibitor, enolase, heat shock protein and AV422 in SG, were identified. The qRT-PCR results confirmed that patterns of these genes (except for the longipain gene) expression were consistent with RNA-seq results. This de novo assembly of SG transcriptome of H. flava not only provides more chance for screening and cloning functional genes, but also forms a solid basis for further insight into the changes of salivary proteins during blood-feeding. Copyright © 2015 Elsevier B.V. All rights reserved.

  6. Uncovering the Complex Transcriptome Response of Mytilus chilensis against Saxitoxin: Implications of Harmful Algal Blooms on Mussel Populations.

    PubMed

    Detree, Camille; Núñez-Acuña, Gustavo; Roberts, Steven; Gallardo-Escárate, Cristian

    2016-01-01

    Saxitoxin (STX), a principal phycotoxin contributing to paralytic shellfish poisoning, is largely produced by marine microalgae of the genus Alexandrium. This toxin affects a wide range of species, inducing massive deaths in fish and other marine species. However, marine bivalves can resist and accumulate paralytic shellfish poisons. Despite numerous studies on the impact of STX in marine bivalves, knowledge regarding STX recognition at molecular level by benthic species remains scarce. Therefore, the aim of this study was to identify novel genes that interact with STX in the Chilean mussel Mytilus chilensis. For this, RNA-seq and RT-qPCR approaches were used to evaluate the transcriptomic response of M. chilensis to a purified STX as well as in vivo Alexandrium catenella exposure. Approximately 800 million reads were assembled, generating 138,883 contigs that were blasted against the UniProt Mollusca database. Pattern Recognition Receptors (PRRs) involved in mussel immunity, such as Toll-like receptors, tumor necrosis factor receptors, and scavenger-like receptors were found to be strongly upregulated at 8 and 16 h post-STX injection. These results suggest an involvement of PRRs in the response to STX, as well as identifying potential, novel STX-interacting receptors in this Chilean mussel. This study is the first transcriptomic overview of the STX-response in the edible species M. chilensis. However, the most significant contribution of this work is the identification of immune receptors and pathways potentially involved in the recognition and defense against STX's toxicity and its impact of harmful algae blooms on wild and cultivated mussel populations.

  7. Uncovering the Complex Transcriptome Response of Mytilus chilensis against Saxitoxin: Implications of Harmful Algal Blooms on Mussel Populations

    PubMed Central

    Detree, Camille; Núñez-Acuña, Gustavo; Roberts, Steven; Gallardo-Escárate, Cristian

    2016-01-01

    Saxitoxin (STX), a principal phycotoxin contributing to paralytic shellfish poisoning, is largely produced by marine microalgae of the genus Alexandrium. This toxin affects a wide range of species, inducing massive deaths in fish and other marine species. However, marine bivalves can resist and accumulate paralytic shellfish poisons. Despite numerous studies on the impact of STX in marine bivalves, knowledge regarding STX recognition at molecular level by benthic species remains scarce. Therefore, the aim of this study was to identify novel genes that interact with STX in the Chilean mussel Mytilus chilensis. For this, RNA-seq and RT-qPCR approaches were used to evaluate the transcriptomic response of M. chilensis to a purified STX as well as in vivo Alexandrium catenella exposure. Approximately 800 million reads were assembled, generating 138,883 contigs that were blasted against the UniProt Mollusca database. Pattern Recognition Receptors (PRRs) involved in mussel immunity, such as Toll-like receptors, tumor necrosis factor receptors, and scavenger-like receptors were found to be strongly upregulated at 8 and 16 h post-STX injection. These results suggest an involvement of PRRs in the response to STX, as well as identifying potential, novel STX-interacting receptors in this Chilean mussel. This study is the first transcriptomic overview of the STX-response in the edible species M. chilensis. However, the most significant contribution of this work is the identification of immune receptors and pathways potentially involved in the recognition and defense against STX’s toxicity and its impact of harmful algae blooms on wild and cultivated mussel populations. PMID:27764234

  8. Modeling hormonal and inflammatory contributions to preterm and term labor using uterine temporal transcriptomics.

    PubMed

    Migale, Roberta; MacIntyre, David A; Cacciatore, Stefano; Lee, Yun S; Hagberg, Henrik; Herbert, Bronwen R; Johnson, Mark R; Peebles, Donald; Waddington, Simon N; Bennett, Phillip R

    2016-06-13

    Preterm birth is now recognized as the primary cause of infant mortality worldwide. Interplay between hormonal and inflammatory signaling in the uterus modulates the onset of contractions; however, the relative contribution of each remains unclear. In this study we aimed to characterize temporal transcriptome changes in the uterus preceding term labor and preterm labor (PTL) induced by progesterone withdrawal or inflammation in the mouse and compare these findings with human data. Myometrium was collected at multiple time points during gestation and labor from three murine models of parturition: (1) term gestation; (2) PTL induced by RU486; and (3) PTL induced by lipopolysaccharide (LPS). RNA was extracted and cDNA libraries were prepared and sequenced using the Illumina HiSeq 2000 system. Resulting RNA-Seq data were analyzed using multivariate modeling approaches as well as pathway and causal network analyses and compared against human myometrial transcriptome data. We identified a core set of temporal myometrial gene changes associated with term labor and PTL in the mouse induced by either inflammation or progesterone withdrawal. Progesterone withdrawal initiated labor without inflammatory gene activation, yet LPS activation of uterine inflammation was sufficient to override the repressive effects of progesterone and induce a laboring phenotype. Comparison of human and mouse uterine transcriptomic datasets revealed that human labor more closely resembles inflammation-induced PTL in the mouse. Labor in the mouse can be achieved through inflammatory gene activation yet these changes are not a requisite for labor itself. Human labor more closely resembles LPS-induced PTL in the mouse, supporting an essential role for inflammatory mediators in human "functional progesterone withdrawal." This improved understanding of inflammatory and progesterone influence on the uterine transcriptome has important implications for the development of PTL prevention strategies.

  9. Whole Body Melanoma Transcriptome Response in Medaka

    PubMed Central

    Schartl, Manfred; Shen, Yingjia; Maurus, Katja; Walter, Ron; Tomlinson, Chad; Wilson, Richard K.; Postlethwait, John; Warren, Wesley C.

    2015-01-01

    The incidence of malignant melanoma continues to increase each year with poor prognosis for survival in many relapse cases. To reverse this trend, whole body response measures are needed to discover collaborative paths to primary and secondary malignancy. Several species of fish provide excellent melanoma models because fish and human melanocytes both appear in the epidermis, and fish and human pigment cell tumors share conserved gene expression signatures. For the first time, we have examined the whole body transcriptome response to invasive melanoma as a prelude to using transcriptome profiling to screen for drugs in a medaka (Oryzias latipes) model. We generated RNA-seq data from whole body RNA isolates for controls and melanoma fish. After testing for differential expression, 396 genes had significantly different expression (adjusted p-value <0.02) in the whole body transcriptome between melanoma and control fish; 379 of these genes were matched to human orthologs with 233 having annotated human gene symbols and 14 matched genes that contain putative deleterious variants in human melanoma at varying levels of recurrence. A detailed canonical pathway evaluation for significant enrichment showed the top scoring pathway to be antigen presentation but also included the expected melanocyte development and pigmentation signaling pathway. Results revealed a profound down-regulation of genes involved in the immune response, especially the innate immune system. We hypothesize that the developing melanoma actively suppresses the immune system responses of the body in reacting to the invasive malignancy, and that this mal-adaptive response contributes to disease progression, a result that suggests our whole-body transcriptomic approach merits further use. In these findings, we also observed novel genes not yet identified in human melanoma expression studies and uncovered known and new candidate drug targets for further testing in this malignant melanoma medaka model. PMID:26714172

  10. Whole Body Melanoma Transcriptome Response in Medaka.

    PubMed

    Schartl, Manfred; Shen, Yingjia; Maurus, Katja; Walter, Ron; Tomlinson, Chad; Wilson, Richard K; Postlethwait, John; Warren, Wesley C

    2015-01-01

    The incidence of malignant melanoma continues to increase each year with poor prognosis for survival in many relapse cases. To reverse this trend, whole body response measures are needed to discover collaborative paths to primary and secondary malignancy. Several species of fish provide excellent melanoma models because fish and human melanocytes both appear in the epidermis, and fish and human pigment cell tumors share conserved gene expression signatures. For the first time, we have examined the whole body transcriptome response to invasive melanoma as a prelude to using transcriptome profiling to screen for drugs in a medaka (Oryzias latipes) model. We generated RNA-seq data from whole body RNA isolates for controls and melanoma fish. After testing for differential expression, 396 genes had significantly different expression (adjusted p-value <0.02) in the whole body transcriptome between melanoma and control fish; 379 of these genes were matched to human orthologs with 233 having annotated human gene symbols and 14 matched genes that contain putative deleterious variants in human melanoma at varying levels of recurrence. A detailed canonical pathway evaluation for significant enrichment showed the top scoring pathway to be antigen presentation but also included the expected melanocyte development and pigmentation signaling pathway. Results revealed a profound down-regulation of genes involved in the immune response, especially the innate immune system. We hypothesize that the developing melanoma actively suppresses the immune system responses of the body in reacting to the invasive malignancy, and that this mal-adaptive response contributes to disease progression, a result that suggests our whole-body transcriptomic approach merits further use. In these findings, we also observed novel genes not yet identified in human melanoma expression studies and uncovered known and new candidate drug targets for further testing in this malignant melanoma medaka model.

  11. High-throughput RNA sequencing reveals structural differences of orthologous brain-expressed genes between western lowland gorillas and humans.

    PubMed

    Lipovich, Leonard; Hou, Zhuo-Cheng; Jia, Hui; Sinkler, Christopher; McGowen, Michael; Sterner, Kirstin N; Weckle, Amy; Sugalski, Amara B; Pipes, Lenore; Gatti, Domenico L; Mason, Christopher E; Sherwood, Chet C; Hof, Patrick R; Kuzawa, Christopher W; Grossman, Lawrence I; Goodman, Morris; Wildman, Derek E

    2016-02-01

    The human brain and human cognitive abilities are strikingly different from those of other great apes despite relatively modest genome sequence divergence. However, little is presently known about the interspecies divergence in gene structure and transcription that might contribute to these phenotypic differences. To date, most comparative studies of gene structure in the brain have examined humans, chimpanzees, and macaque monkeys. To add to this body of knowledge, we analyze here the brain transcriptome of the western lowland gorilla (Gorilla gorilla gorilla), an African great ape species that is phylogenetically closely related to humans, but with a brain that is approximately one-third the size. Manual transcriptome curation from a sample of the planum temporale region of the neocortex revealed 12 protein-coding genes and one noncoding-RNA gene with exons in the gorilla unmatched by public transcriptome data from the orthologous human loci. These interspecies gene structure differences accounted for a total of 134 amino acids in proteins found in the gorilla that were absent from protein products of the orthologous human genes. Proteins varying in structure between human and gorilla were involved in immunity and energy metabolism, suggesting their relevance to phenotypic differences. This gorilla neocortical transcriptome comprises an empirical, not homology- or prediction-driven, resource for orthologous gene comparisons between human and gorilla. These findings provide a unique repository of the sequences and structures of thousands of genes transcribed in the gorilla brain, pointing to candidate genes that may contribute to the traits distinguishing humans from other closely related great apes. © 2015 Wiley Periodicals, Inc.

  12. Global Transcriptomic Changes Induced by Infection of Cucumber (Cucumis sativus L.) with Mild and Severe Variants of Hop Stunt Viroid.

    PubMed

    Xia, Changjian; Li, Shifang; Hou, Wanying; Fan, Zaifeng; Xiao, Hong; Lu, Meiguang; Sano, Teruo; Zhang, Zhixiang

    2017-01-01

    Fifteen years after transfer to hops, hop stunt viroid-grapevine (HSVd-g) was replaced by HSVd-hop (HSVd-h), a sequence variant that contains changes at five different positions. HSVd-g54 is a laboratory mutant derived from HSVd-g that differs from its progenitor by a single G to A substitution at position 54. While infection by HSVd-h induces only mild stunting in cucumber ( Cucumis sativus L.), HSVd-g54 induces much more severe symptoms in this indicator host. Comparison of transcriptome profiles of cucumber infected with HSVd-h or HSVd-g54 with those of mock-inoculated controls obtained by whole transcriptome shotgun sequencing revealed that many genes related to photosynthesis were down-regulated following infection. In contrast, genes encoding RNA-dependent RNA polymerase 1 ( CsRDR1 ), especially CsRDR1c1 and CsRDR1c2 , as well as those related to basal defense responses were up-regulated. Expression of genes associated with phytohormone signaling pathways were also altered, indicating that viroid infection initiates a complex array of changes in the host transcriptome. HSVd-g54 induced an earlier and stronger response than HSVd-h, and further examination of these differences will contribute to a better understanding of the mechanisms that determine viroid pathogenicity.

  13. Impact of Transcriptomics on Our Understanding of Pulmonary Fibrosis

    PubMed Central

    Vukmirovic, Milica; Kaminski, Naftali

    2018-01-01

    Idiopathic pulmonary fibrosis (IPF) is a lethal fibrotic lung disease characterized by aberrant remodeling of the lung parenchyma with extensive changes to the phenotypes of all lung resident cells. The introduction of transcriptomics, genome scale profiling of thousands of RNA transcripts, caused a significant inversion in IPF research. Instead of generating hypotheses based on animal models of disease, or biological plausibility, with limited validation in humans, investigators were able to generate hypotheses based on unbiased molecular analysis of human samples and then use animal models of disease to test their hypotheses. In this review, we describe the insights made from transcriptomic analysis of human IPF samples. We describe how transcriptomic studies led to identification of novel genes and pathways involved in the human IPF lung such as: matrix metalloproteinases, WNT pathway, epithelial genes, role of microRNAs among others, as well as conceptual insights such as the involvement of developmental pathways and deep shifts in epithelial and fibroblast phenotypes. The impact of lung and transcriptomic studies on disease classification, endotype discovery, and reproducible biomarkers is also described in detail. Despite these impressive achievements, the impact of transcriptomic studies has been limited because they analyzed bulk tissue and did not address the cellular and spatial heterogeneity of the IPF lung. We discuss new emerging technologies and applications, such as single-cell RNAseq and microenvironment analysis that may address cellular and spatial heterogeneity. We end by making the point that most current tissue collections and resources are not amenable to analysis using the novel technologies. To take advantage of the new opportunities, we need new efforts of sample collections, this time focused on access to all the microenvironments and cells in the IPF lung. PMID:29670881

  14. Unique Transcriptome Patterns of the White and Grey Matter Corroborate Structural and Functional Heterogeneity in the Human Frontal Lobe

    PubMed Central

    Mills, James D.; Kavanagh, Tomas; Kim, Woojin S.; Chen, Bei Jun; Kawahara, Yoshihiro; Halliday, Glenda M.; Janitz, Michael

    2013-01-01

    The human frontal lobe has undergone accelerated evolution, leading to the development of unique human features such as language and self-reflection. Cortical grey matter and underlying white matter reflect distinct cellular compositions in the frontal lobe. Surprisingly little is known about the transcriptomal landscape of these distinct regions. Here, for the first time, we report a detailed transcriptomal profile of the frontal grey (GM) and white matter (WM) with resolution to alternatively spliced isoforms obtained using the RNA-Seq approach. We observed more vigorous transcriptome activity in GM compared to WM, presumably because of the presence of cellular bodies of neurons in the GM and RNA associated with the nucleus and perinuclear space. Among the top differentially expressed genes, we also identified a number of long intergenic non-coding RNAs (lincRNAs), specifically expressed in white matter, such as LINC00162. Furthermore, along with confirmation of expression of known markers for neurons and oligodendrocytes, we identified a number of genes and splicing isoforms that are exclusively expressed in GM or WM with examples of GABRB2 and PAK2 transcripts, respectively. Pathway analysis identified distinct physiological and biochemical processes specific to grey and white matter samples with a prevalence of synaptic processes in GM and myelination regulation and axonogenesis in the WM. Our study also revealed that expression of many genes, for example, the GPR123, is characterized by isoform switching, depending in which structure the gene is expressed. Our report clearly shows that GM and WM have perhaps surprisingly divergent transcriptome profiles, reflecting distinct roles in brain physiology. Further, this study provides the first reference data set for a normal human frontal lobe, which will be useful in comparative transcriptome studies of cerebral disorders, in particular, neurodegenerative diseases. PMID:24194939

  15. Gene discovery in Boophilus microplus, the cattle tick: the transcriptomes of ovaries, salivary glands, and hemocytes.

    PubMed

    Santos, Isabel K F de Miranda; Valenzuela, Jesus G; Ribeiro, José Marcos C; de Castro, Marilia; Costa, Juliana Nardelli; Costa, Ana Maria; da Silva, Edson Ramiro; Neto, Olavo Bilac Rego; Rocha, Clarisse; Daffre, Sirlei; Ferreira, Beatriz R; da Silva, João Santana; Szabó, Matias Pablo; Bechara, Gervasio Henrique

    2004-10-01

    The quest for new control strategies for ticks can profit from high throughput genomics. In order to identify genes that are involved in oogenesis and development, in defense, and in hematophagy, the transcriptomes of ovaries, hemocytes, and salivary glands from rapidly ingurgitating females, and of salivary glands from males of Boophilus microplus were PCR amplified, and the expressed sequence tags (EST) of random clones were mass sequenced. So far, more than 1,344 EST have been generated for these tissues, with approximately 30% novelty, depending on the the tissue studied. To date approximately 760 nucleotide sequences from B. microplus are deposited in the NCBI database. Mass sequencing of partial cDNAs of parasite genes can build up this scant database and rapidly generate a large quantity of useful information about potential targets for immunobiological or chemical control.

  16. Transcriptomic Analysis of Paeonia delavayi Wild Population Flowers to Identify Differentially Expressed Genes Involved in Purple-Red and Yellow Petal Pigmentation

    PubMed Central

    Wang, Yan; Li, Kui; Zheng, Baoqiang; Miao, Kun

    2015-01-01

    Tree peony (Paeonia suffruticosa Andrews) is a very famous traditional ornamental plant in China. P. delavayi is a species endemic to Southwest China that has aroused great interest from researchers as a precious genetic resource for flower color breeding. However, the current understanding of the molecular mechanisms of flower pigmentation in this plant is limited, hindering the genetic engineering of novel flower color in tree peonies. In this study, we conducted a large-scale transcriptome analysis based on Illumina HiSeq sequencing of cDNA libraries generated from yellow and purple-red P. delavayi petals. A total of 90,202 unigenes were obtained by de novo assembly, with an average length of 721 nt. Using Blastx, 44,811 unigenes (49.68%) were found to have significant similarity to accessions in the NR, NT, and Swiss-Prot databases. We also examined COG, GO and KEGG annotations to better understand the functions of these unigenes. Further analysis of the two digital transcriptomes revealed that 6,855 unigenes were differentially expressed between yellow and purple-red flower petals, with 3,430 up-regulated and 3,425 down-regulated. According to the RNA-Seq data and qRT-PCR analysis, we proposed that four up-regulated key structural genes, including F3H, DFR, ANS and 3GT, might play an important role in purple-red petal pigmentation, while high co-expression of THC2'GT, CHI and FNS II ensures the accumulation of pigments contributing to the yellow color. We also found 50 differentially expressed transcription factors that might be involved in flavonoid biosynthesis. This study is the first to report genetic information for P. delavayi. The large number of gene sequences produced by transcriptome sequencing and the candidate genes identified using pathway mapping and expression profiles will provide a valuable resource for future association studies aimed at better understanding the molecular mechanisms underlying flower pigmentation in tree peonies. PMID:26267644

  17. Cardiac fibroblast transcriptome analyses support a role for interferogenic, profibrotic, and inflammatory genes in anti-SSA/Ro-associated congenital heart block.

    PubMed

    Clancy, Robert M; Markham, Androo J; Jackson, Tanisha; Rasmussen, Sara E; Blumenberg, Miroslav; Buyon, Jill P

    2017-09-01

    The signature lesion of SSA/Ro autoantibody-associated congenital heart block (CHB) is fibrosis and a macrophage infiltrate, supporting an experimental focus on cues influencing the fibroblast component. The transcriptomes of human fetal cardiac fibroblasts were analyzed using two complementary approaches. Cardiac injury conditions were simulated in vitro by incubating human fetal cardiac fibroblasts with supernatants from macrophages transfected with the SSA/Ro-associated noncoding Y ssRNA. The top 10 upregulated transcripts in the stimulated fibroblasts reflected a type I interferon (IFN) response [e.g., IFN-induced protein 44-like (IFI44L), of MX dynamin-like GTPase (MX)1, MX2, and radical S -adenosyl methionine domain containing 2 (Rsad2)]. Within the fibrotic pathway, transcript levels of endothelin-1 (EDN1), phosphodiesterase (PDE)4D, chemokine (C-X-C motif) ligand (CXCL)2, and CXCL3 were upregulated, while others, including adenomedullin, RAP guanine nucleotide exchange factor 3 (RAPGEF3), tissue inhibitor of metalloproteinase (TIMP)1, TIMP3, and dual specificity phosphatase 1, were downregulated. Agnostic Database for Annotation, Visualization and Integrated Discovery analysis revealed a significant increase in inflammatory genes, including complement C3A receptor 1 (C3AR1), F2R-like thrombin/trypsin receptor 3, and neutrophil cytosolic factor 2. In addition, stimulated fibroblasts expressed high levels of phospho-MADS box transcription enhancer factor 2 [a substrate of MAPK5 (ERK5)], which was inhibited by BIX-02189, a specific inhibitor of ERK5. Translation to human disease leveraged an unprecedented opportunity to interrogate the transcriptome of fibroblasts freshly isolated and cell sorted without stimulation from a fetal heart with CHB and a matched healthy heart. Consistent with the in vitro data, five IFN response genes were among the top 10 most highly expressed transcripts in CHB fibroblasts. In addition, the expression of matrix-related genes reflected fibrosis. These data support the novel finding that cardiac injury in CHB may occur secondary to abnormal remodeling due in part to upregulation of type 1 IFN response genes. NEW & NOTEWORTHY Congenital heart block is a rare disease of the fetal heart associated with maternal anti-Ro autoantibodies which can result in death and for survivors, lifelong pacing. This study provides in vivo and in vitro transcriptome-support that injury may be mediated by an effect of Type I Interferon on fetal fibroblasts. Copyright © 2017 the American Physiological Society.

  18. Transcriptome analysis reveals intermittent fasting-induced genetic changes in ischemic stroke.

    PubMed

    Kim, Joonki; Kang, Sung-Wook; Mallilankaraman, Karthik; Baik, Sang-Ha; Lim, James C; Balaganapathy, Priyanka; She, David T; Lok, Ker-Zhing; Fann, David Y; Thambiayah, Uma; Tang, Sung-Chun; Stranahan, Alexis M; Dheen, S Thameem; Gelderblom, Mathias; Seet, Raymond C; Karamyan, Vardan T; Vemuganti, Raghu; Sobey, Christopher G; Mattson, Mark P; Jo, Dong-Gyu; Arumugam, Thiruma V

    2018-05-01

    Genetic changes due to dietary intervention in the form of either calorie restriction (CR) or intermittent fasting (IF) are not reported in detail until now. However, it is well established that both CR and IF extend the lifespan and protect against neurodegenerative diseases and stroke. The current research aims were first to describe the transcriptomic changes in brains of IF mice and, second, to determine whether IF induces extensive transcriptomic changes following ischemic stroke to protect the brain from injury. Mice were randomly assigned to ad libitum feeding (AL), 12 (IF12) or 16 (IF16) h daily fasting. Each diet group was then subjected to sham surgery or middle cerebral artery occlusion and consecutive reperfusion. Mid-coronal sections of ipsilateral cerebral tissue were harvested at the end of the 1 h ischemic period or at 3, 12, 24 or 72 h of reperfusion, and genome-wide mRNA expression was quantified by RNA sequencing. The cerebral transcriptome of mice in AL group exhibited robust, sustained up-regulation of detrimental genetic pathways under ischemic stroke, but activation of these pathways was suppressed in IF16 group. Interestingly, the cerebral transcriptome of AL mice was largely unchanged during the 1 h of ischemia, whereas mice in IF16 group exhibited extensive up-regulation of genetic pathways involved in neuroplasticity and down-regulation of protein synthesis. Our data provide a genetic molecular framework for understanding how IF protects brain cells against damage caused by ischemic stroke, and reveal cellular signaling and bioenergetic pathways to target in the development of clinical interventions.

  19. De novo assembly and characterization of fruit transcriptome in Litchi chinensis Sonn and analysis of differentially regulated genes in fruit in response to shading

    PubMed Central

    2013-01-01

    Background Litchi (Litchi chinensis Sonn.) is one of the most important fruit trees cultivated in tropical and subtropical areas. However, a lack of transcriptomic and genomic information hinders our understanding of the molecular mechanisms underlying fruit set and fruit development in litchi. Shading during early fruit development decreases fruit growth and induces fruit abscission. Here, high-throughput RNA sequencing (RNA-Seq) was employed for the de novo assembly and characterization of the fruit transcriptome in litchi, and differentially regulated genes, which are responsive to shading, were also investigated using digital transcript abundance(DTA)profiling. Results More than 53 million paired-end reads were generated and assembled into 57,050 unigenes with an average length of 601 bp. These unigenes were annotated by querying against various public databases, with 34,029 unigenes found to be homologous to genes in the NCBI GenBank database and 22,945 unigenes annotated based on known proteins in the Swiss-Prot database. In further orthologous analyses, 5,885 unigenes were assigned with one or more Gene Ontology terms, 10,234 hits were aligned to the 24 Clusters of Orthologous Groups classifications and 15,330 unigenes were classified into 266 Kyoto Encyclopedia of Genes and Genomes pathways. Based on the newly assembled transcriptome, the DTA profiling approach was applied to investigate the differentially expressed genes related to shading stress. A total of 3.6 million and 3.5 million high-quality tags were generated from shaded and non-shaded libraries, respectively. As many as 1,039 unigenes were shown to be significantly differentially regulated. Eleven of the 14 differentially regulated unigenes, which were randomly selected for more detailed expression comparison during the course of shading treatment, were identified as being likely to be involved in the process of fruitlet abscission in litchi. Conclusions The assembled transcriptome of litchi fruit provides a global description of expressed genes in litchi fruit development, and could serve as an ideal repository for future functional characterization of specific genes. The DTA analysis revealed that more than 1000 differentially regulated unigenes respond to the shading signal, some of which might be involved in the fruitlet abscission process in litchi, shedding new light on the molecular mechanisms underlying organ abscission. PMID:23941440

  20. Assessment of pleiotropic transcriptome perturbations in Arabidopsis engineered for indirect insect defence.

    PubMed

    Houshyani, Benyamin; van der Krol, Alexander R; Bino, Raoul J; Bouwmeester, Harro J

    2014-06-19

    Molecular characterization is an essential step of risk/safety assessment of genetically modified (GM) crops. Holistic approaches for molecular characterization using omics platforms can be used to confirm the intended impact of the genetic engineering, but can also reveal the unintended changes at the omics level as a first assessment of potential risks. The potential of omics platforms for risk assessment of GM crops has rarely been used for this purpose because of the lack of a consensus reference and statistical methods to judge the significance or importance of the pleiotropic changes in GM plants. Here we propose a meta data analysis approach to the analysis of GM plants, by measuring the transcriptome distance to untransformed wild-types. In the statistical analysis of the transcriptome distance between GM and wild-type plants, values are compared with naturally occurring transcriptome distances in non-GM counterparts obtained from a database. Using this approach we show that the pleiotropic effect of genes involved in indirect insect defence traits is substantially equivalent to the variation in gene expression occurring naturally in Arabidopsis. Transcriptome distance is a useful screening method to obtain insight in the pleiotropic effects of genetic modification.

  1. Sequence homology between HLA-bound cytomegalovirus and human peptides: A potential trigger for alloreactivity

    PubMed Central

    Koparde, Vishal N.; Jameson-Lee, Maximilian; Elnasseh, Abdelrhman G.; Scalora, Allison F.; Kobulnicky, David J.; Serrano, Myrna G.; Roberts, Catherine H.; Buck, Gregory A.; Neale, Michael C.; Nixon, Daniel E.; Toor, Amir A.

    2017-01-01

    Human cytomegalovirus (hCMV) reactivation may often coincide with the development of graft-versus-host-disease (GVHD) in stem cell transplantation (SCT). Seventy seven SCT donor-recipient pairs (DRP) (HLA matched unrelated donor (MUD), n = 50; matched related donor (MRD), n = 27) underwent whole exome sequencing to identify single nucleotide polymorphisms (SNPs) generating alloreactive peptide libraries for each DRP (9-mer peptide-HLA complexes); Human CMV CROSS (Cross-Reactive Open Source Sequence) database was compiled from NCBI; HLA class I binding affinity for each DRPs HLA was calculated by NetMHCpan 2.8 and hCMV- derived 9-mers algorithmically compared to the alloreactive peptide-HLA complex libraries. Short consecutive (≥6) amino acid (AA) sequence homology matching hCMV to recipient peptides was considered for HLA-bound-peptide (IC50<500nM) cross reactivity. Of the 70,686 hCMV 9-mers contained within the hCMV CROSS database, an average of 29,658 matched the MRD DRP alloreactive peptides and 52,910 matched MUD DRP peptides (p<0.001). In silico analysis revealed multiple high affinity, immunogenic CMV-Human peptide matches (IC50<500 nM) expressed in GVHD-affected tissue-specific manner. hCMV+GVHD was found in 18 patients, 13 developing hCMV viremia before GVHD onset. Analysis of patients with GVHD identified potential cross reactive peptide expression within affected organs. We propose that hCMV peptide sequence homology with human alloreactive peptides may contribute to the pathophysiology of GVHD. PMID:28800601

  2. Sequence homology between HLA-bound cytomegalovirus and human peptides: A potential trigger for alloreactivity.

    PubMed

    Hall, Charles E; Koparde, Vishal N; Jameson-Lee, Maximilian; Elnasseh, Abdelrhman G; Scalora, Allison F; Kobulnicky, David J; Serrano, Myrna G; Roberts, Catherine H; Buck, Gregory A; Neale, Michael C; Nixon, Daniel E; Toor, Amir A

    2017-01-01

    Human cytomegalovirus (hCMV) reactivation may often coincide with the development of graft-versus-host-disease (GVHD) in stem cell transplantation (SCT). Seventy seven SCT donor-recipient pairs (DRP) (HLA matched unrelated donor (MUD), n = 50; matched related donor (MRD), n = 27) underwent whole exome sequencing to identify single nucleotide polymorphisms (SNPs) generating alloreactive peptide libraries for each DRP (9-mer peptide-HLA complexes); Human CMV CROSS (Cross-Reactive Open Source Sequence) database was compiled from NCBI; HLA class I binding affinity for each DRPs HLA was calculated by NetMHCpan 2.8 and hCMV- derived 9-mers algorithmically compared to the alloreactive peptide-HLA complex libraries. Short consecutive (≥6) amino acid (AA) sequence homology matching hCMV to recipient peptides was considered for HLA-bound-peptide (IC50<500nM) cross reactivity. Of the 70,686 hCMV 9-mers contained within the hCMV CROSS database, an average of 29,658 matched the MRD DRP alloreactive peptides and 52,910 matched MUD DRP peptides (p<0.001). In silico analysis revealed multiple high affinity, immunogenic CMV-Human peptide matches (IC50<500 nM) expressed in GVHD-affected tissue-specific manner. hCMV+GVHD was found in 18 patients, 13 developing hCMV viremia before GVHD onset. Analysis of patients with GVHD identified potential cross reactive peptide expression within affected organs. We propose that hCMV peptide sequence homology with human alloreactive peptides may contribute to the pathophysiology of GVHD.

  3. Human cytosolic glutathione-S-transferases: quantitative analysis of expression, comparative analysis of structures and inhibition strategies of isozymes involved in drug resistance.

    PubMed

    Mohana, Krishnamoorthy; Achary, Anant

    2017-08-01

    Glutathione-S-transferase (GST) inhibition is a strategy to overcome drug resistance. Several isoforms of human GSTs are present and they are expressed in almost all the organs. Specific expression levels of GSTs in various organs are collected from the human transcriptome data and analysis of the organ-specific expression of GST isoforms is carried out. The variations in the level of expressions of GST isoforms are statistically significant. The GST expression differs in diseased conditions as reported by many investigators and some of the isoforms of GSTs are disease markers or drug targets. Structure analysis of various isoforms is carried out and literature mining has been performed to identify the differences in the active sites of the GSTs. The xenobiotic binding H site is classified into H1, H2, and H3 and the differences in the amino acid composition, the hydrophobicity and other structural features of H site of GSTs are discussed. The existing inhibition strategies are compared. The advent of rational drug design, mechanism-based inhibition strategies, availability of high-throughput screening, target specific, and selective inhibition of GST isoforms involved in drug resistance could be achieved for the reversal of drug resistance and aid in the treatment of diseases.

  4. KONAGAbase: a genomic and transcriptomic database for the diamondback moth, Plutella xylostella.

    PubMed

    Jouraku, Akiya; Yamamoto, Kimiko; Kuwazaki, Seigo; Urio, Masahiro; Suetsugu, Yoshitaka; Narukawa, Junko; Miyamoto, Kazuhisa; Kurita, Kanako; Kanamori, Hiroyuki; Katayose, Yuichi; Matsumoto, Takashi; Noda, Hiroaki

    2013-07-09

    The diamondback moth (DBM), Plutella xylostella, is one of the most harmful insect pests for crucifer crops worldwide. DBM has rapidly evolved high resistance to most conventional insecticides such as pyrethroids, organophosphates, fipronil, spinosad, Bacillus thuringiensis, and diamides. Therefore, it is important to develop genomic and transcriptomic DBM resources for analysis of genes related to insecticide resistance, both to clarify the mechanism of resistance of DBM and to facilitate the development of insecticides with a novel mode of action for more effective and environmentally less harmful insecticide rotation. To contribute to this goal, we developed KONAGAbase, a genomic and transcriptomic database for DBM (KONAGA is the Japanese word for DBM). KONAGAbase provides (1) transcriptomic sequences of 37,340 ESTs/mRNAs and 147,370 RNA-seq contigs which were clustered and assembled into 84,570 unigenes (30,695 contigs, 50,548 pseudo singletons, and 3,327 singletons); and (2) genomic sequences of 88,530 WGS contigs with 246,244 degenerate contigs and 106,455 singletons from which 6,310 de novo identified repeat sequences and 34,890 predicted gene-coding sequences were extracted. The unigenes and predicted gene-coding sequences were clustered and 32,800 representative sequences were extracted as a comprehensive putative gene set. These sequences were annotated with BLAST descriptions, Gene Ontology (GO) terms, and Pfam descriptions, respectively. KONAGAbase contains rich graphical user interface (GUI)-based web interfaces for easy and efficient searching, browsing, and downloading sequences and annotation data. Five useful search interfaces consisting of BLAST search, keyword search, BLAST result-based search, GO tree-based search, and genome browser are provided. KONAGAbase is publicly available from our website (http://dbm.dna.affrc.go.jp/px/) through standard web browsers. KONAGAbase provides DBM comprehensive transcriptomic and draft genomic sequences with useful annotation information with easy-to-use web interfaces, which helps researchers to efficiently search for target sequences such as insect resistance-related genes. KONAGAbase will be continuously updated and additional genomic/transcriptomic resources and analysis tools will be provided for further efficient analysis of the mechanism of insecticide resistance and the development of effective insecticides with a novel mode of action for DBM.

  5. Flower bud transcriptome analysis of Sapium sebiferum (Linn.) Roxb. and primary investigation of drought induced flowering: pathway construction and G-quadruplex prediction based on transcriptome.

    PubMed

    Yang, Minglei; Wu, Ying; Jin, Shan; Hou, Jinyan; Mao, Yingji; Liu, Wenbo; Shen, Yangcheng; Wu, Lifang

    2015-01-01

    Sapium sebiferum (Linn.) Roxb. (Chinese Tallow Tree) is a perennial woody tree and its seeds are rich in oil which hold great potential for biodiesel production. Despite a traditional woody oil plant, our understanding on S. sebiferum genetics and molecular biology remains scant. In this study, the first comprehensive transcriptome of S. sebiferum flower has been generated by sequencing and de novo assembly. A total of 149,342 unigenes were generated from raw reads, of which 24,289 unigenes were successfully matched to public database. A total of 61 MADS box genes and putative pathways involved in S. sebiferum flower development have been identified. Abiotic stress response network was also constructed in this work, where 2,686 unigenes are involved in the pathway. As for lipid biosynthesis, 161 unigenes have been identified in fatty acid (FA) and triacylglycerol (TAG) biosynthesis. Besides, the G-Quadruplexes in RNA of S. sebiferum also have been predicted. An interesting finding is that the stress-induced flowering was observed in S. sebiferum for the first time. According to the results of semi-quantitative PCR, expression tendencies of flowering-related genes, GA1, AP2 and CRY2, accorded with stress-related genes, such as GRX50435 and PRXⅡ39562. This transcriptome provides functional genomic information for further research of S. sebiferum, especially for the genetic engineering to shorten the juvenile period and improve yield by regulating flower development. It also offers a useful database for the research of other Euphorbiaceae family plants.

  6. Sequencing and Characterization of the Invasive Sycamore Lace Bug Corythucha ciliata (Hemiptera: Tingidae) Transcriptome

    PubMed Central

    Qu, Cheng; Fu, Ningning; Xu, Yihua

    2016-01-01

    The sycamore lace bug, Corythucha ciliata (Hemiptera: Tingidae), is an invasive forestry pest rapidly expanding in many countries. This pest poses a considerable threat to the urban forestry ecosystem, especially to Platanus spp. However, its molecular biology and biochemistry are poorly understood. This study reports the first C. ciliata transcriptome, encompassing three different life stages (Nymphs, adults female (AF) and adults male (AM)). In total, 26.53 GB of clean data and 60,879 unigenes were obtained from three RNA-seq libraries. These unigenes were annotated and classified by Nr (NCBI non-redundant protein sequences), Nt (NCBI non-redundant nucleotide sequences), Pfam (Protein family), KOG/COG (Clusters of Orthologous Groups of proteins), Swiss-Prot (A manually annotated and reviewed protein sequence database), and KO (KEGG Ortholog database). After all pairwise comparisons between these three different samples, a large number of differentially expressed genes were revealed. The dramatic differences in global gene expression profiles were found between distinct life stages (nymphs and AF, nymphs and AM) and sex difference (AF and AM), with some of the significantly differentially expressed genes (DEGs) being related to metamorphosis, digestion, immune and sex difference. The different express of unigenes were validated through quantitative Real-Time PCR (qRT-PCR) for 16 randomly selected unigenes. In addition, 17,462 potential simple sequence repeat molecular markers were identified in these transcriptome resources. These comprehensive C. ciliata transcriptomic information can be utilized to promote the development of environmentally friendly methodologies to disrupt the processes of metamorphosis, digestion, immune and sex differences. PMID:27494615

  7. Competing endogenous RNA and interactome bioinformatic analyses on human telomerase.

    PubMed

    Arancio, Walter; Pizzolanti, Giuseppe; Genovese, Swonild Ilenia; Baiamonte, Concetta; Giordano, Carla

    2014-04-01

    We present a classic interactome bioinformatic analysis and a study on competing endogenous (ce) RNAs for hTERT. The hTERT gene codes for the catalytic subunit and limiting component of the human telomerase complex. Human telomerase reverse transcriptase (hTERT) is essential for the integrity of telomeres. Telomere dysfunctions have been widely reported to be involved in aging, cancer, and cellular senescence. The hTERT gene network has been analyzed using the BioGRID interaction database (http://thebiogrid.org/) and related analysis tools such as Osprey (http://biodata.mshri.on.ca/osprey/servlet/Index) and GeneMANIA (http://genemania.org/). The network of interaction of hTERT transcripts has been further analyzed following the competing endogenous (ce) RNA hypotheses (messenger [m] RNAs cross-talk via micro [mi] RNAs) using the miRWalk database and tools (www.ma.uni-heidelberg.de/apps/zmf/mirwalk/). These analyses suggest a role for Akt, nuclear factor-κB (NF-κB), heat shock protein 90 (HSP90), p70/p80 autoantigen, 14-3-3 proteins, and dynein in telomere functions. Roles for histone acetylation/deacetylation and proteoglycan metabolism are also proposed.

  8. Combined transcriptome studies identify AFF3 as a mediator of the oncogenic effects of β-catenin in adrenocortical carcinoma

    PubMed Central

    Lefèvre, L; Omeiri, H; Drougat, L; Hantel, C; Giraud, M; Val, P; Rodriguez, S; Perlemoine, K; Blugeon, C; Beuschlein, F; de Reyniès, A; Rizk-Rabin, M; Bertherat, J; Ragazzon, B

    2015-01-01

    Adrenocortical cancer (ACC) is a very aggressive tumor, and genomics studies demonstrate that the most frequent alterations of driver genes in these cancers activate the Wnt/β-catenin signaling pathway. However, the adrenal-specific targets of oncogenic β-catenin-mediating tumorigenesis have not being established. A combined transcriptomic analysis from two series of human tumors and the human ACC cell line H295R harboring a spontaneous β-catenin activating mutation was done to identify the Wnt/β-catenin targets. Seven genes were consistently identified in the three studies. Among these genes, we found that AFF3 mediates the oncogenic effects of β-catenin in ACC. The Wnt response element site located at nucleotide position −1408 of the AFF3 transcriptional start sites (TSS) mediates the regulation by the Wnt/β-catenin signaling pathway. AFF3 silencing decreases cell proliferation and increases apoptosis in the ACC cell line H295R. AFF3 is located in nuclear speckles, which play an important role in RNA splicing. AFF3 overexpression in adrenocortical cells interferes with the organization and/or biogenesis of these nuclear speckles and alters the distribution of CDK9 and cyclin T1 such that they accumulate at the sites of AFF3/speckles. We demonstrate that AFF3 is a new target of Wnt/β-catenin pathway involved in ACC, acting on transcription and RNA splicing. PMID:26214578

  9. Comparative high-throughput transcriptome sequencing and development of SiESTa, the Silene EST annotation database

    PubMed Central

    2011-01-01

    Background The genus Silene is widely used as a model system for addressing ecological and evolutionary questions in plants, but advances in using the genus as a model system are impeded by the lack of available resources for studying its genome. Massively parallel sequencing cDNA has recently developed into an efficient method for characterizing the transcriptomes of non-model organisms, generating massive amounts of data that enable the study of multiple species in a comparative framework. The sequences generated provide an excellent resource for identifying expressed genes, characterizing functional variation and developing molecular markers, thereby laying the foundations for future studies on gene sequence and gene expression divergence. Here, we report the results of a comparative transcriptome sequencing study of eight individuals representing four Silene and one Dianthus species as outgroup. All sequences and annotations have been deposited in a newly developed and publicly available database called SiESTa, the Silene EST annotation database. Results A total of 1,041,122 EST reads were generated in two runs on a Roche GS-FLX 454 pyrosequencing platform. EST reads were analyzed separately for all eight individuals sequenced and were assembled into contigs using TGICL. These were annotated with results from BLASTX searches and Gene Ontology (GO) terms, and thousands of single-nucleotide polymorphisms (SNPs) were characterized. Unassembled reads were kept as singletons and together with the contigs contributed to the unigenes characterized in each individual. The high quality of unigenes is evidenced by the proportion (49%) that have significant hits in similarity searches with the A. thaliana proteome. The SiESTa database is accessible at http://www.siesta.ethz.ch. Conclusion The sequence collections established in the present study provide an important genomic resource for four Silene and one Dianthus species and will help to further develop Silene as a plant model system. The genes characterized will be useful for future research not only in the species included in the present study, but also in related species for which no genomic resources are yet available. Our results demonstrate the efficiency of massively parallel transcriptome sequencing in a comparative framework as an approach for developing genomic resources in diverse groups of non-model organisms. PMID:21791039

  10. Massively parallel pyrosequencing-based transcriptome analyses of small brown planthopper (Laodelphax striatellus), a vector insect transmitting rice stripe virus (RSV)

    PubMed Central

    2010-01-01

    Background The small brown planthopper (Laodelphax striatellus) is an important agricultural pest that not only damages rice plants by sap-sucking, but also acts as a vector that transmits rice stripe virus (RSV), which can cause even more serious yield loss. Despite being a model organism for studying entomology, population biology, plant protection, molecular interactions among plants, viruses and insects, only a few genomic sequences are available for this species. To investigate its transcriptome and determine the differences between viruliferous and naïve L. striatellus, we employed 454-FLX high-throughput pyrosequencing to generate EST databases of this insect. Results We obtained 201,281 and 218,681 high-quality reads from viruliferous and naïve L. striatellus, respectively, with an average read length as 230 bp. These reads were assembled into contigs and two EST databases were generated. When all reads were combined, 16,885 contigs and 24,607 singletons (a total of 41,492 unigenes) were obtained, which represents a transcriptome of the insect. BlastX search against the NCBI-NR database revealed that only 6,873 (16.6%) of these unigenes have significant matches. Comparison of the distribution of GO classification among viruliferous, naïve, and combined EST databases indicated that these libraries are broadly representative of the L. striatellus transcriptomes. Functionally diverse transcripts from RSV, endosymbiotic bacteria Wolbachia and yeast-like symbiotes were identified, which reflects the possible lifestyles of these microbial symbionts that live in the cells of the host insect. Comparative genomic analysis revealed that L. striatellus encodes similar innate immunity regulatory systems as other insects, such as RNA interference, JAK/STAT and partial Imd cascades, which might be involved in defense against viral infection. In addition, we determined the differences in gene expression between vector and naïve samples, which generated a list of candidate genes that are potentially involved in the symbiosis of L. striatellus and RSV. Conclusions To our knowledge, the present study is the first description of a genomic project for L. striatellus. The identification of transcripts from RSV, Wolbachia, yeast-like symbiotes and genes abundantly expressed in viruliferous insect, provided a starting-point for investigating the molecular basis of symbiosis among these organisms. PMID:20462456

  11. De novo transcriptome assembly and RNA-Seq expression analysis in blood from beluga whales of Bristol Bay, AK.

    PubMed

    Morey, Jeanine S; Burek Huntington, Kathy A; Campbell, Michelle; Clauss, Tonya M; Goertz, Caroline E; Hobbs, Roderick C; Lunardi, Denise; Moors, Amanda J; Neely, Marion G; Schwacke, Lori H; Van Dolah, Frances M

    2017-10-01

    Assessing the health of marine mammal sentinel species is crucial to understanding the impacts of environmental perturbations on marine ecosystems and human health. In Arctic regions, beluga whales, Delphinapterus leucas, are upper level predators that may serve as a sentinel species, potentially forecasting impacts on human health. While gene expression profiling from blood transcriptomes has widely been used to assess health status and environmental exposures in human and veterinary medicine, its use in wildlife has been limited due to the lack of available genomes and baseline data. To this end we constructed the first beluga whale blood transcriptome de novo from samples collected during annual health assessments of the healthy Bristol Bay, AK stock during 2012-2014 to establish baseline information on the content and variation of the beluga whale blood transcriptome. The Trinity transcriptome assembly from beluga was comprised of 91,325 transcripts that represented a wide array of cellular functions and processes and was extremely similar in content to the blood transcriptome of another cetacean, the bottlenose dolphin. Expression of hemoglobin transcripts was much lower in beluga (25.6% of TPM, transcripts per million) than has been observed in many other mammals. A T12A amino acid substitution in the HBB sequence of beluga whales, but not bottlenose dolphins, was identified and may play a role in low temperature adaptation. The beluga blood transcriptome was extremely stable between sex and year, with no apparent clustering of samples by principle components analysis and <4% of genes differentially expressed (EBseq, FDR<0.05). While the impacts of season, sexual maturity, disease, and geography on the beluga blood transcriptome must be established, the presence of transcripts involved in stress, detoxification, and immune functions indicate that blood gene expression analyses may provide information on health status and exposure. This study provides a wealth of transcriptomic data on beluga whales and provides a sizeable pool of preliminary data for comparison with other studies in beluga whale. Copyright © 2017 Elsevier B.V. All rights reserved.

  12. PATRIC, the bacterial bioinformatics database and analysis resource.

    PubMed

    Wattam, Alice R; Abraham, David; Dalay, Oral; Disz, Terry L; Driscoll, Timothy; Gabbard, Joseph L; Gillespie, Joseph J; Gough, Roger; Hix, Deborah; Kenyon, Ronald; Machi, Dustin; Mao, Chunhong; Nordberg, Eric K; Olson, Robert; Overbeek, Ross; Pusch, Gordon D; Shukla, Maulik; Schulman, Julie; Stevens, Rick L; Sullivan, Daniel E; Vonstein, Veronika; Warren, Andrew; Will, Rebecca; Wilson, Meredith J C; Yoo, Hyun Seung; Zhang, Chengdong; Zhang, Yan; Sobral, Bruno W

    2014-01-01

    The Pathosystems Resource Integration Center (PATRIC) is the all-bacterial Bioinformatics Resource Center (BRC) (http://www.patricbrc.org). A joint effort by two of the original National Institute of Allergy and Infectious Diseases-funded BRCs, PATRIC provides researchers with an online resource that stores and integrates a variety of data types [e.g. genomics, transcriptomics, protein-protein interactions (PPIs), three-dimensional protein structures and sequence typing data] and associated metadata. Datatypes are summarized for individual genomes and across taxonomic levels. All genomes in PATRIC, currently more than 10,000, are consistently annotated using RAST, the Rapid Annotations using Subsystems Technology. Summaries of different data types are also provided for individual genes, where comparisons of different annotations are available, and also include available transcriptomic data. PATRIC provides a variety of ways for researchers to find data of interest and a private workspace where they can store both genomic and gene associations, and their own private data. Both private and public data can be analyzed together using a suite of tools to perform comparative genomic or transcriptomic analysis. PATRIC also includes integrated information related to disease and PPIs. All the data and integrated analysis and visualization tools are freely available. This manuscript describes updates to the PATRIC since its initial report in the 2007 NAR Database Issue.

  13. PATRIC, the bacterial bioinformatics database and analysis resource

    PubMed Central

    Wattam, Alice R.; Abraham, David; Dalay, Oral; Disz, Terry L.; Driscoll, Timothy; Gabbard, Joseph L.; Gillespie, Joseph J.; Gough, Roger; Hix, Deborah; Kenyon, Ronald; Machi, Dustin; Mao, Chunhong; Nordberg, Eric K.; Olson, Robert; Overbeek, Ross; Pusch, Gordon D.; Shukla, Maulik; Schulman, Julie; Stevens, Rick L.; Sullivan, Daniel E.; Vonstein, Veronika; Warren, Andrew; Will, Rebecca; Wilson, Meredith J.C.; Yoo, Hyun Seung; Zhang, Chengdong; Zhang, Yan; Sobral, Bruno W.

    2014-01-01

    The Pathosystems Resource Integration Center (PATRIC) is the all-bacterial Bioinformatics Resource Center (BRC) (http://www.patricbrc.org). A joint effort by two of the original National Institute of Allergy and Infectious Diseases-funded BRCs, PATRIC provides researchers with an online resource that stores and integrates a variety of data types [e.g. genomics, transcriptomics, protein–protein interactions (PPIs), three-dimensional protein structures and sequence typing data] and associated metadata. Datatypes are summarized for individual genomes and across taxonomic levels. All genomes in PATRIC, currently more than 10 000, are consistently annotated using RAST, the Rapid Annotations using Subsystems Technology. Summaries of different data types are also provided for individual genes, where comparisons of different annotations are available, and also include available transcriptomic data. PATRIC provides a variety of ways for researchers to find data of interest and a private workspace where they can store both genomic and gene associations, and their own private data. Both private and public data can be analyzed together using a suite of tools to perform comparative genomic or transcriptomic analysis. PATRIC also includes integrated information related to disease and PPIs. All the data and integrated analysis and visualization tools are freely available. This manuscript describes updates to the PATRIC since its initial report in the 2007 NAR Database Issue. PMID:24225323

  14. Architecture of epigenetic reprogramming following Twist1-mediated epithelial-mesenchymal transition

    PubMed Central

    2013-01-01

    Background Epithelial-mesenchymal transition (EMT) is known to impart metastasis and stemness characteristics in breast cancer. To characterize the epigenetic reprogramming following Twist1-induced EMT, we characterized the epigenetic and transcriptome landscapes using whole-genome transcriptome analysis by RNA-seq, DNA methylation by digital restriction enzyme analysis of methylation (DREAM) and histone modifications by CHIP-seq of H3K4me3 and H3K27me3 in immortalized human mammary epithelial cells relative to cells induced to undergo EMT by Twist1. Results EMT is accompanied by focal hypermethylation and widespread global DNA hypomethylation, predominantly within transcriptionally repressed gene bodies. At the chromatin level, the number of gene promoters marked by H3K4me3 increases by more than one fifth; H3K27me3 undergoes dynamic genomic redistribution characterized by loss at half of gene promoters and overall reduction of peak size by almost half. This is paralleled by increased phosphorylation of EZH2 at serine 21. Among genes with highly altered mRNA expression, 23.1% switch between H3K4me3 and H3K27me3 marks, and those point to the master EMT targets and regulators CDH1, PDGFRα and ESRP1. Strikingly, Twist1 increases the number of bivalent genes by more than two fold. Inhibition of the H3K27 methyltransferases EZH2 and EZH1, which form part of the Polycomb repressive complex 2 (PRC2), blocks EMT and stemness properties. Conclusions Our findings demonstrate that the EMT program requires epigenetic remodeling by the Polycomb and Trithorax complexes leading to increased cellular plasticity. This suggests that inhibiting epigenetic remodeling and thus decrease plasticity will prevent EMT, and the associated breast cancer metastasis. PMID:24367927

  15. De novo Transcriptome Analysis of Rhizoctonia solani AG1 IA Strain Early Invasion in Zoysia japonica Root.

    PubMed

    Zhu, Chen; Ai, Lin; Wang, Li; Yin, Pingping; Liu, Chenglan; Li, Shanshan; Zeng, Huiming

    2016-01-01

    Zoysia japonica brown spot was caused by necrotrophic fungus Rhizoctonia solani invasion, which led to severe financial loss in city lawn and golf ground maintenance. However, little was known about the molecular mechanism of R. solani pathogenicity in Z. japonica. In this study we examined early stage interaction between R. solani AG1 IA strain and Z. japonica cultivar "Zenith" root by cell ultra-structure analysis, pathogenesis-related proteins assay and transcriptome analysis to explore molecular clues for AG1 IA strain pathogenicity in Z. japonica. No obvious cell structure damage was found in infected roots and most pathogenesis-related protein activities showedg a downward trend especially in 36 h post inoculation, which exhibits AG1 IA strain stealthy invasion characteristic. According to Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) database classification, most DEGs in infected "Zenith" roots dynamically changed especially in three aspects, signal transduction, gene translation, and protein synthesis. Total 3422 unigenes of "Zenith" root were predicted into 14 kinds of resistance (R) gene class. Potential fungal resistance related unigenes of "Zenith" root were involved in ligin biosynthesis, phytoalexin synthesis, oxidative burst, wax biosynthesis, while two down-regulated unigenes encoding leucine-rich repeat receptor protein kinase and subtilisin-like protease might be important for host-derived signal perception to AG1 IA strain invasion. According to Pathogen Host Interaction (PHI) database annotation, 1508 unigenes of AG1 IA strain were predicted and classified into 37 known pathogen species, in addition, unigenes encoding virulence, signaling, host stress tolerance, and potential effector were also predicted. This research uncovered transcriptional profiling during the early phase interaction between R. solani AG1 IA strain and Z. japonica, and will greatly help identify key pathogenicity of AG1 IA strain.

  16. Biochemical, transcriptomic and proteomic analyses of digestion in the scorpion Tityus serrulatus: insights into function and evolution of digestion in an ancient arthropod.

    PubMed

    Fuzita, Felipe J; Pinkse, Martijn W H; Patane, José S L; Juliano, Maria A; Verhaert, Peter D E M; Lopes, Adriana R

    2015-01-01

    Scorpions are among the oldest terrestrial arthropods and they have passed through small morphological changes during their evolutionary history on land. They are efficient predators capable of capturing and consuming large preys and due to envenomation these animals can become a human health challenge. Understanding the physiology of scorpions can not only lead to evolutionary insights but also is a crucial step in the development of control strategies. However, the digestive process in scorpions has been scarcely studied. In this work, we describe the combinatory use of next generation sequencing, proteomic analysis and biochemical assays in order to investigate the digestive process in the yellow scorpion Tityus serrulatus, mainly focusing in the initial protein digestion. The transcriptome generated database allowed the quantitative identification by mass spectrometry of different enzymes and proteins involved in digestion. All the results suggested that cysteine cathepsins play an important role in protein digestion. Two digestive cysteine cathepsins were isolated and characterized presenting acidic characteristics (pH optima and stability), zymogen conversion to the mature form after acidic activation and a cross-class inhibition by pepstatin. A more elucidative picture of the molecular mechanism of digestion in a scorpion was proposed based on our results from Tityus serrulatus. The midgut and midgut glands (MMG) are composed by secretory and digestive cells. In fasting animals, the secretory granules are ready for the next predation event, containing enzymes needed for alkaline extra-oral digestion which will compose the digestive fluid, such as trypsins, astacins and chitinase. The digestive vacuoles are filled with an acidic proteolytic cocktail to the intracellular digestion composed by cathepsins L, B, F, D and legumain. Other proteins as lipases, carbohydrases, ctenitoxins and a chitolectin with a perithrophin domain were also detected. Evolutionarily, a large gene duplication of cathepsin L occurred in Arachnida with the sequences from ticks being completely divergent from other arachnids probably due to the particular selective pressures over this group.

  17. BGDB: a database of bivalent genes

    PubMed Central

    Li, Qingyan; Lian, Shuabin; Dai, Zhiming; Xiang, Qian; Dai, Xianhua

    2013-01-01

    Bivalent gene is a gene marked with both H3K4me3 and H3K27me3 epigenetic modification in the same area, and is proposed to play a pivotal role related to pluripotency in embryonic stem (ES) cells. Identification of these bivalent genes and understanding their functions are important for further research of lineage specification and embryo development. So far, lots of genome-wide histone modification data were generated in mouse and human ES cells. These valuable data make it possible to identify bivalent genes, but no comprehensive data repositories or analysis tools are available for bivalent genes currently. In this work, we develop BGDB, the database of bivalent genes. The database contains 6897 bivalent genes in human and mouse ES cells, which are manually collected from scientific literature. Each entry contains curated information, including genomic context, sequences, gene ontology and other relevant information. The web services of BGDB database were implemented with PHP + MySQL + JavaScript, and provide diverse query functions. Database URL: http://dailab.sysu.edu.cn/bgdb/ PMID:23894186

  18. Heterochromatin assembly and transcriptome repression by Set1 in coordination with a class II histone deacetylase

    PubMed Central

    Lorenz, David R; Meyer, Lauren F; Grady, Patrick J R; Meyer, Michelle M; Cam, Hugh P

    2014-01-01

    Histone modifiers play essential roles in controlling transcription and organizing eukaryotic genomes into functional domains. Here, we show that Set1, the catalytic subunit of the highly conserved Set1C/COMPASS complex responsible for histone H3K4 methylation (H3K4me), behaves as a repressor of the transcriptome largely independent of Set1C and H3K4me in the fission yeast Schizosaccharomyces pombe. Intriguingly, while Set1 is enriched at highly expressed and repressed loci, Set1 binding levels do not generally correlate with the levels of transcription. We show that Set1 is recruited by the ATF/CREB homolog Atf1 to heterochromatic loci and promoters of stress-response genes. Moreover, we demonstrate that Set1 coordinates with the class II histone deacetylase Clr3 in heterochromatin assembly at prominent chromosomal landmarks and repression of the transcriptome that includes Tf2 retrotransposons, noncoding RNAs, and regulators of development and stress-responses. Our study delineates a molecular framework for elucidating the functional links between transcriptome control and chromatin organization. DOI: http://dx.doi.org/10.7554/eLife.04506.001 PMID:25497836

  19. Three Human Cell Types Respond to Multi-Walled Carbon Nanotubes and Titanium Dioxide Nanobelts with Cell-Specific Transcriptomic and Proteomic Expression Patterns.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Tilton, Susan C.; Karin, Norman J.; Tolic, Ana

    2014-08-01

    The growing use of engineered nanoparticles (NPs) in commercial and medical applications raises the urgent need for tools that can predict NP toxicity. Global transcriptome and proteome analyses were conducted on three human cell types, exposed to two high aspect ratio NP types, to identify patterns of expression that might indicate high versus low NP toxicity. Three cell types representing the most common routes of human exposure to NPs, including macrophage-like (THP-1), small airway epithelial and intestinal (Caco-2/HT29-MTX) cells, were exposed to TiO2 nanobelts (TiO2-NB; high toxicity) and multi-walled carbon nanotubes (MWCNT; low toxicity) at low (10 µg/mL) and highmore » (100 µg/mL) concentrations for 1 and 24 h. Unique patterns of gene and protein expressions were identified for each cell type, with no differentially expressed (p < 0.05, 1.5-fold change) genes or proteins overlapping across all three cell types. While unique to each cell type, the early response was primarily independent of NP type, showing similar expression patterns in response to both TiO2-NB and MWCNT. The early response might, therefore, indicate a general response to insult. In contrast, the 24 h response was unique to each NP type. The most significantly (p < 0.05) enriched biological processes in THP-1 cells indicated TiO2-NB regulation of pathways associated with inflammation, apoptosis, cell cycle arrest, DNA replication stress and genomic instability, while MWCNT-regulated pathways indicated increased cell proliferation, DNA repair and anti-apoptosis. These two distinct sets of biological pathways might, therefore, underlie cellular responses to high and low NP toxicity, respectively.« less

  20. Rapid transcriptome sequencing of an invasive pest, the brown marmorated stink bug Halyomorpha halys.

    PubMed

    Ioannidis, Panagiotis; Lu, Yong; Kumar, Nikhil; Creasy, Todd; Daugherty, Sean; Chibucos, Marcus C; Orvis, Joshua; Shetty, Amol; Ott, Sandra; Flowers, Melissa; Sengamalay, Naomi; Tallon, Luke J; Pick, Leslie; Dunning Hotopp, Julie C

    2014-08-29

    Halyomorpha halys (Stål) (Insecta:Hemiptera;Pentatomidae), commonly known as the Brown Marmorated Stink Bug (BMSB), is an invasive pest of the mid-Atlantic region of the United States, causing economically important damage to a wide range of crops. Native to Asia, BMSB was first observed in Allentown, PA, USA, in 1996, and this pest is now well-established throughout the US mid-Atlantic region and beyond. In addition to the serious threat BMSB poses to agriculture, BMSB has become a nuisance to homeowners, invading home gardens and congregating in large numbers in human-made structures, including homes, to overwinter. Despite its significance as an agricultural pest with limited control options, only 100 bp of BMSB sequence data was available in public databases when this project began. Transcriptome sequencing was undertaken to provide a molecular resource to the research community to inform the development of pest control strategies and to provide molecular data for population genetics studies of BMSB. Using normalized, strand-specific libraries, we sequenced pools of all BMSB life stages on the Illumina HiSeq. Trinity was used to assemble 200,000 putative transcripts in >100,000 components. A novel bioinformatic method that analyzed the strand-specificity of the data reduced this to 53,071 putative transcripts from 18,573 components. By integrating multiple other data types, we narrowed this further to 13,211 representative transcripts. Bacterial endosymbiont genes were identified in this dataset, some of which have a copy number consistent with being lateral gene transfers between endosymbiont genomes and Hemiptera, including ankyrin-repeat related proteins, lysozyme, and mannanase. Such genes and endosymbionts may provide novel targets for BMSB-specific biocontrol. This study demonstrates the utility of strand-specific sequencing in generating shotgun transcriptomes and that rapid sequencing shotgun transcriptomes is possible without the need for extensive inbreeding to generate homozygous lines. Such sequencing can provide a rapid response to pest invasions similar to that already described for disease epidemiology.

  1. A high-throughput venom-gland transcriptome for the Eastern Diamondback Rattlesnake (Crotalus adamanteus) and evidence for pervasive positive selection across toxin classes.

    PubMed

    Rokyta, Darin R; Wray, Kenneth P; Lemmon, Alan R; Lemmon, Emily Moriarty; Caudle, S Brian

    2011-04-01

    Despite causing considerable human mortality and morbidity, animal toxins represent a valuable source of pharmacologically active macromolecules, a unique system for studying molecular adaptation, and a powerful framework for examining structure-function relationships in proteins. Snake venoms are particularly useful in the latter regard as they consist primarily of a moderate number of proteins and peptides that have been found to belong to just a handful of protein families. As these proteins and peptides are produced in dedicated glands, transcriptome sequencing has proven to be an effective approach to identifying the expressed toxin genes. We generated a venom-gland transcriptome for the Eastern Diamondback Rattlesnake (Crotalus adamanteus) using Roche 454 sequencing technology. In the current work, we focus on transcripts encoding toxins. We identified 40 unique toxin transcripts, 30 of which have full-length coding sequences, and 10 have only partial coding sequences. These toxins account for 24% of the total sequencing reads. We found toxins from 11 previously described families of snake-venom toxins and have discovered two putative, previously undescribed toxin classes. The most diverse and highly expressed toxin classes in the C. adamanteus venom-gland transcriptome are the serine proteinases, metalloproteinases, and C-type lectins. The serine proteinases are the most abundant class, accounting for 35% of the toxin sequencing reads. Metalloproteinases are the most diverse; 11 different forms have been identified. Using our sequences and those available in public databases, we detected positive selection in seven of the eight toxin families for which sufficient sequences were available for the analysis. We find that the vast majority of the genes that contribute directly to this vertebrate trait show evidence for a role for positive selection in their evolutionary history. Copyright © 2011 Elsevier Ltd. All rights reserved.

  2. Analysis of the skin transcriptome in two oujiang color varieties of common carp.

    PubMed

    Wang, Chenghui; Wachholtz, Michael; Wang, Jun; Liao, Xiaolin; Lu, Guoqing

    2014-01-01

    Body color and coloration patterns are important phenotypic traits to maintain survival and reproduction activities. The Oujiang color varieties of common carp (Cyprinus carpio var. color), with a narrow distribution in Zhejiang Province of China and a history of aquaculture for over 1,200 years, consistently exhibit a variety of body color patterns. The molecular mechanism underlying diverse color patterns in these variants is unknown. To the practical end, it is essential to develop molecular markers that can distinguish different phenotypes and assist selective breeding. In this exploratory study, we conducted Roche 454 transcriptome sequencing of two pooled skin tissue samples of Oujiang common carp, which correspond to distinct color patterns, red with big black spots (RB) and whole white (WW), and a total of 737,525 sequence reads were generated. The reads obtained in this study were co-assembled jointly with common carp Roche 454 sequencing reads downloaded from NCBI SRA database, resulting in 43,923 isotigs and 546,676 singletons. Over 31 thousand (31,445; 71.6%) isotigs were found with significant BLAST matches (E<1e-10) to the nr protein database, which corresponds to 12,597 annotated zebrafish genes. A total of 70,947 isotigs and singletons (transcripts) were annotated with Gene Ontology, and 60,221 transcripts were found with corresponding EC numbers. Out of 145 zebrafish pigmentation genes, orthologs for 117 were recovered in Oujiang color carp transcriptome, including 18 found only among singletons. Our transcriptome analysis revealed over 52,902 SNPs in Oujiang common carp, and identified 63 SNP markers that are putatively unique either for RB or WW. The transcriptome of Oujiang color varieties of common carp obtained through this study, along with the pigmentation genes recovered and the color pattern-specific molecular markers developed, will facilitate future research on the molecular mechanism of color patterns and promote aquaculture of Oujiang color varieties of common carp through molecular marker assisted-selective breeding.

  3. De Novo Transcriptomic Analysis of Peripheral Blood Lymphocytes from the Chinese Goose: Gene Discovery and Immune System Pathway Description

    PubMed Central

    Tariq, Mansoor; Chen, Rong; Yuan, Hongyu; Liu, Yanjie; Wu, Yanan; Wang, Junya; Xia, Chun

    2015-01-01

    Background The Chinese goose is one of the most economically important poultry birds and is a natural reservoir for many avian viruses. However, the nature and regulation of the innate and adaptive immune systems of this waterfowl species are not completely understood due to limited information on the goose genome. Recently, transcriptome sequencing technology was applied in the genomic studies focused on novel gene discovery. Thus, this study described the transcriptome of the goose peripheral blood lymphocytes to identify immunity relevant genes. Principal Findings De novo transcriptome assembly of the goose peripheral blood lymphocytes was sequenced by Illumina-Solexa technology. In total, 211,198 unigenes were assembled from the 69.36 million cleaned reads. The average length, N50 size and the maximum length of the assembled unigenes were 687 bp, 1,298 bp and 18,992 bp, respectively. A total of 36,854 unigenes showed similarity by BLAST search against the NCBI non-redundant (Nr) protein database. For functional classification, 163,161 unigenes were comprised of three Gene Ontology (Go) categories and 67 subcategories. A total of 15,334 unigenes were annotated into 25 eukaryotic orthologous groups (KOGs) categories. Kyoto Encyclopedia of Genes and Genomes (KEGG) database annotated 39,585 unigenes into six biological functional groups and 308 pathways. Among the 2,757 unigenes that participated in the 15 immune system KEGG pathways, 125 of the most important immune relevant genes were summarized and analyzed by STRING analysis to identify gene interactions and relationships. Moreover, 10 genes were confirmed by PCR and analyzed. Of these 125 unigenes, 109 unigenes, approximately 87%, were not previously identified in the goose. Conclusion This de novo transcriptome analysis could provide important Chinese goose sequence information and highlights the value of new gene discovery, pathways investigation and immune system gene identification, and comparison with other avian species as useful tools to understand the goose immune system. PMID:25816068

  4. Transcriptome analysis of genes involved in defense against alkaline stress in roots of wild jujube (Ziziphus acidojujuba)

    PubMed Central

    Tian, Shan; Wang, Bei; Zhao, Xusheng

    2017-01-01

    Wild jujube (Ziziphus acidojujuba Mill.) is highly tolerant to alkaline, saline and drought stress; however, no studies have performed transcriptome profiling to study the response of wild jujube to these and other abiotic stresses. In this study, we examined the tolerance of wild jujube to NaHCO3-NaOH solution and analyzed gene expression profiles in response to alkaline stress. Physiological experiments revealed that H2O2 content in leaves increased significantly and root activity decreased quickly during alkaline of pH 9.5 treatment. For transcriptome analysis, wild jujube plants grown hydroponically were treated with NaHCO3-NaOH solution for 0, 1, and 12 h and six transcriptomes from roots were built. In total, 32,758 genes were generated, and 3,604 differentially expressed genes (DEGs) were identified. After 1 h, 853 genes showed significantly different expression between control and treated plants; after 12 h, expression of 2,856 genes was significantly different. The expression pattern of nine genes was validated by quantitative real-time PCR. After gene annotation and gene ontology enrichment analysis, the genes encoding transcriptional factors, serine/threonine-protein kinases, heat shock proteins, cysteine-like kinases, calmodulin-like proteins, and reactive oxygen species (ROS) scavengers were found to be closely involved in alkaline stress response. These results will provide useful insights for elucidating the mechanisms underlying alkaline tolerance in wild jujube. PMID:28976994

  5. The role of transcriptome resilience in resistance of corals to bleaching.

    PubMed

    Seneca, Francois O; Palumbi, Stephen R

    2015-04-01

    Wild populations increasingly experience extreme conditions as climate change amplifies environmental variability. How individuals respond to environmental extremes determines the impact of climate change overall. The variability of response from individual to individual can represent the opportunity for natural selection to occur as a result of extreme conditions. Here, we experimentally replicated the natural exposure to extreme temperatures of the reef lagoon at Ofu Island (American Samoa), where corals can experience severe heat stress during midday low tide. We investigated the bleaching and transcriptome response of 20 Acropora hyacinthus colonies 5 and 20 h after exposure to control (29 °C) or heated (35 °C) conditions. We found a highly dynamic transcriptome response: 27% of the coral transcriptome was significantly regulated 1 h postheat exposure. Yet 15 h later, when heat-induced coral bleaching became apparent, only 12% of the transcriptome was differentially regulated. A large proportion of responsive genes at the first time point returned to control levels, others remained differentially expressed over time, while an entirely different subset of genes was successively regulated at the second time point. However, a noteworthy variability in gene expression was observed among individual coral colonies. Among the genes of which expression lingered over time, fast return to normal levels was associated with low bleaching. Colonies that maintained higher expression levels of these genes bleached severely. Return to normal levels of gene expression after stress has been termed transcriptome resilience, and in the case of some specific genes may signal the physiological health and response ability of individuals to environmental stress. © 2015 John Wiley & Sons Ltd.

  6. Blood transcriptomics and metabolomics for personalized medicine.

    PubMed

    Li, Shuzhao; Todor, Andrei; Luo, Ruiyan

    2016-01-01

    Molecular analysis of blood samples is pivotal to clinical diagnosis and has been intensively investigated since the rise of systems biology. Recent developments have opened new opportunities to utilize transcriptomics and metabolomics for personalized and precision medicine. Efforts from human immunology have infused into this area exquisite characterizations of subpopulations of blood cells. It is now possible to infer from blood transcriptomics, with fine accuracy, the contribution of immune activation and of cell subpopulations. In parallel, high-resolution mass spectrometry has brought revolutionary analytical capability, detecting > 10,000 metabolites, together with environmental exposure, dietary intake, microbial activity, and pharmaceutical drugs. Thus, the re-examination of blood chemicals by metabolomics is in order. Transcriptomics and metabolomics can be integrated to provide a more comprehensive understanding of the human biological states. We will review these new data and methods and discuss how they can contribute to personalized medicine.

  7. GenomewidePDB 2.0: A Newly Upgraded Versatile Proteogenomic Database for the Chromosome-Centric Human Proteome Project.

    PubMed

    Jeong, Seul-Ki; Hancock, William S; Paik, Young-Ki

    2015-09-04

    Since the launch of the Chromosome-centric Human Proteome Project (C-HPP) in 2012, the number of "missing" proteins has fallen to 2932, down from ∼5932 since the number was first counted in 2011. We compared the characteristics of missing proteins with those of already annotated proteins with respect to transcriptional expression pattern and the time periods in which newly identified proteins were annotated. We learned that missing proteins commonly exhibit lower levels of transcriptional expression and less tissue-specific expression compared with already annotated proteins. This makes it more difficult to identify missing proteins as time goes on. One of the C-HPP goals is to identify alternative spliced product of proteins (ASPs), which are usually difficult to find by shot-gun proteomic methods due to their sequence similarities with the representative proteins. To resolve this problem, it may be necessary to use a targeted proteomics approach (e.g., selected and multiple reaction monitoring [S/MRM] assays) and an innovative bioinformatics platform that enables the selection of target peptides for rarely expressed missing proteins or ASPs. Given that the success of efforts to identify missing proteins may rely on more informative public databases, it was necessary to upgrade the available integrative databases. To this end, we attempted to improve the features and utility of GenomewidePDB by integrating transcriptomic information (e.g., alternatively spliced transcripts), annotated peptide information, and an advanced search interface that can find proteins of interest when applying a targeted proteomics strategy. This upgraded version of the database, GenomewidePDB 2.0, may not only expedite identification of the remaining missing proteins but also enhance the exchange of information among the proteome community. GenomewidePDB 2.0 is available publicly at http://genomewidepdb.proteomix.org/.

  8. Single cell RNA sequencing of stem cell-derived retinal ganglion cells.

    PubMed

    Daniszewski, Maciej; Senabouth, Anne; Nguyen, Quan H; Crombie, Duncan E; Lukowski, Samuel W; Kulkarni, Tejal; Sluch, Valentin M; Jabbari, Jafar S; Chamling, Xitiz; Zack, Donald J; Pébay, Alice; Powell, Joseph E; Hewitt, Alex W

    2018-02-13

    We used single cell sequencing technology to characterize the transcriptomes of 1,174 human embryonic stem cell-derived retinal ganglion cells (RGCs) at the single cell level. The human embryonic stem cell line BRN3B-mCherry (A81-H7), was differentiated to RGCs using a guided differentiation approach. Cells were harvested at day 36 and prepared for single cell RNA sequencing. Our data indicates the presence of three distinct subpopulations of cells, with various degrees of maturity. One cluster of 288 cells showed increased expression of genes involved in axon guidance together with semaphorin interactions, cell-extracellular matrix interactions and ECM proteoglycans, suggestive of a more mature RGC phenotype.

  9. Functional organization of the transcriptome in human brain

    PubMed Central

    Oldham, Michael C; Konopka, Genevieve; Iwamoto, Kazuya; Langfelder, Peter; Kato, Tadafumi; Horvath, Steve; Geschwind, Daniel H

    2009-01-01

    The enormous complexity of the human brain ultimately derives from a finite set of molecular instructions encoded in the human genome. These instructions can be directly studied by exploring the organization of the brain’s transcriptome through systematic analysis of gene coexpression relationships. We analyzed gene coexpression relationships in microarray data generated from specific human brain regions and identified modules of coexpressed genes that correspond to neurons, oligodendrocytes, astrocytes and microglia. These modules provide an initial description of the transcriptional programs that distinguish the major cell classes of the human brain and indicate that cell type–specific information can be obtained from whole brain tissue without isolating homogeneous populations of cells. Other modules corresponded to additional cell types, organelles, synaptic function, gender differences and the subventricular neurogenic niche. We found that subventricular zone astrocytes, which are thought to function as neural stem cells in adults, have a distinct gene expression pattern relative to protoplasmic astrocytes. Our findings provide a new foundation for neurogenetic inquiries by revealing a robust and previously unrecognized organization to the human brain transcriptome. PMID:18849986

  10. De Novo Sequencing of Hypericum perforatum Transcriptome to Identify Potential Genes Involved in the Biosynthesis of Active Metabolites

    PubMed Central

    He, Miao; Wang, Ying; Hua, Wenping; Zhang, Yuan; Wang, Zhezhi

    2012-01-01

    Background Hypericum perforatum L. (St. John’s wort) is a medicinal plant with pharmacological properties that are antidepressant, anti-inflammatory, antiviral, anti-cancer, and antibacterial. Its major active metabolites are hypericins, hyperforins, and melatonin. However, little genetic information is available for this species, especially that concerning the biosynthetic pathways for active ingredients. Methodology/Principal Findings Using de novo transcriptome analysis, we obtained 59,184 unigenes covering the entire life cycle of these plants. In all, 40,813 unigenes (68.86%) were annotated and 2,359 were assigned to secondary metabolic pathways. Among them, 260 unigenes are involved in the production of hypericin, hyperforin, and melatonin. Another 2,291 unigenes are classified as potential Type III polyketide synthase. Our BlastX search against the AGRIS database reveals 1,772 unigenes that are homologous to 47 known Arabidopsis transcription factor families. Further analysis shows that 10.61% (6,277) of these unigenes contain 7,643 SSRs. Conclusion We have identified a set of putative genes involved in several secondary metabolism pathways, especially those related to the synthesis of its active ingredients. Our results will serve as an important platform for public information about gene expression, genomics, and functional genomics in H. perforatum. PMID:22860059

  11. The first report on transcriptome analysis of the venom gland of Iranian scorpion, Hemiscorpius lepturus.

    PubMed

    Kazemi-Lomedasht, Fatemeh; Khalaj, Vahid; Bagheri, Kamran Pooshang; Behdani, Mahdi; Shahbazzadeh, Delavar

    2017-01-01

    Hemiscorpius lepturus scorpion is one of the most venomous members of the Hemiscorpiidae family. H. lepturus is distributed in Iran, Iraq and Yemen. The prevalence and severity of scorpionism is high and health services are not able to control it. Scorpionism in Iran especially in the southern regions (Khuzestan, Sistan and Baluchestan, Hormozgan, Ilam) is one of the main health challenges. Due to the medical and health importance of scorpionism, the focus of various studies has been on the identification of H. lepturus venom components. Nevertheless, until now, only a few percent of H. lepturus venom components have been identified and there is no complete information about the venom components of H. lepturus. The current study reports transcriptome analysis of the venom gland of H. lepturus scorpion. Illumina Next Generation Sequencing results identified venom components of H. lepturus. When compared with other scorpion's venom, the venom of H. lepturus consists of mixtures of peptides, proteins and enzymes such as; phospholipases, metalloproteases, hyaluronidases, potassium channel toxins, calcium channel toxins, antimicrobial peptides (AMPs), venom proteins, venom toxins, allergens, La1-like peptides, proteases and scorpine-like peptides. Comparison of identified components of H. lepturus venom was carried out with venom components of reported scorpions and various identities and similarities between them were observed. With transcriptome analysis of H. lepturus venom unique sequences, coding venom components were investigated. Moreover, our study confirmed transcript expression of previously reported peptides; Hemitoxin, Hemicalcin and Hemilipin. The gene sequences of venom components were investigated employing transcriptome analysis of venom gland of H. lepturus. In summary, new bioactive molecules identified in this study, provide basis for venomics studies of scorpions of Hemiscorpiidae family and promises development of novel biotherapeutics. Copyright © 2016 Elsevier Ltd. All rights reserved.

  12. High-Throughput Sequence Analysis of Turbot (Scophthalmus maximus) Transcriptome Using 454-Pyrosequencing for the Discovery of Antiviral Immune Genes

    PubMed Central

    Pereiro, Patricia; Balseiro, Pablo; Romero, Alejandro; Dios, Sonia; Forn-Cuni, Gabriel; Fuste, Berta; Planas, Josep V.; Beltran, Sergi; Novoa, Beatriz; Figueras, Antonio

    2012-01-01

    Background Turbot (Scophthalmus maximus L.) is an important aquacultural resource both in Europe and Asia. However, there is little information on gene sequences available in public databases. Currently, one of the main problems affecting the culture of this flatfish is mortality due to several pathogens, especially viral diseases which are not treatable. In order to identify new genes involved in immune defense, we conducted 454-pyrosequencing of the turbot transcriptome after different immune stimulations. Methodology/Principal Findings Turbot were injected with viral stimuli to increase the expression level of immune-related genes. High-throughput deep sequencing using 454-pyrosequencing technology yielded 915,256 high-quality reads. These sequences were assembled into 55,404 contigs that were subjected to annotation steps. Intriguingly, 55.16% of the deduced protein was not significantly similar to any sequences in the databases used for the annotation and only 0.85% of the BLASTx top-hits matched S. maximus protein sequences. This relatively low level of annotation is possibly due to the limited information for this specie and other flatfish in the database. These results suggest the identification of a large number of new genes in turbot and in fish in general. A more detailed analysis showed the presence of putative members of several innate and specific immune pathways. Conclusions/Significance To our knowledge, this study is the first transcriptome analysis using 454-pyrosequencing for turbot. Previously, there were only 12,471 EST and less of 1,500 nucleotide sequences for S. maximus in NCBI database. Our results provide a rich source of data (55,404 contigs and 181,845 singletons) for discovering and identifying new genes, which will serve as a basis for microarray construction, gene expression characterization and for identification of genetic markers to be used in several applications. Immune stimulation in turbot was very effective, obtaining an enormous variety of sequences belonging to genes involved in the defense mechanisms. PMID:22629298

  13. Transcriptome analysis of the desert locust central nervous system: production and annotation of a Schistocerca gregaria EST database.

    PubMed

    Badisco, Liesbeth; Huybrechts, Jurgen; Simonet, Gert; Verlinden, Heleen; Marchal, Elisabeth; Huybrechts, Roger; Schoofs, Liliane; De Loof, Arnold; Vanden Broeck, Jozef

    2011-03-21

    The desert locust (Schistocerca gregaria) displays a fascinating type of phenotypic plasticity, designated as 'phase polyphenism'. Depending on environmental conditions, one genome can be translated into two highly divergent phenotypes, termed the solitarious and gregarious (swarming) phase. Although many of the underlying molecular events remain elusive, the central nervous system (CNS) is expected to play a crucial role in the phase transition process. Locusts have also proven to be interesting model organisms in a physiological and neurobiological research context. However, molecular studies in locusts are hampered by the fact that genome/transcriptome sequence information available for this branch of insects is still limited. We have generated 34,672 raw expressed sequence tags (EST) from the CNS of desert locusts in both phases. These ESTs were assembled in 12,709 unique transcript sequences and nearly 4,000 sequences were functionally annotated. Moreover, the obtained S. gregaria EST information is highly complementary to the existing orthopteran transcriptomic data. Since many novel transcripts encode neuronal signaling and signal transduction components, this paper includes an overview of these sequences. Furthermore, several transcripts being differentially represented in solitarious and gregarious locusts were retrieved from this EST database. The findings highlight the involvement of the CNS in the phase transition process and indicate that this novel annotated database may also add to the emerging knowledge of concomitant neuronal signaling and neuroplasticity events. In summary, we met the need for novel sequence data from desert locust CNS. To our knowledge, we hereby also present the first insect EST database that is derived from the complete CNS. The obtained S. gregaria EST data constitute an important new source of information that will be instrumental in further unraveling the molecular principles of phase polyphenism, in further establishing locusts as valuable research model organisms and in molecular evolutionary and comparative entomology.

  14. A global approach to analysis and interpretation of metabolic data for plant natural product discovery.

    PubMed

    Hur, Manhoi; Campbell, Alexis Ann; Almeida-de-Macedo, Marcia; Li, Ling; Ransom, Nick; Jose, Adarsh; Crispin, Matt; Nikolau, Basil J; Wurtele, Eve Syrkin

    2013-04-01

    Discovering molecular components and their functionality is key to the development of hypotheses concerning the organization and regulation of metabolic networks. The iterative experimental testing of such hypotheses is the trajectory that can ultimately enable accurate computational modelling and prediction of metabolic outcomes. This information can be particularly important for understanding the biology of natural products, whose metabolism itself is often only poorly defined. Here, we describe factors that must be in place to optimize the use of metabolomics in predictive biology. A key to achieving this vision is a collection of accurate time-resolved and spatially defined metabolite abundance data and associated metadata. One formidable challenge associated with metabolite profiling is the complexity and analytical limits associated with comprehensively determining the metabolome of an organism. Further, for metabolomics data to be efficiently used by the research community, it must be curated in publicly available metabolomics databases. Such databases require clear, consistent formats, easy access to data and metadata, data download, and accessible computational tools to integrate genome system-scale datasets. Although transcriptomics and proteomics integrate the linear predictive power of the genome, the metabolome represents the nonlinear, final biochemical products of the genome, which results from the intricate system(s) that regulate genome expression. For example, the relationship of metabolomics data to the metabolic network is confounded by redundant connections between metabolites and gene-products. However, connections among metabolites are predictable through the rules of chemistry. Therefore, enhancing the ability to integrate the metabolome with anchor-points in the transcriptome and proteome will enhance the predictive power of genomics data. We detail a public database repository for metabolomics, tools and approaches for statistical analysis of metabolomics data, and methods for integrating these datasets with transcriptomic data to create hypotheses concerning specialized metabolisms that generate the diversity in natural product chemistry. We discuss the importance of close collaborations among biologists, chemists, computer scientists and statisticians throughout the development of such integrated metabolism-centric databases and software.

  15. A global approach to analysis and interpretation of metabolic data for plant natural product discovery†

    PubMed Central

    Hur, Manhoi; Campbell, Alexis Ann; Almeida-de-Macedo, Marcia; Li, Ling; Ransom, Nick; Jose, Adarsh; Crispin, Matt; Nikolau, Basil J.

    2013-01-01

    Discovering molecular components and their functionality is key to the development of hypotheses concerning the organization and regulation of metabolic networks. The iterative experimental testing of such hypotheses is the trajectory that can ultimately enable accurate computational modelling and prediction of metabolic outcomes. This information can be particularly important for understanding the biology of natural products, whose metabolism itself is often only poorly defined. Here, we describe factors that must be in place to optimize the use of metabolomics in predictive biology. A key to achieving this vision is a collection of accurate time-resolved and spatially defined metabolite abundance data and associated metadata. One formidable challenge associated with metabolite profiling is the complexity and analytical limits associated with comprehensively determining the metabolome of an organism. Further, for metabolomics data to be efficiently used by the research community, it must be curated in publically available metabolomics databases. Such databases require clear, consistent formats, easy access to data and metadata, data download, and accessible computational tools to integrate genome system-scale datasets. Although transcriptomics and proteomics integrate the linear predictive power of the genome, the metabolome represents the nonlinear, final biochemical products of the genome, which results from the intricate system(s) that regulate genome expression. For example, the relationship of metabolomics data to the metabolic network is confounded by redundant connections between metabolites and gene-products. However, connections among metabolites are predictable through the rules of chemistry. Therefore, enhancing the ability to integrate the metabolome with anchor-points in the transcriptome and proteome will enhance the predictive power of genomics data. We detail a public database repository for metabolomics, tools and approaches for statistical analysis of metabolomics data, and methods for integrating these dataset with transcriptomic data to create hypotheses concerning specialized metabolism that generates the diversity in natural product chemistry. We discuss the importance of close collaborations among biologists, chemists, computer scientists and statisticians throughout the development of such integrated metabolism-centric databases and software. PMID:23447050

  16. Deep Sequencing Reveals Uncharted Isoform Heterogeneity of the Protein-Coding Transcriptome in Cerebral Ischemia.

    PubMed

    Bhattarai, Sunil; Aly, Ahmed; Garcia, Kristy; Ruiz, Diandra; Pontarelli, Fabrizio; Dharap, Ashutosh

    2018-06-03

    Gene expression in cerebral ischemia has been a subject of intense investigations for several years. Studies utilizing probe-based high-throughput methodologies such as microarrays have contributed significantly to our existing knowledge but lacked the capacity to dissect the transcriptome in detail. Genome-wide RNA-sequencing (RNA-seq) enables comprehensive examinations of transcriptomes for attributes such as strandedness, alternative splicing, alternative transcription start/stop sites, and sequence composition, thus providing a very detailed account of gene expression. Leveraging this capability, we conducted an in-depth, genome-wide evaluation of the protein-coding transcriptome of the adult mouse cortex after transient focal ischemia at 6, 12, or 24 h of reperfusion using RNA-seq. We identified a total of 1007 transcripts at 6 h, 1878 transcripts at 12 h, and 1618 transcripts at 24 h of reperfusion that were significantly altered as compared to sham controls. With isoform-level resolution, we identified 23 splice variants arising from 23 genes that were novel mRNA isoforms. For a subset of genes, we detected reperfusion time-point-dependent splice isoform switching, indicating an expression and/or functional switch for these genes. Finally, for 286 genes across all three reperfusion time-points, we discovered multiple, distinct, simultaneously expressed and differentially altered isoforms per gene that were generated via alternative transcription start/stop sites. Of these, 165 isoforms derived from 109 genes were novel mRNAs. Together, our data unravel the protein-coding transcriptome of the cerebral cortex at an unprecedented depth to provide several new insights into the flexibility and complexity of stroke-related gene transcription and transcript organization.

  17. Detailed tail proteomic analysis of axolotl (Ambystoma mexicanum) using an mRNA-seq reference database.

    PubMed

    Demircan, Turan; Keskin, Ilknur; Dumlu, Seda Nilgün; Aytürk, Nilüfer; Avşaroğlu, Mahmut Erhan; Akgün, Emel; Öztürk, Gürkan; Baykal, Ahmet Tarık

    2017-01-01

    Salamander axolotl has been emerging as an important model for stem cell research due to its powerful regenerative capacity. Several advantages, such as the high capability of advanced tissue, organ, and appendages regeneration, promote axolotl as an ideal model system to extend our current understanding on the mechanisms of regeneration. Acknowledging the common molecular pathways between amphibians and mammals, there is a great potential to translate the messages from axolotl research to mammalian studies. However, the utilization of axolotl is hindered due to the lack of reference databases of genomic, transcriptomic, and proteomic data. Here, we introduce the proteome analysis of the axolotl tail section searched against an mRNA-seq database. We translated axolotl mRNA sequences to protein sequences and annotated these to process the LC-MS/MS data and identified 1001 nonredundant proteins. Functional classification of identified proteins was performed by gene ontology searches. The presence of some of the identified proteins was validated by in situ antibody labeling. Furthermore, we have analyzed the proteome expressional changes postamputation at three time points to evaluate the underlying mechanisms of the regeneration process. Taken together, this work expands the proteomics data of axolotl to contribute to its establishment as a fully utilized model. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  18. Combined transcriptome and metabolome analyses of metformin effects reveal novel links between metabolic networks in steroidogenic systems.

    PubMed

    Udhane, Sameer S; Legeza, Balazs; Marti, Nesa; Hertig, Damian; Diserens, Gaëlle; Nuoffer, Jean-Marc; Vermathen, Peter; Flück, Christa E

    2017-08-17

    Metformin is an antidiabetic drug, which inhibits mitochondrial respiratory-chain-complex I and thereby seems to affect the cellular metabolism in many ways. It is also used for the treatment of the polycystic ovary syndrome (PCOS), the most common endocrine disorder in women. In addition, metformin possesses antineoplastic properties. Although metformin promotes insulin-sensitivity and ameliorates reproductive abnormalities in PCOS, its exact mechanisms of action remain elusive. Therefore, we studied the transcriptome and the metabolome of metformin in human adrenal H295R cells. Microarray analysis revealed changes in 693 genes after metformin treatment. Using high resolution magic angle spinning nuclear magnetic resonance spectroscopy (HR-MAS-NMR), we determined 38 intracellular metabolites. With bioinformatic tools we created an integrated pathway analysis to understand different intracellular processes targeted by metformin. Combined metabolomics and transcriptomics data analysis showed that metformin affects a broad range of cellular processes centered on the mitochondrium. Data confirmed several known effects of metformin on glucose and androgen metabolism, which had been identified in clinical and basic studies previously. But more importantly, novel links between the energy metabolism, sex steroid biosynthesis, the cell cycle and the immune system were identified. These omics studies shed light on a complex interplay between metabolic pathways in steroidogenic systems.

  19. RNA-Seq transcriptomic analysis of the Morus alba L. leaves exposed to high-level UVB with or without dark treatment.

    PubMed

    Guan, Qijie; Yu, Jiaojiao; Zhu, Wei; Yang, Bingxian; Li, Yaohan; Zhang, Lin; Tian, Jingkui

    2018-03-01

    Ultraviolet-B (UVB) irradiation induces oxidative stress in plant cells due to the generation of excessive reactive oxygen species. Morus alba L. (M. abla) is an important medicinal plant used for the treatment of human diseases. Also, its leaves are widely used as food for silkworms. In our previous research, we found that a high level of UVB irradiation with dark incubation led to the accumulation of secondary metabolites in M. abla leaf. The aim of the present study was to describe and compare M. alba leaf transcriptomics with different treatments (control, UVB, UVB+dark). Leaf transcripts from M. alba were sequenced using an Illumina Hiseq 2000 system, which produced 14.27Gb of data including 153,204,462 paired-end reads among the three libraries. We de novo assembled 133,002 transcripts with an average length of 1270bp and filtered 69,728 non-redundant unigenes. A similarity search was performed against the non-redundant National Center of Biotechnology Information (NCBI) protein database, which returned 41.08% hits. Among the 20,040 unigenes annotated in UniProtKB/SwissProt database, 16,683 unigenes were assigned 102,232 gene ontology terms and 6667 unigenes were identified in 287 known metabolic pathways. Results of differential gene expression analysis together with real-time quantitative PCR tests indicated that UVB irradiation with dark incubation enhanced the flavonoid biosynthesis in M. alba leaf. Our findings provided a valuable proof for a better understanding of the metabolic mechanism under abiotic stresses in M. alba leaf. Copyright © 2017 Elsevier B.V. All rights reserved.

  20. Transcriptome profiling identified differentially expressed genes and pathways associated with tamoxifen resistance in human breast cancer

    PubMed Central

    Men, Xin; Ma, Jun; Wu, Tong; Pu, Junyi; Wen, Shaojia; Shen, Jianfeng; Wang, Xun; Wang, Yamin; Chen, Chao; Dai, Penggao

    2018-01-01

    Tamoxifen (TAM) resistance is an important clinical problem in the treatment of breast cancer. In order to identify the mechanism of TAM resistance for estrogen receptor (ER)-positive breast cancer, we screened the transcriptome using RNA-seq and compared the gene expression profiles between the MCF-7 mamma carcinoma cell line and the TAM-resistant cell line TAMR/MCF-7, 52 significant differential expression genes (DEGs) were identified including SLIT2, ROBO, LHX, KLF, VEGFC, BAMBI, LAMA1, FLT4, PNMT, DHRS2, MAOA and ALDH. The DEGs were annotated in the GO, COG and KEGG databases. Annotation of the function of the DEGs in the KEGG database revealed the top three pathways enriched with the most DEGs, including pathways in cancer, the PI3K-AKT pathway, and focal adhesion. Then we compared the gene expression profiles between the Clinical progressive disease (PD) and the complete response (CR) from the cancer genome altas (TCGA). 10 common DEGs were identified through combining the clinical and cellular analysis results. Protein-protein interaction network was applied to analyze the association of ER signal pathway with the 10 DEGs. 3 significant genes (GFRA3, NPY1R and PTPRN2) were closely related to ER related pathway. These significant DEGs regulated many biological activities such as cell proliferation and survival, motility and migration, and tumor cell invasion. The interactions between these DEGs and drug resistance phenomenon need to be further elucidated at a functional level in further studies. Based on our findings, we believed that these DEGs could be therapeutic targets, which can be explored to develop new treatment options. PMID:29423105

  1. SmedGD 2.0: The Schmidtea mediterranea genome database

    PubMed Central

    Robb, Sofia M.C.; Gotting, Kirsten; Ross, Eric; Sánchez Alvarado, Alejandro

    2016-01-01

    Planarians have emerged as excellent models for the study of key biological processes such as stem cell function and regulation, axial polarity specification, regeneration, and tissue homeostasis among others. The most widely used organism for these studies is the free-living flatworm Schmidtea mediterranea. In 2007, the Schmidtea mediterranea Genome Database (SmedGD) was first released to provide a much needed resource for the small, but growing planarian community. SmedGD 1.0 has been a depository for genome sequence, a draft assembly, and related experimental data (e.g., RNAi phenotypes, in situ hybridization images, and differential gene expression results). We report here a comprehensive update to SmedGD (SmedGD 2.0) that aims to expand its role as an interactive community resource. The new database includes more recent, and up-to-date transcription data, provides tools that enhance interconnectivity between different genome assemblies and transcriptomes, including next generation assemblies for both the sexual and asexual biotypes of S. mediterranea. SmedGD 2.0 (http://smedgd.stowers.org) not only provides significantly improved gene annotations, but also tools for data sharing, attributes that will help both the planarian and biomedical communities to more efficiently mine the genomics and transcriptomics of S. mediterranea. PMID:26138588

  2. CyanoEXpress: A web database for exploration and visualisation of the integrated transcriptome of cyanobacterium Synechocystis sp. PCC6803.

    PubMed

    Hernandez-Prieto, Miguel A; Futschik, Matthias E

    2012-01-01

    Synechocystis sp. PCC6803 is one of the best studied cyanobacteria and an important model organism for our understanding of photosynthesis. The early availability of its complete genome sequence initiated numerous transcriptome studies, which have generated a wealth of expression data. Analysis of the accumulated data can be a powerful tool to study transcription in a comprehensive manner and to reveal underlying regulatory mechanisms, as well as to annotate genes whose functions are yet unknown. However, use of divergent microarray platforms, as well as distributed data storage make meta-analyses of Synechocystis expression data highly challenging, especially for researchers with limited bioinformatic expertise and resources. To facilitate utilisation of the accumulated expression data for a wider research community, we have developed CyanoEXpress, a web database for interactive exploration and visualisation of transcriptional response patterns in Synechocystis. CyanoEXpress currently comprises expression data for 3073 genes and 178 environmental and genetic perturbations obtained in 31 independent studies. At present, CyanoEXpress constitutes the most comprehensive collection of expression data available for Synechocystis and can be freely accessed. The database is available for free at http://cyanoexpress.sysbiolab.eu.

  3. Reptilian-transcriptome v1.0, a glimpse in the brain transcriptome of five divergent Sauropsida lineages and the phylogenetic position of turtles.

    PubMed

    Tzika, Athanasia C; Helaers, Raphaël; Schramm, Gerrit; Milinkovitch, Michel C

    2011-09-26

    Reptiles are largely under-represented in comparative genomics despite the fact that they are substantially more diverse in many respects than mammals. Given the high divergence of reptiles from classical model species, next-generation sequencing of their transcriptomes is an approach of choice for gene identification and annotation. Here, we use 454 technology to sequence the brain transcriptome of four divergent reptilian and one reference avian species: the Nile crocodile, the corn snake, the bearded dragon, the red-eared turtle, and the chicken. Using an in-house pipeline for recursive similarity searches of >3,000,000 reads against multiple databases from 7 reference vertebrates, we compile a reptilian comparative transcriptomics dataset, with homology assignment for 20,000 to 31,000 transcripts per species and a cumulated non-redundant sequence length of 248.6 Mbases. Our approach identifies the majority (87%) of chicken brain transcripts and about 50% of de novo assembled reptilian transcripts. In addition to 57,502 microsatellite loci, we identify thousands of SNP and indel polymorphisms for population genetic and linkage analyses. We also build very large multiple alignments for Sauropsida and mammals (two million residues per species) and perform extensive phylogenetic analyses suggesting that turtles are not basal living reptiles but are rather associated with Archosaurians, hence, potentially answering a long-standing question in the phylogeny of Amniotes. The reptilian transcriptome (freely available at http://www.reptilian-transcriptomes.org) should prove a useful new resource as reptiles are becoming important new models for comparative genomics, ecology, and evolutionary developmental genetics.

  4. Optimization of De Novo Short Read Assembly of Seabuckthorn (Hippophae rhamnoides L.) Transcriptome

    PubMed Central

    Ghangal, Rajesh; Chaudhary, Saurabh; Jain, Mukesh; Purty, Ram Singh; Chand Sharma, Prakash

    2013-01-01

    Seabuckthorn ( Hippophae rhamnoides L.) is known for its medicinal, nutritional and environmental importance since ancient times. However, very limited efforts have been made to characterize the genome and transcriptome of this wonder plant. Here, we report the use of next generation massive parallel sequencing technology (Illumina platform) and de novo assembly to gain a comprehensive view of the seabuckthorn transcriptome. We assembled 86,253,874 high quality short reads using six assembly tools. At our hand, assembly of non-redundant short reads following a two-step procedure was found to be the best considering various assembly quality parameters. Initially, ABySS tool was used following an additive k-mer approach. The assembled transcripts were subsequently subjected to TGICL suite. Finally, de novo short read assembly yielded 88,297 transcripts (> 100 bp), representing about 53 Mb of seabuckthorn transcriptome. The average length of transcripts was 610 bp, N50 length 1198 BP and 91% of the short reads uniquely mapped back to seabuckthorn transcriptome. A total of 41,340 (46.8%) transcripts showed significant similarity with sequences present in nr protein databases of NCBI (E-value < 1E-06). We also screened the assembled transcripts for the presence of transcription factors and simple sequence repeats. Our strategy involving the use of short read assembler (ABySS) followed by TGICL will be useful for the researchers working with a non-model organism’s transcriptome in terms of saving time and reducing complexity in data management. The seabuckthorn transcriptome data generated here provide a valuable resource for gene discovery and development of functional molecular markers. PMID:23991119

  5. Integrative "omic" analysis of experimental bacteremia identifies a metabolic signature that distinguishes human sepsis from systemic inflammatory response syndromes.

    PubMed

    Langley, Raymond J; Tipper, Jennifer L; Bruse, Shannon; Baron, Rebecca M; Tsalik, Ephraim L; Huntley, James; Rogers, Angela J; Jaramillo, Richard J; O'Donnell, Denise; Mega, William M; Keaton, Mignon; Kensicki, Elizabeth; Gazourian, Lee; Fredenburgh, Laura E; Massaro, Anthony F; Otero, Ronny M; Fowler, Vance G; Rivers, Emanuel P; Woods, Chris W; Kingsmore, Stephen F; Sopori, Mohan L; Perrella, Mark A; Choi, Augustine M K; Harrod, Kevin S

    2014-08-15

    Sepsis is a leading cause of morbidity and mortality. Currently, early diagnosis and the progression of the disease are difficult to make. The integration of metabolomic and transcriptomic data in a primate model of sepsis may provide a novel molecular signature of clinical sepsis. To develop a biomarker panel to characterize sepsis in primates and ascertain its relevance to early diagnosis and progression of human sepsis. Intravenous inoculation of Macaca fascicularis with Escherichia coli produced mild to severe sepsis, lung injury, and death. Plasma samples were obtained before and after 1, 3, and 5 days of E. coli challenge and at the time of killing. At necropsy, blood, lung, kidney, and spleen samples were collected. An integrative analysis of the metabolomic and transcriptomic datasets was performed to identify a panel of sepsis biomarkers. The extent of E. coli invasion, respiratory distress, lethargy, and mortality was dependent on the bacterial dose. Metabolomic and transcriptomic changes characterized severe infections and death, and indicated impaired mitochondrial, peroxisomal, and liver functions. Analysis of the pulmonary transcriptome and plasma metabolome suggested impaired fatty acid catabolism regulated by peroxisome-proliferator activated receptor signaling. A representative four-metabolite model effectively diagnosed sepsis in primates (area under the curve, 0.966) and in two human sepsis cohorts (area under the curve, 0.78 and 0.82). A model of sepsis based on reciprocal metabolomic and transcriptomic data was developed in primates and validated in two human patient cohorts. It is anticipated that the identified parameters will facilitate early diagnosis and management of sepsis.

  6. CBrowse: a SAM/BAM-based contig browser for transcriptome assembly visualization and analysis.

    PubMed

    Li, Pei; Ji, Guoli; Dong, Min; Schmidt, Emily; Lenox, Douglas; Chen, Liangliang; Liu, Qi; Liu, Lin; Zhang, Jie; Liang, Chun

    2012-09-15

    To address the impending need for exploring rapidly increased transcriptomics data generated for non-model organisms, we developed CBrowse, an AJAX-based web browser for visualizing and analyzing transcriptome assemblies and contigs. Designed in a standard three-tier architecture with a data pre-processing pipeline, CBrowse is essentially a Rich Internet Application that offers many seamlessly integrated web interfaces and allows users to navigate, sort, filter, search and visualize data smoothly. The pre-processing pipeline takes the contig sequence file in FASTA format and its relevant SAM/BAM file as the input; detects putative polymorphisms, simple sequence repeats and sequencing errors in contigs and generates image, JSON and database-compatible CSV text files that are directly utilized by different web interfaces. CBowse is a generic visualization and analysis tool that facilitates close examination of assembly quality, genetic polymorphisms, sequence repeats and/or sequencing errors in transcriptome sequencing projects. CBrowse is distributed under the GNU General Public License, available at http://bioinfolab.muohio.edu/CBrowse/ liangc@muohio.edu or liangc.mu@gmail.com; glji@xmu.edu.cn Supplementary data are available at Bioinformatics online.

  7. Comparative transcriptomics with self-organizing map reveals cryptic photosynthetic differences between two accessions of North American Lake cress.

    PubMed

    Nakayama, Hokuto; Sakamoto, Tomoaki; Okegawa, Yuki; Kaminoyama, Kaori; Fujie, Manabu; Ichihashi, Yasunori; Kurata, Tetsuya; Motohashi, Ken; Al-Shehbaz, Ihsan; Sinha, Neelima; Kimura, Seisuke

    2018-02-19

    Because natural variation in wild species is likely the result of local adaptation, it provides a valuable resource for understanding plant-environmental interactions. Rorippa aquatica (Brassicaceae) is a semi-aquatic North American plant with morphological differences between several accessions, but little information available on any physiological differences. Here, we surveyed the transcriptomes of two R. aquatica accessions and identified cryptic physiological differences between them. We first reconstructed a Rorippa phylogeny to confirm relationships between the accessions. We performed large-scale RNA-seq and de novo assembly; the resulting 87,754 unigenes were then annotated via comparisons to different databases. Between-accession physiological variation was identified with transcriptomes from both accessions. Transcriptome data were analyzed with principal component analysis and self-organizing map. Results of analyses suggested that photosynthetic capability differs between the accessions. Indeed, physiological experiments revealed between-accession variation in electron transport rate and the redox state of the plastoquinone pool. These results indicated that one accession may have adapted to differences in temperature or length of the growing season.

  8. Aberrant activation of the human sex-determining gene in early embryonic development results in postnatal growth retardation and lethality in mice.

    PubMed

    Kido, Tatsuo; Sun, Zhaoyu; Lau, Yun-Fai Chris

    2017-06-23

    Sexual dimorphisms are prevalent in development, physiology and diseases in humans. Currently, the contributions of the genes on the male-specific region of the Y chromosome (MSY) in these processes are uncertain. Using a transgene activation system, the human sex-determining gene hSRY is activated in the single-cell embryos of the mouse. Pups with hSRY activated (hSRY ON ) are born of similar sizes as those of non-activated controls. However, they retard significantly in postnatal growth and development and all die of multi-organ failure before two weeks of age. Pathological and molecular analyses indicate that hSRY ON pups lack innate suckling activities, and develop fatty liver disease, arrested alveologenesis in the lung, impaired neurogenesis in the brain and occasional myocardial fibrosis and minimized thymus development. Transcriptome analysis shows that, in addition to those unique to the respective organs, various cell growth and survival pathways and functions are differentially affected in the transgenic mice. These observations suggest that ectopic activation of a Y-located SRY gene could exert male-specific effects in development and physiology of multiple organs, thereby contributing to sexual dimorphisms in normal biological functions and disease processes in affected individuals.

  9. De novo transcriptome characterization and gene expression profiling of the desiccation tolerant moss Bryum argenteum following rehydration.

    PubMed

    Gao, Bei; Zhang, Daoyuan; Li, Xiaoshuang; Yang, Honglan; Zhang, Yuanming; Wood, Andrew J

    2015-05-28

    The desiccation-tolerant moss Bryum argenteum is an important component of the Biological Soil Crusts (BSCs) found in the Gurbantunggut desert. Desiccation tolerance is defined as the ability to revive from the air dried state. To elucidate the molecular mechanisms related to desiccation tolerance, we employed RNA-Seq and digital gene expression (DGE) technologies to study the genome-wide expression profiles of the dehydration and rehydration processes in this important desert plant. We applied a two-step approach to investigate the gene expression profile upon rehydration in the moss Bryum argenteum using Illumina HiSeq2000 sequencing platform. First, a total of 57,247 transcript assembly contigs (TACs) were obtained from 54.79 million reads by de novo assembly, with an average length of 863 bp and N50 of 1,372 bp. Among the reconstructed TACs, 36,916 (64.5%) revealed similarity with existing protein sequences in the public databases. 23,509 and 21,607 TACs were assigned GO and KEGG annotation information, respectively. Second, samples were taken from 3 hydration stages: desiccated (Dry), rehydrated 2 h (R2) and rehydrated 24 h (R24), and DEG libraries were constructed for Differentially Expressed Genes (DEGs) discovery. 4,081 and 6,709 DEGs were identified in R2 and R24, compared with Dry, respectively. Compared to the desiccated sample, up-regulated genes after two hours of hydration are primarily related to stress responses. GO function enrichment network, EKGG metabolic pathway and MapMan analysis supports the idea of the rapid recovery of photosynthesis after 24 h of rehydration. We identified 770 transcription factors (TFs) which were classified into 50 TF families. 142 TF transcripts were up-regulated upon rehydration including 23 members of the ERF family. In this study, we constructed a pioneering, high-quality reference transcriptome in B. argenteum and generated three DGE libraries to elucidate the changes of gene expression upon rehydration. Expression profiles consistent with the rapid recovery of photosynthesis (at R2) and the re-establishment of a positive carbon balance following rehydration (at R24) were observed. Our study will extend our knowledge of bryophyte transcriptomes and provide further insight into the molecular mechanisms related to rehydration and desiccation-tolerance.

  10. Effects of cognitive behavioral therapy in patients with depressive disorder and comorbid insomnia: A propensity score-matched outcome study.

    PubMed

    Hsu, Hui-Min; Chou, Kuei-Ru; Lin, Kuan-Chia; Chen, Kuan-Yu; Su, Shu-Fang; Chung, Min-Huey

    2015-10-01

    We evaluated the effects of cognitive behavioral therapy for insomnia (CBT-I) in inpatients with a diagnosis of depression and comorbid insomnia. This study used a prospective, parallel-group design. The experimental group received CBT-I for no more than 90 min once weekly for 6 weeks and the control group only have health education manuals for insomnia. The following questionnaires were administered at baseline: the Hamilton Rating Scale for Depression (HAM-D), Dysfunctional Beliefs and Attitudes about Sleep (DBAS), Presleep Arousal Scale (PSAS), Sleep Hygiene Practice (SHP), and Pittsburgh Sleep Quality Index. The questionnaires were readministered after the completion of the 6-wk CBT-I intervention and 1 month following the completion of CBT-I, to determine the effects of the CBT-I intervention over time. The analysis of Generalized Estimation Equations was identified the difference between the experimental group and the control group by controlling for the variables in BZD dose and propensity score of gender, age, and the scores for the DBAS-16, PSAS, SHPS, and HAM-D. Consequently, the significant difference in the PSQI scores was observed at the 1-month follow-up assessment however, no significant intergroup difference in the PSQI scores was found at the completion of the CBT-I intervention between two groups. As a conclusion, we found that overall sleep quality significantly improved in patients who received CBT-I after we controlled for the BZD dose and propensity score, which suggests that CBT-I may represent a useful clinical strategy for improving sleep quality in patients with depression and comorbid insomnia. Copyright © 2015 Elsevier Ltd. All rights reserved.

  11. Insight into Catechins Metabolic Pathways of Camellia sinensis Based on Genome and Transcriptome Analysis.

    PubMed

    Wang, Wenzhao; Zhou, Yihui; Wu, Yingling; Dai, Xinlong; Liu, Yajun; Qian, Yumei; Li, Mingzhuo; Jiang, Xiaolan; Wang, Yunsheng; Gao, Liping; Xia, Tao

    2018-04-25

    Tea is an important economic crop with a 3.02 Gb genome. It accumulates various bioactive compounds, especially catechins, which are closely associated with tea flavor and quality. Catechins are biosynthesized through the phenylpropanoid and flavonoid pathways, with 12 structural genes being involved in their synthesis. However, we found that in Camellia sinensis the understanding of the basic profile of catechins biosynthesis is still unclear. The gene structure, locus, transcript number, transcriptional variation, and function of multigene families have not yet been clarified. Our previous studies demonstrated that the accumulation of flavonoids in tea is species, tissue, and induction specific, which indicates that gene coexpression patterns may be involved in tea catechins and flavonoids biosynthesis. In this paper, we screened candidate genes of multigene families involved in the phenylpropanoid and flavonoid pathways based on an analysis of genome and transcriptome sequence data. The authenticity of candidate genes was verified by PCR cloning, and their function was validated by reverse genetic methods. In the present study, 36 genes from 12 gene families were identified and were accessed in the NCBI database. During this process, some intron retention events of the CsCHI and CsDFR genes were found. Furthermore, the transcriptome sequencing of various tea tissues and subcellular location assays revealed coexpression and colocalization patterns. The correlation analysis showed that CsCHIc, CsF3'H, and CsANRb expression levels are associated significantly with the concentration of soluble PA as well as the expression levels of CsPALc and CsPALf with the concentration of insoluble PA. This work provides insights into catechins metabolism in tea and provides a foundation for future studies.

  12. Primary Cell Culture of Live Neurosurgically Resected Aged Adult Human Brain Cells and Single Cell Transcriptomics.

    PubMed

    Spaethling, Jennifer M; Na, Young-Ji; Lee, Jaehee; Ulyanova, Alexandra V; Baltuch, Gordon H; Bell, Thomas J; Brem, Steven; Chen, H Isaac; Dueck, Hannah; Fisher, Stephen A; Garcia, Marcela P; Khaladkar, Mugdha; Kung, David K; Lucas, Timothy H; O'Rourke, Donald M; Stefanik, Derek; Wang, Jinhui; Wolf, John A; Bartfai, Tamas; Grady, M Sean; Sul, Jai-Yoon; Kim, Junhyong; Eberwine, James H

    2017-01-17

    Investigation of human CNS disease and drug effects has been hampered by the lack of a system that enables single-cell analysis of live adult patient brain cells. We developed a culturing system, based on a papain-aided procedure, for resected adult human brain tissue removed during neurosurgery. We performed single-cell transcriptomics on over 300 cells, permitting identification of oligodendrocytes, microglia, neurons, endothelial cells, and astrocytes after 3 weeks in culture. Using deep sequencing, we detected over 12,000 expressed genes, including hundreds of cell-type-enriched mRNAs, lncRNAs and pri-miRNAs. We describe cell-type- and patient-specific transcriptional hierarchies. Single-cell transcriptomics on cultured live adult patient derived cells is a prime example of the promise of personalized precision medicine. Because these cells derive from subjects ranging in age into their sixties, this system permits human aging studies previously possible only in rodent systems. Copyright © 2017 The Author(s). Published by Elsevier Inc. All rights reserved.

  13. De Novo Transcriptome Sequencing Reveals Important Molecular Networks and Metabolic Pathways of the Plant, Chlorophytum borivilianum

    PubMed Central

    Kalra, Shikha; Puniya, Bhanwar Lal; Kulshreshtha, Deepika; Kumar, Sunil; Kaur, Jagdeep; Ramachandran, Srinivasan; Singh, Kashmir

    2013-01-01

    Chlorophytum borivilianum, an endangered medicinal plant species is highly recognized for its aphrodisiac properties provided by saponins present in the plant. The transcriptome information of this species is limited and only few hundred expressed sequence tags (ESTs) are available in the public databases. To gain molecular insight of this plant, high throughput transcriptome sequencing of leaf RNA was carried out using Illumina's HiSeq 2000 sequencing platform. A total of 22,161,444 single end reads were retrieved after quality filtering. Available (e.g., De-Bruijn/Eulerian graph) and in-house developed bioinformatics tools were used for assembly and annotation of transcriptome. A total of 101,141 assembled transcripts were obtained, with coverage size of 22.42 Mb and average length of 221 bp. Guanine-cytosine (GC) content was found to be 44%. Bioinformatics analysis, using non-redundant proteins, gene ontology (GO), enzyme commission (EC) and kyoto encyclopedia of genes and genomes (KEGG) databases, extracted all the known enzymes involved in saponin and flavonoid biosynthesis. Few genes of the alkaloid biosynthesis, along with anticancer and plant defense genes, were also discovered. Additionally, several cytochrome P450 (CYP450) and glycosyltransferase unique sequences were also found. We identified simple sequence repeat motifs in transcripts with an abundance of di-nucleotide simple sequence repeat (SSR; 43.1%) markers. Large scale expression profiling through Reads per Kilobase per Million mapped reads (RPKM) showed major genes involved in different metabolic pathways of the plant. Genes, expressed sequence tags (ESTs) and unique sequences from this study provide an important resource for the scientific community, interested in the molecular genetics and functional genomics of C. borivilianum. PMID:24376689

  14. Transcriptome Analysis of Dendrobium officinale and its Application to the Identification of Genes Associated with Polysaccharide Synthesis

    PubMed Central

    Zhang, Jianxia; He, Chunmei; Wu, Kunlin; Teixeira da Silva, Jaime A.; Zeng, Songjun; Zhang, Xinhua; Yu, Zhenming; Xia, Haoqiang; Duan, Jun

    2016-01-01

    Dendrobium officinale is one of the most important Chinese medicinal herbs. Polysaccharides are one of the main active ingredients of D. officinale. To identify the genes that maybe related to polysaccharides synthesis, two cDNA libraries were prepared from juvenile and adult D. officinale, and were named Dendrobium-1 and Dendrobium-2, respectively. Illumina sequencing for Dendrobium-1 generated 102 million high quality reads that were assembled into 93,881 unigenes with an average sequence length of 790 base pairs. The sequencing for Dendrobium-2 generated 86 million reads that were assembled into 114,098 unigenes with an average sequence length of 695 base pairs. Two transcriptome databases were integrated and assembled into a total of 145,791 unigenes. Among them, 17,281 unigenes were assigned to 126 KEGG pathways while 135 unigenes were involved in fructose and mannose metabolism. Gene Ontology analysis revealed that the majority of genes were associated with metabolic and cellular processes. Furthermore, 430 glycosyltransferase and 89 cellulose synthase genes were identified. Comparative analysis of both transcriptome databases revealed a total of 32,794 differential expression genes (DEGs), including 22,051 up-regulated and 10,743 down-regulated genes in Dendrobium-2 compared to Dendrobium-1. Furthermore, a total of 1142 and 7918 unigenes showed unique expression in Dendrobium-1 and Dendrobium-2, respectively. These DEGs were mainly correlated with metabolic pathways and the biosynthesis of secondary metabolites. In addition, 170 DEGs belonged to glycosyltransferase genes, 37 DEGs were related to cellulose synthase genes and 627 DEGs encoded transcription factors. This study substantially expands the transcriptome information for D. officinale and provides valuable clues for identifying candidate genes involved in polysaccharide biosynthesis and elucidating the mechanism of polysaccharide biosynthesis. PMID:26904032

  15. De Novo transcriptome sequencing reveals important molecular networks and metabolic pathways of the plant, Chlorophytum borivilianum.

    PubMed

    Kalra, Shikha; Puniya, Bhanwar Lal; Kulshreshtha, Deepika; Kumar, Sunil; Kaur, Jagdeep; Ramachandran, Srinivasan; Singh, Kashmir

    2013-01-01

    Chlorophytum borivilianum, an endangered medicinal plant species is highly recognized for its aphrodisiac properties provided by saponins present in the plant. The transcriptome information of this species is limited and only few hundred expressed sequence tags (ESTs) are available in the public databases. To gain molecular insight of this plant, high throughput transcriptome sequencing of leaf RNA was carried out using Illumina's HiSeq 2000 sequencing platform. A total of 22,161,444 single end reads were retrieved after quality filtering. Available (e.g., De-Bruijn/Eulerian graph) and in-house developed bioinformatics tools were used for assembly and annotation of transcriptome. A total of 101,141 assembled transcripts were obtained, with coverage size of 22.42 Mb and average length of 221 bp. Guanine-cytosine (GC) content was found to be 44%. Bioinformatics analysis, using non-redundant proteins, gene ontology (GO), enzyme commission (EC) and kyoto encyclopedia of genes and genomes (KEGG) databases, extracted all the known enzymes involved in saponin and flavonoid biosynthesis. Few genes of the alkaloid biosynthesis, along with anticancer and plant defense genes, were also discovered. Additionally, several cytochrome P450 (CYP450) and glycosyltransferase unique sequences were also found. We identified simple sequence repeat motifs in transcripts with an abundance of di-nucleotide simple sequence repeat (SSR; 43.1%) markers. Large scale expression profiling through Reads per Kilobase per Million mapped reads (RPKM) showed major genes involved in different metabolic pathways of the plant. Genes, expressed sequence tags (ESTs) and unique sequences from this study provide an important resource for the scientific community, interested in the molecular genetics and functional genomics of C. borivilianum.

  16. FusionHub: A unified web platform for annotation and visualization of gene fusion events in human cancer.

    PubMed

    Panigrahi, Priyabrata; Jere, Abhay; Anamika, Krishanpal

    2018-01-01

    Gene fusion is a chromosomal rearrangement event which plays a significant role in cancer due to the oncogenic potential of the chimeric protein generated through fusions. At present many databases are available in public domain which provides detailed information about known gene fusion events and their functional role. Existing gene fusion detection tools, based on analysis of transcriptomics data usually report a large number of fusion genes as potential candidates, which could be either known or novel or false positives. Manual annotation of these putative genes is indeed time-consuming. We have developed a web platform FusionHub, which acts as integrated search engine interfacing various fusion gene databases and simplifies large scale annotation of fusion genes in a seamless way. In addition, FusionHub provides three ways of visualizing fusion events: circular view, domain architecture view and network view. Design of potential siRNA molecules through ensemble method is another utility integrated in FusionHub that could aid in siRNA-based targeted therapy. FusionHub is freely available at https://fusionhub.persistent.co.in.

  17. The de novo Transcriptome and Its Analysis in the Worldwide Vegetable Pest, Delia antiqua (Diptera: Anthomyiidae)

    PubMed Central

    Zhang, Yu-Juan; Hao, Youjin; Si, Fengling; Ren, Shuang; Hu, Ganyu; Shen, Li; Chen, Bin

    2014-01-01

    The onion maggot Delia antiqua is a major insect pest of cultivated vegetables, especially the onion, and a good model to investigate the molecular mechanisms of diapause. To better understand the biology and diapause mechanism of the insect pest species, D. antiqua, the transcriptome was sequenced using Illumina paired-end sequencing technology. Approximately 54 million reads were obtained, trimmed, and assembled into 29,659 unigenes, with an average length of 607 bp and an N50 of 818 bp. Among these unigenes, 21,605 (72.8%) were annotated in the public databases. All unigenes were then compared against Drosophila melanogaster and Anopheles gambiae. Codon usage bias was analyzed and 332 simple sequence repeats (SSRs) were detected in this organism. These data represent the most comprehensive transcriptomic resource currently available for D. antiqua and will facilitate the study of genetics, genomics, diapause, and further pest control of D. antiqua. PMID:24615268

  18. Transcriptome sequencing and annotation of the halophytic microalga Dunaliella salina * #

    PubMed Central

    Hong, Ling; Liu, Jun-li; Midoun, Samira Z.; Miller, Philip C.

    2017-01-01

    The unicellular green alga Dunaliella salina is well adapted to salt stress and contains compounds (including β-carotene and vitamins) with potential commercial value. A large transcriptome database of D. salina during the adjustment, exponential and stationary growth phases was generated using a high throughput sequencing platform. We characterized the metabolic processes in D. salina with a focus on valuable metabolites, with the aim of manipulating D. salina to achieve greater economic value in large-scale production through a bioengineering strategy. Gene expression profiles under salt stress verified using quantitative polymerase chain reaction (qPCR) implied that salt can regulate the expression of key genes. This study generated a substantial fraction of D. salina transcriptional sequences for the entire growth cycle, providing a basis for the discovery of novel genes. This first full-scale transcriptome study of D. salina establishes a foundation for further comparative genomic studies. PMID:28990374

  19. Dataset of potential targets for Mycobacterium tuberculosis H37Rv through comparative genome analysis.

    PubMed

    Asif, Siddiqui M; Asad, Amir; Faizan, Ahmad; Anjali, Malik S; Arvind, Arya; Neelesh, Kapoor; Hirdesh, Kumar; Sanjay, Kumar

    2009-12-31

    Mycobacterium tuberculosis is the causative agent of the disease, tuberculosis and H37Rv is the most studied clinical strain. We use comparative genome analysis of Mycobacterium tuberculosis H37Rv and human for the identification of potential targets dataset. We used DEG (Database of Essential Genes) to identify essential genes in the H37Rv strain. The analysis shows that 628 of the 3989 genes in Mycobacterium tuberculosis H37Rv were found to be essential of which 324 genes lack similarity to the human genome. Subsequently hypothetical proteins were removed through manual curation. This further resulted in a dataset of 135 proteins with essential function and no homology to human.

  20. A comprehensive analysis of the human placenta transcriptome

    USDA-ARS?s Scientific Manuscript database

    As the conduit for nutrients and growth signals, the placenta is critical to establishing an environment sufficient for fetal growth and development. To better understand the mechanisms regulating placental development and gene expression, we characterized the transcriptome of term placenta from 20 ...

  1. Publishing SNP genotypes of human embryonic stem cell lines: policy statement of the International Stem Cell Forum Ethics Working Party.

    PubMed

    Knoppers, Bartha M; Isasi, Rosario; Benvenisty, Nissim; Kim, Ock-Joo; Lomax, Geoffrey; Morris, Clive; Murray, Thomas H; Lee, Eng Hin; Perry, Margery; Richardson, Genevra; Sipp, Douglas; Tanner, Klaus; Wahlström, Jan; de Wert, Guido; Zeng, Fanyi

    2011-09-01

    Novel methods and associated tools permitting individual identification in publicly accessible SNP databases have become a debatable issue. There is growing concern that current technical and ethical safeguards to protect the identities of donors could be insufficient. In the context of human embryonic stem cell research, there are no studies focusing on the probability that an hESC line donor could be identified by analyzing published SNP profiles and associated genotypic and phenotypic information. We present the International Stem Cell Forum (ISCF) Ethics Working Party's Policy Statement on "Publishing SNP Genotypes of Human Embryonic Stem Cell Lines (hESC)". The Statement prospectively addresses issues surrounding the publication of genotypic data and associated annotations of hESC lines in open access databases. It proposes a balanced approach between the goals of open science and data sharing with the respect for fundamental bioethical principles (autonomy, privacy, beneficence, justice and research merit and integrity).

  2. Comparative genomics and transcriptome analysis of Aspergillus niger and metabolic engineering for citrate production

    PubMed Central

    Yin, Xian; Shin, Hyun-dong; Li, Jianghua; Du, Guocheng; Liu, Long; Chen, Jian

    2017-01-01

    Despite a long and successful history of citrate production in Aspergillus niger, the molecular mechanism of citrate accumulation is only partially understood. In this study, we used comparative genomics and transcriptome analysis of citrate-producing strains—namely, A. niger H915-1 (citrate titer: 157 g L−1), A1 (117 g L−1), and L2 (76 g L−1)—to gain a genome-wide view of the mechanism of citrate accumulation. Compared with A. niger A1 and L2, A. niger H915-1 contained 92 mutated genes, including a succinate-semialdehyde dehydrogenase in the γ-aminobutyric acid shunt pathway and an aconitase family protein involved in citrate synthesis. Furthermore, transcriptome analysis of A. niger H915-1 revealed that the transcription levels of 479 genes changed between the cell growth stage (6 h) and the citrate synthesis stage (12 h, 24 h, 36 h, and 48 h). In the glycolysis pathway, triosephosphate isomerase was up-regulated, whereas pyruvate kinase was down-regulated. Two cytosol ATP-citrate lyases, which take part in the cycle of citrate synthesis, were up-regulated, and may coordinate with the alternative oxidases in the alternative respiratory pathway for energy balance. Finally, deletion of the oxaloacetate acetylhydrolase gene in H915-1 eliminated oxalate formation but neither influence on pH decrease nor difference in citrate production were observed. PMID:28106122

  3. Transcriptome Network Analysis Reveals Aging-Related Mitochondrial and Proteasomal Dysfunction and Immune Activation in Human Thyroid

    PubMed Central

    Cho, Byuri Angela; Yoo, Seong-Keun; Song, Young Shin; Kim, Su-jin; Lee, Kyu Eun; Shong, Minho

    2018-01-01

    Background: Elucidating aging-related transcriptomic changes in human organs is necessary to understand the aging physiology and mechanisms, but little is known regarding the thyroid gland. We investigated aging-related transcriptomic alterations in the human thyroid gland and characterized the related molecular functions. Methods: Publicly available RNA sequencing data of 322 thyroid tissue samples from the Genotype-Tissue Expression project were analyzed. In addition, our own 64 RNA sequencing data of normal thyroid tissue samples were used as a validation set. To comprehensively evaluate the associations between aging and transcriptomic changes, we performed a weighted gene coexpression network analysis and pathway enrichment analysis. The thyroid differentiation score was then used for further analysis, defining the correlations between thyroid differentiation and aging. Results: The most significant aging-related transcriptomic change in thyroid was the downregulation of genes related to the mitochondrial and proteasomal functions (p = 3 × 10−6). Moreover, genes that are associated with immune processes were significantly upregulated with age (p = 3 × 10−4), and all of them overlapped with the upregulated genes in the thyroid glands affected by lymphocytic thyroiditis. Furthermore, these aging-related changes were not significantly different according to sex, but in terms of the thyroid differentiation, females were more susceptible to aging-related changes (p for trend = 0.03). Conclusions: Aging-related transcriptomic changes in the thyroid gland were associated with mitochondrial and proteasomal dysfunction, loss of differentiation, and activation of autoimmune processes. Our results provide clues to better understanding the age-related decline in thyroid function and higher susceptibility to autoimmune thyroid disease. PMID:29652618

  4. De novo assembly and characterization of bark transcriptome using Illumina sequencing and development of EST-SSR markers in rubber tree (Hevea brasiliensis Muell. Arg.)

    PubMed Central

    2012-01-01

    Background In rubber tree, bark is one of important agricultural and biological organs. However, the molecular mechanism involved in the bark formation and development in rubber tree remains largely unknown, which is at least partially due to lack of bark transcriptomic and genomic information. Therefore, it is necessary to carried out high-throughput transcriptome sequencing of rubber tree bark to generate enormous transcript sequences for the functional characterization and molecular marker development. Results In this study, more than 30 million sequencing reads were generated using Illumina paired-end sequencing technology. In total, 22,756 unigenes with an average length of 485 bp were obtained with de novo assembly. The similarity search indicated that 16,520 and 12,558 unigenes showed significant similarities to known proteins from NCBI non-redundant and Swissprot protein databases, respectively. Among these annotated unigenes, 6,867 and 5,559 unigenes were separately assigned to Gene Ontology (GO) and Clusters of Orthologous Group (COG). When 22,756 unigenes searched against the Kyoto Encyclopedia of Genes and Genomes Pathway (KEGG) database, 12,097 unigenes were assigned to 5 main categories including 123 KEGG pathways. Among the main KEGG categories, metabolism was the biggest category (9,043, 74.75%), suggesting the active metabolic processes in rubber tree bark. In addition, a total of 39,257 EST-SSRs were identified from 22,756 unigenes, and the characterizations of EST-SSRs were further analyzed in rubber tree. 110 potential marker sites were randomly selected to validate the assembly quality and develop EST-SSR markers. Among 13 Hevea germplasms, PCR success rate and polymorphism rate of 110 markers were separately 96.36% and 55.45% in this study. Conclusion By assembling and analyzing de novo transcriptome sequencing data, we reported the comprehensive functional characterization of rubber tree bark. This research generated a substantial fraction of rubber tree transcriptome sequences, which were very useful resources for gene annotation and discovery, molecular markers development, genome assembly and annotation, and microarrays development in rubber tree. The EST-SSR markers identified and developed in this study will facilitate marker-assisted selection breeding in rubber tree. Moreover, this study also supported that transcriptome analysis based on Illumina paired-end sequencing is a powerful tool for transcriptome characterization and molecular marker development in non-model species, especially those with large and complex genomes. PMID:22607098

  5. Dibenz[a,h]anthracene

    Integrated Risk Information System (IRIS)

    Dibenz [ a , h ] anthracene ; CASRN 53 - 70 - 3 Human health assessment information on a chemical substance is included in the IRIS database only after a comprehensive review of toxicity data , as outlined in the IRIS assessment development process . Sections I ( Health Hazard Assessments for Noncar

  6. Transcriptomic Analysis of the Endangered Neritid Species Clithon retropictus: De Novo Assembly, Functional Annotation, and Marker Discovery

    PubMed Central

    Park, So Young; Patnaik, Bharat Bhusan; Kang, Se Won; Hwang, Hee-Ju; Chung, Jong Min; Song, Dae Kwon; Sang, Min Kyu; Patnaik, Hongray Howrelia; Lee, Jae Bong; Noh, Mi Young; Kim, Changmu; Kim, Soonok; Park, Hong Seog; Lee, Jun Sang; Han, Yeon Soo; Lee, Yong Seok

    2016-01-01

    An aquatic gastropod belonging to the family Neritidae, Clithon retropictus is listed as an endangered class II species in South Korea. The lack of information on its genomic background limits the ability to obtain functional data resources and inhibits informed conservation planning for this species. In the present study, the transcriptomic sequencing and de novo assembly of C. retropictus generated a total of 241,696,750 high-quality reads. These assembled to 282,838 unigenes with mean and N50 lengths of 736.9 and 1201 base pairs, respectively. Of these, 125,616 unigenes were subjected to annotation analysis with known proteins in Protostome DB, COG, GO, and KEGG protein databases (BLASTX; E ≤ 0.00001) and with known nucleotides in the Unigene database (BLASTN; E ≤ 0.00001). The GO analysis indicated that cellular process, cell, and catalytic activity are the predominant GO terms in the biological process, cellular component, and molecular function categories, respectively. In addition, 2093 unigenes were distributed in 107 different KEGG pathways. Furthermore, 49,280 simple sequence repeats were identified in the unigenes (>1 kilobase sequences). This is the first report on the identification of transcriptomic and microsatellite resources for C. retropictus, which opens up the possibility of exploring traits related to the adaptation and acclimatization of this species. PMID:27455329

  7. Transcriptome of the Australian Mollusc Dicathais orbita Provides Insights into the Biosynthesis of Indoles and Choline Esters

    PubMed Central

    Baten, Abdul; Ngangbam, Ajit Kumar; Waters, Daniel L. E.; Benkendorff, Kirsten

    2016-01-01

    Dicathais orbita is a mollusc of the Muricidae family and is well known for the production of the expensive dye Tyrian purple and its brominated precursors that have anticancer properties, in addition to choline esters with muscle-relaxing properties. However, the biosynthetic pathways that produce these secondary metabolites in D. orbita are not known. Illumina HiSeq 2000 transcriptome sequencing of hypobranchial glands, prostate glands, albumen glands, capsule glands, and mantle and foot tissues of D. orbita generated over 201 million high quality reads that were de novo assembled into 219,437 contigs. Annotation with reference to the Nr, Swiss-Prot and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases identified candidate-coding regions in 76,152 of these contigs, with transcripts for many enzymes in various metabolic pathways associated with secondary metabolite biosynthesis represented. This study revealed that D. orbita expresses a number of genes associated with indole, sulfur and histidine metabolism pathways that are relevant to Tyrian purple precursor biosynthesis, and many of which were not found in the fully annotated genomes of three other molluscs in the KEGG database. However, there were no matches to known bromoperoxidase enzymes within the D. orbita transcripts. These transcriptome data provide a significant molecular resource for gastropod research in general and Tyrian purple producing Muricidae in particular. PMID:27447649

  8. Leaf transcriptome of two highly divergent genotypes of Urochloa humidicola (Poaceae), a tropical polyploid forage grass adapted to acidic soils and temporary flooding areas.

    PubMed

    Vigna, Bianca Baccili Zanotto; de Oliveira, Fernanda Ancelmo; de Toledo-Silva, Guilherme; da Silva, Carla Cristina; do Valle, Cacilda Borges; de Souza, Anete Pereira

    2016-11-11

    Urochloa humidicola (Koronivia grass) is a polyploid (6x to 9x) species that is used as forage in the tropics. Facultative apospory apomixis is present in most of the genotypes of this species, although one individual has been described as sexual. Molecular studies have been restricted to molecular marker approaches for genetic diversity estimations and linkage map construction. The objectives of the present study were to describe and compare the leaf transcriptome of two important genotypes that are highly divergent in terms of their phenotypes and reproduction modes: the sexual BH031 and the aposporous apomictic cultivar BRS Tupi. We sequenced the leaf transcriptome of Koronivia grass using an Illumina GAIIx system, which produced 13.09 Gb of data that consisted of 163,575,526 paired-end reads between the two libraries. We de novo-assembled 76,196 transcripts with an average length of 1,152 bp and filtered 35,093 non-redundant unigenes. A similarity search against the non-redundant National Center of Biotechnology Information (NCBI) protein database returned 65 % hits. We annotated 24,133 unigenes in the Phytozome database and 14,082 unigenes in the UniProtKB/Swiss-Prot database, assigned 108,334 gene ontology terms to 17,255 unigenes and identified 5,324 unigenes in 327 known metabolic pathways. Comparisons with other grasses via a reciprocal BLAST search revealed a larger number of orthologous genes for the Panicum species. The unigenes were involved in C4 photosynthesis, lignocellulose biosynthesis and flooding stress responses. A search for functional molecular markers revealed 4,489 microsatellites and 560,298 single nucleotide polymorphisms (SNPs). A quantitative real-time PCR analysis validated the RNA-seq expression analysis and allowed for the identification of transcriptomic differences between the two evaluated genotypes. Moreover, 192 unannotated sequences were classified as containing complete open reading frames, suggesting that the new, potentially exclusive genes should be further investigated. The present study represents the first whole-transcriptome sequencing of U. humidicola leaves, providing an important public information source of transcripts and functional molecular markers. The qPCR analysis indicated that the expression of certain transcripts confirmed the differential expression observed in silico, which demonstrated that RNA-seq is useful for identifying differentially expressed and unique genes. These results corroborate the findings from previous studies and suggest a hybrid origin for BH031.

  9. Androgen-responsive non-coding small RNAs extend the potential of HCG stimulation to act as a bioassay of androgen sufficiency.

    PubMed

    Rodie, M E; Mudaliar, M A V; Herzyk, P; McMillan, M; Boroujerdi, M; Chudleigh, S; Tobias, E S; Ahmed, S F

    2017-10-01

    It is unclear whether a short-term change in circulating androgens is associated with changes in the transcriptome of the peripheral blood mononuclear cells (PBMC). To explore the effect of hCG stimulation on the PBMC transcriptome, 12 boys with a median age (range) of 0.7 years (0.3, 11.2) who received intramuscular hCG 1500u on 3 consecutive days as part of their investigations underwent transcriptomic array analysis on RNA extracted from peripheral blood mononuclear cells before and after hCG stimulation. Median pre- and post-hCG testosterone for the overall group was 0.7 nmol/L (<0.5, 6) and 7.9 nmol/L (<0.5, 31.5), respectively. Of the 12 boys, 3 (25%) did not respond to hCG stimulation with a pre and post median serum testosterone of <0.5 nmol/L and <0.5 nmol/L, respectively. When corrected for gene expression changes in the non-responders to exclude hCG effects, all 9 of the hCG responders consistently demonstrated a 20% or greater increase in the expression of piR-37153 and piR-39248 , non-coding PIWI-interacting RNAs (piRNAs). In addition, of the 9 responders, 8, 6 and 4 demonstrated a 30, 40 and 50% rise, respectively, in a total of 2 further piRNAs. In addition, 3 of the responders showed a 50% or greater rise in the expression of another small RNA, SNORD5 . On comparing fold-change in serum testosterone with fold-change in the above transcripts, a positive correlation was detected for SNORD5 ( P  = 0.01). The identification of a dynamic and androgen-responsive PBMC transcriptome extends the potential value of the hCG test for the assessment of androgen sufficiency. © 2017 The authors.

  10. Gene networks and toxicity pathways induced by acute cadmium exposure in adult largemouth bass (Micropterus salmoides).

    PubMed

    Mehinto, Alvine C; Prucha, Melinda S; Colli-Dula, Reyna C; Kroll, Kevin J; Lavelle, Candice M; Barber, David S; Vulpe, Christopher D; Denslow, Nancy D

    2014-07-01

    Cadmium is a heavy metal that can accumulate to toxic levels in the environment leading to detrimental effects in animals and humans including kidney, liver and lung injuries. Using a transcriptomics approach, genes and cellular pathways affected by a low dose of cadmium were investigated. Adult largemouth bass were intraperitoneally injected with 20μg/kg of cadmium chloride (mean exposure level - 2.6μg of cadmium per fish) and microarray analyses were conducted in the liver and testis 48h after injection. Transcriptomic profiles identified in response to cadmium exposure were tissue-specific with the most differential expression changes found in the liver tissues, which also contained much higher levels of cadmium than the testis. Acute exposure to a low dose of cadmium induced oxidative stress response and oxidative damage pathways in the liver. The mRNA levels of antioxidants such as catalase increased and numerous transcripts related to DNA damage and DNA repair were significantly altered. Hepatic mRNA levels of metallothionein, a molecular marker of metal exposure, did not increase significantly after 48h exposure. Carbohydrate metabolic pathways were also disrupted with hepatic transcripts such as UDP-glucose, pyrophosphorylase 2, and sorbitol dehydrogenase highly induced. Both tissues exhibited a disruption of steroid signaling pathways. In the testis, estrogen receptor beta and transcripts linked to cholesterol metabolism were suppressed. On the contrary, genes involved in cholesterol metabolism were highly increased in the liver including genes encoding for the rate limiting steroidogenic acute regulatory protein and the catalytic enzyme 7-dehydrocholesterol reductase. Integration of the transcriptomic data using functional enrichment analyses revealed a number of enriched gene networks associated with previously reported adverse outcomes of cadmium exposure such as liver toxicity and impaired reproduction. Copyright © 2014 Elsevier B.V. All rights reserved.

  11. Transcriptomic analysis of the venom glands from the scorpion Hadogenes troglodytes revealed unique and extremely high diversity of the venom peptides.

    PubMed

    Zhong, Jie; Zeng, Xian-Chun; Zeng, Xin; Nie, Yao; Zhang, Lei; Wu, Shifen; Bao, Aorigele

    2017-01-06

    Hadogenes is a genus of large African scorpions with 18 described species. However, little is known about the venom peptide composition of any species from Hadogenes so far. Here, we fully explored the composition of venom gland peptides from Hadogenes troglodytes using transcriptomic approach. We discovered 121 novel peptides from the scorpion, including 20 new-type peptides cross-linked with one, two, three, four or seven disulfide bridges, respectively, 11 novel K + -channel toxin-like peptides, 2 novel ryanodine receptors-specific toxin-like peptides, a unique peptide containing the cysteine knots of spider toxins, 15 novel La1-like toxins, 3 novel TIL domain-containing peptides, 5 novel peptides with atypical cysteine patterns, 19 novel antimicrobial peptides, 6 novel cysteine-free peptides and 39 new-type cysteine-free peptides. Among them, the new-type peptides are largely dominant; this highlights the unique diversity of the venom gland peptides from H. troglodytes. Some of the new peptides would serve as new molecular probes for the investigations of cellular ion channels and other receptors, or offer new templates for the development of therapeutic drugs for the treatment of ion channel-associated diseases, and infections caused by antibiotics-resistant pathogens. In this study, we fully explored the composition of venom gland peptides from the scorpion Hadogenes troglodytes using transcriptomic approach. We discovered a total of 121 novel peptides from the venom glands of the scorpion, of which new-type peptides are largely dominant. These data highlight the unique diversity of the venom gland peptides from the scorpion H. troglodytes, gain insights into new mechanisms for the scorpion to subdue its prey and predators, and enlarge the protein database of scorpion venom glands. The discovery of a lot of novel peptides provides new templates for the development of therapeutic drugs, and offers new molecular materials for the basic researches of various cellular receptors, and for the evolutionary investigations of scorpion toxins. Copyright © 2016 Elsevier B.V. All rights reserved.

  12. The quest to make fully functional human pancreatic beta cells from embryonic stem cells: climbing a mountain in the clouds.

    PubMed

    Johnson, James D

    2016-10-01

    The production of fully functional insulin-secreting cells to treat diabetes is a major goal of regenerative medicine. In this article, I review progress towards this goal over the last 15 years from the perspective of a beta cell biologist. I describe the current state-of-the-art, and speculate on the general approaches that will be required to identify and achieve our ultimate goal of producing functional beta cells. The need for deeper phenotyping of heterogeneous cultures of stem cell derived islet-like cells in parallel with a better understanding of the heterogeneity of the target cell type(s) is emphasised. This deep phenotyping should include high-throughput single-cell analysis, as well as comprehensive 'omics technologies to provide unbiased characterisation of cell products and human beta cells. There are justified calls for more detailed and well-powered studies of primary human pancreatic beta cell physiology, and I propose online databases of standardised human beta cell responses to physiological stimuli, including both functional and metabolomic/proteomic/transcriptomic profiles. With a concerted, community-wide effort, including both basic and applied scientists, beta cell replacement will become a clinical reality for patients with diabetes.

  13. dbWFA: a web-based database for functional annotation of Triticum aestivum transcripts

    PubMed Central

    Vincent, Jonathan; Dai, Zhanwu; Ravel, Catherine; Choulet, Frédéric; Mouzeyar, Said; Bouzidi, M. Fouad; Agier, Marie; Martre, Pierre

    2013-01-01

    The functional annotation of genes based on sequence homology with genes from model species genomes is time-consuming because it is necessary to mine several unrelated databases. The aim of the present work was to develop a functional annotation database for common wheat Triticum aestivum (L.). The database, named dbWFA, is based on the reference NCBI UniGene set, an expressed gene catalogue built by expressed sequence tag clustering, and on full-length coding sequences retrieved from the TriFLDB database. Information from good-quality heterogeneous sources, including annotations for model plant species Arabidopsis thaliana (L.) Heynh. and Oryza sativa L., was gathered and linked to T. aestivum sequences through BLAST-based homology searches. Even though the complexity of the transcriptome cannot yet be fully appreciated, we developed a tool to easily and promptly obtain information from multiple functional annotation systems (Gene Ontology, MapMan bin codes, MIPS Functional Categories, PlantCyc pathway reactions and TAIR gene families). The use of dbWFA is illustrated here with several query examples. We were able to assign a putative function to 45% of the UniGenes and 81% of the full-length coding sequences from TriFLDB. Moreover, comparison of the annotation of the whole T. aestivum UniGene set along with curated annotations of the two model species assessed the accuracy of the annotation provided by dbWFA. To further illustrate the use of dbWFA, genes specifically expressed during the early cell division or late storage polymer accumulation phases of T. aestivum grain development were identified using a clustering analysis and then annotated using dbWFA. The annotation of these two sets of genes was consistent with previous analyses of T. aestivum grain transcriptomes and proteomes. Database URL: urgi.versailles.inra.fr/dbWFA/ PMID:23660284

  14. Mass fingerprinting of the venom and transcriptome of venom gland of scorpion Centruroides tecomanus.

    PubMed

    Valdez-Velázquez, Laura L; Quintero-Hernández, Verónica; Romero-Gutiérrez, Maria Teresa; Coronas, Fredy I V; Possani, Lourival D

    2013-01-01

    Centruroides tecomanus is a Mexican scorpion endemic of the State of Colima, that causes human fatalities. This communication describes a proteome analysis obtained from milked venom and a transcriptome analysis from a cDNA library constructed from two pairs of venom glands of this scorpion. High perfomance liquid chromatography separation of soluble venom produced 80 fractions, from which at least 104 individual components were identified by mass spectrometry analysis, showing to contain molecular masses from 259 to 44,392 Da. Most of these components are within the expected molecular masses for Na(+)- and K(+)-channel specific toxic peptides, supporting the clinical findings of intoxication, when humans are stung by this scorpion. From the cDNA library 162 clones were randomly chosen, from which 130 sequences of good quality were identified and were clustered in 28 contigs containing, each, two or more expressed sequence tags (EST) and 49 singlets with only one EST. Deduced amino acid sequence analysis from 53% of the total ESTs showed that 81% (24 sequences) are similar to known toxic peptides that affect Na(+)-channel activity, and 19% (7 unique sequences) are similar to K(+)-channel especific toxins. Out of the 31 sequences, at least 8 peptides were confirmed by direct Edman degradation, using components isolated directly from the venom. The remaining 19%, 4%, 4%, 15% and 5% of the ESTs correspond respectively to proteins involved in cellular processes, antimicrobial peptides, venom components, proteins without defined function and sequences without similarity in databases. Among the cloned genes are those similar to metalloproteinases.

  15. Transcriptomic analysis of grain amaranth (Amaranthus hypochondriacus) using 454 pyrosequencing: comparison with A. tuberculatus, expression profiling in stems and in response to biotic and abiotic stress

    PubMed Central

    2011-01-01

    Background Amaranthus hypochondriacus, a grain amaranth, is a C4 plant noted by its ability to tolerate stressful conditions and produce highly nutritious seeds. These possess an optimal amino acid balance and constitute a rich source of health-promoting peptides. Although several recent studies, mostly involving subtractive hybridization strategies, have contributed to increase the relatively low number of grain amaranth expressed sequence tags (ESTs), transcriptomic information of this species remains limited, particularly regarding tissue-specific and biotic stress-related genes. Thus, a large scale transcriptome analysis was performed to generate stem- and (a)biotic stress-responsive gene expression profiles in grain amaranth. Results A total of 2,700,168 raw reads were obtained from six 454 pyrosequencing runs, which were assembled into 21,207 high quality sequences (20,408 isotigs + 799 contigs). The average sequence length was 1,064 bp and 930 bp for isotigs and contigs, respectively. Only 5,113 singletons were recovered after quality control. Contigs/isotigs were further incorporated into 15,667 isogroups. All unique sequences were queried against the nr, TAIR, UniRef100, UniRef50 and Amaranthaceae EST databases for annotation. Functional GO annotation was performed with all contigs/isotigs that produced significant hits with the TAIR database. Only 8,260 sequences were found to be homologous when the transcriptomes of A. tuberculatus and A. hypochondriacus were compared, most of which were associated with basic house-keeping processes. Digital expression analysis identified 1,971 differentially expressed genes in response to at least one of four stress treatments tested. These included several multiple-stress-inducible genes that could represent potential candidates for use in the engineering of stress-resistant plants. The transcriptomic data generated from pigmented stems shared similarity with findings reported in developing stems of Arabidopsis and black cottonwood (Populus trichocarpa). Conclusions This study represents the first large-scale transcriptomic analysis of A. hypochondriacus, considered to be a highly nutritious and stress-tolerant crop. Numerous genes were found to be induced in response to (a)biotic stress, many of which could further the understanding of the mechanisms that contribute to multiple stress-resistance in plants, a trait that has potential biotechnological applications in agriculture. PMID:21752295

  16. De novo characterization of the Chinese fir (Cunninghamia lanceolata) transcriptome and analysis of candidate genes involved in cellulose and lignin biosynthesis

    PubMed Central

    2012-01-01

    Background Chinese fir (Cunninghamia lanceolata) is an important timber species that accounts for 20–30% of the total commercial timber production in China. However, the available genomic information of Chinese fir is limited, and this severely encumbers functional genomic analysis and molecular breeding in Chinese fir. Recently, major advances in transcriptome sequencing have provided fast and cost-effective approaches to generate large expression datasets that have proven to be powerful tools to profile the transcriptomes of non-model organisms with undetermined genomes. Results In this study, the transcriptomes of nine tissues from Chinese fir were analyzed using the Illumina HiSeq™ 2000 sequencing platform. Approximately 40 million paired-end reads were obtained, generating 3.62 gigabase pairs of sequencing data. These reads were assembled into 83,248 unique sequences (i.e. Unigenes) with an average length of 449 bp, amounting to 37.40 Mb. A total of 73,779 Unigenes were supported by more than 5 reads, 42,663 (57.83%) had homologs in the NCBI non-redundant and Swiss-Prot protein databases, corresponding to 27,224 unique protein entries. Of these Unigenes, 16,750 were assigned to Gene Ontology classes, and 14,877 were clustered into orthologous groups. A total of 21,689 (29.40%) were mapped to 119 pathways by BLAST comparison against the Kyoto Encyclopedia of Genes and Genomes (KEGG) database. The majority of the genes encoding the enzymes in the biosynthetic pathways of cellulose and lignin were identified in the Unigene dataset by targeted searches of their annotations. And a number of candidate Chinese fir genes in the two metabolic pathways were discovered firstly. Eighteen genes related to cellulose and lignin biosynthesis were cloned for experimental validating of transcriptome data. Overall 49 Unigenes, covering different regions of these selected genes, were found by alignment. Their expression patterns in different tissues were analyzed by qRT-PCR to explore their putative functions. Conclusions A substantial fraction of transcript sequences was obtained from the deep sequencing of Chinese fir. The assembled Unigene dataset was used to discover candidate genes of cellulose and lignin biosynthesis. This transcriptome dataset will provide a comprehensive sequence resource for molecular genetics research of C. lanceolata. PMID:23171398

  17. De novo transcriptome assembly and differential gene expression analysis of the calanoid copepod Acartia tonsa exposed to nickel nanoparticles.

    PubMed

    Zhou, Chao; Carotenuto, Ylenia; Vitiello, Valentina; Wu, Changwen; Zhang, Jianshe; Buttino, Isabella

    2018-06-14

    The calanoid copepod Acartia tonsa is a reference species in standardized ecotoxicology bioassay. Despite this interest, there is a lack of knowledge on molecular responses of A. tonsa to contaminants. We generated a de novo assembled transcriptome of A. tonsa exposed 4 days to 8.5 and 17 mg/L nickel nanoparticles (NiNPs), which have been shown to reduce egg hatching success and larval survival but had no effects on the adults. Aims of our study were to 1) improve the knowledge on the molecular responses of A. tonsa copepod and 2) increase the genomic resources of this copepod for further identification of potential biomarkers of NP exposure. The de novo assembled transcriptome of A. tonsa consisted of 53,619 unigenes, which were further annotated to nr, GO, KOG and KEGG databases. In particular, most unigenes were assigned to Metabolic and Cellular processes (34-45%) GO terms, and to Human disease (28%) and Organismal systems (23%) KEGG categories. Comparison among treatments showed that 373 unigenes were differentially expressed in A. tonsa exposed to NiNPs at 8.5 and 17 mg/L, with respect to control. Most of these genes were downregulated and took part in ribosome biogenesis, translation and protein turnover, thus suggesting that NiNPs could affect the copepod ribosome synthesis machinery and functioning. Overall, our study highlights the potential of toxicogenomic approach in gaining more mechanistic and functional information about the mode of action of emerging compounds on marine organisms, for biomarker discovering in crustaceans. Copyright © 2018 Elsevier Ltd. All rights reserved.

  18. Developing the anemone Aiptasia as a tractable model for cnidarian-dinoflagellate symbiosis: the transcriptome of aposymbiotic A. pallida.

    PubMed

    Lehnert, Erik M; Burriesci, Matthew S; Pringle, John R

    2012-06-22

    Coral reefs are hotspots of oceanic biodiversity, forming the foundation of ecosystems that are important both ecologically and for their direct practical impacts on humans. Corals are declining globally due to a number of stressors, including rising sea-surface temperatures and pollution; such stresses can lead to a breakdown of the essential symbiotic relationship between the coral host and its endosymbiotic dinoflagellates, a process known as coral bleaching. Although the environmental stresses causing this breakdown are largely known, the cellular mechanisms of symbiosis establishment, maintenance, and breakdown are still largely obscure. Investigating the symbiosis using an experimentally tractable model organism, such as the small sea anemone Aiptasia, should improve our understanding of exactly how the environmental stressors affect coral survival and growth. We assembled the transcriptome of a clonal population of adult, aposymbiotic (dinoflagellate-free) Aiptasia pallida from ~208 million reads, yielding 58,018 contigs. We demonstrated that many of these contigs represent full-length or near-full-length transcripts that encode proteins similar to those from a diverse array of pathways in other organisms, including various metabolic enzymes, cytoskeletal proteins, and neuropeptide precursors. The contigs were annotated by sequence similarity, assigned GO terms, and scanned for conserved protein domains. We analyzed the frequency and types of single-nucleotide variants and estimated the size of the Aiptasia genome to be ~421 Mb. The contigs and annotations are available through NCBI (Transcription Shotgun Assembly database, accession numbers JV077153-JV134524) and at http://pringlelab.stanford.edu/projects.html. The availability of an extensive transcriptome assembly for A. pallida will facilitate analyses of gene-expression changes, identification of proteins of interest, and other studies in this important emerging model system.

  19. A chromosome-centric human proteome project (C-HPP) to characterize the sets of proteins encoded in chromosome 17.

    PubMed

    Liu, Suli; Im, Hogune; Bairoch, Amos; Cristofanilli, Massimo; Chen, Rui; Deutsch, Eric W; Dalton, Stephen; Fenyo, David; Fanayan, Susan; Gates, Chris; Gaudet, Pascale; Hincapie, Marina; Hanash, Samir; Kim, Hoguen; Jeong, Seul-Ki; Lundberg, Emma; Mias, George; Menon, Rajasree; Mu, Zhaomei; Nice, Edouard; Paik, Young-Ki; Uhlen, Mathias; Wells, Lance; Wu, Shiaw-Lin; Yan, Fangfei; Zhang, Fan; Zhang, Yue; Snyder, Michael; Omenn, Gilbert S; Beavis, Ronald C; Hancock, William S

    2013-01-04

    We report progress assembling the parts list for chromosome 17 and illustrate the various processes that we have developed to integrate available data from diverse genomic and proteomic knowledge bases. As primary resources, we have used GPMDB, neXtProt, PeptideAtlas, Human Protein Atlas (HPA), and GeneCards. All sites share the common resource of Ensembl for the genome modeling information. We have defined the chromosome 17 parts list with the following information: 1169 protein-coding genes, the numbers of proteins confidently identified by various experimental approaches as documented in GPMDB, neXtProt, PeptideAtlas, and HPA, examples of typical data sets obtained by RNASeq and proteomic studies of epithelial derived tumor cell lines (disease proteome) and a normal proteome (peripheral mononuclear cells), reported evidence of post-translational modifications, and examples of alternative splice variants (ASVs). We have constructed a list of the 59 "missing" proteins as well as 201 proteins that have inconclusive mass spectrometric (MS) identifications. In this report we have defined a process to establish a baseline for the incorporation of new evidence on protein identification and characterization as well as related information from transcriptome analyses. This initial list of "missing" proteins that will guide the selection of appropriate samples for discovery studies as well as antibody reagents. Also we have illustrated the significant diversity of protein variants (including post-translational modifications, PTMs) using regions on chromosome 17 that contain important oncogenes. We emphasize the need for mandated deposition of proteomics data in public databases, the further development of improved PTM, ASV, and single nucleotide variant (SNV) databases, and the construction of Web sites that can integrate and regularly update such information. In addition, we describe the distribution of both clustered and scattered sets of protein families on the chromosome. Since chromosome 17 is rich in cancer-associated genes, we have focused the clustering of cancer-associated genes in such genomic regions and have used the ERBB2 amplicon as an example of the value of a proteogenomic approach in which one integrates transcriptomic with proteomic information and captures evidence of coexpression through coordinated regulation.

  20. OMICS-strategies and methods in the fight against doping.

    PubMed

    Reichel, Christian

    2011-12-10

    During the past decade OMICS-methods not only continued to have their impact on research strategies in life sciences and in particular molecular biology, but also started to be used for anti-doping control purposes. Research activities were mainly reasoned by the fact that several substances and methods, which were prohibited by the World Anti-Doping Agency (WADA), were or still are difficult to detect by direct methods. Transcriptomics, proteomics, and metabolomics in theory offer ideal platforms for the discovery of biomarkers for the indirect detection of the abuse of these substances and methods. Traditionally, the main focus of transcriptomics and proteomics projects has been on the prolonged detection of the misuse of human growth hormone (hGH), recombinant erythropoietin (rhEpo), and autologous blood transfusion. An additional benefit of the indirect or marker approach would also be that similarly acting substances might then be detected by a single method, without being forced to develop new direct detection methods for new but comparable prohibited substances (as has been the case, e.g. for the various forms of Epo analogs and biosimilars). While several non-OMICS-derived parameters for the indirect detection of doping are currently in use, for example the blood parameters of the hematological module of the athlete's biological passport, the outcome of most non-targeted OMICS-projects led to no direct application in routine doping control so far. The main reason is the inherent complexity of human transcriptomes, proteomes, and metabolomes and their inter-individual variability. The article reviews previous and recent research projects and their results and discusses future strategies for a more efficient application of OMICS-methods in doping control. Copyright © 2011 Elsevier Ireland Ltd. All rights reserved.

  1. Integrated Left Ventricular Global Transcriptome and Proteome Profiling in Human End-Stage Dilated Cardiomyopathy.

    PubMed

    Colak, Dilek; Alaiya, Ayodele A; Kaya, Namik; Muiya, Nzioka P; AlHarazi, Olfat; Shinwari, Zakia; Andres, Editha; Dzimiri, Nduna

    2016-01-01

    The disease pathways leading to idiopathic dilated cardiomyopathy (DCM) are still elusive. The present study investigated integrated global transcriptional and translational changes in human DCM for disease biomarker discovery. We used identical myocardial tissues from five DCM hearts compared to five non-failing (NF) donor hearts for both transcriptome profiling using the ABI high-density oligonucleotide microarrays and proteome expression with One-Dimensional Nano Acquity liquid chromatography coupled with tandem mass spectrometry on the Synapt G2 system. We identified 1262 differentially expressed genes (DEGs) and 269 proteins (DEPs) between DCM cases and healthy controls. Among the most significantly upregulated (>5-fold) proteins were GRK5, APOA2, IGHG3, ANXA6, HSP90AA1, and ATP5C1 (p< 0.01). On the other hand, the most significantly downregulated proteins were GSTM5, COX17, CAV1 and ANXA3. At least ten entities were concomitantly upregulated on the two analysis platforms: GOT1, ALDH4A1, PDHB, BDH1, SLC2A11, HSP90AA1, HSP90AB1, H2AFV, HSPA5 and NDUFV1. Gene ontology analyses of DEGs and DEPs revealed significant overlap with enrichment of genes/proteins related to metabolic process, biosynthetic process, cellular component organization, oxidative phosphorylation, alterations in glycolysis and ATP synthesis, Alzheimer's disease, chemokine-mediated inflammation and cytokine signalling pathways. The concomitant use of transcriptome and proteome expression to evaluate global changes in DCM has led to the identification of sixteen commonly altered entities as well as novel genes, proteins and pathways whose cardiac functions have yet to be deciphered. This data should contribute towards better management of the disease.

  2. Digital transcriptome profiling using selective hexamer priming for cDNA synthesis.

    PubMed

    Armour, Christopher D; Castle, John C; Chen, Ronghua; Babak, Tomas; Loerch, Patrick; Jackson, Stuart; Shah, Jyoti K; Dey, John; Rohl, Carol A; Johnson, Jason M; Raymond, Christopher K

    2009-09-01

    We developed a procedure for the preparation of whole transcriptome cDNA libraries depleted of ribosomal RNA from only 1 microg of total RNA. The method relies on a collection of short, computationally selected oligonucleotides, called 'not-so-random' (NSR) primers, to obtain full-length, strand-specific representation of nonribosomal RNA transcripts. In this study we validated the technique by profiling human whole brain and universal human reference RNA using ultra-high-throughput sequencing.

  3. Benzo[g,h,i]perylene

    Integrated Risk Information System (IRIS)

    Benzo [ g , h , i ] perylene ; CASRN 191 - 24 - 2 Human health assessment information on a chemical substance is included in the IRIS database only after a comprehensive review of toxicity data , as outlined in the IRIS assessment development process . Sections I ( Health Hazard Assessments for Nonc

  4. Identification and Single-Cell Functional Characterization of an Endodermally Biased Pluripotent Substate in Human Embryonic Stem Cells.

    PubMed

    Allison, Thomas F; Smith, Andrew J H; Anastassiadis, Konstantinos; Sloane-Stanley, Jackie; Biga, Veronica; Stavish, Dylan; Hackland, James; Sabri, Shan; Langerman, Justin; Jones, Mark; Plath, Kathrin; Coca, Daniel; Barbaric, Ivana; Gokhale, Paul; Andrews, Peter W

    2018-05-09

    Human embryonic stem cells (hESCs) display substantial heterogeneity in gene expression, implying the existence of discrete substates within the stem cell compartment. To determine whether these substates impact fate decisions of hESCs we used a GFP reporter line to investigate the properties of fractions of putative undifferentiated cells defined by their differential expression of the endoderm transcription factor, GATA6, together with the hESC surface marker, SSEA3. By single-cell cloning, we confirmed that substates characterized by expression of GATA6 and SSEA3 include pluripotent stem cells capable of long-term self-renewal. When clonal stem cell colonies were formed from GATA6-positive and GATA6-negative cells, more of those derived from GATA6-positive cells contained spontaneously differentiated endoderm cells than similar colonies derived from the GATA6-negative cells. We characterized these discrete cellular states using single-cell transcriptomic analysis, identifying a potential role for SOX17 in the establishment of the endoderm-biased stem cell state. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.

  5. Reptilian-transcriptome v1.0, a glimpse in the brain transcriptome of five divergent Sauropsida lineages and the phylogenetic position of turtles

    PubMed Central

    2011-01-01

    Background Reptiles are largely under-represented in comparative genomics despite the fact that they are substantially more diverse in many respects than mammals. Given the high divergence of reptiles from classical model species, next-generation sequencing of their transcriptomes is an approach of choice for gene identification and annotation. Results Here, we use 454 technology to sequence the brain transcriptome of four divergent reptilian and one reference avian species: the Nile crocodile, the corn snake, the bearded dragon, the red-eared turtle, and the chicken. Using an in-house pipeline for recursive similarity searches of >3,000,000 reads against multiple databases from 7 reference vertebrates, we compile a reptilian comparative transcriptomics dataset, with homology assignment for 20,000 to 31,000 transcripts per species and a cumulated non-redundant sequence length of 248.6 Mbases. Our approach identifies the majority (87%) of chicken brain transcripts and about 50% of de novo assembled reptilian transcripts. In addition to 57,502 microsatellite loci, we identify thousands of SNP and indel polymorphisms for population genetic and linkage analyses. We also build very large multiple alignments for Sauropsida and mammals (two million residues per species) and perform extensive phylogenetic analyses suggesting that turtles are not basal living reptiles but are rather associated with Archosaurians, hence, potentially answering a long-standing question in the phylogeny of Amniotes. Conclusions The reptilian transcriptome (freely available at http://www.reptilian-transcriptomes.org) should prove a useful new resource as reptiles are becoming important new models for comparative genomics, ecology, and evolutionary developmental genetics. PMID:21943375

  6. Cell host response to infection with novel human coronavirus EMC predicts potential antivirals and important differences with SARS coronavirus.

    PubMed

    Josset, Laurence; Menachery, Vineet D; Gralinski, Lisa E; Agnihothram, Sudhakar; Sova, Pavel; Carter, Victoria S; Yount, Boyd L; Graham, Rachel L; Baric, Ralph S; Katze, Michael G

    2013-04-30

    A novel human coronavirus (HCoV-EMC) was recently identified in the Middle East as the causative agent of a severe acute respiratory syndrome (SARS) resembling the illness caused by SARS coronavirus (SARS-CoV). Although derived from the CoV family, the two viruses are genetically distinct and do not use the same receptor. Here, we investigated whether HCoV-EMC and SARS-CoV induce similar or distinct host responses after infection of a human lung epithelial cell line. HCoV-EMC was able to replicate as efficiently as SARS-CoV in Calu-3 cells and similarly induced minimal transcriptomic changes before 12 h postinfection. Later in infection, HCoV-EMC induced a massive dysregulation of the host transcriptome, to a much greater extent than SARS-CoV. Both viruses induced a similar activation of pattern recognition receptors and the interleukin 17 (IL-17) pathway, but HCoV-EMC specifically down-regulated the expression of several genes within the antigen presentation pathway, including both type I and II major histocompatibility complex (MHC) genes. This could have an important impact on the ability of the host to mount an adaptive host response. A unique set of 207 genes was dysregulated early and permanently throughout infection with HCoV-EMC, and was used in a computational screen to predict potential antiviral compounds, including kinase inhibitors and glucocorticoids. Overall, HCoV-EMC and SARS-CoV elicit distinct host gene expression responses, which might impact in vivo pathogenesis and could orient therapeutic strategies against that emergent virus. Identification of a novel coronavirus causing fatal respiratory infection in humans raises concerns about a possible widespread outbreak of severe respiratory infection similar to the one caused by SARS-CoV. Using a human lung epithelial cell line and global transcriptomic profiling, we identified differences in the host response between HCoV-EMC and SARS-CoV. This enables rapid assessment of viral properties and the ability to anticipate possible differences in human clinical responses to HCoV-EMC and SARS-CoV. We used this information to predict potential effective drugs against HCoV-EMC, a method that could be more generally used to identify candidate therapeutics in future disease outbreaks. These data will help to generate hypotheses and make rapid advancements in characterizing this new virus.

  7. Comparative genomics and transcriptomics of Escherichia coli isolates carrying virulence factors of both enteropathogenic and enterotoxigenic E. coli.

    PubMed

    Hazen, Tracy H; Michalski, Jane; Luo, Qingwei; Shetty, Amol C; Daugherty, Sean C; Fleckenstein, James M; Rasko, David A

    2017-06-14

    Escherichia coli that are capable of causing human disease are often classified into pathogenic variants (pathovars) based on their virulence gene content. However, disease-associated hybrid E. coli, containing unique combinations of multiple canonical virulence factors have also been described. Such was the case of the E. coli O104:H4 outbreak in 2011, which caused significant morbidity and mortality. Among the pathovars of diarrheagenic E. coli that cause significant human disease are the enteropathogenic E. coli (EPEC) and enterotoxigenic E. coli (ETEC). In the current study we use comparative genomics, transcriptomics, and functional studies to characterize isolates that contain virulence factors of both EPEC and ETEC. Based on phylogenomic analysis, these hybrid isolates are more genomically-related to EPEC, but appear to have acquired ETEC virulence genes. Global transcriptional analysis using RNA sequencing, demonstrated that the EPEC and ETEC virulence genes of these hybrid isolates were differentially-expressed under virulence-inducing laboratory conditions, similar to reference isolates. Immunoblot assays further verified that the virulence gene products were produced and that the T3SS effector EspB of EPEC, and heat-labile toxin of ETEC were secreted. These findings document the existence and virulence potential of an E. coli pathovar hybrid that blurs the distinction between E. coli pathovars.

  8. The low-abundance transcriptome reveals novel biomarkers, specific intracellular pathways and targetable genes associated with advanced gastric cancer.

    PubMed

    Bizama, Carolina; Benavente, Felipe; Salvatierra, Edgardo; Gutiérrez-Moraga, Ana; Espinoza, Jaime A; Fernández, Elmer A; Roa, Iván; Mazzolini, Guillermo; Sagredo, Eduardo A; Gidekel, Manuel; Podhajcer, Osvaldo L

    2014-02-15

    Studies on the low-abundance transcriptome are of paramount importance for identifying the intimate mechanisms of tumor progression that can lead to novel therapies. The aim of the present study was to identify novel markers and targetable genes and pathways in advanced human gastric cancer through analyses of the low-abundance transcriptome. The procedure involved an initial subtractive hybridization step, followed by global gene expression analysis using microarrays. We observed profound differences, both at the single gene and gene ontology levels, between the low-abundance transcriptome and the whole transcriptome. Analysis of the low-abundance transcriptome led to the identification and validation by tissue microarrays of novel biomarkers, such as LAMA3 and TTN; moreover, we identified cancer type-specific intracellular pathways and targetable genes, such as IRS2, IL17, IFNγ, VEGF-C, WISP1, FZD5 and CTBP1 that were not detectable by whole transcriptome analyses. We also demonstrated that knocking down the expression of CTBP1 sensitized gastric cancer cells to mainstay chemotherapeutic drugs. We conclude that the analysis of the low-abundance transcriptome provides useful insights into the molecular basis and treatment of cancer. © 2013 UICC.

  9. A combined strategy involving Sanger and 454 pyrosequencing increases genomic resources to aid in the management of reproduction, disease control and genetic selection in the turbot (Scophthalmus maximus)

    PubMed Central

    2013-01-01

    Background Genomic resources for plant and animal species that are under exploitation primarily for human consumption are increasingly important, among other things, for understanding physiological processes and for establishing adequate genetic selection programs. Current available techniques for high-throughput sequencing have been implemented in a number of species, including fish, to obtain a proper description of the transcriptome. The objective of this study was to generate a comprehensive transcriptomic database in turbot, a highly priced farmed fish species in Europe, with potential expansion to other areas of the world, for which there are unsolved production bottlenecks, to understand better reproductive- and immune-related functions. This information is essential to implement marker assisted selection programs useful for the turbot industry. Results Expressed sequence tags were generated by Sanger sequencing of cDNA libraries from different immune-related tissues after several parasitic challenges. The resulting database (“Turbot 2 database”) was enlarged with sequences generated from a 454 sequencing run of brain-hypophysis-gonadal axis-derived RNA obtained from turbot at different development stages. The assembly of Sanger and 454 sequences generated 52,427 consensus sequences (“Turbot 3 database”), of which 23,661 were successfully annotated. A total of 1,410 sequences were confirmed to be related to reproduction and key genes involved in sex differentiation and maturation were identified for the first time in turbot (AR, AMH, SRY-related genes, CYP19A, ZPGs, STAR FSHR, etc.). Similarly, 2,241 sequences were related to the immune system and several novel key immune genes were identified (BCL, TRAF, NCK, CD28 and TOLLIP, among others). The number of genes of many relevant reproduction- and immune-related pathways present in the database was 50–90% of the total gene count of each pathway. In addition, 1,237 microsatellites and 7,362 single nucleotide polymorphisms (SNPs) were also compiled. Further, 2,976 putative natural antisense transcripts (NATs) including microRNAs were also identified. Conclusions The combined sequencing strategies employed here significantly increased the turbot genomic resources available, including 34,400 novel sequences. The generated database contains a larger number of genes relevant for reproduction- and immune-associated studies, with an excellent coverage of most genes present in many relevant physiological pathways. This database also allowed the identification of many microsatellites and SNP markers that will be very useful for population and genome screening and a valuable aid in marker assisted selection programs. PMID:23497389

  10. OpenFluDB, a database for human and animal influenza virus

    PubMed Central

    Liechti, Robin; Gleizes, Anne; Kuznetsov, Dmitry; Bougueleret, Lydie; Le Mercier, Philippe; Bairoch, Amos; Xenarios, Ioannis

    2010-01-01

    Although research on influenza lasted for more than 100 years, it is still one of the most prominent diseases causing half a million human deaths every year. With the recent observation of new highly pathogenic H5N1 and H7N7 strains, and the appearance of the influenza pandemic caused by the H1N1 swine-like lineage, a collaborative effort to share observations on the evolution of this virus in both animals and humans has been established. The OpenFlu database (OpenFluDB) is a part of this collaborative effort. It contains genomic and protein sequences, as well as epidemiological data from more than 27 000 isolates. The isolate annotations include virus type, host, geographical location and experimentally tested antiviral resistance. Putative enhanced pathogenicity as well as human adaptation propensity are computed from protein sequences. Each virus isolate can be associated with the laboratories that collected, sequenced and submitted it. Several analysis tools including multiple sequence alignment, phylogenetic analysis and sequence similarity maps enable rapid and efficient mining. The contents of OpenFluDB are supplied by direct user submission, as well as by a daily automatic procedure importing data from public repositories. Additionally, a simple mechanism facilitates the export of OpenFluDB records to GenBank. This resource has been successfully used to rapidly and widely distribute the sequences collected during the recent human swine flu outbreak and also as an exchange platform during the vaccine selection procedure. Database URL: http://openflu.vital-it.ch. PMID:20624713

  11. Transcriptomic responses to ocean acidification in larval sea urchins from a naturally variable pH environment.

    PubMed

    Evans, Tyler G; Chan, Francis; Menge, Bruce A; Hofmann, Gretchen E

    2013-03-01

    Some marine ecosystems already experience natural declines in pH approximating those predicted with future anthropogenic ocean acidification (OA), the decline in seawater pH caused by the absorption of atmospheric CO2 . The molecular mechanisms that allow organisms to inhabit these low pH environments, particularly those building calcium carbonate skeletons, are unknown. Also uncertain is whether an enhanced capacity to cope with present day pH variation will confer resistance to future OA. To address these issues, we monitored natural pH dynamics within an intertidal habitat in the Northeast Pacific, demonstrating that upwelling exposes resident species to pH regimes not predicted to occur elsewhere until 2100. Next, we cultured the progeny of adult purple sea urchins (Strongylocentrotus purpuratus) collected from this region in CO2 -acidified seawater representing present day and near future ocean scenarios and monitored gene expression using transcriptomics. We hypothesized that persistent exposure to upwelling during evolutionary history will have selected for increased pH tolerance in this population and that their transcriptomic response to low pH seawater would provide insight into mechanisms underlying pH tolerance in a calcifying species. Resulting expression patterns revealed two important trends. Firstly, S. purpuratus larvae may alter the bioavailability of calcium and adjust skeletogenic pathways to sustain calcification in a low pH ocean. Secondly, larvae use different strategies for coping with different magnitudes of pH stress: initiating a robust transcriptional response to present day pH regimes but a muted response to near future conditions. Thus, an enhanced capacity to cope with present day pH variation may not translate into success in future oceans. © 2013 Blackwell Publishing Ltd.

  12. Identification of Hub Genes and Pathways in Zika Virus Infection Using RNA-Seq Data: A Network-Based Computational Approach.

    PubMed

    Brahma, Rahul; Gurumayum, Sanathoi; Naorem, Leimarembi Devi; Muthaiyan, Mathavan; Gopal, Jeyakodi; Venkatesan, Amouda

    2018-05-01

    Zika virus (ZIKV), a single-strand RNA flavivirus, is transmitted primarily through Aedes aegypti. The recent outbreaks in America and unexpected association between ZIKV infection and birth defects have triggered the global attention. This vouches to understand the molecular mechanisms of ZIKV infection to develop effective drug therapy. A systems-level understanding of biological process affected by ZIKV infection in fetal brain sample led us to identify the candidate genes for pharmaceutical intervention and potential biomarkers for diagnosis. To identify the key genes, transcriptomics data (RNA-Seq) with GSE93385 of ZIKV (Strain: MR766) infected human fetal neural stem cell are analyzed. In total, 1,084 differentially expressed genes (DEGs) are identified, that is, 471 upregulated and 613 downregulated genes. Further analysis such as the gene ontology term suggested that the downregulated genes are mostly enriched in defense response to virus, receptor binding, laminin binding, extracellular matrix, endoplasmic reticulum, and for upregulated DEGs: translation initiation, RNA binding, cytosol, and nucleosome are enriched. And through pathway analysis, systemic lupus erythematosus (SLE) is found to be the most enriched pathway. Protein-protein interaction (PPI) network is constructed to find the hub genes using STRING database. The seven key genes namely cyclin-dependent kinase 1 (CDK1), cyclin B1 (CCNB1), histone cluster 1 H2B family member K, (HIST1H2BK) histone cluster 1 H2B family member O (HIST1H2BO), and histone cluster 1 H2B family member B (HIST1H2BB), polo-like kinase 1 (PLK1), and cell division cycle 20 (CDC20) with highest degree are found to be hub genes using Centiscape, a Cytoscape plugin. The modules of PPI network using Molecular Complex Detection plugin are found significant in structural constituent of ribosome, defense response to virus, nucleosome, SLE, extracellular region, and regulation of gene silencing. Thus, identified key hub genes and pathways shed light on molecular mechanism that may contribute to the discovery of novel therapeutic targets and development of new strategies for the intervention of ZIKV disease.

  13. Optimization Of A High-Throughput Transcriptomic (HTTr) Bioactivity Screen In MCF7 Cells Using Targeted RNA-Seq (SOT)

    EPA Science Inventory

    Recent advances in targeted RNA-Seq technology allow researchers to efficiently and cost-effectively obtain whole transcriptome profiles using picograms of mRNA from human cell lysates. Low mRNA input requirements and sample multiplexing capabilities has made time- and concentrat...

  14. Genomic, transcriptomic and phenomic variation reveals the complex adaptation to stress response of modern maize breeding

    USDA-ARS?s Scientific Manuscript database

    Early maize adaptation to different agricultural environments was an important process associated with the creation of a stable food supply that allowed the evolution of human civilization in the Americas. To explore the mechanisms of maize adaptation, genomic, transcriptomic and phenomic data were ...

  15. Mining functionally relevant gene sets for analyzing physiologically novel clinical expression data.

    PubMed

    Turcan, Sevin; Vetter, Douglas E; Maron, Jill L; Wei, Xintao; Slonim, Donna K

    2011-01-01

    Gene set analyses have become a standard approach for increasing the sensitivity of transcriptomic studies. However, analytical methods incorporating gene sets require the availability of pre-defined gene sets relevant to the underlying physiology being studied. For novel physiological problems, relevant gene sets may be unavailable or existing gene set databases may bias the results towards only the best-studied of the relevant biological processes. We describe a successful attempt to mine novel functional gene sets for translational projects where the underlying physiology is not necessarily well characterized in existing annotation databases. We choose targeted training data from public expression data repositories and define new criteria for selecting biclusters to serve as candidate gene sets. Many of the discovered gene sets show little or no enrichment for informative Gene Ontology terms or other functional annotation. However, we observe that such gene sets show coherent differential expression in new clinical test data sets, even if derived from different species, tissues, and disease states. We demonstrate the efficacy of this method on a human metabolic data set, where we discover novel, uncharacterized gene sets that are diagnostic of diabetes, and on additional data sets related to neuronal processes and human development. Our results suggest that our approach may be an efficient way to generate a collection of gene sets relevant to the analysis of data for novel clinical applications where existing functional annotation is relatively incomplete.

  16. Transcriptome Analysis and Screening for Potential Target Genes for RNAi-Mediated Pest Control of the Beet Armyworm, Spodoptera exigua.

    PubMed

    Li, Hang; Jiang, Weihua; Zhang, Zan; Xing, Yanru; Li, Fei

    2013-01-01

    The beet armyworm, Spodoptera exigua (Hübner), is a serious pest worldwide that causes significant losses in crops. Unfortunately, genetic resources for the beet armyworm is extremely scarce. To improve these resources we sequenced the transcriptome of S. exigua representing all stages including eggs, 1(st) to 5(th) instar larvae, pupae, male and female adults using the Illumina Solexa platform. We assembled the transcriptome with Trinity that yielded 31,414 contigs. Of these contigs, 18,592 were annotated as protein coding genes by Blast searches against the NCBI nr database. It has been shown that knockdown of important insect genes by dsRNAs or siRNAs is a feasible mechanism to control insect pests. The first key step towards developing an efficient RNAi-mediated pest control technique is to find suitable target genes. To screen for effective target genes in the beet armyworm, we selected nine candidate genes. The sequences of these genes were amplified using the RACE strategy. Then, siRNAs were designed and chemically synthesized. We injected 2 µl siRNA (2 µg/µl) into the 4(th) instar larvae to knock down the respective target genes. The mRNA abundance of target genes decreased to different levels (∼20-94.3%) after injection of siRNAs. Knockdown of eight genes including chitinase7, PGCP, chitinase1, ATPase, tubulin1, arf2, tubulin2 and arf1 caused a significantly high level of mortality compared to the negative control (P<0.05). About 80% of the surviving insects in the siRNA-treated group of five genes (PGCP, chitinase1, tubulin1, tubulin2 and helicase) showed retarded development. In chitinase1-siRNA and chitinase7-siRNA administered groups, 12.5% survivors exhibited "half-ecdysis". In arf1-siRNA and arf2-siRNA groups, the body color of 15% became black 48 h after injections. In summary, the transcriptome could be a valuable genetic resource for identification of genes in S. exigua and this study provided putative targets for RNAi pest control.

  17. Transcriptome Analysis and Screening for Potential Target Genes for RNAi-Mediated Pest Control of the Beet Armyworm, Spodoptera exigua

    PubMed Central

    Zhang, Zan; Xing, Yanru; Li, Fei

    2013-01-01

    The beet armyworm, Spodoptera exigua (Hübner), is a serious pest worldwide that causes significant losses in crops. Unfortunately, genetic resources for the beet armyworm is extremely scarce. To improve these resources we sequenced the transcriptome of S. exigua representing all stages including eggs, 1st to 5th instar larvae, pupae, male and female adults using the Illumina Solexa platform. We assembled the transcriptome with Trinity that yielded 31,414 contigs. Of these contigs, 18,592 were annotated as protein coding genes by Blast searches against the NCBI nr database. It has been shown that knockdown of important insect genes by dsRNAs or siRNAs is a feasible mechanism to control insect pests. The first key step towards developing an efficient RNAi-mediated pest control technique is to find suitable target genes. To screen for effective target genes in the beet armyworm, we selected nine candidate genes. The sequences of these genes were amplified using the RACE strategy. Then, siRNAs were designed and chemically synthesized. We injected 2 µl siRNA (2 µg/µl) into the 4th instar larvae to knock down the respective target genes. The mRNA abundance of target genes decreased to different levels (∼20–94.3%) after injection of siRNAs. Knockdown of eight genes including chitinase7, PGCP, chitinase1, ATPase, tubulin1, arf2, tubulin2 and arf1 caused a significantly high level of mortality compared to the negative control (P<0.05). About 80% of the surviving insects in the siRNA-treated group of five genes (PGCP, chitinase1, tubulin1, tubulin2 and helicase) showed retarded development. In chitinase1-siRNA and chitinase7-siRNA administered groups, 12.5% survivors exhibited “half-ecdysis”. In arf1-siRNA and arf2-siRNA groups, the body color of 15% became black 48 h after injections. In summary, the transcriptome could be a valuable genetic resource for identification of genes in S. exigua and this study provided putative targets for RNAi pest control. PMID:23823756

  18. PeanutDB: an integrated bioinformatics web portal for Arachis hypogaea transcriptomics

    PubMed Central

    2012-01-01

    Background The peanut (Arachis hypogaea) is an important crop cultivated worldwide for oil production and food sources. Its complex genetic architecture (e.g., the large and tetraploid genome possibly due to unique cross of wild diploid relatives and subsequent chromosome duplication: 2n = 4x = 40, AABB, 2800 Mb) presents a major challenge for its genome sequencing and makes it a less-studied crop. Without a doubt, transcriptome sequencing is the most effective way to harness the genome structure and gene expression dynamics of this non-model species that has a limited genomic resource. Description With the development of next generation sequencing technologies such as 454 pyro-sequencing and Illumina sequencing by synthesis, the transcriptomics data of peanut is rapidly accumulated in both the public databases and private sectors. Integrating 187,636 Sanger reads (103,685,419 bases), 1,165,168 Roche 454 reads (333,862,593 bases) and 57,135,995 Illumina reads (4,073,740,115 bases), we generated the first release of our peanut transcriptome assembly that contains 32,619 contigs. We provided EC, KEGG and GO functional annotations to these contigs and detected SSRs, SNPs and other genetic polymorphisms for each contig. Based on both open-source and our in-house tools, PeanutDB presents many seamlessly integrated web interfaces that allow users to search, filter, navigate and visualize easily the whole transcript assembly, its annotations and detected polymorphisms and simple sequence repeats. For each contig, sequence alignment is presented in both bird’s-eye view and nucleotide level resolution, with colorfully highlighted regions of mismatches, indels and repeats that facilitate close examination of assembly quality, genetic polymorphisms, sequence repeats and/or sequencing errors. Conclusion As a public genomic database that integrates peanut transcriptome data from different sources, PeanutDB (http://bioinfolab.muohio.edu/txid3818v1) provides the Peanut research community with an easy-to-use web portal that will definitely facilitate genomics research and molecular breeding in this less-studied crop. PMID:22712730

  19. Short read Illumina data for the de novo assembly of a non-model snail species transcriptome (Radix balthica, Basommatophora, Pulmonata), and a comparison of assembler performance

    PubMed Central

    2011-01-01

    Background Until recently, read lengths on the Solexa/Illumina system were too short to reliably assemble transcriptomes without a reference sequence, especially for non-model organisms. However, with read lengths up to 100 nucleotides available in the current version, an assembly without reference genome should be possible. For this study we created an EST data set for the common pond snail Radix balthica by Illumina sequencing of a normalized transcriptome. Performance of three different short read assemblers was compared with respect to: the number of contigs, their length, depth of coverage, their quality in various BLAST searches and the alignment to mitochondrial genes. Results A single sequencing run of a normalized RNA pool resulted in 16,923,850 paired end reads with median read length of 61 bases. The assemblies generated by VELVET, OASES, and SeqMan NGEN differed in the total number of contigs, contig length, the number and quality of gene hits obtained by BLAST searches against various databases, and contig performance in the mt genome comparison. While VELVET produced the highest overall number of contigs, a large fraction of these were of small size (< 200bp), and gave redundant hits in BLAST searches and the mt genome alignment. The best overall contig performance resulted from the NGEN assembly. It produced the second largest number of contigs, which on average were comparable to the OASES contigs but gave the highest number of gene hits in two out of four BLAST searches against different reference databases. A subsequent meta-assembly of the four contig sets resulted in larger contigs, less redundancy and a higher number of BLAST hits. Conclusion Our results document the first de novo transcriptome assembly of a non-model species using Illumina sequencing data. We show that de novo transcriptome assembly using this approach yields results useful for downstream applications, in particular if a meta-assembly of contig sets is used to increase contig quality. These results highlight the ongoing need for improvements in assembly methodology. PMID:21679424

  20. Transcriptome Analysis of the Dihydrotestosterone-Exposed Fetal Rat Gubernaculum Identifies Common Androgen and Insulin-Like 3 Targets1

    PubMed Central

    Barthold, Julia S.; Wang, Yanping; Robbins, Alan; Pike, Jack; McDowell, Erin; Johnson, Kamin J.; McCahan, Suzanne M.

    2013-01-01

    ABSTRACT Androgens and insulin-like 3 (INSL3) are required for development of the fetal gubernaculum and testicular descent. Previous studies suggested that the INSL3-exposed fetal gubernacular transcriptome is enriched for genes involved in neural pathways. In the present study, we profiled the transcriptome of fetal gubernaculum explants exposed to dihydrotestosterone (DHT) and compared this response to that with INSL3. We exposed fetal (Embryonic Day 17) rat gubernacula to DHT for 24 h (10 and 30 nM) or 6 h (1 and 10 nM) in organ culture and analyzed gene expression relative to that of vehicle-treated controls using Affymetrix arrays. Results were annotated using functional, pathway, and promoter analyses and independently validated for selected transcripts using quantitative RT-PCR (qRT-PCR). Transcripts were differentially expressed after 24 h but not 6 h. Most highly overrepresented functional categories included those related to gene expression, skeletal and muscular development and function, and Wnt signaling. Promoter response elements enriched in the DHT-specific transcriptome included consensus sequences for c-ETS1, ELK1, CREB, CRE-BP1/c-June, NRF2, and USF. We observed that 55% of DHT probe sets were also differentially expressed after INSL3 exposure and that the direction of change was the same in 96%. The qRT-PCR results confirmed that DHT increased expression of the INSL3-responsive genes Crlf1 and Chrdl2 but reduced expression of Wnt4. We also validated reduced Tgfb2 and Cxcl12 and increased Slit3 expression following DHT exposure. These data suggest a robust overlap in the DHT- and INSL3-regulated transcriptome that may be mediated in part by CREB signaling and a common Wnt pathway response for both hormones in the fetal gubernaculum. PMID:24174575

  1. Phenome-genome association studies of pancreatic cancer: new targets for therapy and diagnosis.

    PubMed

    Narayanan, Ramaswamy

    2015-01-01

    Pancreatic cancer, has a very high mortality rate and requires novel molecular targets for diagnosis and therapy. Genetic association studies over databases offer an attractive starting point for gene discovery. The National Center for Biotechnology Information (NCBI) Phenome Genome Integrator (PheGenI) tool was enriched for pancreatic cancer-associated traits. The genes associated with the trait were characterized using diverse bioinformatics tools for Genome-Wide Association (GWA), transcriptome and proteome profile and protein classes for motif and domain. Two hundred twenty-six genes were identified that had a genetic association with pancreatic cancer in the human genome. This included 25 uncharacterized open reading frames (ORFs). Bioinformatics analysis of these ORFs identified putative druggable proteins and biomarkers including enzymes, transporters and G-protein-coupled receptor signaling proteins. Secreted proteins including a neuroendocrine factor and a chemokine were identified. Five out of these ORFs encompassed non coding RNAs. The ORF protein expression was detected in numerous body fluids, such as ascites, bile, pancreatic juice, milk, plasma, serum and saliva. Transcriptome and proteome analyses showed a correlation of mRNA and protein expression for nine ORFs. Analysis of the Catalogue of Somatic Mutations in Cancer (COSMIC) database revealed a strong correlation across copy number variations and mRNA over-expression for four ORFs. Mining of the International Cancer Gene Consortium (ICGC) database identified somatic mutations in a significant number of pancreatic patients' tumors for most of these ORFs. The pancreatic cancer-associated ORFs were also found to be genetically associated with other neoplasms, including leukemia, malignant melanoma, neuroblastoma and prostate carcinomas, as well as other unrelated diseases and disorders, such as Alzheimer's disease, Crohn's disease, coronary diseases, attention deficit disorder and addiction. Based on Genome-Wide Association Studies (GWAS), copy number variations, somatic mutational status and correlation of gene expression in pancreatic tumors at the mRNA and protein level, expression specificity in normal tissues and detection in body fluids, six ORFs emerged as putative leads for pancreatic cancer. These six targets provide a basis for accelerated drug discovery and diagnostic marker development for pancreatic cancer. Copyright© 2015, International Institute of Anticancer Research (Dr. John G. Delinasios), All rights reserved.

  2. Transcriptomic responses of the liver and adipose tissues to altered carbohydrate-fat ratio in diet: an isoenergetic study in young rats.

    PubMed

    Tanaka, Mitsuru; Yasuoka, Akihito; Shimizu, Manae; Saito, Yoshikazu; Kumakura, Kei; Asakura, Tomiko; Nagai, Toshitada

    2017-01-01

    To elucidate the effects of altered dietary carbohydrate and fat balance on liver and adipose tissue transcriptomes, 3-week-old rats were fed three kinds of diets: low-, moderate-, and high-fat diets (L, M, and H) containing a different ratio of carbohydrate-fat (C-F) (65:15, 60:20, and 35:45 in energy percent, respectively). The rats consumed the diets for 9 weeks and were subjected to biochemical and DNA microarray analyses. The rats in the H-group exhibited lower serum triacylglycerol (TG) levels but higher liver TG and cholesterol content than rats in the L-group. The analysis of differentially expressed genes (DEGs) between each group (L vs M, M vs H, and L vs H) in the liver revealed about 35% of L vs H DEGs that were regulated in the same way as M vs H DEGs, and most of the others were L- vs H-specific. Gene ontology analysis of these L vs H DEGs indicated that those related to fatty acid synthesis and circadian rhythm were enriched. Interestingly, about 30% of L vs M DEGs were regulated in a reverse way compared with L vs H and M vs H DEGs. These reversed liver DEGs included M-up/H-down genes ( Sds for gluconeogenesis from amino acids) and M-down/H-up genes ( Gpd2 for gluconeogenesis from glycerol, Agpat9 for TG synthesis, and Acot1 for beta-oxidation). We also analyzed L vs H DEGs in white (WAT) and brown (BAT) adipose tissues and found that both oxidation and synthesis of fatty acids were inhibited in these tissues. These results indicate that the alteration of dietary C-F balance differentially affects the transcriptomes of metabolizing and energy-storing tissues.

  3. Analysis of the Macaca mulatta transcriptome and the sequence divergence between Macaca and human.

    PubMed

    Magness, Charles L; Fellin, P Campion; Thomas, Matthew J; Korth, Marcus J; Agy, Michael B; Proll, Sean C; Fitzgibbon, Matthew; Scherer, Christina A; Miner, Douglas G; Katze, Michael G; Iadonato, Shawn P

    2005-01-01

    We report the initial sequencing and comparative analysis of the Macaca mulatta transcriptome. Cloned sequences from 11 tissues, nine animals, and three species (M. mulatta, M. fascicularis, and M. nemestrina) were sampled, resulting in the generation of 48,642 sequence reads. These data represent an initial sampling of the putative rhesus orthologs for 6,216 human genes. Mean nucleotide diversity within M. mulatta and sequence divergence among M. fascicularis, M. nemestrina, and M. mulatta are also reported.

  4. Altered hepatic lipid metabolism in mice lacking both the melanocortin type 4 receptor and low density lipoprotein receptor.

    PubMed

    Lede, Vera; Meusel, Andrej; Garten, Antje; Popkova, Yulia; Penke, Melanie; Franke, Christin; Ricken, Albert; Schulz, Angela; Kiess, Wieland; Huster, Daniel; Schöneberg, Torsten; Schiller, Jürgen

    2017-01-01

    Obesity is often associated with dyslipidemia and hepatosteatosis. A number of animal models of non-alcoholic fatty liver disease (NAFLD) are established but they significantly differ in the molecular and biochemical changes depending on the genetic modification and diet used. Mice deficient for melanocortin type 4 receptor (Mc4rmut) develop hyperphagia, obesity, and subsequently NAFLD already under regular chow and resemble more closely the energy supply-driven obesity found in humans. This animal model was used to assess the molecular and biochemical consequences of hyperphagia-induced obesity on hepatic lipid metabolism. We analyzed transcriptome changes in Mc4rmut mice by RNA sequencing and used high resolution 1H magic angle spinning NMR spectroscopy and MALDI-TOF mass spectrometry to assess changes in the lipid composition. On the transcriptomic level we found significant changes in components of the triacylglycerol metabolism, unsaturated fatty acids biosynthesis, peroxisome proliferator-activated receptor signaling pathways, and lipid transport and storage compared to the wild-type. These findings were supported by increases in triacylglycerol, monounsaturated fatty acid, and arachidonic acid levels. The transcriptome signatures significantly differ from those of other NAFLD mouse models supporting the concept of hepatic subphenotypes depending on the genetic background and diet. Comparative analyses of our data with previous studies allowed for the identification of common changes and genotype-specific components and pathways involved in obesity-associated NAFLD.

  5. De novo assembly and analysis of the Artemisia argyi transcriptome and identification of genes involved in terpenoid biosynthesis.

    PubMed

    Liu, Miaomiao; Zhu, Jinhang; Wu, Shengbing; Wang, Chenkai; Guo, Xingyi; Wu, Jiawen; Zhou, Meiqi

    2018-04-11

    Artemisia argyi Lev. et Vant. (A. argyi) is widely utilized for moxibustion in Chinese medicine, and the mechanism underlying terpenoid biosynthesis in its leaves is suggested to play an important role in its medicinal use. However, the A. argyi transcriptome has not been sequenced. Herein, we performed RNA sequencing for A. argyi leaf, root and stem tissues to identify as many as possible of the transcribed genes. In total, 99,807 unigenes were assembled by analysing the expression profiles generated from the three tissue types, and 67,446 of those unigenes were annotated in public databases. We further performed differential gene expression analysis to compare leaf tissue with the other two tissue types and identified numerous genes that were specifically expressed or up-regulated in leaf tissue. Specifically, we identified multiple genes encoding significant enzymes or transcription factors related to terpenoid synthesis. This study serves as a valuable resource for transcriptome information, as many transcribed genes related to terpenoid biosynthesis were identified in the A. argyi transcriptome, providing a functional genomic basis for additional studies on molecular mechanisms underlying the medicinal use of A. argyi.

  6. Transcriptome Sequencing of Gracilariopsis lemaneiformis to Analyze the Genes Related to Optically Active Phycoerythrin Synthesis.

    PubMed

    Huang, Xiaoyun; Zang, Xiaonan; Wu, Fei; Jin, Yuming; Wang, Haitao; Liu, Chang; Ding, Yating; He, Bangxiang; Xiao, Dongfang; Song, Xinwei; Liu, Zhu

    2017-01-01

    Gracilariopsis lemaneiformis (aka Gracilaria lemaneiformis) is a red macroalga rich in phycoerythrin, which can capture light efficiently and transfer it to photosystemⅡ. However, little is known about the synthesis of optically active phycoerythrinin in G. lemaneiformis at the molecular level. With the advent of high-throughput sequencing technology, analysis of genetic information for G. lemaneiformis by transcriptome sequencing is an effective means to get a deeper insight into the molecular mechanism of phycoerythrin synthesis. Illumina technology was employed to sequence the transcriptome of two strains of G. lemaneiformis- the wild type and a green-pigmented mutant. We obtained a total of 86915 assembled unigenes as a reference gene set, and 42884 unigenes were annotated in at least one public database. Taking the above transcriptome sequencing as a reference gene set, 4041 differentially expressed genes were screened to analyze and compare the gene expression profiles of the wild type and green mutant. By GO and KEGG pathway analysis, we concluded that three factors, including a reduction in the expression level of apo-phycoerythrin, an increase of chlorophyll light-harvesting complex synthesis, and reduction of phycoerythrobilin by competitive inhibition, caused the reduction of optically active phycoerythrin in the green-pigmented mutant.

  7. Characterization of the myometrial transcriptome in women with an arrest of dilatation during labor

    PubMed Central

    Chaemsaithong, Piya; Madan, Ichchha; Romero, Roberto; Than, Nandor G; Tarca, Adi L; Draghici, Sorin; Bhatti, Gaurav; Mazor, Moshe; Kim, Chong Jai; Hassan, Sonia S; Chaiworapongsa, Tinnakorn

    2014-01-01

    Objective The molecular basis of failure to progress in labor is poorly understood. This study was undertaken to characterize the myometrial transcriptome of patients with an arrest of dilatation (AODIL). Study design Human myometrium was prospectively collected from women in the following groups: 1) spontaneous term labor (TL; n=29); and 2) arrest of dilatation (AODIL; n=14). Gene expression was characterized using Illumina® HumanHT-12 microarrays. A moderated student t-test and false discovery rate adjustment were used for analysis. Quantitative reverse transcription-polymerase chain reaction (qRT-PCR) of selected genes was performed in an independent sample set. Pathway analysis was performed on the Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway database using Pathway Analysis with Down-weighting of Overlapping Genes (PADOG). The Metacore knowledge base was also mined for pathway analysis. Results 1) 42 genes differentially expressed were identified in women with an AODIL; 2) gene ontology analysis indicated enrichment of biological processes, which included: regulation of angiogenesis, response to hypoxia, inflammatory response, and chemokine-mediated signaling pathway. Enriched molecular functions included: transcription repressor activity, Heat shock protein (Hsp) 90 binding, and nitric oxide synthase (NOS) activity; 3) Metacore analysis identified immune response chemokine (C-C motif) ligand 2 (CCL2) signaling, muscle contraction regulation of eNOS activity in endothelial cells, and Triiodothyronine and Thyroxine signaling as significantly over-represented (FDR<0.05); 4) qRT-PCR confirmed overexpression of Nitric oxide synthase 3 NOS3; hypoxic ischemic factor (HIF1A), Chemokine (C-C motif) ligand 2 (CCL2); angiopoietin-like 4 (ANGPTL4), ADAM metallopeptidase with thrombospondin type 1, motif 9 (ADAMTS9), G protein-coupled receptor 4 (GPR4), metallothionein 1A (MT1A), MT2A, selectin E (SELE) in an AODIL. Conclusion The myometrium of women with arrest of dilatation have a stereotypic transcriptome profile. This disorder was associated with a pattern of gene expression involved in muscle contraction, an inflammatory response, and hypoxia. This is the first comprehensive and unbiased examination of the molecular basis of an AODIL. PMID:23893668

  8. Analysis of drought-responsive signalling network in two contrasting rice cultivars using transcriptome-based approach

    PubMed Central

    Borah, Pratikshya; Sharma, Eshan; Kaur, Amarjot; Chandel, Girish; Mohapatra, Trilochan; Kapoor, Sanjay; Khurana, Jitendra P.

    2017-01-01

    Traditional cultivars of rice in India exhibit tolerance to drought stress due to their inherent genetic variations. Here we present comparative physiological and transcriptome analyses of two contrasting cultivars, drought tolerant Dhagaddeshi (DD) and susceptible IR20. Microarray analysis revealed several differentially expressed genes (DEGs) exclusively in DD as compared to IR20 seedlings exposed to 3 h drought stress. Physiologically, DD seedlings showed higher cell membrane stability and differential ABA accumulation in response to dehydration, coupled with rapid changes in gene expression. Detailed analyses of metabolic pathways enriched in expression data suggest interplay of ABA dependent along with secondary and redox metabolic networks that activate osmotic and detoxification signalling in DD. By co-localization of DEGs with QTLs from databases or published literature for physiological traits of DD and IR20, candidate genes were identified including those underlying major QTL qDTY1.1 in DD. Further, we identified previously uncharacterized genes from both DD and IR20 under drought conditions including OsWRKY51, OsVP1 and confirmed their expression by qPCR in multiple rice cultivars. OsFBK1 was also functionally validated in susceptible PB1 rice cultivar and Arabidopsis for providing drought tolerance. Some of the DEGs mapped to the known QTLs could thus, be of potential significance for marker-assisted breeding. PMID:28181537

  9. Familial Dysautonomia (FD) Human Embryonic Stem Cell Derived PNS Neurons Reveal that Synaptic Vesicular and Neuronal Transport Genes Are Directly or Indirectly Affected by IKBKAP Downregulation

    PubMed Central

    Kantor, Gal; Cheishvili, David; Even, Aviel; Birger, Anastasya; Turetsky, Tikva; Gil, Yaniv; Even-Ram, Sharona; Aizenman, Einat; Bashir, Nibal; Maayan, Channa; Razin, Aharon; Reubinoff, Benjamim E.; Weil, Miguel

    2015-01-01

    A splicing mutation in the IKBKAP gene causes Familial Dysautonomia (FD), affecting the IKAP protein expression levels and proper development and function of the peripheral nervous system (PNS). Here we found new molecular insights for the IKAP role and the impact of the FD mutation in the human PNS lineage by using a novel and unique human embryonic stem cell (hESC) line homozygous to the FD mutation originated by pre implantation genetic diagnosis (PGD) analysis. We found that IKBKAP downregulation during PNS differentiation affects normal migration in FD-hESC derived neural crest cells (NCC) while at later stages the PNS neurons show reduced intracellular colocalization between vesicular proteins and IKAP. Comparative wide transcriptome analysis of FD and WT hESC-derived neurons together with the analysis of human brains from FD and WT 12 weeks old embryos and experimental validation of the results confirmed that synaptic vesicular and neuronal transport genes are directly or indirectly affected by IKBKAP downregulation in FD neurons. Moreover we show that kinetin (a drug that corrects IKBKAP alternative splicing) promotes the recovery of IKAP expression and these IKAP functional associated genes identified in the study. Altogether, these results support the view that IKAP might be a vesicular like protein that might be involved in neuronal transport in hESC derived PNS neurons. This function seems to be mostly affected in FD-hESC derived PNS neurons probably reflecting some PNS neuronal dysfunction observed in FD. PMID:26437462

  10. Transcriptomic response of the Antarctic pteropod Limacina helicina antarctica to ocean acidification.

    PubMed

    Johnson, Kevin M; Hofmann, Gretchen E

    2017-10-23

    Ocean acidification (OA), a change in ocean chemistry due to the absorption of atmospheric CO 2 into surface oceans, challenges biogenic calcification in many marine organisms. Ocean acidification is expected to rapidly progress in polar seas, with regions of the Southern Ocean expected to experience severe OA within decades. Biologically, the consequences of OA challenge calcification processes and impose an energetic cost. In order to better characterize the response of a polar calcifier to conditions of OA, we assessed differential gene expression in the Antarctic pteropod, Limacina helicina antarctica. Experimental levels of pCO 2 were chosen to create both contemporary pH conditions, and to mimic future pH expected in OA scenarios. Significant changes in the transcriptome were observed when juvenile L. h. antarctica were acclimated for 21 days to low-pH (7.71), mid-pH (7.9) or high-pH (8.13) conditions. Differential gene expression analysis of individuals maintained in the low-pH treatment identified down-regulation of genes involved in cytoskeletal structure, lipid transport, and metabolism. High pH exposure led to increased expression and enrichment for genes involved in shell formation, calcium ion binding, and DNA binding. Significant differential gene expression was observed in four major cellular and physiological processes: shell formation, the cellular stress response, metabolism, and neural function. Across these functional groups, exposure to conditions that mimic ocean acidification led to rapid suppression of gene expression. Results of this study demonstrated that the transcriptome of the juvenile pteropod, L. h. antarctica, was dynamic and changed in response to different levels of pCO 2 . In a global change context, exposure of L. h. antarctica to the low pH, high pCO 2 OA conditions resulted in a suppression of transcripts for genes involved in key physiological processes: calcification, metabolism, and the cellular stress response. The transcriptomic response at both acute and longer-term acclimation time frames indicated that contemporary L. h. antarctica may not have the physiological plasticity necessary for adaptation to OA conditions expected in future decades. Lastly, the differential gene expression results further support the role of shelled pteropods such as L. h. antarctica as sentinel organisms for the impacts of ocean acidification.

  11. Comparative de novo transcriptome analysis of male and female Sea buckthorn.

    PubMed

    Bansal, Ankush; Salaria, Mehul; Sharma, Tashil; Stobdan, Tsering; Kant, Anil

    2018-02-01

    Sea buckthorn is a dioecious medicinal plant found at high altitude. The plant has both male and female reproductive organs in separate individuals. In this article, whole transcriptome de novo assemblies of male and female flower bud samples were carried out using Illumina NextSeq 500 platform to determine the role of the genes involved in sex determination. Moreover, genes with differential expression in male and female transcriptomes were identified to understand the underlying sex determination mechanism. The current study showed 63,904 and 62,272 coding sequences (CDS) in female and male transcriptome data sets, respectively. 16,831 common CDS were screened out from both transcriptomes, out of which 625 were upregulated and 491 were found to be downregulated. To understand the potential regulatory roles of differentially expressed genes in metabolic networks and biosynthetic pathways: KEGG mapping, gene ontology, and co-expression network analysis were performed. Comparison with Flowering Interactive Database (FLOR-ID) resulted in eight differentially expressed genes viz. CHD3-type chromatin-remodeling factor PICKLE ( PKL ), phytochrome-associated serine/threonine-protein phosphatase ( FYPP ), protein TOPLESS ( TPL ), sensitive to freezing 6 ( SFR6 ), lysine-specific histone demethylase 1 homolog 1 ( LDL1 ), pre-mRNA-processing-splicing factor 8A ( PRP8A ), sucrose synthase 4 ( SUS4 ), ubiquitin carboxyl-terminal hydrolase 12 ( UBP12 ), known to be broadly involved in flowering, photoperiodism, embryo development, and cold response pathways. Male and female flower bud transcriptome data of Sea buckthorn may provide comprehensive information at genomic level for the identification of genetic regulation involved in sex determination.

  12. Targeting of Repeated Sequences Unique to a Gene Results in Significant Increases in Antisense Oligonucleotide Potency

    PubMed Central

    Vickers, Timothy A.; Freier, Susan M.; Bui, Huynh-Hoa; Watt, Andrew; Crooke, Stanley T.

    2014-01-01

    A new strategy for identifying potent RNase H-dependent antisense oligonucleotides (ASOs) is presented. Our analysis of the human transcriptome revealed that a significant proportion of genes contain unique repeated sequences of 16 or more nucleotides in length. Activities of ASOs targeting these repeated sites in several representative genes were compared to those of ASOs targeting unique single sites in the same transcript. Antisense activity at repeated sites was also evaluated in a highly controlled minigene system. Targeting both native and minigene repeat sites resulted in significant increases in potency as compared to targeting of non-repeated sites. The increased potency at these sites is a result of increased frequency of ASO/RNA interactions which, in turn, increases the probability of a productive interaction between the ASO/RNA heteroduplex and human RNase H1 in the cell. These results suggest a new, highly efficient strategy for rapid identification of highly potent ASOs. PMID:25334092

  13. Temporal transcriptome changes induced by methyl jasmonate in Salvia sclarea.

    PubMed

    Hao, Da Cheng; Chen, Shi Lin; Osbourn, Anne; Kontogianni, Vassiliki G; Liu, Li Wei; Jordán, Maria J

    2015-03-01

    Salvia sclarea is a traditional medicinal and aromatic plant that grows in Europe and produces various economically important compounds, including phenylpropanoid derivatives and terpenoids. Methyl jasmonate (MeJA) is commonly used to elicit plant stress responses. However, how MeJA enhances production of secondary metabolites in S. sclarea is not well understood. We performed a genome-wide analysis of temporal gene expression in S. sclarea leaves and roots. The transcriptome profiles 0, 10 and 26 h after MeJA treatment were analyzed by Illumina RNA-Seq. A total of 16,142 isogenes (average length 866bp; N50 1035bp) were obtained by de novo assembly of 35,757,567 raw sequencing reads. When these sequencing reads were mapped onto the assembled Unigenes, 3236, 2792 and 798 Unigenes were found to be expressed differentially between 0 and 10h, 0 and 26 h, and 10 and 26h, respectively. These included many secondary metabolite biosynthesis, stress and defense-related genes. A qRT-PCR analysis confirmed the expression profiles of selected differentially expressed genes (DEGs) revealed by RNA-Seq data, and also extended our analysis of differential gene expression to 73 h. Our investigations revealed temporal differences in the responses of S. sclarea to MeJA treatment. MeJA treatment induced the expression of a large number of genes involved in phenylpropanoid biosynthesis, especially between 0 and 10h, and 0 and 26 h. Additionally, many genes encoding transcription factors, cytochrome P450s, glycosyltransferases, methyltransferases and transporters were shown to respond to MeJA elicitation. DEGs related to structural molecule activity and cell death showed a significant temporal variation. A chromatographic analysis of metabolites at 26h, 73h and six days after MeJA treatment indicated that these transcriptomic changes precede MeJA-induced changes in secondary metabolite content. This study sheds light on the molecular mechanisms of MeJA elicitation and is helpful in understanding how exogenous MeJA treatment mediates extensive plant transcriptome reprogramming/remodeling. Our results can be utilized to characterize genes related to secondary metabolism and their regulation, and in breeding S. sclarea for desirable chemotypes. Copyright © 2014 Elsevier B.V. All rights reserved.

  14. Comparative Characterization of the Leaf Tissue of Physalis alkekengi and Physalis peruviana Using RNA-seq and Metabolite Profiling

    PubMed Central

    Fukushima, Atsushi; Nakamura, Michimi; Suzuki, Hideyuki; Yamazaki, Mami; Knoch, Eva; Mori, Tetsuya; Umemoto, Naoyuki; Morita, Masaki; Hirai, Go; Sodeoka, Mikiko; Saito, Kazuki

    2016-01-01

    The genus Physalis in the Solanaceae family contains several species of benefit to humans. Examples include P. alkekengi (Chinese-lantern plant, hôzuki in Japanese) used for medicinal and for decorative purposes, and P. peruviana, also known as Cape gooseberry, which bears an edible, vitamin-rich fruit. Members of the Physalis genus are a valuable resource for phytochemicals needed for the development of medicines and functional foods. To fully utilize the potential of these phytochemicals we need to understand their biosynthesis, and for this we need genomic data, especially comprehensive transcriptome datasets for gene discovery. We report the de novo assembly of the transcriptome from leaves of P. alkekengi and P. peruviana using Illumina RNA-seq technologies. We identified 75,221 unigenes in P. alkekengi and 54,513 in P. peruviana. All unigenes were annotated with gene ontology (GO), Enzyme Commission (EC) numbers, and pathway information from the Kyoto Encyclopedia of Genes and Genomes (KEGG). We classified unigenes encoding enzyme candidates putatively involved in the secondary metabolism and identified more than one unigenes for each step in terpenoid backbone- and steroid biosynthesis in P. alkekengi and P. peruviana. To measure the variability of the withanolides including physalins and provide insights into their chemical diversity in Physalis, we also analyzed the metabolite content in leaves of P. alkekengi and P. peruviana at five different developmental stages by liquid chromatography-mass spectrometry. We discuss that comprehensive transcriptome approaches within a family can yield a clue for gene discovery in Physalis and provide insights into their complex chemical diversity. The transcriptome information we submit here will serve as an important public resource for further studies of the specialized metabolism of Physalis species. PMID:28066454

  15. Comparative Characterization of the Leaf Tissue of Physalis alkekengi and Physalis peruviana Using RNA-seq and Metabolite Profiling.

    PubMed

    Fukushima, Atsushi; Nakamura, Michimi; Suzuki, Hideyuki; Yamazaki, Mami; Knoch, Eva; Mori, Tetsuya; Umemoto, Naoyuki; Morita, Masaki; Hirai, Go; Sodeoka, Mikiko; Saito, Kazuki

    2016-01-01

    The genus Physalis in the Solanaceae family contains several species of benefit to humans. Examples include P. alkekengi (Chinese-lantern plant, hôzuki in Japanese) used for medicinal and for decorative purposes, and P. peruviana , also known as Cape gooseberry, which bears an edible, vitamin-rich fruit. Members of the Physalis genus are a valuable resource for phytochemicals needed for the development of medicines and functional foods. To fully utilize the potential of these phytochemicals we need to understand their biosynthesis, and for this we need genomic data, especially comprehensive transcriptome datasets for gene discovery. We report the de novo assembly of the transcriptome from leaves of P. alkekengi and P. peruviana using Illumina RNA-seq technologies. We identified 75,221 unigenes in P. alkekengi and 54,513 in P. peruviana . All unigenes were annotated with gene ontology (GO), Enzyme Commission (EC) numbers, and pathway information from the Kyoto Encyclopedia of Genes and Genomes (KEGG). We classified unigenes encoding enzyme candidates putatively involved in the secondary metabolism and identified more than one unigenes for each step in terpenoid backbone- and steroid biosynthesis in P. alkekengi and P. peruviana . To measure the variability of the withanolides including physalins and provide insights into their chemical diversity in Physalis , we also analyzed the metabolite content in leaves of P. alkekengi and P. peruviana at five different developmental stages by liquid chromatography-mass spectrometry. We discuss that comprehensive transcriptome approaches within a family can yield a clue for gene discovery in Physalis and provide insights into their complex chemical diversity. The transcriptome information we submit here will serve as an important public resource for further studies of the specialized metabolism of Physalis species.

  16. An Efficient Method for Electroporation of Small Interfering RNAs into ENCODE Project Tier 1 GM12878 and K562 Cell Lines.

    PubMed

    Muller, Ryan Y; Hammond, Ming C; Rio, Donald C; Lee, Yeon J

    2015-12-01

    The Encyclopedia of DNA Elements (ENCODE) Project aims to identify all functional sequence elements in the human genome sequence by use of high-throughput DNA/cDNA sequencing approaches. To aid the standardization, comparison, and integration of data sets produced from different technologies and platforms, the ENCODE Consortium selected several standard human cell lines to be used by the ENCODE Projects. The Tier 1 ENCODE cell lines include GM12878, K562, and H1 human embryonic stem cell lines. GM12878 is a lymphoblastoid cell line, transformed with the Epstein-Barr virus, that was selected by the International HapMap Project for whole genome and transcriptome sequencing by use of the Illumina platform. K562 is an immortalized myelogenous leukemia cell line. The GM12878 cell line is attractive for the ENCODE Projects, as it offers potential synergy with the International HapMap Project. Despite the vast amount of sequencing data available on the GM12878 cell line through the ENCODE Project, including transcriptome, chromatin immunoprecipitation-sequencing for histone marks, and transcription factors, no small interfering siRNA-mediated knockdown studies have been performed in the GM12878 cell line, as cationic lipid-mediated transfection methods are inefficient for lymphoid cell lines. Here, we present an efficient and reproducible method for transfection of a variety of siRNAs into the GM12878 and K562 cell lines, which subsequently results in targeted protein depletion.

  17. Differences in acid tolerance between Bifidobacterium breve BB8 and its acid-resistant derivative B. breve BB8dpH, revealed by RNA-sequencing and physiological analysis.

    PubMed

    Yang, Xu; Hang, Xiaomin; Tan, Jing; Yang, Hong

    2015-06-01

    Bifidobacteria are common inhabitants of the human gastrointestinal tract, and their application has increased dramatically in recent years due to their health-promoting effects. The ability of bifidobacteria to tolerate acidic environments is particularly important for their function as probiotics because they encounter such environments in food products and during passage through the gastrointestinal tract. In this study, we generated a derivative, Bifidobacterium breve BB8dpH, which displayed a stable, acid-resistant phenotype. To investigate the possible reasons for the higher acid tolerance of B. breve BB8dpH, as compared with its parental strain B. breve BB8, a combined transcriptome and physiological approach was used to characterize differences between the two strains. An analysis of the transcriptome by RNA-sequencing indicated that the expression of 121 genes was increased by more than 2-fold, while the expression of 146 genes was reduced more than 2-fold, in B. breve BB8dpH. Validation of the RNA-sequencing data using real-time quantitative PCR analysis demonstrated that the RNA-sequencing results were highly reliable. The comparison analysis, based on differentially expressed genes, suggested that the acid tolerance of B. breve BB8dpH was enhanced by regulating the expression of genes involved in carbohydrate transport and metabolism, energy production, synthesis of cell envelope components (peptidoglycan and exopolysaccharide), synthesis and transport of glutamate and glutamine, and histidine synthesis. Furthermore, an analysis of physiological data showed that B. breve BB8dpH displayed higher production of exopolysaccharide and lower H(+)-ATPase activity than B. breve BB8. The results presented here will improve our understanding of acid tolerance in bifidobacteria, and they will lead to the development of new strategies to enhance the acid tolerance of bifidobacterial strains. Copyright © 2015 Elsevier Ltd. All rights reserved.

  18. The Porcelain Crab Transcriptome and PCAD, the Porcelain Crab Microarray and Sequence Database

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Tagmount, Abderrahmane; Wang, Mei; Lindquist, Erika

    2010-01-27

    Background: With the emergence of a completed genome sequence of the freshwater crustacean Daphnia pulex, construction of genomic-scale sequence databases for additional crustacean sequences are important for comparative genomics and annotation. Porcelain crabs, genus Petrolisthes, have been powerful crustacean models for environmental and evolutionary physiology with respect to thermal adaptation and understanding responses of marine organisms to climate change. Here, we present a large-scale EST sequencing and cDNA microarray database project for the porcelain crab Petrolisthes cinctipes. Methodology/Principal Findings: A set of ~;;30K unique sequences (UniSeqs) representing ~;;19K clusters were generated from ~;;98K high quality ESTs from a set ofmore » tissue specific non-normalized and mixed-tissue normalized cDNA libraries from the porcelain crab Petrolisthes cinctipes. Homology for each UniSeq was assessed using BLAST, InterProScan, GO and KEGG database searches. Approximately 66percent of the UniSeqs had homology in at least one of the databases. All EST and UniSeq sequences along with annotation results and coordinated cDNA microarray datasets have been made publicly accessible at the Porcelain Crab Array Database (PCAD), a feature-enriched version of the Stanford and Longhorn Array Databases.Conclusions/Significance: The EST project presented here represents the third largest sequencing effort for any crustacean, and the largest effort for any crab species. Our assembly and clustering results suggest that our porcelain crab EST data set is equally diverse to the much larger EST set generated in the Daphnia pulex genome sequencing project, and thus will be an important resource to the Daphnia research community. Our homology results support the pancrustacea hypothesis and suggest that Malacostraca may be ancestral to Branchiopoda and Hexapoda. Our results also suggest that our cDNA microarrays cover as much of the transcriptome as can reasonably be captured in EST library sequencing approaches, and thus represent a rich resource for studies of environmental genomics.« less

  19. Transcriptomic profiling as a screening tool to detect trenbolone treatment in beef cattle.

    PubMed

    Pegolo, S; Cannizzo, F T; Biolatti, B; Castagnaro, M; Bargelloni, L

    2014-06-01

    The effects of steroid hormone implants containing trenbolone alone (Finaplix-H), combined with 17β-oestradiol (17β-E; Revalor-H), or with 17β-E and dexamethasone (Revalor-H plus dexamethasone per os) on the bovine muscle transcriptome were examined by DNA-microarray. Overall, large sets of genes were shown to be modulated by the different growth promoters (GPs) and the regulated pathways and biological processes were mostly shared among the treatment groups. Using the Prediction Analysis of Microarray program, GP-treated animals were accurately identified by a small number of predictive genes. A meta-analysis approach was also carried out for the Revalor group to potentially increase the robustness of class prediction analysis. After data pre-processing, a high level of accuracy (90%) was obtained in the classification of samples, using 105 predictive gene markers. Transcriptomics could thus help in the identification of indirect biomarkers for anabolic treatment in beef cattle to be applied for the screening of muscle samples collected after slaughtering. Copyright © 2014 Elsevier Ltd. All rights reserved.

  20. Evaluating intra- and inter-individual variation in the human placental transcriptome.

    PubMed

    Hughes, David A; Kircher, Martin; He, Zhisong; Guo, Song; Fairbrother, Genevieve L; Moreno, Carlos S; Khaitovich, Philipp; Stoneking, Mark

    2015-03-19

    Gene expression variation is a phenotypic trait of particular interest as it represents the initial link between genotype and other phenotypes. Analyzing how such variation apportions among and within groups allows for the evaluation of how genetic and environmental factors influence such traits. It also provides opportunities to identify genes and pathways that may have been influenced by non-neutral processes. Here we use a population genetics framework and next generation sequencing to evaluate how gene expression variation is apportioned among four human groups in a natural biological tissue, the placenta. We estimate that on average, 33.2%, 58.9%, and 7.8% of the placental transcriptome is explained by variation within individuals, among individuals, and among human groups, respectively. Additionally, when technical and biological traits are included in models of gene expression they each account for roughly 2% of total gene expression variation. Notably, the variation that is significantly different among groups is enriched in biological pathways associated with immune response, cell signaling, and metabolism. Many biological traits demonstrate correlated changes in expression in numerous pathways of potential interest to clinicians and evolutionary biologists. Finally, we estimate that the majority of the human placental transcriptome exhibits expression profiles consistent with neutrality; the remainder are consistent with stabilizing selection, directional selection, or diversifying selection. We apportion placental gene expression variation into individual, population, and biological trait factors and identify how each influence the transcriptome. Additionally, we advance methods to associate expression profiles with different forms of selection.

  1. Human somatic cells subjected to genetic induction with six germ line-related factors display meiotic germ cell-like features

    PubMed Central

    Medrano, Jose V.; Martínez-Arroyo, Ana M.; Míguez, Jose M.; Moreno, Inmaculada; Martínez, Sebastián; Quiñonero, Alicia; Díaz-Gimeno, Patricia; Marqués-Marí, Ana I.; Pellicer, Antonio; Remohí, Jose; Simón, Carlos

    2016-01-01

    The in vitro derivation of human germ cells has attracted interest in the last years, but their direct conversion from human somatic cells has not yet been reported. Here we tested the ability of human male somatic cells to directly convert into a meiotic germ cell-like phenotype by inducing them with a combination of selected key germ cell developmental factors. We started with a pool of 12 candidates that were reduced to 6, demonstrating that ectopic expression of the germ line-related genes PRDM1, PRDM14, LIN28A, DAZL, VASA and SYCP3 induced direct conversion of somatic cells (hFSK (46, XY), and hMSC (46, XY)) into a germ cell-like phenotype in vitro. Induced germ cell-like cells showed a marked switch in their transcriptomic profile and expressed several post-meiotic germ line related markers, showed meiotic progression, evidence of epigenetic reprogramming, and approximately 1% were able to complete meiosis as demonstrated by their haploid status and the expression of several post-meiotic markers. Furthermore, xenotransplantation assays demonstrated that a subset of induced cells properly colonize the spermatogonial niche. Knowledge obtained from this work can be used to create in vitro models to study gamete-related diseases in humans. PMID:27112843

  2. Transcriptome Analysis of the Octopus vulgaris Central Nervous System

    PubMed Central

    Zhang, Xiang; Mao, Yong; Huang, Zixia; Qu, Meng; Chen, Jun; Ding, Shaoxiong; Hong, Jingni; Sun, Tiantian

    2012-01-01

    Background Cephalopoda are a class of Mollusca species found in all the world's oceans. They are an important model organism in neurobiology. Unfortunately, the lack of neuronal molecular sequences, such as ESTs, transcriptomic or genomic information, has limited the development of molecular neurobiology research in this unique model organism. Results With high-throughput Illumina Solexa sequencing technology, we have generated 59,859 high quality sequences from 12,918,391 paired-end reads. Using BLASTx/BLASTn, 12,227 contigs have blast hits in the Swissprot, NR protein database and NT nucleotide database with E-value cutoff 1e−5. The comparison between the Octopus vulgaris central nervous system (CNS) library and the Aplysia californica/Lymnaea stagnalis CNS ESTs library yielded 5.93%/13.45% of O. vulgaris sequences with significant matches (1e−5) using BLASTn/tBLASTx. Meanwhile the hit percentage of the recently published Schistocerca gregaria, Tilapia or Hirudo medicinalis CNS library to the O. vulgaris CNS library is 21.03%–46.19%. We constructed the Phylogenetic tree using two genes related to CNS function, Synaptotagmin-7 and Synaptophysin. Lastly, we demonstrated that O. vulgaris may have a vertebrate-like Blood-Brain Barrier based on bioinformatic analysis. Conclusion This study provides a mass of molecular information that will contribute to further molecular biology research on O. vulgaris. In our presentation of the first CNS transcriptome analysis of O. vulgaris, we hope to accelerate the study of functional molecular neurobiology and comparative evolutionary biology. PMID:22768275

  3. Genome-wide transcriptome profiling reveals novel insights into Luffa cylindrica browning.

    PubMed

    Chen, Xia; Tan, Taiming; Xu, Changcheng; Huang, Shuping; Tan, Jie; Zhang, Min; Wang, Chunli; Xie, Conghua

    2015-08-07

    Luffa cylindrica (sponge gourd) is one of the most popular vegetables in China. Production and consumption of L. cylindrica are limited due to postharvest browning; however, little is known about the genetic regulation of the browning process. In the present study, transcriptome profiles of L. cylindrica cultivars, YLB05 (browning resistant) and XTR05 (browning sensitive), were analyzed using next-generation sequencing to clarify the genes and mechanisms associated with browning. A total of 9.1 Gb of valid data including 116,703 unigenes (>200 bp) were obtained and 39,473 sequences were annotated by alignment against five public databases. Of these, there were 27,407 genes assigned to 747 Gene Ontology functional categories; and 12,350 genes were annotated with 25 Eukaryotic Orthologous Groups (KOG) categories with 343 KOG functional terms. Additionally, by searching against the Kyoto Encyclopedia of Genes and Genomes database, 8689 unigenes were mapped to 189 pathways. Furthermore, there were 24,556 sequences found to be differentially regulated, including 4344 annotated unigenes. Several genes potentially associated with phenolic oxidation, carbohydrate and hormone metabolism were found differentially regulated between the cultivars of different browning sensitivities. Our results suggest that elements involved in enzymatic processes and other pathways might be responsible for L. cylindrica browning. The present study provides a comprehensive transcriptome sequence resource, which will facilitate further studies on gene discovery and exploiting the fruit browning mechanism of L. cylindrica. Copyright © 2015 Elsevier Inc. All rights reserved.

  4. Is chloroplastic class IIA aldolase a marine enzyme?

    PubMed

    Miyasaka, Hitoshi; Ogata, Takeru; Tanaka, Satoshi; Ohama, Takeshi; Kano, Sanae; Kazuhiro, Fujiwara; Hayashi, Shuhei; Yamamoto, Shinjiro; Takahashi, Hiro; Matsuura, Hideyuki; Hirata, Kazumasa

    2016-11-01

    Expressed sequence tag analyses revealed that two marine Chlorophyceae green algae, Chlamydomonas sp. W80 and Chlamydomonas sp. HS5, contain genes coding for chloroplastic class IIA aldolase (fructose-1, 6-bisphosphate aldolase: FBA). These genes show robust monophyly with those of the marine Prasinophyceae algae genera Micromonas, Ostreococcus and Bathycoccus, indicating that the acquisition of this gene through horizontal gene transfer by an ancestor of the green algal lineage occurred prior to the divergence of the core chlorophytes (Chlorophyceae and Trebouxiophyceae) and the prasinophytes. The absence of this gene in some freshwater chlorophytes, such as Chlamydomonas reinhardtii, Volvox carteri, Chlorella vulgaris, Chlorella variabilis and Coccomyxa subellipsoidea, can therefore be explained by the loss of this gene somewhere in the evolutionary process. Our survey on the distribution of this gene in genomic and transcriptome databases suggests that this gene occurs almost exclusively in marine algae, with a few exceptions, and as such, we propose that chloroplastic class IIA FBA is a marine environment-adapted enzyme. This hypothesis was also experimentally tested using Chlamydomonas W80, for which we found that the transcript levels of this gene to be significantly lower under low-salt (that is, simulated terrestrial) conditions. Expression analyses of transcriptome data for two algae, Prymnesium parvum and Emiliania huxleyi, taken from the Sequence Read Archive database also indicated that the expression of this gene under terrestrial conditions (low NaCl and low sulfate) is significantly downregulated. Thus, these experimental and transcriptome data provide support for our hypothesis.

  5. Fish and chips: Various methodologies demonstrate utility of a 16,006-gene salmonid microarray

    PubMed Central

    von Schalburg, Kristian R; Rise, Matthew L; Cooper, Glenn A; Brown, Gordon D; Gibbs, A Ross; Nelson, Colleen C; Davidson, William S; Koop, Ben F

    2005-01-01

    Background We have developed and fabricated a salmonid microarray containing cDNAs representing 16,006 genes. The genes spotted on the array have been stringently selected from Atlantic salmon and rainbow trout expressed sequence tag (EST) databases. The EST databases presently contain over 300,000 sequences from over 175 salmonid cDNA libraries derived from a wide variety of tissues and different developmental stages. In order to evaluate the utility of the microarray, a number of hybridization techniques and screening methods have been developed and tested. Results We have analyzed and evaluated the utility of a microarray containing 16,006 (16K) salmonid cDNAs in a variety of potential experimental settings. We quantified the amount of transcriptome binding that occurred in cross-species, organ complexity and intraspecific variation hybridization studies. We also developed a methodology to rapidly identify and confirm the contents of a bacterial artificial chromosome (BAC) library containing Atlantic salmon genomic DNA. Conclusion We validate and demonstrate the usefulness of the 16K microarray over a wide range of teleosts, even for transcriptome targets from species distantly related to salmonids. We show the potential of the use of the microarray in a variety of experimental settings through hybridization studies that examine the binding of targets derived from different organs and tissues. Intraspecific variation in transcriptome expression is evaluated and discussed. Finally, BAC hybridizations are demonstrated as a rapid and accurate means to identify gene content. PMID:16164747

  6. Is chloroplastic class IIA aldolase a marine enzyme?

    PubMed Central

    Miyasaka, Hitoshi; Ogata, Takeru; Tanaka, Satoshi; Ohama, Takeshi; Kano, Sanae; Kazuhiro, Fujiwara; Hayashi, Shuhei; Yamamoto, Shinjiro; Takahashi, Hiro; Matsuura, Hideyuki; Hirata, Kazumasa

    2016-01-01

    Expressed sequence tag analyses revealed that two marine Chlorophyceae green algae, Chlamydomonas sp. W80 and Chlamydomonas sp. HS5, contain genes coding for chloroplastic class IIA aldolase (fructose-1, 6-bisphosphate aldolase: FBA). These genes show robust monophyly with those of the marine Prasinophyceae algae genera Micromonas, Ostreococcus and Bathycoccus, indicating that the acquisition of this gene through horizontal gene transfer by an ancestor of the green algal lineage occurred prior to the divergence of the core chlorophytes (Chlorophyceae and Trebouxiophyceae) and the prasinophytes. The absence of this gene in some freshwater chlorophytes, such as Chlamydomonas reinhardtii, Volvox carteri, Chlorella vulgaris, Chlorella variabilis and Coccomyxa subellipsoidea, can therefore be explained by the loss of this gene somewhere in the evolutionary process. Our survey on the distribution of this gene in genomic and transcriptome databases suggests that this gene occurs almost exclusively in marine algae, with a few exceptions, and as such, we propose that chloroplastic class IIA FBA is a marine environment-adapted enzyme. This hypothesis was also experimentally tested using Chlamydomonas W80, for which we found that the transcript levels of this gene to be significantly lower under low-salt (that is, simulated terrestrial) conditions. Expression analyses of transcriptome data for two algae, Prymnesium parvum and Emiliania huxleyi, taken from the Sequence Read Archive database also indicated that the expression of this gene under terrestrial conditions (low NaCl and low sulfate) is significantly downregulated. Thus, these experimental and transcriptome data provide support for our hypothesis. PMID:27058504

  7. RNAseq versus genome-predicted transcriptomes: a large population of novel transcripts identified in an Illumina-454 Hydra transcriptome.

    PubMed

    Wenger, Yvan; Galliot, Brigitte

    2013-03-25

    Evolutionary studies benefit from deep sequencing technologies that generate genomic and transcriptomic sequences from a variety of organisms. Genome sequencing and RNAseq have complementary strengths. In this study, we present the assembly of the most complete Hydra transcriptome to date along with a comparative analysis of the specific features of RNAseq and genome-predicted transcriptomes currently available in the freshwater hydrozoan Hydra vulgaris. To produce an accurate and extensive Hydra transcriptome, we combined Illumina and 454 Titanium reads, giving the primacy to Illumina over 454 reads to correct homopolymer errors. This strategy yielded an RNAseq transcriptome that contains 48'909 unique sequences including splice variants, representing approximately 24'450 distinct genes. Comparative analysis to the available genome-predicted transcriptomes identified 10'597 novel Hydra transcripts that encode 529 evolutionarily-conserved proteins. The annotation of 170 human orthologs points to critical functions in protein biosynthesis, FGF and TOR signaling, vesicle transport, immunity, cell cycle regulation, cell death, mitochondrial metabolism, transcription and chromatin regulation. However, a majority of these novel transcripts encodes short ORFs, at least 767 of them corresponding to pseudogenes. This RNAseq transcriptome also lacks 11'270 predicted transcripts that correspond either to silent genes or to genes expressed below the detection level of this study. We established a simple and powerful strategy to combine Illumina and 454 reads and we produced, with genome assistance, an extensive and accurate Hydra transcriptome. The comparative analysis of the RNAseq transcriptome with genome-predicted transcriptomes lead to the identification of large populations of novel as well as missing transcripts that might reflect Hydra-specific evolutionary events.

  8. RNAseq versus genome-predicted transcriptomes: a large population of novel transcripts identified in an Illumina-454 Hydra transcriptome

    PubMed Central

    2013-01-01

    Background Evolutionary studies benefit from deep sequencing technologies that generate genomic and transcriptomic sequences from a variety of organisms. Genome sequencing and RNAseq have complementary strengths. In this study, we present the assembly of the most complete Hydra transcriptome to date along with a comparative analysis of the specific features of RNAseq and genome-predicted transcriptomes currently available in the freshwater hydrozoan Hydra vulgaris. Results To produce an accurate and extensive Hydra transcriptome, we combined Illumina and 454 Titanium reads, giving the primacy to Illumina over 454 reads to correct homopolymer errors. This strategy yielded an RNAseq transcriptome that contains 48’909 unique sequences including splice variants, representing approximately 24’450 distinct genes. Comparative analysis to the available genome-predicted transcriptomes identified 10’597 novel Hydra transcripts that encode 529 evolutionarily-conserved proteins. The annotation of 170 human orthologs points to critical functions in protein biosynthesis, FGF and TOR signaling, vesicle transport, immunity, cell cycle regulation, cell death, mitochondrial metabolism, transcription and chromatin regulation. However, a majority of these novel transcripts encodes short ORFs, at least 767 of them corresponding to pseudogenes. This RNAseq transcriptome also lacks 11’270 predicted transcripts that correspond either to silent genes or to genes expressed below the detection level of this study. Conclusions We established a simple and powerful strategy to combine Illumina and 454 reads and we produced, with genome assistance, an extensive and accurate Hydra transcriptome. The comparative analysis of the RNAseq transcriptome with genome-predicted transcriptomes lead to the identification of large populations of novel as well as missing transcripts that might reflect Hydra-specific evolutionary events. PMID:23530871

  9. Evaluation of human embryonic stem cells and their differentiated fibroblastic progenies as cellular models for in vitro genotoxicity screening.

    PubMed

    Vinoth, Kumar Jayaseelan; Manikandan, Jayapal; Sethu, Swaminathan; Balakrishnan, Lakshmidevi; Heng, Alexis; Lu, Kai; Hande, Manoor Prakash; Cao, Tong

    2014-08-20

    This study evaluated human embryonic stem cells (hESC) and their differentiated fibroblastic progenies as cellular models for genotoxicity screening. The DNA damage response of hESCs and their differentiated fibroblastic progenies were compared to a fibroblastic cell line (HEPM, CRL1486) and primary cultures of peripheral blood lymphocytes (PBL), upon exposure to Mitomycin C, gamma irradiation and H2O2. It was demonstrated that hESC-derived fibroblastic progenies (H1F) displayed significantly higher chromosomal aberrations, micronuclei formation and double strand break (DSB) formation, as compared to undifferentiated hESC upon exposure to genotoxic stress. Nevertheless, H1F cell types displayed comparable sensitivities to genotoxic challenge as HEPM and PBL, both of which are representative of somatic cell types commonly used for genotoxicity screening. Subsequently, transcriptomic and pathways analysis identified differential expression of critical genes involved in cell death and DNA damage response upon exposure to gamma irradiation. The results thus demonstrate that hESC-derived fibroblastic progenies are as sensitive as commonly-used somatic cell types for genotoxicity screening. Moreover, hESCs have additional advantages, such as their genetic normality compared to immortalized cell lines, as well as their amenability to scale-up for producing large, standardized quantities of cells for genotoxicity screening on an industrial scale, something which can never be achieved with primary cell cultures. Copyright © 2014. Published by Elsevier B.V.

  10. Probing the evolution, ecology and physiology of marine protists using transcriptomics.

    PubMed

    Caron, David A; Alexander, Harriet; Allen, Andrew E; Archibald, John M; Armbrust, E Virginia; Bachy, Charles; Bell, Callum J; Bharti, Arvind; Dyhrman, Sonya T; Guida, Stephanie M; Heidelberg, Karla B; Kaye, Jonathan Z; Metzner, Julia; Smith, Sarah R; Worden, Alexandra Z

    2017-01-01

    Protists, which are single-celled eukaryotes, critically influence the ecology and chemistry of marine ecosystems, but genome-based studies of these organisms have lagged behind those of other microorganisms. However, recent transcriptomic studies of cultured species, complemented by meta-omics analyses of natural communities, have increased the amount of genetic information available for poorly represented branches on the tree of eukaryotic life. This information is providing insights into the adaptations and interactions between protists and other microorganisms and macroorganisms, but many of the genes sequenced show no similarity to sequences currently available in public databases. A better understanding of these newly discovered genes will lead to a deeper appreciation of the functional diversity and metabolic processes in the ocean. In this Review, we summarize recent developments in our understanding of the ecology, physiology and evolution of protists, derived from transcriptomic studies of cultured strains and natural communities, and discuss how these novel large-scale genetic datasets will be used in the future.

  11. The de novo transcriptome and its analysis in the worldwide vegetable pest, Delia antiqua (Diptera: Anthomyiidae).

    PubMed

    Zhang, Yu-Juan; Hao, Youjin; Si, Fengling; Ren, Shuang; Hu, Ganyu; Shen, Li; Chen, Bin

    2014-03-10

    The onion maggot Delia antiqua is a major insect pest of cultivated vegetables, especially the onion, and a good model to investigate the molecular mechanisms of diapause. To better understand the biology and diapause mechanism of the insect pest species, D. antiqua, the transcriptome was sequenced using Illumina paired-end sequencing technology. Approximately 54 million reads were obtained, trimmed, and assembled into 29,659 unigenes, with an average length of 607 bp and an N50 of 818 bp. Among these unigenes, 21,605 (72.8%) were annotated in the public databases. All unigenes were then compared against Drosophila melanogaster and Anopheles gambiae. Codon usage bias was analyzed and 332 simple sequence repeats (SSRs) were detected in this organism. These data represent the most comprehensive transcriptomic resource currently available for D. antiqua and will facilitate the study of genetics, genomics, diapause, and further pest control of D. antiqua. Copyright © 2014 Zhang et al.

  12. Data of first de-novo transcriptome assembly of a non-model species, hawksbill sea turtle, Eretmochelys imbricate, nesting of the Colombian Caribean.

    PubMed

    Hernández-Fernández, Javier

    2017-12-01

    The hawksbill sea turtle, Eretmochelys imbricata, is an endangered species of the Caribbean Colombian coast due to anthropic and natural factors that have decreased their population levels. Little is known about the genes that are involved in their immune system, sex determination, aging and others important functions. The data generated represents RNA sequencing and the first de-novo assembly of transcripts expressed in the blood of the hawksbill sea turtle. The raw FASTQ files were deposited in the NCBI SRA database with accession number SRX2653641. A total of 5.7 Gb raw sequence data were obtained, corresponding to 47,555,108 raw reads. Trinity was used to perform a first de-novo assembly, and we were able to identify 47,586 transcripts of the female hawksbill turtle transcriptome with an N50 of 1100 bp. The obtained transcriptome data will be useful for further studies of the physiology, biochemistry and evolution in this species.

  13. De novo transcriptomic analysis and development of EST-SSRs for Sorbus pohuashanensis (Hance) Hedl.

    PubMed Central

    Guan, Xuelian; Fu, Qiang; Zhang, Ze; Hu, Zenghui; Zheng, Jian; Lu, Yizeng; Li, Wei

    2017-01-01

    Sorbus pohuashanensis is a native tree species of northern China that is used for a variety of ecological purposes. The species is often grown as an ornamental landscape tree because of its beautiful form, silver flowers in early summer, attractive pinnate leaves in summer, and red leaves and fruits in autumn. However, development and further utilization of the species are hindered by the lack of comprehensive genetic information, which impedes research into its genetics and molecular biology. Recent advances in de novo transcriptome sequencing (RNA-seq) technology have provided an effective means to obtain genomic information from non-model species. Here, we applied RNA-seq for sequencing S. pohuashanensis leaves and obtained a total of 137,506 clean reads. After assembly, 96,213 unigenes with an average length of 770 bp were obtained. We found that 64.5% of the unigenes could be annotated using bioinformatics tools to analyze gene function and alignment with the NCBI database. Overall, 59,089 unigenes were annotated using the Nr database(non-redundant protein database), 35,225 unigenes were annotated using the GO (Gene Ontology categories) database, and 33,168 unigenes were annotated using COG (Cluster of Orthologous Groups). Analysis of the unigenes using the KEGG (Kyoto Encyclopedia of Genes and Genomes) database indicated that 13,953 unigenes were involved in 322 metabolic pathways. Finally, simple sequence repeat (SSR) site detection identified 6,604 unigenes that included EST-SSRs and a total of 7,473 EST-SSRs in the unigene sequences. Fifteen polymorphic SSRs were screened and found to be of use for future genetic research. These unigene sequences will provide important genetic resources for genetic improvement and investigation of biochemical processes in S. pohuashanensis. PMID:28614366

  14. Transcriptome Profiling Identifies Multiplexin as a Target of SAGA Deubiquitinase Activity in Glia Required for Precise Axon Guidance During Drosophila Visual Development.

    PubMed

    Ma, Jingqun; Brennan, Kaelan J; D'Aloia, Mitch R; Pascuzzi, Pete E; Weake, Vikki M

    2016-08-09

    The Spt-Ada-Gcn5 Acetyltransferase (SAGA) complex is a transcriptional coactivator with histone acetylase and deubiquitinase activities that plays an important role in visual development and function. In Drosophila melanogaster, four SAGA subunits are required for the deubiquitination of monoubiquitinated histone H2B (ubH2B): Nonstop, Sgf11, E(y)2, and Ataxin 7. Mutations that disrupt SAGA deubiquitinase activity cause defects in neuronal connectivity in the developing Drosophila visual system. In addition, mutations in SAGA result in the human progressive visual disorder spinocerebellar ataxia type 7 (SCA7). Glial cells play a crucial role in both the neuronal connectivity defect in nonstop and sgf11 flies, and in the retinal degeneration observed in SCA7 patients. Thus, we sought to identify the gene targets of SAGA deubiquitinase activity in glia in the Drosophila larval central nervous system. To do this, we enriched glia from wild-type, nonstop, and sgf11 larval optic lobes using affinity-purification of KASH-GFP tagged nuclei, and then examined each transcriptome using RNA-seq. Our analysis showed that SAGA deubiquitinase activity is required for proper expression of 16% of actively transcribed genes in glia, especially genes involved in proteasome function, protein folding and axon guidance. We further show that the SAGA deubiquitinase-activated gene Multiplexin (Mp) is required in glia for proper photoreceptor axon targeting. Mutations in the human ortholog of Mp, COL18A1, have been identified in a family with a SCA7-like progressive visual disorder, suggesting that defects in the expression of this gene in SCA7 patients could play a role in the retinal degeneration that is unique to this ataxia. Copyright © 2016 Ma et al.

  15. Transcriptome database derived from the Texas Deutsch outbreak strain population of the cattle tick, Rhipicephalus (Boophilus) microplus

    USDA-ARS?s Scientific Manuscript database

    The Southern cattle tick, Rhipicephalus (Boophilus) microplus, vectors Babesia bovis and B. bigemina, which are the protozoans causing cattle fever, a disease that is responsible for significant production losses to cattle producers in much of Africa, Central and South America, and Australia. We ini...

  16. miRNA Signature and Dicer Requirement during Human Endometrial Stromal Decidualization In Vitro

    PubMed Central

    Estella, Carlos; Herrer, Isabel; Moreno-Moya, Juan Manuel; Quiñonero, Alicia; Martínez, Sebastián; Pellicer, Antonio; Simón, Carlos

    2012-01-01

    Decidualization is a morphological and biochemical transformation of endometrial stromal fibroblast into differentiated decidual cells, which is critical for embryo implantation and pregnancy establishment. The complex regulatory networks have been elucidated at both the transcriptome and the proteome levels, however very little is known about the post-transcriptional regulation of this process. miRNAs regulate multiple physiological pathways and their de-regulation is associated with human disorders including gynaecological conditions such as endometriosis and preeclampsia. In this study we profile the miRNAs expression throughout human endometrial stromal (hESCs) decidualization and analyze the requirement of the miRNA biogenesis enzyme Dicer during this process. A total of 26 miRNAs were upregulated and 17 miRNAs downregulated in decidualized hESCs compared to non-decidualized hESCs. Three miRNAs families, miR-181, miR-183 and miR-200, are down-regulated during the decidualization process. Using miRNAs target prediction algorithms we have identified the potential targets and pathways regulated by these miRNAs. The knockdown of Dicer has a minor effect on hESCs during in vitro decidualization. We have analyzed a battery of decidualization markers such as cell morphology, Prolactin, IGFBP-1, MPIF-1 and TIMP-3 secretion as well as HOXA10, COX2, SP1, C/EBPß and FOXO1 expression in decidualized hESCs with decreased Dicer function. We found decreased levels of HOXA10 and altered intracellular organization of actin filaments in Dicer knockdown decidualized hESCs compared to control. Our results provide the miRNA signature of hESC during the decidualization process in vitro. We also provide the first functional characterization of Dicer during human endometrial decidualization although surprisingly we found that Dicer plays a minor role regulating this process suggesting that alternative biogenesis miRNAs pathways must be involved in human endometrial decidualization. PMID:22911744

  17. Gene evolution and gene expression after whole genome duplication in fish: the PhyloFish database.

    PubMed

    Pasquier, Jeremy; Cabau, Cédric; Nguyen, Thaovi; Jouanno, Elodie; Severac, Dany; Braasch, Ingo; Journot, Laurent; Pontarotti, Pierre; Klopp, Christophe; Postlethwait, John H; Guiguen, Yann; Bobe, Julien

    2016-05-18

    With more than 30,000 species, ray-finned fish represent approximately half of vertebrates. The evolution of ray-finned fish was impacted by several whole genome duplication (WGD) events including a teleost-specific WGD event (TGD) that occurred at the root of the teleost lineage about 350 million years ago (Mya) and more recent WGD events in salmonids, carps, suckers and others. In plants and animals, WGD events are associated with adaptive radiations and evolutionary innovations. WGD-spurred innovation may be especially relevant in the case of teleost fish, which colonized a wide diversity of habitats on earth, including many extreme environments. Fish biodiversity, the use of fish models for human medicine and ecological studies, and the importance of fish in human nutrition, fuel an important need for the characterization of gene expression repertoires and corresponding evolutionary histories of ray-finned fish genes. To this aim, we performed transcriptome analyses and developed the PhyloFish database to provide (i) de novo assembled gene repertoires in 23 different ray-finned fish species including two holosteans (i.e. a group that diverged from teleosts before TGD) and 21 teleosts (including six salmonids), and (ii) gene expression levels in ten different tissues and organs (and embryos for many) in the same species. This resource was generated using a common deep RNA sequencing protocol to obtain the most exhaustive gene repertoire possible in each species that allows between-species comparisons to study the evolution of gene expression in different lineages. The PhyloFish database described here can be accessed and searched using RNAbrowse, a simple and efficient solution to give access to RNA-seq de novo assembled transcripts.

  18. Ensembl core software resources: storage and programmatic access for DNA sequence and genome annotation.

    PubMed

    Ruffier, Magali; Kähäri, Andreas; Komorowska, Monika; Keenan, Stephen; Laird, Matthew; Longden, Ian; Proctor, Glenn; Searle, Steve; Staines, Daniel; Taylor, Kieron; Vullo, Alessandro; Yates, Andrew; Zerbino, Daniel; Flicek, Paul

    2017-01-01

    The Ensembl software resources are a stable infrastructure to store, access and manipulate genome assemblies and their functional annotations. The Ensembl 'Core' database and Application Programming Interface (API) was our first major piece of software infrastructure and remains at the centre of all of our genome resources. Since its initial design more than fifteen years ago, the number of publicly available genomic, transcriptomic and proteomic datasets has grown enormously, accelerated by continuous advances in DNA-sequencing technology. Initially intended to provide annotation for the reference human genome, we have extended our framework to support the genomes of all species as well as richer assembly models. Cross-referenced links to other informatics resources facilitate searching our database with a variety of popular identifiers such as UniProt and RefSeq. Our comprehensive and robust framework storing a large diversity of genome annotations in one location serves as a platform for other groups to generate and maintain their own tailored annotation. We welcome reuse and contributions: our databases and APIs are publicly available, all of our source code is released with a permissive Apache v2.0 licence at http://github.com/Ensembl and we have an active developer mailing list ( http://www.ensembl.org/info/about/contact/index.html ). http://www.ensembl.org. © The Author(s) 2017. Published by Oxford University Press.

  19. Complex and dynamic landscape of RNA polyadenylation revealed by PAS-Seq

    PubMed Central

    Shepard, Peter J.; Choi, Eun-A; Lu, Jente; Flanagan, Lisa A.; Hertel, Klemens J.; Shi, Yongsheng

    2011-01-01

    Alternative polyadenylation (APA) of mRNAs has emerged as an important mechanism for post-transcriptional gene regulation in higher eukaryotes. Although microarrays have recently been used to characterize APA globally, they have a number of serious limitations that prevents comprehensive and highly quantitative analysis. To better characterize APA and its regulation, we have developed a deep sequencing-based method called Poly(A) Site Sequencing (PAS-Seq) for quantitatively profiling RNA polyadenylation at the transcriptome level. PAS-Seq not only accurately and comprehensively identifies poly(A) junctions in mRNAs and noncoding RNAs, but also provides quantitative information on the relative abundance of polyadenylated RNAs. PAS-Seq analyses of human and mouse transcriptomes showed that 40%–50% of all expressed genes produce alternatively polyadenylated mRNAs. Furthermore, our study detected evolutionarily conserved polyadenylation of histone mRNAs and revealed novel features of mitochondrial RNA polyadenylation. Finally, PAS-Seq analyses of mouse embryonic stem (ES) cells, neural stem/progenitor (NSP) cells, and neurons not only identified more poly(A) sites than what was found in the entire mouse EST database, but also detected significant changes in the global APA profile that lead to lengthening of 3′ untranslated regions (UTR) in many mRNAs during stem cell differentiation. Together, our PAS-Seq analyses revealed a complex landscape of RNA polyadenylation in mammalian cells and the dynamic regulation of APA during stem cell differentiation. PMID:21343387

  20. Transcriptome analysis in tardigrade species reveals specific molecular pathways for stress adaptations.

    PubMed

    Förster, Frank; Beisser, Daniela; Grohme, Markus A; Liang, Chunguang; Mali, Brahim; Siegl, Alexander Matthias; Engelmann, Julia C; Shkumatov, Alexander V; Schokraie, Elham; Müller, Tobias; Schnölzer, Martina; Schill, Ralph O; Frohme, Marcus; Dandekar, Thomas

    2012-01-01

    Tardigrades have unique stress-adaptations that allow them to survive extremes of cold, heat, radiation and vacuum. To study this, encoded protein clusters and pathways from an ongoing transcriptome study on the tardigrade Milnesium tardigradum were analyzed using bioinformatics tools and compared to expressed sequence tags (ESTs) from Hypsibius dujardini, revealing major pathways involved in resistance against extreme environmental conditions. ESTs are available on the Tardigrade Workbench along with software and databank updates. Our analysis reveals that RNA stability motifs for M. tardigradum are different from typical motifs known from higher animals. M. tardigradum and H. dujardini protein clusters and conserved domains imply metabolic storage pathways for glycogen, glycolipids and specific secondary metabolism as well as stress response pathways (including heat shock proteins, bmh2, and specific repair pathways). Redox-, DNA-, stress- and protein protection pathways complement specific repair capabilities to achieve the strong robustness of M. tardigradum. These pathways are partly conserved in other animals and their manipulation could boost stress adaptation even in human cells. However, the unique combination of resistance and repair pathways make tardigrades and M. tardigradum in particular so highly stress resistant.

  1. Understanding the molecular mechanisms underlying the effects of light intensity on flavonoid production by RNA-seq analysis in Epimedium pseudowushanense B.L.Guo.

    PubMed

    Pan, Junqian; Chen, Haimei; Guo, Baolin; Liu, Chang

    2017-01-01

    Epimedium pseudowushanense B.L.Guo, a light-demanding shade herb, is used in traditional medicine to increase libido and strengthen muscles and bones. The recognition of the health benefits of Epimedium has increased its market demand. However, its resource recycling rate is low and environmentally dependent. Furthermore, its natural sources are endangered, further increasing prices. Commercial culture can address resource constraints of it.Understanding the effects of environmental factors on the production of its active components would improve the technology for cultivation and germplasm conservation. Here, we studied the effects of light intensities on the flavonoid production and revealed the molecular mechanism using RNA-seq analysis. Plants were exposed to five levels of light intensity through the periods of germination to flowering, the flavonoid contents were measured using HPLC. Quantification of epimedin A, epimedin B, epimedin C, and icariin showed that the flavonoid contents varied with different light intensity levels. And the largest amount of epimedin C was produced at light intensity level 4 (I4). Next, the leaves under the treatment of three light intensity levels ("L", "M" and "H") with the largest differences in the flavonoid content, were subjected to RNA-seq analysis. Transcriptome reconstruction identified 43,657 unigenes. All unigene sequences were annotated by searching against the Nr, Gene Ontology, and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases. In total, 4008, 5260, and 3591 significant differentially expressed genes (DEGs) were identified between the groups L vs. M, M vs. H and L vs. H. Particularly, twenty-one full-length genes involved in flavonoid biosynthesis were identified. The expression levels of the flavonol synthase, chalcone synthase genes were strongly associated with light-induced flavonoid abundance with the highest expression levels found in the H group. Furthermore, 65 transcription factors, including 31 FAR1, 17 MYB-related, 12 bHLH, and 5 WRKY, were differentially expressed after light induction. Finally, a model was proposed to explain the light-induced flavonoid production. This study provided valuable information to improve cultivation practices and produced the first comprehensive resource for E. pseudowushanense transcriptomes.

  2. Targeted exploration and analysis of large cross-platform human transcriptomic compendia

    PubMed Central

    Zhu, Qian; Wong, Aaron K; Krishnan, Arjun; Aure, Miriam R; Tadych, Alicja; Zhang, Ran; Corney, David C; Greene, Casey S; Bongo, Lars A; Kristensen, Vessela N; Charikar, Moses; Li, Kai; Troyanskaya, Olga G.

    2016-01-01

    We present SEEK (http://seek.princeton.edu), a query-based search engine across very large transcriptomic data collections, including thousands of human data sets from almost 50 microarray and next-generation sequencing platforms. SEEK uses a novel query-level cross-validation-based algorithm to automatically prioritize data sets relevant to the query and a robust search approach to identify query-coregulated genes, pathways, and processes. SEEK provides cross-platform handling, multi-gene query search, iterative metadata-based search refinement, and extensive visualization-based analysis options. PMID:25581801

  3. Microbiome and ecotypic adaption of Holcus lanatus (L.) to extremes of its soil pH range, investigated through transcriptome sequencing.

    PubMed

    Young, Ellen; Carey, Manus; Meharg, Andrew A; Meharg, Caroline

    2018-03-20

    Plants can adapt to edaphic stress, such as nutrient deficiency, toxicity and biotic challenges, by controlled transcriptomic responses, including microbiome interactions. Traditionally studied in model plant species with controlled microbiota inoculation treatments, molecular plant-microbiome interactions can be functionally investigated via RNA-Seq. Complex, natural plant-microbiome studies are limited, typically focusing on microbial rRNA and omitting functional microbiome investigations, presenting a fundamental knowledge gap. Here, root and shoot meta-transcriptome analyses, in tandem with shoot elemental content and root staining, were employed to investigate transcriptome responses in the wild grass Holcus lanatus and its associated natural multi-species eukaryotic microbiome. A full factorial reciprocal soil transplant experiment was employed, using plant ecotypes from two widely contrasting natural habitats, acid bog and limestone quarry soil, to investigate naturally occurring, and ecologically meaningful, edaphically driven molecular plant-microbiome interactions. Arbuscular mycorrhizal (AM) and non-AM fungal colonization was detected in roots in both soils. Staining showed greater levels of non-AM fungi, and transcriptomics indicated a predominance of Ascomycota-annotated genes. Roots in acid bog soil were dominated by Phialocephala-annotated transcripts, a putative growth-promoting endophyte, potentially involved in N nutrition and ion homeostasis. Limestone roots in acid bog soil had greater expression of other Ascomycete genera and Oomycetes and lower expression of Phialocephala-annotated transcripts compared to acid ecotype roots, which corresponded with reduced induction of pathogen defense processes, particularly lignin biosynthesis in limestone ecotypes. Ascomycota dominated in shoots and limestone soil roots, but Phialocephala-annotated transcripts were insignificant, and no single Ascomycete genus dominated. Fusarium-annotated transcripts were the most common genus in shoots, with Colletotrichum and Rhizophagus (AM fungi) most numerous in limestone soil roots. The latter coincided with upregulation of plant genes involved in AM symbiosis initiation and AM-based P acquisition in an environment where P availability is low. Meta-transcriptome analyses provided novel insights into H. lanatus transcriptome responses, associated eukaryotic microbiota functions and taxonomic community composition. Significant edaphic and plant ecotype effects were identified, demonstrating that meta-transcriptome-based functional analysis is a powerful tool for the study of natural plant-microbiome interactions.

  4. Comparative study of the hemagglutinin and neuraminidase genes of influenza A virus H3N2, H9N2, and H5N1 subtypes using bioinformatics techniques.

    PubMed

    Ahn, Insung; Son, Hyeon S

    2007-07-01

    To investigate the genomic patterns of influenza A virus subtypes, such as H3N2, H9N2, and H5N1, we collected 1842 sequences of the hemagglutinin and neuraminidase genes from the NCBI database and parsed them into 7 categories: accession number, host species, sampling year, country, subtype, gene name, and sequence. The sequences that were isolated from the human, avian, and swine populations were extracted and stored in a MySQL database for intensive analysis. The GC content and relative synonymous codon usage (RSCU) values were calculated using JAVA codes. As a result, correspondence analysis of the RSCU values yielded the unique codon usage pattern (CUP) of each subtype and revealed no extreme differences among the human, avian, and swine isolates. H5N1 subtype viruses exhibited little variation in CUPs compared with other subtypes, suggesting that the H5N1 CUP has not yet undergone significant changes within each host species. Moreover, some observations may be relevant to CUP variation that has occurred over time among the H3N2 subtype viruses isolated from humans. All the sequences were divided into 3 groups over time, and each group seemed to have preferred synonymous codon patterns for each amino acid, especially for arginine, glycine, leucine, and valine. The bioinformatics technique we introduce in this study may be useful in predicting the evolutionary patterns of pandemic viruses.

  5. Discovery of genes related to insecticide resistance in Bactrocera dorsalis by functional genomic analysis of a de novo assembled transcriptome.

    PubMed

    Hsu, Ju-Chun; Chien, Ting-Ying; Hu, Chia-Cheng; Chen, Mei-Ju May; Wu, Wen-Jer; Feng, Hai-Tung; Haymer, David S; Chen, Chien-Yu

    2012-01-01

    Insecticide resistance has recently become a critical concern for control of many insect pest species. Genome sequencing and global quantization of gene expression through analysis of the transcriptome can provide useful information relevant to this challenging problem. The oriental fruit fly, Bactrocera dorsalis, is one of the world's most destructive agricultural pests, and recently it has been used as a target for studies of genetic mechanisms related to insecticide resistance. However, prior to this study, the molecular data available for this species was largely limited to genes identified through homology. To provide a broader pool of gene sequences of potential interest with regard to insecticide resistance, this study uses whole transcriptome analysis developed through de novo assembly of short reads generated by next-generation sequencing (NGS). The transcriptome of B. dorsalis was initially constructed using Illumina's Solexa sequencing technology. Qualified reads were assembled into contigs and potential splicing variants (isotigs). A total of 29,067 isotigs have putative homologues in the non-redundant (nr) protein database from NCBI, and 11,073 of these correspond to distinct D. melanogaster proteins in the RefSeq database. Approximately 5,546 isotigs contain coding sequences that are at least 80% complete and appear to represent B. dorsalis genes. We observed a strong correlation between the completeness of the assembled sequences and the expression intensity of the transcripts. The assembled sequences were also used to identify large numbers of genes potentially belonging to families related to insecticide resistance. A total of 90 P450-, 42 GST-and 37 COE-related genes, representing three major enzyme families involved in insecticide metabolism and resistance, were identified. In addition, 36 isotigs were discovered to contain target site sequences related to four classes of resistance genes. Identified sequence motifs were also analyzed to characterize putative polypeptide translational products and associate them with specific genes and protein functions.

  6. Tissue-Specific Transcriptome Profiling of Plutella Xylostella Third Instar Larval Midgut

    PubMed Central

    Xie, Wen; Lei, Yanyuan; Fu, Wei; Yang, Zhongxia; Zhu, Xun; Guo, Zhaojiang; Wu, Qingjun; Wang, Shaoli; Xu, Baoyun; Zhou, Xuguo; Zhang, Youjun

    2012-01-01

    The larval midgut of diamondback moth, Plutella xylostella, is a dynamic tissue that interfaces with a diverse array of physiological and toxicological processes, including nutrient digestion and allocation, xenobiotic detoxification, innate and adaptive immune response, and pathogen defense. Despite its enormous agricultural importance, the genomic resources for P. xylostella are surprisingly scarce. In this study, a Bt resistant P. xylostella strain was subjected to the in-depth transcriptome analysis to identify genes and gene networks putatively involved in various physiological and toxicological processes in the P. xylostella larval midgut. Using Illumina deep sequencing, we obtained roughly 40 million reads containing approximately 3.6 gigabases of sequence data. De novo assembly generated 63,312 ESTs with an average read length of 416bp, and approximately half of the P. xylostella sequences (45.4%, 28,768) showed similarity to the non-redundant database in GenBank with a cut-off E-value below 10-5. Among them, 11,092 unigenes were assigned to one or multiple GO terms and 16,732 unigenes were assigned to 226 specific pathways. In-depth analysis indentified genes putatively involved in insecticide resistance, nutrient digestion, and innate immune defense. Besides conventional detoxification enzymes and insecticide targets, novel genes, including 28 chymotrypsins and 53 ABC transporters, have been uncovered in the P. xylostella larval midgut transcriptome; which are potentially linked to the Bt toxicity and resistance. Furthermore, an unexpectedly high number of ESTs, including 46 serpins and 7 lysozymes, were predicted to be involved in the immune defense. As the first tissue-specific transcriptome analysis of P. xylostella, this study sheds light on the molecular understanding of insecticide resistance, especially Bt resistance in an agriculturally important insect pest, and lays the foundation for future functional genomics research. In addition, current sequencing effort greatly enriched the existing P. xylostella EST database, and makes RNAseq a viable option in the future genomic analysis. PMID:23091412

  7. Tissue-specific transcriptome profiling of Plutella xylostella third instar larval midgut.

    PubMed

    Xie, Wen; Lei, Yanyuan; Fu, Wei; Yang, Zhongxia; Zhu, Xun; Guo, Zhaojiang; Wu, Qingjun; Wang, Shaoli; Xu, Baoyun; Zhou, Xuguo; Zhang, Youjun

    2012-01-01

    The larval midgut of diamondback moth, Plutella xylostella, is a dynamic tissue that interfaces with a diverse array of physiological and toxicological processes, including nutrient digestion and allocation, xenobiotic detoxification, innate and adaptive immune response, and pathogen defense. Despite its enormous agricultural importance, the genomic resources for P. xylostella are surprisingly scarce. In this study, a Bt resistant P. xylostella strain was subjected to the in-depth transcriptome analysis to identify genes and gene networks putatively involved in various physiological and toxicological processes in the P. xylostella larval midgut. Using Illumina deep sequencing, we obtained roughly 40 million reads containing approximately 3.6 gigabases of sequence data. De novo assembly generated 63,312 ESTs with an average read length of 416 bp, and approximately half of the P. xylostella sequences (45.4%, 28,768) showed similarity to the non-redundant database in GenBank with a cut-off E-value below 10(-5). Among them, 11,092 unigenes were assigned to one or multiple GO terms and 16,732 unigenes were assigned to 226 specific pathways. In-depth analysis identified genes putatively involved in insecticide resistance, nutrient digestion, and innate immune defense. Besides conventional detoxification enzymes and insecticide targets, novel genes, including 28 chymotrypsins and 53 ABC transporters, have been uncovered in the P. xylostella larval midgut transcriptome; which are potentially linked to the Bt toxicity and resistance. Furthermore, an unexpectedly high number of ESTs, including 46 serpins and 7 lysozymes, were predicted to be involved in the immune defense.As the first tissue-specific transcriptome analysis of P. xylostella, this study sheds light on the molecular understanding of insecticide resistance, especially Bt resistance in an agriculturally important insect pest, and lays the foundation for future functional genomics research. In addition, current sequencing effort greatly enriched the existing P. xylostella EST database, and makes RNAseq a viable option in the future genomic analysis.

  8. Transcriptome changes and cAMP oscillations in an archaeal cell cycle.

    PubMed

    Baumann, Anke; Lange, Christian; Soppa, Jörg

    2007-06-11

    The cell cycle of all organisms includes mass increase by a factor of two, replication of the genetic material, segregation of the genome to different parts of the cell, and cell division into two daughter cells. It is tightly regulated and typically includes cell cycle-specific oscillations of the levels of transcripts, proteins, protein modifications, and signaling molecules. Until now cell cycle-specific transcriptome changes have been described for four eukaryotic species ranging from yeast to human, but only for two prokaryotic species. Similarly, oscillations of small signaling molecules have been identified in very few eukaryotic species, but not in any prokaryote. A synchronization procedure for the archaeon Halobacterium salinarum was optimized, so that nearly 100% of all cells divide in a time interval that is 1/4th of the generation time of exponentially growing cells. The method was used to characterize cell cycle-dependent transcriptome changes using a genome-wide DNA microarray. The transcript levels of 87 genes were found to be cell cycle-regulated, corresponding to 3% of all genes. They could be clustered into seven groups with different transcript level profiles. Cluster-specific sequence motifs were detected around the start of the genes that are predicted to be involved in cell cycle-specific transcriptional regulation. Notably, many cell cycle genes that have oscillating transcript levels in eukaryotes are not regulated on the transcriptional level in H. salinarum. Synchronized cultures were also used to identify putative small signaling molecules. H. salinarum was found to contain a basal cAMP concentration of 200 microM, considerably higher than that of yeast. The cAMP concentration is shortly induced directly prior to and after cell division, and thus cAMP probably is an important signal for cell cycle progression. The analysis of cell cycle-specific transcriptome changes of H. salinarum allowed to identify a strategy of transcript level regulation that is different from all previously characterized species. The transcript levels of only 3% of all genes are regulated, a fraction that is considerably lower than has been reported for four eukaryotic species (6%-28%) and for the bacterium C. crescentus (19%). It was shown that cAMP is present in significant concentrations in an archaeon, and the phylogenetic profile of the adenylate cyclase indicates that this signaling molecule is widely distributed in archaea. The occurrence of cell cycle-dependent oscillations of the cAMP concentration in an archaeon and in several eukaryotic species indicates that cAMP level changes might be a phylogenetically old signal for cell cycle progression.

  9. RNA-Seq analysis and transcriptome assembly for blackberry (Rubus sp. Var. Lochness) fruit.

    PubMed

    Garcia-Seco, Daniel; Zhang, Yang; Gutierrez-Mañero, Francisco J; Martin, Cathie; Ramos-Solano, Beatriz

    2015-01-22

    There is an increasing interest in berries, especially blackberries in the diet, because of recent reports of their health benefits due to their high content of flavonoids. A broad range of genomic tools are available for other Rosaceae species but these tools are still lacking in the Rubus genus, thus limiting gene discovery and the breeding of improved varieties. De novo RNA-seq of ripe blackberries grown under field conditions was performed using Illumina Hiseq 2000. Almost 9 billion nucleotide bases were sequenced in total. Following assembly, 42,062 consensus sequences were detected. For functional annotation, 33,040 (NR), 32,762 (NT), 21,932 (Swiss-Prot), 20,134 (KEGG), 13,676 (COG), 24,168 (GO) consensus sequences were annotated using different databases; in total 34,552 annotated sequences were identified. For protein prediction analysis, the number of coding DNA sequences (CDS) that mapped to the protein database was 32,540. Non redundant (NR), annotation showed that 25,418 genes (73.5%) has the highest similarity with Fragaria vesca subspecies vesca. Reanalysis was undertaken by aligning the reads with this reference genome for a deeper analysis of the transcriptome. We demonstrated that de novo assembly, using Trinity and later annotation with Blast using different databases, were complementary to alignment to the reference sequence using SOAPaligner/SOAP2. The Fragaria reference genome belongs to a species in the same family as blackberry (Rosaceae) but to a different genus. Since blackberries are tetraploids, the possibility of artefactual gene chimeras resulting from mis-assembly was tested with one of the genes sequenced by RNAseq, Chalcone Synthase (CHS). cDNAs encoding this protein were cloned and sequenced. Primers designed to the assembled sequences accurately distinguished different contigs, at least for chalcone synthase genes. We prepared and analysed transcriptome data from ripe blackberries, for which prior genomic information was limited. This new sequence information will improve the knowledge of this important and healthy fruit, providing an invaluable new tool for biological research.

  10. Annotation of the Transcriptome from Taenia pisiformis and Its Comparative Analysis with Three Taeniidae Species

    PubMed Central

    Yang, Deying; Fu, Yan; Wu, Xuhang; Xie, Yue; Nie, Huaming; Chen, Lin; Nong, Xiang; Gu, Xiaobin; Wang, Shuxian; Peng, Xuerong; Yan, Ning; Zhang, Runhui; Zheng, Wanpeng; Yang, Guangyou

    2012-01-01

    Background Taenia pisiformis is one of the most common intestinal tapeworms and can cause infections in canines. Adult T. pisiformis (canines as definitive hosts) and Cysticercus pisiformis (rabbits as intermediate hosts) cause significant health problems to the host and considerable socio-economic losses as a consequence. No complete genomic data regarding T. pisiformis are currently available in public databases. RNA-seq provides an effective approach to analyze the eukaryotic transcriptome to generate large functional gene datasets that can be used for further studies. Methodology/Principal Findings In this study, 2.67 million sequencing clean reads and 72,957 unigenes were generated using the RNA-seq technique. Based on a sequence similarity search with known proteins, a total of 26,012 unigenes (no redundancy) were identified after quality control procedures via the alignment of four databases. Overall, 15,920 unigenes were mapped to 203 Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways. Through analyzing the glycolysis/gluconeogenesis and axonal guidance pathways, we achieved an in-depth understanding of the biochemistry of T. pisiformis. Here, we selected four unigenes at random and obtained their full-length cDNA clones using RACE PCR. Functional distribution characteristics were gained through comparing four cestode species (72,957 unigenes of T. pisiformis, 30,700 ESTs of T. solium, 1,058 ESTs of Eg+Em [conserved ESTs between Echinococcus granulosus and Echinococcus multilocularis]), with the cluster of orthologous groups (COG) and gene ontology (GO) functional classification systems. Furthermore, the conserved common genes in these four cestode species were obtained and aligned by the KEGG database. Conclusion This study provides an extensive transcriptome dataset obtained from the deep sequencing of T. pisiformis in a non-model whole genome. The identification of conserved genes may provide novel approaches for potential drug targets and vaccinations against cestode infections. Research can now accelerate into the functional genomics, immunity and gene expression profiles of cestode species. PMID:22514598

  11. Comparative transcriptome analysis between planarian Dugesia japonica and other platyhelminth species.

    PubMed

    Nishimura, Osamu; Hirao, Yukako; Tarui, Hiroshi; Agata, Kiyokazu

    2012-06-29

    Planarians are considered to be among the extant animals close to one of the earliest groups of organisms that acquired a central nervous system (CNS) during evolution. Planarians have a bilobed brain with nine lateral branches from which a variety of external signals are projected into different portions of the main lobes. Various interneurons process different signals to regulate behavior and learning/memory. Furthermore, planarians have robust regenerative ability and are attracting attention as a new model organism for the study of regeneration. Here we conducted large-scale EST analysis of the head region of the planarian Dugesia japonica to construct a database of the head-region transcriptome, and then performed comparative analyses among related species. A total of 54,752 high-quality EST reads were obtained from a head library of the planarian Dugesia japonica, and 13,167 unigene sequences were produced by de novo assembly. A new method devised here revealed that proteins related to metabolism and defense mechanisms have high flexibility of amino-acid substitutions within the planarian family. Eight-two CNS-development genes were found in the planarian (cf. C. elegans 3; chicken 129). Comparative analysis revealed that 91% of the planarian CNS-development genes could be mapped onto the schistosome genome, but one-third of these shared genes were not expressed in the schistosome. We constructed a database that is a useful resource for comparative planarian transcriptome studies. Analysis comparing homologous genes between two planarian species showed that the potential of genes is important for accumulation of amino-acid substitutions. The presence of many CNS-development genes in our database supports the notion that the planarian has a fundamental brain with regard to evolution and development at not only the morphological/functional, but also the genomic, level. In addition, our results indicate that the planarian CNS-development genes already existed before the divergence of planarians and schistosomes from their common ancestor.

  12. Chicken interferome: avian interferon-stimulated genes identified by microarray and RNA-seq of primary chick embryo fibroblasts treated with a chicken type I interferon (IFN-α).

    PubMed

    Giotis, Efstathios S; Robey, Rebecca C; Skinner, Natalie G; Tomlinson, Christopher D; Goodbourn, Stephen; Skinner, Michael A

    2016-08-05

    Viruses that infect birds pose major threats-to the global supply of chicken, the major, universally-acceptable meat, and as zoonotic agents (e.g. avian influenza viruses H5N1 and H7N9). Controlling these viruses in birds as well as understanding their emergence into, and transmission amongst, humans will require considerable ingenuity and understanding of how different species defend themselves. The type I interferon-coordinated response constitutes the major antiviral innate defence. Although interferon was discovered in chicken cells, details of the response, particularly the identity of hundreds of stimulated genes, are far better described in mammals. Viruses induce interferon-stimulated genes but they also regulate the expression of many hundreds of cellular metabolic and structural genes to facilitate their replication. This study focusses on the potentially anti-viral genes by identifying those induced just by interferon in primary chick embryo fibroblasts. Three transcriptomic technologies were exploited: RNA-seq, a classical 3'-biased chicken microarray and a high density, "sense target", whole transcriptome chicken microarray, with each recognising 120-150 regulated genes (curated for duplication and incorrect assignment of some microarray probesets). Overall, the results are considered robust because 128 of the compiled, curated list of 193 regulated genes were detected by two, or more, of the technologies.

  13. A global view of the nonprotein-coding transcriptome in Plasmodium falciparum

    PubMed Central

    Raabe, Carsten A.; Sanchez, Cecilia P.; Randau, Gerrit; Robeck, Thomas; Skryabin, Boris V.; Chinni, Suresh V.; Kube, Michael; Reinhardt, Richard; Ng, Guey Hooi; Manickam, Ravichandran; Kuryshev, Vladimir Y.; Lanzer, Michael; Brosius, Juergen; Tang, Thean Hock; Rozhdestvensky, Timofey S.

    2010-01-01

    Nonprotein-coding RNAs (npcRNAs) represent an important class of regulatory molecules that act in many cellular pathways. Here, we describe the experimental identification and validation of the small npcRNA transcriptome of the human malaria parasite Plasmodium falciparum. We identified 630 novel npcRNA candidates. Based on sequence and structural motifs, 43 of them belong to the C/D and H/ACA-box subclasses of small nucleolar RNAs (snoRNAs) and small Cajal body-specific RNAs (scaRNAs). We further observed the exonization of a functional H/ACA snoRNA gene, which might contribute to the regulation of ribosomal protein L7a gene expression. Some of the small npcRNA candidates are from telomeric and subtelomeric repetitive regions, suggesting their potential involvement in maintaining telomeric integrity and subtelomeric gene silencing. We also detected 328 cis-encoded antisense npcRNAs (asRNAs) complementary to P. falciparum protein-coding genes of a wide range of biochemical pathways, including determinants of virulence and pathology. All cis-encoded asRNA genes tested exhibit lifecycle-specific expression profiles. For all but one of the respective sense–antisense pairs, we deduced concordant patterns of expression. Our findings have important implications for a better understanding of gene regulatory mechanisms in P. falciparum, revealing an extended and sophisticated npcRNA network that may control the expression of housekeeping genes and virulence factors. PMID:19864253

  14. A global view of the nonprotein-coding transcriptome in Plasmodium falciparum.

    PubMed

    Raabe, Carsten A; Sanchez, Cecilia P; Randau, Gerrit; Robeck, Thomas; Skryabin, Boris V; Chinni, Suresh V; Kube, Michael; Reinhardt, Richard; Ng, Guey Hooi; Manickam, Ravichandran; Kuryshev, Vladimir Y; Lanzer, Michael; Brosius, Juergen; Tang, Thean Hock; Rozhdestvensky, Timofey S

    2010-01-01

    Nonprotein-coding RNAs (npcRNAs) represent an important class of regulatory molecules that act in many cellular pathways. Here, we describe the experimental identification and validation of the small npcRNA transcriptome of the human malaria parasite Plasmodium falciparum. We identified 630 novel npcRNA candidates. Based on sequence and structural motifs, 43 of them belong to the C/D and H/ACA-box subclasses of small nucleolar RNAs (snoRNAs) and small Cajal body-specific RNAs (scaRNAs). We further observed the exonization of a functional H/ACA snoRNA gene, which might contribute to the regulation of ribosomal protein L7a gene expression. Some of the small npcRNA candidates are from telomeric and subtelomeric repetitive regions, suggesting their potential involvement in maintaining telomeric integrity and subtelomeric gene silencing. We also detected 328 cis-encoded antisense npcRNAs (asRNAs) complementary to P. falciparum protein-coding genes of a wide range of biochemical pathways, including determinants of virulence and pathology. All cis-encoded asRNA genes tested exhibit lifecycle-specific expression profiles. For all but one of the respective sense-antisense pairs, we deduced concordant patterns of expression. Our findings have important implications for a better understanding of gene regulatory mechanisms in P. falciparum, revealing an extended and sophisticated npcRNA network that may control the expression of housekeeping genes and virulence factors.

  15. Global Transcriptome Analysis of Staphylococcus aureus Response to Hydrogen Peroxide†

    PubMed Central

    Chang, Wook; Small, David A.; Toghrol, Freshteh; Bentley, William E.

    2006-01-01

    Staphylococcus aureus responds with protective strategies against phagocyte-derived reactive oxidants to infect humans. Herein, we report the transcriptome analysis of the cellular response of S. aureus to hydrogen peroxide-induced oxidative stress. The data indicate that the oxidative response includes the induction of genes involved in virulence, DNA repair, and notably, anaerobic metabolism. PMID:16452450

  16. 20180312 - Application of a Multiplexed High Content Imaging (HCI) Based Cell Viability and Apoptosis Chemical Screening Assay with Results in MCF-7 Cells (SOT)

    EPA Science Inventory

    The NCCT high throughput transcriptomics (HTTr) screening program uses whole transcriptome profiling assay in human-derived cells to collect concentration-response data for large numbers (100s-1000s) of environmental chemicals. To contextualize HTTr data, chemical effects on cell...

  17. Leukotriene signaling in the extinct human subspecies Homo denisovan and Homo neanderthalensis. Structural and functional comparison with Homo sapiens.

    PubMed

    Adel, Susan; Kakularam, Kumar Reddy; Horn, Thomas; Reddanna, Pallu; Kuhn, Hartmut; Heydeck, Dagmar

    2015-01-01

    Mammalian lipoxygenases (LOXs) have been implicated in cell differentiation and in the biosynthesis of pro- and anti-inflammatory lipid mediators. The initial draft sequence of the Homo neanderthalensis genome (coverage of 1.3-fold) suggested defective leukotriene signaling in this archaic human subspecies since expression of essential proteins appeared to be corrupted. Meanwhile high quality genomic sequence data became available for two extinct human subspecies (H. neanderthalensis, Homo denisovan) and completion of the human 1000 genome project provided a comprehensive database characterizing the genetic variability of the human genome. For this study we extracted the nucleotide sequences of selected eicosanoid relevant genes (ALOX5, ALOX15, ALOX12, ALOX15B, ALOX12B, ALOXE3, COX1, COX2, LTA4H, LTC4S, ALOX5AP, CYSLTR1, CYSLTR2, BLTR1, BLTR2) from the corresponding databases. Comparison of the deduced amino acid sequences in connection with site-directed mutagenesis studies and structural modeling suggested that the major enzymes and receptors of leukotriene signaling as well as the two cyclooxygenase isoforms were fully functional in these two extinct human subspecies. Copyright © 2014 Elsevier Inc. All rights reserved.

  18. Comparative Genomics and Transcriptomics Analyses Reveal Divergent Lifestyle Features of Nematode Endoparasitic Fungus Hirsutella minnesotensis

    PubMed Central

    Lai, Yiling; Liu, Keke; Zhang, Xinyu; Zhang, Xiaoling; Li, Kuan; Wang, Niuniu; Shu, Chi; Wu, Yunpeng; Wang, Chengshu; Bushley, Kathryn E.; Xiang, Meichun; Liu, Xingzhong

    2014-01-01

    Hirsutella minnesotensis [Ophiocordycipitaceae (Hypocreales, Ascomycota)] is a dominant endoparasitic fungus by using conidia that adhere to and penetrate the secondary stage juveniles of soybean cyst nematode. Its genome was de novo sequenced and compared with five entomopathogenic fungi in the Hypocreales and three nematode-trapping fungi in the Orbiliales (Ascomycota). The genome of H. minnesotensis is 51.4 Mb and encodes 12,702 genes enriched with transposable elements up to 32%. Phylogenomic analysis revealed that H. minnesotensis was diverged from entomopathogenic fungi in Hypocreales. Genome of H. minnesotensis is similar to those of entomopathogenic fungi to have fewer genes encoding lectins for adhesion and glycoside hydrolases for cellulose degradation, but is different from those of nematode-trapping fungi to possess more genes for protein degradation, signal transduction, and secondary metabolism. Those results indicate that H. minnesotensis has evolved different mechanism for nematode endoparasitism compared with nematode-trapping fungi. Transcriptomics analyses for the time-scale parasitism revealed the upregulations of lectins, secreted proteases and the genes for biosynthesis of secondary metabolites that could be putatively involved in host surface adhesion, cuticle degradation, and host manipulation. Genome and transcriptome analyses provided comprehensive understanding of the evolution and lifestyle of nematode endoparasitism. PMID:25359922

  19. Comparative transcriptome analysis of Haematococcus pluvialis on astaxanthin biosynthesis in response to irradiation with red or blue LED wavelength.

    PubMed

    Lee, Changsu; Ahn, Joon-Woo; Kim, Jin-Baek; Kim, Jee Young; Choi, Yoon-E

    2018-06-18

    The unicellular green microalga Haematococcus pluvialis has the highest content of the natural antioxidant, astaxanthin. Previously, it was determined that astaxanthin accumulation in H. pluvialis could be induced by blue-wavelength irradiation; however, the molecular mechanism remains unknown. The present study aimed to compare the transcriptome of H. pluvialis, with respect to astaxanthin biosynthesis, under the monochromatic red (660 nm) or blue (450 nm) light-emitting diode (LED) irradiation. Among a total of 165,372 transcripts, we identified 67,703 unigenes, of which 2245 and 171 were identified as differentially expressed genes (DEGs) in response to blue and red irradiation, respectively. Interestingly, expressional changes of blue light receptor cryptochromes were detected in response to blue and/or red LED irradiation in H. pluvialis, which may directly and indirectly regulate astaxanthin biosynthesis. In accordance with this observation, expression of the BKT and CHY genes, which are part of the downstream section of the astaxanthin biosynthetic pathway, was significantly upregulated by blue LED irradiation compared with their expression under control white irradiation. Contrastingly, they were downregulated by red LED irradiation. Our transcriptome study provided molecular insights that highlighted the different of responses of H. pluvialis to red and blue irradiation, especially for astaxanthin biosynthesis.

  20. Deep RNA sequencing reveals dynamic regulation of myocardial noncoding RNAs in failing human heart and remodeling with mechanical circulatory support.

    PubMed

    Yang, Kai-Chien; Yamada, Kathryn A; Patel, Akshar Y; Topkara, Veli K; George, Isaac; Cheema, Faisal H; Ewald, Gregory A; Mann, Douglas L; Nerbonne, Jeanne M

    2014-03-04

    Microarrays have been used extensively to profile transcriptome remodeling in failing human heart, although the genomic coverage provided is limited and fails to provide a detailed picture of the myocardial transcriptome landscape. Here, we describe sequencing-based transcriptome profiling, providing comprehensive analysis of myocardial mRNA, microRNA (miRNA), and long noncoding RNA (lncRNA) expression in failing human heart before and after mechanical support with a left ventricular (LV) assist device (LVAD). Deep sequencing of RNA isolated from paired nonischemic (NICM; n=8) and ischemic (ICM; n=8) human failing LV samples collected before and after LVAD and from nonfailing human LV (n=8) was conducted. These analyses revealed high abundance of mRNA (37%) and lncRNA (71%) of mitochondrial origin. miRNASeq revealed 160 and 147 differentially expressed miRNAs in ICM and NICM, respectively, compared with nonfailing LV. Among these, only 2 (ICM) and 5 (NICM) miRNAs are normalized with LVAD. RNASeq detected 18 480, including 113 novel, lncRNAs in human LV. Among the 679 (ICM) and 570 (NICM) lncRNAs differentially expressed with heart failure, ≈10% are improved or normalized with LVAD. In addition, the expression signature of lncRNAs, but not miRNAs or mRNAs, distinguishes ICM from NICM. Further analysis suggests that cis-gene regulation represents a major mechanism of action of human cardiac lncRNAs. The myocardial transcriptome is dynamically regulated in advanced heart failure and after LVAD support. The expression profiles of lncRNAs, but not mRNAs or miRNAs, can discriminate failing hearts of different pathologies and are markedly altered in response to LVAD support. These results suggest an important role for lncRNAs in the pathogenesis of heart failure and in reverse remodeling observed with mechanical support.

Top