Su, Zhipeng; Zhu, Jiawen; Xu, Zhuofei; Xiao, Ran; Zhou, Rui; Li, Lu; Chen, Huanchun
2016-01-01
Actinobacillus pleuropneumoniae is the pathogen of porcine contagious pleuropneumoniae, a highly contagious respiratory disease of swine. Although the genome of A. pleuropneumoniae was sequenced several years ago, limited information is available on the genome-wide transcriptional analysis to accurately annotate the gene structures and regulatory elements. High-throughput RNA sequencing (RNA-seq) has been applied to study the transcriptional landscape of bacteria, which can efficiently and accurately identify gene expression regions and unknown transcriptional units, especially small non-coding RNAs (sRNAs), UTRs and regulatory regions. The aim of this study is to comprehensively analyze the transcriptome of A. pleuropneumoniae by RNA-seq in order to improve the existing genome annotation and promote our understanding of A. pleuropneumoniae gene structures and RNA-based regulation. In this study, we utilized RNA-seq to construct a single nucleotide resolution transcriptome map of A. pleuropneumoniae. More than 3.8 million high-quality reads (average length ~90 bp) from a cDNA library were generated and aligned to the reference genome. We identified 32 open reading frames encoding novel proteins that were mis-annotated in the previous genome annotations. The start sites for 35 genes based on the current genome annotation were corrected. Furthermore, 51 sRNAs in the A. pleuropneumoniae genome were discovered, of which 40 sRNAs were never reported in previous studies. The transcriptome map also enabled visualization of 5'- and 3'-UTR regions, in which contained 11 sRNAs. In addition, 351 operons covering 1230 genes throughout the whole genome were identified. The RNA-Seq based transcriptome map validated annotated genes and corrected annotations of open reading frames in the genome, and led to the identification of many functional elements (e.g. regions encoding novel proteins, non-coding sRNAs and operon structures). The transcriptional units described in this study provide a foundation for future studies concerning the gene functions and the transcriptional regulatory architectures of this pathogen. PMID:27018591
Unprecedented high-resolution view of bacterial operon architecture revealed by RNA sequencing.
Conway, Tyrrell; Creecy, James P; Maddox, Scott M; Grissom, Joe E; Conkle, Trevor L; Shadid, Tyler M; Teramoto, Jun; San Miguel, Phillip; Shimada, Tomohiro; Ishihama, Akira; Mori, Hirotada; Wanner, Barry L
2014-07-08
We analyzed the transcriptome of Escherichia coli K-12 by strand-specific RNA sequencing at single-nucleotide resolution during steady-state (logarithmic-phase) growth and upon entry into stationary phase in glucose minimal medium. To generate high-resolution transcriptome maps, we developed an organizational schema which showed that in practice only three features are required to define operon architecture: the promoter, terminator, and deep RNA sequence read coverage. We precisely annotated 2,122 promoters and 1,774 terminators, defining 1,510 operons with an average of 1.98 genes per operon. Our analyses revealed an unprecedented view of E. coli operon architecture. A large proportion (36%) of operons are complex with internal promoters or terminators that generate multiple transcription units. For 43% of operons, we observed differential expression of polycistronic genes, despite being in the same operons, indicating that E. coli operon architecture allows fine-tuning of gene expression. We found that 276 of 370 convergent operons terminate inefficiently, generating complementary 3' transcript ends which overlap on average by 286 nucleotides, and 136 of 388 divergent operons have promoters arranged such that their 5' ends overlap on average by 168 nucleotides. We found 89 antisense transcripts of 397-nucleotide average length, 7 unannotated transcripts within intergenic regions, and 18 sense transcripts that completely overlap operons on the opposite strand. Of 519 overlapping transcripts, 75% correspond to sequences that are highly conserved in E. coli (>50 genomes). Our data extend recent studies showing unexpected transcriptome complexity in several bacteria and suggest that antisense RNA regulation is widespread. Importance: We precisely mapped the 5' and 3' ends of RNA transcripts across the E. coli K-12 genome by using a single-nucleotide analytical approach. Our resulting high-resolution transcriptome maps show that ca. one-third of E. coli operons are complex, with internal promoters and terminators generating multiple transcription units and allowing differential gene expression within these operons. We discovered extensive antisense transcription that results from more than 500 operons, which fully overlap or extensively overlap adjacent divergent or convergent operons. The genomic regions corresponding to these antisense transcripts are highly conserved in E. coli (including Shigella species), although it remains to be proven whether or not they are functional. Our observations of features unearthed by single-nucleotide transcriptome mapping suggest that deeper layers of transcriptional regulation in bacteria are likely to be revealed in the future. Copyright © 2014 Conway et al.
Burkhard, Silja Barbara
2018-01-01
Development of specialized cells and structures in the heart is regulated by spatially -restricted molecular pathways. Disruptions in these pathways can cause severe congenital cardiac malformations or functional defects. To better understand these pathways and how they regulate cardiac development we used tomo-seq, combining high-throughput RNA-sequencing with tissue-sectioning, to establish a genome-wide expression dataset with high spatial resolution for the developing zebrafish heart. Analysis of the dataset revealed over 1100 genes differentially expressed in sub-compartments. Pacemaker cells in the sinoatrial region induce heart contractions, but little is known about the mechanisms underlying their development. Using our transcriptome map, we identified spatially restricted Wnt/β-catenin signaling activity in pacemaker cells, which was controlled by Islet-1 activity. Moreover, Wnt/β-catenin signaling controls heart rate by regulating pacemaker cellular response to parasympathetic stimuli. Thus, this high-resolution transcriptome map incorporating all cell types in the embryonic heart can expose spatially restricted molecular pathways critical for specific cardiac functions. PMID:29400650
RNA-Seq Based Transcriptional Map of Bovine Respiratory Disease Pathogen “Histophilus somni 2336”
Kumar, Ranjit; Lawrence, Mark L.; Watt, James; Cooksey, Amanda M.; Burgess, Shane C.; Nanduri, Bindu
2012-01-01
Genome structural annotation, i.e., identification and demarcation of the boundaries for all the functional elements in a genome (e.g., genes, non-coding RNAs, proteins and regulatory elements), is a prerequisite for systems level analysis. Current genome annotation programs do not identify all of the functional elements of the genome, especially small non-coding RNAs (sRNAs). Whole genome transcriptome analysis is a complementary method to identify “novel” genes, small RNAs, regulatory regions, and operon structures, thus improving the structural annotation in bacteria. In particular, the identification of non-coding RNAs has revealed their widespread occurrence and functional importance in gene regulation, stress and virulence. However, very little is known about non-coding transcripts in Histophilus somni, one of the causative agents of Bovine Respiratory Disease (BRD) as well as bovine infertility, abortion, septicemia, arthritis, myocarditis, and thrombotic meningoencephalitis. In this study, we report a single nucleotide resolution transcriptome map of H. somni strain 2336 using RNA-Seq method. The RNA-Seq based transcriptome map identified 94 sRNAs in the H. somni genome of which 82 sRNAs were never predicted or reported in earlier studies. We also identified 38 novel potential protein coding open reading frames that were absent in the current genome annotation. The transcriptome map allowed the identification of 278 operon (total 730 genes) structures in the genome. When compared with the genome sequence of a non-virulent strain 129Pt, a disproportionate number of sRNAs (∼30%) were located in genomic region unique to strain 2336 (∼18% of the total genome). This observation suggests that a number of the newly identified sRNAs in strain 2336 may be involved in strain-specific adaptations. PMID:22276113
RNA-seq based transcriptional map of bovine respiratory disease pathogen "Histophilus somni 2336".
Kumar, Ranjit; Lawrence, Mark L; Watt, James; Cooksey, Amanda M; Burgess, Shane C; Nanduri, Bindu
2012-01-01
Genome structural annotation, i.e., identification and demarcation of the boundaries for all the functional elements in a genome (e.g., genes, non-coding RNAs, proteins and regulatory elements), is a prerequisite for systems level analysis. Current genome annotation programs do not identify all of the functional elements of the genome, especially small non-coding RNAs (sRNAs). Whole genome transcriptome analysis is a complementary method to identify "novel" genes, small RNAs, regulatory regions, and operon structures, thus improving the structural annotation in bacteria. In particular, the identification of non-coding RNAs has revealed their widespread occurrence and functional importance in gene regulation, stress and virulence. However, very little is known about non-coding transcripts in Histophilus somni, one of the causative agents of Bovine Respiratory Disease (BRD) as well as bovine infertility, abortion, septicemia, arthritis, myocarditis, and thrombotic meningoencephalitis. In this study, we report a single nucleotide resolution transcriptome map of H. somni strain 2336 using RNA-Seq method.The RNA-Seq based transcriptome map identified 94 sRNAs in the H. somni genome of which 82 sRNAs were never predicted or reported in earlier studies. We also identified 38 novel potential protein coding open reading frames that were absent in the current genome annotation. The transcriptome map allowed the identification of 278 operon (total 730 genes) structures in the genome. When compared with the genome sequence of a non-virulent strain 129Pt, a disproportionate number of sRNAs (∼30%) were located in genomic region unique to strain 2336 (∼18% of the total genome). This observation suggests that a number of the newly identified sRNAs in strain 2336 may be involved in strain-specific adaptations.
Moore, Michael; Zhang, Chaolin; Gantman, Emily Conn; Mele, Aldo; Darnell, Jennifer C.; Darnell, Robert B.
2014-01-01
Summary Identifying sites where RNA binding proteins (RNABPs) interact with target RNAs opens the door to understanding the vast complexity of RNA regulation. UV-crosslinking and immunoprecipitation (CLIP) is a transformative technology in which RNAs purified from in vivo cross-linked RNA-protein complexes are sequenced to reveal footprints of RNABP:RNA contacts. CLIP combined with high throughput sequencing (HITS-CLIP) is a generalizable strategy to produce transcriptome-wide RNA binding maps with higher accuracy and resolution than standard RNA immunoprecipitation (RIP) profiling or purely computational approaches. Applying CLIP to Argonaute proteins has expanded the utility of this approach to mapping binding sites for microRNAs and other small regulatory RNAs. Finally, recent advances in data analysis take advantage of crosslinked-induced mutation sites (CIMS) to refine RNA-binding maps to single-nucleotide resolution. Once IP conditions are established, HITS-CLIP takes approximately eight days to prepare RNA for sequencing. Established pipelines for data analysis, including for CIMS, take 3-4 days. PMID:24407355
Smola, Matthew J; Rice, Greggory M; Busan, Steven; Siegfried, Nathan A; Weeks, Kevin M
2015-11-01
Selective 2'-hydroxyl acylation analyzed by primer extension (SHAPE) chemistries exploit small electrophilic reagents that react with 2'-hydroxyl groups to interrogate RNA structure at single-nucleotide resolution. Mutational profiling (MaP) identifies modified residues by using reverse transcriptase to misread a SHAPE-modified nucleotide and then counting the resulting mutations by massively parallel sequencing. The SHAPE-MaP approach measures the structure of large and transcriptome-wide systems as accurately as can be done for simple model RNAs. This protocol describes the experimental steps, implemented over 3 d, that are required to perform SHAPE probing and to construct multiplexed SHAPE-MaP libraries suitable for deep sequencing. Automated processing of MaP sequencing data is accomplished using two software packages. ShapeMapper converts raw sequencing files into mutational profiles, creates SHAPE reactivity plots and provides useful troubleshooting information. SuperFold uses these data to model RNA secondary structures, identify regions with well-defined structures and visualize probable and alternative helices, often in under 1 d. SHAPE-MaP can be used to make nucleotide-resolution biophysical measurements of individual RNA motifs, rare components of complex RNA ensembles and entire transcriptomes.
The developmental transcriptome of Drosophila melanogaster
DOE Office of Scientific and Technical Information (OSTI.GOV)
University of Connecticut; Graveley, Brenton R.; Brooks, Angela N.
Drosophila melanogaster is one of the most well studied genetic model organisms; nonetheless, its genome still contains unannotated coding and non-coding genes, transcripts, exons and RNA editing sites. Full discovery and annotation are pre-requisites for understanding how the regulation of transcription, splicing and RNA editing directs the development of this complex organism. Here we used RNA-Seq, tiling microarrays and cDNA sequencing to explore the transcriptome in 30 distinct developmental stages. We identified 111,195 new elements, including thousands of genes, coding and non-coding transcripts, exons, splicing and editing events, and inferred protein isoforms that previously eluded discovery using established experimental, predictionmore » and conservation-based approaches. These data substantially expand the number of known transcribed elements in the Drosophila genome and provide a high-resolution view of transcriptome dynamics throughout development. Drosophila melanogaster is an important non-mammalian model system that has had a critical role in basic biological discoveries, such as identifying chromosomes as the carriers of genetic information and uncovering the role of genes in development. Because it shares a substantial genic content with humans, Drosophila is increasingly used as a translational model for human development, homeostasis and disease. High-quality maps are needed for all functional genomic elements. Previous studies demonstrated that a rich collection of genes is deployed during the life cycle of the fly. Although expression profiling using microarrays has revealed the expression of, 13,000 annotated genes, it is difficult to map splice junctions and individual base modifications generated by RNA editing using such approaches. Single-base resolution is essential to define precisely the elements that comprise the Drosophila transcriptome. Estimates of the number of transcript isoforms are less accurate than estimates of the number of genes. Whereas, 20% of Drosophila genes are annotated as encoding alternatively spliced premRNAs, splice-junction microarray experiments indicate that this number is at least 40% (ref. 7). Determining the diversity of mRNAs generated by alternative promoters, alternative splicing and RNA editing will substantially increase the inferred protein repertoire. Non-coding RNA genes (ncRNAs) including short interfering RNAs (siRNAs) and microRNAS (miRNAs) (reviewed in ref. 10), and longer ncRNAs such as bxd (ref. 11) and rox (ref. 12), have important roles in gene regulation, whereas others such as small nucleolar RNAs (snoRNAs)and small nuclear RNAs (snRNAs) are important components of macromolecular machines such as the ribosome and spliceosome. The transcription and processing of these ncRNAs must also be fully documented and mapped. As part of the modENCODE project to annotate the functional elements of the D. melanogaster and Caenorhabditis elegans genomes, we used RNA-Seq and tiling microarrays to sample the Drosophila transcriptome at unprecedented depth throughout development from early embryo to ageing male and female adults. We report on a high-resolution view of the discovery, structure and dynamic expression of the D. melanogaster transcriptome.« less
High-confidence coding and noncoding transcriptome maps
2017-01-01
The advent of high-throughput RNA sequencing (RNA-seq) has led to the discovery of unprecedentedly immense transcriptomes encoded by eukaryotic genomes. However, the transcriptome maps are still incomplete partly because they were mostly reconstructed based on RNA-seq reads that lack their orientations (known as unstranded reads) and certain boundary information. Methods to expand the usability of unstranded RNA-seq data by predetermining the orientation of the reads and precisely determining the boundaries of assembled transcripts could significantly benefit the quality of the resulting transcriptome maps. Here, we present a high-performing transcriptome assembly pipeline, called CAFE, that significantly improves the original assemblies, respectively assembled with stranded and/or unstranded RNA-seq data, by orienting unstranded reads using the maximum likelihood estimation and by integrating information about transcription start sites and cleavage and polyadenylation sites. Applying large-scale transcriptomic data comprising 230 billion RNA-seq reads from the ENCODE, Human BodyMap 2.0, The Cancer Genome Atlas, and GTEx projects, CAFE enabled us to predict the directions of about 220 billion unstranded reads, which led to the construction of more accurate transcriptome maps, comparable to the manually curated map, and a comprehensive lncRNA catalog that includes thousands of novel lncRNAs. Our pipeline should not only help to build comprehensive, precise transcriptome maps from complex genomes but also to expand the universe of noncoding genomes. PMID:28396519
Hayakawa-Yano, Yoshika; Suyama, Satoshi; Nogami, Masahiro; Yugami, Masato; Koya, Ikuko; Furukawa, Takako; Zhou, Li; Abe, Manabu; Sakimura, Kenji; Takebayashi, Hirohide; Nakanishi, Atsushi; Okano, Hideyuki; Yano, Masato
2017-09-15
Cell type-specific transcriptomes are enabled by the action of multiple regulators, which are frequently expressed within restricted tissue regions. In the present study, we identify one such regulator, Quaking 5 (Qki5), as an RNA-binding protein (RNABP) that is expressed in early embryonic neural stem cells and subsequently down-regulated during neurogenesis. mRNA sequencing analysis in neural stem cell culture indicates that Qki proteins play supporting roles in the neural stem cell transcriptome and various forms of mRNA processing that may result from regionally restricted expression and subcellular localization. Also, our in utero electroporation gain-of-function study suggests that the nuclear-type Qki isoform Qki5 supports the neural stem cell state. We next performed in vivo transcriptome-wide protein-RNA interaction mapping to search for direct targets of Qki5 and elucidate how Qki5 regulates neural stem cell function. Combined with our transcriptome analysis, this mapping analysis yielded a bona fide map of Qki5-RNA interaction at single-nucleotide resolution, the identification of 892 Qki5 direct target genes, and an accurate Qki5-dependent alternative splicing rule in the developing brain. Last, our target gene list provides the first compelling evidence that Qki5 is associated with specific biological events; namely, cell-cell adhesion. This prediction was confirmed by histological analysis of mice in which Qki proteins were genetically ablated, which revealed disruption of the apical surface of the lateral wall in the developing brain. These data collectively indicate that Qki5 regulates communication between neural stem cells by mediating numerous RNA processing events and suggest new links between splicing regulation and neural stem cell states. © 2017 Hayakawa-Yano et al.; Published by Cold Spring Harbor Laboratory Press.
Chopra, Ratan; Burow, Gloria; Farmer, Andrew; Mudge, Joann; Simpson, Charles E; Wilkins, Thea A; Baring, Michael R; Puppala, Naveen; Chamberlin, Kelly D; Burow, Mark D
2015-06-01
Single-nucleotide polymorphisms, which can be identified in the thousands or millions from comparisons of transcriptome or genome sequences, are ideally suited for making high-resolution genetic maps, investigating population evolutionary history, and discovering marker-trait linkages. Despite significant results from their use in human genetics, progress in identification and use in plants, and particularly polyploid plants, has lagged. As part of a long-term project to identify and use SNPs suitable for these purposes in cultivated peanut, which is tetraploid, we generated transcriptome sequences of four peanut cultivars, namely OLin, New Mexico Valencia C, Tamrun OL07 and Jupiter, which represent the four major market classes of peanut grown in the world, and which are important economically to the US southwest peanut growing region. CopyDNA libraries of each genotype were used to generate 2 × 54 paired-end reads using an Illumina GAIIx sequencer. Raw reads were mapped to a custom reference consisting of Tifrunner 454 sequences plus peanut ESTs in GenBank, compromising 43,108 contigs; 263,840 SNP and indel variants were identified among four genotypes compared to the reference. A subset of 6 variants was assayed across 24 genotypes representing four market types using KASP chemistry to assess the criteria for SNP selection. Results demonstrated that transcriptome sequencing can identify SNPs usable as selectable DNA-based markers in complex polyploid species such as peanut. Criteria for effective use of SNPs as markers are discussed in this context.
Smola, Matthew J.; Rice, Greggory M.; Busan, Steven; Siegfried, Nathan A.; Weeks, Kevin M.
2016-01-01
SHAPE chemistries exploit small electrophilic reagents that react with the 2′-hydroxyl group to interrogate RNA structure at single-nucleotide resolution. Mutational profiling (MaP) identifies modified residues based on the ability of reverse transcriptase to misread a SHAPE-modified nucleotide and then counting the resulting mutations by massively parallel sequencing. The SHAPE-MaP approach measures the structure of large and transcriptome-wide systems as accurately as for simple model RNAs. This protocol describes the experimental steps, implemented over three days, required to perform SHAPE probing and construct multiplexed SHAPE-MaP libraries suitable for deep sequencing. These steps include RNA folding and SHAPE structure probing, mutational profiling by reverse transcription, library construction, and sequencing. Automated processing of MaP sequencing data is accomplished using two software packages. ShapeMapper converts raw sequencing files into mutational profiles, creates SHAPE reactivity plots, and provides useful troubleshooting information, often within an hour. SuperFold uses these data to model RNA secondary structures, identify regions with well-defined structures, and visualize probable and alternative helices, often in under a day. We illustrate these algorithms with the E. coli thiamine pyrophosphate riboswitch, E. coli 16S rRNA, and HIV-1 genomic RNAs. SHAPE-MaP can be used to make nucleotide-resolution biophysical measurements of individual RNA motifs, rare components of complex RNA ensembles, and entire transcriptomes. The straightforward MaP strategy greatly expands the number, length, and complexity of analyzable RNA structures. PMID:26426499
Transcriptome-wide identification of RNA-binding protein and microRNA target sites by PAR-CLIP
Hafner, Markus; Landthaler, Markus; Burger, Lukas; Khorshid, Mohsen; Hausser, Jean; Berninger, Philipp; Rothballer, Andrea; Ascano, Manuel; Jungkamp, Anna-Carina; Munschauer, Mathias; Ulrich, Alexander; Wardle, Greg S.; Dewell, Scott; Zavolan, Mihaela; Tuschl, Thomas
2010-01-01
Summary RNA transcripts are subject to post-transcriptional gene regulation involving hundreds of RNA-binding proteins (RBPs) and microRNA-containing ribonucleoprotein complexes (miRNPs) expressed in a cell-type dependent fashion. We developed a cell-based crosslinking approach to determine at high resolution and transcriptome-wide the binding sites of cellular RBPs and miRNPs. The crosslinked sites are revealed by thymidine to cytidine transitions in the cDNAs prepared from immunopurified RNPs of 4-thiouridine-treated cells. We determined the binding sites and regulatory consequences for several intensely studied RBPs and miRNPs, including PUM2, QKI, IGF2BP1-3, AGO/EIF2C1-4 and TNRC6A-C. Our study revealed that these factors bind thousands of sites containing defined sequence motifs and have distinct preferences for exonic versus intronic or coding versus untranslated transcript regions. The precise mapping of binding sites across the transcriptome will be critical to the interpretation of the rapidly emerging data on genetic variation between individuals and how these variations contribute to complex genetic diseases. PMID:20371350
2011-01-01
Background Several tools have been developed to perform global gene expression profile data analysis, to search for specific chromosomal regions whose features meet defined criteria as well as to study neighbouring gene expression. However, most of these tools are tailored for a specific use in a particular context (e.g. they are species-specific, or limited to a particular data format) and they typically accept only gene lists as input. Results TRAM (Transcriptome Mapper) is a new general tool that allows the simple generation and analysis of quantitative transcriptome maps, starting from any source listing gene expression values for a given gene set (e.g. expression microarrays), implemented as a relational database. It includes a parser able to assign univocal and updated gene symbols to gene identifiers from different data sources. Moreover, TRAM is able to perform intra-sample and inter-sample data normalization, including an original variant of quantile normalization (scaled quantile), useful to normalize data from platforms with highly different numbers of investigated genes. When in 'Map' mode, the software generates a quantitative representation of the transcriptome of a sample (or of a pool of samples) and identifies if segments of defined lengths are over/under-expressed compared to the desired threshold. When in 'Cluster' mode, the software searches for a set of over/under-expressed consecutive genes. Statistical significance for all results is calculated with respect to genes localized on the same chromosome or to all genome genes. Transcriptome maps, showing differential expression between two sample groups, relative to two different biological conditions, may be easily generated. We present the results of a biological model test, based on a meta-analysis comparison between a sample pool of human CD34+ hematopoietic progenitor cells and a sample pool of megakaryocytic cells. Biologically relevant chromosomal segments and gene clusters with differential expression during the differentiation toward megakaryocyte were identified. Conclusions TRAM is designed to create, and statistically analyze, quantitative transcriptome maps, based on gene expression data from multiple sources. The release includes FileMaker Pro database management runtime application and it is freely available at http://apollo11.isto.unibo.it/software/, along with preconfigured implementations for mapping of human, mouse and zebrafish transcriptomes. PMID:21333005
Sun, Cheng; Yu, Guoliang; Bao, Manzhu; Zheng, Bo; Ning, Guogui
2014-06-27
Odd traits in few of plant species usually implicate potential biology significances in plant evolutions. The genus Helwingia Willd, a dioecious medical shrub in Aquifoliales order, has an odd floral architecture-epiphyllous inflorescence. The potential significances and possible evolutionary origin of this specie are not well understood due to poorly available data of biological and genetic studies. In addition, the advent of genomics-based technologies has widely revolutionized plant species with unknown genomic information. Morphological and biological pattern were detailed via anatomical and pollination analyses. An RNA sequencing based transcriptomic analysis were undertaken and a high-resolution phylogenetic analysis was conducted based on single-copy genes in more than 80 species of seed plants, including H. japonica. It is verified that a potential fusion of rachis to the leaf midvein facilitates insect pollination. RNA sequencing yielded a total of 111450 unigenes; half of them had significant similarity with proteins in the public database, and 20281 unigenes were mapped to 119 pathways. Deduced from the phylogenetic analysis based on single-copy genes, the group of Helwingia is closer with Euasterids II and rather than Euasterids, congruent with previous reports using plastid sequences. The odd flower architecture make H. Willd adapt to insect pollination by hosting those insects larger than the flower in size via leave, which has little common character that other insect pollination plants hold. Further the present transcriptome greatly riches genomics information of Helwingia species and nucleus genes based phylogenetic analysis also greatly improve the resolution and robustness of phylogenetic reconstruction in H. japonica.
Identifier mapping performance for integrating transcriptomics and proteomics experimental results
2011-01-01
Background Studies integrating transcriptomic data with proteomic data can illuminate the proteome more clearly than either separately. Integromic studies can deepen understanding of the dynamic complex regulatory relationship between the transcriptome and the proteome. Integrating these data dictates a reliable mapping between the identifier nomenclature resultant from the two high-throughput platforms. However, this kind of analysis is well known to be hampered by lack of standardization of identifier nomenclature among proteins, genes, and microarray probe sets. Therefore data integration may also play a role in critiquing the fallible gene identifications that both platforms emit. Results We compared three freely available internet-based identifier mapping resources for mapping UniProt accessions (ACCs) to Affymetrix probesets identifications (IDs): DAVID, EnVision, and NetAffx. Liquid chromatography-tandem mass spectrometry analyses of 91 endometrial cancer and 7 noncancer samples generated 11,879 distinct ACCs. For each ACC, we compared the retrieval sets of probeset IDs from each mapping resource. We confirmed a high level of discrepancy among the mapping resources. On the same samples, mRNA expression was available. Therefore, to evaluate the quality of each ACC-to-probeset match, we calculated proteome-transcriptome correlations, and compared the resources presuming that better mapping of identifiers should generate a higher proportion of mapped pairs with strong inter-platform correlations. A mixture model for the correlations fitted well and supported regression analysis, providing a window into the performance of the mapping resources. The resources have added and dropped matches over two years, but their overall performance has not changed. Conclusions The methods presented here serve to achieve concrete context-specific insight, to support well-informed decisions in choosing an ID mapping strategy for "omic" data merging. PMID:21619611
Wei, Lin; Li, Shenghua; Liu, Shenggui; He, Anna; Wang, Dan; Wang, Jie; Tang, Yulian; Wu, Xianjin
2014-01-01
Houttuynia cordata Thunb. is an important traditional medical herb in China and other Asian countries, with high medicinal and economic value. However, a lack of available genomic information has become a limitation for research on this species. Thus, we carried out high-throughput transcriptomic sequencing of H. cordata to generate an enormous transcriptome sequence dataset for gene discovery and molecular marker development. Illumina paired-end sequencing technology produced over 56 million sequencing reads from H. cordata mRNA. Subsequent de novo assembly yielded 63,954 unigenes, 39,982 (62.52%) and 26,122 (40.84%) of which had significant similarity to proteins in the NCBI nonredundant protein and Swiss-Prot databases (E-value <10(-5)), respectively. Of these annotated unigenes, 30,131 and 15,363 unigenes were assigned to gene ontology categories and clusters of orthologous groups, respectively. In addition, 24,434 (38.21%) unigenes were mapped onto 128 pathways using the KEGG pathway database and 17,964 (44.93%) unigenes showed homology to Vitis vinifera (Vitaceae) genes in BLASTx analysis. Furthermore, 4,800 cDNA SSRs were identified as potential molecular markers. Fifty primer pairs were randomly selected to detect polymorphism among 30 samples of H. cordata; 43 (86%) produced fragments of expected size, suggesting that the unigenes were suitable for specific primer design and of high quality, and the SSR marker could be widely used in marker-assisted selection and molecular breeding of H. cordata in the future. This is the first application of Illumina paired-end sequencing technology to investigate the whole transcriptome of H. cordata and to assemble RNA-seq reads without a reference genome. These data should help researchers investigating the evolution and biological processes of this species. The SSR markers developed can be used for construction of high-resolution genetic linkage maps and for gene-based association analyses in H. cordata. This work will enable future functional genomic research and research into the distinctive active constituents of this genus.
Microglia Transcriptome Changes in a Model of Depressive Behavior after Immune Challenge
Gonzalez-Pena, Dianelys; Nixon, Scott E.; O’Connor, Jason C.; Southey, Bruce R.; Lawson, Marcus A.; McCusker, Robert H.; Borras, Tania; Machuca, Debbie; Hernandez, Alvaro G.; Dantzer, Robert; Kelley, Keith W.; Rodriguez-Zas, Sandra L.
2016-01-01
Depression symptoms following immune response to a challenge have been reported after the recovery from sickness. A RNA-Seq study of the dysregulation of the microglia transcriptome in a model of inflammation-associated depressive behavior was undertaken. The transcriptome of microglia from mice at day 7 after Bacille Calmette Guérin (BCG) challenge was compared to that from unchallenged Control mice and to the transcriptome from peripheral macrophages from the same mice. Among the 562 and 3,851 genes differentially expressed between BCG-challenged and Control mice in microglia and macrophages respectively, 353 genes overlapped between these cells types. Among the most differentially expressed genes in the microglia, serum amyloid A3 (Saa3) and cell adhesion molecule 3 (Cadm3) were over-expressed and coiled-coil domain containing 162 (Ccdc162) and titin-cap (Tcap) were under-expressed in BCG-challenged relative to Control. Many of the differentially expressed genes between BCG-challenged and Control mice were associated with neurological disorders encompassing depression symptoms. Across cell types, S100 calcium binding protein A9 (S100A9), interleukin 1 beta (Il1b) and kynurenine 3-monooxygenase (Kmo) were differentially expressed between challenged and control mice. Immune response, chemotaxis, and chemokine activity were among the functional categories enriched by the differentially expressed genes. Functional categories enriched among the 9,117 genes differentially expressed between cell types included leukocyte regulation and activation, chemokine and cytokine activities, MAP kinase activity, and apoptosis. More than 200 genes exhibited alternative splicing events between cell types including WNK lysine deficient protein kinase 1 (Wnk1) and microtubule-actin crosslinking factor 1(Macf1). Network visualization revealed the capability of microglia to exhibit transcriptome dysregulation in response to immune challenge still after resolution of sickness symptoms, albeit lower than that observed in macrophages. The persistent transcriptome dysregulation in the microglia shared patterns with neurological disorders indicating that the associated persistent depressive symptoms share a common transcriptome basis. PMID:26959683
Microglia Transcriptome Changes in a Model of Depressive Behavior after Immune Challenge.
Gonzalez-Pena, Dianelys; Nixon, Scott E; O'Connor, Jason C; Southey, Bruce R; Lawson, Marcus A; McCusker, Robert H; Borras, Tania; Machuca, Debbie; Hernandez, Alvaro G; Dantzer, Robert; Kelley, Keith W; Rodriguez-Zas, Sandra L
2016-01-01
Depression symptoms following immune response to a challenge have been reported after the recovery from sickness. A RNA-Seq study of the dysregulation of the microglia transcriptome in a model of inflammation-associated depressive behavior was undertaken. The transcriptome of microglia from mice at day 7 after Bacille Calmette Guérin (BCG) challenge was compared to that from unchallenged Control mice and to the transcriptome from peripheral macrophages from the same mice. Among the 562 and 3,851 genes differentially expressed between BCG-challenged and Control mice in microglia and macrophages respectively, 353 genes overlapped between these cells types. Among the most differentially expressed genes in the microglia, serum amyloid A3 (Saa3) and cell adhesion molecule 3 (Cadm3) were over-expressed and coiled-coil domain containing 162 (Ccdc162) and titin-cap (Tcap) were under-expressed in BCG-challenged relative to Control. Many of the differentially expressed genes between BCG-challenged and Control mice were associated with neurological disorders encompassing depression symptoms. Across cell types, S100 calcium binding protein A9 (S100A9), interleukin 1 beta (Il1b) and kynurenine 3-monooxygenase (Kmo) were differentially expressed between challenged and control mice. Immune response, chemotaxis, and chemokine activity were among the functional categories enriched by the differentially expressed genes. Functional categories enriched among the 9,117 genes differentially expressed between cell types included leukocyte regulation and activation, chemokine and cytokine activities, MAP kinase activity, and apoptosis. More than 200 genes exhibited alternative splicing events between cell types including WNK lysine deficient protein kinase 1 (Wnk1) and microtubule-actin crosslinking factor 1(Macf1). Network visualization revealed the capability of microglia to exhibit transcriptome dysregulation in response to immune challenge still after resolution of sickness symptoms, albeit lower than that observed in macrophages. The persistent transcriptome dysregulation in the microglia shared patterns with neurological disorders indicating that the associated persistent depressive symptoms share a common transcriptome basis.
Lovatt, Ditte; Ruble, Brittani K.; Lee, Jaehee; Dueck, Hannah; Kim, Tae Kyung; Fisher, Stephen; Francis, Chantal; Spaethling, Jennifer M.; Wolf, John A.; Grady, M. Sean; Ulyanova, Alexandra V.; Yeldell, Sean B.; Griepenburg, Julianne C.; Buckley, Peter T.; Kim, Junhyong; Sul, Jai-Yoon; Dmochowski, Ivan J.; Eberwine, James
2014-01-01
Transcriptome profiling is an indispensable tool in advancing the understanding of single cell biology, but depends upon methods capable of isolating mRNA at the spatial resolution of a single cell. Current capture methods lack sufficient spatial resolution to isolate mRNA from individual in vivo resident cells without damaging adjacent tissue. Because of this limitation, it has been difficult to assess the influence of the microenvironment on the transcriptome of individual neurons. Here, we engineered a Transcriptome In Vivo Analysis (TIVA)-tag, which upon photoactivation enables mRNA capture from single cells in live tissue. Using the TIVA-tag in combination with RNA-seq to analyze transcriptome variance among single dispersed cells and in vivo resident mouse and human neurons, we show that the tissue microenvironment shapes the transcriptomic landscape of individual cells. The TIVA methodology provides the first noninvasive approach for capturing mRNA from single cells in their natural microenvironment. PMID:24412976
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hasin-Brumshtein, Yehudit; Khan, Arshad H.; Hormozdiari, Farhad
2016-09-13
Previous studies had shown that the integration of genome wide expression profiles, in metabolic tissues, with genetic and phenotypic variance, provided valuable insight into the underlying molecular mechanisms. We used RNA-Seq to characterize hypothalamic transcriptome in 99 inbred strains of mice from the Hybrid Mouse Diversity Panel (HMDP), a reference resource population for cardiovascular and metabolic traits. We report numerous novel transcripts supported by proteomic analyses, as well as novel non coding RNAs. High resolution genetic mapping of transcript levels in HMDP, reveals bothlocalandtransexpression Quantitative Trait Loci (eQTLs) demonstrating 2transeQTL 'hotspots' associated with expression of hundreds of genes. We alsomore » report thousands of alternative splicing events regulated by genetic variants. Finally, comparison with about 150 metabolic and cardiovascular traits revealed many highly significant associations. Our data provide a rich resource for understanding the many physiologic functions mediated by the hypothalamus and their genetic regulation.« less
Genome-wide mapping of alternative splicing in Arabidopsis thaliana
Filichkin, Sergei A.; Priest, Henry D.; Givan, Scott A.; Shen, Rongkun; Bryant, Douglas W.; Fox, Samuel E.; Wong, Weng-Keen; Mockler, Todd C.
2010-01-01
Alternative splicing can enhance transcriptome plasticity and proteome diversity. In plants, alternative splicing can be manifested at different developmental stages, and is frequently associated with specific tissue types or environmental conditions such as abiotic stress. We mapped the Arabidopsis transcriptome at single-base resolution using the Illumina platform for ultrahigh-throughput RNA sequencing (RNA-seq). Deep transcriptome sequencing confirmed a majority of annotated introns and identified thousands of novel alternatively spliced mRNA isoforms. Our analysis suggests that at least ∼42% of intron-containing genes in Arabidopsis are alternatively spliced; this is significantly higher than previous estimates based on cDNA/expressed sequence tag sequencing. Random validation confirmed that novel splice isoforms empirically predicted by RNA-seq can be detected in vivo. Novel introns detected by RNA-seq were substantially enriched in nonconsensus terminal dinucleotide splice signals. Alternative isoforms with premature termination codons (PTCs) comprised the majority of alternatively spliced transcripts. Using an example of an essential circadian clock gene, we show that intron retention can generate relatively abundant PTC+ isoforms and that this specific event is highly conserved among diverse plant species. Alternatively spliced PTC+ isoforms can be potentially targeted for degradation by the nonsense mediated mRNA decay (NMD) surveillance machinery or regulate the level of functional transcripts by the mechanism of regulated unproductive splicing and translation (RUST). We demonstrate that the relative ratios of the PTC+ and reference isoforms for several key regulatory genes can be considerably shifted under abiotic stress treatments. Taken together, our results suggest that like in animals, NMD and RUST may be widespread in plants and may play important roles in regulating gene expression. PMID:19858364
van der Meulen, Sjoerd B; de Jong, Anne; Kok, Jan
2016-01-01
RNA sequencing has revolutionized genome-wide transcriptome analyses, and the identification of non-coding regulatory RNAs in bacteria has thus increased concurrently. Here we reveal the transcriptome map of the lactic acid bacterial paradigm Lactococcus lactis MG1363 by employing differential RNA sequencing (dRNA-seq) and a combination of manual and automated transcriptome mining. This resulted in a high-resolution genome annotation of L. lactis and the identification of 60 cis-encoded antisense RNAs (asRNAs), 186 trans-encoded putative regulatory RNAs (sRNAs) and 134 novel small ORFs. Based on the putative targets of asRNAs, a novel classification is proposed. Several transcription factor DNA binding motifs were identified in the promoter sequences of (a)sRNAs, providing insight in the interplay between lactococcal regulatory RNAs and transcription factors. The presence and lengths of 14 putative sRNAs were experimentally confirmed by differential Northern hybridization, including the abundant RNA 6S that is differentially expressed depending on the available carbon source. For another sRNA, LLMGnc_147, functional analysis revealed that it is involved in carbon uptake and metabolism. L. lactis contains 13% leaderless mRNAs (lmRNAs) that, from an analysis of overrepresentation in GO classes, seem predominantly involved in nucleotide metabolism and DNA/RNA binding. Moreover, an A-rich sequence motif immediately following the start codon was uncovered, which could provide novel insight in the translation of lmRNAs. Altogether, this first experimental genome-wide assessment of the transcriptome landscape of L. lactis and subsequent sRNA studies provide an extensive basis for the investigation of regulatory RNAs in L. lactis and related lactococcal species.
The aquatic animals' transcriptome resource for comparative functional analysis.
Chou, Chih-Hung; Huang, Hsi-Yuan; Huang, Wei-Chih; Hsu, Sheng-Da; Hsiao, Chung-Der; Liu, Chia-Yu; Chen, Yu-Hung; Liu, Yu-Chen; Huang, Wei-Yun; Lee, Meng-Lin; Chen, Yi-Chang; Huang, Hsien-Da
2018-05-09
Aquatic animals have great economic and ecological importance. Among them, non-model organisms have been studied regarding eco-toxicity, stress biology, and environmental adaptation. Due to recent advances in next-generation sequencing techniques, large amounts of RNA-seq data for aquatic animals are publicly available. However, currently there is no comprehensive resource exist for the analysis, unification, and integration of these datasets. This study utilizes computational approaches to build a new resource of transcriptomic maps for aquatic animals. This aquatic animal transcriptome map database dbATM provides de novo assembly of transcriptome, gene annotation and comparative analysis of more than twenty aquatic organisms without draft genome. To improve the assembly quality, three computational tools (Trinity, Oases and SOAPdenovo-Trans) were employed to enhance individual transcriptome assembly, and CAP3 and CD-HIT-EST software were then used to merge these three assembled transcriptomes. In addition, functional annotation analysis provides valuable clues to gene characteristics, including full-length transcript coding regions, conserved domains, gene ontology and KEGG pathways. Furthermore, all aquatic animal genes are essential for comparative genomics tasks such as constructing homologous gene groups and blast databases and phylogenetic analysis. In conclusion, we establish a resource for non model organism aquatic animals, which is great economic and ecological importance and provide transcriptomic information including functional annotation and comparative transcriptome analysis. The database is now publically accessible through the URL http://dbATM.mbc.nctu.edu.tw/ .
Zhang, Zijun; Xing, Yi
2017-09-19
Crosslinking or RNA immunoprecipitation followed by sequencing (CLIP-seq or RIP-seq) allows transcriptome-wide discovery of RNA regulatory sites. As CLIP-seq/RIP-seq reads are short, existing computational tools focus on uniquely mapped reads, while reads mapped to multiple loci are discarded. We present CLAM (CLIP-seq Analysis of Multi-mapped reads). CLAM uses an expectation-maximization algorithm to assign multi-mapped reads and calls peaks combining uniquely and multi-mapped reads. To demonstrate the utility of CLAM, we applied it to a wide range of public CLIP-seq/RIP-seq datasets involving numerous splicing factors, microRNAs and m6A RNA methylation. CLAM recovered a large number of novel RNA regulatory sites inaccessible by uniquely mapped reads. The functional significance of these sites was demonstrated by consensus motif patterns and association with alternative splicing (splicing factors), transcript abundance (AGO2) and mRNA half-life (m6A). CLAM provides a useful tool to discover novel protein-RNA interactions and RNA modification sites from CLIP-seq and RIP-seq data, and reveals the significant contribution of repetitive elements to the RNA regulatory landscape of the human transcriptome. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Chatterjee, Aniruddha; Stockwell, Peter A.; Rodger, Euan J.; Duncan, Elizabeth J.; Parry, Matthew F.; Weeks, Robert J.; Morison, Ian M.
2015-01-01
The extent of variation in DNA methylation patterns in healthy individuals is not yet well documented. Identification of inter-individual epigenetic variation is important for understanding phenotypic variation and disease susceptibility. Using neutrophils from a cohort of healthy individuals, we generated base-resolution DNA methylation maps to document inter-individual epigenetic variation. We identified 12851 autosomal inter-individual variably methylated fragments (iVMFs). Gene promoters were the least variable, whereas gene body and upstream regions showed higher variation in DNA methylation. The iVMFs were relatively enriched in repetitive elements compared to non-iVMFs, and were associated with genome regulation and chromatin function elements. Further, variably methylated genes were disproportionately associated with regulation of transcription, responsive function and signal transduction pathways. Transcriptome analysis indicates that iVMF methylation at differentially expressed exons has a positive correlation and local effect on the inclusion of that exon in the mRNA transcript. PMID:26612583
The vagal ganglia transcriptome identifies candidate therapeutics for airway hyperreactivity.
Reznikov, Leah R; Meyerholz, David K; Abou Alaiwa, Mahmoud H; Kuan, Shin-Ping; Liao, Yan-Shin J; Bormann, Nicholas L; Bair, Thomas B; Price, Margaret; Stoltz, David A; Welsh, Michael J
2018-04-05
Mainstay therapeutics are ineffective in some people with asthma, suggesting a need for additional agents. In the current study, we used vagal ganglia transcriptome profiling and connectivity mapping to identify compounds beneficial for alleviating airway hyperreactivity. As a comparison, we also utilized previously published transcriptome data from sensitized mouse lungs and human asthmatic endobronchial biopsies. All transcriptomes revealed agents beneficial for mitigating airway hyperreactivity; however, only the vagal ganglia transcriptome identified agents used clinically to treat asthma (flunisolide, isoetarine). We also tested one compound identified by vagal ganglia transcriptome profiling that had not previously been linked to asthma and found that it had bronchodilator effects in both mouse and pig airways. These data suggest that transcriptome profiling of the vagal ganglia might be a novel strategy to identify potential asthma therapeutics.
Wei, Lin; Li, Shenghua; Liu, Shenggui; He, Anna; Wang, Dan; Wang, Jie; Tang, Yulian; Wu, Xianjin
2014-01-01
Background Houttuynia cordata Thunb. is an important traditional medical herb in China and other Asian countries, with high medicinal and economic value. However, a lack of available genomic information has become a limitation for research on this species. Thus, we carried out high-throughput transcriptomic sequencing of H. cordata to generate an enormous transcriptome sequence dataset for gene discovery and molecular marker development. Principal Findings Illumina paired-end sequencing technology produced over 56 million sequencing reads from H. cordata mRNA. Subsequent de novo assembly yielded 63,954 unigenes, 39,982 (62.52%) and 26,122 (40.84%) of which had significant similarity to proteins in the NCBI nonredundant protein and Swiss-Prot databases (E-value <10−5), respectively. Of these annotated unigenes, 30,131 and 15,363 unigenes were assigned to gene ontology categories and clusters of orthologous groups, respectively. In addition, 24,434 (38.21%) unigenes were mapped onto 128 pathways using the KEGG pathway database and 17,964 (44.93%) unigenes showed homology to Vitis vinifera (Vitaceae) genes in BLASTx analysis. Furthermore, 4,800 cDNA SSRs were identified as potential molecular markers. Fifty primer pairs were randomly selected to detect polymorphism among 30 samples of H. cordata; 43 (86%) produced fragments of expected size, suggesting that the unigenes were suitable for specific primer design and of high quality, and the SSR marker could be widely used in marker-assisted selection and molecular breeding of H. cordata in the future. Conclusions This is the first application of Illumina paired-end sequencing technology to investigate the whole transcriptome of H. cordata and to assemble RNA-seq reads without a reference genome. These data should help researchers investigating the evolution and biological processes of this species. The SSR markers developed can be used for construction of high-resolution genetic linkage maps and for gene-based association analyses in H. cordata. This work will enable future functional genomic research and research into the distinctive active constituents of this genus. PMID:24392108
Tao, Xuelian; Chen, Jianning; Jiang, Yanzhi; Wei, Yingying; Chen, Yan; Xu, Huaming; Zhu, Li; Tang, Guoqing; Li, Mingzhou; Jiang, Anan; Shuai, Surong; Bai, Lin; Liu, Haifeng; Ma, Jideng; Jin, Long; Wen, Anxiang; Wang, Qin; Zhu, Guangxiang; Xie, Meng; Wu, Jiayun; He, Tao; Huang, Chunyu; Gao, Xiang; Li, Xuewei
2017-04-28
N 6 -methyladenosine (m 6 A) is the most prevalent internal form of modification in messenger RNA in higher eukaryotes and potential regulatory functions of reversible m 6 A methylation on mRNA have been revealed by mapping of m 6 A methylomes in several species. m 6 A modification in active gene regulation manifests itself as altered methylation profiles in a tissue-specific manner or in response to changing cellular or species living environment. However, up to date, there has no data on m 6 A porcine transcriptome-wide map and its potential biological roles in adipose deposition and muscle growth. In this work, we used methylated RNA immunoprecipitation with next-generation sequencing (MeRIP-Seq) technique to acquire the first ever m 6 A porcine transcriptome-wide map. Transcriptomes of muscle and adipose tissues from three different pig breeds, the wild boar, Landrace, and Rongchang pig, were used to generate these maps. Our findings show that there were 5,872 and 2,826 m 6 A peaks respectively, in the porcine muscle and adipose tissue transcriptomes. Stop codons, 3'-untranslated regions, and coding regions were found to be mainly enriched for m 6 A peaks. Gene ontology analysis revealed that common m 6 A peaks in nuclear genes are associated with transcriptional factors, suggestive of a relationship between m 6 A mRNA methylation and nuclear genome transcription. Some genes showed tissue- and breed-differential methylation, and have novel biological functions. We also found a relationship between the m 6 A methylation extent and the transcript level, suggesting a regulatory role for m 6 A in gene expression. This comprehensive map provides a solid basis for the determination of potential functional roles for RNA m 6 A modification in adipose deposition and muscle growth.
Cui, Yi; Liu, Jing; Irudayaraj, Joseph
2017-07-01
In situ analysis offers a venue for dissecting the complex transcriptome in its natural context to tap into cellular processes that could explain the phenotypic physiology and pathology yet to be understood. Over the past decades, enormous progress has been made to improve the resolution, sensitivity, and specificity of single-cell technologies. The continued efforts in RNA research not only facilitates mechanistic studies of molecular biology but also provides state-of-the-art strategies for diagnostic purposes. The implementation of novel bio-imaging platforms has yielded valuable information for inspecting gene expression, mapping regulatory networks, and classifying cell types. In this article, we discuss the merits and technical challenges in single-molecule in situ RNA profiling. Advanced in situ hybridization methodologies developed for a variety of detection modalities are reviewed. Considering the fact that in mammalian cells the number of protein products immensely exceeds that of the actual coding genes due to pre-mRNA alternative splicing, tools capable of elucidating this process in intact cells are highlighted. To conclude, we point out future directions for in situ transcriptome analysis and expect a plethora of opportunities and discoveries in this field. WIREs Nanomed Nanobiotechnol 2017, 9:e1443. doi: 10.1002/wnan.1443 For further resources related to this article, please visit the WIREs website. © 2016 Wiley Periodicals, Inc.
Customizing the Connectivity Map Approach for Functional Evaluation in Toxicogenomics Studies (SOT)
Evaluating effects on the transcriptome can provide insight on putative chemical-specific mechanisms of action (MOAs). With whole genome transcriptomics technologies becoming more amenable to high-throughput screening, libraries of chemicals can be evaluated in vitro to produce l...
Naithani, Sushma; Sullivan, Chris; Preece, Justin; Tiwari, Vijay K.; Elser, Justin; Leonard, Jeffrey M.; Sage, Abigail; Gresham, Cathy; Kerhornou, Arnaud; Bolser, Dan; McCarthy, Fiona; Kersey, Paul; Lazo, Gerard R.; Jaiswal, Pankaj
2014-01-01
Background Triticum monococcum (2n) is a close ancestor of T. urartu, the A-genome progenitor of cultivated hexaploid wheat, and is therefore a useful model for the study of components regulating photomorphogenesis in diploid wheat. In order to develop genetic and genomic resources for such a study, we constructed genome-wide transcriptomes of two Triticum monococcum subspecies, the wild winter wheat T. monococcum ssp. aegilopoides (accession G3116) and the domesticated spring wheat T. monococcum ssp. monococcum (accession DV92) by generating de novo assemblies of RNA-Seq data derived from both etiolated and green seedlings. Principal Findings The de novo transcriptome assemblies of DV92 and G3116 represent 120,911 and 117,969 transcripts, respectively. We successfully mapped ∼90% of these transcripts from each accession to barley and ∼95% of the transcripts to T. urartu genomes. However, only ∼77% transcripts mapped to the annotated barley genes and ∼85% transcripts mapped to the annotated T. urartu genes. Differential gene expression analyses revealed 22% more light up-regulated and 35% more light down-regulated transcripts in the G3116 transcriptome compared to DV92. The DV92 and G3116 mRNA sequence reads aligned against the reference barley genome led to the identification of ∼500,000 single nucleotide polymorphism (SNP) and ∼22,000 simple sequence repeat (SSR) sites. Conclusions De novo transcriptome assemblies of two accessions of the diploid wheat T. monococcum provide new empirical transcriptome references for improving Triticeae genome annotations, and insights into transcriptional programming during photomorphogenesis. The SNP and SSR sites identified in our analysis provide additional resources for the development of molecular markers. PMID:24821410
Sequencing, Annotation and Analysis of the Syrian Hamster (Mesocricetus auratus) Transcriptome
Tchitchek, Nicolas; Safronetz, David; Rasmussen, Angela L.; Martens, Craig; Virtaneva, Kimmo; Porcella, Stephen F.; Feldmann, Heinz
2014-01-01
Background The Syrian hamster (golden hamster, Mesocricetus auratus) is gaining importance as a new experimental animal model for multiple pathogens, including emerging zoonotic diseases such as Ebola. Nevertheless there are currently no publicly available transcriptome reference sequences or genome for this species. Results A cDNA library derived from mRNA and snRNA isolated and pooled from the brains, lungs, spleens, kidneys, livers, and hearts of three adult female Syrian hamsters was sequenced. Sequence reads were assembled into 62,482 contigs and 111,796 reads remained unassembled (singletons). This combined contig/singleton dataset, designated as the Syrian hamster transcriptome, represents a total of 60,117,204 nucleotides. Our Mesocricetus auratus Syrian hamster transcriptome mapped to 11,648 mouse transcripts representing 9,562 distinct genes, and mapped to a similar number of transcripts and genes in the rat. We identified 214 quasi-complete transcripts based on mouse annotations. Canonical pathways involved in a broad spectrum of fundamental biological processes were significantly represented in the library. The Syrian hamster transcriptome was aligned to the current release of the Chinese hamster ovary (CHO) cell transcriptome and genome to improve the genomic annotation of this species. Finally, our Syrian hamster transcriptome was aligned against 14 other rodents, primate and laurasiatheria species to gain insights about the genetic relatedness and placement of this species. Conclusions This Syrian hamster transcriptome dataset significantly improves our knowledge of the Syrian hamster's transcriptome, especially towards its future use in infectious disease research. Moreover, this library is an important resource for the wider scientific community to help improve genome annotation of the Syrian hamster and other closely related species. Furthermore, these data provide the basis for development of expression microarrays that can be used in functional genomics studies. PMID:25398096
Wang, Yejun; MacKenzie, Keith D; White, Aaron P
2015-05-07
As sequencing costs are being lowered continuously, RNA-seq has gradually been adopted as the first choice for comparative transcriptome studies with bacteria. Unlike microarrays, RNA-seq can directly detect cDNA derived from mRNA transcripts at a single nucleotide resolution. Not only does this allow researchers to determine the absolute expression level of genes, but it also conveys information about transcript structure. Few automatic software tools have yet been established to investigate large-scale RNA-seq data for bacterial transcript structure analysis. In this study, 54 directional RNA-seq libraries from Salmonella serovar Typhimurium (S. Typhimurium) 14028s were examined for potential relationships between read mapping patterns and transcript structure. We developed an empirical method, combined with statistical tests, to automatically detect key transcript features, including transcriptional start sites (TSSs), transcriptional termination sites (TTSs) and operon organization. Using our method, we obtained 2,764 TSSs and 1,467 TTSs for 1331 and 844 different genes, respectively. Identification of TSSs facilitated further discrimination of 215 putative sigma 38 regulons and 863 potential sigma 70 regulons. Combining the TSSs and TTSs with intergenic distance and co-expression information, we comprehensively annotated the operon organization in S. Typhimurium 14028s. Our results show that directional RNA-seq can be used to detect transcriptional borders at an acceptable resolution of ±10-20 nucleotides. Technical limitations of the RNA-seq procedure may prevent single nucleotide resolution. The automatic transcript border detection methods, statistical models and operon organization pipeline that we have described could be widely applied to RNA-seq studies in other bacteria. Furthermore, the TSSs, TTSs, operons, promoters and unstranslated regions that we have defined for S. Typhimurium 14028s may constitute valuable resources that can be used for comparative analyses with other Salmonella serotypes.
In-cell RNA structure probing with SHAPE-MaP.
Smola, Matthew J; Weeks, Kevin M
2018-06-01
This protocol is an extension to: Nat. Protoc. 10, 1643-1669 (2015); doi:10.1038/nprot.2015.103; published online 01 October 2015RNAs play key roles in many cellular processes. The underlying structure of RNA is an important determinant of how transcripts function, are processed, and interact with RNA-binding proteins and ligands. RNA structure analysis by selective 2'-hydroxyl acylation analyzed by primer extension (SHAPE) takes advantage of the reactivity of small electrophilic chemical probes that react with the 2'-hydroxyl group to assess RNA structure at nucleotide resolution. When coupled with mutational profiling (MaP), in which modified nucleotides are detected as internal miscodings during reverse transcription and then read out by massively parallel sequencing, SHAPE yields quantitative per-nucleotide measurements of RNA structure. Here, we provide an extension to our previous in vitro SHAPE-MaP protocol with detailed guidance for undertaking and analyzing SHAPE-MaP probing experiments in live cells. The MaP strategy works for both abundant-transcriptome experiments and for cellular RNAs of low to moderate abundance, which are not well examined by whole-transcriptome methods. In-cell SHAPE-MaP, performed in roughly 3 d, can be applied in cell types ranging from bacteria to cultured mammalian cells and is compatible with a variety of structure-probing reagents. We detail several strategies by which in-cell SHAPE-MaP can inform new biological hypotheses and emphasize downstream analyses that reveal sequence or structure motifs important for RNA interactions in cells.
Rosinski-Chupin, Isabelle; Sauvage, Elisabeth; Sismeiro, Odile; Villain, Adrien; Da Cunha, Violette; Caliot, Marie-Elise; Dillies, Marie-Agnès; Trieu-Cuot, Patrick; Bouloc, Philippe; Lartigue, Marie-Frédérique; Glaser, Philippe
2015-05-30
Streptococcus agalactiae, or Group B Streptococcus, is a leading cause of neonatal infections and an increasing cause of infections in adults with underlying diseases. In an effort to reconstruct the transcriptional networks involved in S. agalactiae physiology and pathogenesis, we performed an extensive and robust characterization of its transcriptome through a combination of differential RNA-sequencing in eight different growth conditions or genetic backgrounds and strand-specific RNA-sequencing. Our study identified 1,210 transcription start sites (TSSs) and 655 transcript ends as well as 39 riboswitches and cis-regulatory regions, 39 cis-antisense non-coding RNAs and 47 small RNAs potentially acting in trans. Among these putative regulatory RNAs, ten were differentially expressed in response to an acid stress and two riboswitches sensed directly or indirectly the pH modification. Strikingly, 15% of the TSSs identified were associated with the incorporation of pseudo-templated nucleotides, showing that reiterative transcription is a pervasive process in S. agalactiae. In particular, 40% of the TSSs upstream genes involved in nucleotide metabolism show reiterative transcription potentially regulating gene expression, as exemplified for pyrG and thyA encoding the CTP synthase and the thymidylate synthase respectively. This comprehensive map of the transcriptome at the single nucleotide resolution led to the discovery of new regulatory mechanisms in S. agalactiae. It also provides the basis for in depth analyses of transcriptional networks in S. agalactiae and of the regulatory role of reiterative transcription following variations of intra-cellular nucleotide pools.
Offenbacher, Steven; Barros, Silvana P; Paquette, David W; Winston, J Leslie; Biesbrock, Aaron R; Thomason, Ryan G; Gibb, Roger D; Fulmer, Andy W; Tiesman, Jay P; Juhlin, Kenton D; Wang, Shuo L; Reichling, Tim D; Chen, Ker-Sang; Ho, Begonia
2009-12-01
To our knowledge, changes in the patterns of whole-transcriptome gene expression that occur during the induction and resolution of experimental gingivitis in humans were not previously explored using bioinformatic tools. Gingival biopsy samples collected from 14 subjects during a 28-day stent-induced experimental gingivitis model, followed by treatment, and resolution at days 28 through 35 were analyzed using gene-expression arrays. Biopsy samples were collected at different sites within each subject at baseline (day 0), at the peak of gingivitis (day 28), and at resolution (day 35) and processed using whole-transcriptome gene-expression arrays. Gene-expression data were analyzed to identify biologic themes and pathways associated with changes in gene-expression profiles that occur during the induction and resolution of experimental gingivitis using bioinformatic tools. During disease induction and resolution, the dominant expression pathway was the immune response, with 131 immune response genes significantly up- or downregulated during induction, during resolution, or during both at P <0.05. During induction, there was significant transient increase in the expression of inflammatory and oxidative stress mediators, including interleukin (IL)-1 alpha (IL1A), IL-1 beta (IL1B), IL8, RANTES, colony stimulating factor 3 (CSF3), and superoxide dismutase 2 (SOD2), and a decreased expression of IP10, interferon inducible T-cell alpha chemoattractant (ITAC), matrix metalloproteinase 10 (MMP10), and beta 4 defensin (DEFB4). These genes reversed expression patterns upon resolution in parallel with the reversal of gingival inflammation. A relatively small subset (11.9%) of the immune response genes analyzed by array was transiently activated in response to biofilm overgrowth, suggesting a degree of specificity in the transcriptome-expression response. The fact that this same subset demonstrates a reversal in expression patterns during clinical resolution implicates these genes as being critical for maintaining tissue homeostasis at the biofilm-gingival interface. In addition to the immune response pathway as the dominant response theme, new candidate genes and pathways were identified as being selectively modulated in experimental gingivitis, including neural processes, epithelial defenses, angiogenesis, and wound healing.
Metabolomics for undergraduates: Identification and pathway assignment of mitochondrial metabolites.
Marques, Ana Patrícia; Serralheiro, Maria Luisa; Ferreira, António E N; Freire, Ana Ponces; Cordeiro, Carlos; Silva, Marta Sousa
2016-01-01
Metabolomics is a key discipline in systems biology, together with genomics, transcriptomics, and proteomics. In this omics cascade, the metabolome represents the biochemical products that arise from cellular processes and is often regarded as the final response of a biological system to environmental or genetic changes. The overall screening approach to identify all the metabolites in a given biological system is called metabolic fingerprinting. Using high-resolution and high-mass accuracy mass spectrometry, large metabolome coverage, sensitivity, and specificity can be attained. Although the theoretical concepts of this methodology are usually provided in life-science programs, hands-on laboratory experiments are not usually accessible to undergraduate students. Even if the instruments are available, there are not simple laboratory protocols created specifically for teaching metabolomics. We designed a straightforward hands-on laboratory experiment to introduce students to this methodology, relating it to biochemical knowledge through metabolic pathway mapping of the identified metabolites. This study focuses on mitochondrial metabolomics since mitochondria have a well-known, medium-sized cellular sub-metabolome. These features facilitate both data processing and pathway mapping. In this experiment, students isolate mitochondria from potatoes, extract the metabolites, and analyze them by high-resolution mass spectrometry (using an FT-ICR mass spectrometer). The resulting mass list is submitted to an online program for metabolite identification, and compounds associated with mitochondrial pathways can be highlighted in a metabolic network map. © 2015 The International Union of Biochemistry and Molecular Biology.
Chao, Michael C.; Pritchard, Justin R.; Zhang, Yanjia J.; Rubin, Eric J.; Livny, Jonathan; Davis, Brigid M.; Waldor, Matthew K.
2013-01-01
The coupling of high-density transposon mutagenesis to high-throughput DNA sequencing (transposon-insertion sequencing) enables simultaneous and genome-wide assessment of the contributions of individual loci to bacterial growth and survival. We have refined analysis of transposon-insertion sequencing data by normalizing for the effect of DNA replication on sequencing output and using a hidden Markov model (HMM)-based filter to exploit heretofore unappreciated information inherent in all transposon-insertion sequencing data sets. The HMM can smooth variations in read abundance and thereby reduce the effects of read noise, as well as permit fine scale mapping that is independent of genomic annotation and enable classification of loci into several functional categories (e.g. essential, domain essential or ‘sick’). We generated a high-resolution map of genomic loci (encompassing both intra- and intergenic sequences) that are required or beneficial for in vitro growth of the cholera pathogen, Vibrio cholerae. This work uncovered new metabolic and physiologic requirements for V. cholerae survival, and by combining transposon-insertion sequencing and transcriptomic data sets, we also identified several novel noncoding RNA species that contribute to V. cholerae growth. Our findings suggest that HMM-based approaches will enhance extraction of biological meaning from transposon-insertion sequencing genomic data. PMID:23901011
Wong, Kim; Navarro, José Fernández; Bergenstråhle, Ludvig; Ståhl, Patrik L; Lundeberg, Joakim
2018-06-01
Spatial Transcriptomics (ST) is a method which combines high resolution tissue imaging with high troughput transcriptome sequencing data. This data must be aligned with the images for correct visualization, a process that involves several manual steps. Here we present ST Spot Detector, a web tool that automates and facilitates this alignment through a user friendly interface. jose.fernandez.navarro@scilifelab.se. Supplementary data are available at Bioinformatics online.
Yu, Yang; Wei, Jiankai; Zhang, Xiaojun; Liu, Jingwen; Liu, Chengzhang; Li, Fuhua; Xiang, Jianhai
2014-01-01
The application of next generation sequencing technology has greatly facilitated high throughput single nucleotide polymorphism (SNP) discovery and genotyping in genetic research. In the present study, SNPs were discovered based on two transcriptomes of Litopenaeus vannamei (L. vannamei) generated from Illumina sequencing platform HiSeq 2000. One transcriptome of L. vannamei was obtained through sequencing on the RNA from larvae at mysis stage and its reference sequence was de novo assembled. The data from another transcriptome were downloaded from NCBI and the reads of the two transcriptomes were mapped separately to the assembled reference by BWA. SNP calling was performed using SAMtools. A total of 58,717 and 36,277 SNPs with high quality were predicted from the two transcriptomes, respectively. SNP calling was also performed using the reads of two transcriptomes together, and a total of 96,040 SNPs with high quality were predicted. Among these 96,040 SNPs, 5,242 and 29,129 were predicted as non-synonymous and synonymous SNPs respectively. Characterization analysis of the predicted SNPs in L. vannamei showed that the estimated SNP frequency was 0.21% (one SNP per 476 bp) and the estimated ratio for transition to transversion was 2.0. Fifty SNPs were randomly selected for validation by Sanger sequencing after PCR amplification and 76% of SNPs were confirmed, which indicated that the SNPs predicted in this study were reliable. These SNPs will be very useful for genetic study in L. vannamei, especially for the high density linkage map construction and genome-wide association studies. PMID:24498047
High-throughput illumina strand-specific RNA sequencing library preparation
USDA-ARS?s Scientific Manuscript database
Conventional Illumina RNA-Seq does not have the resolution to decode the complex eukaryote transcriptome due to the lack of RNA polarity information. Strand-specific RNA sequencing (ssRNA-Seq) can overcome these limitations and as such is better suited for genome annotation, de novo transcriptome as...
A rat RNA-Seq transcriptomic BodyMap across 11 organs and 4 developmental stages
Yu, Ying; Fuscoe, James C.; Zhao, Chen; Guo, Chao; Jia, Meiwen; Qing, Tao; Bannon, Desmond I.; Lancashire, Lee; Bao, Wenjun; Du, Tingting; Luo, Heng; Su, Zhenqiang; Jones, Wendell D.; Moland, Carrie L.; Branham, William S.; Qian, Feng; Ning, Baitang; Li, Yan; Hong, Huixiao; Guo, Lei; Mei, Nan; Shi, Tieliu; Wang, Kevin Y.; Wolfinger, Russell D.; Nikolsky, Yuri; Walker, Stephen J.; Duerksen-Hughes, Penelope; Mason, Christopher E.; Tong, Weida; Thierry-Mieg, Jean; Thierry-Mieg, Danielle; Shi, Leming; Wang, Charles
2014-01-01
The rat has been used extensively as a model for evaluating chemical toxicities and for understanding drug mechanisms. However, its transcriptome across multiple organs, or developmental stages, has not yet been reported. Here we show, as part of the SEQC consortium efforts, a comprehensive rat transcriptomic BodyMap created by performing RNA-Seq on 320 samples from 11 organs of both sexes of juvenile, adolescent, adult and aged Fischer 344 rats. We catalogue the expression profiles of 40,064 genes, 65,167 transcripts, 31,909 alternatively spliced transcript variants and 2,367 non-coding genes/non-coding RNAs (ncRNAs) annotated in AceView. We find that organ-enriched, differentially expressed genes reflect the known organ-specific biological activities. A large number of transcripts show organ-specific, age-dependent or sex-specific differential expression patterns. We create a web-based, open-access rat BodyMap database of expression profiles with crosslinks to other widely used databases, anticipating that it will serve as a primary resource for biomedical research using the rat model. PMID:24510058
Nakayama, Hokuto; Sakamoto, Tomoaki; Okegawa, Yuki; Kaminoyama, Kaori; Fujie, Manabu; Ichihashi, Yasunori; Kurata, Tetsuya; Motohashi, Ken; Al-Shehbaz, Ihsan; Sinha, Neelima; Kimura, Seisuke
2018-02-19
Because natural variation in wild species is likely the result of local adaptation, it provides a valuable resource for understanding plant-environmental interactions. Rorippa aquatica (Brassicaceae) is a semi-aquatic North American plant with morphological differences between several accessions, but little information available on any physiological differences. Here, we surveyed the transcriptomes of two R. aquatica accessions and identified cryptic physiological differences between them. We first reconstructed a Rorippa phylogeny to confirm relationships between the accessions. We performed large-scale RNA-seq and de novo assembly; the resulting 87,754 unigenes were then annotated via comparisons to different databases. Between-accession physiological variation was identified with transcriptomes from both accessions. Transcriptome data were analyzed with principal component analysis and self-organizing map. Results of analyses suggested that photosynthetic capability differs between the accessions. Indeed, physiological experiments revealed between-accession variation in electron transport rate and the redox state of the plastoquinone pool. These results indicated that one accession may have adapted to differences in temperature or length of the growing season.
Su, Chang; Wang, Chao; He, Lin; Yang, Chuanping; Wang, Yucheng
2014-01-01
DNA methylation plays a critical role in the regulation of gene expression. Most studies of DNA methylation have been performed in herbaceous plants, and little is known about the methylation patterns in tree genomes. In the present study, we generated a map of methylated cytosines at single base pair resolution for Betula platyphylla (white birch) by bisulfite sequencing combined with transcriptomics to analyze DNA methylation and its effects on gene expression. We obtained a detailed view of the function of DNA methylation sequence composition and distribution in the genome of B. platyphylla. There are 34,460 genes in the whole genome of birch, and 31,297 genes are methylated. Conservatively, we estimated that 14.29% of genomic cytosines are methylcytosines in birch. Among the methylation sites, the CHH context accounts for 48.86%, and is the largest proportion. Combined transcriptome and methylation analysis showed that the genes with moderate methylation levels had higher expression levels than genes with high and low methylation. In addition, methylated genes are highly enriched for the GO subcategories of binding activities, catalytic activities, cellular processes, response to stimulus and cell death, suggesting that methylation mediates these pathways in birch trees. PMID:25514241
An anatomically comprehensive atlas of the adult human brain transcriptome
Guillozet-Bongaarts, Angela L.; Shen, Elaine H.; Ng, Lydia; Miller, Jeremy A.; van de Lagemaat, Louie N.; Smith, Kimberly A.; Ebbert, Amanda; Riley, Zackery L.; Abajian, Chris; Beckmann, Christian F.; Bernard, Amy; Bertagnolli, Darren; Boe, Andrew F.; Cartagena, Preston M.; Chakravarty, M. Mallar; Chapin, Mike; Chong, Jimmy; Dalley, Rachel A.; David Daly, Barry; Dang, Chinh; Datta, Suvro; Dee, Nick; Dolbeare, Tim A.; Faber, Vance; Feng, David; Fowler, David R.; Goldy, Jeff; Gregor, Benjamin W.; Haradon, Zeb; Haynor, David R.; Hohmann, John G.; Horvath, Steve; Howard, Robert E.; Jeromin, Andreas; Jochim, Jayson M.; Kinnunen, Marty; Lau, Christopher; Lazarz, Evan T.; Lee, Changkyu; Lemon, Tracy A.; Li, Ling; Li, Yang; Morris, John A.; Overly, Caroline C.; Parker, Patrick D.; Parry, Sheana E.; Reding, Melissa; Royall, Joshua J.; Schulkin, Jay; Sequeira, Pedro Adolfo; Slaughterbeck, Clifford R.; Smith, Simon C.; Sodt, Andy J.; Sunkin, Susan M.; Swanson, Beryl E.; Vawter, Marquis P.; Williams, Derric; Wohnoutka, Paul; Zielke, H. Ronald; Geschwind, Daniel H.; Hof, Patrick R.; Smith, Stephen M.; Koch, Christof; Grant, Seth G. N.; Jones, Allan R.
2014-01-01
Neuroanatomically precise, genome-wide maps of transcript distributions are critical resources to complement genomic sequence data and to correlate functional and genetic brain architecture. Here we describe the generation and analysis of a transcriptional atlas of the adult human brain, comprising extensive histological analysis and comprehensive microarray profiling of ~900 neuroanatomically precise subdivisions in two individuals. Transcriptional regulation varies enormously by anatomical location, with different regions and their constituent cell types displaying robust molecular signatures that are highly conserved between individuals. Analysis of differential gene expression and gene co-expression relationships demonstrates that brain-wide variation strongly reflects the distributions of major cell classes such as neurons, oligodendrocytes, astrocytes and microglia. Local neighbourhood relationships between fine anatomical subdivisions are associated with discrete neuronal subtypes and genes involved with synaptic transmission. The neocortex displays a relatively homogeneous transcriptional pattern, but with distinct features associated selectively with primary sensorimotor cortices and with enriched frontal lobe expression. Notably, the spatial topography of the neocortex is strongly reflected in its molecular topography— the closer two cortical regions, the more similar their transcriptomes. This freely accessible online data resource forms a high-resolution transcriptional baseline for neurogenetic studies of normal and abnormal human brain function. PMID:22996553
Necklace: combining reference and assembled transcriptomes for more comprehensive RNA-Seq analysis.
Davidson, Nadia M; Oshlack, Alicia
2018-05-01
RNA sequencing (RNA-seq) analyses can benefit from performing a genome-guided and de novo assembly, in particular for species where the reference genome or the annotation is incomplete. However, tools for integrating an assembled transcriptome with reference annotation are lacking. Necklace is a software pipeline that runs genome-guided and de novo assembly and combines the resulting transcriptomes with reference genome annotations. Necklace constructs a compact but comprehensive superTranscriptome out of the assembled and reference data. Reads are subsequently aligned and counted in preparation for differential expression testing. Necklace allows a comprehensive transcriptome to be built from a combination of assembled and annotated transcripts, which results in a more comprehensive transcriptome for the majority of organisms. In addition RNA-seq data are mapped back to this newly created superTranscript reference to enable differential expression testing with standard methods.
Global survey of genomic imprinting by transcriptome sequencing.
Babak, Tomas; Deveale, Brian; Armour, Christopher; Raymond, Christopher; Cleary, Michele A; van der Kooy, Derek; Johnson, Jason M; Lim, Lee P
2008-11-25
Genomic imprinting restricts gene expression to a paternal or maternal allele. To date, approximately 90 imprinted transcripts have been identified in mouse, of which the majority were detected after intense interrogation of clusters of imprinted genes identified by phenotype-driven assays in mice with uniparental disomies [1]. Here we use selective priming and parallel sequencing to measure allelic bias in whole transcriptomes. By distinguishing parent-of-origin bias from strain-specific bias in embryos derived from a reciprocal cross of mice, we constructed a genome-wide map of imprinted transcription. This map was able to objectively locate over 80% of known imprinted loci and allowed the detection and confirmation of six novel imprinted genes. Even in the intensely studied embryonic day 9.5 developmental stage that we analyzed, more than half of all imprinted single-nucleotide polymorphisms did not overlap previously discovered imprinted transcripts; a large fraction of these represent novel noncoding RNAs within known imprinted loci. For example, a previously unnoticed, maternally expressed antisense transcript was mapped within the Grb10 locus. This study demonstrates the feasibility of using transcriptome sequencing for mapping of imprinted gene expression in physiologically normal animals. Such an approach will allow researchers to study imprinting without restricting themselves to individual loci or specific transcripts.
Brereton, Nicholas J. B.; Marleau, Julie; Nissim, Werther Guidi; Labrecque, Michel; Joly, Simon; Pitre, Frederic E.
2016-01-01
Metatranscriptomic study of nonmodel organisms requires strategies that retain the highly resolved genetic information generated from model organisms while allowing for identification of the unexpected. A real-world biological application of phytoremediation, the field growth of 10 Salix cultivars on polluted soils, was used as an exemplar nonmodel and multifaceted crop response well-disposed to the study of gene expression. Sequence reads were assembled de novo to create 10 independent transcriptomes, a global transcriptome, and were mapped against the Salix purpurea 94006 reference genome. Annotation of assembled contigs was performed without a priori assumption of the originating organism. Global transcriptome construction from 3.03 billion paired-end reads revealed 606,880 unique contigs annotated from 1588 species, often common in all 10 cultivars. Comparisons between transcriptomic and metatranscriptomic methodologies provide clear evidence that nonnative RNA can mistakenly map to reference genomes, especially to conserved regions of common housekeeping genes, such as actin, α/β-tubulin, and elongation factor 1-α. In Salix, Rubisco activase transcripts were down-regulated in contaminated trees across all 10 cultivars, whereas thiamine thizole synthase and CP12, a Calvin Cycle master regulator, were uniformly up-regulated. De novo assembly approaches, with unconstrained annotation, can improve data quality; care should be taken when exploring such plant genetics to reduce de facto data exclusion by mapping to a single reference genome alone. Salix gene expression patterns strongly suggest cultivar-wide alteration of specific photosynthetic apparatus and protection of the antenna complexes from oxidation damage in contaminated trees, providing an insight into common stress tolerance strategies in a real-world phytoremediation system. PMID:27002060
Xu, Hai-Ming; Kong, Xiang-Dong; Chen, Fei; Huang, Ji-Xiang; Lou, Xiang-Yang; Zhao, Jian-Yi
2015-10-24
Brassica napus is an important oilseed crop. Dissection of the genetic architecture underlying oil-related biological processes will greatly facilitates the genetic improvement of rapeseed. The differential gene expression during pod development offers a snapshot on the genes responsible for oil accumulation in. To identify candidate genes in the linkage peaks reported previously, we used RNA sequencing (RNA-Seq) technology to analyze the pod transcriptomes of German cultivar Sollux and Chinese inbred line Gaoyou. The RNA samples were collected for RNA-Seq at 5-7, 15-17 and 25-27 days after flowering (DAF). Bioinformatics analysis was performed to investigate differentially expressed genes (DEGs). Gene annotation analysis was integrated with QTL mapping and Brassica napus pod transcriptome profiling to detect potential candidate genes in oilseed. Four hundred sixty five and two thousand, one hundred fourteen candidate DEGs were identified, respectively, between two varieties at the same stages and across different periods of each variety. Then, 33 DEGs between Sollux and Gaoyou were identified as the candidate genes affecting seed oil content by combining those DEGs with the quantitative trait locus (QTL) mapping results, of which, one was found to be homologous to Arabidopsis thaliana lipid-related genes. Intervarietal DEGs of lipid pathways in QTL regions represent important candidate genes for oil-related traits. Integrated analysis of transcriptome profiling, QTL mapping and comparative genomics with other relative species leads to efficient identification of most plausible functional genes underlying oil-content related characters, offering valuable resources for bettering breeding program of Brassica napus. This study provided a comprehensive overview on the pod transcriptomes of two varieties with different oil-contents at the three developmental stages.
Spatial reconstruction of single-cell gene expression data.
Satija, Rahul; Farrell, Jeffrey A; Gennert, David; Schier, Alexander F; Regev, Aviv
2015-05-01
Spatial localization is a key determinant of cellular fate and behavior, but methods for spatially resolved, transcriptome-wide gene expression profiling across complex tissues are lacking. RNA staining methods assay only a small number of transcripts, whereas single-cell RNA-seq, which measures global gene expression, separates cells from their native spatial context. Here we present Seurat, a computational strategy to infer cellular localization by integrating single-cell RNA-seq data with in situ RNA patterns. We applied Seurat to spatially map 851 single cells from dissociated zebrafish (Danio rerio) embryos and generated a transcriptome-wide map of spatial patterning. We confirmed Seurat's accuracy using several experimental approaches, then used the strategy to identify a set of archetypal expression patterns and spatial markers. Seurat correctly localizes rare subpopulations, accurately mapping both spatially restricted and scattered groups. Seurat will be applicable to mapping cellular localization within complex patterned tissues in diverse systems.
Strand-specific transcriptome profiling with directly labeled RNA on genomic tiling microarrays
2011-01-01
Background With lower manufacturing cost, high spot density, and flexible probe design, genomic tiling microarrays are ideal for comprehensive transcriptome studies. Typically, transcriptome profiling using microarrays involves reverse transcription, which converts RNA to cDNA. The cDNA is then labeled and hybridized to the probes on the arrays, thus the RNA signals are detected indirectly. Reverse transcription is known to generate artifactual cDNA, in particular the synthesis of second-strand cDNA, leading to false discovery of antisense RNA. To address this issue, we have developed an effective method using RNA that is directly labeled, thus by-passing the cDNA generation. This paper describes this method and its application to the mapping of transcriptome profiles. Results RNA extracted from laboratory cultures of Porphyromonas gingivalis was fluorescently labeled with an alkylation reagent and hybridized directly to probes on genomic tiling microarrays specifically designed for this periodontal pathogen. The generated transcriptome profile was strand-specific and produced signals close to background level in most antisense regions of the genome. In contrast, high levels of signal were detected in the antisense regions when the hybridization was done with cDNA. Five antisense areas were tested with independent strand-specific RT-PCR and none to negligible amplification was detected, indicating that the strong antisense cDNA signals were experimental artifacts. Conclusions An efficient method was developed for mapping transcriptome profiles specific to both coding strands of a bacterial genome. This method chemically labels and uses extracted RNA directly in microarray hybridization. The generated transcriptome profile was free of cDNA artifactual signals. In addition, this method requires fewer processing steps and is potentially more sensitive in detecting small amount of RNA compared to conventional end-labeling methods due to the incorporation of more fluorescent molecules per RNA fragment. PMID:21235785
A High-Density Linkage Map for Astyanax mexicanus Using Genotyping-by-Sequencing Technology
Carlson, Brian M.; Onusko, Samuel W.; Gross, Joshua B.
2014-01-01
The Mexican tetra, Astyanax mexicanus, is a unique model system consisting of cave-adapted and surface-dwelling morphotypes that diverged >1 million years (My) ago. This remarkable natural experiment has enabled powerful genetic analyses of cave adaptation. Here, we describe the application of next-generation sequencing technology to the creation of a high-density linkage map. Our map comprises more than 2200 markers populating 25 linkage groups constructed from genotypic data generated from a single genotyping-by-sequencing project. We leveraged emergent genomic and transcriptomic resources to anchor hundreds of anonymous Astyanax markers to the genome of the zebrafish (Danio rerio), the most closely related model organism to our study species. This facilitated the identification of 784 distinct connections between our linkage map and the Danio rerio genome, highlighting several regions of conserved genomic architecture between the two species despite ∼150 My of divergence. Using a Mendelian cave-associated trait as a proof-of-principle, we successfully recovered the genomic position of the albinism locus near the gene Oca2. Further, our map successfully informed the positions of unplaced Astyanax genomic scaffolds within particular linkage groups. This ability to identify the relative location, orientation, and linear order of unaligned genomic scaffolds will facilitate ongoing efforts to improve on the current early draft and assemble future versions of the Astyanax physical genome. Moreover, this improved linkage map will enable higher-resolution genetic analyses and catalyze the discovery of the genetic basis for cave-associated phenotypes. PMID:25520037
Dominguez, Daniel; Tsai, Yi-Hsuan; Gomez, Nicholas; Jha, Deepak Kumar; Davis, Ian; Wang, Zefeng
2016-01-01
Progression through the cell cycle is largely dependent on waves of periodic gene expression, and the regulatory networks for these transcriptome dynamics have emerged as critical points of vulnerability in various aspects of tumor biology. Through RNA-sequencing of human cells during two continuous cell cycles (>2.3 billion paired reads), we identified over 1 000 mRNAs, non-coding RNAs and pseudogenes with periodic expression. Periodic transcripts are enriched in functions related to DNA metabolism, mitosis, and DNA damage response, indicating these genes likely represent putative cell cycle regulators. Using our set of periodic genes, we developed a new approach termed “mitotic trait” that can classify primary tumors and normal tissues by their transcriptome similarity to different cell cycle stages. By analyzing >4 000 tumor samples in The Cancer Genome Atlas (TCGA) and other expression data sets, we found that mitotic trait significantly correlates with genetic alterations, tumor subtype and, notably, patient survival. We further defined a core set of 67 genes with robust periodic expression in multiple cell types. Proteins encoded by these genes function as major hubs of protein-protein interaction and are mostly required for cell cycle progression. The core genes also have unique chromatin features including increased levels of CTCF/RAD21 binding and H3K36me3. Loss of these features in uterine and kidney cancers is associated with altered expression of the core 67 genes. Our study suggests new chromatin-associated mechanisms for periodic gene regulation and offers a predictor of cancer patient outcomes. PMID:27364684
Dutta, Sutapa; Kumawat, Giriraj; Singh, Bikram P; Gupta, Deepak K; Singh, Sangeeta; Dogra, Vivek; Gaikwad, Kishor; Sharma, Tilak R; Raje, Ranjeet S; Bandhopadhya, Tapas K; Datta, Subhojit; Singh, Mahendra N; Bashasab, Fakrudin; Kulwal, Pawan; Wanjari, K B; K Varshney, Rajeev; Cook, Douglas R; Singh, Nagendra K
2011-01-20
Pigeonpea [Cajanus cajan (L.) Millspaugh], one of the most important food legumes of semi-arid tropical and subtropical regions, has limited genomic resources, particularly expressed sequence based (genic) markers. We report a comprehensive set of validated genic simple sequence repeat (SSR) markers using deep transcriptome sequencing, and its application in genetic diversity analysis and mapping. In this study, 43,324 transcriptome shotgun assembly unigene contigs were assembled from 1.696 million 454 GS-FLX sequence reads of separate pooled cDNA libraries prepared from leaf, root, stem and immature seed of two pigeonpea varieties, Asha and UPAS 120. A total of 3,771 genic-SSR loci, excluding homopolymeric and compound repeats, were identified; of which 2,877 PCR primer pairs were designed for marker development. Dinucleotide was the most common repeat motif with a frequency of 60.41%, followed by tri- (34.52%), hexa- (2.62%), tetra- (1.67%) and pentanucleotide (0.76%) repeat motifs. Primers were synthesized and tested for 772 of these loci with repeat lengths of ≥ 18 bp. Of these, 550 markers were validated for consistent amplification in eight diverse pigeonpea varieties; 71 were found to be polymorphic on agarose gel electrophoresis. Genetic diversity analysis was done on 22 pigeonpea varieties and eight wild species using 20 highly polymorphic genic-SSR markers. The number of alleles at these loci ranged from 4-10 and the polymorphism information content values ranged from 0.46 to 0.72. Neighbor-joining dendrogram showed distinct separation of the different groups of pigeonpea cultivars and wild species. Deep transcriptome sequencing of the two parental lines helped in silico identification of polymorphic genic-SSR loci to facilitate the rapid development of an intra-species reference genetic map, a subset of which was validated for expected allelic segregation in the reference mapping population. We developed 550 validated genic-SSR markers in pigeonpea using deep transcriptome sequencing. From these, 20 highly polymorphic markers were used to evaluate the genetic relationship among species of the genus Cajanus. A comprehensive set of genic-SSR markers was developed as an important genomic resource for diversity analysis and genetic mapping in pigeonpea.
2011-01-01
Background Pigeonpea [Cajanus cajan (L.) Millspaugh], one of the most important food legumes of semi-arid tropical and subtropical regions, has limited genomic resources, particularly expressed sequence based (genic) markers. We report a comprehensive set of validated genic simple sequence repeat (SSR) markers using deep transcriptome sequencing, and its application in genetic diversity analysis and mapping. Results In this study, 43,324 transcriptome shotgun assembly unigene contigs were assembled from 1.696 million 454 GS-FLX sequence reads of separate pooled cDNA libraries prepared from leaf, root, stem and immature seed of two pigeonpea varieties, Asha and UPAS 120. A total of 3,771 genic-SSR loci, excluding homopolymeric and compound repeats, were identified; of which 2,877 PCR primer pairs were designed for marker development. Dinucleotide was the most common repeat motif with a frequency of 60.41%, followed by tri- (34.52%), hexa- (2.62%), tetra- (1.67%) and pentanucleotide (0.76%) repeat motifs. Primers were synthesized and tested for 772 of these loci with repeat lengths of ≥18 bp. Of these, 550 markers were validated for consistent amplification in eight diverse pigeonpea varieties; 71 were found to be polymorphic on agarose gel electrophoresis. Genetic diversity analysis was done on 22 pigeonpea varieties and eight wild species using 20 highly polymorphic genic-SSR markers. The number of alleles at these loci ranged from 4-10 and the polymorphism information content values ranged from 0.46 to 0.72. Neighbor-joining dendrogram showed distinct separation of the different groups of pigeonpea cultivars and wild species. Deep transcriptome sequencing of the two parental lines helped in silico identification of polymorphic genic-SSR loci to facilitate the rapid development of an intra-species reference genetic map, a subset of which was validated for expected allelic segregation in the reference mapping population. Conclusion We developed 550 validated genic-SSR markers in pigeonpea using deep transcriptome sequencing. From these, 20 highly polymorphic markers were used to evaluate the genetic relationship among species of the genus Cajanus. A comprehensive set of genic-SSR markers was developed as an important genomic resource for diversity analysis and genetic mapping in pigeonpea. PMID:21251263
Integrated Quantitative Transcriptome Maps of Human Trisomy 21 Tissues and Cells
Pelleri, Maria Chiara; Cattani, Chiara; Vitale, Lorenza; Antonaros, Francesca; Strippoli, Pierluigi; Locatelli, Chiara; Cocchi, Guido; Piovesan, Allison; Caracausi, Maria
2018-01-01
Down syndrome (DS) is due to the presence of an extra full or partial chromosome 21 (Hsa21). The identification of genes contributing to DS pathogenesis could be the key to any rational therapy of the associated intellectual disability. We aim at generating quantitative transcriptome maps in DS integrating all gene expression profile datasets available for any cell type or tissue, to obtain a complete model of the transcriptome in terms of both expression values for each gene and segmental trend of gene expression along each chromosome. We used the TRAM (Transcriptome Mapper) software for this meta-analysis, comparing transcript expression levels and profiles between DS and normal brain, lymphoblastoid cell lines, blood cells, fibroblasts, thymus and induced pluripotent stem cells, respectively. TRAM combined, normalized, and integrated datasets from different sources and across diverse experimental platforms. The main output was a linear expression value that may be used as a reference for each of up to 37,181 mapped transcripts analyzed, related to both known genes and expression sequence tag (EST) clusters. An independent example in vitro validation of fibroblast transcriptome map data was performed through “Real-Time” reverse transcription polymerase chain reaction showing an excellent correlation coefficient (r = 0.93, p < 0.0001) with data obtained in silico. The availability of linear expression values for each gene allowed the testing of the gene dosage hypothesis of the expected 3:2 DS/normal ratio for Hsa21 as well as other human genes in DS, in addition to listing genes differentially expressed with statistical significance. Although a fraction of Hsa21 genes escapes dosage effects, Hsa21 genes are selectively over-expressed in DS samples compared to genes from other chromosomes, reflecting a decisive role in the pathogenesis of the syndrome. Finally, the analysis of chromosomal segments reveals a high prevalence of Hsa21 over-expressed segments over the other genomic regions, suggesting, in particular, a specific region on Hsa21 that appears to be frequently over-expressed (21q22). Our complete datasets are released as a new framework to investigate transcription in DS for individual genes as well as chromosomal segments in different cell types and tissues. PMID:29740474
Lee, Jungeun; Kang, Yoonjee; Shin, Seung Chul; Park, Hyun; Lee, Hyoungseok
2014-01-01
Background Antarctic hairgrass (Deschampsia antarctica Desv.) is the only natural grass species in the maritime Antarctic. It has been researched as an important ecological marker and as an extremophile plant for studies on stress tolerance. Despite its importance, little genomic information is available for D. antarctica. Here, we report the complete chloroplast genome, transcriptome profiles of the coding/noncoding genes, and the posttranscriptional processing by RNA editing in the chloroplast system. Results The complete chloroplast genome of D. antarctica is 135,362 bp in length with a typical quadripartite structure, including the large (LSC: 79,881 bp) and small (SSC: 12,519 bp) single-copy regions, separated by a pair of identical inverted repeats (IR: 21,481 bp). It contains 114 unique genes, including 81 unique protein-coding genes, 29 tRNA genes, and 4 rRNA genes. Sequence divergence analysis with other plastomes from the BEP clade of the grass family suggests a sister relationship between D. antarctica, Festuca arundinacea and Lolium perenne of the Poeae tribe, based on the whole plastome. In addition, we conducted high-resolution mapping of the chloroplast-derived transcripts. Thus, we created an expression profile for 81 protein-coding genes and identified ndhC, psbJ, rps19, psaJ, and psbA as the most highly expressed chloroplast genes. Small RNA-seq analysis identified 27 small noncoding RNAs of chloroplast origin that were preferentially located near the 5′- or 3′-ends of genes. We also found >30 RNA-editing sites in the D. antarctica chloroplast genome, with a dominance of C-to-U conversions. Conclusions We assembled and characterized the complete chloroplast genome sequence of D. antarctica and investigated the features of the plastid transcriptome. These data may contribute to a better understanding of the evolution of D. antarctica within the Poaceae family for use in molecular phylogenetic studies and may also help researchers understand the characteristics of the chloroplast transcriptome. PMID:24647560
Lee, Jungeun; Kang, Yoonjee; Shin, Seung Chul; Park, Hyun; Lee, Hyoungseok
2014-01-01
Antarctic hairgrass (Deschampsia antarctica Desv.) is the only natural grass species in the maritime Antarctic. It has been researched as an important ecological marker and as an extremophile plant for studies on stress tolerance. Despite its importance, little genomic information is available for D. antarctica. Here, we report the complete chloroplast genome, transcriptome profiles of the coding/noncoding genes, and the posttranscriptional processing by RNA editing in the chloroplast system. The complete chloroplast genome of D. antarctica is 135,362 bp in length with a typical quadripartite structure, including the large (LSC: 79,881 bp) and small (SSC: 12,519 bp) single-copy regions, separated by a pair of identical inverted repeats (IR: 21,481 bp). It contains 114 unique genes, including 81 unique protein-coding genes, 29 tRNA genes, and 4 rRNA genes. Sequence divergence analysis with other plastomes from the BEP clade of the grass family suggests a sister relationship between D. antarctica, Festuca arundinacea and Lolium perenne of the Poeae tribe, based on the whole plastome. In addition, we conducted high-resolution mapping of the chloroplast-derived transcripts. Thus, we created an expression profile for 81 protein-coding genes and identified ndhC, psbJ, rps19, psaJ, and psbA as the most highly expressed chloroplast genes. Small RNA-seq analysis identified 27 small noncoding RNAs of chloroplast origin that were preferentially located near the 5'- or 3'-ends of genes. We also found >30 RNA-editing sites in the D. antarctica chloroplast genome, with a dominance of C-to-U conversions. We assembled and characterized the complete chloroplast genome sequence of D. antarctica and investigated the features of the plastid transcriptome. These data may contribute to a better understanding of the evolution of D. antarctica within the Poaceae family for use in molecular phylogenetic studies and may also help researchers understand the characteristics of the chloroplast transcriptome.
A comprehensive catalogue of the coding and non-coding transcripts of the human inner ear
Corneveaux, Jason J.; Ohmen, Jeffrey; White, Cory; Allen, April N.; Lusis, Aldons J.; Van Camp, Guy; Huentelman, Matthew J.; Friedman, Rick A.
2015-01-01
The mammalian inner ear consists of the cochlea and the vestibular labyrinth (utricle, saccule, and semicircular canals), which participate in both hearing and balance. Proper development and life-long function of these structures involves a highly complex coordinated system of spatial and temporal gene expression. The characterization of the inner ear transcriptome is likely important for the functional study of auditory and vestibular components, yet, primarily due to tissue unavailability, detailed expression catalogues of the human inner ear remain largely incomplete. We report here, for the first time, comprehensive transcriptome characterization of the adult human cochlea, ampulla, saccule and utricle of the vestibule obtained from patients without hearing abnormalities. Using RNA-Seq, we measured the expression of >50,000 predicted genes corresponding to approximately 200,000 transcripts, in the adult inner ear and compared it to 32 other human tissues. First, we identified genes preferentially expressed in the inner ear, and unique either to the vestibule or cochlea. Next, we examined expression levels of specific groups of potentially interesting RNAs, such as genes implicated in hearing loss, long non-coding RNAs, pseudogenes and transcripts subject to nonsense mediated decay (NMD). We uncover the spatial specificity of expression of these RNAs in the hearing/balance system, and reveal evidence of tissue specific NMD. Lastly, we investigated the non-syndromic deafness loci to which no gene has been mapped, and narrow the list of potential candidates for each locus. These data represent the first high-resolution transcriptome catalogue of the adult human inner ear. A comprehensive identification of coding and non-coding RNAs in the inner ear will enable pathways of auditory and vestibular function to be further defined in the study of hearing and balance. Expression data are freely accessible at https://www.tgen.org/home/research/research-divisions/neurogenomics/supplementary-data/inner-ear-transcriptome.aspx PMID:26341477
2009-01-01
Background ESTs or variable sequence reads can be available in prokaryotic studies well before a complete genome is known. Use cases include (i) transcriptome studies or (ii) single cell sequencing of bacteria. Without suitable software their further analysis and mapping would have to await finalization of the corresponding genome. Results The tool JANE rapidly maps ESTs or variable sequence reads in prokaryotic sequencing and transcriptome efforts to related template genomes. It provides an easy-to-use graphics interface for information retrieval and a toolkit for EST or nucleotide sequence function prediction. Furthermore, we developed for rapid mapping an enhanced sequence alignment algorithm which reassembles and evaluates high scoring pairs provided from the BLAST algorithm. Rapid assembly on and replacement of the template genome by sequence reads or mapped ESTs is achieved. This is illustrated (i) by data from Staphylococci as well as from a Blattabacteria sequencing effort, (ii) mapping single cell sequencing reads is shown for poribacteria to sister phylum representative Rhodopirellula Baltica SH1. The algorithm has been implemented in a web-server accessible at http://jane.bioapps.biozentrum.uni-wuerzburg.de. Conclusion Rapid prokaryotic EST mapping or mapping of sequence reads is achieved applying JANE even without knowing the cognate genome sequence. PMID:19943962
Gluck, Christian; Min, Sangwon; Oyelakin, Akinsola; Smalley, Kirsten; Sinha, Satrajit; Romano, Rose-Anne
2016-11-16
Mouse models have served a valuable role in deciphering various facets of Salivary Gland (SG) biology, from normal developmental programs to diseased states. To facilitate such studies, gene expression profiling maps have been generated for various stages of SG organogenesis. However these prior studies fall short of capturing the transcriptional complexity due to the limited scope of gene-centric microarray-based technology. Compared to microarray, RNA-sequencing (RNA-seq) offers unbiased detection of novel transcripts, broader dynamic range and high specificity and sensitivity for detection of genes, transcripts, and differential gene expression. Although RNA-seq data, particularly under the auspices of the ENCODE project, have covered a large number of biological specimens, studies on the SG have been lacking. To better appreciate the wide spectrum of gene expression profiles, we isolated RNA from mouse submandibular salivary glands at different embryonic and adult stages. In parallel, we processed RNA-seq data for 24 organs and tissues obtained from the mouse ENCODE consortium and calculated the average gene expression values. To identify molecular players and pathways likely to be relevant for SG biology, we performed functional gene enrichment analysis, network construction and hierarchal clustering of the RNA-seq datasets obtained from different stages of SG development and maturation, and other mouse organs and tissues. Our bioinformatics-based data analysis not only reaffirmed known modulators of SG morphogenesis but revealed novel transcription factors and signaling pathways unique to mouse SG biology and function. Finally we demonstrated that the unique SG gene signature obtained from our mouse studies is also well conserved and can demarcate features of the human SG transcriptome that is different from other tissues. Our RNA-seq based Atlas has revealed a high-resolution cartographic view of the dynamic transcriptomic landscape of the mouse SG at various stages. These RNA-seq datasets will complement pre-existing microarray based datasets, including the Salivary Gland Molecular Anatomy Project by offering a broader systems-biology based perspective rather than the classical gene-centric view. Ultimately such resources will be valuable in providing a useful toolkit to better understand how the diverse cell population of the SG are organized and controlled during development and differentiation.
Unique cell-type-specific patterns of DNA methylation in the root meristem.
Kawakatsu, Taiji; Stuart, Tim; Valdes, Manuel; Breakfield, Natalie; Schmitz, Robert J; Nery, Joseph R; Urich, Mark A; Han, Xinwei; Lister, Ryan; Benfey, Philip N; Ecker, Joseph R
2016-04-29
DNA methylation is an epigenetic modification that differs between plant organs and tissues, but the extent of variation between cell types is not known. Here, we report single-base-resolution whole-genome DNA methylomes, mRNA transcriptomes and small RNA transcriptomes for six cell populations covering the major cell types of the Arabidopsis root meristem. We identify widespread cell-type-specific patterns of DNA methylation, especially in the CHH sequence context, where H is A, C or T. The genome of the columella root cap is the most highly methylated Arabidopsis cell characterized so far. It is hypermethylated within transposable elements (TEs), accompanied by increased abundance of transcripts encoding RNA-directed DNA methylation (RdDM) pathway components and 24-nt small RNAs (smRNAs). The absence of the nucleosome remodeller DECREASED DNA METHYLATION 1 (DDM1), required for maintenance of DNA methylation, and low abundance of histone transcripts involved in heterochromatin formation suggests that a loss of heterochromatin may occur in the columella, thus allowing access of RdDM factors to the whole genome, and producing an excess of 24-nt smRNAs in this tissue. Together, these maps provide new insights into the epigenomic diversity that exists between distinct plant somatic cell types.
Breaking the 1000-gene barrier for Mimivirus using ultra-deep genome and transcriptome sequencing.
Legendre, Matthieu; Santini, Sébastien; Rico, Alain; Abergel, Chantal; Claverie, Jean-Michel
2011-03-04
Mimivirus, a giant dsDNA virus infecting Acanthamoeba, is the prototype of the mimiviridae family, the latest addition to the family of the nucleocytoplasmic large DNA viruses (NCLDVs). Its 1.2 Mb-genome was initially predicted to encode 917 genes. A subsequent RNA-Seq analysis precisely mapped many transcript boundaries and identified 75 new genes. We now report a much deeper analysis using the SOLiD™ technology combining RNA-Seq of the Mimivirus transcriptome during the infectious cycle (202.4 Million reads), and a complete genome re-sequencing (45.3 Million reads). This study corrected the genome sequence and identified several single nucleotide polymorphisms. Our results also provided clear evidence of previously overlooked transcription units, including an important RNA polymerase subunit distantly related to Euryarchea homologues. The total Mimivirus gene count is now 1018, 11% greater than the original annotation. This study highlights the huge progress brought about by ultra-deep sequencing for the comprehensive annotation of virus genomes, opening the door to a complete one-nucleotide resolution level description of their transcriptional activity, and to the realistic modeling of the viral genome expression at the ultimate molecular level. This work also illustrates the need to go beyond bioinformatics-only approaches for the annotation of short protein and non-coding genes in viral genomes.
Pseudotemporal Ordering of Single Cells Reveals Metabolic Control of Postnatal β Cell Proliferation.
Zeng, Chun; Mulas, Francesca; Sui, Yinghui; Guan, Tiffany; Miller, Nathanael; Tan, Yuliang; Liu, Fenfen; Jin, Wen; Carrano, Andrea C; Huising, Mark O; Shirihai, Orian S; Yeo, Gene W; Sander, Maike
2017-05-02
Pancreatic β cell mass for appropriate blood glucose control is established during early postnatal life. β cell proliferative capacity declines postnatally, but the extrinsic cues and intracellular signals that cause this decline remain unknown. To obtain a high-resolution map of β cell transcriptome dynamics after birth, we generated single-cell RNA-seq data of β cells from multiple postnatal time points and ordered cells based on transcriptional similarity using a new analytical tool. This analysis captured signatures of immature, proliferative β cells and established high expression of amino acid metabolic, mitochondrial, and Srf/Jun/Fos transcription factor genes as their hallmark feature. Experimental validation revealed high metabolic activity in immature β cells and a role for reactive oxygen species and Srf/Jun/Fos transcription factors in driving postnatal β cell proliferation and mass expansion. Our work provides the first high-resolution molecular characterization of state changes in postnatal β cells and paves the way for the identification of novel therapeutic targets to stimulate β cell regeneration. Copyright © 2017 Elsevier Inc. All rights reserved.
Celedon, Jose M; Yuen, Macaire M S; Chiang, Angela; Henderson, Hannah; Reid, Karen E; Bohlmann, Jörg
2017-11-01
Plant defenses often involve specialized cells and tissues. In conifers, specialized cells of the bark are important for defense against insects and pathogens. Using laser microdissection, we characterized the transcriptomes of cortical resin duct cells, phenolic cells and phloem of white spruce (Picea glauca) bark under constitutive and methyl jasmonate (MeJa)-induced conditions, and we compared these transcriptomes with the transcriptome of the bark tissue complex. Overall, ~3700 bark transcripts were differentially expressed in response to MeJa. Approximately 25% of transcripts were expressed in only one cell type, revealing cell specialization at the transcriptome level. MeJa caused cell-type-specific transcriptome responses and changed the overall patterns of cell-type-specific transcript accumulation. Comparison of transcriptomes of the conifer bark tissue complex and specialized cells resolved a masking effect inherent to transcriptome analysis of complex tissues, and showed the actual cell-type-specific transcriptome signatures. Characterization of cell-type-specific transcriptomes is critical to reveal the dynamic patterns of spatial and temporal display of constitutive and induced defense systems in a complex plant tissue or organ. This was demonstrated with the improved resolution of spatially restricted expression of sets of genes of secondary metabolism in the specialized cell types. © 2017 The Authors The Plant Journal published by John Wiley & Sons Ltd and Society for Experimental Biology.
LungMAP: The Molecular Atlas of Lung Development Program
Ardini-Poleske, Maryanne E.; Ansong, Charles; Carson, James P.; Corley, Richard A.; Deutsch, Gail H.; Hagood, James S.; Kaminski, Naftali; Mariani, Thomas J.; Potter, Steven S.; Pryhuber, Gloria S.; Warburton, David; Whitsett, Jeffrey A.; Palmer, Scott M.; Ambalavanan, Namasivayam
2017-01-01
The National Heart, Lung, and Blood Institute is funding an effort to create a molecular atlas of the developing lung (LungMAP) to serve as a research resource and public education tool. The lung is a complex organ with lengthy development time driven by interactive gene networks and dynamic cross talk among multiple cell types to control and coordinate lineage specification, cell proliferation, differentiation, migration, morphogenesis, and injury repair. A better understanding of the processes that regulate lung development, particularly alveologenesis, will have a significant impact on survival rates for premature infants born with incomplete lung development and will facilitate lung injury repair and regeneration in adults. A consortium of four research centers, a data coordinating center, and a human tissue repository provides high-quality molecular data of developing human and mouse lungs. LungMAP includes mouse and human data for cross correlation of developmental processes across species. LungMAP is generating foundational data and analysis, creating a web portal for presentation of results and public sharing of data sets, establishing a repository of young human lung tissues obtained through organ donor organizations, and developing a comprehensive lung ontology that incorporates the latest findings of the consortium. The LungMAP website (www.lungmap.net) currently contains more than 6,000 high-resolution lung images and transcriptomic, proteomic, and lipidomic human and mouse data and provides scientific information to stimulate interest in research careers for young audiences. This paper presents a brief description of research conducted by the consortium, database, and portal development and upcoming features that will enhance the LungMAP experience for a community of users. PMID:28798251
Murray, John Isaac
2018-05-01
The convergence of developmental biology and modern genomics tools brings the potential for a comprehensive understanding of developmental systems. This is especially true for the Caenorhabditis elegans embryo because its small size, invariant developmental lineage, and powerful genetic and genomic tools provide the prospect of a cellular resolution understanding of messenger RNA (mRNA) expression and regulation across the organism. We describe here how a systems biology framework might allow large-scale determination of the embryonic regulatory relationships encoded in the C. elegans genome. This framework consists of two broad steps: (a) defining the "parts list"-all genes expressed in all cells at each time during development and (b) iterative steps of computational modeling and refinement of these models by experimental perturbation. Substantial progress has been made towards defining the parts list through imaging methods such as large-scale green fluorescent protein (GFP) reporter analysis. Imaging results are now being augmented by high-resolution transcriptome methods such as single-cell RNA sequencing, and it is likely the complete expression patterns of all genes across the embryo will be known within the next few years. In contrast, the modeling and perturbation experiments performed so far have focused largely on individual cell types or genes, and improved methods will be needed to expand them to the full genome and organism. This emerging comprehensive map of embryonic expression and regulatory function will provide a powerful resource for developmental biologists, and would also allow scientists to ask questions not accessible without a comprehensive picture. This article is categorized under: Invertebrate Organogenesis > Worms Technologies > Analysis of the Transcriptome Gene Expression and Transcriptional Hierarchies > Gene Networks and Genomics. © 2018 Wiley Periodicals, Inc.
Ndeve, Arsenio Daniel; Huynh, Bao-Lam; Matthews, William Charles; Roberts, Philip Alan
2018-01-01
Cowpea is one of the most important food and forage legumes in drier regions of the tropics and subtropics. However, cowpea yield worldwide is markedly below the known potential due to abiotic and biotic stresses, including parasitism by root-knot nematodes (Meloidogyne spp., RKN). Two resistance genes with dominant effect, Rk and Rk2, have been reported to provide resistance against RKN in cowpea. Despite their description and use in breeding for resistance to RKN and particularly genetic mapping of the Rk locus, the exact genes conferring resistance to RKN remain unknown. In the present work, QTL mapping using recombinant inbred line (RIL) population 524B x IT84S-2049 segregating for a newly mapped locus and analysis of the transcriptome changes in two cowpea near-isogenic lines (NIL) were used to identify candidate genes for Rk and the newly mapped locus. A major QTL, designated QRk-vu9.1, associated with resistance to Meloidogyne javanica reproduction, was detected and mapped on linkage group LG9 at position 13.37 cM using egg production data. Transcriptome analysis on resistant and susceptible NILs 3 and 9 days after inoculation revealed up-regulation of 109 and 98 genes and down-regulation of 110 and 89 genes, respectively, out of 19,922 unique genes mapped to the common bean reference genome. Among the differentially expressed genes, four and nine genes were found within the QRk-vu9.1 and QRk-vu11.1 QTL intervals, respectively. Six of these genes belong to the TIR-NBS-LRR family of resistance genes and three were upregulated at one or more time-points. Quantitative RT-PCR validated gene expression to be positively correlated with RNA-seq expression pattern for eight genes. Future functional analysis of these cowpea genes will enhance our understanding of Rk-mediated resistance and identify the specific gene responsible for the resistance. PMID:29300744
Santos, Jansen Rodrigo Pereira; Ndeve, Arsenio Daniel; Huynh, Bao-Lam; Matthews, William Charles; Roberts, Philip Alan
2018-01-01
Cowpea is one of the most important food and forage legumes in drier regions of the tropics and subtropics. However, cowpea yield worldwide is markedly below the known potential due to abiotic and biotic stresses, including parasitism by root-knot nematodes (Meloidogyne spp., RKN). Two resistance genes with dominant effect, Rk and Rk2, have been reported to provide resistance against RKN in cowpea. Despite their description and use in breeding for resistance to RKN and particularly genetic mapping of the Rk locus, the exact genes conferring resistance to RKN remain unknown. In the present work, QTL mapping using recombinant inbred line (RIL) population 524B x IT84S-2049 segregating for a newly mapped locus and analysis of the transcriptome changes in two cowpea near-isogenic lines (NIL) were used to identify candidate genes for Rk and the newly mapped locus. A major QTL, designated QRk-vu9.1, associated with resistance to Meloidogyne javanica reproduction, was detected and mapped on linkage group LG9 at position 13.37 cM using egg production data. Transcriptome analysis on resistant and susceptible NILs 3 and 9 days after inoculation revealed up-regulation of 109 and 98 genes and down-regulation of 110 and 89 genes, respectively, out of 19,922 unique genes mapped to the common bean reference genome. Among the differentially expressed genes, four and nine genes were found within the QRk-vu9.1 and QRk-vu11.1 QTL intervals, respectively. Six of these genes belong to the TIR-NBS-LRR family of resistance genes and three were upregulated at one or more time-points. Quantitative RT-PCR validated gene expression to be positively correlated with RNA-seq expression pattern for eight genes. Future functional analysis of these cowpea genes will enhance our understanding of Rk-mediated resistance and identify the specific gene responsible for the resistance.
Draft De Novo Transcriptome of the Rat Kangaroo Potorous tridactylus as a Tool for Cell Biology
Udy, Dylan B.; Voorhies, Mark; Chan, Patricia P.; Lowe, Todd M.; Dumont, Sophie
2015-01-01
The rat kangaroo (long-nosed potoroo, Potorous tridactylus) is a marsupial native to Australia. Cultured rat kangaroo kidney epithelial cells (PtK) are commonly used to study cell biological processes. These mammalian cells are large, adherent, and flat, and contain large and few chromosomes—and are thus ideal for imaging intra-cellular dynamics such as those of mitosis. Despite this, neither the rat kangaroo genome nor transcriptome have been sequenced, creating a challenge for probing the molecular basis of these cellular dynamics. Here, we present the sequencing, assembly and annotation of the draft rat kangaroo de novo transcriptome. We sequenced 679 million reads that mapped to 347,323 Trinity transcripts and 20,079 Unigenes. We present statistics emerging from transcriptome-wide analyses, and analyses suggesting that the transcriptome covers full-length sequences of most genes, many with multiple isoforms. We also validate our findings with a proof-of-concept gene knockdown experiment. We expect that this high quality transcriptome will make rat kangaroo cells a more tractable system for linking molecular-scale function and cellular-scale dynamics. PMID:26252667
Draft De Novo Transcriptome of the Rat Kangaroo Potorous tridactylus as a Tool for Cell Biology.
Udy, Dylan B; Voorhies, Mark; Chan, Patricia P; Lowe, Todd M; Dumont, Sophie
2015-01-01
The rat kangaroo (long-nosed potoroo, Potorous tridactylus) is a marsupial native to Australia. Cultured rat kangaroo kidney epithelial cells (PtK) are commonly used to study cell biological processes. These mammalian cells are large, adherent, and flat, and contain large and few chromosomes-and are thus ideal for imaging intra-cellular dynamics such as those of mitosis. Despite this, neither the rat kangaroo genome nor transcriptome have been sequenced, creating a challenge for probing the molecular basis of these cellular dynamics. Here, we present the sequencing, assembly and annotation of the draft rat kangaroo de novo transcriptome. We sequenced 679 million reads that mapped to 347,323 Trinity transcripts and 20,079 Unigenes. We present statistics emerging from transcriptome-wide analyses, and analyses suggesting that the transcriptome covers full-length sequences of most genes, many with multiple isoforms. We also validate our findings with a proof-of-concept gene knockdown experiment. We expect that this high quality transcriptome will make rat kangaroo cells a more tractable system for linking molecular-scale function and cellular-scale dynamics.
... Sheets A Brief Guide to Genomics About NHGRI Research About the International HapMap Project Biological Pathways Chromosome Abnormalities Chromosomes Cloning Comparative Genomics DNA Microarray Technology DNA Sequencing Deoxyribonucleic Acid ( ...
Ament, Seth A; Pearl, Jocelynn R; Grindeland, Andrea; St Claire, Jason; Earls, John C; Kovalenko, Marina; Gillis, Tammy; Mysore, Jayalakshmi; Gusella, James F; Lee, Jong-Min; Kwak, Seung; Howland, David; Lee, Min Young; Baxter, David; Scherler, Kelsey; Wang, Kai; Geman, Donald; Carroll, Jeffrey B; MacDonald, Marcy E; Carlson, George; Wheeler, Vanessa C; Price, Nathan D; Hood, Leroy E
2017-03-01
Huntington's disease is a dominantly inherited neurodegenerative disease caused by the expansion of a CAG repeat in the HTT gene. In addition to the length of the CAG expansion, factors such as genetic background have been shown to contribute to the age at onset of neurological symptoms. A central challenge in understanding the disease progression that leads from the HD mutation to massive cell death in the striatum is the ability to characterize the subtle and early functional consequences of the CAG expansion longitudinally. We used dense time course sampling between 4 and 20 postnatal weeks to characterize early transcriptomic, molecular and cellular phenotypes in the striatum of six distinct knock-in mouse models of the HD mutation. We studied the effects of the HttQ111 allele on the C57BL/6J, CD-1, FVB/NCr1, and 129S2/SvPasCrl genetic backgrounds, and of two additional alleles, HttQ92 and HttQ50, on the C57BL/6J background. We describe the emergence of a transcriptomic signature in HttQ111/+ mice involving hundreds of differentially expressed genes and changes in diverse molecular pathways. We also show that this time course spanned the onset of mutant huntingtin nuclear localization phenotypes and somatic CAG-length instability in the striatum. Genetic background strongly influenced the magnitude and age at onset of these effects. This work provides a foundation for understanding the earliest transcriptional and molecular changes contributing to HD pathogenesis. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Munger, Steven C.; Raghupathy, Narayanan; Choi, Kwangbom; Simons, Allen K.; Gatti, Daniel M.; Hinerfeld, Douglas A.; Svenson, Karen L.; Keller, Mark P.; Attie, Alan D.; Hibbs, Matthew A.; Graber, Joel H.; Chesler, Elissa J.; Churchill, Gary A.
2014-01-01
Massively parallel RNA sequencing (RNA-seq) has yielded a wealth of new insights into transcriptional regulation. A first step in the analysis of RNA-seq data is the alignment of short sequence reads to a common reference genome or transcriptome. Genetic variants that distinguish individual genomes from the reference sequence can cause reads to be misaligned, resulting in biased estimates of transcript abundance. Fine-tuning of read alignment algorithms does not correct this problem. We have developed Seqnature software to construct individualized diploid genomes and transcriptomes for multiparent populations and have implemented a complete analysis pipeline that incorporates other existing software tools. We demonstrate in simulated and real data sets that alignment to individualized transcriptomes increases read mapping accuracy, improves estimation of transcript abundance, and enables the direct estimation of allele-specific expression. Moreover, when applied to expression QTL mapping we find that our individualized alignment strategy corrects false-positive linkage signals and unmasks hidden associations. We recommend the use of individualized diploid genomes over reference sequence alignment for all applications of high-throughput sequencing technology in genetically diverse populations. PMID:25236449
Characterization of Chiton Ischnochiton hakodadensis Foot Based on Transcriptome Sequencing
NASA Astrophysics Data System (ADS)
Dou, Huaiqian; Miao, Yan; Li, Yuli; Li, Yangping; Dai, Xiaoting; Zhang, Xiaokang; Liang, Pengyu; Liu, Weizhi; Wang, Shi; Bao, Zhenmin
2018-06-01
Chiton ( Ischnochiton hakodadensis) is one of marine mollusks well known for its eight separate shell plates. I. hakodadensis is important, which plays a vital role in the ecosystems it inhabits. So far, the genetic studies on the chiton are scarce due in part to insufficient genomic resources available for this species. In this study, we investigated the transcriptome of the chiton foot using Illumina sequencing technology. The reads were assembled and clustered into 256461 unigenes, of which 42247 were divided into diverse functional categories by Gene Ontology (GO) annotation terms, and 17256 mapped onto 365 pathways by KEGG pathway mapping. Meanwhile, a set of differentially expressed genes (DEGs) between distal and proximal muscles were identified as the foot adhesive locomotion associated, thus were useful for our future studies. Moreover, up to 679384 high-quality single nucleotide polymorphisms (SNPs) and 19814 simple sequence repeats (SSRs) were identified in this study, which are valuable for subsequent studies on genetic diversity and variation. The transcriptomic resource obtained in this study should aid to future genetic and genomic studies of chiton.
Xu, Yiming; Smith, Scot E; Grunwald, Sabine; Abd-Elrahman, Amr; Wani, Suhas P; Nair, Vimala D
2017-09-11
Digital soil mapping (DSM) is gaining momentum as a technique to help smallholder farmers secure soil security and food security in developing regions. However, communications of the digital soil mapping information between diverse audiences become problematic due to the inconsistent scale of DSM information. Spatial downscaling can make use of accessible soil information at relatively coarse spatial resolution to provide valuable soil information at relatively fine spatial resolution. The objective of this research was to disaggregate the coarse spatial resolution soil exchangeable potassium (K ex ) and soil total nitrogen (TN) base map into fine spatial resolution soil downscaled map using weighted generalized additive models (GAMs) in two smallholder villages in South India. By incorporating fine spatial resolution spectral indices in the downscaling process, the soil downscaled maps not only conserve the spatial information of coarse spatial resolution soil maps but also depict the spatial details of soil properties at fine spatial resolution. The results of this study demonstrated difference between the fine spatial resolution downscaled maps and fine spatial resolution base maps is smaller than the difference between coarse spatial resolution base maps and fine spatial resolution base maps. The appropriate and economical strategy to promote the DSM technique in smallholder farms is to develop the relatively coarse spatial resolution soil prediction maps or utilize available coarse spatial resolution soil maps at the regional scale and to disaggregate these maps to the fine spatial resolution downscaled soil maps at farm scale.
A method for the further assembly of targeted unigenes in a transcriptome after assembly by Trinity
Xiao, Xinlong; Ma, Jinbiao; Sun, Yufang; Yao, Yinan
2015-01-01
RNA-sequencing has been widely used to obtain high throughput transcriptome sequences in various species, but the assembly of a full set of complete transcripts is still a significant challenge. Judging by the number of expected transcripts and assembled unigenes in a transcriptome library, we believe that some unigenes could be reassembled. In this study, using the nitrate transporter (NRT) gene family and phosphate transporter (PHT) gene family in Salicornia europaea as examples, we introduced an approach to further assemble unigenes found in transcriptome libraries which had been previously generated by Trinity. To find the unigenes of a particular transcript that contained gaps, we respectively selected 16 NRT candidate unigene pairs and 12 PHT candidate unigene pairs for which the two unigenes had the same annotations, the same expression patterns among various RNA-seq samples, and different positions of the proteins coded as mapped to a reference protein. To fill a gap between the two unigenes, PCR was performed using primers that mapped to the two unigenes and the PCR products were sequenced, which demonstrated that 5 unigene pairs of NRT and 3 unigene pairs of PHT could be reassembled when the gaps were filled using the corresponding PCR product sequences. This fast and simple method will reduce the redundancy of targeted unigenes and allow acquisition of complete coding sequences (CDS). PMID:26528307
Plouhinec, Jean-Louis; Medina-Ruiz, Sofía; Borday, Caroline; Bernard, Elsa; Vert, Jean-Philippe; Eisen, Michael B; Harland, Richard M; Monsoro-Burq, Anne H
2017-10-01
During vertebrate neurulation, the embryonic ectoderm is patterned into lineage progenitors for neural plate, neural crest, placodes and epidermis. Here, we use Xenopus laevis embryos to analyze the spatial and temporal transcriptome of distinct ectodermal domains in the course of neurulation, during the establishment of cell lineages. In order to define the transcriptome of small groups of cells from a single germ layer and to retain spatial information, dorsal and ventral ectoderm was subdivided along the anterior-posterior and medial-lateral axes by microdissections. Principal component analysis on the transcriptomes of these ectoderm fragments primarily identifies embryonic axes and temporal dynamics. This provides a genetic code to define positional information of any ectoderm sample along the anterior-posterior and dorsal-ventral axes directly from its transcriptome. In parallel, we use nonnegative matrix factorization to predict enhanced gene expression maps onto early and mid-neurula embryos, and specific signatures for each ectoderm area. The clustering of spatial and temporal datasets allowed detection of multiple biologically relevant groups (e.g., Wnt signaling, neural crest development, sensory placode specification, ciliogenesis, germ layer specification). We provide an interactive network interface, EctoMap, for exploring synexpression relationships among genes expressed in the neurula, and suggest several strategies to use this comprehensive dataset to address questions in developmental biology as well as stem cell or cancer research.
Borday, Caroline; Bernard, Elsa; Vert, Jean-Philippe; Eisen, Michael B.; Harland, Richard M.
2017-01-01
During vertebrate neurulation, the embryonic ectoderm is patterned into lineage progenitors for neural plate, neural crest, placodes and epidermis. Here, we use Xenopus laevis embryos to analyze the spatial and temporal transcriptome of distinct ectodermal domains in the course of neurulation, during the establishment of cell lineages. In order to define the transcriptome of small groups of cells from a single germ layer and to retain spatial information, dorsal and ventral ectoderm was subdivided along the anterior-posterior and medial-lateral axes by microdissections. Principal component analysis on the transcriptomes of these ectoderm fragments primarily identifies embryonic axes and temporal dynamics. This provides a genetic code to define positional information of any ectoderm sample along the anterior-posterior and dorsal-ventral axes directly from its transcriptome. In parallel, we use nonnegative matrix factorization to predict enhanced gene expression maps onto early and mid-neurula embryos, and specific signatures for each ectoderm area. The clustering of spatial and temporal datasets allowed detection of multiple biologically relevant groups (e.g., Wnt signaling, neural crest development, sensory placode specification, ciliogenesis, germ layer specification). We provide an interactive network interface, EctoMap, for exploring synexpression relationships among genes expressed in the neurula, and suggest several strategies to use this comprehensive dataset to address questions in developmental biology as well as stem cell or cancer research. PMID:29049289
Kudapa, Himabindu; Bharti, Arvind K; Cannon, Steven B; Farmer, Andrew D; Mulaosmanovic, Benjamin; Kramer, Robin; Bohra, Abhishek; Weeks, Nathan T; Crow, John A; Tuteja, Reetu; Shah, Trushar; Dutta, Sutapa; Gupta, Deepak K; Singh, Archana; Gaikwad, Kishor; Sharma, Tilak R; May, Gregory D; Singh, Nagendra K; Varshney, Rajeev K
2012-09-01
A comprehensive transcriptome assembly for pigeonpea has been developed by analyzing 128.9 million short Illumina GA IIx single end reads, 2.19 million single end FLX/454 reads, and 18 353 Sanger expressed sequenced tags from more than 16 genotypes. The resultant transcriptome assembly, referred to as CcTA v2, comprised 21 434 transcript assembly contigs (TACs) with an N50 of 1510 bp, the largest one being ~8 kb. Of the 21 434 TACs, 16 622 (77.5%) could be mapped on to the soybean genome build 1.0.9 under fairly stringent alignment parameters. Based on knowledge of intron junctions, 10 009 primer pairs were designed from 5033 TACs for amplifying intron spanning regions (ISRs). By using in silico mapping of BAC-end-derived SSR loci of pigeonpea on the soybean genome as a reference, putative mapping positions at the chromosome level were predicted for 6284 ISR markers, covering all 11 pigeonpea chromosomes. A subset of 128 ISR markers were analyzed on a set of eight genotypes. While 116 markers were validated, 70 markers showed one to three alleles, with an average of 0.16 polymorphism information content (PIC) value. In summary, the CcTA v2 transcript assembly and ISR markers will serve as a useful resource to accelerate genetic research and breeding applications in pigeonpea.
The protein expression landscape of the Arabidopsis root
Petricka, Jalean J.; Schauer, Monica A.; Megraw, Molly; Breakfield, Natalie W.; Thompson, J. Will; Georgiev, Stoyan; Soderblom, Erik J.; Ohler, Uwe; Moseley, Martin Arthur; Grossniklaus, Ueli; Benfey, Philip N.
2012-01-01
Because proteins are the major functional components of cells, knowledge of their cellular localization is crucial to gaining an understanding of the biology of multicellular organisms. We have generated a protein expression map of the Arabidopsis root providing the identity and cell type-specific localization of nearly 2,000 proteins. Grouping proteins into functional categories revealed unique cellular functions and identified cell type-specific biomarkers. Cellular colocalization provided support for numerous protein–protein interactions. With a binary comparison, we found that RNA and protein expression profiles are weakly correlated. We then performed peak integration at cell type-specific resolution and found an improved correlation with transcriptome data using continuous values. We performed GeLC-MS/MS (in-gel tryptic digestion followed by liquid chromatography-tandem mass spectrometry) proteomic experiments on mutants with ectopic and no root hairs, providing complementary proteomic data. Finally, among our root hair-specific proteins we identified two unique regulators of root hair development. PMID:22447775
Droplet barcoding for single cell transcriptomics applied to embryonic stem cells
Klein, Allon M; Mazutis, Linas; Akartuna, Ilke; Tallapragada, Naren; Veres, Adrian; Li, Victor; Peshkin, Leonid; Weitz, David A; Kirschner, Marc W
2015-01-01
Summary It has long been the dream of biologists to map gene expression at the single cell level. With such data one might track heterogeneous cell sub-populations, and infer regulatory relationships between genes and pathways. Recently, RNA sequencing has achieved single cell resolution. What is limiting is an effective way to routinely isolate and process large numbers of individual cells for quantitative in-depth sequencing. We have developed a high-throughput droplet-microfluidic approach for barcoding the RNA from thousands of individual cells for subsequent analysis by next-generation sequencing. The method shows a surprisingly low noise profile and is readily adaptable to other sequencing-based assays. We analyzed mouse embryonic stem cells, revealing in detail the population structure and the heterogeneous onset of differentiation after LIF withdrawal. The reproducibility of these high-throughput single cell data allowed us to deconstruct cell populations and infer gene expression relationships. PMID:26000487
Genome sequence and genetic diversity of the common carp, Cyprinus carpio.
Xu, Peng; Zhang, Xiaofeng; Wang, Xumin; Li, Jiongtang; Liu, Guiming; Kuang, Youyi; Xu, Jian; Zheng, Xianhu; Ren, Lufeng; Wang, Guoliang; Zhang, Yan; Huo, Linhe; Zhao, Zixia; Cao, Dingchen; Lu, Cuiyun; Li, Chao; Zhou, Yi; Liu, Zhanjiang; Fan, Zhonghua; Shan, Guangle; Li, Xingang; Wu, Shuangxiu; Song, Lipu; Hou, Guangyuan; Jiang, Yanliang; Jeney, Zsigmond; Yu, Dan; Wang, Li; Shao, Changjun; Song, Lai; Sun, Jing; Ji, Peifeng; Wang, Jian; Li, Qiang; Xu, Liming; Sun, Fanyue; Feng, Jianxin; Wang, Chenghui; Wang, Shaolin; Wang, Baosen; Li, Yan; Zhu, Yaping; Xue, Wei; Zhao, Lan; Wang, Jintu; Gu, Ying; Lv, Weihua; Wu, Kejing; Xiao, Jingfa; Wu, Jiayan; Zhang, Zhang; Yu, Jun; Sun, Xiaowen
2014-11-01
The common carp, Cyprinus carpio, is one of the most important cyprinid species and globally accounts for 10% of freshwater aquaculture production. Here we present a draft genome of domesticated C. carpio (strain Songpu), whose current assembly contains 52,610 protein-coding genes and approximately 92.3% coverage of its paleotetraploidized genome (2n = 100). The latest round of whole-genome duplication has been estimated to have occurred approximately 8.2 million years ago. Genome resequencing of 33 representative individuals from worldwide populations demonstrates a single origin for C. carpio in 2 subspecies (C. carpio Haematopterus and C. carpio carpio). Integrative genomic and transcriptomic analyses were used to identify loci potentially associated with traits including scaling patterns and skin color. In combination with the high-resolution genetic map, the draft genome paves the way for better molecular studies and improved genome-assisted breeding of C. carpio and other closely related species.
Oh, Dong-Ha; Barkla, Bronwyn J; Vera-Estrella, Rosario; Pantoja, Omar; Lee, Sang-Yeol; Bohnert, Hans J; Dassanayake, Maheshi
2015-08-01
Mesembryanthemum crystallinum (ice plant) exhibits extreme tolerance to salt. Epidermal bladder cells (EBCs), developing on the surface of aerial tissues and specialized in sodium sequestration and other protective functions, are critical for the plant's stress adaptation. We present the first transcriptome analysis of EBCs isolated from intact plants, to investigate cell type-specific responses during plant salt adaptation. We developed a de novo assembled, nonredundant EBC reference transcriptome. Using RNAseq, we compared the expression patterns of the EBC-specific transcriptome between control and salt-treated plants. The EBC reference transcriptome consists of 37 341 transcript-contigs, of which 7% showed significantly different expression between salt-treated and control samples. We identified significant changes in ion transport, metabolism related to energy generation and osmolyte accumulation, stress signalling, and organelle functions, as well as a number of lineage-specific genes of unknown function, in response to salt treatment. The salinity-induced EBC transcriptome includes active transcript clusters, refuting the view of EBCs as passive storage compartments in the whole-plant stress response. EBC transcriptomes, differing from those of whole plants or leaf tissue, exemplify the importance of cell type-specific resolution in understanding stress adaptive mechanisms. No claim to original US government works. New Phytologist © 2015 New Phytologist Trust.
LungMAP: The Molecular Atlas of Lung Development Program.
Ardini-Poleske, Maryanne E; Clark, Robert F; Ansong, Charles; Carson, James P; Corley, Richard A; Deutsch, Gail H; Hagood, James S; Kaminski, Naftali; Mariani, Thomas J; Potter, Steven S; Pryhuber, Gloria S; Warburton, David; Whitsett, Jeffrey A; Palmer, Scott M; Ambalavanan, Namasivayam
2017-11-01
The National Heart, Lung, and Blood Institute is funding an effort to create a molecular atlas of the developing lung (LungMAP) to serve as a research resource and public education tool. The lung is a complex organ with lengthy development time driven by interactive gene networks and dynamic cross talk among multiple cell types to control and coordinate lineage specification, cell proliferation, differentiation, migration, morphogenesis, and injury repair. A better understanding of the processes that regulate lung development, particularly alveologenesis, will have a significant impact on survival rates for premature infants born with incomplete lung development and will facilitate lung injury repair and regeneration in adults. A consortium of four research centers, a data coordinating center, and a human tissue repository provides high-quality molecular data of developing human and mouse lungs. LungMAP includes mouse and human data for cross correlation of developmental processes across species. LungMAP is generating foundational data and analysis, creating a web portal for presentation of results and public sharing of data sets, establishing a repository of young human lung tissues obtained through organ donor organizations, and developing a comprehensive lung ontology that incorporates the latest findings of the consortium. The LungMAP website (www.lungmap.net) currently contains more than 6,000 high-resolution lung images and transcriptomic, proteomic, and lipidomic human and mouse data and provides scientific information to stimulate interest in research careers for young audiences. This paper presents a brief description of research conducted by the consortium, database, and portal development and upcoming features that will enhance the LungMAP experience for a community of users. Copyright © 2017 the American Physiological Society.
Salvermoser, Melanie; Chotewutmontri, Sasithorn; Braspenning-Wesch, Ilona; Hasche, Daniel; Rösl, Frank; Vinzón, Sabrina E
2016-07-01
Mastomys coucha, an African rodent, is a useful animal model of papillomavirus infection, as it develops both premalignant and malignant skin tumors as a consequence of a persistent infection with Mastomys natalensis papillomavirus (MnPV). In this study, we mapped the MnPV transcriptome in productive lesions by both classical molecular techniques and high-throughput RNA sequencing. Combination of these methods revealed a complex and comprehensive transcription map, with novel splicing events not described in other papillomaviruses. Furthermore, these splicing occurrences could potentially lead to the expression of novel E2, E1∧E4, E7 and L2 isoforms. Expression level estimation of each transcript showed that late-region mRNAs considerably outnumber early transcripts, with species coding for L1 and E1∧E4 being the most abundant. In summary, the full transcription map assembled in this study will allow us to further understand MnPV gene expression and the mechanisms that lead to natural tumour development.
2011-01-01
Background Apomixis, asexual seed production in plants, holds great potential for agriculture as a means to fix hybrid vigor. Apospory is a form of apomixis where the embryo develops from an unreduced egg that is derived from a somatic nucellar cell, the aposporous initial, via mitosis. Understanding the molecular mechanism regulating aposporous initial specification will be a critical step toward elucidation of apomixis and also provide insight into developmental regulation and downstream signaling that results in apomixis. To discover candidate transcripts for regulating aposporous initial specification in P. squamulatum, we compared two transcriptomes derived from microdissected ovules at the stage of aposporous initial formation between the apomictic donor parent, P. squamulatum (accession PS26), and an apomictic derived backcross 8 (BC8) line containing only the Apospory-Specific Genomic Region (ASGR)-carrier chromosome from P. squamulatum. Toward this end, two transcriptomes derived from ovules of an apomictic donor parent and its apomictic backcross derivative at the stage of apospory initiation, were sequenced using 454-FLX technology. Results Using 454-FLX technology, we generated 332,567 reads with an average read length of 147 base pairs (bp) for the PS26 ovule transcriptome library and 363,637 reads with an average read length of 142 bp for the BC8 ovule transcriptome library. A total of 33,977 contigs from the PS26 ovule transcriptome library and 26,576 contigs from the BC8 ovule transcriptome library were assembled using the Multifunctional Inertial Reference Assembly program. Using stringent in silico parameters, 61 transcripts were predicted to map to the ASGR-carrier chromosome, of which 49 transcripts were verified as ASGR-carrier chromosome specific. One of the alien expressed genes could be assigned as tightly linked to the ASGR by screening of apomictic and sexual F1s. Only one transcript, which did not map to the ASGR, showed expression primarily in reproductive tissue. Conclusions Our results suggest that a strategy of comparative sequencing of transcriptomes between donor parent and backcross lines containing an alien chromosome of interest can be an efficient method of identifying transcripts derived from an alien chromosome in a chromosome addition line. PMID:21521529
Transcriptome-enabled marker discovery and mapping of plastochron-related genes in Petunia spp.
Guo, Yufang; Wiegert-Rininger, Krystle E; Vallejo, Veronica A; Barry, Cornelius S; Warner, Ryan M
2015-09-24
Petunia (Petunia × hybrida), derived from a hybrid between P. axillaris and P. integrifolia, is one of the most economically important bedding plant crops and Petunia spp. serve as model systems for investigating the mechanisms underlying diverse mating systems and pollination syndromes. In addition, we have previously described genetic variation and quantitative trait loci (QTL) related to petunia development rate and morphology, which represent important breeding targets for the floriculture industry to improve crop production and performance. Despite the importance of petunia as a crop, the floriculture industry has been slow to adopt marker assisted selection to facilitate breeding strategies and there remains a limited availability of sequences and molecular markers from the genus compared to other economically important members of the Solanaceae family such as tomato, potato and pepper. Here we report the de novo assembly, annotation and characterization of transcriptomes from P. axillaris, P. exserta and P. integrifolia. Each transcriptome assembly was derived from five tissue libraries (callus, 3-week old seedlings, shoot apices, flowers of mixed developmental stages, and trichomes). A total of 74,573, 54,913, and 104,739 assembled transcripts were recovered from P. axillaris, P. exserta and P. integrifolia, respectively and following removal of multiple isoforms, 32,994 P. axillaris, 30,225 P. exserta, and 33,540 P. integrifolia high quality representative transcripts were extracted for annotation and expression analysis. The transcriptome data was mined for single nucleotide polymorphisms (SNP) and simple sequence repeat (SSR) markers, yielding 89,007 high quality SNPs and 2949 SSRs, respectively. 15,701 SNPs were computationally converted into user-friendly cleaved amplified polymorphic sequence (CAPS) markers and a subset of SNP and CAPS markers were experimentally verified. CAPS markers developed from plastochron-related homologous transcripts from P. axillaris were mapped in an interspecific Petunia population and evaluated for co-localization with QTL for development rate. The high quality of the three Petunia spp. transcriptomes coupled with the utility of the SNP data will serve as a resource for further exploration of genetic diversity within the genus and will facilitate efforts to develop genetic and physical maps to aid the identification of QTL associated with traits of interest.
Survey of the transcriptome of Aspergillus oryzae via massively parallel mRNA sequencing
Wang, Bin; Guo, Guangwu; Wang, Chao; Lin, Ying; Wang, Xiaoning; Zhao, Mouming; Guo, Yong; He, Minghui; Zhang, Yong; Pan, Li
2010-01-01
Aspergillus oryzae, an important filamentous fungus used in food fermentation and the enzyme industry, has been shown through genome sequencing and various other tools to have prominent features in its genomic composition. However, the functional complexity of the A. oryzae transcriptome has not yet been fully elucidated. Here, we applied direct high-throughput paired-end RNA-sequencing (RNA-Seq) to the transcriptome of A. oryzae under four different culture conditions. With the high resolution and sensitivity afforded by RNA-Seq, we were able to identify a substantial number of novel transcripts, new exons, untranslated regions, alternative upstream initiation codons and upstream open reading frames, which provide remarkable insight into the A. oryzae transcriptome. We were also able to assess the alternative mRNA isoforms in A. oryzae and found a large number of genes undergoing alternative splicing. Many genes and pathways that might be involved in higher levels of protein production in solid-state culture than in liquid culture were identified by comparing gene expression levels between different cultures. Our analysis indicated that the transcriptome of A. oryzae is much more complex than previously anticipated, and these results may provide a blueprint for further study of the A. oryzae transcriptome. PMID:20392818
Survey of the transcriptome of Aspergillus oryzae via massively parallel mRNA sequencing.
Wang, Bin; Guo, Guangwu; Wang, Chao; Lin, Ying; Wang, Xiaoning; Zhao, Mouming; Guo, Yong; He, Minghui; Zhang, Yong; Pan, Li
2010-08-01
Aspergillus oryzae, an important filamentous fungus used in food fermentation and the enzyme industry, has been shown through genome sequencing and various other tools to have prominent features in its genomic composition. However, the functional complexity of the A. oryzae transcriptome has not yet been fully elucidated. Here, we applied direct high-throughput paired-end RNA-sequencing (RNA-Seq) to the transcriptome of A. oryzae under four different culture conditions. With the high resolution and sensitivity afforded by RNA-Seq, we were able to identify a substantial number of novel transcripts, new exons, untranslated regions, alternative upstream initiation codons and upstream open reading frames, which provide remarkable insight into the A. oryzae transcriptome. We were also able to assess the alternative mRNA isoforms in A. oryzae and found a large number of genes undergoing alternative splicing. Many genes and pathways that might be involved in higher levels of protein production in solid-state culture than in liquid culture were identified by comparing gene expression levels between different cultures. Our analysis indicated that the transcriptome of A. oryzae is much more complex than previously anticipated, and these results may provide a blueprint for further study of the A. oryzae transcriptome.
Blood transcriptomics and metabolomics for personalized medicine.
Li, Shuzhao; Todor, Andrei; Luo, Ruiyan
2016-01-01
Molecular analysis of blood samples is pivotal to clinical diagnosis and has been intensively investigated since the rise of systems biology. Recent developments have opened new opportunities to utilize transcriptomics and metabolomics for personalized and precision medicine. Efforts from human immunology have infused into this area exquisite characterizations of subpopulations of blood cells. It is now possible to infer from blood transcriptomics, with fine accuracy, the contribution of immune activation and of cell subpopulations. In parallel, high-resolution mass spectrometry has brought revolutionary analytical capability, detecting > 10,000 metabolites, together with environmental exposure, dietary intake, microbial activity, and pharmaceutical drugs. Thus, the re-examination of blood chemicals by metabolomics is in order. Transcriptomics and metabolomics can be integrated to provide a more comprehensive understanding of the human biological states. We will review these new data and methods and discuss how they can contribute to personalized medicine.
Reptilian Transcriptomes v2.0: An Extensive Resource for Sauropsida Genomics and Transcriptomics
Tzika, Athanasia C.; Ullate-Agote, Asier; Grbic, Djordje; Milinkovitch, Michel C.
2015-01-01
Despite the availability of deep-sequencing techniques, genomic and transcriptomic data remain unevenly distributed across phylogenetic groups. For example, reptiles are poorly represented in sequence databases, hindering functional evolutionary and developmental studies in these lineages substantially more diverse than mammals. In addition, different studies use different assembly and annotation protocols, inhibiting meaningful comparisons. Here, we present the “Reptilian Transcriptomes Database 2.0,” which provides extensive annotation of transcriptomes and genomes from species covering the major reptilian lineages. To this end, we sequenced normalized complementary DNA libraries of multiple adult tissues and various embryonic stages of the leopard gecko and the corn snake and gathered published reptilian sequence data sets from representatives of the four extant orders of reptiles: Squamata (snakes and lizards), the tuatara, crocodiles, and turtles. The LANE runner 2.0 software was implemented to annotate all assemblies within a single integrated pipeline. We show that this approach increases the annotation completeness of the assembled transcriptomes/genomes. We then built large concatenated protein alignments of single-copy genes and inferred phylogenetic trees that support the positions of turtles and the tuatara as sister groups of Archosauria and Squamata, respectively. The Reptilian Transcriptomes Database 2.0 resource will be updated to include selected new data sets as they become available, thus making it a reference for differential expression studies, comparative genomics and transcriptomics, linkage mapping, molecular ecology, and phylogenomic analyses involving reptiles. The database is available at www.reptilian-transcriptomes.org and can be enquired using a wwwblast server installed at the University of Geneva. PMID:26133641
Characterization and fine mapping of qkc7.03: a major locus for kernel cracking in maize.
Yang, Mingtao; Chen, Lin; Wu, Xun; Gao, Xing; Li, Chunhui; Song, Yanchun; Zhang, Dengfeng; Shi, Yunsu; Li, Yu; Li, Yong-Xiang; Wang, Tianyu
2018-02-01
A major locus conferring kernel cracking in maize was characterized and fine mapped to an interval of 416.27 kb. Meanwhile, combining the results of transcriptomic analysis, the candidate gene was inferred. Seed development requires a proper structural and physiological balance between the maternal tissues and the internal structures of the seeds. In maize, kernel cracking is a disorder in this balance that seriously limits quality and yield and is characterized by a cracked pericarp at the kernel top and endosperm everting. This study elucidated the genetic basis and characterization of kernel cracking. Primarily, a near isogenic line (NIL) with a B73 background exhibited steady kernel cracking across environments. Therefore, deprived mapping populations were developed from this NIL and its recurrent parent B73. A major locus on chromosome 7, qkc7.03, was identified to be associated with the cracking performance. According to a progeny test of recombination events, qkc7.03 was fine mapped to a physical interval of 416.27 kb. In addition, obvious differences were observed in embryo development and starch granule arrangement within the endosperm between the NIL and its recurrent parent upon the occurrence of kernel cracking. Moreover, compared to its recurrent parent, the transcriptome of the NIL showed a significantly down-regulated expression of genes related to zeins, carbohydrate synthesis and MADS-domain transcription factors. The transcriptomic analysis revealed ten annotated genes within the target region of qkc7.03, and only GRMZM5G899476 was differently expressed between the NIL and its recurrent parent, indicating that this gene might be a candidate gene for kernel cracking. The results of this study facilitate the understanding of the potential mechanism underlying kernel cracking in maize.
Suzuki, Mami; Nakabayashi, Ryo; Ogata, Yoshiyuki; Sakurai, Nozomu; Tokimatsu, Toshiaki; Goto, Susumu; Suzuki, Makoto; Jasinski, Michal; Martinoia, Enrico; Otagaki, Shungo; Matsumoto, Shogo; Saito, Kazuki; Shiratake, Katsuhiro
2015-01-01
Grape (Vitis vinifera) accumulates various polyphenolic compounds, which protect against environmental stresses, including ultraviolet-C (UV-C) light and pathogens. In this study, we looked at the transcriptome and metabolome in grape berry skin after UV-C irradiation, which demonstrated the effectiveness of omics approaches to clarify important traits of grape. We performed transcriptome analysis using a genome-wide microarray, which revealed 238 genes up-regulated more than 5-fold by UV-C light. Enrichment analysis of Gene Ontology terms showed that genes encoding stilbene synthase, a key enzyme for resveratrol synthesis, were enriched in the up-regulated genes. We performed metabolome analysis using liquid chromatography-quadrupole time-of-flight mass spectrometry, and 2,012 metabolite peaks, including unidentified peaks, were detected. Principal component analysis using the peaks showed that only one metabolite peak, identified as resveratrol, was highly induced by UV-C light. We updated the metabolic pathway map of grape in the Kyoto Encyclopedia of Genes and Genomes (KEGG) database and in the KaPPA-View 4 KEGG system, then projected the transcriptome and metabolome data on a metabolic pathway map. The map showed specific induction of the resveratrol synthetic pathway by UV-C light. Our results showed that multiomics is a powerful tool to elucidate the accumulation mechanisms of secondary metabolites, and updated systems, such as KEGG and KaPPA-View 4 KEGG for grape, can support such studies. PMID:25761715
Suzuki, Mami; Nakabayashi, Ryo; Ogata, Yoshiyuki; Sakurai, Nozomu; Tokimatsu, Toshiaki; Goto, Susumu; Suzuki, Makoto; Jasinski, Michal; Martinoia, Enrico; Otagaki, Shungo; Matsumoto, Shogo; Saito, Kazuki; Shiratake, Katsuhiro
2015-05-01
Grape (Vitis vinifera) accumulates various polyphenolic compounds, which protect against environmental stresses, including ultraviolet-C (UV-C) light and pathogens. In this study, we looked at the transcriptome and metabolome in grape berry skin after UV-C irradiation, which demonstrated the effectiveness of omics approaches to clarify important traits of grape. We performed transcriptome analysis using a genome-wide microarray, which revealed 238 genes up-regulated more than 5-fold by UV-C light. Enrichment analysis of Gene Ontology terms showed that genes encoding stilbene synthase, a key enzyme for resveratrol synthesis, were enriched in the up-regulated genes. We performed metabolome analysis using liquid chromatography-quadrupole time-of-flight mass spectrometry, and 2,012 metabolite peaks, including unidentified peaks, were detected. Principal component analysis using the peaks showed that only one metabolite peak, identified as resveratrol, was highly induced by UV-C light. We updated the metabolic pathway map of grape in the Kyoto Encyclopedia of Genes and Genomes (KEGG) database and in the KaPPA-View 4 KEGG system, then projected the transcriptome and metabolome data on a metabolic pathway map. The map showed specific induction of the resveratrol synthetic pathway by UV-C light. Our results showed that multiomics is a powerful tool to elucidate the accumulation mechanisms of secondary metabolites, and updated systems, such as KEGG and KaPPA-View 4 KEGG for grape, can support such studies. © 2015 American Society of Plant Biologists. All Rights Reserved.
RNA-Seq Technology and Its Application in Fish Transcriptomics
Ba, Yi; Zhuang, Qianfeng
2014-01-01
Abstract High-throughput sequencing technologies, also known as next-generation sequencing (NGS) technologies, have revolutionized the way that genomic research is advancing. In addition to the static genome, these state-of-art technologies have been recently exploited to analyze the dynamic transcriptome, and the resulting technology is termed RNA sequencing (RNA-seq). RNA-seq is free from many limitations of other transcriptomic approaches, such as microarray and tag-based sequencing method. Although RNA-seq has only been available for a short time, studies using this method have completely changed our perspective of the breadth and depth of eukaryotic transcriptomes. In terms of the transcriptomics of teleost fishes, both model and non-model species have benefited from the RNA-seq approach and have undergone tremendous advances in the past several years. RNA-seq has helped not only in mapping and annotating fish transcriptome but also in our understanding of many biological processes in fish, such as development, adaptive evolution, host immune response, and stress response. In this review, we first provide an overview of each step of RNA-seq from library construction to the bioinformatic analysis of the data. We then summarize and discuss the recent biological insights obtained from the RNA-seq studies in a variety of fish species. PMID:24380445
Abdelrahman, Hisham; ElHady, Mohamed; Alcivar-Warren, Acacia; Allen, Standish; Al-Tobasei, Rafet; Bao, Lisui; Beck, Ben; Blackburn, Harvey; Bosworth, Brian; Buchanan, John; Chappell, Jesse; Daniels, William; Dong, Sheng; Dunham, Rex; Durland, Evan; Elaswad, Ahmed; Gomez-Chiarri, Marta; Gosh, Kamal; Guo, Ximing; Hackett, Perry; Hanson, Terry; Hedgecock, Dennis; Howard, Tiffany; Holland, Leigh; Jackson, Molly; Jin, Yulin; Khalil, Karim; Kocher, Thomas; Leeds, Tim; Li, Ning; Lindsey, Lauren; Liu, Shikai; Liu, Zhanjiang; Martin, Kyle; Novriadi, Romi; Odin, Ramjie; Palti, Yniv; Peatman, Eric; Proestou, Dina; Qin, Guyu; Reading, Benjamin; Rexroad, Caird; Roberts, Steven; Salem, Mohamed; Severin, Andrew; Shi, Huitong; Shoemaker, Craig; Stiles, Sheila; Tan, Suxu; Tang, Kathy F J; Thongda, Wilawan; Tiersch, Terrence; Tomasso, Joseph; Prabowo, Wendy Tri; Vallejo, Roger; van der Steen, Hein; Vo, Khoi; Waldbieser, Geoff; Wang, Hanping; Wang, Xiaozhu; Xiang, Jianhai; Yang, Yujia; Yant, Roger; Yuan, Zihao; Zeng, Qifan; Zhou, Tao
2017-02-20
Advancing the production efficiency and profitability of aquaculture is dependent upon the ability to utilize a diverse array of genetic resources. The ultimate goals of aquaculture genomics, genetics and breeding research are to enhance aquaculture production efficiency, sustainability, product quality, and profitability in support of the commercial sector and for the benefit of consumers. In order to achieve these goals, it is important to understand the genomic structure and organization of aquaculture species, and their genomic and phenomic variations, as well as the genetic basis of traits and their interrelationships. In addition, it is also important to understand the mechanisms of regulation and evolutionary conservation at the levels of genome, transcriptome, proteome, epigenome, and systems biology. With genomic information and information between the genomes and phenomes, technologies for marker/causal mutation-assisted selection, genome selection, and genome editing can be developed for applications in aquaculture. A set of genomic tools and resources must be made available including reference genome sequences and their annotations (including coding and non-coding regulatory elements), genome-wide polymorphic markers, efficient genotyping platforms, high-density and high-resolution linkage maps, and transcriptome resources including non-coding transcripts. Genomic and genetic control of important performance and production traits, such as disease resistance, feed conversion efficiency, growth rate, processing yield, behaviour, reproductive characteristics, and tolerance to environmental stressors like low dissolved oxygen, high or low water temperature and salinity, must be understood. QTL need to be identified, validated across strains, lines and populations, and their mechanisms of control understood. Causal gene(s) need to be identified. Genetic and epigenetic regulation of important aquaculture traits need to be determined, and technologies for marker-assisted selection, causal gene/mutation-assisted selection, genome selection, and genome editing using CRISPR and other technologies must be developed, demonstrated with applicability, and application to aquaculture industries.Major progress has been made in aquaculture genomics for dozens of fish and shellfish species including the development of genetic linkage maps, physical maps, microarrays, single nucleotide polymorphism (SNP) arrays, transcriptome databases and various stages of genome reference sequences. This paper provides a general review of the current status, challenges and future research needs of aquaculture genomics, genetics, and breeding, with a focus on major aquaculture species in the United States: catfish, rainbow trout, Atlantic salmon, tilapia, striped bass, oysters, and shrimp. While the overall research priorities and the practical goals are similar across various aquaculture species, the current status in each species should dictate the next priority areas within the species. This paper is an output of the USDA Workshop for Aquaculture Genomics, Genetics, and Breeding held in late March 2016 in Auburn, Alabama, with participants from all parts of the United States.
Lloréns-Rico, Verónica; Serrano, Luis; Lluch-Senar, Maria
2014-07-29
RNA sequencing methods have already altered our view of the extent and complexity of bacterial and eukaryotic transcriptomes, revealing rare transcript isoforms (circular RNAs, RNA chimeras) that could play an important role in their biology. We performed an analysis of chimera formation by four different computational approaches, including a custom designed pipeline, to study the transcriptomes of M. pneumoniae and P. aeruginosa, as well as mixtures of both. We found that rare transcript isoforms detected by conventional pipelines of analysis could be artifacts of the experimental procedure used in the library preparation, and that they are protocol-dependent. By using a customized pipeline we show that optimal library preparation protocol and the pipeline to analyze the results are crucial to identify real chimeric RNAs.
The cancer transcriptome is shaped by genetic changes, variation in gene transcription, mRNA processing, editing and stability, and the cancer microbiome. Deciphering this variation and understanding its implications on tumorigenesis requires sophisticated computational analyses. Most RNA-Seq analyses rely on methods that first map short reads to a reference genome, and then compare them to annotated transcripts or assemble them. However, this strategy can be limited when the cancer genome is substantially different than the reference or for detecting sequences from the cancer microbiome.
Bacillus anthracis genome organization in light of whole transcriptome sequencing
DOE Office of Scientific and Technical Information (OSTI.GOV)
Martin, Jeffrey; Zhu, Wenhan; Passalacqua, Karla D.
2010-03-22
Emerging knowledge of whole prokaryotic transcriptomes could validate a number of theoretical concepts introduced in the early days of genomics. What are the rules connecting gene expression levels with sequence determinants such as quantitative scores of promoters and terminators? Are translation efficiency measures, e.g. codon adaptation index and RBS score related to gene expression? We used the whole transcriptome shotgun sequencing of a bacterial pathogen Bacillus anthracis to assess correlation of gene expression level with promoter, terminator and RBS scores, codon adaptation index, as well as with a new measure of gene translational efficiency, average translation speed. We compared computationalmore » predictions of operon topologies with the transcript borders inferred from RNA-Seq reads. Transcriptome mapping may also improve existing gene annotation. Upon assessment of accuracy of current annotation of protein-coding genes in the B. anthracis genome we have shown that the transcriptome data indicate existence of more than a hundred genes missing in the annotation though predicted by an ab initio gene finder. Interestingly, we observed that many pseudogenes possess not only a sequence with detectable coding potential but also promoters that maintain transcriptional activity.« less
Ball, Robyn L; Fujiwara, Yasuhiro; Sun, Fengyun; Hu, Jianjun; Hibbs, Matthew A; Handel, Mary Ann; Carter, Gregory W
2016-08-12
The continuous and non-synchronous nature of postnatal male germ-cell development has impeded stage-specific resolution of molecular events of mammalian meiotic prophase in the testis. Here the juvenile onset of spermatogenesis in mice is analyzed by combining cytological and transcriptomic data in a novel computational analysis that allows decomposition of the transcriptional programs of spermatogonia and meiotic prophase substages. Germ cells from testes of individual mice were obtained at two-day intervals from 8 to 18 days post-partum (dpp), prepared as surface-spread chromatin and immunolabeled for meiotic stage-specific protein markers (STRA8, SYCP3, phosphorylated H2AFX, and HISTH1T). Eight stages were discriminated cytologically by combinatorial antibody labeling, and RNA-seq was performed on the same samples. Independent principal component analyses of cytological and transcriptomic data yielded similar patterns for both data types, providing strong evidence for substage-specific gene expression signatures. A novel permutation-based maximum covariance analysis (PMCA) was developed to map co-expressed transcripts to one or more of the eight meiotic prophase substages, thereby linking distinct molecular programs to cytologically defined cell states. Expression of meiosis-specific genes is not substage-limited, suggesting regulation of substage transitions at other levels. This integrated analysis provides a general method for resolving complex cell populations. Here it revealed not only features of meiotic substage-specific gene expression, but also a network of substage-specific transcription factors and relationships to potential target genes.
Kogelman, Lisette J A; Zhernakova, Daria V; Westra, Harm-Jan; Cirera, Susanna; Fredholm, Merete; Franke, Lude; Kadarmideen, Haja N
2015-10-20
Obesity is a multi-factorial health problem in which genetic factors play an important role. Limited results have been obtained in single-gene studies using either genomic or transcriptomic data. RNA sequencing technology has shown its potential in gaining accurate knowledge about the transcriptome, and may reveal novel genes affecting complex diseases. Integration of genomic and transcriptomic variation (expression quantitative trait loci [eQTL] mapping) has identified causal variants that affect complex diseases. We integrated transcriptomic data from adipose tissue and genomic data from a porcine model to investigate the mechanisms involved in obesity using a systems genetics approach. Using a selective gene expression profiling approach, we selected 36 animals based on a previously created genomic Obesity Index for RNA sequencing of subcutaneous adipose tissue. Differential expression analysis was performed using the Obesity Index as a continuous variable in a linear model. eQTL mapping was then performed to integrate 60 K porcine SNP chip data with the RNA sequencing data. Results were restricted based on genome-wide significant single nucleotide polymorphisms, detected differentially expressed genes, and previously detected co-expressed gene modules. Further data integration was performed by detecting co-expression patterns among eQTLs and integration with protein data. Differential expression analysis of RNA sequencing data revealed 458 differentially expressed genes. The eQTL mapping resulted in 987 cis-eQTLs and 73 trans-eQTLs (false discovery rate < 0.05), of which the cis-eQTLs were associated with metabolic pathways. We reduced the eQTL search space by focusing on differentially expressed and co-expressed genes and disease-associated single nucleotide polymorphisms to detect obesity-related genes and pathways. Building a co-expression network using eQTLs resulted in the detection of a module strongly associated with lipid pathways. Furthermore, we detected several obesity candidate genes, for example, ENPP1, CTSL, and ABHD12B. To our knowledge, this is the first study to perform an integrated genomics and transcriptomics (eQTL) study using, and modeling, genomic and subcutaneous adipose tissue RNA sequencing data on obesity in a porcine model. We detected several pathways and potential causal genes for obesity. Further validation and investigation may reveal their exact function and association with obesity.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Peterson, Elena S.; McCue, Lee Ann; Rutledge, Alexandra C.
2012-04-25
Visual Exploration and Statistics to Promote Annotation (VESPA) is an interactive visual analysis software tool that facilitates the discovery of structural mis-annotations in prokaryotic genomes. VESPA integrates high-throughput peptide-centric proteomics data and oligo-centric or RNA-Seq transcriptomics data into a genomic context. The data may be interrogated via visual analysis across multiple levels of genomic resolution, linked searches, exports and interaction with BLAST to rapidly identify location of interest within the genome and evaluate potential mis-annotations.
Stare, Tjaša; Stare, Katja; Weckwerth, Wolfram; Wienkoop, Stefanie; Gruden, Kristina
2017-07-06
Plant diseases caused by viral infection are affecting all major crops. Being an obligate intracellular organisms, chemical control of these pathogens is so far not applied in the field except to control the insect vectors of the viruses. Understanding of molecular responses of plant immunity is therefore economically important, guiding the enforcement of crop resistance. To disentangle complex regulatory mechanisms of the plant immune responses, understanding system as a whole is a must. However, integrating data from different molecular analysis (transcriptomics, proteomics, metabolomics, smallRNA regulation etc.) is not straightforward. We evaluated the response of potato ( Solanum tuberosum L.) following the infection with potato virus Y (PVY). The response has been analyzed on two molecular levels, with microarray transcriptome analysis and mass spectroscopy-based proteomics. Within this report, we performed detailed analysis of the results on both levels and compared two different approaches for analysis of proteomic data (spectral count versus MaxQuant). To link the data on different molecular levels, each protein was mapped to the corresponding potato transcript according to StNIB paralogue grouping. Only 33% of the proteins mapped to microarray probes in a one-to-one relation and additionally many showed discordance in detected levels of proteins with corresponding transcripts. We discussed functional importance of true biological differences between both levels and showed that the reason for the discordance between transcript and protein abundance lies partly in complexity and structure of biological regulation of proteome and transcriptome and partly in technical issues contributing to it.
Stare, Tjaša; Stare, Katja; Weckwerth, Wolfram; Wienkoop, Stefanie
2017-01-01
Plant diseases caused by viral infection are affecting all major crops. Being an obligate intracellular organisms, chemical control of these pathogens is so far not applied in the field except to control the insect vectors of the viruses. Understanding of molecular responses of plant immunity is therefore economically important, guiding the enforcement of crop resistance. To disentangle complex regulatory mechanisms of the plant immune responses, understanding system as a whole is a must. However, integrating data from different molecular analysis (transcriptomics, proteomics, metabolomics, smallRNA regulation etc.) is not straightforward. We evaluated the response of potato (Solanum tuberosum L.) following the infection with potato virus Y (PVY). The response has been analyzed on two molecular levels, with microarray transcriptome analysis and mass spectroscopy-based proteomics. Within this report, we performed detailed analysis of the results on both levels and compared two different approaches for analysis of proteomic data (spectral count versus MaxQuant). To link the data on different molecular levels, each protein was mapped to the corresponding potato transcript according to StNIB paralogue grouping. Only 33% of the proteins mapped to microarray probes in a one-to-one relation and additionally many showed discordance in detected levels of proteins with corresponding transcripts. We discussed functional importance of true biological differences between both levels and showed that the reason for the discordance between transcript and protein abundance lies partly in complexity and structure of biological regulation of proteome and transcriptome and partly in technical issues contributing to it. PMID:28684682
NREL: International Activities - Afghanistan Resource Maps
facilities, load centers, terrain conditions, and land use. The high-resolution (1-km) annual wind power maps . The high-resolution (10-km) annual and seasonal solar resource maps were developed using weather -km Resolution Annual Maps (Direct) Low-Res (JPG 104 KB) | High-Res (ZIP 330 KB) 40-km Resolution
Kamphuis, Lars G; Hane, James K; Nelson, Matthew N; Gao, Lingling; Atkins, Craig A; Singh, Karam B
2015-01-01
Narrow-leafed lupin (NLL; Lupinus angustifolius L.) is an important grain legume crop that is valuable for sustainable farming and is becoming recognized as a human health food. NLL breeding is directed at improving grain production, disease resistance, drought tolerance and health benefits. However, genetic and genomic studies have been hindered by a lack of extensive genomic resources for the species. Here, the generation, de novo assembly and annotation of transcriptome datasets derived from five different NLL tissue types of the reference accession cv. Tanjil are described. The Tanjil transcriptome was compared to transcriptomes of an early domesticated cv. Unicrop, a wild accession P27255, as well as accession 83A:476, together being the founding parents of two recombinant inbred line (RIL) populations. In silico predictions for transcriptome-derived gene-based length and SNP polymorphic markers were conducted and corroborated using a survey assembly sequence for NLL cv. Tanjil. This yielded extensive indel and SNP polymorphic markers for the two RIL populations. A total of 335 transcriptome-derived markers and 66 BAC-end sequence-derived markers were evaluated, and 275 polymorphic markers were selected to genotype the reference NLL 83A:476 × P27255 RIL population. This significantly improved the completeness, marker density and quality of the reference NLL genetic map. PMID:25060816
Cui, Z; Hui, M; Liu, Y; Song, C; Li, X; Li, Y; Liu, L; Shi, G; Wang, S; Li, F; Zhang, X; Liu, C; Xiang, J; Chu, K H
2015-01-01
The sex determination system in crabs is believed to be XY-XX from karyotypy, but centromeres could not be identified in some chromosomes and their morphology is not completely clear. Using quantitative trait locus mapping of the gender phenotype, we revealed a ZW-ZZ sex determination system in Eriocheir sinensis and presented a high-density linkage map covering ~98.5% of the genome, with 73 linkage groups corresponding to the haploid chromosome number. All sex-linked markers in the family we used were located on a single linkage group, LG60, and sex linkage was confirmed by genome-wide association studies (GWAS). Forty-six markers detected by GWAS were heterozygous and segregated only in the female parent. The female LG60 was thus the putative W chromosome, with the homologous male LG60 as the Z chromosome. The putative Z and W sex chromosomes were identical in size and carried many homologous loci. Sex ratio (5:1) skewing towards females in induced triploids using unrelated animals also supported a ZW-ZZ system. Transcriptome data were used to search for candidate sex-determining loci, but only one LG60 gene was identified as an ankyrin-2 gene. Double sex- and mab3-related transcription factor 1 (Dmrt1), a Z-linked gene in birds, was located on a putative autosome. With complete genome sequencing and transcriptomic data, more genes on putative sex chromosomes will be characterised, thus leading towards a comprehensive understanding of the sex determination and differentiation mechanisms of E. sinensis, and decapod crustaceans in general. PMID:25873149
Transcriptome profiling analysis of Vibrio vulnificus during human infection.
Bisharat, Naiel; Bronstein, Michal; Korner, Mira; Schnitzer, Temima; Koton, Yael
2013-09-01
Vibrio vulnificus is a waterborne pathogen that was responsible for an outbreak of severe soft-tissue infections among fish farmers and fish consumers in Israel. Several factors have been shown to be associated with virulence. However, the transcriptome profile of the pathogen during human infection has not been determined yet. We compared the transcriptome profile, using RNA sequencing, of a human-pathogenic strain harvested directly from tissue of a patient suffering from severe soft-tissue infection with necrotizing fasciitis, with the same strain and three other environmental strains grown in vitro. The five sequenced libraries were aligned to the reference genomes of V. vulnificus strains CMCP6 and YJ016. Approximately 47.8 to 62.3 million paired-end raw reads were generated from the five runs. Nearly 84 % of the genome was covered by reads from at least one of the five runs, suggesting that nearly 16 % of the genome is not transcribed or is transcribed at low levels. We identified 123 genes that were differentially expressed during the acute phase of infection. Sixty-three genes were mapped to the large chromosome, 47 genes mapped to the small chromosome and 13 genes mapped to the YJ016 plasmid. The 123 genes fell into a variety of functional categories including transcription, signal transduction, cell motility, carbohydrate metabolism, intracellular trafficking and cell envelope biogenesis. Among the genes differentially expressed during human infection we identified genes encoding bacterial toxin (RtxA1) and genes involved in flagellar components, Flp-coding region, GGDEF family protein, iron acquisition system and sialic acid metabolism.
Dissecting the Molecular Mechanism of RhoC GTPase Expression in the Normal and Malignant Breast
2010-09-01
Headquarters Services , Directorate for Information Operations and Reports (0704-0188), 1215 Jefferson Davis Highway, Suite 1204, Arlington, VA 22202- 4302...transcriptome and categorized as (i) mapping to same gene, (ii) mapping to different genes (chimera candidates), (iii) nonmapping, (iv) mitochondrial, (v) quality ...mitochondrial, (iv) quality control, (v) chimera can- didates, and (vi) nonmapping. Chimera candidates and nonmapping categories were used for gene fusion
Ma, Qi-Feng; Wu, Chun-Hui; Wu, Man; Pei, Wen-Feng; Li, Xing-Li; Wang, Wen-Kui; Zhang, Jinfa; Yu, Ji-Wen; Yu, Shu-Xun
2016-01-01
To investigate the molecular mechanisms of fiber initiation in cotton (Gossypium spp.), an integrated approach combining transcriptome, iTRAQ-based proteome and genetic mapping was taken to compare the ovules of the Xuzhou 142 wild type (WT) with its fuzzless-lintless (fl) mutant at −3 and 0 day post-anthesis. A total of 1,953 mRNAs, 187 proteins, and 131 phosphoproteins were differentially expressed (DE) between WT and fl, and the levels of transcripts and their encoded proteins and phosphoproteins were highly congruent. A functional analysis suggested that the abundance of proteins were mainly involved in amino sugar, nucleotide sugar and fatty acid metabolism, one carbon pool for folate metabolism and flavonoid biosynthesis. qRT-PCR, Western blotting, and enzymatic assays were performed to confirm the regulation of these transcripts and proteins. A molecular mapping located the lintless gene li3 in the fl mutant on chromosome 26 for the first time. A further in-silico physical mapping of DE genes with sequence variations between fl and WT identified one and four candidate genes in the li3 and n2 regions, respectively. Taken together, the transcript abundance, phosphorylation status of proteins at the fiber initiation stage and candidate genes have provided insights into regulatory processes underlying cotton fiber initiation. PMID:27075604
Lamm, Ayelet T; Stadler, Michael R; Zhang, Huibin; Gent, Jonathan I; Fire, Andrew Z
2011-02-01
We have used a combination of three high-throughput RNA capture and sequencing methods to refine and augment the transcriptome map of a well-studied genetic model, Caenorhabditis elegans. The three methods include a standard (non-directional) library preparation protocol relying on cDNA priming and foldback that has been used in several previous studies for transcriptome characterization in this species, and two directional protocols, one involving direct capture of single-stranded RNA fragments and one involving circular-template PCR (CircLigase). We find that each RNA-seq approach shows specific limitations and biases, with the application of multiple methods providing a more complete map than was obtained from any single method. Of particular note in the analysis were substantial advantages of CircLigase-based and ssRNA-based capture for defining sequences and structures of the precise 5' ends (which were lost using the double-strand cDNA capture method). Of the three methods, ssRNA capture was most effective in defining sequences to the poly(A) junction. Using data sets from a spectrum of C. elegans strains and stages and the UCSC Genome Browser, we provide a series of tools, which facilitate rapid visualization and assignment of gene structures.
Hartono, Stella R; Malapert, Amélie; Legros, Pénélope; Bernard, Pascal; Chédin, Frédéric; Vanoosthuyse, Vincent
2018-02-02
R-loops, which result from the formation of stable DNA:RNA hybrids, can both threaten genome integrity and act as physiological regulators of gene expression and chromatin patterning. To characterize R-loops in fission yeast, we used the S9.6 antibody-based DRIPc-seq method to sequence the RNA strand of R-loops and obtain strand-specific R-loop maps at near nucleotide resolution. Surprisingly, preliminary DRIPc-seq experiments identified mostly RNase H-resistant but exosome-sensitive RNAs that mapped to both DNA strands and resembled RNA:RNA hybrids (dsRNAs), suggesting that dsRNAs form widely in fission yeast. We confirmed in vitro that S9.6 can immuno-precipitate dsRNAs and provide evidence that dsRNAs can interfere with its binding to R-loops. dsRNA elimination by RNase III treatment prior to DRIPc-seq allowed the genome-wide and strand-specific identification of genuine R-loops that responded in vivo to RNase H levels and displayed classical features associated with R-loop formation. We also found that most transcripts whose levels were altered by in vivo manipulation of RNase H levels did not form detectable R-loops, suggesting that prolonged manipulation of R-loop levels could indirectly alter the transcriptome. We discuss the implications of our work in the design of experimental strategies to probe R-loop functions. Copyright © 2017 Elsevier Ltd. All rights reserved.
Droplet barcoding for single-cell transcriptomics applied to embryonic stem cells.
Klein, Allon M; Mazutis, Linas; Akartuna, Ilke; Tallapragada, Naren; Veres, Adrian; Li, Victor; Peshkin, Leonid; Weitz, David A; Kirschner, Marc W
2015-05-21
It has long been the dream of biologists to map gene expression at the single-cell level. With such data one might track heterogeneous cell sub-populations, and infer regulatory relationships between genes and pathways. Recently, RNA sequencing has achieved single-cell resolution. What is limiting is an effective way to routinely isolate and process large numbers of individual cells for quantitative in-depth sequencing. We have developed a high-throughput droplet-microfluidic approach for barcoding the RNA from thousands of individual cells for subsequent analysis by next-generation sequencing. The method shows a surprisingly low noise profile and is readily adaptable to other sequencing-based assays. We analyzed mouse embryonic stem cells, revealing in detail the population structure and the heterogeneous onset of differentiation after leukemia inhibitory factor (LIF) withdrawal. The reproducibility of these high-throughput single-cell data allowed us to deconstruct cell populations and infer gene expression relationships. VIDEO ABSTRACT. Copyright © 2015 Elsevier Inc. All rights reserved.
The draft genome of tropical fruit durian (Durio zibethinus).
Teh, Bin Tean; Lim, Kevin; Yong, Chern Han; Ng, Cedric Chuan Young; Rao, Sushma Ramesh; Rajasegaran, Vikneswari; Lim, Weng Khong; Ong, Choon Kiat; Chan, Ki; Cheng, Vincent Kin Yuen; Soh, Poh Sheng; Swarup, Sanjay; Rozen, Steven G; Nagarajan, Niranjan; Tan, Patrick
2017-11-01
Durian (Durio zibethinus) is a Southeast Asian tropical plant known for its hefty, spine-covered fruit and sulfury and onion-like odor. Here we present a draft genome assembly of D. zibethinus, representing the third plant genus in the Malvales order and first in the Helicteroideae subfamily to be sequenced. Single-molecule sequencing and chromosome contact maps enabled assembly of the highly heterozygous durian genome at chromosome-scale resolution. Transcriptomic analysis showed upregulation of sulfur-, ethylene-, and lipid-related pathways in durian fruits. We observed paleopolyploidization events shared by durian and cotton and durian-specific gene expansions in MGL (methionine γ-lyase), associated with production of volatile sulfur compounds (VSCs). MGL and the ethylene-related gene ACS (aminocyclopropane-1-carboxylic acid synthase) were upregulated in fruits concomitantly with their downstream metabolites (VSCs and ethylene), suggesting a potential association between ethylene biosynthesis and methionine regeneration via the Yang cycle. The durian genome provides a resource for tropical fruit biology and agronomy.
Introduction to Single-Cell RNA Sequencing.
Olsen, Thale Kristin; Baryawno, Ninib
2018-04-01
During the last decade, high-throughput sequencing methods have revolutionized the entire field of biology. The opportunity to study entire transcriptomes in great detail using RNA sequencing (RNA-seq) has fueled many important discoveries and is now a routine method in biomedical research. However, RNA-seq is typically performed in "bulk," and the data represent an average of gene expression patterns across thousands to millions of cells; this might obscure biologically relevant differences between cells. Single-cell RNA-seq (scRNA-seq) represents an approach to overcome this problem. By isolating single cells, capturing their transcripts, and generating sequencing libraries in which the transcripts are mapped to individual cells, scRNA-seq allows assessment of fundamental biological properties of cell populations and biological systems at unprecedented resolution. Here, we present the most common scRNA-seq protocols in use today and the basics of data analysis and discuss factors that are important to consider before planning and designing an scRNA-seq project. © 2018 by John Wiley & Sons, Inc. Copyright © 2018 John Wiley & Sons, Inc.
Optimization of De Novo Short Read Assembly of Seabuckthorn (Hippophae rhamnoides L.) Transcriptome
Ghangal, Rajesh; Chaudhary, Saurabh; Jain, Mukesh; Purty, Ram Singh; Chand Sharma, Prakash
2013-01-01
Seabuckthorn ( Hippophae rhamnoides L.) is known for its medicinal, nutritional and environmental importance since ancient times. However, very limited efforts have been made to characterize the genome and transcriptome of this wonder plant. Here, we report the use of next generation massive parallel sequencing technology (Illumina platform) and de novo assembly to gain a comprehensive view of the seabuckthorn transcriptome. We assembled 86,253,874 high quality short reads using six assembly tools. At our hand, assembly of non-redundant short reads following a two-step procedure was found to be the best considering various assembly quality parameters. Initially, ABySS tool was used following an additive k-mer approach. The assembled transcripts were subsequently subjected to TGICL suite. Finally, de novo short read assembly yielded 88,297 transcripts (> 100 bp), representing about 53 Mb of seabuckthorn transcriptome. The average length of transcripts was 610 bp, N50 length 1198 BP and 91% of the short reads uniquely mapped back to seabuckthorn transcriptome. A total of 41,340 (46.8%) transcripts showed significant similarity with sequences present in nr protein databases of NCBI (E-value < 1E-06). We also screened the assembled transcripts for the presence of transcription factors and simple sequence repeats. Our strategy involving the use of short read assembler (ABySS) followed by TGICL will be useful for the researchers working with a non-model organism’s transcriptome in terms of saving time and reducing complexity in data management. The seabuckthorn transcriptome data generated here provide a valuable resource for gene discovery and development of functional molecular markers. PMID:23991119
Spatial resolution requirements for urban land cover mapping from space
NASA Technical Reports Server (NTRS)
Todd, William J.; Wrigley, Robert C.
1986-01-01
Very low resolution (VLR) satellite data (Advanced Very High Resolution Radiometer, DMSP Operational Linescan System), low resolution (LR) data (Landsat MSS), medium resolution (MR) data (Landsat TM), and high resolution (HR) satellite data (Spot HRV, Large Format Camera) were evaluated and compared for interpretability at differing spatial resolutions. VLR data (500 m - 1.0 km) is useful for Level 1 (urban/rural distinction) mapping at 1:1,000,000 scale. Feature tone/color is utilized to distinguish generalized urban land cover using LR data (80 m) for 1:250,000 scale mapping. Advancing to MR data (30 m) and 1:100,000 scale mapping, confidence in land cover mapping is greatly increased, owing to the element of texture/pattern which is now evident in the imagery. Shape and shadow contribute to detailed Level II/III urban land use mapping possible if the interpreter can use HR (10-15 m) satellite data; mapping scales can be 1:25,000 - 1:50,000.
Wang, Yonglin; Xiong, Dianguang; Jiang, Ning; Li, Xuewu; Yang, Qiqing; Tian, Chengming
2016-01-01
Arceuthobium (dwarf mistletoes) are hemiparasites that may cause great damage to infected trees belonging to Pinaceae and Cupressaceae. Currently, dwarf mistletoe control involves the use of the ethylene-producing product ethephon (ETH), which acts by inducing dwarf mistletoe shoot abscission. However, the process by which ETH functions is mostly unknown. Therefore, the transcriptome of the ETH-exposed plants was compared to non-exposed controls to identify genes associated with the response to ethephon. In this study, the reference transcriptome was contained 120,316 annotated unigenes, with a total of 21,764 ETH-responsive differentially expressed unigenes were identified. These ETH-associated genes clustered into 20 distinctly expressed pattern groups, providing a view of molecular events with good spatial and temporal resolution. As expected, the greatest number of unigenes with changed expression were observed at the onset of abscission, suggesting induction by ethylene. ETH also affected genes associated with shoot abscission processes including hormone biosynthesis and signaling, cell wall hydrolysis and modification, lipid transference, and more. The comprehensive transcriptome data set provides a wealth of genomic resources for dwarf mistletoe communities and contributes to a better understanding of the molecular regulatory mechanism of ethylene-caused shoots abscission. PMID:27941945
Choi, Sun Young; Park, Byeonghyeok; Choi, In-Geol; Sim, Sang Jun; Lee, Sun-Mi; Um, Youngsoon; Woo, Han Min
2016-01-01
The development of high-throughput technology using RNA-seq has allowed understanding of cellular mechanisms and regulations of bacterial transcription. In addition, transcriptome analysis with RNA-seq has been used to accelerate strain improvement through systems metabolic engineering. Synechococcus elongatus PCC 7942, a photosynthetic bacterium, has remarkable potential for biochemical and biofuel production due to photoautotrophic cell growth and direct CO2 conversion. Here, we performed a transcriptome analysis of S. elongatus PCC 7942 using RNA-seq to understand the changes of cellular metabolism and regulation for nitrogen starvation responses. As a result, differentially expressed genes (DEGs) were identified and functionally categorized. With mapping onto metabolic pathways, we probed transcriptional perturbation and regulation of carbon and nitrogen metabolisms relating to nitrogen starvation responses. Experimental evidence such as chlorophyll a and phycobilisome content and the measurement of CO2 uptake rate validated the transcriptome analysis. The analysis suggests that S. elongatus PCC 7942 reacts to nitrogen starvation by not only rearranging the cellular transport capacity involved in carbon and nitrogen assimilation pathways but also by reducing protein synthesis and photosynthesis activities. PMID:27488818
Gingival tissue transcriptomes in experimental gingivitis
Jönsson, Daniel; Ramberg, Per; Demmer, Ryan T.; Kebschull, Moritz; Dahlén, Gunnar; Papapanou, Panos N.
2012-01-01
Aims We investigated the sequential gene expression in the gingiva during the induction and resolution of experimental gingivitis. Methods Twenty periodontally and systemically healthy non-smoking volunteers participated in a 3-week experimental gingivitis protocol, followed by debridement and 2-week regular plaque control. We recorded clinical indices and harvested gingival tissue samples from 4 interproximal palatal sites in half of the participants at baseline, Day 7, 14 and 21 (‘induction phase’), and at day 21, 25, 30 and 35 in the other half (‘resolution phase’). RNA was extracted, amplified, reversed transcribed, amplified, labeled and hybridized with Affymetrix Human Genome U133Plus2.0 microarrays. Paired t-tests compared gene expression changes between consecutive time points. Gene ontology analyses summarized the expression patterns into biologically relevant categories. Results The median gingival index was 0 at baseline, 2 at Day 21 and 1 at Day 35. Differential gene regulation peaked during the third week of induction and the first four days of resolution. Leukocyte transmigration, cell adhesion and antigen processing/presentation were the top differentially regulated pathways. Conclusions Transcriptomic studies enhance our understanding of the pathobiology of the reversible inflammatory gingival lesion and provide a detailed account of the dynamic tissue responses during induction and resolution of experimental gingivitis. PMID:21501207
Lan, DaoLiang; Xiong, XianRong; Wei, YanLi; Xu, Tong; Zhong, JinCheng; Zhi, XiangDong; Wang, Yong; Li, Jian
2014-09-01
RNA-Seq, a high-throughput (HT) sequencing technique, has been used effectively in large-scale transcriptomic studies, and is particularly useful for improving gene structure information and mining of new genes. In this study, RNA-Seq HT technology was employed to analyze the transcriptome of yak ovary. After Illumina-Solexa deep sequencing, 26826516 clean reads with a total of 4828772880 bp were obtained from the ovary library. Alignment analysis showed that 16992 yak genes mapped to the yak genome and 3734 of these genes were involved in alternative splicing. Gene structure refinement analysis showed that 7340 genes that were annotated in the yak genome could be extended at the 5' or 3' ends based on the alignments been the transcripts and the genome sequence. Novel transcript prediction analysis identified 6321 new transcripts with lengths ranging from 180 to 14884 bp, and 2267 of them were predicted to code proteins. BLAST analysis of the new transcripts showed that 1200?4933 mapped to the non-redundant (nr), nucleotide (nt) and/or SwissProt sequence databases. Comparative statistical analysis of the new mapped transcripts showed that the majority of them were similar to genes in Bos taurus (41.4%), Bos grunniens mutus (33.0%), Ovis aries (6.3%), Homo sapiens (2.8%), Mus musculus (1.6%) and other species. Functional analysis showed that these expressed genes were involved in various Gene Ontology (GO) categories and Kyoto Encyclopedia of Genes and Genomes pathways. GO analysis of the new transcripts found that the largest proportion of them was associated with reproduction. The results of this study will provide a basis for describing the normal transcriptome map of yak ovary and for future studies on yak breeding performance. Moreover, the results confirmed that RNA-Seq HT technology is highly advantageous in improving gene structure information and mining of new genes, as well as in providing valuable data to expand the yak genome information.
Long, Yong; Li, Qing; Zhou, Bolan; Song, Guili; Li, Tao; Cui, Zongbin
2013-01-01
Fish skin serves as the first line of defense against a wide variety of chemical, physical and biological stressors. Secretion of mucus is among the most prominent characteristics of fish skin and numerous innate immune factors have been identified in the epidermal mucus. However, molecular mechanisms underlying the mucus secretion and immune activities of fish skin remain largely unclear due to the lack of genomic and transcriptomic data for most economically important fish species. In this study, we characterized the skin transcriptome of mud loach using Illumia paired-end sequencing. A total of 40364 unigenes were assembled from 86.6 million (3.07 gigabases) filtered reads. The mean length, N50 size and maximum length of assembled transcripts were 387, 611 and 8670 bp, respectively. A total of 17336 (43.76%) unigenes were annotated by blast searches against the NCBI non-redundant protein database. Gene ontology mapping assigned a total of 108513 GO terms to 15369 (38.08%) unigenes. KEGG orthology mapping annotated 9337 (23.23%) unigenes. Among the identified KO categories, immune system is the largest category that contains various components of multiple immune pathways such as chemokine signaling, leukocyte transendothelial migration and T cell receptor signaling, suggesting the complexity of immune mechanisms in fish skin. As for mucin biosynthesis, 37 unigenes were mapped to 7 enzymes of the mucin type O-glycan biosynthesis pathway and 8 members of the polypeptide N-acetylgalactosaminyltransferase family were identified. Additionally, 38 unigenes were mapped to 23 factors of the SNARE interactions in vesicular transport pathway, indicating that the activity of this pathway is required for the processes of epidermal mucus storage and release. Moreover, 1754 simple sequence repeats (SSRs) were detected in 1564 unigenes and dinucleotide repeats represented the most abundant type. These findings have laid the foundation for further understanding the secretary processes and immune functions of loach skin mucus. PMID:23437293
Long, Yong; Li, Qing; Zhou, Bolan; Song, Guili; Li, Tao; Cui, Zongbin
2013-01-01
Fish skin serves as the first line of defense against a wide variety of chemical, physical and biological stressors. Secretion of mucus is among the most prominent characteristics of fish skin and numerous innate immune factors have been identified in the epidermal mucus. However, molecular mechanisms underlying the mucus secretion and immune activities of fish skin remain largely unclear due to the lack of genomic and transcriptomic data for most economically important fish species. In this study, we characterized the skin transcriptome of mud loach using Illumia paired-end sequencing. A total of 40364 unigenes were assembled from 86.6 million (3.07 gigabases) filtered reads. The mean length, N50 size and maximum length of assembled transcripts were 387, 611 and 8670 bp, respectively. A total of 17336 (43.76%) unigenes were annotated by blast searches against the NCBI non-redundant protein database. Gene ontology mapping assigned a total of 108513 GO terms to 15369 (38.08%) unigenes. KEGG orthology mapping annotated 9337 (23.23%) unigenes. Among the identified KO categories, immune system is the largest category that contains various components of multiple immune pathways such as chemokine signaling, leukocyte transendothelial migration and T cell receptor signaling, suggesting the complexity of immune mechanisms in fish skin. As for mucin biosynthesis, 37 unigenes were mapped to 7 enzymes of the mucin type O-glycan biosynthesis pathway and 8 members of the polypeptide N-acetylgalactosaminyltransferase family were identified. Additionally, 38 unigenes were mapped to 23 factors of the SNARE interactions in vesicular transport pathway, indicating that the activity of this pathway is required for the processes of epidermal mucus storage and release. Moreover, 1754 simple sequence repeats (SSRs) were detected in 1564 unigenes and dinucleotide repeats represented the most abundant type. These findings have laid the foundation for further understanding the secretary processes and immune functions of loach skin mucus.
Amaradasa, Bimal S; Amundsen, Keenan
2016-01-01
Buffalograss (Bouteloua dactyloides) is a low maintenance U. S. native turfgrass species with exceptional drought, heat, and cold tolerance. Leaf spot caused by Curvularia inaequalis negatively impacts buffalograss visual quality. Two leaf spot susceptible and two resistant buffalograss lines were challenged with C. inaequalis. Samples were collected from treated and untreated leaves when susceptible lines showed symptoms. Transcriptome sequencing was done and differentially expressed genes were identified. Approximately 27 million raw sequencing reads were produced per sample. More than 86% of the sequencing reads mapped to an existing buffalograss reference transcriptome. De novo assembly of unmapped reads was merged with the existing reference to produce a more complete transcriptome. There were 461 differentially expressed transcripts between the resistant and susceptible lines when challenged with the pathogen and 1552 in its absence. Previously characterized defense-related genes were identified among the differentially expressed transcripts. Twenty one resistant line transcripts were similar to genes regulating pattern triggered immunity and 20 transcripts were similar to genes regulating effector triggered immunity. There were also nine up-regulated transcripts in resistance lines which showed potential to initiate systemic acquired resistance (SAR) and three transcripts encoding pathogenesis-related proteins which are downstream products of SAR. This is the first study characterizing changes in the buffalograss transcriptome when challenged with C. inaequalis.
van Veen, Hans; Vashisht, Divya; Akman, Melis; Girke, Thomas; Mustroph, Angelika; Reinen, Emilie; Kooiker, Maarten; van Tienderen, Peter; Voesenek, Laurentius A.C.J.
2016-01-01
Climate change has increased the frequency and severity of flooding events, with significant negative impact on agricultural productivity. These events often submerge plant aerial organs and roots, limiting growth and survival due to a severe reduction in light reactions and gas exchange necessary for photosynthesis and respiration, respectively. To distinguish molecular responses to the compound stress imposed by submergence, we investigated transcriptomic adjustments to darkness in air and under submerged conditions using eight Arabidopsis (Arabidopsis thaliana) accessions differing significantly in sensitivity to submergence. Evaluation of root and rosette transcriptomes revealed an early transcriptional and posttranscriptional response signature that was conserved primarily across genotypes, although flooding susceptibility-associated and genotype-specific responses also were uncovered. Posttranscriptional regulation encompassed darkness- and submergence-induced alternative splicing of transcripts from pathways involved in the alternative mobilization of energy reserves. The organ-specific transcriptome adjustments reflected the distinct physiological status of roots and shoots. Root-specific transcriptome changes included marked up-regulation of chloroplast-encoded photosynthesis and redox-related genes, whereas those of the rosette were related to the regulation of development and growth processes. We identified a novel set of tolerance genes, recognized mainly by quantitative differences. These included a transcriptome signature of more pronounced gluconeogenesis in tolerant accessions, a response that included stress-induced alternative splicing. This study provides organ-specific molecular resolution of genetic variation in submergence responses involving interactions between darkness and low-oxygen constraints of flooding stress and demonstrates that early transcriptome plasticity, including alternative splicing, is associated with the ability to cope with a compound environmental stress. PMID:27208254
Differential DNA methylation and transcription profiles in date palm roots exposed to salinity
Al-Harrasi, Ibtisam; Al-Yahyai, Rashid
2018-01-01
As a salt-adaptive plant, the date palm (Phoenix dactylifera L.) requires a suitable mechanism to adapt to the stress of saline soils. There is growing evidence that DNA methylation plays an important role in regulating gene expression in response to abiotic stresses, including salinity. Thus, the present study sought to examine the differential methylation status that occurs in the date palm genome when plants are exposed to salinity, and to identify salinity responsive genes that are regulated by DNA methylation. To achieve these, whole-genome bisulfite sequencing (WGBS) was employed and mRNA was sequenced from salinity-treated and untreated roots. The WGBS analysis included 324,987,795 and 317,056,091 total reads of the control and the salinity-treated samples, respectively. The analysis covered about 81% of the total genomic DNA with about 40% of mapping efficiency of the sequenced reads and an average read depth of 17-fold coverage per DNA strand, and with a bisulfite conversion rate of around 99%. The level of methylation within the differentially methylated regions (DMRs) was significantly (p < 0.05, FDR ≤ 0.05) increased in response to salinity specifically at the mCHG and mCHH sequence contexts. Consistently, the mass spectrometry and the enzyme-linked immunosorbent assay (ELISA) showed that there was a significant (p < 0.05) increase in the global DNA methylation in response to salinity. mRNA sequencing revealed the presence of 6,405 differentially regulated genes with a significant value (p < 0.001, FDR ≤ 0.05) in response to salinity. Integration of high-resolution methylome and transcriptome analyses revealed a negative correlation between mCG methylation located within the promoters and the gene expression, while a positive correlation was noticed between mCHG/mCHH methylation rations and gene expression specifically when plants grew under control conditions. Therefore, the methylome and transcriptome relationships vary based on the methylated sequence context, the methylated region within the gene, the protein-coding ability of the gene, and the salinity treatment. These results provide insights into interplay among DNA methylation and gene expression, and highlight the effect of salinity on the nature of this relationship, which may involve other genetic and epigenetic players under salt stress conditions. The results obtained from this project provide the first draft map of the differential methylome and transcriptome of date palm when exposed to an abiotic stress. PMID:29352281
Differential DNA methylation and transcription profiles in date palm roots exposed to salinity.
Al-Harrasi, Ibtisam; Al-Yahyai, Rashid; Yaish, Mahmoud W
2018-01-01
As a salt-adaptive plant, the date palm (Phoenix dactylifera L.) requires a suitable mechanism to adapt to the stress of saline soils. There is growing evidence that DNA methylation plays an important role in regulating gene expression in response to abiotic stresses, including salinity. Thus, the present study sought to examine the differential methylation status that occurs in the date palm genome when plants are exposed to salinity, and to identify salinity responsive genes that are regulated by DNA methylation. To achieve these, whole-genome bisulfite sequencing (WGBS) was employed and mRNA was sequenced from salinity-treated and untreated roots. The WGBS analysis included 324,987,795 and 317,056,091 total reads of the control and the salinity-treated samples, respectively. The analysis covered about 81% of the total genomic DNA with about 40% of mapping efficiency of the sequenced reads and an average read depth of 17-fold coverage per DNA strand, and with a bisulfite conversion rate of around 99%. The level of methylation within the differentially methylated regions (DMRs) was significantly (p < 0.05, FDR ≤ 0.05) increased in response to salinity specifically at the mCHG and mCHH sequence contexts. Consistently, the mass spectrometry and the enzyme-linked immunosorbent assay (ELISA) showed that there was a significant (p < 0.05) increase in the global DNA methylation in response to salinity. mRNA sequencing revealed the presence of 6,405 differentially regulated genes with a significant value (p < 0.001, FDR ≤ 0.05) in response to salinity. Integration of high-resolution methylome and transcriptome analyses revealed a negative correlation between mCG methylation located within the promoters and the gene expression, while a positive correlation was noticed between mCHG/mCHH methylation rations and gene expression specifically when plants grew under control conditions. Therefore, the methylome and transcriptome relationships vary based on the methylated sequence context, the methylated region within the gene, the protein-coding ability of the gene, and the salinity treatment. These results provide insights into interplay among DNA methylation and gene expression, and highlight the effect of salinity on the nature of this relationship, which may involve other genetic and epigenetic players under salt stress conditions. The results obtained from this project provide the first draft map of the differential methylome and transcriptome of date palm when exposed to an abiotic stress.
Casey, Maura E; Meade, Kieran G; Nalpas, Nicolas C; Taraktsoglou, Maria; Browne, John A; Killick, Kate E; Park, Stephen D E; Gormley, Eamonn; Hokamp, Karsten; Magee, David A; MacHugh, David E
2015-01-01
Johne's disease, caused by infection with Mycobacterium avium subsp. paratuberculosis, (MAP), is a chronic intestinal disease of ruminants with serious economic consequences for cattle production in the United States and elsewhere. During infection, MAP bacilli are phagocytosed and subvert host macrophage processes, resulting in subclinical infections that can lead to immunopathology and dissemination of disease. Analysis of the host macrophage transcriptome during infection can therefore shed light on the molecular mechanisms and host-pathogen interplay associated with Johne's disease. Here, we describe results of an in vitro study of the bovine monocyte-derived macrophage (MDM) transcriptome response during MAP infection using RNA-seq. MDM were obtained from seven age- and sex-matched Holstein-Friesian cattle and were infected with MAP across a 6-h infection time course with non-infected controls. We observed 245 and 574 differentially expressed (DE) genes in MAP-infected versus non-infected control samples (adjusted P value ≤0.05) at 2 and 6 h post-infection, respectively. Functional analyses of these DE genes, including biological pathway enrichment, highlighted potential functional roles for genes that have not been previously described in the host response to infection with MAP bacilli. In addition, differential expression of pro- and anti-inflammatory cytokine genes, such as those associated with the IL-10 signaling pathway, and other immune-related genes that encode proteins involved in the bovine macrophage response to MAP infection emphasize the balance between protective host immunity and bacilli survival and proliferation. Systematic comparisons of RNA-seq gene expression results with Affymetrix(®) microarray data generated from the same experimental samples also demonstrated that RNA-seq represents a superior technology for studying host transcriptional responses to intracellular infection.
Casey, Maura E.; Meade, Kieran G.; Nalpas, Nicolas C.; Taraktsoglou, Maria; Browne, John A.; Killick, Kate E.; Park, Stephen D. E.; Gormley, Eamonn; Hokamp, Karsten; Magee, David A.; MacHugh, David E.
2015-01-01
Johne’s disease, caused by infection with Mycobacterium avium subsp. paratuberculosis, (MAP), is a chronic intestinal disease of ruminants with serious economic consequences for cattle production in the United States and elsewhere. During infection, MAP bacilli are phagocytosed and subvert host macrophage processes, resulting in subclinical infections that can lead to immunopathology and dissemination of disease. Analysis of the host macrophage transcriptome during infection can therefore shed light on the molecular mechanisms and host-pathogen interplay associated with Johne’s disease. Here, we describe results of an in vitro study of the bovine monocyte-derived macrophage (MDM) transcriptome response during MAP infection using RNA-seq. MDM were obtained from seven age- and sex-matched Holstein-Friesian cattle and were infected with MAP across a 6-h infection time course with non-infected controls. We observed 245 and 574 differentially expressed (DE) genes in MAP-infected versus non-infected control samples (adjusted P value ≤0.05) at 2 and 6 h post-infection, respectively. Functional analyses of these DE genes, including biological pathway enrichment, highlighted potential functional roles for genes that have not been previously described in the host response to infection with MAP bacilli. In addition, differential expression of pro- and anti-inflammatory cytokine genes, such as those associated with the IL-10 signaling pathway, and other immune-related genes that encode proteins involved in the bovine macrophage response to MAP infection emphasize the balance between protective host immunity and bacilli survival and proliferation. Systematic comparisons of RNA-seq gene expression results with Affymetrix® microarray data generated from the same experimental samples also demonstrated that RNA-seq represents a superior technology for studying host transcriptional responses to intracellular infection. PMID:25699042
ORMAN: optimal resolution of ambiguous RNA-Seq multimappings in the presence of novel isoforms.
Dao, Phuong; Numanagić, Ibrahim; Lin, Yen-Yi; Hach, Faraz; Karakoc, Emre; Donmez, Nilgun; Collins, Colin; Eichler, Evan E; Sahinalp, S Cenk
2014-03-01
RNA-Seq technology is promising to uncover many novel alternative splicing events, gene fusions and other variations in RNA transcripts. For an accurate detection and quantification of transcripts, it is important to resolve the mapping ambiguity for those RNA-Seq reads that can be mapped to multiple loci: >17% of the reads from mouse RNA-Seq data and 50% of the reads from some plant RNA-Seq data have multiple mapping loci. In this study, we show how to resolve the mapping ambiguity in the presence of novel transcriptomic events such as exon skipping and novel indels towards accurate downstream analysis. We introduce ORMAN ( O ptimal R esolution of M ultimapping A mbiguity of R N A-Seq Reads), which aims to compute the minimum number of potential transcript products for each gene and to assign each multimapping read to one of these transcripts based on the estimated distribution of the region covering the read. ORMAN achieves this objective through a combinatorial optimization formulation, which is solved through well-known approximation algorithms, integer linear programs and heuristics. On a simulated RNA-Seq dataset including a random subset of transcripts from the UCSC database, the performance of several state-of-the-art methods for identifying and quantifying novel transcripts, such as Cufflinks, IsoLasso and CLIIQ, is significantly improved through the use of ORMAN. Furthermore, in an experiment using real RNA-Seq reads, we show that ORMAN is able to resolve multimapping to produce coverage values that are similar to the original distribution, even in genes with highly non-uniform coverage. ORMAN is available at http://orman.sf.net
Transcriptional atlas of cardiogenesis maps congenital heart disease interactome.
Li, Xing; Martinez-Fernandez, Almudena; Hartjes, Katherine A; Kocher, Jean-Pierre A; Olson, Timothy M; Terzic, Andre; Nelson, Timothy J
2014-07-01
Mammalian heart development is built on highly conserved molecular mechanisms with polygenetic perturbations resulting in a spectrum of congenital heart diseases (CHD). However, knowledge of cardiogenic ontogeny that regulates proper cardiogenesis remains largely based on candidate-gene approaches. Mapping the dynamic transcriptional landscape of cardiogenesis from a genomic perspective is essential to integrate the knowledge of heart development into translational applications that accelerate disease discovery efforts toward mechanistic-based treatment strategies. Herein, we designed a time-course transcriptome analysis to investigate the genome-wide dynamic expression landscape of innate murine cardiogenesis ranging from embryonic stem cells to adult cardiac structures. This comprehensive analysis generated temporal and spatial expression profiles, revealed stage-specific gene functions, and mapped the dynamic transcriptome of cardiogenesis to curated pathways. Reconciling known genetic underpinnings of CHD, we deconstructed a disease-centric dynamic interactome encoded within this cardiogenic atlas to identify stage-specific developmental disturbances clustered on regulation of epithelial-to-mesenchymal transition (EMT), BMP signaling, NF-AT signaling, TGFb-dependent EMT, and Notch signaling. Collectively, this cardiogenic transcriptional landscape defines the time-dependent expression of cardiac ontogeny and prioritizes regulatory networks at the interface between health and disease. Copyright © 2014 the American Physiological Society.
ReadXplorer—visualization and analysis of mapped sequences
Hilker, Rolf; Stadermann, Kai Bernd; Doppmeier, Daniel; Kalinowski, Jörn; Stoye, Jens; Straube, Jasmin; Winnebald, Jörn; Goesmann, Alexander
2014-01-01
Motivation: Fast algorithms and well-arranged visualizations are required for the comprehensive analysis of the ever-growing size of genomic and transcriptomic next-generation sequencing data. Results: ReadXplorer is a software offering straightforward visualization and extensive analysis functions for genomic and transcriptomic DNA sequences mapped on a reference. A unique specialty of ReadXplorer is the quality classification of the read mappings. It is incorporated in all analysis functions and displayed in ReadXplorer's various synchronized data viewers for (i) the reference sequence, its base coverage as (ii) normalizable plot and (iii) histogram, (iv) read alignments and (v) read pairs. ReadXplorer's analysis capability covers RNA secondary structure prediction, single nucleotide polymorphism and deletion–insertion polymorphism detection, genomic feature and general coverage analysis. Especially for RNA-Seq data, it offers differential gene expression analysis, transcription start site and operon detection as well as RPKM value and read count calculations. Furthermore, ReadXplorer can combine or superimpose coverage of different datasets. Availability and implementation: ReadXplorer is available as open-source software at http://www.readxplorer.org along with a detailed manual. Contact: rhilker@mikrobio.med.uni-giessen.de Supplementary information: Supplementary data are available at Bioinformatics online. PMID:24790157
2013-01-01
Background Adenosine-to-inosine (A-to-I) RNA editing is recognized as a cellular mechanism for generating both RNA and protein diversity. Inosine base pairs with cytidine during reverse transcription and therefore appears as guanosine during sequencing of cDNA. Current approaches of RNA editing identification largely depend on the comparison between transcriptomes and genomic DNA (gDNA) sequencing datasets from the same individuals, and it has been challenging to identify editing candidates from transcriptomes in the absence of gDNA information. Results We have developed a new strategy to accurately predict constitutive RNA editing sites from publicly available human RNA-seq datasets in the absence of relevant genomic sequences. Our approach establishes new parameters to increase the ability to map mismatches and to minimize sequencing/mapping errors and unreported genome variations. We identified 695 novel constitutive A-to-I editing sites that appear in clusters (named “editing boxes”) in multiple samples and which exhibit spatial and dynamic regulation across human tissues. Some of these editing boxes are enriched in non-repetitive regions lacking inverted repeat structures and contain an extremely high conversion frequency of As to Is. We validated a number of editing boxes in multiple human cell lines and confirmed that ADAR1 is responsible for the observed promiscuous editing events in non-repetitive regions, further expanding our knowledge of the catalytic substrate of A-to-I RNA editing by ADAR enzymes. Conclusions The approach we present here provides a novel way of identifying A-to-I RNA editing events by analyzing only RNA-seq datasets. This method has allowed us to gain new insights into RNA editing and should also aid in the identification of more constitutive A-to-I editing sites from additional transcriptomes. PMID:23537002
Spatial organization shapes the turnover of a bacterial transcriptome
Moffitt, Jeffrey R; Pandey, Shristi; Boettiger, Alistair N; Wang, Siyuan; Zhuang, Xiaowei
2016-01-01
Spatial organization of the transcriptome has emerged as a powerful means for regulating the post-transcriptional fate of RNA in eukaryotes; however, whether prokaryotes use RNA spatial organization as a mechanism for post-transcriptional regulation remains unclear. Here we used super-resolution microscopy to image the E. coli transcriptome and observed a genome-wide spatial organization of RNA: mRNAs encoding inner-membrane proteins are enriched at the membrane, whereas mRNAs encoding outer-membrane, cytoplasmic and periplasmic proteins are distributed throughout the cytoplasm. Membrane enrichment is caused by co-translational insertion of signal peptides recognized by the signal-recognition particle. Time-resolved RNA-sequencing revealed that degradation rates of inner-membrane-protein mRNAs are on average greater that those of the other mRNAs and that this selective destabilization of inner-membrane-protein mRNAs is abolished by dissociating the RNA degradosome from the membrane. Together, these results demonstrate that the bacterial transcriptome is spatially organized and suggest that this organization shapes the post-transcriptional dynamics of mRNAs. DOI: http://dx.doi.org/10.7554/eLife.13065.001 PMID:27198188
Tan, Chee K.; Carey, Alison J.; Cui, Xiangqin; Webb, Richard I.; Ipe, Deepak; Crowley, Michael; Cripps, Allan W.; Benjamin, William H.; Ulett, Kimberly B.; Schembri, Mark A.
2012-01-01
The most common causes of urinary tract infections (UTIs) are Gram-negative pathogens such as Escherichia coli; however, Gram-positive organisms, including Streptococcus agalactiae, or group B streptococcus (GBS), also cause UTI. In GBS infection, UTI progresses to cystitis once the bacteria colonize the bladder, but the host responses triggered in the bladder immediately following infection are largely unknown. Here, we used genome-wide expression profiling to map the bladder transcriptome of GBS UTI in mice infected transurethrally with uropathogenic GBS that was cultured from a 35-year-old women with cystitis. RNA from bladders was applied to Affymetrix Gene-1.0ST microarrays; quantitative reverse transcriptase PCR (qRT-PCR) was used to analyze selected gene responses identified in array data sets. A surprisingly small significant-gene list of 172 genes was identified at 24 h; this compared to 2,507 genes identified in a side-by-side comparison with uropathogenic E. coli (UPEC). No genes exhibited significantly altered expression at 2 h in GBS-infected mice according to arrays despite high bladder bacterial loads at this early time point. The absence of a marked early host response to GBS juxtaposed with broad-based bladder responses activated by UPEC at 2 h. Bioinformatics analyses, including integrative system-level network mapping, revealed multiple activated biological pathways in the GBS bladder transcriptome that regulate leukocyte activation, inflammation, apoptosis, and cytokine-chemokine biosynthesis. These findings define a novel, minimalistic type of bladder host response triggered by GBS UTI, which comprises collective antimicrobial pathways that differ dramatically from those activated by UPEC. Overall, this study emphasizes the unique nature of bladder immune activation mechanisms triggered by distinct uropathogens. PMID:22733575
Cooper, Laurel D.; Kishore, Venkata K.; Knapp, Steven J.; Kling, Jennifer G.
2015-01-01
The seed oil of meadowfoam, a new crop in the Limnanthaceae family, is highly enriched in very long chain fatty acids that are desaturated at the Δ5 position. The unusual oil is desirable for cosmetics and innovative industrial applications and the seed meal remaining after oil extraction contains glucolimnanthin, a methoxylated benzylglucosinolate whose degradation products are herbicidal and anti-microbial. Here we describe EST analysis of the developing seed transcriptome that identified major genes involved in biosynthesis and assembly of the seed oil and in glucosinolate metabolic pathways. mRNAs encoding acyl-CoA Δ5 desaturase were notably abundant. The library was searched for simple sequence repeats (SSRs) and single nucleotide polymorphisms (SNPs). Fifty-four new SSR markers and eight candidate gene markers were developed and combined with previously developed SSRs to construct a new genetic map for Limnanthes alba. Mapped genes in the lipid biosynthetic pathway encode 3-ketoacyl-CoA synthase (KCS), Δ5 desaturase (Δ5DS), lysophosphatidylacyl-acyl transferase (LPAT), and acyl-CoA diacylglycerol acyl transferase (DGAT). Mapped genes in glucosinolate biosynthetic and degradation pathways encode CYP79A, myrosinase (TGG), and epithiospecifier modifier protein (ESM). The resources developed in this study will further the domestication and improvement of meadowfoam as an oilseed crop. PMID:26038713
Phelix, C F; Feltus, F A
2015-01-01
Measuring biomarkers from plant tissue samples is challenging and expensive when the desire is to integrate transcriptomics, fluxomics, metabolomics, lipidomics, proteomics, physiomics and phenomics. We present a computational biology method where only the transcriptome needs to be measured and is used to derive a set of parameters for deterministic kinetic models of metabolic pathways. The technology is called Transcriptome-To-Metabolome (TTM) biosimulations, currently under commercial development, but available for non-commercial use by researchers. The simulated results on metabolites of 30 primary and secondary metabolic pathways in rice (Oryza sativa) were used as the biomarkers to predict whether the transcriptome was from a plant that had been under drought conditions. The rice transcriptomes were accessed from public archives and each individual plant was simulated. This unique quality of the TTM technology allows standard analyses on biomarker assessments, i.e. sensitivity, specificity, positive and negative predictive values, accuracy, receiver operator characteristics (ROC) curve and area under the ROC curve (AUC). Two validation methods were also used, the holdout and 10-fold cross validations. Initially 17 metabolites were identified as candidate biomarkers based on either statistical significance on binary phenotype when compared with control samples or recognition from the literature. The top three biomarkers based on AUC were gibberellic acid 12 (0.89), trehalose (0.80) and sn1-palmitate-sn2-oleic-phosphatidylglycerol (0.70). Neither heat map analyses of transcriptomes nor all 300 metabolites clustered the stressed and control groups effectively. The TTM technology allows the emergent properties of the integrated system to generate unique and useful 'Omics' information. © 2014 German Botanical Society and The Royal Botanical Society of the Netherlands.
Dufresnes, Christophe; Brelsford, Alan; Béziers, Paul; Perrin, Nicolas
2014-07-01
A simple way to quickly optimize microsatellites in nonmodel organisms is to reuse loci available in closely related taxa; however, this approach can be limited by the stochastic and low cross-amplification success experienced in some groups (e.g. amphibians). An efficient alternative is to develop loci from transcriptome sequences. Transcriptomic microsatellites have been found to vary in their levels of cross-species amplification and variability, but this has to date never been tested in amphibians. Here, we compare the patterns of cross-amplification and levels of polymorphism of 18 published anonymous microsatellites isolated from genomic DNA vs. 17 loci derived from a transcriptome, across nine species of tree frogs (Hyla arborea and Hyla cinerea group). We established a clear negative relationship between divergence time and amplification success, which was much steeper for anonymous than transcriptomic markers, with half-lives (time at which 50% of the markers still amplify) of 1.1 and 37 My, respectively. Transcriptomic markers are significantly less polymorphic than anonymous loci, but remain variable across diverged taxa. We conclude that the exploitation of amphibian transcriptomes for developing microsatellites seems an optimal approach for multispecies surveys (e.g. analyses of hybrid zones, comparative linkage mapping), whereas anonymous microsatellites may be more informative for fine-scale analyses of intraspecific variation. Moreover, our results confirm the pattern that microsatellite cross-amplification is greatly variable among amphibians and should be assessed independently within target lineages. Finally, we provide a bank of microsatellites for Palaearctic tree frogs (so far only available for H. arborea), which will be useful for conservation and evolutionary studies in this radiation. © 2013 John Wiley & Sons Ltd.
Kulkarni, Kalyani S.; Madhu Babu, P.; Sanjeeva Rao, D.; Surekha, K.; Ravindra Babu, V
2018-01-01
Polished rice is poor source of micronutrients, however wide genotypic variability exists for zinc uptake and remobilization and zinc content in brown and polished grains in rice. Two landraces (Chittimutyalu and Kala Jeera Joha) and one popular improved variety (BPT 5204) were grown under zinc sufficient soil and their analyses showed high zinc in straw of improved variety, but high zinc in polished rice in landraces suggesting better translocation ability of zinc into the grain in landraces. Transcriptome analyses of the panicle tissue showed 41182 novel transcripts across three samples. Out of 1011 differentially expressed exclusive transcripts by two landraces, 311 were up regulated and 534 were down regulated. Phosphate transporter-exporter (PHO), proton-coupled peptide transporters (POT) and vacuolar iron transporter (VIT) showed enhanced and significant differential expression in landraces. Out of 24 genes subjected to quantitative real time analyses for confirmation, eight genes showed significant differential expression in landraces. Through mapping, six rice microsatellite markers spanning the genomic regions of six differentially expressed genes were validated for their association with zinc in brown and polished rice using recombinant inbred lines (RIL) of BPT 5204/Chittimutyalu. Thus, this study reports repertoire of genes associated with high zinc in polished rice and a proof concept for deployment of transcriptome information for validation in mapping population and its use in marker assisted selection for biofortification of rice with zinc. PMID:29394277
Huang, Guangzao; Yuan, Mingshun; Chen, Moliang; Li, Lei; You, Wenjie; Li, Hanjie; Cai, James J; Ji, Guoli
2017-10-07
The application of machine learning in cancer diagnostics has shown great promise and is of importance in clinic settings. Here we consider applying machine learning methods to transcriptomic data derived from tumor-educated platelets (TEPs) from individuals with different types of cancer. We aim to define a reliability measure for diagnostic purposes to increase the potential for facilitating personalized treatments. To this end, we present a novel classification method called MFRB (for Multiple Fitting Regression and Bayes decision), which integrates the process of multiple fitting regression (MFR) with Bayes decision theory. MFR is first used to map multidimensional features of the transcriptomic data into a one-dimensional feature. The probability density function of each class in the mapped space is then adjusted using the Gaussian probability density function. Finally, the Bayes decision theory is used to build a probabilistic classifier with the estimated probability density functions. The output of MFRB can be used to determine which class a sample belongs to, as well as to assign a reliability measure for a given class. The classical support vector machine (SVM) and probabilistic SVM (PSVM) are used to evaluate the performance of the proposed method with simulated and real TEP datasets. Our results indicate that the proposed MFRB method achieves the best performance compared to SVM and PSVM, mainly due to its strong generalization ability for limited, imbalanced, and noisy data.
Neeraja, C N; Kulkarni, Kalyani S; Madhu Babu, P; Sanjeeva Rao, D; Surekha, K; Ravindra Babu, V
2018-01-01
Polished rice is poor source of micronutrients, however wide genotypic variability exists for zinc uptake and remobilization and zinc content in brown and polished grains in rice. Two landraces (Chittimutyalu and Kala Jeera Joha) and one popular improved variety (BPT 5204) were grown under zinc sufficient soil and their analyses showed high zinc in straw of improved variety, but high zinc in polished rice in landraces suggesting better translocation ability of zinc into the grain in landraces. Transcriptome analyses of the panicle tissue showed 41182 novel transcripts across three samples. Out of 1011 differentially expressed exclusive transcripts by two landraces, 311 were up regulated and 534 were down regulated. Phosphate transporter-exporter (PHO), proton-coupled peptide transporters (POT) and vacuolar iron transporter (VIT) showed enhanced and significant differential expression in landraces. Out of 24 genes subjected to quantitative real time analyses for confirmation, eight genes showed significant differential expression in landraces. Through mapping, six rice microsatellite markers spanning the genomic regions of six differentially expressed genes were validated for their association with zinc in brown and polished rice using recombinant inbred lines (RIL) of BPT 5204/Chittimutyalu. Thus, this study reports repertoire of genes associated with high zinc in polished rice and a proof concept for deployment of transcriptome information for validation in mapping population and its use in marker assisted selection for biofortification of rice with zinc.
A 2-D guinea pig lung proteome map
USDA-ARS?s Scientific Manuscript database
Guinea pigs represent an important model for a number of infectious and non-infectious pulmonary diseases. The guinea pig genome has recently been sequenced to full coverage, opening up new research avenues using genomics, transcriptomics and proteomics techniques in this species. In order to furth...
Massively parallel digital transcriptional profiling of single cells
Zheng, Grace X. Y.; Terry, Jessica M.; Belgrader, Phillip; Ryvkin, Paul; Bent, Zachary W.; Wilson, Ryan; Ziraldo, Solongo B.; Wheeler, Tobias D.; McDermott, Geoff P.; Zhu, Junjie; Gregory, Mark T.; Shuga, Joe; Montesclaros, Luz; Underwood, Jason G.; Masquelier, Donald A.; Nishimura, Stefanie Y.; Schnall-Levin, Michael; Wyatt, Paul W.; Hindson, Christopher M.; Bharadwaj, Rajiv; Wong, Alexander; Ness, Kevin D.; Beppu, Lan W.; Deeg, H. Joachim; McFarland, Christopher; Loeb, Keith R.; Valente, William J.; Ericson, Nolan G.; Stevens, Emily A.; Radich, Jerald P.; Mikkelsen, Tarjei S.; Hindson, Benjamin J.; Bielas, Jason H.
2017-01-01
Characterizing the transcriptome of individual cells is fundamental to understanding complex biological systems. We describe a droplet-based system that enables 3′ mRNA counting of tens of thousands of single cells per sample. Cell encapsulation, of up to 8 samples at a time, takes place in ∼6 min, with ∼50% cell capture efficiency. To demonstrate the system's technical performance, we collected transcriptome data from ∼250k single cells across 29 samples. We validated the sensitivity of the system and its ability to detect rare populations using cell lines and synthetic RNAs. We profiled 68k peripheral blood mononuclear cells to demonstrate the system's ability to characterize large immune populations. Finally, we used sequence variation in the transcriptome data to determine host and donor chimerism at single-cell resolution from bone marrow mononuclear cells isolated from transplant patients. PMID:28091601
Wu, Jing Qin; Wang, Xi; Beveridge, Natalie J.; Tooney, Paul A.; Scott, Rodney J.; Carr, Vaughan J.; Cairns, Murray J.
2012-01-01
Background While hybridization based analysis of the cortical transcriptome has provided important insight into the neuropathology of schizophrenia, it represents a restricted view of disease-associated gene activity based on predetermined probes. By contrast, sequencing technology can provide un-biased analysis of transcription at nucleotide resolution. Here we use this approach to investigate schizophrenia-associated cortical gene expression. Methodology/Principal Findings The data was generated from 76 bp reads of RNA-Seq, aligned to the reference genome and assembled into transcripts for quantification of exons, splice variants and alternative promoters in postmortem superior temporal gyrus (STG/BA22) from 9 male subjects with schizophrenia and 9 matched non-psychiatric controls. Differentially expressed genes were then subjected to further sequence and functional group analysis. The output, amounting to more than 38 Gb of sequence, revealed significant alteration of gene expression including many previously shown to be associated with schizophrenia. Gene ontology enrichment analysis followed by functional map construction identified three functional clusters highly relevant to schizophrenia including neurotransmission related functions, synaptic vesicle trafficking, and neural development. Significantly, more than 2000 genes displayed schizophrenia-associated alternative promoter usage and more than 1000 genes showed differential splicing (FDR<0.05). Both types of transcriptional isoforms were exemplified by reads aligned to the neurodevelopmentally significant doublecortin-like kinase 1 (DCLK1) gene. Conclusions This study provided the first deep and un-biased analysis of schizophrenia-associated transcriptional diversity within the STG, and revealed variants with important implications for the complex pathophysiology of schizophrenia. PMID:22558445
The Secret Life of RNA: Lessons from Emerging Methodologies.
Medioni, Caroline; Besse, Florence
2018-01-01
The last past decade has witnessed a revolution in our appreciation of transcriptome complexity and regulation. This remarkable expansion in our knowledge largely originates from the advent of high-throughput methodologies, and the consecutive discovery that up to 90% of eukaryotic genomes are transcribed, thus generating an unanticipated large range of noncoding RNAs (Hangauer et al., 15(4):112, 2014). Besides leading to the identification of new noncoding RNA species, transcriptome-wide studies have uncovered novel layers of posttranscriptional regulatory mechanisms controlling RNA processing, maturation or translation, and each contributing to the precise and dynamic regulation of gene expression. Remarkably, the development of systems-level studies has been accompanied by tremendous progress in the visualization of individual RNA molecules in single cells, such that it is now possible to image RNA species with a single-molecule resolution from birth to translation or decay. Monitoring quantitatively, with unprecedented spatiotemporal resolution, the fate of individual molecules has been key to understanding the molecular mechanisms underlying the different steps of RNA regulation. This has also revealed biologically relevant, intracellular and intercellular heterogeneities in RNA distribution or regulation. More recently, the convergence of imaging and high-throughput technologies has led to the emergence of spatially resolved transcriptomic techniques that provide a means to perform large-scale analyses while preserving spatial information. By generating transcriptome-wide data on single-cell RNA content, or even subcellular RNA distribution, these methodologies are opening avenues to a wide range of network-level studies at the cell and organ-level, and promise to strongly improve disease diagnostic and treatment.In this introductory chapter, we highlight how recently developed technologies aiming at detecting and visualizing RNA molecules have contributed to the emergence of entirely new research fields, and to dramatic progress in our understanding of gene expression regulation.
Coastal habitat mapping in the Aegean Sea using high resolution orthophoto maps
NASA Astrophysics Data System (ADS)
Topouzelis, Konstantinos; Papakonstantinou, Apostolos; Doukari, Michaela; Stamatis, Panagiotis; Makri, Despina; Katsanevakis, Stelios
2017-09-01
The significance of coastal habitat mapping lies in the need to prevent from anthropogenic interventions and other factors. Until 2015, Landsat-8 (30m) imagery were used as medium spatial resolution satellite imagery. So far, Sentinel-2 satellite imagery is very useful for more detailed regional scale mapping. However, the use of high resolution orthophoto maps, which are determined from UAV data, is expected to improve the mapping accuracy. This is due to small spatial resolution of the orthophoto maps (30 cm). This paper outlines the integration of UAS for data acquisition and Structure from Motion (SfM) pipeline for the visualization of selected coastal areas in the Aegean Sea. Additionally, the produced orthophoto maps analyzed through an object-based image analysis (OBIA) and nearest-neighbor classification for mapping the coastal habitats. Classification classes included the main general habitat types, i.e. seagrass, soft bottom, and hard bottom The developed methodology applied at the Koumbara beach (Ios Island - Greece). Results showed that UAS's data revealed the sub-bottom complexity in large shallow areas since they provide such information in the spatial resolution that permits the mapping of seagrass meadows with extreme detail. The produced habitat vectors are ideal as reference data for studies with satellite data of lower spatial resolution.
Transcriptomics of cortical gray matter thickness decline during normal aging
Kochunov, P; Charlesworth, J; Winkler, A; Hong, LE; Nichols, T; Curran, JE; Sprooten, E; Jahanshad, N; Thompson, PM; Johnson, MP; Kent, JW; Landman, BA; Mitchell, B; Cole, SA; Dyer, TD; Moses, EK; Goring, HHH; Almasy, L; Duggirala, R; Olvera, RL; Glahn, DC; Blangero, J
2013-01-01
Introduction We performed a whole-transcriptome correlation analysis, followed by the pathway enrichment and testing of innate immune response pathways analyses to evaluate the hypothesis that transcriptional activity can predict cortical gray matter thickness (GMT) variability during normal cerebral aging Methods Transcriptome and GMT data were availabe for 379 individuals (age range=28–85) community-dwelling members of large extended Mexican-American families. Collection of transcriptome data preceded that of neuroimaging data by 17 years. Genome-wide gene transcriptome data consisted of 20,413 heritable lymphocytes-based transcripts. GMT measurements were performed from high-resolution (isotropic 800µm) T1-weighted MRI. Transcriptome-wide and pathway enrichment analysis was used to classify genes correlated with GMT. Transcripts for sixty genes from seven innate immune pathways were tested as specific predictors of GMT variability. Results Transcripts for eight genes (IGFBP3, LRRN3, CRIP2, SCD, IDS, TCF4, GATA3, HN1) passed the transcriptome-wide significance threshold. Four orthogonal factors extracted from this set predicted 31.9% of the variability in the whole-brain and between 23.4 and 35% of regional GMT measurements. Pathway enrichment analysis identified six functional categories including cellular proliferation, aggregation, differentiation, viral infection, and metabolism. The integrin signaling pathway was significantly (p<10−6) enriched with GMT. Finally, three innate immune pathways (complement signaling, toll-receptors and scavenger and immunoglobulins) were significantly associated with GMT. Conclusion Expression activity for the genes that regulate cellular proliferation, adhesion, differentiation and inflammation can explain a significant proportion of individual variability in cortical GMT. Our findings suggest that normal cerebral aging is the product of a progressive decline in regenerative capacity and increased neuroinflammation. PMID:23707588
Transcriptomics of cortical gray matter thickness decline during normal aging.
Kochunov, P; Charlesworth, J; Winkler, A; Hong, L E; Nichols, T E; Curran, J E; Sprooten, E; Jahanshad, N; Thompson, P M; Johnson, M P; Kent, J W; Landman, B A; Mitchell, B; Cole, S A; Dyer, T D; Moses, E K; Goring, H H H; Almasy, L; Duggirala, R; Olvera, R L; Glahn, D C; Blangero, J
2013-11-15
We performed a whole-transcriptome correlation analysis, followed by the pathway enrichment and testing of innate immune response pathway analyses to evaluate the hypothesis that transcriptional activity can predict cortical gray matter thickness (GMT) variability during normal cerebral aging. Transcriptome and GMT data were available for 379 individuals (age range=28-85) community-dwelling members of large extended Mexican American families. Collection of transcriptome data preceded that of neuroimaging data by 17 years. Genome-wide gene transcriptome data consisted of 20,413 heritable lymphocytes-based transcripts. GMT measurements were performed from high-resolution (isotropic 800 μm) T1-weighted MRI. Transcriptome-wide and pathway enrichment analysis was used to classify genes correlated with GMT. Transcripts for sixty genes from seven innate immune pathways were tested as specific predictors of GMT variability. Transcripts for eight genes (IGFBP3, LRRN3, CRIP2, SCD, IDS, TCF4, GATA3, and HN1) passed the transcriptome-wide significance threshold. Four orthogonal factors extracted from this set predicted 31.9% of the variability in the whole-brain and between 23.4 and 35% of regional GMT measurements. Pathway enrichment analysis identified six functional categories including cellular proliferation, aggregation, differentiation, viral infection, and metabolism. The integrin signaling pathway was significantly (p<10(-6)) enriched with GMT. Finally, three innate immune pathways (complement signaling, toll-receptors and scavenger and immunoglobulins) were significantly associated with GMT. Expression activity for the genes that regulate cellular proliferation, adhesion, differentiation and inflammation can explain a significant proportion of individual variability in cortical GMT. Our findings suggest that normal cerebral aging is the product of a progressive decline in regenerative capacity and increased neuroinflammation. Copyright © 2013 Elsevier Inc. All rights reserved.
Higher resolution satellite remote sensing and the impact on image mapping
Watkins, Allen H.; Thormodsgard, June M.
1987-01-01
Recent advances in spatial, spectral, and temporal resolution of civil land remote sensing satellite data are presenting new opportunities for image mapping applications. The U.S. Geological Survey's experimental satellite image mapping program is evolving toward larger scale image map products with increased information content as a result of improved image processing techniques and increased resolution. Thematic mapper data are being used to produce experimental image maps at 1:100,000 scale that meet established U.S. and European map accuracy standards. Availability of high quality, cloud-free, 30-meter ground resolution multispectral data from the Landsat thematic mapper sensor, along with 10-meter ground resolution panchromatic and 20-meter ground resolution multispectral data from the recently launched French SPOT satellite, present new cartographic and image processing challenges.The need to fully exploit these higher resolution data increases the complexity of processing the images into large-scale image maps. The removal of radiometric artifacts and noise prior to geometric correction can be accomplished by using a variety of image processing filters and transforms. Sensor modeling and image restoration techniques allow maximum retention of spatial and radiometric information. An optimum combination of spectral information and spatial resolution can be obtained by merging different sensor types. These processing techniques are discussed and examples are presented.
Triwitayakorn, Kanokporn; Chatkulkawin, Pornsupa; Kanjanawattanawong, Supanath; Sraphet, Supajit; Yoocha, Thippawan; Sangsrakru, Duangjai; Chanprasert, Juntima; Ngamphiw, Chumpol; Jomchai, Nukoon; Therawattanasuk, Kanikar; Tangphatsornruang, Sithichoke
2011-01-01
To obtain more information on the Hevea brasiliensis genome, we sequenced the transcriptome from the vegetative shoot apex yielding 2 311 497 reads. Clustering and assembly of the reads produced a total of 113 313 unique sequences, comprising 28 387 isotigs and 84 926 singletons. Also, 17 819 expressed sequence tag (EST)-simple sequence repeats (SSRs) were identified from the data set. To demonstrate the use of this EST resource for marker development, primers were designed for 430 of the EST-SSRs. Three hundred and twenty-three primer pairs were amplifiable in H. brasiliensis clones. Polymorphic information content values of selected 47 SSRs among 20 H. brasiliensis clones ranged from 0.13 to 0.71, with an average of 0.51. A dendrogram of genetic similarities between the 20 H. brasiliensis clones using these 47 EST-SSRs suggested two distinct groups that correlated well with clone pedigree. These novel EST-SSRs together with the published SSRs were used for the construction of an integrated parental linkage map of H. brasiliensis based on 81 lines of an F1 mapping population. The map consisted of 97 loci, consisting of 37 novel EST-SSRs and 60 published SSRs, distributed on 23 linkage groups and covered 842.9 cM with a mean interval of 11.9 cM and ∼4 loci per linkage group. Although the numbers of linkage groups exceed the haploid number (18), but with several common markers between homologous linkage groups with the previous map indicated that the F1 map in this study is appropriate for further study in marker-assisted selection. PMID:22086998
Castellanos, Elena; Aranaz, Alicia; de Juan, Lucia; Dominguez, Lucas; Linedale, Richard; Bull, Tim J
2012-09-14
In this study we characterise the genomic and transcriptomic variability of a natural deletion strain of Mycobacterium avium subspecies paratuberculosis (MAP) prevalent in Spanish Guadarrama goats. Using a pan-genome microarray including MAP and M. avium subspecies hominissuis 104 genomes (MAPAC) we demonstrate the genotype to be MAP Type II with a single deletion of 19 contiguous ORFs (16 kb) including a complete mammalian cell entry (mce7_1) operon and adjacent proline-glutamic acid (PE)/proline-proline-glutamic acid (PPE) genes. A deletion specific PCR test was developed and a subsequent screening identified four goat herds infected with the variant strain. Each was located in central Spain and showed epidemiological links suggestive of transmission between herds. A majority of animals infected with the variant manifested a paucibacillary form of the disease. Comparisons between virulent complete genome compliment strains isolated from multibacillary diseased goats and the MAP variant strain during entry into activated macrophages demonstrated an increased sensitivity in the variant to intracellular killing in human and ovine macrophages. As PPE and mce genes are associated with mycobacterial virulence and pathogenesis we investigated the interplay of these gene sets during cell entry using the MAPAC array. This showed significant differential transcriptome profiles compared to full genome complement MAP controls that included changes in other undeleted mce operons and PE/PPE genes, esx-like signalling operons and stress response/fatty acid metabolism pathways. This strain represents the first report of a MAP Type II genotype with significant natural genomic deletions which remains able to cause disease and is transmissible in goats. Copyright © 2012 Elsevier B.V. All rights reserved.
Kopf, Matthias; Klähn, Stephan; Scholz, Ingeborg; Hess, Wolfgang R; Voß, Björn
2015-04-22
In all studied organisms, a substantial portion of the transcriptome consists of non-coding RNAs that frequently execute regulatory functions. Here, we have compared the primary transcriptomes of the cyanobacteria Synechocystis sp. PCC 6714 and PCC 6803 under 10 different conditions. These strains share 2854 protein-coding genes and a 16S rRNA identity of 99.4%, indicating their close relatedness. Conserved major transcriptional start sites (TSSs) give rise to non-coding transcripts within the sigB gene, from the 5'UTRs of cmpA and isiA, and 168 loci in antisense orientation. Distinct differences include single nucleotide polymorphisms rendering promoters inactive in one of the strains, e.g., for cmpR and for the asRNA PsbA2R. Based on the genome-wide mapped location, regulation and classification of TSSs, non-coding transcripts were identified as the most dynamic component of the transcriptome. We identified a class of mRNAs that originate by read-through from an sRNA that accumulates as a discrete and abundant transcript while also serving as the 5'UTR. Such an sRNA/mRNA structure, which we name 'actuaton', represents another way for bacteria to remodel their transcriptional network. Our findings support the hypothesis that variations in the non-coding transcriptome constitute a major evolutionary element of inter-strain divergence and capability for physiological adaptation.
High resolution physical mapping of single gene fragments on pachytene chromosome 4 and 7 of Rosa.
Kirov, Ilya V; Van Laere, Katrijn; Khrustaleva, Ludmila I
2015-07-02
Rosaceae is a family containing many economically important fruit and ornamental species. Although fluorescence in situ hybridization (FISH)-based physical mapping of plant genomes is a valuable tool for map-based cloning, comparative genomics and evolutionary studies, no studies using high resolution physical mapping have been performed in this family. Previously we proved that physical mapping of single-copy genes as small as 1.1 kb is possible on mitotic metaphase chromosomes of Rosa wichurana using Tyramide-FISH. In this study we aimed to further improve the physical map of Rosa wichurana by applying high resolution FISH to pachytene chromosomes. Using high resolution Tyramide-FISH and multicolor Tyramide-FISH, 7 genes (1.7-3 kb) were successfully mapped on pachytene chromosomes 4 and 7 of Rosa wichurana. Additionally, by using multicolor Tyramide-FISH three closely located genes were simultaneously visualized on chromosome 7. A detailed map of heterochromatine/euchromatine patterns of chromosome 4 and 7 was developed with indication of the physical position of these 7 genes. Comparison of the gene order between Rosa wichurana and Fragaria vesca revealed a poor collinearity for chromosome 7, but a perfect collinearity for chromosome 4. High resolution physical mapping of short probes on pachytene chromosomes of Rosa wichurana was successfully performed for the first time. Application of Tyramide-FISH on pachytene chromosomes allowed the mapping resolution to be increased up to 20 times compared to mitotic metaphase chromosomes. High resolution Tyramide-FISH and multicolor Tyramide-FISH might become useful tools for further physical mapping of single-copy genes and for the integration of physical and genetic maps of Rosa wichurana and other members of the Rosaceae.
Transcriptional activity of transposable elements in coelacanth.
Forconi, Mariko; Chalopin, Domitille; Barucca, Marco; Biscotti, Maria Assunta; De Moro, Gianluca; Galiana, Delphine; Gerdol, Marco; Pallavicini, Alberto; Canapa, Adriana; Olmo, Ettore; Volff, Jean-Nicolas
2014-09-01
The morphological stasis of coelacanths has long suggested a slow evolutionary rate. General genomic stasis might also imply a decrease of transposable elements activity. To evaluate the potential activity of transposable elements (TEs) in "living fossil" species, transcriptomic data of Latimeria chalumnae and its Indonesian congener Latimeria menadoensis were compared through the RNA-sequencing mapping procedures in three different organs (liver, testis, and muscle). The analysis of coelacanth transcriptomes highlights a significant percentage of transcribed TEs in both species. Major contributors are LINE retrotransposons, especially from the CR1 family. Furthermore, some particular elements such as a LF-SINE and a LINE2 sequences seem to be more expressed than other elements. The amount of TEs expressed in testis suggests possible transposition burst in incoming generations. Moreover, significant amount of TEs in liver and muscle transcriptomes were also observed. Analyses of elements displaying marked organ-specific expression gave us the opportunity to highlight exaptation cases, that is, the recruitment of TEs as new cellular genes, but also to identify a new Latimeria-specific family of Short Interspersed Nuclear Elements called CoeG-SINEs. Overall, transcriptome results do not seem to be in line with a slow-evolving genome with poor TE activity. © 2013 Wiley Periodicals, Inc.
Shen, Yingjia; Venu, R.C.; Nobuta, Kan; Wu, Xiaohui; Notibala, Varun; Demirci, Caghan; Meyers, Blake C.; Wang, Guo-Liang; Ji, Guoli; Li, Qingshun Q.
2011-01-01
Polyadenylation sites mark the ends of mRNA transcripts. Alternative polyadenylation (APA) may alter sequence elements and/or the coding capacity of transcripts, a mechanism that has been demonstrated to regulate gene expression and transcriptome diversity. To study the role of APA in transcriptome dynamics, we analyzed a large-scale data set of RNA “tags” that signify poly(A) sites and expression levels of mRNA. These tags were derived from a wide range of tissues and developmental stages that were mutated or exposed to environmental treatments, and generated using digital gene expression (DGE)–based protocols of the massively parallel signature sequencing (MPSS-DGE) and the Illumina sequencing-by-synthesis (SBS-DGE) sequencing platforms. The data offer a global view of APA and how it contributes to transcriptome dynamics. Upon analysis of these data, we found that ∼60% of Arabidopsis genes have multiple poly(A) sites. Likewise, ∼47% and 82% of rice genes use APA, supported by MPSS-DGE and SBS-DGE tags, respectively. In both species, ∼49%–66% of APA events were mapped upstream of annotated stop codons. Interestingly, 10% of the transcriptomes are made up of APA transcripts that are differentially distributed among developmental stages and in tissues responding to environmental stresses, providing an additional level of transcriptome dynamics. Examples of pollen-specific APA switching and salicylic acid treatment-specific APA clearly demonstrated such dynamics. The significance of these APAs is more evident in the 3034 genes that have conserved APA events between rice and Arabidopsis. PMID:21813626
Comparative de novo transcriptome analysis of male and female Sea buckthorn.
Bansal, Ankush; Salaria, Mehul; Sharma, Tashil; Stobdan, Tsering; Kant, Anil
2018-02-01
Sea buckthorn is a dioecious medicinal plant found at high altitude. The plant has both male and female reproductive organs in separate individuals. In this article, whole transcriptome de novo assemblies of male and female flower bud samples were carried out using Illumina NextSeq 500 platform to determine the role of the genes involved in sex determination. Moreover, genes with differential expression in male and female transcriptomes were identified to understand the underlying sex determination mechanism. The current study showed 63,904 and 62,272 coding sequences (CDS) in female and male transcriptome data sets, respectively. 16,831 common CDS were screened out from both transcriptomes, out of which 625 were upregulated and 491 were found to be downregulated. To understand the potential regulatory roles of differentially expressed genes in metabolic networks and biosynthetic pathways: KEGG mapping, gene ontology, and co-expression network analysis were performed. Comparison with Flowering Interactive Database (FLOR-ID) resulted in eight differentially expressed genes viz. CHD3-type chromatin-remodeling factor PICKLE ( PKL ), phytochrome-associated serine/threonine-protein phosphatase ( FYPP ), protein TOPLESS ( TPL ), sensitive to freezing 6 ( SFR6 ), lysine-specific histone demethylase 1 homolog 1 ( LDL1 ), pre-mRNA-processing-splicing factor 8A ( PRP8A ), sucrose synthase 4 ( SUS4 ), ubiquitin carboxyl-terminal hydrolase 12 ( UBP12 ), known to be broadly involved in flowering, photoperiodism, embryo development, and cold response pathways. Male and female flower bud transcriptome data of Sea buckthorn may provide comprehensive information at genomic level for the identification of genetic regulation involved in sex determination.
Intercomparison of Satellite-Derived Snow-Cover Maps
NASA Technical Reports Server (NTRS)
Hall, Dorothy K.; Tait, Andrew B.; Foster, James L.; Chang, Alfred T. C.; Allen, Milan
1999-01-01
In anticipation of the launch of the Earth Observing System (EOS) Terra, and the PM-1 spacecraft in 1999 and 2000, respectively, efforts are ongoing to determine errors of satellite-derived snow-cover maps. EOS Moderate Resolution Imaging Spectroradiometer (MODIS) and Advanced Microwave Scanning Radiometer-E (AMSR-E) snow-cover products will be produced. For this study we compare snow maps covering the same study area acquired from different sensors using different snow- mapping algorithms. Four locations are studied: 1) southern Saskatchewan; 2) a part of New England (New Hampshire, Vermont and Massachusetts) and eastern New York; 3) central Idaho and western Montana; and 4) parts of North and South Dakota. Snow maps were produced using a prototype MODIS snow-mapping algorithm used on Landsat Thematic Mapper (TM) scenes of each study area at 30-m and when the TM data were degraded to 1 -km resolution. National Operational Hydrologic Remote Sensing Center (NOHRSC) 1 -km resolution snow maps were also used, as were snow maps derived from 1/2 deg. x 1/2 deg. resolution Special Sensor Microwave Imager (SSM/1) data. A land-cover map derived from the International Geosphere-Biosphere Program (IGBP) land-cover map of North America was also registered to the scenes. The TM, NOHRSC and SSM/I snow maps, and land-cover maps were compared digitally. In most cases, TM-derived maps show less snow cover than the NOHRSC and SSM/I maps because areas of incomplete snow cover in forests (e.g., tree canopies, branches and trunks) are seen in the TM data, but not in the coarser-resolution maps. The snow maps generally agree with respect to the spatial variability of the snow cover. The 30-m resolution TM data provide the most accurate snow maps, and are thus used as the baseline for comparison with the other maps. Comparisons show that the percent change in amount of snow cover relative to the 3 0-m resolution TM maps is lowest using the TM I -km resolution maps, ranging from 0 to 40%. The highest percent change (less than 100%) is found in the New England study area, probably due to the presence of patchy snow cover. A scene with patchy snow cover is more difficult to map accurately than is a scene with a well-defined snowline such as is found on the North and South Dakota scene where the percent change ranged from 0 to 40%. There are also some important differences in the amount of snow mapped using the two different SSM/I algorithms because they utilize different channels.
Rosa-Garrido, Manuel; Chapski, Douglas J.; Schmitt, Anthony D.; Kimball, Todd H.; Karbassi, Elaheh; Monte, Emma; Balderas, Enrique; Pellegrini, Matteo; Shih, Tsai-Ting; Soehalim, Elizabeth; Liem, David; Ping, Peipei; Galjart, Niels J.; Ren, Shuxun; Wang, Yibin; Ren, Bing
2017-01-01
Background: Cardiovascular disease is associated with epigenomic changes in the heart; however, the endogenous structure of cardiac myocyte chromatin has never been determined. Methods: To investigate the mechanisms of epigenomic function in the heart, genome-wide chromatin conformation capture (Hi-C) and DNA sequencing were performed in adult cardiac myocytes following development of pressure overload–induced hypertrophy. Mice with cardiac-specific deletion of CTCF (a ubiquitous chromatin structural protein) were generated to explore the role of this protein in chromatin structure and cardiac phenotype. Transcriptome analyses by RNA-seq were conducted as a functional readout of the epigenomic structural changes. Results: Depletion of CTCF was sufficient to induce heart failure in mice, and human patients with heart failure receiving mechanical unloading via left ventricular assist devices show increased CTCF abundance. Chromatin structural analyses revealed interactions within the cardiac myocyte genome at 5-kb resolution, enabling examination of intra- and interchromosomal events, and providing a resource for future cardiac epigenomic investigations. Pressure overload or CTCF depletion selectively altered boundary strength between topologically associating domains and A/B compartmentalization, measurements of genome accessibility. Heart failure involved decreased stability of chromatin interactions around disease-causing genes. In addition, pressure overload or CTCF depletion remodeled long-range interactions of cardiac enhancers, resulting in a significant decrease in local chromatin interactions around these functional elements. Conclusions: These findings provide a high-resolution chromatin architecture resource for cardiac epigenomic investigations and demonstrate that global structural remodeling of chromatin underpins heart failure. The newly identified principles of endogenous chromatin structure have key implications for epigenetic therapy. PMID:28802249
Rosa-Garrido, Manuel; Chapski, Douglas J; Schmitt, Anthony D; Kimball, Todd H; Karbassi, Elaheh; Monte, Emma; Balderas, Enrique; Pellegrini, Matteo; Shih, Tsai-Ting; Soehalim, Elizabeth; Liem, David; Ping, Peipei; Galjart, Niels J; Ren, Shuxun; Wang, Yibin; Ren, Bing; Vondriska, Thomas M
2017-10-24
Cardiovascular disease is associated with epigenomic changes in the heart; however, the endogenous structure of cardiac myocyte chromatin has never been determined. To investigate the mechanisms of epigenomic function in the heart, genome-wide chromatin conformation capture (Hi-C) and DNA sequencing were performed in adult cardiac myocytes following development of pressure overload-induced hypertrophy. Mice with cardiac-specific deletion of CTCF (a ubiquitous chromatin structural protein) were generated to explore the role of this protein in chromatin structure and cardiac phenotype. Transcriptome analyses by RNA-seq were conducted as a functional readout of the epigenomic structural changes. Depletion of CTCF was sufficient to induce heart failure in mice, and human patients with heart failure receiving mechanical unloading via left ventricular assist devices show increased CTCF abundance. Chromatin structural analyses revealed interactions within the cardiac myocyte genome at 5-kb resolution, enabling examination of intra- and interchromosomal events, and providing a resource for future cardiac epigenomic investigations. Pressure overload or CTCF depletion selectively altered boundary strength between topologically associating domains and A/B compartmentalization, measurements of genome accessibility. Heart failure involved decreased stability of chromatin interactions around disease-causing genes. In addition, pressure overload or CTCF depletion remodeled long-range interactions of cardiac enhancers, resulting in a significant decrease in local chromatin interactions around these functional elements. These findings provide a high-resolution chromatin architecture resource for cardiac epigenomic investigations and demonstrate that global structural remodeling of chromatin underpins heart failure. The newly identified principles of endogenous chromatin structure have key implications for epigenetic therapy. © 2017 The Authors.
Hendre, Prasad S.; Aggarwal, Ramesh K.
2014-01-01
Coffee breeding and improvement efforts can be greatly facilitated by availability of a large repository of simple sequence repeats (SSRs) based microsatellite markers, which provides efficiency and high-resolution in genetic analyses. This study was aimed to improve SSR availability in coffee by developing new genic−/genomic-SSR markers using in-silico bioinformatics and streptavidin-biotin based enrichment approach, respectively. The expressed sequence tag (EST) based genic microsatellite markers (EST-SSRs) were developed using the publicly available dataset of 13,175 unigene ESTs, which showed a distribution of 1 SSR/3.4 kb of coffee transcriptome. Genomic SSRs, on the other hand, were developed from an SSR-enriched small-insert partial genomic library of robusta coffee. In total, 69 new SSRs (44 EST-SSRs and 25 genomic SSRs) were developed and validated as suitable genetic markers. Diversity analysis of selected coffee genotypes revealed these to be highly informative in terms of allelic diversity and PIC values, and eighteen of these markers (∼27%) could be mapped on a robusta linkage map. Notably, the markers described here also revealed a very high cross-species transferability. In addition to the validated markers, we have also designed primer pairs for 270 putative EST-SSRs, which are expected to provide another ca. 200 useful genetic markers considering the high success rate (88%) of marker conversion of similar pairs tested/validated in this study. PMID:25461752
A developmental transcriptome map for allotetraploid arachis hypogaea
USDA-ARS?s Scientific Manuscript database
The advent of the genome sequences of Arachis duranensis and Arachis ipaensis has ushered in a new era for peanut genomics. With the goal of producing a gene atlas for cultivated peanut (Arachis hypogaea), 22 different tissue types and ontogenies that represent the full development of peanut were s...
Whole-genome expression analysis of mammalian-wide interspersed repeat elements in human cell lines.
Carnevali, Davide; Conti, Anastasia; Pellegrini, Matteo; Dieci, Giorgio
2017-02-01
With more than 500,000 copies, mammalian-wide interspersed repeats (MIRs), a sub-group of SINEs, represent ∼2.5% of the human genome and one of the most numerous family of potential targets for the RNA polymerase (Pol) III transcription machinery. Since MIR elements ceased to amplify ∼130 myr ago, previous studies primarily focused on their genomic impact, while the issue of their expression has not been extensively addressed. We applied a dedicated bioinformatic pipeline to ENCODE RNA-Seq datasets of seven human cell lines and, for the first time, we were able to define the Pol III-driven MIR transcriptome at single-locus resolution. While the majority of Pol III-transcribed MIR elements are cell-specific, we discovered a small set of ubiquitously transcribed MIRs mapping within Pol II-transcribed genes in antisense orientation that could influence the expression of the overlapping gene. We also identified novel Pol III-transcribed ncRNAs, deriving from transcription of annotated MIR fragments flanked by unique MIR-unrelated sequences, and confirmed the role of Pol III-specific internal promoter elements in MIR transcription. Besides demonstrating widespread transcription at these retrotranspositionally inactive elements in human cells, the ability to profile MIR expression at single-locus resolution will facilitate their study in different cell types and states including pathological alterations. © The Author 2016. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
An atlas of high-resolution IRAS maps on nearby galaxies
NASA Technical Reports Server (NTRS)
Rice, Walter
1993-01-01
An atlas of far-infrared IRAS maps with near 1 arcmin angular resolution of 30 optically large galaxies is presented. The high-resolution IRAS maps were produced with the Maximum Correlation Method (MCM) image construction and enhancement technique developed at IPAC. The MCM technique, which recovers the spatial information contained in the overlapping detector data samples of the IRAS all-sky survey scans, is outlined and tests to verify the structural reliability and photometric integrity of the high-resolution maps are presented. The infrared structure revealed in individual galaxies is discussed. The atlas complements the IRAS Nearby Galaxy High-Resolution Image Atlas, the high-resolution galaxy images encoded in FITS format, which is provided to the astronomical community as an IPAC product.
Next generation sequencing applications for microRNA biomarker discovery in toxicological studies
Next Generation Sequencing (NGS) technology will be reviewed for its base pair resolution, wide dynamic range, and insights into the genome and transcriptome, with special focus upon the biomarker potential of microRNAs (miRNAs). The first part of this presentation reviews commo...
A high-resolution cattle CNV map by population-scale genome sequencing
USDA-ARS?s Scientific Manuscript database
Copy Number Variations (CNVs) are common genomic structural variations that have been linked to human diseases and phenotypic traits. Prior studies in cattle have produced low-resolution CNV maps. We constructed a draft, high-resolution map of cattle CNVs based on whole genome sequencing data from 7...
[Prospects of molecular breeding in medical plants].
Ma, Xiao-Jun; Mo, Chang-Ming
2017-06-01
The molecular-assisted breeding, transgenic breeding and molecular designing breeding are three development directions of plant molecular breeding. Base on these three development directions, this paper summarizes developing status and new tendency of research field of genetic linkage mapping, QTL mapping, association mapping, molecular-assisted selections, pollen-mediated transformations, agrobacterium-mediated transformations, particle gun-mediated transformations, genome editing technologies, whole-genome sequencing, transcriptome sequencing, proteome sequencing and varietal molecular designing. The objective and existing problem of medical plant molecular breeding were discussed the prospect of these three molecular breeding technologies application on medical plant molecular breeding was outlooked. Copyright© by the Chinese Pharmaceutical Association.
Validation of Satellite Snow Cover Maps in North America and Norway
NASA Technical Reports Server (NTRS)
Hall, Dorothy K.; Solberg, Rune; Riggs, George A.
2002-01-01
Satellite-derived snow maps from NASA's Earth Observing System Moderate Resolution Imaging Spectroradiometer (MODIS) have been produced since February of 2000. The global maps are available daily at 500-m resolution, and at a climate-modeling grid (CMG) resolution of 1/20 deg (approximately 5.6 km). We compared the 8-day composite CMG MODIS-derived global maps from November 1,2001, through March 21,2002, and daily CMG maps from February 26 - March 5,2002, with National Oceanic and Atmospheric Administration (NOAA) Interactive Multisensor Snow and Ice Mapping System (IMS) 25-km resolution maps for North America. For the Norwegian study area, national snow maps, based on synoptic measurements as well as visual interpretation of AVHRR images, published by the Det Norske Meteorologiske Institutt (Norwegian Meteorological Institute) (MI) maps, as well as Landsat ETM+ images were compared with the MODIS maps. The MODIS-derived maps agreed over most areas with the IMS or MI maps, however, there are important areas of disagreement between the maps, especially when the 8-day composite maps were used. It is concluded that MODIS daily CMG maps should be studied for validation purposes rather than the 8-day composite maps, despite the limitations imposed by cloud obscuration when using the daily maps.
Peterson, Elena S; McCue, Lee Ann; Schrimpe-Rutledge, Alexandra C; Jensen, Jeffrey L; Walker, Hyunjoo; Kobold, Markus A; Webb, Samantha R; Payne, Samuel H; Ansong, Charles; Adkins, Joshua N; Cannon, William R; Webb-Robertson, Bobbie-Jo M
2012-04-05
The procedural aspects of genome sequencing and assembly have become relatively inexpensive, yet the full, accurate structural annotation of these genomes remains a challenge. Next-generation sequencing transcriptomics (RNA-Seq), global microarrays, and tandem mass spectrometry (MS/MS)-based proteomics have demonstrated immense value to genome curators as individual sources of information, however, integrating these data types to validate and improve structural annotation remains a major challenge. Current visual and statistical analytic tools are focused on a single data type, or existing software tools are retrofitted to analyze new data forms. We present Visual Exploration and Statistics to Promote Annotation (VESPA) is a new interactive visual analysis software tool focused on assisting scientists with the annotation of prokaryotic genomes though the integration of proteomics and transcriptomics data with current genome location coordinates. VESPA is a desktop Java™ application that integrates high-throughput proteomics data (peptide-centric) and transcriptomics (probe or RNA-Seq) data into a genomic context, all of which can be visualized at three levels of genomic resolution. Data is interrogated via searches linked to the genome visualizations to find regions with high likelihood of mis-annotation. Search results are linked to exports for further validation outside of VESPA or potential coding-regions can be analyzed concurrently with the software through interaction with BLAST. VESPA is demonstrated on two use cases (Yersinia pestis Pestoides F and Synechococcus sp. PCC 7002) to demonstrate the rapid manner in which mis-annotations can be found and explored in VESPA using either proteomics data alone, or in combination with transcriptomic data. VESPA is an interactive visual analytics tool that integrates high-throughput data into a genomic context to facilitate the discovery of structural mis-annotations in prokaryotic genomes. Data is evaluated via visual analysis across multiple levels of genomic resolution, linked searches and interaction with existing bioinformatics tools. We highlight the novel functionality of VESPA and core programming requirements for visualization of these large heterogeneous datasets for a client-side application. The software is freely available at https://www.biopilot.org/docs/Software/Vespa.php.
2012-01-01
Background The procedural aspects of genome sequencing and assembly have become relatively inexpensive, yet the full, accurate structural annotation of these genomes remains a challenge. Next-generation sequencing transcriptomics (RNA-Seq), global microarrays, and tandem mass spectrometry (MS/MS)-based proteomics have demonstrated immense value to genome curators as individual sources of information, however, integrating these data types to validate and improve structural annotation remains a major challenge. Current visual and statistical analytic tools are focused on a single data type, or existing software tools are retrofitted to analyze new data forms. We present Visual Exploration and Statistics to Promote Annotation (VESPA) is a new interactive visual analysis software tool focused on assisting scientists with the annotation of prokaryotic genomes though the integration of proteomics and transcriptomics data with current genome location coordinates. Results VESPA is a desktop Java™ application that integrates high-throughput proteomics data (peptide-centric) and transcriptomics (probe or RNA-Seq) data into a genomic context, all of which can be visualized at three levels of genomic resolution. Data is interrogated via searches linked to the genome visualizations to find regions with high likelihood of mis-annotation. Search results are linked to exports for further validation outside of VESPA or potential coding-regions can be analyzed concurrently with the software through interaction with BLAST. VESPA is demonstrated on two use cases (Yersinia pestis Pestoides F and Synechococcus sp. PCC 7002) to demonstrate the rapid manner in which mis-annotations can be found and explored in VESPA using either proteomics data alone, or in combination with transcriptomic data. Conclusions VESPA is an interactive visual analytics tool that integrates high-throughput data into a genomic context to facilitate the discovery of structural mis-annotations in prokaryotic genomes. Data is evaluated via visual analysis across multiple levels of genomic resolution, linked searches and interaction with existing bioinformatics tools. We highlight the novel functionality of VESPA and core programming requirements for visualization of these large heterogeneous datasets for a client-side application. The software is freely available at https://www.biopilot.org/docs/Software/Vespa.php. PMID:22480257
2012-01-01
Background Cucurbita pepo is a member of the Cucurbitaceae family, the second- most important horticultural family in terms of economic importance after Solanaceae. The "summer squash" types, including Zucchini and Scallop, rank among the highest-valued vegetables worldwide. There are few genomic tools available for this species. The first Cucurbita transcriptome, along with a large collection of Single Nucleotide Polymorphisms (SNP), was recently generated using massive sequencing. A set of 384 SNP was selected to generate an Illumina GoldenGate assay in order to construct the first SNP-based genetic map of Cucurbita and map quantitative trait loci (QTL). Results We herein present the construction of the first SNP-based genetic map of Cucurbita pepo using a population derived from the cross of two varieties with contrasting phenotypes, representing the main cultivar groups of the species' two subspecies: Zucchini (subsp. pepo) × Scallop (subsp. ovifera). The mapping population was genotyped with 384 SNP, a set of selected EST-SNP identified in silico after massive sequencing of the transcriptomes of both parents, using the Illumina GoldenGate platform. The global success rate of the assay was higher than 85%. In total, 304 SNP were mapped, along with 11 SSR from a previous map, giving a map density of 5.56 cM/marker. This map was used to infer syntenic relationships between C. pepo and cucumber and to successfully map QTL that control plant, flowering and fruit traits that are of benefit to squash breeding. The QTL effects were validated in backcross populations. Conclusion Our results show that massive sequencing in different genotypes is an excellent tool for SNP discovery, and that the Illumina GoldenGate platform can be successfully applied to constructing genetic maps and performing QTL analysis in Cucurbita. This is the first SNP-based genetic map in the Cucurbita genus and is an invaluable new tool for biological research, especially considering that most of these markers are located in the coding regions of genes involved in different physiological processes. The platform will also be useful for future mapping and diversity studies, and will be essential in order to accelerate the process of breeding new and better-adapted squash varieties. PMID:22356647
Miller, Charlotte N; Harper, Andrea L; Trick, Martin; Werner, Peter; Waldron, Keith; Bancroft, Ian
2016-07-16
The current approach to reducing the tendency for wheat grown under high fertilizer conditions to collapse (lodge) under the weight of its grain is based on reducing stem height via the introduction of Rht genes. However, these reduce the yield of straw (itself an important commodity) and introduce other undesirable characteristics. Identification of alternative height-control loci is therefore of key interest. In addition, the improvement of stem mechanical strength provides a further way through which lodging can be reduced. To investigate the prospects for genetic alternatives to Rht, we assessed variation for plant height and stem strength properties in a training genetic diversity panel of 100 wheat accessions fixed for Rht. Using mRNAseq data derived from RNA purified from leaves, functional genotypes were developed for the panel comprising 42,066 Single Nucleotide Polymorphism (SNP) markers and 94,060 Gene Expression Markers (GEMs). In the first application in wheat of the recently-developed method of Associative Transcriptomics, we identified associations between trait variation and both SNPs and GEMs. Analysis of marker-trait associations revealed candidates for the causative genes underlying the trait variation, implicating xylan acetylation and the COP9 signalosome as contributing to stem strength and auxin in the control of the observed variation for plant height. Predictive capabilities of key markers for stem strength were validated using a test genetic diversity panel of 30 further wheat accessions. This work illustrates the power of Associative Transcriptomics for the exploration of complex traits of high agronomic importance in wheat. The careful selection of genotypes included in the analysis, allowed for high resolution mapping of novel trait-controlling loci in this staple crop. The use of Gene Expression markers coupled with the more traditional sequence-based markers, provides the power required to understand the biological context of the marker-trait associations observed. This not only adds to the wealth of knowledge that we strive to accumulate regarding gene function and plant adaptation, but also provides breeders with the information required to make more informed decisions regarding the potential consequences of incorporating the use of particular markers into future breeding programmes.
2017-01-01
Mapping gene expression as a quantitative trait using whole genome-sequencing and transcriptome analysis allows to discover the functional consequences of genetic variation. We developed a novel method and ultra-fast software Findr for higly accurate causal inference between gene expression traits using cis-regulatory DNA variations as causal anchors, which improves current methods by taking into consideration hidden confounders and weak regulations. Findr outperformed existing methods on the DREAM5 Systems Genetics challenge and on the prediction of microRNA and transcription factor targets in human lymphoblastoid cells, while being nearly a million times faster. Findr is publicly available at https://github.com/lingfeiwang/findr. PMID:28821014
Pal, Tarun; Malhotra, Nikhil; Chanumolu, Sree Krishna; Chauhan, Rajinder Singh
2015-07-01
The transcriptomes of Aconitum heterophyllum were assembled and characterized for the first time to decipher molecular components contributing to biosynthesis and accumulation of metabolites in tuberous roots. Aconitum heterophyllum Wall., popularly known as Atis, is a high-value medicinal herb of North-Western Himalayas. No information exists as of today on genetic factors contributing to the biosynthesis of secondary metabolites accumulating in tuberous roots, thereby, limiting genetic interventions towards genetic improvement of A. heterophyllum. Illumina paired-end sequencing followed by de novo assembly yielded 75,548 transcripts for root transcriptome and 39,100 transcripts for shoot transcriptome with minimum length of 200 bp. Biological role analysis of root versus shoot transcriptomes assigned 27,596 and 16,604 root transcripts; 12,340 and 9398 shoot transcripts into gene ontology and clusters of orthologous group, respectively. KEGG pathway mapping assigned 37 and 31 transcripts onto starch-sucrose metabolism while 329 and 341 KEGG orthologies associated with transcripts were found to be involved in biosynthesis of various secondary metabolites for root and shoot transcriptomes, respectively. In silico expression profiling of the mevalonate/2-C-methyl-D-erythritol 4-phosphate (non-mevalonate) pathway genes for aconites biosynthesis revealed 4 genes HMGR (3-hydroxy-3-methylglutaryl-CoA reductase), MVK (mevalonate kinase), MVDD (mevalonate diphosphate decarboxylase) and HDS (1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase) with higher expression in root transcriptome compared to shoot transcriptome suggesting their key role in biosynthesis of aconite alkaloids. Five genes, GMPase (geranyl diphosphate mannose pyrophosphorylase), SHAGGY, RBX1 (RING-box protein 1), SRF receptor kinases and β-amylase, implicated in tuberous root formation in other plant species showed higher levels of expression in tuberous roots compared to shoots. A total of 15,487 transcription factors belonging to bHLH, MYB, bZIP families and 399 ABC transporters which regulate biosynthesis and accumulation of bioactive compounds were identified in root and shoot transcriptomes. The expression of 5 ABC transporters involved in tuberous root development was validated by quantitative PCR analysis. Network connectivity diagrams were drawn for starch-sucrose metabolism and isoquinoline alkaloid biosynthesis associated with tuberous root growth and secondary metabolism, respectively, in root transcriptome of A. heterophyllum. The current endeavor will be of practical importance in planning a suitable genetic intervention strategy for the improvement of A. heterophyllum.
Distinct contributions of replication and transcription to mutation rate variation of human genomes.
Cui, Peng; Ding, Feng; Lin, Qiang; Zhang, Lingfang; Li, Ang; Zhang, Zhang; Hu, Songnian; Yu, Jun
2012-02-01
Here, we evaluate the contribution of two major biological processes--DNA replication and transcription--to mutation rate variation in human genomes. Based on analysis of the public human tissue transcriptomics data, high-resolution replicating map of Hela cells and dbSNP data, we present significant correlations between expression breadth, replication time in local regions and SNP density. SNP density of tissue-specific (TS) genes is significantly higher than that of housekeeping (HK) genes. TS genes tend to locate in late-replicating genomic regions and genes in such regions have a higher SNP density compared to those in early-replication regions. In addition, SNP density is found to be positively correlated with expression level among HK genes. We conclude that the process of DNA replication generates stronger mutational pressure than transcription-associated biological processes do, resulting in an increase of mutation rate in TS genes while having weaker effects on HK genes. In contrast, transcription-associated processes are mainly responsible for the accumulation of mutations in highly-expressed HK genes. Copyright © 2012 Beijing Genomics Institute. Published by Elsevier Ltd. All rights reserved.
NREL: International Activities - Pakistan Resource Maps
. The high-resolution (1-km) annual wind power maps were developed using a numerical modeling approach along with NREL's empirical validation methodology. The high-resolution (10-km) annual and seasonal KB) | High-Res (ZIP 281 KB) 40-km Resolution Annual Maps (Direct) Low-Res (JPG 156 KB) | High-Res
Estimating the resolution limit of the map equation in community detection
NASA Astrophysics Data System (ADS)
Kawamoto, Tatsuro; Rosvall, Martin
2015-01-01
A community detection algorithm is considered to have a resolution limit if the scale of the smallest modules that can be resolved depends on the size of the analyzed subnetwork. The resolution limit is known to prevent some community detection algorithms from accurately identifying the modular structure of a network. In fact, any global objective function for measuring the quality of a two-level assignment of nodes into modules must have some sort of resolution limit or an external resolution parameter. However, it is yet unknown how the resolution limit affects the so-called map equation, which is known to be an efficient objective function for community detection. We derive an analytical estimate and conclude that the resolution limit of the map equation is set by the total number of links between modules instead of the total number of links in the full network as for modularity. This mechanism makes the resolution limit much less restrictive for the map equation than for modularity; in practice, it is orders of magnitudes smaller. Furthermore, we argue that the effect of the resolution limit often results from shoehorning multilevel modular structures into two-level descriptions. As we show, the hierarchical map equation effectively eliminates the resolution limit for networks with nested multilevel modular structures.
Heifetz, Eliyahu M; Soller, Morris
2015-07-07
High-resolution mapping of the loci (QTN) responsible for genetic variation in quantitative traits is essential for positional cloning of candidate genes, and for effective marker assisted selection. The confidence interval (QTL) flanking the point estimate of QTN-location is proportional to the number of individuals in the mapping population carrying chromosomes recombinant in the given interval. Consequently, many designs for high resolution QTN mapping are based on increasing the proportion of recombinants in the mapping population. The "Targeted Recombinant Progeny" (TRP) design is a new design for high resolution mapping of a target QTN in crosses between pure, or inbred lines. It is a three-generation procedure generating a large number of recombinant individuals within a QTL previously shown to contain a QTN. This is achieved by having individuals that carry chromosomes recombinant across the target QTL interval as parents of a large mapping population; most of whom will therefore carry recombinant chromosomes targeted to the given QTL. The TRP design is particularly useful for high resolution mapping of QTN that differentiate inbred or pure lines, and hence are not amenable to high resolution mapping by genome-wide association tests. In the absence of residual polygenic variation, population sizes required for achieving given mapping resolution by the TRP-F2 design relative to a standard F2 design ranged from 0.289 for a QTN with standardized allele substitution effect = 0.2, mapped to an initial QTL of 0.2 Morgan to 0.041 for equivalent QTN mapped to an initial QTL of 0.02 M. In the presence of residual polygenic variation, the relative effectiveness of the TRP design ranges from 1.068 to 0.151 for the same initial QTL intervals and QTN effect. Thus even in the presence of polygenic variation, the TRP can still provide major savings. Simulation showed that mapping by TRP should be based on 30-50 markers spanning the initial interval; and on at least 50 or more G2 families representing this number of recombination points,. The TRP design can be an effective procedure for achieving high and ultra-high mapping resolution of a target QTN previously mapped to a known confidence interval (QTL).
Computational methods for constructing protein structure models from 3D electron microscopy maps.
Esquivel-Rodríguez, Juan; Kihara, Daisuke
2013-10-01
Protein structure determination by cryo-electron microscopy (EM) has made significant progress in the past decades. Resolutions of EM maps have been improving as evidenced by recently reported structures that are solved at high resolutions close to 3Å. Computational methods play a key role in interpreting EM data. Among many computational procedures applied to an EM map to obtain protein structure information, in this article we focus on reviewing computational methods that model protein three-dimensional (3D) structures from a 3D EM density map that is constructed from two-dimensional (2D) maps. The computational methods we discuss range from de novo methods, which identify structural elements in an EM map, to structure fitting methods, where known high resolution structures are fit into a low-resolution EM map. A list of available computational tools is also provided. Copyright © 2013 Elsevier Inc. All rights reserved.
Comparison of next generation sequencing technologies for transcriptome characterization
2009-01-01
Background We have developed a simulation approach to help determine the optimal mixture of sequencing methods for most complete and cost effective transcriptome sequencing. We compared simulation results for traditional capillary sequencing with "Next Generation" (NG) ultra high-throughput technologies. The simulation model was parameterized using mappings of 130,000 cDNA sequence reads to the Arabidopsis genome (NCBI Accession SRA008180.19). We also generated 454-GS20 sequences and de novo assemblies for the basal eudicot California poppy (Eschscholzia californica) and the magnoliid avocado (Persea americana) using a variety of methods for cDNA synthesis. Results The Arabidopsis reads tagged more than 15,000 genes, including new splice variants and extended UTR regions. Of the total 134,791 reads (13.8 MB), 119,518 (88.7%) mapped exactly to known exons, while 1,117 (0.8%) mapped to introns, 11,524 (8.6%) spanned annotated intron/exon boundaries, and 3,066 (2.3%) extended beyond the end of annotated UTRs. Sequence-based inference of relative gene expression levels correlated significantly with microarray data. As expected, NG sequencing of normalized libraries tagged more genes than non-normalized libraries, although non-normalized libraries yielded more full-length cDNA sequences. The Arabidopsis data were used to simulate additional rounds of NG and traditional EST sequencing, and various combinations of each. Our simulations suggest a combination of FLX and Solexa sequencing for optimal transcriptome coverage at modest cost. We have also developed ESTcalc http://fgp.huck.psu.edu/NG_Sims/ngsim.pl, an online webtool, which allows users to explore the results of this study by specifying individualized costs and sequencing characteristics. Conclusion NG sequencing technologies are a highly flexible set of platforms that can be scaled to suit different project goals. In terms of sequence coverage alone, the NG sequencing is a dramatic advance over capillary-based sequencing, but NG sequencing also presents significant challenges in assembly and sequence accuracy due to short read lengths, method-specific sequencing errors, and the absence of physical clones. These problems may be overcome by hybrid sequencing strategies using a mixture of sequencing methodologies, by new assemblers, and by sequencing more deeply. Sequencing and microarray outcomes from multiple experiments suggest that our simulator will be useful for guiding NG transcriptome sequencing projects in a wide range of organisms. PMID:19646272
Bernknopf, R.L.; Wein, A.M.; St-Onge, M. R.; Lucas, S.B.
2007-01-01
This bulletin/professional paper focuses on the value of geoscientific information and knowledge, as provided in published government bedrock geological maps, to the mineral exploration sector. An economic model is developed that uses an attribute- ranking approach to convert geological maps into domains of mineral favourability. Information about known deposits in these (or analogous) favourability domains allow the calculation of exploration search statistics that provide input into measures of exploration efficiency, productivity, effectiveness, risk, and cost stemming from the use of the published geological maps. Two case studies, the Flin Flon Belt (Manitoba and Saskatchewan) and the south Baffin Island area (Nunavut), demonstrate that updated, finer resolution maps can be used to identify more exploration campaign options, and campaigns thats are more efficient, more effective, and less risky than old, coarser resolution maps when used as a guide for mineral exploration. The Flin Flon Belt study illustrates that an updated, coarser resolution bedrock map enables improved mineral exploration efficiency, productivity, and effectiveness by locating 60% more targets and supporting an exploration campaign that is 44% more efficient. Refining the map resolution provides an additional 17% reduction in search effort across all favourable domains and a 55% reduction in search effort in the most favourable domain. The south Baffin Island case study projects a 40% increase in expected targets and a 27% reduction in search effort when the updated, finer resolution map is used in lieu of the old, coarser resolution map. On southern Baffin Island, the economic value of the up dated map ranges from CAN$2.28 million to CAN$15.21 million, which can be compared to the CAN$1.86 million that it cost to produce the map (a multiplier effect of up to eight).
Towards the Optimal Pixel Size of dem for Automatic Mapping of Landslide Areas
NASA Astrophysics Data System (ADS)
Pawłuszek, K.; Borkowski, A.; Tarolli, P.
2017-05-01
Determining appropriate spatial resolution of digital elevation model (DEM) is a key step for effective landslide analysis based on remote sensing data. Several studies demonstrated that choosing the finest DEM resolution is not always the best solution. Various DEM resolutions can be applicable for diverse landslide applications. Thus, this study aims to assess the influence of special resolution on automatic landslide mapping. Pixel-based approach using parametric and non-parametric classification methods, namely feed forward neural network (FFNN) and maximum likelihood classification (ML), were applied in this study. Additionally, this allowed to determine the impact of used classification method for selection of DEM resolution. Landslide affected areas were mapped based on four DEMs generated at 1 m, 2 m, 5 m and 10 m spatial resolution from airborne laser scanning (ALS) data. The performance of the landslide mapping was then evaluated by applying landslide inventory map and computation of confusion matrix. The results of this study suggests that the finest scale of DEM is not always the best fit, however working at 1 m DEM resolution on micro-topography scale, can show different results. The best performance was found at 5 m DEM-resolution for FFNN and 1 m DEM resolution for results. The best performance was found to be using 5 m DEM-resolution for FFNN and 1 m DEM resolution for ML classification.
Carmona, Rosario; Zafra, Adoración; Seoane, Pedro; Castro, Antonio J.; Guerrero-Fernández, Darío; Castillo-Castillo, Trinidad; Medina-García, Ana; Cánovas, Francisco M.; Aldana-Montes, José F.; Navas-Delgado, Ismael; Alché, Juan de Dios; Claros, M. Gonzalo
2015-01-01
Plant reproductive transcriptomes have been analyzed in different species due to the agronomical and biotechnological importance of plant reproduction. Here we presented an olive tree reproductive transcriptome database with samples from pollen and pistil at different developmental stages, and leaf and root as control vegetative tissues http://reprolive.eez.csic.es). It was developed from 2,077,309 raw reads to 1,549 Sanger sequences. Using a pre-defined workflow based on open-source tools, sequences were pre-processed, assembled, mapped, and annotated with expression data, descriptions, GO terms, InterPro signatures, EC numbers, KEGG pathways, ORFs, and SSRs. Tentative transcripts (TTs) were also annotated with the corresponding orthologs in Arabidopsis thaliana from TAIR and RefSeq databases to enable Linked Data integration. It results in a reproductive transcriptome comprising 72,846 contigs with average length of 686 bp, of which 63,965 (87.8%) included at least one functional annotation, and 55,356 (75.9%) had an ortholog. A minimum of 23,568 different TTs was identified and 5,835 of them contain a complete ORF. The representative reproductive transcriptome can be reduced to 28,972 TTs for further gene expression studies. Partial transcriptomes from pollen, pistil, and vegetative tissues as control were also constructed. ReprOlive provides free access and download capability to these results. Retrieval mechanisms for sequences and transcript annotations are provided. Graphical localization of annotated enzymes into KEGG pathways is also possible. Finally, ReprOlive has included a semantic conceptualisation by means of a Resource Description Framework (RDF) allowing a Linked Data search for extracting the most updated information related to enzymes, interactions, allergens, structures, and reactive oxygen species. PMID:26322066
Luo, Hui; Xiao, Shijun; Ye, Hua; Zhang, Zhengshi; Lv, Changhuan; Zheng, Shuming; Wang, Zhiyong; Wang, Xiaoqing
2016-01-01
Schizothorax prenanti (S. prenanti) is mainly distributed in the upstream regions of the Yangtze River and its tributaries in China. This species is indigenous and commercially important. However, in recent years, wild populations and aquacultures have faced the serious challenges of germplasm variation loss and an increased susceptibility to a range of pathogens. Currently, the genetics and immune mechanisms of S. prenanti are unknown, partly due to a lack of genome and transcriptome information. Here, we sought to identify genes related to immune functions and to identify molecular markers to study the function of these genes and for trait mapping. To this end, the transcriptome from spleen tissues of S. prenanti was analyzed and sequenced. Using paired-end reads from the Illumina Hiseq2500 platform, 48,517 transcripts were isolated from the spleen transcriptome. These transcripts could be clustered into 37,785 unigenes with an N50 length of 2,539 bp. The majority of the unigenes (35,653, 94.4%) were successfully annotated using non-redundant nucleotide sequence analysis (nt), and the non-redundant protein (nr), Swiss-Prot, Gene Ontology (GO), and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases. KEGG pathway assignment identified more than 500 immune-related genes. Furthermore, 7,545 putative simple sequence repeats (SSRs), 857,535 single nucleotide polymorphisms (SNPs), and 53,481 insertion/deletion (InDels) were detected from the transcriptome. This is the first reported high-throughput transcriptome analysis of S. prenanti, and it provides valuable genetic resources for the investigation of immune mechanisms, conservation of germplasm, and molecular marker-assisted breeding of S. prenanti.
A large-scale full-length cDNA analysis to explore the budding yeast transcriptome
Miura, Fumihito; Kawaguchi, Noriko; Sese, Jun; Toyoda, Atsushi; Hattori, Masahira; Morishita, Shinichi; Ito, Takashi
2006-01-01
We performed a large-scale cDNA analysis to explore the transcriptome of the budding yeast Saccharomyces cerevisiae. We sequenced two cDNA libraries, one from the cells exponentially growing in a minimal medium and the other from meiotic cells. Both libraries were generated by using a vector-capping method that allows the accurate mapping of transcription start sites (TSSs). Consequently, we identified 11,575 TSSs associated with 3,638 annotated genomic features, including 3,599 ORFs, to suggest that most yeast genes have two or more TSSs. In addition, we identified 45 previously undescribed introns, including those affecting current ORF annotations and those spliced alternatively. Furthermore, the analysis revealed 667 transcription units in the intergenic regions and transcripts derived from antisense strands of 367 known features. We also found that 348 ORFs carry TSSs in their 3′-halves to generate sense transcripts starting from inside the ORFs. These results indicate that the budding yeast transcriptome is considerably more complex than previously thought, and it shares many recently revealed characteristics with the transcriptomes of mammals and other higher eukaryotes. Thus, the genome-wide active transcription that generates novel classes of transcripts appears to be an intrinsic feature of the eukaryotic cells. The budding yeast will serve as a versatile model for the studies on these aspects of transcriptome, and the full-length cDNA clones can function as an invaluable resource in such studies. PMID:17101987
Duan, Xinle; Wang, Kang; Su, Sha; Tian, Ruizheng; Li, Yuting; Chen, Maohua
2017-01-01
The bird cherry-oat aphid, Rhopalosiphum padi (L.), is one of the most abundant aphid pests of cereals and has a global distribution. Next-generation sequencing (NGS) is a rapid and efficient method for developing molecular markers. However, transcriptomic and genomic resources of R. padi have not been investigated. In this study, we used transcriptome information obtained by RNA-Seq to develop polymorphic microsatellites for investigating population genetics in this species. The transcriptome of R. padi was sequenced on an Illumina HiSeq 2000 platform. A total of 114.4 million raw reads with a GC content of 40.03% was generated. The raw reads were cleaned and assembled into 29,467 unigenes with an N50 length of 1,580 bp. Using several public databases, 82.47% of these unigenes were annotated. Of the annotated unigenes, 8,022 were assigned to COG pathways, 9,895 were assigned to GO pathways, and 14,586 were mapped to 257 KEGG pathways. A total of 7,936 potential microsatellites were identified in 5,564 unigenes, 60 of which were selected randomly and amplified using specific primer pairs. Fourteen loci were found to be polymorphic in the four R. padi populations. The transcriptomic data presented herein will facilitate gene discovery, gene analyses, and development of molecular markers for future studies of R. padi and other closely related aphid species.
Kopf, Matthias; Klähn, Stephan; Scholz, Ingeborg; Hess, Wolfgang R.; Voß, Björn
2015-01-01
In all studied organisms, a substantial portion of the transcriptome consists of non-coding RNAs that frequently execute regulatory functions. Here, we have compared the primary transcriptomes of the cyanobacteria Synechocystis sp. PCC 6714 and PCC 6803 under 10 different conditions. These strains share 2854 protein-coding genes and a 16S rRNA identity of 99.4%, indicating their close relatedness. Conserved major transcriptional start sites (TSSs) give rise to non-coding transcripts within the sigB gene, from the 5′UTRs of cmpA and isiA, and 168 loci in antisense orientation. Distinct differences include single nucleotide polymorphisms rendering promoters inactive in one of the strains, e.g., for cmpR and for the asRNA PsbA2R. Based on the genome-wide mapped location, regulation and classification of TSSs, non-coding transcripts were identified as the most dynamic component of the transcriptome. We identified a class of mRNAs that originate by read-through from an sRNA that accumulates as a discrete and abundant transcript while also serving as the 5′UTR. Such an sRNA/mRNA structure, which we name ‘actuaton’, represents another way for bacteria to remodel their transcriptional network. Our findings support the hypothesis that variations in the non-coding transcriptome constitute a major evolutionary element of inter-strain divergence and capability for physiological adaptation. PMID:25902393
USDA-ARS?s Scientific Manuscript database
The development of resources for genomic studies in Mangifera indica (mango) will allow marker-assisted selection and identification of genetically diverse germplasm, greatly aiding mango breeding programs. We report here a first step in developing such resources, our identification of thousands una...
Leigh Hawkins; Marilyn Warburton; Juliet Tang; John Tomashek; Dafne Alves Oliveira; Oluwaseun Ogunola; J. Smith; W. Williams
2018-01-01
Many projects have identified candidate genes for resistance to aflatoxin accumulation or Aspergillus flavus infection and growth in maize using genetic mapping, genomics, transcriptomics and/or proteomics studies. However, only a small percentage of these candidates have been validated in field conditions, and their relative contribution to...
Gehan, Malia A; Mockler, Todd C; Weinig, Cynthia; Ewers, Brent E
2017-01-01
The dynamics of local climates make development of agricultural strategies challenging. Yield improvement has progressed slowly, especially in drought-prone regions where annual crop production suffers from episodic aridity. Underlying drought responses are circadian and diel control of gene expression that regulate daily variations in metabolic and physiological pathways. To identify transcriptomic changes that occur in the crop Brassica rapa during initial perception of drought, we applied a co-expression network approach to associate rhythmic gene expression changes with physiological responses. Coupled analysis of transcriptome and physiological parameters over a two-day time course in control and drought-stressed plants provided temporal resolution necessary for correlation of network modules with dynamic changes in stomatal conductance, photosynthetic rate, and photosystem II efficiency. This approach enabled the identification of drought-responsive genes based on their differential rhythmic expression profiles in well-watered versus droughted networks and provided new insights into the dynamic physiological changes that occur during drought. PMID:28826479
Single-cell sequencing and tumorigenesis: improved understanding of tumor evolution and metastasis.
Ellsworth, Darrell L; Blackburn, Heather L; Shriver, Craig D; Rabizadeh, Shahrooz; Soon-Shiong, Patrick; Ellsworth, Rachel E
2017-12-01
Extensive genomic and transcriptomic heterogeneity in human cancer often negatively impacts treatment efficacy and survival, thus posing a significant ongoing challenge for modern treatment regimens. State-of-the-art DNA- and RNA-sequencing methods now provide high-resolution genomic and gene expression portraits of individual cells, facilitating the study of complex molecular heterogeneity in cancer. Important developments in single-cell sequencing (SCS) technologies over the past 5 years provide numerous advantages over traditional sequencing methods for understanding the complexity of carcinogenesis, but significant hurdles must be overcome before SCS can be clinically useful. In this review, we: (1) highlight current methodologies and recent technological advances for isolating single cells, single-cell whole-genome and whole-transcriptome amplification using minute amounts of nucleic acids, and SCS, (2) summarize research investigating molecular heterogeneity at the genomic and transcriptomic levels and how this heterogeneity affects clonal evolution and metastasis, and (3) discuss the promise for integrating SCS in the clinical care arena for improved patient care.
The technology and biology of single-cell RNA sequencing.
Kolodziejczyk, Aleksandra A; Kim, Jong Kyoung; Svensson, Valentine; Marioni, John C; Teichmann, Sarah A
2015-05-21
The differences between individual cells can have profound functional consequences, in both unicellular and multicellular organisms. Recently developed single-cell mRNA-sequencing methods enable unbiased, high-throughput, and high-resolution transcriptomic analysis of individual cells. This provides an additional dimension to transcriptomic information relative to traditional methods that profile bulk populations of cells. Already, single-cell RNA-sequencing methods have revealed new biology in terms of the composition of tissues, the dynamics of transcription, and the regulatory relationships between genes. Rapid technological developments at the level of cell capture, phenotyping, molecular biology, and bioinformatics promise an exciting future with numerous biological and medical applications. Copyright © 2015 Elsevier Inc. All rights reserved.
High-Resolution Global Soil Moisture Map
2015-05-19
High-resolution global soil moisture map from NASA SMAP combined radar and radiometer instruments, acquired between May 4 and May 11, 2015 during SMAP commissioning phase. The map has a resolution of 5.6 miles (9 kilometers). The data gap is due to turning the instruments on and off during testing. http://photojournal.jpl.nasa.gov/catalog/PIA19337
Mansfeldt, Cresten B.; Rowe, Annette R.; Heavner, Gretchen L. W.; Zinder, Stephen H.
2014-01-01
A cDNA-microarray was designed and used to monitor the transcriptomic profile of Dehalococcoides mccartyi strain 195 (in a mixed community) respiring various chlorinated organics, including chloroethenes and 2,3-dichlorophenol. The cultures were continuously fed in order to establish steady-state respiration rates and substrate levels. The organization of array data into a clustered heat map revealed two major experimental partitions. This partitioning in the data set was further explored through principal component analysis. The first two principal components separated the experiments into those with slow (1.6 ± 0.6 μM Cl−/h)- and fast (22.9 ± 9.6 μM Cl−/h)-respiring cultures. Additionally, the transcripts with the highest loadings in these principal components were identified, suggesting that those transcripts were responsible for the partitioning of the experiments. By analyzing the transcriptomes (n = 53) across experiments, relationships among transcripts were identified, and hypotheses about the relationships between electron transport chain members were proposed. One hypothesis, that the hydrogenases Hup and Hym and the formate dehydrogenase-like oxidoreductase (DET0186-DET0187) form a complex (as displayed by their tight clustering in the heat map analysis), was explored using a nondenaturing protein separation technique combined with proteomic sequencing. Although these proteins did not migrate as a single complex, DET0112 (an FdhB-like protein encoded in the Hup operon) was found to comigrate with DET0187 rather than with the catalytic Hup subunit DET0110. On closer inspection of the genome annotations of all Dehalococcoides strains, the DET0185-to-DET0187 operon was found to lack a key subunit, an FdhB-like protein. Therefore, on the basis of the transcriptomic, genomic, and proteomic evidence, the place of the missing subunit in the DET0185-to-DET0187 operon is likely filled by recruiting a subunit expressed from the Hup operon (DET0112). PMID:25063656
Transcriptome Assembly, Gene Annotation and Tissue Gene Expression Atlas of the Rainbow Trout
Salem, Mohamed; Paneru, Bam; Al-Tobasei, Rafet; Abdouni, Fatima; Thorgaard, Gary H.; Rexroad, Caird E.; Yao, Jianbo
2015-01-01
Efforts to obtain a comprehensive genome sequence for rainbow trout are ongoing and will be complemented by transcriptome information that will enhance genome assembly and annotation. Previously, transcriptome reference sequences were reported using data from different sources. Although the previous work added a great wealth of sequences, a complete and well-annotated transcriptome is still needed. In addition, gene expression in different tissues was not completely addressed in the previous studies. In this study, non-normalized cDNA libraries were sequenced from 13 different tissues of a single doubled haploid rainbow trout from the same source used for the rainbow trout genome sequence. A total of ~1.167 billion paired-end reads were de novo assembled using the Trinity RNA-Seq assembler yielding 474,524 contigs > 500 base-pairs. Of them, 287,593 had homologies to the NCBI non-redundant protein database. The longest contig of each cluster was selected as a reference, yielding 44,990 representative contigs. A total of 4,146 contigs (9.2%), including 710 full-length sequences, did not match any mRNA sequences in the current rainbow trout genome reference. Mapping reads to the reference genome identified an additional 11,843 transcripts not annotated in the genome. A digital gene expression atlas revealed 7,678 housekeeping and 4,021 tissue-specific genes. Expression of about 16,000–32,000 genes (35–71% of the identified genes) accounted for basic and specialized functions of each tissue. White muscle and stomach had the least complex transcriptomes, with high percentages of their total mRNA contributed by a small number of genes. Brain, testis and intestine, in contrast, had complex transcriptomes, with a large numbers of genes involved in their expression patterns. This study provides comprehensive de novo transcriptome information that is suitable for functional and comparative genomics studies in rainbow trout, including annotation of the genome. PMID:25793877
Babineau, Marielle; Mahmood, Khalid; Mathiassen, Solvejg K; Kudsk, Per; Kristensen, Michael
2017-02-06
Loose silky bentgrass (Apera spica-venti) is an important weed in Europe with a recent increase in herbicide resistance cases. The lack of genetic information about this noxious weed limits its biological understanding such as growth, reproduction, genetic variation, molecular ecology and metabolic herbicide resistance. This study produced a reference transcriptome for A. spica-venti from different tissues (leaf, root, stem) and various growth stages (seed at phenological stages 05, 07, 08, 09). The de novo assembly was performed on individual and combined dataset followed by functional annotations. Individual transcripts and gene families involved in metabolic based herbicide resistance were identified. Eight separate transcriptome assemblies were performed and compared. The combined transcriptome assembly consists of 83,349 contigs with an N50 and average contig length of 762 and 658 bp, respectively. This dataset contains 74,724 transcripts consisting of total 54,846,111 bp. Among them 94% had a homologue to UniProtKB, 73% retrieved a GO mapping, and 50% were functionally annotated. Compared with other grass species, A. spica-venti has 26% proteins in common to Brachypodium distachyon, and 41% to Lolium spp. Glycosyltransferases had the highest number of transcripts in each tissue followed by the cytochrome P450s. The GSTF1 and CYP89A2 transcripts were recovered from the majority of tissues and aligned at a maximum of 66 and 30% to proven herbicide resistant allele from Alopecurus myosuroides and Lolium rigidum, respectively. De novo transcriptome assembly enabled the generation of the first reference transcriptome of A. spica-venti. This can serve as stepping stone for understanding the metabolic herbicide resistance as well as the general biology of this problematic weed. Furthermore, this large-scale sequence data is a valuable scientific resource for comparative transcriptome analysis for Poaceae grasses.
Chen, Tzu-Han; Shiau, Hsin-Chieh
2018-01-01
Single cell transcriptome (SCT) analysis provides superior resolution to illustrate tumor cell heterogeneity for clinical implications. We characterized four SCTs of MCF-7 using 143 housekeeping genes (HKGs) as control, of which lactate dehydrogenase B (LDHB) expression is silenced. These SCT libraries mapped to 11,423, 11,486, 10,380, and 11,306 RefSeq genes (UCSC), respectively. High consistency in HKG expression levels across all four SCTs, along with transcriptional silencing of LDHB, was observed, suggesting a high sensitivity and reproducibility of the SCT analysis. Cross-library comparison on expression levels by scatter plotting revealed a linear correlation and an 83–94% overlap in transcript isoforms and expressed genes were also observed. To gain insight of transcriptional diversity among the SCTs, expressed genes were split into consistently expressed (CE) (expressed in all SCTs) and inconsistently expressed (IE) (expressed in some but not all SCTs) genes for further characterization, along with the 142 expressed HKGs as a reference. Distinct transcriptional strengths were found among these groups, with averages of 1,612.0, 88.0 and 1.2 FPKM for HKGs, CE and IE, respectively. Comparison between CE and IE groups further indicated that expressions of CE genes vary more significantly than that of IE genes. Gene Ontology analysis indicated that proteins encoded by CE genes are mainly involved in fundamental intracellular activities, while proteins encoded by IE genes are mainly for extracellular activities, especially acting as receptors or ion channels. The diversified gene expressions, especially for those encoded by IE genes, may contribute to cancer drug resistance. PMID:29920548
Yang, Yu-Jiao; Wang, Shuai; Zhang, Biao; Shen, Hong-Bin
2018-06-25
As a relatively new technology to solve the three-dimensional (3D) structure of a protein or protein complex, single-particle reconstruction (SPR) of cryogenic electron microscopy (cryo-EM) images shows much superiority and is in a rapidly developing stage. Resolution measurement in SPR, which evaluates the quality of a reconstructed 3D density map, plays a critical role in promoting methodology development of SPR and structural biology. Because there is no benchmark map in the generation of a new structure, how to realize the resolution estimation of a new map is still an open problem. Existing approaches try to generate a hypothetical benchmark map by reconstructing two 3D models from two halves of the original 2D images for cross-reference, which may result in a premature estimation with a half-data model. In this paper, we report a new self-reference-based resolution estimation protocol, called SRes, that requires only a single reconstructed 3D map. The core idea of SRes is to perform a multiscale spectral analysis (MSSA) on the map through multiple size-variable masks segmenting the map. The MSSA-derived multiscale spectral signal-to-noise ratios (mSSNRs) reveal that their corresponding estimated resolutions will show a cliff jump phenomenon, indicating a significant change in the SSNR properties. The critical point on the cliff borderline is demonstrated to be the right estimator for the resolution of the map.
NASA Astrophysics Data System (ADS)
Qin, Yuanwei; Xiao, Xiangming; Dong, Jinwei; Zhou, Yuting; Zhu, Zhe; Zhang, Geli; Du, Guoming; Jin, Cui; Kou, Weili; Wang, Jie; Li, Xiangping
2015-07-01
Accurate and timely rice paddy field maps with a fine spatial resolution would greatly improve our understanding of the effects of paddy rice agriculture on greenhouse gases emissions, food and water security, and human health. Rice paddy field maps were developed using optical images with high temporal resolution and coarse spatial resolution (e.g., Moderate Resolution Imaging Spectroradiometer (MODIS)) or low temporal resolution and high spatial resolution (e.g., Landsat TM/ETM+). In the past, the accuracy and efficiency for rice paddy field mapping at fine spatial resolutions were limited by the poor data availability and image-based algorithms. In this paper, time series MODIS and Landsat ETM+/OLI images, and the pixel- and phenology-based algorithm are used to map paddy rice planting area. The unique physical features of rice paddy fields during the flooding/open-canopy period are captured with the dynamics of vegetation indices, which are then used to identify rice paddy fields. The algorithm is tested in the Sanjiang Plain (path/row 114/27) in China in 2013. The overall accuracy of the resulted map of paddy rice planting area generated by both Landsat ETM+ and OLI is 97.3%, when evaluated with areas of interest (AOIs) derived from geo-referenced field photos. The paddy rice planting area map also agrees reasonably well with the official statistics at the level of state farms (R2 = 0.94). These results demonstrate that the combination of fine spatial resolution images and the phenology-based algorithm can provide a simple, robust, and automated approach to map the distribution of paddy rice agriculture in a year.
Qin, Yuanwei; Xiao, Xiangming; Dong, Jinwei; Zhou, Yuting; Zhu, Zhe; Zhang, Geli; Du, Guoming; Jin, Cui; Kou, Weili; Wang, Jie; Li, Xiangping
2015-07-01
Accurate and timely rice paddy field maps with a fine spatial resolution would greatly improve our understanding of the effects of paddy rice agriculture on greenhouse gases emissions, food and water security, and human health. Rice paddy field maps were developed using optical images with high temporal resolution and coarse spatial resolution (e.g., Moderate Resolution Imaging Spectroradiometer (MODIS)) or low temporal resolution and high spatial resolution (e.g., Landsat TM/ETM+). In the past, the accuracy and efficiency for rice paddy field mapping at fine spatial resolutions were limited by the poor data availability and image-based algorithms. In this paper, time series MODIS and Landsat ETM+/OLI images, and the pixel- and phenology-based algorithm are used to map paddy rice planting area. The unique physical features of rice paddy fields during the flooding/open-canopy period are captured with the dynamics of vegetation indices, which are then used to identify rice paddy fields. The algorithm is tested in the Sanjiang Plain (path/row 114/27) in China in 2013. The overall accuracy of the resulted map of paddy rice planting area generated by both Landsat ETM+ and OLI is 97.3%, when evaluated with areas of interest (AOIs) derived from geo-referenced field photos. The paddy rice planting area map also agrees reasonably well with the official statistics at the level of state farms ( R 2 = 0.94). These results demonstrate that the combination of fine spatial resolution images and the phenology-based algorithm can provide a simple, robust, and automated approach to map the distribution of paddy rice agriculture in a year.
High-resolution mid-infrared observations of NGC 7469
NASA Technical Reports Server (NTRS)
Miles, J. W.; Houck, J. R.; Hayward, T. L.
1994-01-01
We present a high-resolution 11.7 micrometer image of the starburst/Seyfert hybrid galaxy NGC 7469 using the Hale 5 m telescope at Palomar Observatory. Our map, with diffraction limited spatial resolution of 0.6 sec, shows a 3 sec diameter ring of emission around an unresolved nucleus. The map is similar to the Very Large Array (VLA) 6 cm map of this galaxy made with 0.4 sec resolution by Wilson et al. (1991). About half of the mid-infrared flux in our map emerges from the unresolved nucleus. We also present spatially resolved low resolution spectra that show that the 11.3 micrometer polycyclic aromatic hydrocarbon (PAH) feature comes from the circumnuclear ring but not from the nucleus of the galaxy.
NASA Technical Reports Server (NTRS)
Madsen, Soren N.; Carsey, Frank D.; Turtle, Elizabeth P.
2003-01-01
The topographic data set obtained by MOLA has provided an unprecedented level of information about Mars' geologic features. The proposed flight of JIMO provides an opportunity to accomplish a similar mapping of and comparable scientific discovery for the Jovian moons through us of an interferometric imaging radar analogous to the Shuttle radar that recently generated a new topographic map of Earth. A Ka-band single pass across-track synthetic aperture radar (SAR) interferometer can provide very high resolution surface elevation maps. The concept would use two antennas mounted at the ends of a deployable boom (similar to the Shuttle Radar Topographic Mapper) extended orthogonal to the direction of flight. Assuming an orbit altitude of approximately 100 km and a ground velocity of approximately 1.5 km/sec, horizontal resolutions at the 10 meter level and vertical resolutions at the sub-meter level are possible.
NASA Technical Reports Server (NTRS)
Madsen, S. N.; Carsey, F. D.; Turtle, E. P.
2003-01-01
The topographic data set obtained by MOLA has provided an unprecedented level of information about Mars' geologic features. The proposed flight of JIMO provides an opportunity to accomplish a similar mapping of and comparable scientific discovery for the Jovian moons through use of an interferometric imaging radar analogous to the Shuttle radar that recently generated a new topographic map of Earth. A Ka-band single pass across-track synthetic aperture radar (SAR) interferometer can provide very high resolution surface elevation maps. The concept would use two antennas mounted at the ends of a deployable boom (similar to the Shuttle Radar Topographic Mapper) extended orthogonal to the direction of flight. Assuming an orbit altitude of approximately 100km and a ground velocity of approximately 1.5 km/sec, horizontal resolutions at the 10 meter level and vertical resolutions at the sub-meter level are possible.
NASA Technical Reports Server (NTRS)
Huang, Dong; Yang, Wenze; Tan, Bin; Rautiainen, Miina; Zhang, Ping; Hu, Jiannan; Shabanov, Nikolay V.; Linder, Sune; Knyazikhin, Yuri; Myneni, Ranga B.
2006-01-01
The validation of moderate-resolution satellite leaf area index (LAI) products such as those operationally generated from the Moderate Resolution Imaging Spectroradiometer (MODIS) sensor data requires reference LAI maps developed from field LAI measurements and fine-resolution satellite data. Errors in field measurements and satellite data determine the accuracy of the reference LAI maps. This paper describes a method by which reference maps of known accuracy can be generated with knowledge of errors in fine-resolution satellite data. The method is demonstrated with data from an international field campaign in a boreal coniferous forest in northern Sweden, and Enhanced Thematic Mapper Plus images. The reference LAI map thus generated is used to assess modifications to the MODIS LAI/fPAR algorithm recently implemented to derive the next generation of the MODIS LAI/fPAR product for this important biome type.
Towards a Full-sky, High-resolution Dust Extinction Map with WISE and Planck
NASA Astrophysics Data System (ADS)
Meisner, Aaron M.; Finkbeiner, D. P.
2014-01-01
We have recently completed a custom processing of the entire WISE 12 micron All-sky imaging data set. The result is a full-sky map of diffuse, mid-infrared Galactic dust emission with angular resolution of 15 arcseconds, and with contaminating artifacts such as compact sources removed. At the same time, the 2013 Planck HFI maps represent a complementary data set in the far-infrared, with zero-point relatively immune to zodiacal contamination and angular resolution superior to previous full-sky data sets at similar frequencies. Taken together, these WISE and Planck data products present an opportunity to improve upon the SFD (1998) dust extinction map, by virtue of enhanced angular resolution and potentially better-controlled systematics on large scales. We describe our continuing efforts to construct and test high-resolution dust extinction and temperature maps based on our custom WISE processing and Planck HFI data.
Benton, Matthew A; Kenny, Nathan J; Conrads, Kai H; Roth, Siegfried; Lynch, Jeremy A
2016-01-01
Despite recent efforts to sample broadly across metazoan and insect diversity, current sequence resources in the Coleoptera do not adequately describe the diversity of the clade. Here we present deep, staged transcriptomic data for two coleopteran species, Atrachya menetriesi (Faldermann 1835) and Callosobruchus maculatus (Fabricius 1775). Our sampling covered key stages in ovary and early embryonic development in each species. We utilized this data to build combined assemblies for each species which were then analysed in detail. The combined A. menetriesi assembly consists of 228,096 contigs with an N50 of 1,598 bp, while the combined C. maculatus assembly consists of 128,837 contigs with an N50 of 2,263 bp. For these assemblies, 34.6% and 32.4% of contigs were identified using Blast2GO, and 97% and 98.3% of the BUSCO set of metazoan orthologs were present, respectively. We also carried out manual annotation of developmental signalling pathways and found that nearly all expected genes were present in each transcriptome. Our analyses show that both transcriptomes are of high quality. Lastly, we performed read mapping utilising our timed, stage specific RNA samples to identify differentially expressed contigs. The resources presented here will provide a firm basis for a variety of experimentation, both in developmental biology and in comparative genomic studies.
Zhao, Feng; Yan, Chao; Wang, Xuan; Yang, Yang; Wang, Guangyin; Lee, Wenhui; Xiang, Yang; Zhang, Yun
2014-01-01
Amphibians occupy a key phylogenetic position in vertebrates and evolution of the immune system. But, the resources of its transcriptome or genome are still little now. Bombina maxima possess strong ability to survival in very harsh environment with a more mature immune system. We obtained a comprehensive transcriptome by RNA-sequencing technology. 14.3% of transcripts were identified to be skin-specific genes, most of which were not isolated from skin secretion in previous works or novel non-coding RNAs. 27.9% of transcripts were mapped into 242 predicted KEGG pathways and 6.16% of transcripts related to human disease and cancer. Of 39 448 transcripts with the coding sequence, at least 1501 transcripts (570 genes) related to the immune system process. The molecules of immune signalling pathway were almost presented, several transcripts with high expression in skin and stomach. Experiments showed that lipopolysaccharide or bacteria challenge stimulated pro-inflammatory cytokine production and activation of pro-inflammatory caspase-1. These frog's data can remarkably expand the existing genome or transcriptome resources of amphibians, especially immunity data. The entity of the data provides a valuable platform for further investigation on more detailed immune response in B. maxima and a comparative study with other amphibians. PMID:23942912
High-resolution mapping of forest carbon stocks in the Colombian Amazon
NASA Astrophysics Data System (ADS)
Asner, G. P.; Clark, J. K.; Mascaro, J.; Galindo García, G. A.; Chadwick, K. D.; Navarrete Encinales, D. A.; Paez-Acosta, G.; Cabrera Montenegro, E.; Kennedy-Bowdoin, T.; Duque, Á.; Balaji, A.; von Hildebrand, P.; Maatoug, L.; Bernal, J. F. Phillips; Yepes Quintero, A. P.; Knapp, D. E.; García Dávila, M. C.; Jacobson, J.; Ordóñez, M. F.
2012-07-01
High-resolution mapping of tropical forest carbon stocks can assist forest management and improve implementation of large-scale carbon retention and enhancement programs. Previous high-resolution approaches have relied on field plot and/or light detection and ranging (LiDAR) samples of aboveground carbon density, which are typically upscaled to larger geographic areas using stratification maps. Such efforts often rely on detailed vegetation maps to stratify the region for sampling, but existing tropical forest maps are often too coarse and field plots too sparse for high-resolution carbon assessments. We developed a top-down approach for high-resolution carbon mapping in a 16.5 million ha region (> 40%) of the Colombian Amazon - a remote landscape seldom documented. We report on three advances for large-scale carbon mapping: (i) employing a universal approach to airborne LiDAR-calibration with limited field data; (ii) quantifying environmental controls over carbon densities; and (iii) developing stratification- and regression-based approaches for scaling up to regions outside of LiDAR coverage. We found that carbon stocks are predicted by a combination of satellite-derived elevation, fractional canopy cover and terrain ruggedness, allowing upscaling of the LiDAR samples to the full 16.5 million ha region. LiDAR-derived carbon maps have 14% uncertainty at 1 ha resolution, and the regional map based on stratification has 28% uncertainty in any given hectare. High-resolution approaches with quantifiable pixel-scale uncertainties will provide the most confidence for monitoring changes in tropical forest carbon stocks. Improved confidence will allow resource managers and decision makers to more rapidly and effectively implement actions that better conserve and utilize forests in tropical regions.
High-resolution Mapping of Forest Carbon Stocks in the Colombian Amazon
NASA Astrophysics Data System (ADS)
Asner, G. P.; Clark, J. K.; Mascaro, J.; Galindo García, G. A.; Chadwick, K. D.; Navarrete Encinales, D. A.; Paez-Acosta, G.; Cabrera Montenegro, E.; Kennedy-Bowdoin, T.; Duque, Á.; Balaji, A.; von Hildebrand, P.; Maatoug, L.; Bernal, J. F. Phillips; Knapp, D. E.; García Dávila, M. C.; Jacobson, J.; Ordóñez, M. F.
2012-03-01
High-resolution mapping of tropical forest carbon stocks can assist forest management and improve implementation of large-scale carbon retention and enhancement programs. Previous high-resolution approaches have relied on field plot and/or Light Detection and Ranging (LiDAR) samples of aboveground carbon density, which are typically upscaled to larger geographic areas using stratification maps. Such efforts often rely on detailed vegetation maps to stratify the region for sampling, but existing tropical forest maps are often too coarse and field plots too sparse for high resolution carbon assessments. We developed a top-down approach for high-resolution carbon mapping in a 16.5 million ha region (>40 %) of the Colombian Amazon - a remote landscape seldom documented. We report on three advances for large-scale carbon mapping: (i) employing a universal approach to airborne LiDAR-calibration with limited field data; (ii) quantifying environmental controls over carbon densities; and (iii) developing stratification- and regression-based approaches for scaling up to regions outside of LiDAR coverage. We found that carbon stocks are predicted by a combination of satellite-derived elevation, fractional canopy cover and terrain ruggedness, allowing upscaling of the LiDAR samples to the full 16.5 million ha region. LiDAR-derived carbon mapping samples had 14.6 % uncertainty at 1 ha resolution, and regional maps based on stratification and regression approaches had 25.6 % and 29.6 % uncertainty, respectively, in any given hectare. High-resolution approaches with reported local-scale uncertainties will provide the most confidence for monitoring changes in tropical forest carbon stocks. Improved confidence will allow resource managers and decision-makers to more rapidly and effectively implement actions that better conserve and utilize forests in tropical regions.
Mills, James D.; Kavanagh, Tomas; Kim, Woojin S.; Chen, Bei Jun; Kawahara, Yoshihiro; Halliday, Glenda M.; Janitz, Michael
2013-01-01
The human frontal lobe has undergone accelerated evolution, leading to the development of unique human features such as language and self-reflection. Cortical grey matter and underlying white matter reflect distinct cellular compositions in the frontal lobe. Surprisingly little is known about the transcriptomal landscape of these distinct regions. Here, for the first time, we report a detailed transcriptomal profile of the frontal grey (GM) and white matter (WM) with resolution to alternatively spliced isoforms obtained using the RNA-Seq approach. We observed more vigorous transcriptome activity in GM compared to WM, presumably because of the presence of cellular bodies of neurons in the GM and RNA associated with the nucleus and perinuclear space. Among the top differentially expressed genes, we also identified a number of long intergenic non-coding RNAs (lincRNAs), specifically expressed in white matter, such as LINC00162. Furthermore, along with confirmation of expression of known markers for neurons and oligodendrocytes, we identified a number of genes and splicing isoforms that are exclusively expressed in GM or WM with examples of GABRB2 and PAK2 transcripts, respectively. Pathway analysis identified distinct physiological and biochemical processes specific to grey and white matter samples with a prevalence of synaptic processes in GM and myelination regulation and axonogenesis in the WM. Our study also revealed that expression of many genes, for example, the GPR123, is characterized by isoform switching, depending in which structure the gene is expressed. Our report clearly shows that GM and WM have perhaps surprisingly divergent transcriptome profiles, reflecting distinct roles in brain physiology. Further, this study provides the first reference data set for a normal human frontal lobe, which will be useful in comparative transcriptome studies of cerebral disorders, in particular, neurodegenerative diseases. PMID:24194939
Transcriptome landscape of a bacterial pathogen under plant immunity.
Nobori, Tatsuya; Velásquez, André C; Wu, Jingni; Kvitko, Brian H; Kremer, James M; Wang, Yiming; He, Sheng Yang; Tsuda, Kenichi
2018-03-27
Plant pathogens can cause serious diseases that impact global agriculture. The plant innate immunity, when fully activated, can halt pathogen growth in plants. Despite extensive studies into the molecular and genetic bases of plant immunity against pathogens, the influence of plant immunity in global pathogen metabolism to restrict pathogen growth is poorly understood. Here, we developed RNA sequencing pipelines for analyzing bacterial transcriptomes in planta and determined high-resolution transcriptome patterns of the foliar bacterial pathogen Pseudomonas syringae in Arabidopsis thaliana with a total of 27 combinations of plant immunity mutants and bacterial strains. Bacterial transcriptomes were analyzed at 6 h post infection to capture early effects of plant immunity on bacterial processes and to avoid secondary effects caused by different bacterial population densities in planta We identified specific "immune-responsive" bacterial genes and processes, including those that are activated in susceptible plants and suppressed by plant immune activation. Expression patterns of immune-responsive bacterial genes at the early time point were tightly linked to later bacterial growth levels in different host genotypes. Moreover, we found that a bacterial iron acquisition pathway is commonly suppressed by multiple plant immune-signaling pathways. Overexpression of a P. syringae sigma factor gene involved in iron regulation and other processes partially countered bacterial growth restriction during the plant immune response triggered by AvrRpt2. Collectively, this study defines the effects of plant immunity on the transcriptome of a bacterial pathogen and sheds light on the enigmatic mechanisms of bacterial growth inhibition during the plant immune response.
NOAA's Use of High-Resolution Imagery
NASA Technical Reports Server (NTRS)
Hund, Erik
2007-01-01
NOAA's use of high-resolution imagery consists of: a) Shoreline mapping and nautical chart revision; b) Coastal land cover mapping; c) Benthic habitat mapping; d) Disaster response; and e) Imagery collection and support for coastal programs.
L. Arroyo; S.P. Healey; W.B. Cohen; D. Cocero; J.A. Manzanera
2006-01-01
Knowledge of fuel load and composition is critical in fighting, preventing, and understanding wildfires. Commonly, the generation of fuel maps from remotely sensed imagery has made use of medium-resolution sensors such as Landsat. This paper presents a methodology to generate fuel type maps from high spatial resolution satellite data through object-oriented...
Hisatake, S; Kobayashi, T
2006-12-25
We demonstrate a time-to-space mapping of an optical signal with a picosecond time resolution based on an electrooptic beam deflection. A time axis of the optical signal is mapped into a spatial replica by the deflection. We theoretically derive a minimum time resolution of the time-to-space mapping and confirm it experimentally on the basis of the pulse width of the optical pulses picked out from the deflected beam through a narrow slit which acts as a temporal window. We have achieved the minimum time resolution of 1.6+/-0.2 ps.
Daily High-Resolution Flood Maps of Africa: 1992-present with Near Real Time Updates
NASA Astrophysics Data System (ADS)
Picton, J.; Galantowicz, J. F.; Root, B.
2016-12-01
The ability to characterize past and current flood extents frequently, accurately, and at high resolution is needed for many applications including risk assessment, wetlands monitoring, and emergency management. However, remote sensing methods have not been capable of meeting all of these requirements simultaneously. Cloud cover too often obscures the surface for visual and infrared sensors and observations from radar sensors are too infrequent to create consistent historical databases or monitor evolving events. Lower-resolution (10-50 km) passive microwave sensors, such as SSM/I, AMSR-E, and AMSR2, are sensitive to water cover, acquire useful data during clear and cloudy conditions, have revisit periods of up to twice daily, and provide a continuous record of data from 1992 to the present. What they lack most is the resolution needed to map flood extent. We will present results from a flood mapping system capable of producing high-resolution (90-m) flood extent depictions from lower resolution microwave data. The system uses the strong sensitivity of microwave data to surface water coverage combined with land surface and atmospheric data to derive daily flooded fraction estimates on a sensor-footprint basis. The system downscales flooded fraction to make high-resolution Boolean flood extent depictions that are spatially continuous and consistent with the lower resolution data. The downscaling step is based on a relative floodability (RF) index derived from higher-resolution topographic and hydrological data. We process RF to create a flooded fraction threshold map that relates each 90-m grid point to the surrounding terrain at the microwave scale. We have derived daily, 90-m resolution flood maps for Africa covering 1992-present using SSM/I, AMSR-E, and AMSR2 data and we are now producing new daily maps in near real time. The flood maps are being used by the African Risk Capacity (ARC) Agency to underpin an intergovernmental river flood insurance program in Africa. We will present results showing daily flood extents during major events and discuss: validation of the flood maps against MODIS-derived maps; analyses of minimum detectable flood size; aggregate analyses of flood extent over time; flood map use in ARC's insurance model; and results applying the system to the Americas.
Single-Cell Sequencing for Drug Discovery and Drug Development.
Wu, Hongjin; Wang, Charles; Wu, Shixiu
2017-01-01
Next-generation sequencing (NGS), particularly single-cell sequencing, has revolutionized the scale and scope of genomic and biomedical research. Recent technological advances in NGS and singlecell studies have made the deep whole-genome (DNA-seq), whole epigenome and whole-transcriptome sequencing (RNA-seq) at single-cell level feasible. NGS at the single-cell level expands our view of genome, epigenome and transcriptome and allows the genome, epigenome and transcriptome of any organism to be explored without a priori assumptions and with unprecedented throughput. And it does so with single-nucleotide resolution. NGS is also a very powerful tool for drug discovery and drug development. In this review, we describe the current state of single-cell sequencing techniques, which can provide a new, more powerful and precise approach for analyzing effects of drugs on treated cells and tissues. Our review discusses single-cell whole genome/exome sequencing (scWGS/scWES), single-cell transcriptome sequencing (scRNA-seq), single-cell bisulfite sequencing (scBS), and multiple omics of single-cell sequencing. We also highlight the advantages and challenges of each of these approaches. Finally, we describe, elaborate and speculate the potential applications of single-cell sequencing for drug discovery and drug development. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
Simulated cosmic microwave background maps at 0.5 deg resolution: Basic results
NASA Technical Reports Server (NTRS)
Hinshaw, G.; Bennett, C. L.; Kogut, A.
1995-01-01
We have simulated full-sky maps of the cosmic microwave background (CMB) anisotropy expected from cold dark matter (CDM) models at 0.5 deg and 1.0 deg angular resolution. Statistical properties of the maps are presented as a function of sky coverage, angular resolution, and instrument noise, and the implications of these results for observability of the Doppler peak are discussed. The rms fluctuations in a map are not a particularly robust probe of the existence of a Doppler peak; however, a full correlation analysis can provide reasonable sensitivity. We find that sensitivity to the Doppler peak depends primarily on the fraction of sky covered, and only secondarily on the angular resolution and noise level. Color plates of the simulated maps are presented to illustrate the anisotropies.
The validation process for a moderate resolution leaf area index (LAI) product (i.e., MODIS) involves the creation of a high spatial resolution LAI reference map (Lai-RM), which when scaled to the moderate LAI resolution (i.e., >1 km) allows for comparison and analysis with this ...
NASA Astrophysics Data System (ADS)
Chan, Y. C.; Shih, N. C.; Hsieh, Y. C.
2016-12-01
Geologic maps have provided fundamental information for many scientific and engineering applications in human societies. Geologic maps directly influence the reliability of research results or the robustness of engineering projects. In the past, geologic maps were mainly produced by field geologists through direct field investigations and 2D topographic maps. However, the quality of traditional geologic maps was significantly compromised by field conditions, particularly, when the map area is covered by heavy forest canopies. Recent developments in airborne LiDAR technology may virtually remove trees or buildings, thus, providing a useful data set for improving geological mapping. Because high-quality topographic information still needs to be interpreted in terms of geology, there are many fundamental questions regarding how to best apply the data set for high-resolution geological mapping. In this study, we aim to test the quality and reliability of high-resolution geologic maps produced by recent technological methods through an example from the fold-and-thrust belt in northern Taiwan. We performed the geological mapping by applying the LiDAR-derived DEM, self-developed program tools and many layers of relevant information at interactive 3D environments. Our mapping results indicate that the proposed methods will considerably improve the quality and consistency of the geologic maps. The study also shows that in order to gain consistent mapping results, future high-resolution geologic maps should be produced at interactive 3D environments on the basis of existing geologic maps.
Identifying grain-size dependent errors on global forest area estimates and carbon studies
Daolan Zheng; Linda S. Heath; Mark J. Ducey
2008-01-01
Satellite-derived coarse-resolution data are typically used for conducting global analyses. But the forest areas estimated from coarse-resolution maps (e.g., 1 km) inevitably differ from a corresponding fine-resolution map (such as a 30-m map) that would be closer to ground truth. A better understanding of changes in grain size on area estimation will improve our...
USDA-ARS?s Scientific Manuscript database
Construction of genetic linkage map is essential for genetic and genomic studies. Recent advances in sequencing and genotyping technologies made it possible to generate high-density and high-resolution genetic linkage maps, especially for the organisms lacking extensive genomic resources. In the pre...
2013-01-01
Background Sequencing of the genome of Propionibacterium acnes produced a catalogue of genes many of which enable this organism to colonise skin and survive exposure to the elements. Despite this platform, there was little understanding of the gene regulation that gives rise to an organism that has a major impact on human health and wellbeing and causes infections beyond the skin. To address this situation, we have undertaken a genome–wide study of gene regulation using a combination of improved differential and global RNA-sequencing and an analytical approach that takes into account the inherent noise within the data. Results We have produced nucleotide-resolution transcriptome maps that identify and differentiate sites of transcription initiation from sites of stable RNA processing and mRNA cleavage. Moreover, analysis of these maps provides strong evidence for ‘pervasive’ transcription and shows that contrary to initial indications it is not biased towards the production of antisense RNAs. In addition, the maps reveal an extensive array of riboswitches, leaderless mRNAs and small non-protein-coding RNAs alongside vegetative promoters and post-transcriptional events, which includes unusual tRNA processing. The identification of such features will inform models of complex gene regulation, as illustrated here for ribonucleotide reductases and a potential quorum-sensing, two-component system. Conclusions The approach described here, which is transferable to any bacterial species, has produced a step increase in whole-cell knowledge of gene regulation in P. acnes. Continued expansion of our maps to include transcription associated with different growth conditions and genetic backgrounds will provide a new platform from which to computationally model the gene expression that determines the physiology of P. acnes and its role in human disease. PMID:24034785
NASA Astrophysics Data System (ADS)
Solis, Julio; Gutierrez, Andres; Mangu, Venkata; Sanchez, Eduardo; Bedre, Renesh; Linscombe, Steve; Baisakh, Niranjan
2017-12-01
Drought stress is a constant threat to rice production worldwide. Most Mmodern rice cultivars are sensitive to drought, and the effect is severe at the reproductive stage. Conventional breeding for drought resistant (DR) rice varieties is slow and limited due to the quantitative nature of the DR traits. Identification of genes (QTLs)/markers associated with DR traits is a prerequisite for marker-assisted breeding. Grain yield is the most important trait and to this end drought yield QTLs have been identified under field conditions. The present study reports identification of drought yield QTLs under controlled conditions without confounding effects of other factors prevalent under natural conditions. A linkage map covering 1,781.5 cM with an average resolution of 9.76 cM was constructed using an F2 population from a cross between two Japonica cultivars, Cocodrie (drought sensitive) and Vandana (drought tolerant) with 213 markers distributed over 12 rice chromosomes. A subset of 59 markers (22 genic SSRs and 37 SNPs) derived from the transcriptome of the parents were also placed in the map. Single marker analysis using 187 F2:3 progeny identified 6 markers distributed on chromosomes 1, 5, and 8 to be associated with grain yield under drought (GYD). Composite interval mapping identified six genomic regions/quantitative trait loci (QTL) on chromosome 1, 5, 8, and 9 to be associated with GYD. QTLs located on chromosome 1 (qGYD1.2, qGYD1.3), chromosome 5 (qGYD5.1) and chromosome 8 (qGYD8.1) were contributed by Vandana alleles, whereas the QTLs, qGYD1.1 and qQYD9.1 were contributed by Cocodrie alelles. The additive positive phenotypic variance explained by the QTLs ranged from 30.0% to 34.0%. Candidate genes annotation within QTLs suggested the role of transcription factors and genes involved in osmotic potential regulation through catalytic/metabolic pathways in drought resistance tolerance mechanism contributing to yield.
Almeida, Nuno F.; Krezdorn, Nicolas; Rotter, Björn; Winter, Peter; Rubiales, Diego; Vaz Patto, Maria C.
2015-01-01
Lathyrus sativus (grass pea) is a temperate grain legume crop with a great potential for expansion in dry areas or zones that are becoming more drought-prone. It is also recognized as a potential source of resistance to several important diseases in legumes, such as ascochyta blight. Nevertheless, the lack of detailed genomic and/or transcriptomic information hampers further exploitation of grass pea resistance-related genes in precision breeding. To elucidate the pathways differentially regulated during ascochyta-grass pea interaction and to identify resistance candidate genes, we compared the early response of the leaf gene expression profile of a resistant L. sativus genotype to Ascochyta lathyri infection with a non-inoculated control sample from the same genotype employing deepSuperSAGE. This analysis generated 14.387 UniTags of which 95.7% mapped to a reference grass pea/rust interaction transcriptome. From the total mapped UniTags, 738 were significantly differentially expressed between control and inoculated leaves. The results indicate that several gene classes acting in different phases of the plant/pathogen interaction are involved in the L. sativus response to A. lathyri infection. Most notably a clear up-regulation of defense-related genes involved in and/or regulated by the ethylene pathway was observed. There was also evidence of alterations in cell wall metabolism indicated by overexpression of cellulose synthase and lignin biosynthesis genes. This first genome-wide overview of the gene expression profile of the L. sativus response to ascochyta infection delivered a valuable set of candidate resistance genes for future use in precision breeding. PMID:25852725
Herman, Dorota; Slabbinck, Bram; Pè, Mario Enrico
2016-01-01
Leaves are vital organs for biomass and seed production because of their role in the generation of metabolic energy and organic compounds. A better understanding of the molecular networks underlying leaf development is crucial to sustain global requirements for food and renewable energy. Here, we combined transcriptome profiling of proliferative leaf tissue with in-depth phenotyping of the fourth leaf at later stages of development in 197 recombinant inbred lines of two different maize (Zea mays) populations. Previously, correlation analysis in a classical biparental mapping population identified 1,740 genes correlated with at least one of 14 traits. Here, we extended these results with data from a multiparent advanced generation intercross population. As expected, the phenotypic variability was found to be larger in the latter population than in the biparental population, although general conclusions on the correlations among the traits are comparable. Data integration from the two diverse populations allowed us to identify a set of 226 genes that are robustly associated with diverse leaf traits. This set of genes is enriched for transcriptional regulators and genes involved in protein synthesis and cell wall metabolism. In order to investigate the molecular network context of the candidate gene set, we integrated our data with publicly available functional genomics data and identified a growth regulatory network of 185 genes. Our results illustrate the power of combining in-depth phenotyping with transcriptomics in mapping populations to dissect the genetic control of complex traits and present a set of candidate genes for use in biomass improvement. PMID:26754667
Genome-wide inference of regulatory networks in Streptomyces coelicolor.
Castro-Melchor, Marlene; Charaniya, Salim; Karypis, George; Takano, Eriko; Hu, Wei-Shou
2010-10-18
The onset of antibiotics production in Streptomyces species is co-ordinated with differentiation events. An understanding of the genetic circuits that regulate these coupled biological phenomena is essential to discover and engineer the pharmacologically important natural products made by these species. The availability of genomic tools and access to a large warehouse of transcriptome data for the model organism, Streptomyces coelicolor, provides incentive to decipher the intricacies of the regulatory cascades and develop biologically meaningful hypotheses. In this study, more than 500 samples of genome-wide temporal transcriptome data, comprising wild-type and more than 25 regulatory gene mutants of Streptomyces coelicolor probed across multiple stress and medium conditions, were investigated. Information based on transcript and functional similarity was used to update a previously-predicted whole-genome operon map and further applied to predict transcriptional networks constituting modules enriched in diverse functions such as secondary metabolism, and sigma factor. The predicted network displays a scale-free architecture with a small-world property observed in many biological networks. The networks were further investigated to identify functionally-relevant modules that exhibit functional coherence and a consensus motif in the promoter elements indicative of DNA-binding elements. Despite the enormous experimental as well as computational challenges, a systems approach for integrating diverse genome-scale datasets to elucidate complex regulatory networks is beginning to emerge. We present an integrated analysis of transcriptome data and genomic features to refine a whole-genome operon map and to construct regulatory networks at the cistron level in Streptomyces coelicolor. The functionally-relevant modules identified in this study pose as potential targets for further studies and verification.
Zeng, Shaohua; Xiao, Gong; Wang, Gan; Wang, Ying; Peng, Ming; Huang, Hongwen
2015-01-01
Red-fleshed kiwifruit (Actinidia chinensis Planch. ‘Hongyang’) is a promising commercial cultivar due to its nutritious value and unique flesh color, derived from vitamin C and anthocyanins. In this study, we obtained transcriptome data of ‘Hongyang’ from seven developmental stages using Illumina sequencing. We mapped 39–54 million reads to the recently sequenced kiwifruit genome and other databases to define gene structure, to analyze alternative splicing, and to quantify gene transcript abundance at different developmental stages. The transcript profiles throughout red kiwifruit development were constructed and analyzed, with a focus on the biosynthesis and metabolism of compounds such as phytohormones, sugars, starch and L-ascorbic acid, which are indispensable for the development and formation of quality fruit. Candidate genes for these pathways were identified through MapMan and phylogenetic analysis. The transcript levels of genes involved in sucrose and starch metabolism were consistent with the change in soluble sugar and starch content throughout kiwifruit development. The metabolism of L-ascorbic acid was very active, primarily through the L-galactose pathway. The genes responsible for the accumulation of anthocyanin in red kiwifruit were identified, and their expression levels were investigated during kiwifruit development. This survey of gene expression during kiwifruit development paves the way for further investigation of the development of this uniquely colored and nutritious fruit and reveals which factors are needed for high quality fruit formation. This transcriptome data and its analysis will be useful for improving kiwifruit genome annotation, for basic fruit molecular biology research, and for kiwifruit breeding and improvement. PMID:26301713
Zhu, Qian-Hao; Spriggs, Andrew; Taylor, Jennifer M.; Llewellyn, Danny; Wilson, Iain
2014-01-01
Varietal single nucleotide polymorphisms (SNPs) are the differences within one of the two subgenomes between different tetraploid cotton varieties and have not been practically used in cotton genetics and breeding because they are difficult to identify due to low genetic diversity and very high sequence identity between homeologous genes in cotton. We have used transcriptome and restriction site−associated DNA sequencing to identify varietal SNPs among 18 G. hirsutum varieties based on the rationale that varietal SNPs can be more confidently called when flanked by subgenome-specific SNPs. Using transcriptome data, we successfully identified 37,413 varietal SNPs and, of these, 22,121 did not have an additional varietal SNP within their 20-bp flanking regions so can be used in most SNP genotyping assays. From restriction site−associated DNA sequencing data, we identified an additional 3090 varietal SNPs between two of the varieties. Of the 1583 successful SNP assays achieved using different genotyping platforms, 1363 were verified. Many of the SNPs behaved as dominant markers because of coamplification from homeologous loci, but the number of SNPs acting as codominant markers increased when one or more subgenome-specific SNP(s) were incorporated in their assay primers, giving them greater utility for breeding applications. A G. hirsutum genetic map with 1244 SNP markers was constructed covering 5557.42 centiMorgan and used to map qualitative and quantitative traits. This collection of G. hirsutum varietal SNPs complements existing intra-specific SNPs and provides the cotton community with a valuable marker resource applicable to genetic analyses and breeding programs. PMID:25106949
Ruggles, Kelly V; Tang, Zuojian; Wang, Xuya; Grover, Himanshu; Askenazi, Manor; Teubl, Jennifer; Cao, Song; McLellan, Michael D; Clauser, Karl R; Tabb, David L; Mertins, Philipp; Slebos, Robbert; Erdmann-Gilmore, Petra; Li, Shunqiang; Gunawardena, Harsha P; Xie, Ling; Liu, Tao; Zhou, Jian-Ying; Sun, Shisheng; Hoadley, Katherine A; Perou, Charles M; Chen, Xian; Davies, Sherri R; Maher, Christopher A; Kinsinger, Christopher R; Rodland, Karen D; Zhang, Hui; Zhang, Zhen; Ding, Li; Townsend, R Reid; Rodriguez, Henry; Chan, Daniel; Smith, Richard D; Liebler, Daniel C; Carr, Steven A; Payne, Samuel; Ellis, Matthew J; Fenyő, David
2016-03-01
Improvements in mass spectrometry (MS)-based peptide sequencing provide a new opportunity to determine whether polymorphisms, mutations, and splice variants identified in cancer cells are translated. Herein, we apply a proteogenomic data integration tool (QUILTS) to illustrate protein variant discovery using whole genome, whole transcriptome, and global proteome datasets generated from a pair of luminal and basal-like breast-cancer-patient-derived xenografts (PDX). The sensitivity of proteogenomic analysis for singe nucleotide variant (SNV) expression and novel splice junction (NSJ) detection was probed using multiple MS/MS sample process replicates defined here as an independent tandem MS experiment using identical sample material. Despite analysis of over 30 sample process replicates, only about 10% of SNVs (somatic and germline) detected by both DNA and RNA sequencing were observed as peptides. An even smaller proportion of peptides corresponding to NSJ observed by RNA sequencing were detected (<0.1%). Peptides mapping to DNA-detected SNVs without a detectable mRNA transcript were also observed, suggesting that transcriptome coverage was incomplete (∼80%). In contrast to germline variants, somatic variants were less likely to be detected at the peptide level in the basal-like tumor than in the luminal tumor, raising the possibility of differential translation or protein degradation effects. In conclusion, this large-scale proteogenomic integration allowed us to determine the degree to which mutations are translated and identify gaps in sequence coverage, thereby benchmarking current technology and progress toward whole cancer proteome and transcriptome analysis. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.
Baute, Joke; Herman, Dorota; Coppens, Frederik; De Block, Jolien; Slabbinck, Bram; Dell'Acqua, Matteo; Pè, Mario Enrico; Maere, Steven; Nelissen, Hilde; Inzé, Dirk
2016-03-01
Leaves are vital organs for biomass and seed production because of their role in the generation of metabolic energy and organic compounds. A better understanding of the molecular networks underlying leaf development is crucial to sustain global requirements for food and renewable energy. Here, we combined transcriptome profiling of proliferative leaf tissue with in-depth phenotyping of the fourth leaf at later stages of development in 197 recombinant inbred lines of two different maize (Zea mays) populations. Previously, correlation analysis in a classical biparental mapping population identified 1,740 genes correlated with at least one of 14 traits. Here, we extended these results with data from a multiparent advanced generation intercross population. As expected, the phenotypic variability was found to be larger in the latter population than in the biparental population, although general conclusions on the correlations among the traits are comparable. Data integration from the two diverse populations allowed us to identify a set of 226 genes that are robustly associated with diverse leaf traits. This set of genes is enriched for transcriptional regulators and genes involved in protein synthesis and cell wall metabolism. In order to investigate the molecular network context of the candidate gene set, we integrated our data with publicly available functional genomics data and identified a growth regulatory network of 185 genes. Our results illustrate the power of combining in-depth phenotyping with transcriptomics in mapping populations to dissect the genetic control of complex traits and present a set of candidate genes for use in biomass improvement. © 2016 American Society of Plant Biologists. All Rights Reserved.
Baranski, Matthew; Gopikrishna, Gopalapillay; Robinson, Nicholas A.; Katneni, Vinaya Kumar; Shekhar, Mudagandur S.; Shanmugakarthik, Jayakani; Jothivel, Sarangapani; Gopal, Chavali; Ravichandran, Pitchaiyappan; Kent, Matthew; Arnyasi, Mariann; Ponniah, Alphis G.
2014-01-01
Transcriptome sequencing using Illumina RNA-seq was performed on populations of black tiger shrimp from India. Samples were collected from (i) four landing centres around the east coastline (EC) of India, (ii) survivors of a severe WSSV infection during pond culture (SUR) and (iii) the Andaman Islands (AI) in the Bay of Bengal. Equal quantities of purified total RNA from homogenates of hepatopancreas, muscle, nervous tissue, intestinal tract, heart, gonad, gills, pleopod and lymphoid organs were combined to create AI, EC and SUR pools for RNA sequencing. De novo transcriptome assembly resulted in 136,223 contigs (minimum size 100 base pairs, bp) with a total length 61 Mb, an average length of 446 bp and an average coverage of 163× across all pools. Approximately 16% of contigs were annotated with BLAST hit information and gene ontology annotations. A total of 473,620 putative SNPs/indels were identified. An Illumina iSelect genotyping array containing 6,000 SNPs was developed and used to genotype 1024 offspring belonging to seven full-sibling families. A total of 3959 SNPs were mapped to 44 linkage groups. The linkage groups consisted of between 16–129 and 13–130 markers, of length between 139–10.8 and 109.1–10.5 cM and with intervals averaging between 1.2 and 0.9 cM for the female and male maps respectively. The female map was 28% longer than the male map (4060 and 2917 cM respectively) with a 1.6 higher recombination rate observed for female compared to male meioses. This approach has substantially increased expressed sequence and DNA marker resources for tiger shrimp and is a useful resource for QTL mapping and association studies for evolutionarily and commercially important traits. PMID:24465553
Cheng, Benson Yee Hin; Zhi, Jizu; Santana, Alexis; Khan, Sohail; Salinas, Eduardo; Forrest, J. Craig; Zheng, Yueting; Jaggi, Shirin; Leatherwood, Janet
2012-01-01
We applied a custom tiled microarray to examine murine gammaherpesvirus 68 (MHV68) polyadenylated transcript expression in a time course of de novo infection of fibroblast cells and following phorbol ester-mediated reactivation from a latently infected B cell line. During de novo infection, all open reading frames (ORFs) were transcribed and clustered into four major temporal groups that were overlapping yet distinct from clusters based on the phorbol ester-stimulated B cell reactivation time course. High-density transcript analysis at 2-h intervals during de novo infection mapped gene boundaries with a 20-nucleotide resolution, including a previously undefined ORF73 transcript and the MHV68 ORF63 homolog of Kaposi's sarcoma-associated herpesvirus vNLRP1. ORF6 transcript initiation was mapped by tiled array and confirmed by 5′ rapid amplification of cDNA ends. The ∼1.3-kb region upstream of ORF6 was responsive to lytic infection and MHV68 RTA, identifying a novel RTA-responsive promoter. Transcription in intergenic regions consistent with the previously defined expressed genomic regions was detected during both types of productive infection. We conclude that the MHV68 transcriptome is dynamic and distinct during de novo fibroblast infection and upon phorbol ester-stimulated B cell reactivation, highlighting the need to evaluate further transcript structure and the context-dependent molecular events that govern viral gene expression during chronic infection. PMID:22318145
NASA Astrophysics Data System (ADS)
Ikeshima, D.; Yamazaki, D.; Yoshikawa, S.; Kanae, S.
2015-12-01
The specification of worldwide water body distribution is important for discovering hydrological cycle. Global 3-second Water Body Map (G3WBM) is a global scale map, which indicates the distribution of water body in 90m resolutions (http://hydro.iis.u-tokyo.ac.jp/~yamadai/G3WBM/index.html). This dataset was mainly built to identify the width of river channels, which is one of major uncertainties of continental-scale river hydrodynamics models. To survey the true width of the river channel, this water body map distinguish Permanent Water Body from Temporary Water Body, which means separating river channel and flood plain. However, rivers with narrower width, which is a major case in usual river, could not be observed in this map. To overcome this problem, updating the algorithm of G3WBM and enhancing the resolutions to 30m is the goal of this research. Although this 30m-resolution water body map uses similar algorithm as G3WBM, there are many technical issues attributed to relatively high resolutions. Those are such as lack of same high-resolution digital elevation map, or contamination problem of sub-pixel scale object on satellite acquired image, or invisibility of well-vegetated water body such as swamp. To manage those issues, this research used more than 30,000 satellite images of Landsat Global Land Survey (GLS), and lately distributed topography data of Shuttle Rader Topography Mission (SRTM) 1 arc-second (30m) digital elevation map. Also the effect of aerosol, which would scatter the sun reflectance and disturb the acquired result image, was considered. Due to these revises, the global water body distribution was established in more precise resolution.
Xu, Zhifeng; Zhu, Wenyi; Liu, Yanchao; Liu, Xing; Chen, Qiushuang; Peng, Miao; Wang, Xiangzun; Shen, Guangmao; He, Lin
2014-01-01
The carmine spider mite (CSM), Tetranychus cinnabarinus, is an important pest mite in agriculture, because it can develop insecticide resistance easily. To gain valuable gene information and molecular basis for the future insecticide resistance study of CSM, the first transcriptome analysis of CSM was conducted. A total of 45,016 contigs and 25,519 unigenes were generated from the de novo transcriptome assembly, and 15,167 unigenes were annotated via BLAST querying against current databases, including nr, SwissProt, the Clusters of Orthologous Groups (COGs), Kyoto Encyclopedia of Genes and Genomes (KEGG) and Gene Ontology (GO). Aligning the transcript to Tetranychus urticae genome, the 19255 (75.45%) of the transcripts had significant (e-value <10-5) matches to T. urticae DNA genome, 19111 sequences matched to T. urticae proteome with an average protein length coverage of 42.55%. Core Eukaryotic Genes Mapping Approach (CEGMA) analysis identified 435 core eukaryotic genes (CEGs) in the CSM dataset corresponding to 95% coverage. Ten gene categories that relate to insecticide resistance in arthropod were generated from CSM transcriptome, including 53 P450-, 22 GSTs-, 23 CarEs-, 1 AChE-, 7 GluCls-, 9 nAChRs-, 8 GABA receptor-, 1 sodium channel-, 6 ATPase- and 12 Cyt b genes. We developed significant molecular resources for T. cinnabarinus putatively involved in insecticide resistance. The transcriptome assembly analysis will significantly facilitate our study on the mechanism of adapting environmental stress (including insecticide) in CSM at the molecular level, and will be very important for developing new control strategies against this pest mite.
Hsiang, Chien-Yun; Chen, Yueh-Sheng; Ho, Tin-Yun
2009-06-01
Establishment of a comprehensive platform for the assessment of host-biomaterial interaction in vivo is an important issue. Nuclear factor-kappaB (NF-kappaB) is an inducible transcription factor that is activated by numerous stimuli. Therefore, NF-kappaB-dependent luminescent signal in transgenic mice carrying the luciferase genes was used as the guide to monitor the biomaterials-affected organs, and transcriptomic analysis was further applied to evaluate the complex host responses in affected organs in this study. In vivo imaging showed that genipin-cross-linked gelatin conduit (GGC) implantation evoked the strong NF-kappaB activity at 6h in the implanted region, and transcriptomic analysis showed that the expressions of interleukin-6 (IL-6), IL-24, and IL-1 family were up-regulated. A strong luminescent signal was observed in spleen on 14 d, suggesting that GGC implantation might elicit the biological events in spleen. Transcriptomic analysis of spleen showed that 13 Kyoto Encyclopedia of Genes and Genomes pathways belonging to cell cycles, immune responses, and metabolism were significantly altered by GGC implants. Connectivity Map analysis suggested that the gene signatures of GGC were similar to those of compounds that affect lipid or glucose metabolism. GeneSetTest analysis further showed that host responses to GGC implants might be related to diseases states, especially the metabolic and cardiovascular diseases. In conclusion, our data provided a concept of molecular imaging-guided transcriptomic platform for the evaluation and the prediction of host-biomaterial interaction in vivo.
Perigone Lobe Transcriptome Analysis Provides Insights into Rafflesia cantleyi Flower Development.
Lee, Xin-Wei; Mat-Isa, Mohd-Noor; Mohd-Elias, Nur-Atiqah; Aizat-Juhari, Mohd Afiq; Goh, Hoe-Han; Dear, Paul H; Chow, Keng-See; Haji Adam, Jumaat; Mohamed, Rahmah; Firdaus-Raih, Mohd; Wan, Kiew-Lian
2016-01-01
Rafflesia is a biologically enigmatic species that is very rare in occurrence and possesses an extraordinary morphology. This parasitic plant produces a gigantic flower up to one metre in diameter with no leaves, stem or roots. However, little is known about the floral biology of this species especially at the molecular level. In an effort to address this issue, we have generated and characterised the transcriptome of the Rafflesia cantleyi flower, and performed a comparison with the transcriptome of its floral bud to predict genes that are expressed and regulated during flower development. Approximately 40 million sequencing reads were generated and assembled de novo into 18,053 transcripts with an average length of 641 bp. Of these, more than 79% of the transcripts had significant matches to annotated sequences in the public protein database. A total of 11,756 and 7,891 transcripts were assigned to Gene Ontology categories and clusters of orthologous groups respectively. In addition, 6,019 transcripts could be mapped to 129 pathways in Kyoto Encyclopaedia of Genes and Genomes Pathway database. Digital abundance analysis identified 52 transcripts with very high expression in the flower transcriptome of R. cantleyi. Subsequently, analysis of differential expression between developing flower and the floral bud revealed a set of 105 transcripts with potential role in flower development. Our work presents a deep transcriptome resource analysis for the developing flower of R. cantleyi. Genes potentially involved in the growth and development of the R. cantleyi flower were identified and provide insights into biological processes that occur during flower development.
High-Resolution Land Use and Land Cover Mapping
,
1999-01-01
As the Nation?s population grows, quantifying, monitoring, and managing land use becomes increasingly important. The U.S. Geological Survey (USGS) has a long heritage of leadership and innovation in land use and land cover (LULC) mapping that has been the model both nationally and internationally for over 20 years. At present, the USGS is producing high-resolution LULC data for several watershed and urban areas within the United States. This high-resolution LULC mapping is part of an ongoing USGS Land Cover Characterization Program (LCCP). The four components of the LCCP are global (1:2,000,000-scale), national (1:100,000-scale), urban (1:24,000-scale), and special projects (various scales and time periods). Within the urban and special project components, the USGS Rocky Mountain Mapping Center (RMMC) is collecting historical as well as contemporary high-resolution LULC data. RMMC?s high-resolution LULC mapping builds on the heritage and success of previous USGS LULC programs and provides LULC information to meet user requirements.
Estes, Lyndon; Chen, Peng; Debats, Stephanie; Evans, Tom; Ferreira, Stefanus; Kuemmerle, Tobias; Ragazzo, Gabrielle; Sheffield, Justin; Wolf, Adam; Wood, Eric; Caylor, Kelly
2018-01-01
Land cover maps increasingly underlie research into socioeconomic and environmental patterns and processes, including global change. It is known that map errors impact our understanding of these phenomena, but quantifying these impacts is difficult because many areas lack adequate reference data. We used a highly accurate, high-resolution map of South African cropland to assess (1) the magnitude of error in several current generation land cover maps, and (2) how these errors propagate in downstream studies. We first quantified pixel-wise errors in the cropland classes of four widely used land cover maps at resolutions ranging from 1 to 100 km, and then calculated errors in several representative "downstream" (map-based) analyses, including assessments of vegetative carbon stocks, evapotranspiration, crop production, and household food security. We also evaluated maps' spatial accuracy based on how precisely they could be used to locate specific landscape features. We found that cropland maps can have substantial biases and poor accuracy at all resolutions (e.g., at 1 km resolution, up to ∼45% underestimates of cropland (bias) and nearly 50% mean absolute error (MAE, describing accuracy); at 100 km, up to 15% underestimates and nearly 20% MAE). National-scale maps derived from higher-resolution imagery were most accurate, followed by multi-map fusion products. Constraining mapped values to match survey statistics may be effective at minimizing bias (provided the statistics are accurate). Errors in downstream analyses could be substantially amplified or muted, depending on the values ascribed to cropland-adjacent covers (e.g., with forest as adjacent cover, carbon map error was 200%-500% greater than in input cropland maps, but ∼40% less for sparse cover types). The average locational error was 6 km (600%). These findings provide deeper insight into the causes and potential consequences of land cover map error, and suggest several recommendations for land cover map users. © 2017 John Wiley & Sons Ltd.
2010-10-14
High-Resolution Functional Mapping of the Venezuelan Equine Encephalitis Virus Genome by Insertional Mutagenesis and Massively Parallel Sequencing...Venezuelan equine encephalitis virus (VEEV) genome. We initially used a capillary electrophoresis method to gain insight into the role of the VEEV...Smith JM, Schmaljohn CS (2010) High-Resolution Functional Mapping of the Venezuelan Equine Encephalitis Virus Genome by Insertional Mutagenesis and
Human-specific features of spatial gene expression and regulation in eight brain regions.
Xu, Chuan; Li, Qian; Efimova, Olga; He, Liu; Tatsumoto, Shoji; Stepanova, Vita; Oishi, Takao; Udono, Toshifumi; Yamaguchi, Katsushi; Shigenobu, Shuji; Kakita, Akiyoshi; Nawa, Hiroyuki; Khaitovich, Philipp; Go, Yasuhiro
2018-06-13
Molecular maps of the human brain alone do not inform us of the features unique to humans. Yet, the identification of these features is important for understanding both the evolution and nature of human cognition. Here, we approached this question by analyzing gene expression and H3K27ac chromatin modification data collected in eight brain regions of humans, chimpanzees, gorillas, a gibbon and macaques. An analysis of spatial transcriptome trajectories across eight brain regions in four primate species revealed 1,851 genes showing human-specific transcriptome differences in one or multiple brain regions, in contrast to 240 chimpanzee-specific ones. More than half of these human-specific differences represented elevated expression of genes enriched in neuronal and astrocytic markers in the human hippocampus, while the rest were enriched in microglial markers and displayed human-specific expression in several frontal cortical regions and the cerebellum. An analysis of the predicted regulatory interactions driving these differences revealed the role of transcription factors in species-specific transcriptome changes, while epigenetic modifications were linked to spatial expression differences conserved across species. Published by Cold Spring Harbor Laboratory Press.
Transcriptome map of plant mitochondria reveals islands of unexpected transcribed regions.
Fujii, Sota; Toda, Takushi; Kikuchi, Shunsuke; Suzuki, Ryutaro; Yokoyama, Koji; Tsuchida, Hiroko; Yano, Kentaro; Toriyama, Kinya
2011-06-01
Plant mitochondria contain a relatively large amount of genetic information, suggesting that their functional regulation may not be as straightforward as that of metazoans. We used a genomic tiling array to draw a transcriptomic atlas of Oryza sativa japonica (rice) mitochondria, which was predicted to be approximately 490-kb long. Whereas statistical analysis verified the transcription of all previously known functional genes such as the ones related to oxidative phosphorylation, a similar extent of RNA expression was frequently observed in the inter-genic regions where none of the previously annotated genes are located. The newly identified open reading frames (ORFs) predicted in these transcribed inter-genic regions were generally not conserved among flowering plant species, suggesting that these ORFs did not play a role in mitochondrial principal functions. We also identified two partial fragments of retrotransposon sequences as being transcribed in rice mitochondria. The present study indicated the previously unexpected complexity of plant mitochondrial RNA metabolism. Our transcriptomic data (Oryza sativa Mitochondrial rna Expression Server: OsMES) is publicly accessible at [http://bioinf.mind.meiji.ac.jp/cgi-bin/gbrowse/OsMes/#search].
Sollmann, Nico; Hauck, Theresa; Tussis, Lorena; Ille, Sebastian; Maurer, Stefanie; Boeckh-Behrens, Tobias; Ringel, Florian; Meyer, Bernhard; Krieg, Sandro M
2016-10-24
The spatial resolution of repetitive navigated transcranial magnetic stimulation (rTMS) for language mapping is largely unknown. Thus, to determine a minimum spatial resolution of rTMS for language mapping, we evaluated the mapping sessions derived from 19 healthy volunteers for cortical hotspots of no-response errors. Then, the distances between hotspots (stimulation points with a high error rate) and adjacent mapping points (stimulation points with low error rates) were evaluated. Mean distance values of 13.8 ± 6.4 mm (from hotspots to ventral points, range 0.7-30.7 mm), 10.8 ± 4.8 mm (from hotspots to dorsal points, range 2.0-26.5 mm), 16.6 ± 4.8 mm (from hotspots to apical points, range 0.9-27.5 mm), and 13.8 ± 4.3 mm (from hotspots to caudal points, range 2.0-24.2 mm) were measured. According to the results, the minimum spatial resolution of rTMS should principally allow for the identification of a particular gyrus, and according to the literature, it is in good accordance with the spatial resolution of direct cortical stimulation (DCS). Since measurement was performed between hotspots and adjacent mapping points and not on a finer-grained basis, we only refer to a minimum spatial resolution. Furthermore, refinement of our results within the scope of a prospective study combining rTMS and DCS for resolution measurement during language mapping should be the next step.
High-Resolution Geologic Mapping of Martian Terraced Fan Deposits
NASA Astrophysics Data System (ADS)
Wolak, J. M.; Patterson, A. B.; Smith, S. D.; Robbins, N. N.
2018-06-01
This abstract documents our initial progress (year 1) mapping terraced fan features on Mars. Our objective is to investigate the role of fluids during fan formation and produce the first high-resolution geologic map (1:18k) of a terraced fan.
Full-sky, High-resolution Maps of Interstellar Dust
NASA Astrophysics Data System (ADS)
Meisner, Aaron Michael
We present full-sky, high-resolution maps of interstellar dust based on data from the Wide-field Infrared Survey Explorer (WISE) and Planck missions. We describe our custom processing of the entire WISE 12 micron All-Sky imaging data set, and present the resulting 15 arcsecond resolution, full-sky map of diffuse Galactic dust emission, free of compact sources and other contaminating artifacts. Our derived 12 micron dust map offers angular resolution far superior to that of all other existing full-sky, infrared dust emission maps, revealing a wealth of small-scale filamentary structure. We also apply the Finkbeiner et al. (1999) two-component thermal dust emission model to the Planck HFI maps. We derive full-sky 6.1 arcminute resolution maps of dust optical depth and temperature by fitting this two-component model to Planck 217-857 GHz along with DIRBE/IRAS 100 micron data. In doing so, we obtain the first ever full-sky 100-3000 GHz Planck-based thermal dust emission model, as well as a dust temperature correction with ~10 times enhanced angular resolution relative to DIRBE-based temperature maps. Analyzing the joint Planck/DIRBE dust spectrum, we show that two-component models provide a better fit to the 100-3000 GHz emission than do single-MBB models, though by a lesser margin than found by Finkbeiner et al. (1999) based on FIRAS and DIRBE. We find that, in diffuse sky regions, our two-component 100-217 GHz predictions are on average accurate to within 2.2%, while extrapolating the Planck Collaboration (2013) single-MBB model systematically underpredicts emission by 18.8% at 100 GHz, 12.6% at 143 GHz and 7.9% at 217 GHz. We calibrate our two-component optical depth to reddening, and compare with reddening estimates based on stellar spectra. We find the dominant systematic problems in our temperature/reddening maps to be zodiacal light on large angular scales and the cosmic infrared background anisotropy on small angular scales. Future work will focus on combining our WISE 12 micron dust map and Planck dust model to create a next-generation, full-sky dust extinction map with angular resolution several times better than Schlegel et al. (1998).
ABMapper: a suffix array-based tool for multi-location searching and splice-junction mapping.
Lou, Shao-Ke; Ni, Bing; Lo, Leung-Yau; Tsui, Stephen Kwok-Wing; Chan, Ting-Fung; Leung, Kwong-Sak
2011-02-01
Sequencing reads generated by RNA-sequencing (RNA-seq) must first be mapped back to the genome through alignment before they can be further analyzed. Current fast and memory-saving short-read mappers could give us a quick view of the transcriptome. However, they are neither designed for reads that span across splice junctions nor for repetitive reads, which can be mapped to multiple locations in the genome (multi-reads). Here, we describe a new software package: ABMapper, which is specifically designed for exploring all putative locations of reads that are mapped to splice junctions or repetitive in nature. The software is freely available at: http://abmapper.sourceforge.net/. The software is written in C++ and PERL. It runs on all major platforms and operating systems including Windows, Mac OS X and LINUX.
Fish connectivity mapping intermediate data files and outputs
RLWrankedLists.tar.gz:These lists linked to various chemical treatment conditions serve as the target collection of Cmap. Probes of the entire microarray are sorted based on their log fold changes over control conditions. RLWsignatures2015.tar.gz: These signatures linked to various chemical treatment conditions serve as queries in Cmap.This dataset is associated with the following publication:Wang , R., A. Biales , N. Garcia-Reyero, E. Perkins, D. Villeneuve, G. Ankley, and D. Bencic. Fish Connectivity Mapping: Linking Chemical Stressors by Their MOA-Driven Transcriptomic Profiles. BMC Genomics. BioMed Central Ltd, London, UK, 17(84): 1-20, (2016).
NASA Technical Reports Server (NTRS)
Hammerling, Dorit M.; Michalak, Anna M.; Kawa, S. Randolph
2012-01-01
Satellite observations of CO2 offer new opportunities to improve our understanding of the global carbon cycle. Using such observations to infer global maps of atmospheric CO2 and their associated uncertainties can provide key information about the distribution and dynamic behavior of CO2, through comparison to atmospheric CO2 distributions predicted from biospheric, oceanic, or fossil fuel flux emissions estimates coupled with atmospheric transport models. Ideally, these maps should be at temporal resolutions that are short enough to represent and capture the synoptic dynamics of atmospheric CO2. This study presents a geostatistical method that accomplishes this goal. The method can extract information about the spatial covariance structure of the CO2 field from the available CO2 retrievals, yields full coverage (Level 3) maps at high spatial resolutions, and provides estimates of the uncertainties associated with these maps. The method does not require information about CO2 fluxes or atmospheric transport, such that the Level 3 maps are informed entirely by available retrievals. The approach is assessed by investigating its performance using synthetic OCO-2 data generated from the PCTM/ GEOS-4/CASA-GFED model, for time periods ranging from 1 to 16 days and a target spatial resolution of 1deg latitude x 1.25deg longitude. Results show that global CO2 fields from OCO-2 observations can be predicted well at surprisingly high temporal resolutions. Even one-day Level 3 maps reproduce the large-scale features of the atmospheric CO2 distribution, and yield realistic uncertainty bounds. Temporal resolutions of two to four days result in the best performance for a wide range of investigated scenarios, providing maps at an order of magnitude higher temporal resolution relative to the monthly or seasonal Level 3 maps typically reported in the literature.
Malenfant, René M; Coltman, David W; Davis, Corey S
2015-05-01
Single-nucleotide polymorphisms (SNPs) offer numerous advantages over anonymous markers such as microsatellites, including improved estimation of population parameters, finer-scale resolution of population structure and more precise genomic dissection of quantitative traits. However, many SNPs are needed to equal the resolution of a single microsatellite, and reliable large-scale genotyping of SNPs remains a challenge in nonmodel species. Here, we document the creation of a 9K Illumina Infinium BeadChip for polar bears (Ursus maritimus), which will be used to investigate: (i) the fine-scale population structure among Canadian polar bears and (ii) the genomic architecture of phenotypic traits in the Western Hudson Bay subpopulation. To this end, we used restriction-site associated DNA (RAD) sequencing from 38 bears across their circumpolar range, as well as blood/fat transcriptome sequencing of 10 individuals from Western Hudson Bay. Six-thousand RAD SNPs and 3000 transcriptomic SNPs were selected for the chip, based primarily on genomic spacing and gene function respectively. Of the 9000 SNPs ordered from Illumina, 8042 were successfully printed, and - after genotyping 1450 polar bears - 5441 of these SNPs were found to be well clustered and polymorphic. Using this array, we show rapid linkage disequilibrium decay among polar bears, we demonstrate that in a subsample of 78 individuals, our SNPs detect known genetic structure more clearly than 24 microsatellites genotyped for the same individuals and that these results are not driven by the SNP ascertainment scheme. Here, we present one of the first large-scale genotyping resources designed for a threatened species. © 2014 John Wiley & Sons Ltd.
NASA Technical Reports Server (NTRS)
Hall, Dorothy K.; Riggs, George A.; Salomonson, Vincent V.; Scharfen, Greg R.
2000-01-01
Following the 1999 launch of the Earth Observing System (EOS) Moderate Resolution Imaging Spectroradiometer (MODIS), the capability exists to produce global snow-cover maps on a daily basis at 500-m resolution. Eight-day composite snow-cover maps will also be available. MODIS snow-cover products are produced at Goddard Space Flight Center and archived and distributed by the National Snow and Ice Data Center (NSIDC) in Boulder, Colorado. The products are available in both orbital and gridded formats. An online search and order tool and user-services staff will be available at NSIDC to assist users with the snow products. The snow maps are available at a spatial resolution of 500 m, and 1/4 degree x 1/4 degree spatial resolution, and provide information on sub-pixel (fractional) snow cover. Pre-launch validation work has shown that the MODIS snow-mapping algorithms perform best under conditions of continuous snow cover in low vegetation areas, but can also map snow cover in dense forests. Post-launch validation activities will be performed using field and aircraft measurements from a February 2000 validation mission, as well as from existing satellite-derived snow-cover maps from NOAA and Landsat-7 Enhanced Thematic Mapper Plus (ETM+).
A Synopsis of Global Mapping of Freshwater Habitats and Biodiversity: Implications for Conservation
DOE Office of Scientific and Technical Information (OSTI.GOV)
McManamay, Ryan A.; Griffiths, Natalie A.; DeRolph, Christopher R.
Accurately mapping freshwater habitats and biodiversity at high-resolutions across the globe is essential for assessing the vulnerability and threats to freshwater organisms and prioritizing conservation efforts. Since the 2000s, extensive efforts have been devoted to mapping global freshwater habitats (rivers, lakes, and wetlands), the spatial representation of which has changed dramatically over time with new geospatial data products and improved remote sensing technologies. Some of these mapping efforts, however, are still coarse representations of actual conditions. Likewise, the resolution and scope of global freshwater biodiversity compilation efforts have also increased, but are yet to mirror the spatial resolution and fidelitymore » of mapped freshwater environments. In our synopsis, we find that efforts to map freshwater habitats have been conducted independently of those for freshwater biodiversity; subsequently, there is little congruence in the spatial representation and resolution of the two efforts. We suggest that global species distribution models are needed to fill this information gap; however, limiting data on habitat characteristics at scales that complement freshwater habitats has prohibited global high-resolution biogeography efforts. Emerging research trends, such as mapping habitat alteration in freshwater ecosystems and trait biogeography, show great promise in mechanistically linking global anthropogenic stressors to freshwater biodiversity decline and extinction risk.« less
Expression Profiling Smackdown: Human Transcriptome Array HTA 2.0 vs. RNA-Seq
Palermo, Meghann; Driscoll, Heather; Tighe, Scott; Dragon, Julie; Bond, Jeff; Shukla, Arti; Vangala, Mahesh; Vincent, James; Hunter, Tim
2014-01-01
The advent of both microarray and massively parallel sequencing have revolutionized high-throughput analysis of the human transcriptome. Due to limitations in microarray technology, detecting and quantifying coding transcript isoforms, in addition to non-coding transcripts, has been challenging. As a result, RNA-Seq has been the preferred method for characterizing the full human transcriptome, until now. A new high-resolution array from Affymetrix, GeneChip Human Transcriptome Array 2.0 (HTA 2.0), has been designed to interrogate all transcript isoforms in the human transcriptome with >6 million probes targeting coding transcripts, exon-exon splice junctions, and non-coding transcripts. Here we compare expression results from GeneChip HTA 2.0 and RNA-Seq data using identical RNA extractions from three samples each of healthy human mesothelial cells in culture, LP9-C1, and healthy mesothelial cells treated with asbestos, LP9-A1. For GeneChip HTA 2.0 sample preparation, we chose to compare two target preparation methods, NuGEN Ovation Pico WTA V2 with the Encore Biotin Module versus Affymetrix's GeneChip WT PLUS with the WT Terminal Labeling Kit, on identical RNA extractions from both untreated and treated samples. These same RNA extractions were used for the RNA-Seq library preparation. All analyses were performed in Partek Genomics Suite 6.6. Expression profiles for control and asbestos-treated mesothelial cells prepared with NuGEN versus Affymetrix target preparation methods (GeneChip HTA 2.0) are compared to each other as well as to RNA-Seq results.
Automatic public access to documents and maps stored on and internal secure system.
NASA Astrophysics Data System (ADS)
Trench, James; Carter, Mary
2013-04-01
The Geological Survey of Ireland operates a Document Management System for providing documents and maps stored internally in high resolution and in a high level secure environment, to an external service where the documents are automatically presented in a lower resolution to members of the public. Security is devised through roles and Individual Users where role level and folder level can be set. The application is an electronic document/data management (EDM) system which has a Geographical Information System (GIS) component integrated to allow users to query an interactive map of Ireland for data that relates to a particular area of interest. The data stored in the database consists of Bedrock Field Sheets, Bedrock Notebooks, Bedrock Maps, Geophysical Surveys, Geotechnical Maps & Reports, Groundwater, GSI Publications, Marine, Mine Records, Mineral Localities, Open File, Quaternary and Unpublished Reports. The Konfig application Tool is both an internal and public facing application. It acts as a tool for high resolution data entry which are stored in a high resolution vault. The public facing application is a mirror of the internal application and differs only in that the application furnishes high resolution data into low resolution format which is stored in a low resolution vault thus, making the data web friendly to the end user for download.
High resolution optical DNA mapping
NASA Astrophysics Data System (ADS)
Baday, Murat
Many types of diseases including cancer and autism are associated with copy-number variations in the genome. Most of these variations could not be identified with existing sequencing and optical DNA mapping methods. We have developed Multi-color Super-resolution technique, with potential for high throughput and low cost, which can allow us to recognize more of these variations. Our technique has made 10--fold improvement in the resolution of optical DNA mapping. Using a 180 kb BAC clone as a model system, we resolved dense patterns from 108 fluorescent labels of two different colors representing two different sequence-motifs. Overall, a detailed DNA map with 100 bp resolution was achieved, which has the potential to reveal detailed information about genetic variance and to facilitate medical diagnosis of genetic disease.
Conrads, Kai H.; Roth, Siegfried; Lynch, Jeremy A.
2016-01-01
Despite recent efforts to sample broadly across metazoan and insect diversity, current sequence resources in the Coleoptera do not adequately describe the diversity of the clade. Here we present deep, staged transcriptomic data for two coleopteran species, Atrachya menetriesi (Faldermann 1835) and Callosobruchus maculatus (Fabricius 1775). Our sampling covered key stages in ovary and early embryonic development in each species. We utilized this data to build combined assemblies for each species which were then analysed in detail. The combined A. menetriesi assembly consists of 228,096 contigs with an N50 of 1,598 bp, while the combined C. maculatus assembly consists of 128,837 contigs with an N50 of 2,263 bp. For these assemblies, 34.6% and 32.4% of contigs were identified using Blast2GO, and 97% and 98.3% of the BUSCO set of metazoan orthologs were present, respectively. We also carried out manual annotation of developmental signalling pathways and found that nearly all expected genes were present in each transcriptome. Our analyses show that both transcriptomes are of high quality. Lastly, we performed read mapping utilising our timed, stage specific RNA samples to identify differentially expressed contigs. The resources presented here will provide a firm basis for a variety of experimentation, both in developmental biology and in comparative genomic studies. PMID:27907180
De Novo Transcriptome Analysis of Allium cepa L. (Onion) Bulb to Identify Allergens and Epitopes.
Rajkumar, Hemalatha; Ramagoni, Ramesh Kumar; Anchoju, Vijayendra Chary; Vankudavath, Raju Naik; Syed, Arshi Uz Zaman
2015-01-01
Allium cepa (onion) is a diploid plant with one of the largest nuclear genomes among all diploids. Onion is an example of an under-researched crop which has a complex heterozygous genome. There are no allergenic proteins and genomic data available for onions. This study was conducted to establish a transcriptome catalogue of onion bulb that will enable us to study onion related genes involved in medicinal use and allergies. Transcriptome dataset generated from onion bulb using the Illumina HiSeq 2000 technology showed a total of 99,074,309 high quality raw reads (~20 Gb). Based on sequence homology onion genes were categorized into 49 different functional groups. Most of the genes however, were classified under 'unknown' in all three gene ontology categories. Of the categorized genes, 61.2% showed metabolic functions followed by cellular components such as binding, cellular processes; catalytic activity and cell part. With BLASTx top hit analysis, a total of 2,511 homologous allergenic sequences were found, which had 37-100% similarity with 46 different types of allergens existing in the database. From the 46 contigs or allergens, 521 B-cell linear epitopes were identified using BepiPred linear epitope prediction tool. This is the first comprehensive insight into the transcriptome of onion bulb tissue using the NGS technology, which can be used to map IgE epitopes and prediction of structures and functions of various proteins.
Wenli Huang; Anu Swatantran; Kristofer Johnson; Laura Duncanson; Hao Tang; Jarlath O' Neil Dunne; George Hurtt; Ralph Dubayah
2015-01-01
Continental-scale aboveground biomass maps are increasingly available, but their estimates vary widely, particularly at high resolution. A comprehensive understanding of map discrepancies is required to improve their effectiveness in carbon accounting and local decision-making. To this end, we compare four continental-scale maps with a recent high-resolution lidar-...
Remotely sensed high resolution irrigated area mapping in India for 2000 to 2015
Ambika, Anukesh Krishnankutty; Wardlow, Brian; Mishra, Vimal
2016-01-01
India is among the countries that uses a significant fraction of available water for irrigation. Irrigated area in India has increased substantially after the Green revolution and both surface and groundwater have been extensively used. Under warming climate projections, irrigation frequency may increase leading to increased irrigation water demands. Water resources planning and management in agriculture need spatially-explicit irrigated area information for different crops and different crop growing seasons. However, annual, high-resolution irrigated area maps for India for an extended historical record that can be used for water resources planning and management are unavailable. Using 250 m normalized difference vegetation index (NDVI) data from Moderate Resolution Imaging Spectroradiometer (MODIS) and 56 m land use/land cover data, high-resolution irrigated area maps are developed for all the agroecological zones in India for the period of 2000–2015. The irrigated area maps were evaluated using the agricultural statistics data from ground surveys and were compared with the previously developed irrigation maps. High resolution (250 m) irrigated area maps showed satisfactory accuracy (R2=0.95) and can be used to understand interannual variability in irrigated area at various spatial scales. PMID:27996974
Chen, Li M; Turner, Gregory H; Friedman, Robert M; Zhang, Na; Gore, John C; Roe, Anna W; Avison, Malcolm J
2007-08-22
Although blood oxygenation level-dependent (BOLD) functional magnetic resonance imaging (fMRI) has been widely used to explore human brain function, questions remain regarding the ultimate spatial resolution of positive BOLD fMRI, and indeed the extent to which functional maps revealed by positive BOLD correlate spatially with maps obtained with other high-spatial-resolution mapping techniques commonly used in animals, such as optical imaging of intrinsic signal (OIS) and single-unit electrophysiology. Here, we demonstrate that the positive BOLD signal at 9.4T can reveal the fine topography of individual fingerpads in single-condition activation maps in nonhuman primates. These digit maps are similar to maps obtained from the same animal using intrinsic optical imaging. Furthermore, BOLD fMRI reliably resolved submillimeter spatial shifts in activation in area 3b previously identified with OIS (Chen et al., 2003) as neural correlates of the "funneling illusion." These data demonstrate that at high field, high-spatial-resolution topographic maps can be achieved using the positive BOLD signal, weakening previous notions regarding the spatial specificity of the positive BOLD signal.
Dumas, Marc-Emmanuel; Domange, Céline; Calderari, Sophie; Martínez, Andrea Rodríguez; Ayala, Rafael; Wilder, Steven P; Suárez-Zamorano, Nicolas; Collins, Stephan C; Wallis, Robert H; Gu, Quan; Wang, Yulan; Hue, Christophe; Otto, Georg W; Argoud, Karène; Navratil, Vincent; Mitchell, Steve C; Lindon, John C; Holmes, Elaine; Cazier, Jean-Baptiste; Nicholson, Jeremy K; Gauguier, Dominique
2016-09-30
The genetic regulation of metabolic phenotypes (i.e., metabotypes) in type 2 diabetes mellitus occurs through complex organ-specific cellular mechanisms and networks contributing to impaired insulin secretion and insulin resistance. Genome-wide gene expression profiling systems can dissect the genetic contributions to metabolome and transcriptome regulations. The integrative analysis of multiple gene expression traits and metabolic phenotypes (i.e., metabotypes) together with their underlying genetic regulation remains a challenge. Here, we introduce a systems genetics approach based on the topological analysis of a combined molecular network made of genes and metabolites identified through expression and metabotype quantitative trait locus mapping (i.e., eQTL and mQTL) to prioritise biological characterisation of candidate genes and traits. We used systematic metabotyping by 1 H NMR spectroscopy and genome-wide gene expression in white adipose tissue to map molecular phenotypes to genomic blocks associated with obesity and insulin secretion in a series of rat congenic strains derived from spontaneously diabetic Goto-Kakizaki (GK) and normoglycemic Brown-Norway (BN) rats. We implemented a network biology strategy approach to visualize the shortest paths between metabolites and genes significantly associated with each genomic block. Despite strong genomic similarities (95-99 %) among congenics, each strain exhibited specific patterns of gene expression and metabotypes, reflecting the metabolic consequences of series of linked genetic polymorphisms in the congenic intervals. We subsequently used the congenic panel to map quantitative trait loci underlying specific mQTLs and genome-wide eQTLs. Variation in key metabolites like glucose, succinate, lactate, or 3-hydroxybutyrate and second messenger precursors like inositol was associated with several independent genomic intervals, indicating functional redundancy in these regions. To navigate through the complexity of these association networks we mapped candidate genes and metabolites onto metabolic pathways and implemented a shortest path strategy to highlight potential mechanistic links between metabolites and transcripts at colocalized mQTLs and eQTLs. Minimizing the shortest path length drove prioritization of biological validations by gene silencing. These results underline the importance of network-based integration of multilevel systems genetics datasets to improve understanding of the genetic architecture of metabotype and transcriptomic regulation and to characterize novel functional roles for genes determining tissue-specific metabolism.
Plessy, Charles; Desbois, Linda; Fujii, Teruo; Carninci, Piero
2013-02-01
Tissues contain complex populations of cells. Like countries, which are comprised of mixed populations of people, tissues are not homogeneous. Gene expression studies that analyze entire populations of cells from tissues as a mixture are blind to this diversity. Thus, critical information is lost when studying samples rich in specialized but diverse cells such as tumors, iPS colonies, or brain tissue. High throughput methods are needed to address, model and understand the constitutive and stochastic differences between individual cells. Here, we describe microfluidics technologies that utilize a combination of molecular biology and miniaturized labs on chips to study gene expression at the single cell level. We discuss how the characterization of the transcriptome of each cell in a sample will open a new field in gene expression analysis, population transcriptomics, that will change the academic and biomedical analysis of complex samples by defining them as quantified populations of single cells. Copyright © 2013 WILEY Periodicals, Inc.
Shiroguchi, Katsuyuki; Jia, Tony Z.; Sims, Peter A.; Xie, X. Sunney
2012-01-01
RNA sequencing (RNA-Seq) is a powerful tool for transcriptome profiling, but is hampered by sequence-dependent bias and inaccuracy at low copy numbers intrinsic to exponential PCR amplification. We developed a simple strategy for mitigating these complications, allowing truly digital RNA-Seq. Following reverse transcription, a large set of barcode sequences is added in excess, and nearly every cDNA molecule is uniquely labeled by random attachment of barcode sequences to both ends. After PCR, we applied paired-end deep sequencing to read the two barcodes and cDNA sequences. Rather than counting the number of reads, RNA abundance is measured based on the number of unique barcode sequences observed for a given cDNA sequence. We optimized the barcodes to be unambiguously identifiable, even in the presence of multiple sequencing errors. This method allows counting with single-copy resolution despite sequence-dependent bias and PCR-amplification noise, and is analogous to digital PCR but amendable to quantifying a whole transcriptome. We demonstrated transcriptome profiling of Escherichia coli with more accurate and reproducible quantification than conventional RNA-Seq. PMID:22232676
Olarerin-George, Anthony O; Jaffrey, Samie R
2017-05-15
An increasing number of studies are mapping protein binding and nucleotide modifications sites throughout the transcriptome. Often, these sites cluster in certain regions of the transcript, giving clues to their function. Hence, it is informative to summarize where in the transcript these sites occur. A metagene is a simple and effective tool for visualizing the distribution of sites along a simplified transcript model. In this work, we introduce MetaPlotR, a Perl/R pipeline for creating metagene plots. The code and associated tutorial are available at https://github.com/olarerin/metaPlotR . srj2003@med.cornell.edu. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
Zhao, M; Wang, T; Adamson, K J; Storey, K B; Cummins, S F
2016-02-08
The land snail Theba pisana is native to the Mediterranean region but has become one of the most abundant invasive species worldwide. Here, we present three transcriptomes of this agriculture pest derived from three tissues: the central nervous system, hepatopancreas (digestive gland), and foot muscle. Sequencing of the three tissues produced 339,479,092 high quality reads and a global de novo assembly generated a total of 250,848 unique transcripts (unigenes). BLAST analysis mapped 52,590 unigenes to NCBI non-redundant protein databases and further functional analysis annotated 21,849 unigenes with gene ontology. We report that T. pisana transcripts have representatives in all functional classes and a comparison of differentially expressed transcripts amongst all three tissues demonstrates enormous differences in their potential metabolic activities. The genes differentially expressed include those with sequence similarity to those genes associated with multiple bacterial diseases and neurological diseases. To provide a valuable resource that will assist functional genomics study, we have implemented a user-friendly web interface, ThebaDB (http://thebadb.bioinfo-minzhao.org/). This online database allows for complex text queries, sequence searches, and data browsing by enriched functional terms and KEGG mapping.
Rowland, Lisa J; Alkharouf, Nadim; Darwish, Omar; Ogden, Elizabeth L; Polashock, James J; Bassil, Nahla V; Main, Dorrie
2012-04-02
There has been increased consumption of blueberries in recent years fueled in part because of their many recognized health benefits. Blueberry fruit is very high in anthocyanins, which have been linked to improved night vision, prevention of macular degeneration, anti-cancer activity, and reduced risk of heart disease. Very few genomic resources have been available for blueberry, however. Further development of genomic resources like expressed sequence tags (ESTs), molecular markers, and genetic linkage maps could lead to more rapid genetic improvement. Marker-assisted selection could be used to combine traits for climatic adaptation with fruit and nutritional quality traits. Efforts to sequence the transcriptome of the commercial highbush blueberry (Vaccinium corymbosum) cultivar Bluecrop and use the sequences to identify genes associated with cold acclimation and fruit development and develop SSR markers for mapping studies are presented here. Transcriptome sequences were generated from blueberry fruit at different stages of development, flower buds at different stages of cold acclimation, and leaves by next-generation Roche 454 sequencing. Over 600,000 reads were assembled into approximately 15,000 contigs and 124,000 singletons. The assembled sequences were annotated and functionally mapped to Gene Ontology (GO) terms. Frequency of the most abundant sequences in each of the libraries was compared across all libraries to identify genes that are potentially differentially expressed during cold acclimation and fruit development. Real-time PCR was performed to confirm their differential expression patterns. Overall, 14 out of 17 of the genes examined had differential expression patterns similar to what was predicted from their reads alone. The assembled sequences were also mined for SSRs. From these sequences, 15,886 blueberry EST-SSR loci were identified. Primers were designed from 7,705 of the SSR-containing sequences with adequate flanking sequence. One hundred primer pairs were tested for amplification and polymorphism among parents of two blueberry populations currently being used for genetic linkage map construction. The tetraploid mapping population was based on a cross between the highbush cultivars Draper and Jewel (V. darrowii is also in the background of 'Jewel'). The diploid mapping population was based on a cross between an F1 hybrid of V. darrowii and diploid V. corymbosum and another diploid V. corymbosum. The overall amplification rate of the SSR primers was 68% and the polymorphism rate was 43%. These results indicate that this large collection of 454 ESTs will be a valuable resource for identifying genes that are potentially differentially expressed and play important roles in flower bud development, cold acclimation, chilling unit accumulation, and fruit development in blueberry and related species. In addition, the ESTs have already proved useful for the development of SSR and EST-PCR markers, and are currently being used for construction of genetic linkage maps in blueberry.
2012-01-01
Background There has been increased consumption of blueberries in recent years fueled in part because of their many recognized health benefits. Blueberry fruit is very high in anthocyanins, which have been linked to improved night vision, prevention of macular degeneration, anti-cancer activity, and reduced risk of heart disease. Very few genomic resources have been available for blueberry, however. Further development of genomic resources like expressed sequence tags (ESTs), molecular markers, and genetic linkage maps could lead to more rapid genetic improvement. Marker-assisted selection could be used to combine traits for climatic adaptation with fruit and nutritional quality traits. Results Efforts to sequence the transcriptome of the commercial highbush blueberry (Vaccinium corymbosum) cultivar Bluecrop and use the sequences to identify genes associated with cold acclimation and fruit development and develop SSR markers for mapping studies are presented here. Transcriptome sequences were generated from blueberry fruit at different stages of development, flower buds at different stages of cold acclimation, and leaves by next-generation Roche 454 sequencing. Over 600,000 reads were assembled into approximately 15,000 contigs and 124,000 singletons. The assembled sequences were annotated and functionally mapped to Gene Ontology (GO) terms. Frequency of the most abundant sequences in each of the libraries was compared across all libraries to identify genes that are potentially differentially expressed during cold acclimation and fruit development. Real-time PCR was performed to confirm their differential expression patterns. Overall, 14 out of 17 of the genes examined had differential expression patterns similar to what was predicted from their reads alone. The assembled sequences were also mined for SSRs. From these sequences, 15,886 blueberry EST-SSR loci were identified. Primers were designed from 7,705 of the SSR-containing sequences with adequate flanking sequence. One hundred primer pairs were tested for amplification and polymorphism among parents of two blueberry populations currently being used for genetic linkage map construction. The tetraploid mapping population was based on a cross between the highbush cultivars Draper and Jewel (V. darrowii is also in the background of 'Jewel'). The diploid mapping population was based on a cross between an F1 hybrid of V. darrowii and diploid V. corymbosum and another diploid V. corymbosum. The overall amplification rate of the SSR primers was 68% and the polymorphism rate was 43%. Conclusions These results indicate that this large collection of 454 ESTs will be a valuable resource for identifying genes that are potentially differentially expressed and play important roles in flower bud development, cold acclimation, chilling unit accumulation, and fruit development in blueberry and related species. In addition, the ESTs have already proved useful for the development of SSR and EST-PCR markers, and are currently being used for construction of genetic linkage maps in blueberry. PMID:22471859
HetMappsS: Heterozygous mapping strategy for high resolution Genotyping-by-Sequencing Markers
USDA-ARS?s Scientific Manuscript database
Reduced representation genotyping approaches, such as genotyping-by-sequencing (GBS), provide opportunities to generate high-resolution genetic maps at a low per-sample cost. However, missing data and non-uniform sequence coverage can complicate map creation in highly heterozygous species. To facili...
Time-efficient high-resolution whole-brain three-dimensional macromolecular proton fraction mapping
Yarnykh, Vasily L.
2015-01-01
Purpose Macromolecular proton fraction (MPF) mapping is a quantitative MRI method that reconstructs parametric maps of a relative amount of macromolecular protons causing the magnetization transfer (MT) effect and provides a biomarker of myelination in neural tissues. This study aimed to develop a high-resolution whole-brain MPF mapping technique utilizing a minimal possible number of source images for scan time reduction. Methods The described technique is based on replacement of an actually acquired reference image without MT saturation by a synthetic one reconstructed from R1 and proton density maps, thus requiring only three source images. This approach enabled whole-brain three-dimensional MPF mapping with isotropic 1.25×1.25×1.25 mm3 voxel size and scan time of 20 minutes. The synthetic reference method was validated against standard MPF mapping with acquired reference images based on data from 8 healthy subjects. Results Mean MPF values in segmented white and gray matter appeared in close agreement with no significant bias and small within-subject coefficients of variation (<2%). High-resolution MPF maps demonstrated sharp white-gray matter contrast and clear visualization of anatomical details including gray matter structures with high iron content. Conclusions Synthetic reference method improves resolution of MPF mapping and combines accurate MPF measurements with unique neuroanatomical contrast features. PMID:26102097
Megahertz-resolution programmable microwave shaper.
Li, Jilong; Dai, Yitang; Yin, Feifei; Li, Wei; Li, Ming; Chen, Hongwei; Xu, Kun
2018-04-15
A novel microwave shaper is proposed and demonstrated, of which the microwave spectral transfer function could be fully programmable with high resolution. We achieve this by bandwidth-compressed mapping a programmable optical wave-shaper, which has a lower frequency resolution of tens of gigahertz, to a microwave one with resolution of tens of megahertz. This is based on a novel technology of "bandwidth scaling," which employs bandwidth-stretched electronic-to-optical conversion and bandwidth-compressed optical-to-electronic conversion. We demonstrate the high resolution and full reconfigurability experimentally. Furthermore, we show the group delay variation could be greatly enlarged after mapping; this is then verified by the experiment with an enlargement of 194 times. The resolution improvement and group delay magnification significantly distinguish our proposal from previous optics-to-microwave spectrum mapping.
NASA Astrophysics Data System (ADS)
Van Gordon, M.; Van Gordon, S.; Min, A.; Sullivan, J.; Weiner, Z.; Tappan, G. G.
2017-12-01
Using support vector machine (SVM) learning and high-accuracy hand-classified maps, we have developed a publicly available land cover classification tool for the West African Sahel. Our classifier produces high-resolution and regionally calibrated land cover maps for the Sahel, representing a significant contribution to the data available for this region. Global land cover products are unreliable for the Sahel, and accurate land cover data for the region are sparse. To address this gap, the U.S. Geological Survey and the Regional Center for Agriculture, Hydrology and Meteorology (AGRHYMET) in Niger produced high-quality land cover maps for the region via hand-classification of Landsat images. This method produces highly accurate maps, but the time and labor required constrain the spatial and temporal resolution of the data products. By using these hand-classified maps alongside SVM techniques, we successfully increase the resolution of the land cover maps by 1-2 orders of magnitude, from 2km-decadal resolution to 30m-annual resolution. These high-resolution regionally calibrated land cover datasets, along with the classifier we developed to produce them, lay the foundation for major advances in studies of land surface processes in the region. These datasets will provide more accurate inputs for food security modeling, hydrologic modeling, analyses of land cover change and climate change adaptation efforts. The land cover classification tool we have developed will be publicly available for use in creating additional West Africa land cover datasets with future remote sensing data and can be adapted for use in other parts of the world.
Li, Yun; Liu, Shikai; Qin, Zhenkui; Waldbieser, Geoff; Wang, Ruijia; Sun, Luyang; Bao, Lisui; Danzmann, Roy G.; Dunham, Rex; Liu, Zhanjiang
2015-01-01
Construction of genetic linkage map is essential for genetic and genomic studies. Recent advances in sequencing and genotyping technologies made it possible to generate high-density and high-resolution genetic linkage maps, especially for the organisms lacking extensive genomic resources. In the present work, we constructed a high-density and high-resolution genetic map for channel catfish with three large resource families genotyped using the catfish 250K single-nucleotide polymorphism (SNP) array. A total of 54,342 SNPs were placed on the linkage map, which to our knowledge had the highest marker density among aquaculture species. The estimated genetic size was 3,505.4 cM with a resolution of 0.22 cM for sex-averaged genetic map. The sex-specific linkage maps spanned a total of 4,495.1 cM in females and 2,593.7 cM in males, presenting a ratio of 1.7 : 1 between female and male in recombination fraction. After integration with the previously established physical map, over 87% of physical map contigs were anchored to the linkage groups that covered a physical length of 867 Mb, accounting for ∼90% of the catfish genome. The integrated map provides a valuable tool for validating and improving the catfish whole-genome assembly and facilitates fine-scale QTL mapping and positional cloning of genes responsible for economically important traits. PMID:25428894
Chen, X L; Lui, E Y; Ip, Y Kwong; Lam, S H
2018-06-21
To obtain transcriptomic insights into branchial responses to salinity challenge in Anabas testudineus, this study employed RNA sequencing (RNA-Seq) to analyse the gill transcriptome of A. testudineus exposed to seawater (SW) for 6 days compared with the freshwater (FW) control group. A combined FW and SW gill transcriptome was de novo assembled from 169.9 million 101 bp paired-end reads. In silico validation employing 17 A. testudineus Sanger full-length coding sequences showed that 15/17 of them had greater than 80% of their sequences aligned to the de novo assembled contigs where 5/17 had their full-length (100%) aligned and 9/17 had greater than 90% of their sequences aligned. The combined FW and SW gill transcriptome was mapped to 13780 unique human identifiers at E-value < 1.0E-20 while 952 and 886 identifiers were determined as up and down-regulated by 1.5 fold, respectively, in the gills of A. testudineus in SW when compared with FW. These genes were found to be associated with at least 23 biological processes. A larger proportion of genes encoding enzymes and transporters associated with molecular transport, energy production, metabolisms were up-regulated, while a larger proportion of genes encoding transmembrane receptors, G-protein coupled receptors, kinases and transcription regulators associated with cell cycle, growth, development, signalling, morphology and gene expression were relatively lower in the gills of A. testudineus in SW when compared with FW. High correlation (R = 0.99) was observed between RNA-Seq data and real-time quantitative PCR validation for 13 selected genes. The transcriptomic sequence information will facilitate development of molecular resources and tools while the findings will provide insights for future studies into branchial iono-osmoregulation and related cellular processes in A. testudineus. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
Chen, Xin; Zhang, Jin; Liu, Qingzhong; Guo, Wei; Zhao, Tiantian; Ma, Qinghua; Wang, Guixi
2014-01-01
The genus Corylus is an important woody species in Northeast China. Its products, hazelnuts, constitute one of the most important raw materials for the pastry and chocolate industry. However, limited genetic research has focused on Corylus because of the lack of genomic resources. The advent of high-throughput sequencing technologies provides a turning point for Corylus research. In the present study, we performed de novo transcriptome sequencing for the first time to produce a comprehensive database for the Corylus heterophylla Fisch floral buds. The C. heterophylla Fisch floral buds transcriptome was sequenced using the Illumina paired-end sequencing technology. We produced 28,930,890 raw reads and assembled them into 82,684 contigs. A total of 40,941 unigenes were identified, among which 30,549 were annotated in the NCBI Non-redundant (Nr) protein database and 18,581 were annotated in the Swiss-Prot database. Of these annotated unigenes, 25,311 and 10,514 unigenes were assigned to gene ontology (GO) categories and clusters of orthologous groups (COG), respectively. We could map 17,207 unigenes onto 128 pathways using the Kyoto Encyclopedia of Genes and Genomes Pathway (KEGG) database. Additionally, based on the transcriptome, we constructed a candidate cold tolerance gene set of C. heterophylla Fisch floral buds. The expression patterns of selected genes during four stages of cold acclimation suggested that these genes might be involved in different cold responsive stages in C. heterophylla Fisch floral buds. The transcriptome of C. heterophylla Fisch floral buds was deep sequenced, de novo assembled, and annotated, providing abundant data to better understand the C. heterophylla Fisch floral buds transcriptome. Candidate genes potentially involved in cold tolerance were identified, providing a material basis for future molecular mechanism analysis of C. heterophylla Fisch floral buds tolerant to cold stress.
Li, Wenli; Turner, Amy; Aggarwal, Praful; Matter, Andrea; Storvick, Erin; Arnett, Donna K; Broeckel, Ulrich
2015-12-16
Whole transcriptome sequencing (RNA-seq) represents a powerful approach for whole transcriptome gene expression analysis. However, RNA-seq carries a few limitations, e.g., the requirement of a significant amount of input RNA and complications led by non-specific mapping of short reads. The Ion AmpliSeq Transcriptome Human Gene Expression Kit (AmpliSeq) was recently introduced by Life Technologies as a whole-transcriptome, targeted gene quantification kit to overcome these limitations of RNA-seq. To assess the performance of this new methodology, we performed a comprehensive comparison of AmpliSeq with RNA-seq using two well-established next-generation sequencing platforms (Illumina HiSeq and Ion Torrent Proton). We analyzed standard reference RNA samples and RNA samples obtained from human induced pluripotent stem cell derived cardiomyocytes (hiPSC-CMs). Using published data from two standard RNA reference samples, we observed a strong concordance of log2 fold change for all genes when comparing AmpliSeq to Illumina HiSeq (Pearson's r = 0.92) and Ion Torrent Proton (Pearson's r = 0.92). We used ROC, Matthew's correlation coefficient and RMSD to determine the overall performance characteristics. All three statistical methods demonstrate AmpliSeq as a highly accurate method for differential gene expression analysis. Additionally, for genes with high abundance, AmpliSeq outperforms the two RNA-seq methods. When analyzing four closely related hiPSC-CM lines, we show that both AmpliSeq and RNA-seq capture similar global gene expression patterns consistent with known sources of variations. Our study indicates that AmpliSeq excels in the limiting areas of RNA-seq for gene expression quantification analysis. Thus, AmpliSeq stands as a very sensitive and cost-effective approach for very large scale gene expression analysis and mRNA marker screening with high accuracy.
Global-scale surface spectral variations on Titan seen from Cassini/VIMS
Barnes, J.W.; Brown, R.H.; Soderblom, L.; Buratti, B.J.; Sotin, Christophe; Rodriguez, S.; Le, Mouelic S.; Baines, K.H.; Clark, R.; Nicholson, P.
2007-01-01
We present global-scale maps of Titan from the Visual and Infrared Mapping Spectrometer (VIMS) instrument on Cassini. We map at 64 near-infrared wavelengths simultaneously, covering the atmospheric windows at 0.94, 1.08, 1.28, 1.6, 2.0, 2.8, and 5 ??m with a typical resolution of 50 km/pixel or a typical total integration time of 1 s. Our maps have five to ten times the resolution of ground-based maps, better spectral resolution across most windows, coverage in multiple atmospheric windows, and represent the first spatially resolved maps of Titan at 5 ??m. The VIMS maps provide context and surface spectral information in support of other Cassini instruments. We note a strong latitudinal dependence in the spectral character of Titan's surface, and partition the surface into 9 spectral units that we describe in terms of spectral and spatial characteristics. ?? 2006 Elsevier Inc. All rights reserved.
Given the relatively high cost of mapping impervious surfaces at regional scales, substantial effort is being expended in the development of moderate-resolution, satellite-based methods for estimating impervious surface area (ISA). To rigorously assess the accuracy of these data ...
Leveraging lung tissue transcriptome to uncover candidate causal genes in COPD genetic associations.
Lamontagne, Maxime; Bérubé, Jean-Christophe; Obeidat, Ma'en; Cho, Michael H; Hobbs, Brian D; Sakornsakolpat, Phuwanat; de Jong, Kim; Boezen, H Marike; Nickle, David; Hao, Ke; Timens, Wim; van den Berge, Maarten; Joubert, Philippe; Laviolette, Michel; Sin, Don D; Paré, Peter D; Bossé, Yohan
2018-05-15
Causal genes of chronic obstructive pulmonary disease (COPD) remain elusive. The current study aims at integrating genome-wide association studies (GWAS) and lung expression quantitative trait loci (eQTL) data to map COPD candidate causal genes and gain biological insights into the recently discovered COPD susceptibility loci. Two complementary genomic datasets on COPD were studied. First, the lung eQTL dataset which included whole-genome gene expression and genotyping data from 1038 individuals. Second, the largest COPD GWAS to date from the International COPD Genetics Consortium (ICGC) with 13 710 cases and 38 062 controls. Methods that integrated GWAS with eQTL signals including transcriptome-wide association study (TWAS), colocalization and Mendelian randomization-based (SMR) approaches were used to map causality genes, i.e. genes with the strongest evidence of being the functional effector at specific loci. These methods were applied at the genome-wide level and at COPD risk loci derived from the GWAS literature. Replication was performed using lung data from GTEx. We collated 129 non-overlapping risk loci for COPD from the GWAS literature. At the genome-wide scale, 12 new COPD candidate genes/loci were revealed and six replicated in GTEx including CAMK2A, DMPK, MYO15A, TNFRSF10A, BTN3A2 and TRBV30. In addition, we mapped candidate causal genes for 60 out of the 129 GWAS-nominated loci and 23 of them were replicated in GTEx. Mapping candidate causal genes in lung tissue represents an important contribution to the genetics of COPD, enriches our biological interpretation of GWAS findings, and brings us closer to clinical translation of genetic associations.
The primary transcriptome of the marine diazotroph Trichodesmium erythraeum IMS101
NASA Astrophysics Data System (ADS)
Pfreundt, Ulrike; Kopf, Matthias; Belkin, Natalia; Berman-Frank, Ilana; Hess, Wolfgang R.
2014-08-01
Blooms of the dinitrogen-fixing marine cyanobacterium Trichodesmium considerably contribute to new nitrogen inputs into tropical oceans. Intriguingly, only 60% of the Trichodesmium erythraeum IMS101 genome sequence codes for protein, compared with ~85% in other sequenced cyanobacterial genomes. The extensive non-coding genome fraction suggests space for an unusually high number of unidentified, potentially regulatory non-protein-coding RNAs (ncRNAs). To identify the transcribed fraction of the genome, here we present a genome-wide map of transcriptional start sites (TSS) at single nucleotide resolution, revealing the activity of 6,080 promoters. We demonstrate that T. erythraeum has the highest number of actively splicing group II introns and the highest percentage of TSS yielding ncRNAs of any bacterium examined to date. We identified a highly transcribed retroelement that serves as template repeat for the targeted mutation of at least 12 different genes by mutagenic homing. Our findings explain the non-coding portion of the T. erythraeum genome by the transcription of an unusually high number of non-coding transcripts in addition to the known high incidence of transposable elements. We conclude that riboregulation and RNA maturation-dependent processes constitute a major part of the Trichodesmium regulatory apparatus.
Tabor, Rowland W.; Haugerud, Ralph A.; Haeussler, Peter J.; Clark, Kenneth P.
2011-01-01
This map is an interpretation of a 6-ft-resolution (2-m-resolution) lidar (light detection and ranging) digital elevation model combined with the geology depicted on the Geologic Map of the Wildcat Lake 7.5' quadrangle, Kitsap and Mason Counties, Washington (Haeussler and Clark, 2000). Haeussler and Clark described, interpreted, and located the geology on the 1:24,000-scale topographic map of the Wildcat Lake 7.5' quadrangle. This map, derived from 1951 aerial photographs, has 20-ft contours, nominal horizontal resolution of approximately 40 ft (12 m), and nominal mean vertical accuracy of approximately 10 ft (3 m). Similar to many geologic maps, much of the geology in the Haeussler and Clark (2000) map-especially the distribution of surficial deposits-was interpreted from landforms portrayed on the topographic map. In 2001, the Puget Sound lidar Consortium obtained a lidar-derived digital elevation model (DEM) for Kitsap Peninsula including all of the Wildcat Lake 7.5' quadrangle. This new DEM has a horizontal resolution of 6 ft (2 m) and a mean vertical accuracy of about 1 ft (0.3 m). The greater resolution and accuracy of the lidar DEM compared to topography constructed from air photo stereo models have much improved the interpretation of geology in this heavily vegetated landscape, especially the distribution and relative age of some surficial deposits. Many contacts of surficial deposits are adapted unmodified or slightly modified from Haugerud (2009).
Halvade-RNA: Parallel variant calling from transcriptomic data using MapReduce.
Decap, Dries; Reumers, Joke; Herzeel, Charlotte; Costanza, Pascal; Fostier, Jan
2017-01-01
Given the current cost-effectiveness of next-generation sequencing, the amount of DNA-seq and RNA-seq data generated is ever increasing. One of the primary objectives of NGS experiments is calling genetic variants. While highly accurate, most variant calling pipelines are not optimized to run efficiently on large data sets. However, as variant calling in genomic data has become common practice, several methods have been proposed to reduce runtime for DNA-seq analysis through the use of parallel computing. Determining the effectively expressed variants from transcriptomics (RNA-seq) data has only recently become possible, and as such does not yet benefit from efficiently parallelized workflows. We introduce Halvade-RNA, a parallel, multi-node RNA-seq variant calling pipeline based on the GATK Best Practices recommendations. Halvade-RNA makes use of the MapReduce programming model to create and manage parallel data streams on which multiple instances of existing tools such as STAR and GATK operate concurrently. Whereas the single-threaded processing of a typical RNA-seq sample requires ∼28h, Halvade-RNA reduces this runtime to ∼2h using a small cluster with two 20-core machines. Even on a single, multi-core workstation, Halvade-RNA can significantly reduce runtime compared to using multi-threading, thus providing for a more cost-effective processing of RNA-seq data. Halvade-RNA is written in Java and uses the Hadoop MapReduce 2.0 API. It supports a wide range of distributions of Hadoop, including Cloudera and Amazon EMR.
Han, Rongchun; Takahashi, Hiroki; Nakamura, Michimi; Bunsupa, Somnuk; Yoshimoto, Naoko; Yamamoto, Hirobumi; Suzuki, Hideyuki; Shibata, Daisuke; Yamazaki, Mami; Saito, Kazuki
2015-01-01
Sophora flavescens AITON (kurara) has long been used to treat various diseases. Although several research findings revealed the biosynthetic pathways of its characteristic chemical components as represented by matrine, insufficient analysis of transcriptome data hampered in-depth analysis of the underlying putative genes responsible for the biosynthesis of pharmaceutical chemical components. In this study, more than 200 million fastq format reads were generated by Illumina's next-generation sequencing approach using nine types of tissue from S. flavescens, followed by CLC de novo assembly, ultimately yielding 83,325 contigs in total. By mapping the reads back to the contigs, reads per kilobase of the transcript per million mapped reads values were calculated to demonstrate gene expression levels, and overrepresented gene ontology terms were evaluated using Fisher's exact test. In search of the putative genes relevant to essential metabolic pathways, all 1350 unique enzyme commission numbers were used to map pathways against the Kyoto Encyclopedia of Genes and Genomes. By analyzing expression patterns, we proposed some candidate genes involved in the biosynthesis of isoflavonoids and quinolizidine alkaloids. Adopting RNA-Seq analysis, we obtained substantially credible contigs for downstream work. The preferential expression of the gene for putative lysine/ornithine decarboxylase committed in the initial step of matrine biosynthesis in leaves and stems was confirmed in semi-quantitative polymerase chain reaction (PCR) analysis. The findings in this report may serve as a stepping-stone for further research into this promising medicinal plant.
Identification of Candidate Genes Underlying an Iron Efficiency Quantitative Trait Locus in Soybean1
Peiffer, Gregory A.; King, Keith E.; Severin, Andrew J.; May, Gregory D.; Cianzio, Silvia R.; Lin, Shun Fu; Lauter, Nicholas C.; Shoemaker, Randy C.
2012-01-01
Prevalent on calcareous soils in the United States and abroad, iron deficiency is among the most common and severe nutritional stresses in plants. In soybean (Glycine max) commercial plantings, the identification and use of iron-efficient genotypes has proven to be the best form of managing this soil-related plant stress. Previous studies conducted in soybean identified a significant iron efficiency quantitative trait locus (QTL) explaining more than 70% of the phenotypic variation for the trait. In this research, we identified candidate genes underlying this QTL through molecular breeding, mapping, and transcriptome sequencing. Introgression mapping was performed using two related near-isogenic lines in which a region located on soybean chromosome 3 required for iron efficiency was identified. The region corresponds to the previously reported iron efficiency QTL. The location was further confirmed through QTL mapping conducted in this study. Transcriptome sequencing and quantitative real-time-polymerase chain reaction identified two genes encoding transcription factors within the region that were significantly induced in soybean roots under iron stress. The two induced transcription factors were identified as homologs of the subgroup lb basic helix-loop-helix (bHLH) genes that are known to regulate the strategy I response in Arabidopsis (Arabidopsis thaliana). Resequencing of these differentially expressed genes unveiled a significant deletion within a predicted dimerization domain. We hypothesize that this deletion disrupts the Fe-DEFICIENCY-INDUCED TRANSCRIPTION FACTOR (FIT)/bHLH heterodimer that has been shown to induce known iron acquisition genes. PMID:22319075
Huang, Lin; Li, Guiyang; Mo, Zhaolan; Xiao, Peng; Li, Jie; Huang, Jie
2015-01-01
Background Japanese flounder (Paralichthys olivaceus) is an economically important marine fish in Asia and has suffered from disease outbreaks caused by various pathogens, which requires more information for immune relevant genes on genome background. However, genomic and transcriptomic data for Japanese flounder remain scarce, which limits studies on the immune system of this species. In this study, we characterized the Japanese flounder spleen transcriptome using an Illumina paired-end sequencing platform to identify putative genes involved in immunity. Methodology/Principal Findings A cDNA library from the spleen of P. olivaceus was constructed and randomly sequenced using an Illumina technique. The removal of low quality reads generated 12,196,968 trimmed reads, which assembled into 96,627 unigenes. A total of 21,391 unigenes (22.14%) were annotated in the NCBI Nr database, and only 1.1% of the BLASTx top-hits matched P. olivaceus protein sequences. Approximately 12,503 (58.45%) unigenes were categorized into three Gene Ontology groups, 19,547 (91.38%) were classified into 26 Cluster of Orthologous Groups, and 10,649 (49.78%) were assigned to six Kyoto Encyclopedia of Genes and Genomes pathways. Furthermore, 40,928 putative simple sequence repeats and 47, 362 putative single nucleotide polymorphisms were identified. Importantly, we identified 1,563 putative immune-associated unigenes that mapped to 15 immune signaling pathways. Conclusions/Significance The P. olivaceus transciptome data provides a rich source to discover and identify new genes, and the immune-relevant sequences identified here will facilitate our understanding of the mechanisms involved in the immune response. Furthermore, the plentiful potential SSRs and SNPs found in this study are important resources with respect to future development of a linkage map or marker assisted breeding programs for the flounder. PMID:25723398
Defining the Human Macula Transcriptome and Candidate Retinal Disease Genes UsingEyeSAGE
Rickman, Catherine Bowes; Ebright, Jessica N.; Zavodni, Zachary J.; Yu, Ling; Wang, Tianyuan; Daiger, Stephen P.; Wistow, Graeme; Boon, Kathy; Hauser, Michael A.
2009-01-01
Purpose To develop large-scale, high-throughput annotation of the human macula transcriptome and to identify and prioritize candidate genes for inherited retinal dystrophies, based on ocular-expression profiles using serial analysis of gene expression (SAGE). Methods Two human retina and two retinal pigment epithelium (RPE)/choroid SAGE libraries made from matched macula or midperipheral retina and adjacent RPE/choroid of morphologically normal 28- to 66-year-old donors and a human central retina longSAGE library made from 41- to 66-year-old donors were generated. Their transcription profiles were entered into a relational database, EyeSAGE, including microarray expression profiles of retina and publicly available normal human tissue SAGE libraries. EyeSAGE was used to identify retina- and RPE-specific and -associated genes, and candidate genes for retina and RPE disease loci. Differential and/or cell-type specific expression was validated by quantitative and single-cell RT-PCR. Results Cone photoreceptor-associated gene expression was elevated in the macula transcription profiles. Analysis of the longSAGE retina tags enhanced tag-to-gene mapping and revealed alternatively spliced genes. Analysis of candidate gene expression tables for the identified Bardet-Biedl syndrome disease gene (BBS5) in the BBS5 disease region table yielded BBS5 as the top candidate. Compelling candidates for inherited retina diseases were identified. Conclusions The EyeSAGE database, combining three different gene-profiling platforms including the authors’ multidonor-derived retina/RPE SAGE libraries and existing single-donor retina/RPE libraries, is a powerful resource for definition of the retina and RPE transcriptomes. It can be used to identify retina-specific genes, including alternatively spliced transcripts and to prioritize candidate genes within mapped retinal disease regions. PMID:16723438
Defining the human macula transcriptome and candidate retinal disease genes using EyeSAGE.
Bowes Rickman, Catherine; Ebright, Jessica N; Zavodni, Zachary J; Yu, Ling; Wang, Tianyuan; Daiger, Stephen P; Wistow, Graeme; Boon, Kathy; Hauser, Michael A
2006-06-01
To develop large-scale, high-throughput annotation of the human macula transcriptome and to identify and prioritize candidate genes for inherited retinal dystrophies, based on ocular-expression profiles using serial analysis of gene expression (SAGE). Two human retina and two retinal pigment epithelium (RPE)/choroid SAGE libraries made from matched macula or midperipheral retina and adjacent RPE/choroid of morphologically normal 28- to 66-year-old donors and a human central retina longSAGE library made from 41- to 66-year-old donors were generated. Their transcription profiles were entered into a relational database, EyeSAGE, including microarray expression profiles of retina and publicly available normal human tissue SAGE libraries. EyeSAGE was used to identify retina- and RPE-specific and -associated genes, and candidate genes for retina and RPE disease loci. Differential and/or cell-type specific expression was validated by quantitative and single-cell RT-PCR. Cone photoreceptor-associated gene expression was elevated in the macula transcription profiles. Analysis of the longSAGE retina tags enhanced tag-to-gene mapping and revealed alternatively spliced genes. Analysis of candidate gene expression tables for the identified Bardet-Biedl syndrome disease gene (BBS5) in the BBS5 disease region table yielded BBS5 as the top candidate. Compelling candidates for inherited retina diseases were identified. The EyeSAGE database, combining three different gene-profiling platforms including the authors' multidonor-derived retina/RPE SAGE libraries and existing single-donor retina/RPE libraries, is a powerful resource for definition of the retina and RPE transcriptomes. It can be used to identify retina-specific genes, including alternatively spliced transcripts and to prioritize candidate genes within mapped retinal disease regions.
Improved evidence-based genome-scale metabolic models for maize leaf, embryo, and endosperm
Seaver, Samuel M. D.; Bradbury, Louis M. T.; Frelin, Océane; Zarecki, Raphy; Ruppin, Eytan; Hanson, Andrew D.; Henry, Christopher S.
2015-01-01
There is a growing demand for genome-scale metabolic reconstructions for plants, fueled by the need to understand the metabolic basis of crop yield and by progress in genome and transcriptome sequencing. Methods are also required to enable the interpretation of plant transcriptome data to study how cellular metabolic activity varies under different growth conditions or even within different organs, tissues, and developmental stages. Such methods depend extensively on the accuracy with which genes have been mapped to the biochemical reactions in the plant metabolic pathways. Errors in these mappings lead to metabolic reconstructions with an inflated number of reactions and possible generation of unreliable metabolic phenotype predictions. Here we introduce a new evidence-based genome-scale metabolic reconstruction of maize, with significant improvements in the quality of the gene-reaction associations included within our model. We also present a new approach for applying our model to predict active metabolic genes based on transcriptome data. This method includes a minimal set of reactions associated with low expression genes to enable activity of a maximum number of reactions associated with high expression genes. We apply this method to construct an organ-specific model for the maize leaf, and tissue specific models for maize embryo and endosperm cells. We validate our models using fluxomics data for the endosperm and embryo, demonstrating an improved capacity of our models to fit the available fluxomics data. All models are publicly available via the DOE Systems Biology Knowledgebase and PlantSEED, and our new method is generally applicable for analysis transcript profiles from any plant, paving the way for further in silico studies with a wide variety of plant genomes. PMID:25806041
Stelpflug, Scott C.; Sekhon, Rajandeep S.; Vaillancourt, Brieanne; ...
2015-12-30
Comprehensive and systematic transcriptome profiling provides valuable insight into biological and developmental processes that occur throughout the life cycle of a plant. We have enhanced our previously published microarray-based gene atlas of maize ( Zea mays L.) inbred B73 to now include 79 distinct replicated samples that have been interrogated using RNA sequencing (RNA-seq). The current version of the atlas includes 50 original array-based gene atlas samples, a time-course of 12 stalk and leaf samples postflowering, and an additional set of 17 samples from the maize seedling and adult root system. The entire dataset contains 4.6 billion mapped reads, withmore » an average of 20.5 million mapped reads per biological replicate, allowing for detection of genes with lower transcript abundance. As the new root samples represent key additions to the previously examined tissues, we highlight insights into the root transcriptome, which is represented by 28,894 (73.2%) annotated genes in maize. Additionally, we observed remarkable expression differences across both the longitudinal (four zones) and radial gradients (cortical parenchyma and stele) of the primary root supported by fourfold differential expression of 9353 and 4728 genes, respectively. Among the latter were 1110 genes that encode transcription factors, some of which are orthologs of previously characterized transcription factors known to regulate root development in Arabidopsis thaliana (L.) Heynh., while most are novel, and represent attractive targets for reverse genetics approaches to determine their roles in this important organ. As a result, this comprehensive transcriptome dataset is a powerful tool toward understanding maize development, physiology, and phenotypic diversity.« less
Improved evidence-based genome-scale metabolic models for maize leaf, embryo, and endosperm
Seaver, Samuel M.D.; Bradbury, Louis M.T.; Frelin, Océane; ...
2015-03-10
There is a growing demand for genome-scale metabolic reconstructions for plants, fueled by the need to understand the metabolic basis of crop yield and by progress in genome and transcriptome sequencing. Methods are also required to enable the interpretation of plant transcriptome data to study how cellular metabolic activity varies under different growth conditions or even within different organs, tissues, and developmental stages. Such methods depend extensively on the accuracy with which genes have been mapped to the biochemical reactions in the plant metabolic pathways. Errors in these mappings lead to metabolic reconstructions with an inflated number of reactions andmore » possible generation of unreliable metabolic phenotype predictions. Here we introduce a new evidence-based genome-scale metabolic reconstruction of maize, with significant improvements in the quality of the gene-reaction associations included within our model. We also present a new approach for applying our model to predict active metabolic genes based on transcriptome data. This method includes a minimal set of reactions associated with low expression genes to enable activity of a maximum number of reactions associated with high expression genes. We apply this method to construct an organ-specific model for the maize leaf, and tissue specific models for maize embryo and endosperm cells. We validate our models using fluxomics data for the endosperm and embryo, demonstrating an improved capacity of our models to fit the available fluxomics data. All models are publicly available via the DOE Systems Biology Knowledgebase and PlantSEED, and our new method is generally applicable for analysis transcript profiles from any plant, paving the way for further in silico studies with a wide variety of plant genomes.« less
Ballester, Ana-Rosa; Norelli, John; Burchard, Erik; Abdelfattah, Ahmed; Levin, Elena; González-Candelas, Luis; Droby, Samir; Wisniewski, Michael
2017-01-01
Malus sieversii from Central Asia is a progenitor of the modern domesticated apple (Malus × domestica). Several accessions of M. sieversii are highly resistant to the postharvest pathogen Penicillium expansum. A previous study identified the qM–Pe3.1 QTL on LG3 for resistance to P. expansum in the mapping population GMAL4593, developed using the resistant accession, M. sieversii –PI613981, and the susceptible cultivar “Royal Gala” (RG) (M. domestica), as parents. The goal of the present study was to characterize the transcriptomic response of susceptible RG and resistant PI613981 apple fruit to wounding and inoculation with P. expansum using RNA–Seq. Transcriptomic analyses 0–48 h post inoculation suggest a higher basal level of resistance and a more rapid and intense defense response to wounding and wounding plus inoculation with P. expansum in M. sieversii –PI613981 than in RG. Functional analysis showed that ethylene–related genes and genes involved in “jasmonate” and “MYB–domain transcription factor family” were over–represented in the resistant genotype. It is suggested that the more rapid response in the resistant genotype (Malus sieversii–PI613981) plays a major role in the resistance response. At least twenty DEGs were mapped to the qM–Pe3.1 QTL (M × d v.1: 26,848,396–28,424,055) on LG3, and represent potential candidate genes responsible for the observed resistance QTL in M. sieversii–PI613981. RT–qPCR of several of these genes was used to validate the RNA–Seq data and to confirm their higher expression in MS0. PMID:29201037
DOE Office of Scientific and Technical Information (OSTI.GOV)
Stelpflug, Scott C.; Sekhon, Rajandeep S.; Vaillancourt, Brieanne
Comprehensive and systematic transcriptome profiling provides valuable insight into biological and developmental processes that occur throughout the life cycle of a plant. We have enhanced our previously published microarray-based gene atlas of maize ( Zea mays L.) inbred B73 to now include 79 distinct replicated samples that have been interrogated using RNA sequencing (RNA-seq). The current version of the atlas includes 50 original array-based gene atlas samples, a time-course of 12 stalk and leaf samples postflowering, and an additional set of 17 samples from the maize seedling and adult root system. The entire dataset contains 4.6 billion mapped reads, withmore » an average of 20.5 million mapped reads per biological replicate, allowing for detection of genes with lower transcript abundance. As the new root samples represent key additions to the previously examined tissues, we highlight insights into the root transcriptome, which is represented by 28,894 (73.2%) annotated genes in maize. Additionally, we observed remarkable expression differences across both the longitudinal (four zones) and radial gradients (cortical parenchyma and stele) of the primary root supported by fourfold differential expression of 9353 and 4728 genes, respectively. Among the latter were 1110 genes that encode transcription factors, some of which are orthologs of previously characterized transcription factors known to regulate root development in Arabidopsis thaliana (L.) Heynh., while most are novel, and represent attractive targets for reverse genetics approaches to determine their roles in this important organ. As a result, this comprehensive transcriptome dataset is a powerful tool toward understanding maize development, physiology, and phenotypic diversity.« less
Rangan, Parimalan; Furtado, Agnelo; Henry, Robert J
2017-10-11
Wheat is one of the three major cereals that have been domesticated to feed human populations. The composition of the wheat grain determines the functional properties of wheat including milling efficiency, bread making, and nutritional value. Transcriptome analysis of the developing wheat grain provides key insights into the molecular basis for grain development and quality. The transcriptome of 35 genotypes was analysed by RNA-Seq at two development stages (14 and 30 days-post-anthesis, dpa) corresponding to the mid stage of development (stage Z75) and the almost mature seed (stage Z85). At 14dpa, most of the transcripts were associated with the synthesis of the major seed components including storage proteins and starch. At 30dpa, a diverse range of genes were expressed at low levels with a predominance of genes associated with seed defence and stress tolerance. RNA-Seq analysis of changes in expression between 14dpa and 30dpa stages revealed 26,477 transcripts that were significantly differentially expressed at a FDR corrected p-value cut-off at ≤0.01. Functional annotation and gene ontology mapping was performed and KEGG pathway mapping allowed grouping based upon biochemical linkages. This analysis demonstrated that photosynthesis associated with the pericarp was very active at 14dpa but had ceased by 30dpa. Recently reported genes for flour yield in milling and bread quality were found to influence wheat quality largely due to expression patterns at the earlier seed development stage. This study serves as a resource providing an overview of gene expression during wheat grain development at the early (14dpa) and late (30dpa) grain filling stages for use in studies of grain quality and nutritional value and in understanding seed biology.
RNA-Skim: a rapid method for RNA-Seq quantification at transcript level
Zhang, Zhaojun; Wang, Wei
2014-01-01
Motivation: RNA-Seq technique has been demonstrated as a revolutionary means for exploring transcriptome because it provides deep coverage and base pair-level resolution. RNA-Seq quantification is proven to be an efficient alternative to Microarray technique in gene expression study, and it is a critical component in RNA-Seq differential expression analysis. Most existing RNA-Seq quantification tools require the alignments of fragments to either a genome or a transcriptome, entailing a time-consuming and intricate alignment step. To improve the performance of RNA-Seq quantification, an alignment-free method, Sailfish, has been recently proposed to quantify transcript abundances using all k-mers in the transcriptome, demonstrating the feasibility of designing an efficient alignment-free method for transcriptome quantification. Even though Sailfish is substantially faster than alternative alignment-dependent methods such as Cufflinks, using all k-mers in the transcriptome quantification impedes the scalability of the method. Results: We propose a novel RNA-Seq quantification method, RNA-Skim, which partitions the transcriptome into disjoint transcript clusters based on sequence similarity, and introduces the notion of sig-mers, which are a special type of k-mers uniquely associated with each cluster. We demonstrate that the sig-mer counts within a cluster are sufficient for estimating transcript abundances with accuracy comparable with any state-of-the-art method. This enables RNA-Skim to perform transcript quantification on each cluster independently, reducing a complex optimization problem into smaller optimization tasks that can be run in parallel. As a result, RNA-Skim uses <4% of the k-mers and <10% of the CPU time required by Sailfish. It is able to finish transcriptome quantification in <10 min per sample by using just a single thread on a commodity computer, which represents >100 speedup over the state-of-the-art alignment-based methods, while delivering comparable or higher accuracy. Availability and implementation: The software is available at http://www.csbio.unc.edu/rs. Contact: weiwang@cs.ucla.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:24931995
2014-01-01
Background Syntrichia caninervis is a desiccation-tolerant moss and the dominant bryophyte of the Biological Soil Crusts (BSCs) found in the Mojave and Gurbantunggut deserts. Next generation high throughput sequencing technologies offer an efficient and economic choice for characterizing non-model organism transcriptomes with little or no prior molecular information available. Results In this study, we employed next generation, high-throughput, Illumina RNA-Seq to analyze the poly-(A) + mRNA from hydrated, dehydrating and desiccated S. caninervis gametophores. Approximately 58.0 million paired-end short reads were obtained and 92,240 unigenes were assembled with an average size of 493 bp, N50 value of 662 bp and a total size of 45.48 Mbp. Sequence similarity searches against five public databases (NR, Swiss-Prot, COSMOSS, KEGG and COG) found 54,125 unigenes (58.7%) with significant similarity to an existing sequence (E-value ≤ 1e-5) and could be annotated. Gene Ontology (GO) annotation assigned 24,183 unigenes to the three GO terms: Biological Process, Cellular Component or Molecular Function. GO comparison between P. patens and S. caninervis demonstrated similar sequence enrichment across all three GO categories. 29,370 deduced polypeptide sequences were assigned Pfam domain information and categorized into 4,212 Pfam domains/families. Using the PlantTFDB, 778 unigenes were predicted to be involved in the regulation of transcription and were classified into 49 transcription factor families. Annotated unigenes were mapped to the KEGG pathways and further annotated using MapMan. Comparative genomics revealed that 44% of protein families are shared in common by S. caninervis, P. patens and Arabidopsis thaliana and that 80% are shared by both moss species. Conclusions This study is one of the first comprehensive transcriptome analyses of the moss S. caninervis. Our data extends our knowledge of bryophyte transcriptomes, provides an insight to plants adapted to the arid regions of central Asia, and continues the development of S. caninervis as a model for understanding the molecular aspects of desiccation-tolerance. PMID:25086984
Zhang, Senhao; Shi, Yinghua; Cheng, Ningning; Du, Hongqi; Fan, Wenna; Wang, Chengzhang
2015-01-01
Alfalfa (Medicago sativa L.) is one of the most widely cultivated perennial forage legumes worldwide. Fall dormancy is an adaptive character related to the biomass production and winter survival in alfalfa. The physiological, biochemical and molecular mechanisms causing fall dormancy and the related genes have not been well studied. In this study, we sequenced two standard varieties of alfalfa (dormant and non-dormant) at two time points and generated approximately 160 million high quality paired-end sequence reads using sequencing by synthesis (SBS) technology. The de novo transcriptome assembly generated a set of 192,875 transcripts with an average length of 856 bp representing about 165.1 Mb of the alfalfa leaf transcriptome. After assembly, 111,062 (57.6%) transcripts were annotated against the NCBI non-redundant database. A total of 30,165 (15.6%) transcripts were mapped to 323 Kyoto Encyclopedia of Genes and Genomes pathways. We also identified 41,973 simple sequence repeats, which can be used to generate markers for alfalfa, and 1,541 transcription factors were identified across 1,350 transcripts. Gene expression between dormant and non-dormant alfalfa at different time points were performed, and we identified several differentially expressed genes potentially related to fall dormancy. The Gene Ontology and pathways information were also identified. We sequenced and assembled the leaf transcriptome of alfalfa related to fall dormancy, and also identified some genes of interest involved in the fall dormancy mechanism. Thus, our research focused on studying fall dormancy in alfalfa through transcriptome sequencing. The sequencing and gene expression data generated in this study may be used further to elucidate the complete mechanisms governing fall dormancy in alfalfa.
Cheng, Ningning; Du, Hongqi; Fan, Wenna; Wang, Chengzhang
2015-01-01
Alfalfa (Medicago sativa L.) is one of the most widely cultivated perennial forage legumes worldwide. Fall dormancy is an adaptive character related to the biomass production and winter survival in alfalfa. The physiological, biochemical and molecular mechanisms causing fall dormancy and the related genes have not been well studied. In this study, we sequenced two standard varieties of alfalfa (dormant and non-dormant) at two time points and generated approximately 160 million high quality paired-end sequence reads using sequencing by synthesis (SBS) technology. The de novo transcriptome assembly generated a set of 192,875 transcripts with an average length of 856 bp representing about 165.1 Mb of the alfalfa leaf transcriptome. After assembly, 111,062 (57.6%) transcripts were annotated against the NCBI non-redundant database. A total of 30,165 (15.6%) transcripts were mapped to 323 Kyoto Encyclopedia of Genes and Genomes pathways. We also identified 41,973 simple sequence repeats, which can be used to generate markers for alfalfa, and 1,541 transcription factors were identified across 1,350 transcripts. Gene expression between dormant and non-dormant alfalfa at different time points were performed, and we identified several differentially expressed genes potentially related to fall dormancy. The Gene Ontology and pathways information were also identified. We sequenced and assembled the leaf transcriptome of alfalfa related to fall dormancy, and also identified some genes of interest involved in the fall dormancy mechanism. Thus, our research focused on studying fall dormancy in alfalfa through transcriptome sequencing. The sequencing and gene expression data generated in this study may be used further to elucidate the complete mechanisms governing fall dormancy in alfalfa. PMID:25799491
Transcriptome and proteomic analysis of mango (Mangifera indica Linn) fruits.
Wu, Hong-xia; Jia, Hui-min; Ma, Xiao-wei; Wang, Song-biao; Yao, Quan-sheng; Xu, Wen-tian; Zhou, Yi-gang; Gao, Zhong-shan; Zhan, Ru-lin
2014-06-13
Here we used Illumina RNA-seq technology for transcriptome sequencing of a mixed fruit sample from 'Zill' mango (Mangifera indica Linn) fruit pericarp and pulp during the development and ripening stages. RNA-seq generated 68,419,722 sequence reads that were assembled into 54,207 transcripts with a mean length of 858bp, including 26,413 clusters and 27,794 singletons. A total of 42,515(78.43%) transcripts were annotated using public protein databases, with a cut-off E-value above 10(-5), of which 35,198 and 14,619 transcripts were assigned to gene ontology terms and clusters of orthologous groups respectively. Functional annotation against the Kyoto Encyclopedia of Genes and Genomes database identified 23,741(43.79%) transcripts which were mapped to 128 pathways. These pathways revealed many previously unknown transcripts. We also applied mass spectrometry-based transcriptome data to characterize the proteome of ripe fruit. LC-MS/MS analysis of the mango fruit proteome was using tandem mass spectrometry (MS/MS) in an LTQ Orbitrap Velos (Thermo) coupled online to the HPLC. This approach enabled the identification of 7536 peptides that matched 2754 proteins. Our study provides a comprehensive sequence for a systemic view of transcriptome during mango fruit development and the most comprehensive fruit proteome to date, which are useful for further genomics research and proteomic studies. Our study provides a comprehensive sequence for a systemic view of both the transcriptome and proteome of mango fruit, and a valuable reference for further research on gene expression and protein identification. This article is part of a Special Issue entitled: Proteomics of non-model organisms. Copyright © 2014 Elsevier B.V. All rights reserved.
Joyce, Blake L.; Haug-Baltzell, Asher K.; Hulvey, Jonathan P.; McCarthy, Fiona; Devisetty, Upendra Kumar; Lyons, Eric
2017-01-01
This workflow allows novice researchers to leverage advanced computational resources such as cloud computing to carry out pairwise comparative transcriptomics. It also serves as a primer for biologists to develop data scientist computational skills, e.g. executing bash commands, visualization and management of large data sets. All command line code and further explanations of each command or step can be found on the wiki (https://wiki.cyverse.org/wiki/x/dgGtAQ). The Discovery Environment and Atmosphere platforms are connected together through the CyVerse Data Store. As such, once the initial raw sequencing data has been uploaded there is no more need to transfer large data files over an Internet connection, minimizing the amount of time needed to conduct analyses. This protocol is designed to analyze only two experimental treatments or conditions. Differential gene expression analysis is conducted through pairwise comparisons, and will not be suitable to test multiple factors. This workflow is also designed to be manual rather than automated. Each step must be executed and investigated by the user, yielding a better understanding of data and analytical outputs, and therefore better results for the user. Once complete, this protocol will yield de novo assembled transcriptome(s) for underserved (non-model) organisms without the need to map to previously assembled reference genomes (which are usually not available in underserved organism). These de novo transcriptomes are further used in pairwise differential gene expression analysis to investigate genes differing between two experimental conditions. Differentially expressed genes are then functionally annotated to understand the genetic response organisms have to experimental conditions. In total, the data derived from this protocol is used to test hypotheses about biological responses of underserved organisms. PMID:28518075
Gschloessl, B; Dorkeld, F; Berges, H; Beydon, G; Bouchez, O; Branco, M; Bretaudeau, A; Burban, C; Dubois, E; Gauthier, P; Lhuillier, E; Nichols, J; Nidelet, S; Rocha, S; Sauné, L; Streiff, R; Gautier, M; Kerdelhué, C
2018-05-01
The pine processionary moth Thaumetopoea pityocampa (Lepidoptera: Notodontidae) is the main pine defoliator in the Mediterranean region. Its urticating larvae cause severe human and animal health concerns in the invaded areas. This species shows a high phenotypic variability for various traits, such as phenology, fecundity and tolerance to extreme temperatures. This study presents the construction and analysis of extensive genomic and transcriptomic resources, which are an obligate prerequisite to understand their underlying genetic architecture. Using a well-studied population from Portugal with peculiar phenological characteristics, the karyotype was first determined and a first draft genome of 537 Mb total length was assembled into 68,292 scaffolds (N50 = 164 kb). From this genome assembly, 29,415 coding genes were predicted. To circumvent some limitations for fine-scale physical mapping of genomic regions of interest, a 3X coverage BAC library was also developed. In particular, 11 BACs from this library were individually sequenced to assess the assembly quality. Additionally, de novo transcriptomic resources were generated from various developmental stages sequenced with HiSeq and MiSeq Illumina technologies. The reads were de novo assembled into 62,376 and 63,175 transcripts, respectively. Then, a robust subset of the genome-predicted coding genes, the de novo transcriptome assemblies and previously published 454/Sanger data were clustered to obtain a high-quality and comprehensive reference transcriptome consisting of 29,701 bona fide unigenes. These sequences covered 99% of the cegma and 88% of the busco highly conserved eukaryotic genes and 84% of the busco arthropod gene set. Moreover, 90% of these transcripts could be localized on the draft genome. The described information is available via a genome annotation portal (http://bipaa.genouest.org/sp/thaumetopoea_pityocampa/). © 2018 John Wiley & Sons Ltd.
Ma, Jun; Kanakala, S; He, Yehua; Zhang, Junli; Zhong, Xiaolan
2015-01-01
Ananas comosus var. bracteatus (Red Pineapple) is an important ornamental plant for its colorful leaves and decorative red fruits. Because of its complex genome, it is difficult to understand the molecular mechanisms involved in the growth and development. Thus high-throughput transcriptome sequencing of Ananas comosus var. bracteatus is necessary to generate large quantities of transcript sequences for the purpose of gene discovery and functional genomic studies. The Ananas comosus var. bracteatus transcriptome was sequenced by the Illumina paired-end sequencing technology. We obtained a total of 23.5 million high quality sequencing reads, 1,555,808 contigs and 41,052 unigenes. In total 41,052 unigenes of Ananas comosus var. bracteatus, 23,275 unigenes were annotated in the NCBI non-redundant protein database and 23,134 unigenes were annotated in the Swiss-Port database. Out of these, 17,748 and 8,505 unigenes were assigned to gene ontology categories and clusters of orthologous groups, respectively. Functional annotation against Kyoto Encyclopedia of Genes and Genomes Pathway database identified 5,825 unigenes which were mapped to 117 pathways. The assembly predicted many unigenes that were previously unknown. The annotated unigenes were compared against pineapple, rice, maize, Arabidopsis, and sorghum. Unigenes that did not match any of those five sequence datasets are considered to be Ananas comosus var. bracteatus unique. We predicted unigenes encoding enzymes involved in terpenoid and phenylpropanoid biosynthesis. The sequence data provide the most comprehensive transcriptomic resource currently available for Ananas comosus var. bracteatus. To our knowledge; this is the first report on the de novo transcriptome sequencing of the Ananas comosus var. bracteatus. Unigenes obtained in this study, may help improve future gene expression, genetic and genomics studies in Ananas comosus var. bracteatus.
Ma, Jun; Kanakala, S.; He, Yehua; Zhang, Junli; Zhong, Xiaolan
2015-01-01
Background Ananas comosus var. bracteatus (Red Pineapple) is an important ornamental plant for its colorful leaves and decorative red fruits. Because of its complex genome, it is difficult to understand the molecular mechanisms involved in the growth and development. Thus high-throughput transcriptome sequencing of Ananas comosus var. bracteatus is necessary to generate large quantities of transcript sequences for the purpose of gene discovery and functional genomic studies. Results The Ananas comosus var. bracteatus transcriptome was sequenced by the Illumina paired-end sequencing technology. We obtained a total of 23.5 million high quality sequencing reads, 1,555,808 contigs and 41,052 unigenes. In total 41,052 unigenes of Ananas comosus var. bracteatus, 23,275 unigenes were annotated in the NCBI non-redundant protein database and 23,134 unigenes were annotated in the Swiss-Port database. Out of these, 17,748 and 8,505 unigenes were assigned to gene ontology categories and clusters of orthologous groups, respectively. Functional annotation against Kyoto Encyclopedia of Genes and Genomes Pathway database identified 5,825 unigenes which were mapped to 117 pathways. The assembly predicted many unigenes that were previously unknown. The annotated unigenes were compared against pineapple, rice, maize, Arabidopsis, and sorghum. Unigenes that did not match any of those five sequence datasets are considered to be Ananas comosus var. bracteatus unique. We predicted unigenes encoding enzymes involved in terpenoid and phenylpropanoid biosynthesis. Conclusion The sequence data provide the most comprehensive transcriptomic resource currently available for Ananas comosus var. bracteatus. To our knowledge; this is the first report on the de novo transcriptome sequencing of the Ananas comosus var. bracteatus. Unigenes obtained in this study, may help improve future gene expression, genetic and genomics studies in Ananas comosus var. bracteatus. PMID:25769053
A self-trained classification technique for producing 30 m percent-water maps from Landsat data
Rover, Jennifer R.; Wylie, Bruce K.; Ji, Lei
2010-01-01
Small bodies of water can be mapped with moderate-resolution satellite data using methods where water is mapped as subpixel fractions using field measurements or high-resolution images as training datasets. A new method, developed from a regression-tree technique, uses a 30 m Landsat image for training the regression tree that, in turn, is applied to the same image to map subpixel water. The self-trained method was evaluated by comparing the percent-water map with three other maps generated from established percent-water mapping methods: (1) a regression-tree model trained with a 5 m SPOT 5 image, (2) a regression-tree model based on endmembers and (3) a linear unmixing classification technique. The results suggest that subpixel water fractions can be accurately estimated when high-resolution satellite data or intensively interpreted training datasets are not available, which increases our ability to map small water bodies or small changes in lake size at a regional scale.
Spatial reconstruction of single-cell gene expression
Satija, Rahul; Farrell, Jeffrey A.; Gennert, David; Schier, Alexander F.; Regev, Aviv
2015-01-01
Spatial localization is a key determinant of cellular fate and behavior, but spatial RNA assays traditionally rely on staining for a limited number of RNA species. In contrast, single-cell RNA-seq allows for deep profiling of cellular gene expression, but established methods separate cells from their native spatial context. Here we present Seurat, a computational strategy to infer cellular localization by integrating single-cell RNA-seq data with in situ RNA patterns. We applied Seurat to spatially map 851 single cells from dissociated zebrafish (Danio rerio) embryos, inferring a transcriptome-wide map of spatial patterning. We confirmed Seurat’s accuracy using several experimental approaches, and used it to identify a set of archetypal expression patterns and spatial markers. Additionally, Seurat correctly localizes rare subpopulations, accurately mapping both spatially restricted and scattered groups. Seurat will be applicable to mapping cellular localization within complex patterned tissues in diverse systems. PMID:25867923
The National Map - Orthoimagery
Mauck, James; Brown, Kim; Carswell, William J.
2009-01-01
Orthorectified digital aerial photographs and satellite images of 1-meter (m) pixel resolution or finer make up the orthoimagery component of The National Map. The process of orthorectification removes feature displacements and scale variations caused by terrain relief and sensor geometry. The result is a combination of the image characteristics of an aerial photograph or satellite image and the geometric qualities of a map. These attributes allow users to: *Measure distance *Calculate areas *Determine shapes of features *Calculate directions *Determine accurate coordinates *Determine land cover and use *Perform change detection *Update maps The standard digital orthoimage is a 1-m or finer resolution, natural color or color infra-red product. Most are now produced as GeoTIFFs and accompanied by a Federal Geographic Data Committee (FGDC)-compliant metadata file. The primary source for 1-m data is the National Agriculture Imagery Program (NAIP) leaf-on imagery. The U.S. Geological Survey (USGS) utilizes NAIP imagery as the image layer on its 'Digital- Map' - a new generation of USGS topographic maps (http://nationalmap.gov/digital_map). However, many Federal, State, and local governments and organizations require finer resolutions to meet a myriad of needs. Most of these images are leaf-off, natural-color products at resolutions of 1-foot (ft) or finer.
USDA-ARS?s Scientific Manuscript database
Genotyping by sequencing (GBS) provides opportunities to generate high-resolution genetic maps at a low per-sample genotyping cost, but missing data and under-calling of heterozygotes complicate the creation of GBS linkage maps for highly heterozygous species. To overcome these issues, we developed ...
A high-resolution radiation hybrid map of the bovine genome
USDA-ARS?s Scientific Manuscript database
We are building high-resolution radiation hybrid maps of all 29 bovine autosomes and chromosome X, using a 58,000-marker genotyping assay, and a 12,000-rad whole-genome radiation hybrid (RH) panel. To accommodate the large number of markers, and to automate the map building procedure, a software pip...
Ground mapping resolution accuracy of a scanning radiometer from a geostationary satellite.
Stremler, F G; Khalil, M A; Parent, R J
1977-06-01
Measures of the spatial and spatial rate (frequency) mapping of scanned visual imagery from an earth reference system to a spin-scan geostationary satellite are examined. Mapping distortions and coordinate inversions to correct for these distortions are formulated in terms of geometric transformations between earth and satellite frames of reference. Probabilistic methods are used to develop relations for obtainable mapping resolution when coordinate inversions are employed.
NASA Astrophysics Data System (ADS)
Ba, Yu Tao; xian Liu, Bao; Sun, Feng; Wang, Li hua; Tang, Yu jia; Zhang, Da wei
2017-04-01
High-resolution mapping of PM2.5 is the prerequisite for precise analytics and subsequent anti-pollution interventions. Considering the large variances of particulate distribution, urban-scale mapping is challenging either with ground-based fixed stations, with satellites or via models. In this study, a dynamic fusion method between high-density sensor network and MODIS Aerosol Optical Depth (AOD) was introduced. The sensor network was deployed in Beijing ( > 1000 fixed monitors across 16000 km2 area) to provide raw observations with high temporal resolution (sampling interval < 1 hour), high spatial resolution in flat areas ( < 1 km), and low spatial resolution in mountainous areas ( > 5 km). The MODIS AOD was calibrated to provide distribution map with low temporal resolution (daily) and moderate spatial resolution ( = 3 km). By encoding the data quality and defects (e.g. could, reflectance, abnormal), a hybrid interpolation procedure with cross-validation generated PM2.5 distribution with both high temporal and spatial resolution. Several no-pollutant and high-pollution periods were tested to validate the proposed fusion method for capturing the instantaneous patterns of PM2.5 emission.
Jackson, Ryan N.; McCoy, Airlie J.; Terwilliger, Thomas C.; ...
2015-07-30
Structures of multi-subunit macromolecular machines are primarily determined by either electron microscopy (EM) or X-ray crystallography. In many cases, a structure for a complex can be obtained at low resolution (at a coarse level of detail) with EM and at higher resolution (with finer detail) by X-ray crystallography. The integration of these two structural techniques is becoming increasingly important for generating atomic models of macromolecular complexes. A low-resolution EM image can be a powerful tool for obtaining the "phase" information that is missing from an X-ray crystallography experiment, however integration of EM and X-ray diffraction data has been technically challenging.more » Here we show a step-by-step protocol that explains how low-resolution EM maps can be placed in the crystallographic unit cell by molecular replacement, and how initial phases computed from the placed EM density are extended to high resolution by averaging maps over non-crystallographic symmetry. As the resolution gap between EM and Xray crystallography continues to narrow, the use of EM maps to help with X-ray crystal structure determination, as described in this protocol, will become increasingly effective.« less
Urban cover mapping using digital, high-resolution aerial imagery
Soojeong Myeong; David J. Nowak; Paul F. Hopkins; Robert H. Brock
2003-01-01
High-spatial resolution digital color-infrared aerial imagery of Syracuse, NY was analyzed to test methods for developing land cover classifications for an urban area. Five cover types were mapped: tree/shrub, grass/herbaceous, bare soil, water and impervious surface. Challenges in high-spatial resolution imagery such as shadow effect and similarity in spectral...
Jun-Jun Liu; Anna W. Schoettle; Richard A. Sniezko; Rona N. Sturrock; Arezoo Zamany; Holly Williams; Amanda Ha; Danelle Chan; Bob Danchok; Douglas P. Savin; Angelia Kegley
2016-01-01
Linkage of DNA markers with phenotypic traits provides essential information to dissect clustered genes with potential phenotypic contributions in a target genome region. Pinus flexilis E. James (limber pine) is a keystone five-needle pine species in mountain-top ecosystems of North America. White pine blister rust (WPBR), caused by a non-native fungal...
USDA-ARS?s Scientific Manuscript database
Domesticated carrots, Daucus carota subsp. sativus, are the richest source of beta-carotene in the US diet, which when consumed is converted into vitamin A, an essential component of eye health and immunity. The Y2 locus plays a significant role in beta-carotene accumulation in carrot roots, but a c...
USDA-ARS?s Scientific Manuscript database
Apios (Apios americana; “apios”), a tuberous perennial legume in the Phaseoleae tribe, was widely used as a food by Native Americans. Work in the last 40 years has led to several improved breeding lines. Aspects of the pollination biology (complex floral structure and tripping mechanism) have made c...
Identification of hypertension-related genes through an integrated genomic-transcriptomic approach.
Yagil, Chana; Hubner, Norbert; Monti, Jan; Schulz, Herbert; Sapojnikov, Marina; Luft, Friedrich C; Ganten, Detlev; Yagil, Yoram
2005-04-01
In search for the genetic basis of hypertension, we applied an integrated genomic-transcriptomic approach to identify genes involved in the pathogenesis of hypertension in the Sabra rat model of salt-susceptibility. In the genomic arm of the project, we previously detected in male rats two salt-susceptibility QTLs on chromosome 1, SS1a (D1Mgh2-D1Mit11; span 43.1 cM) and SS1b (D1Mit11-D1Mit4; span 18 cM). In the transcriptomic arm, we studied differential gene expression in kidneys of SBH/y and SBN/y rats that had been fed regular diet or salt-loaded. We used the Affymetrix Rat Genome RAE230 GeneChip and probed >30,000 transcripts. The research algorithm called for an initial genome-wide screen for differentially expressed transcripts between the study groups. This step was followed by cluster analysis based on 2x2 ANOVA to identify transcripts that were of relevance specifically to salt-sensitivity and hypertension and to salt-resistance. The two arms of the project were integrated by identifying those differentially expressed transcripts that showed an allele-specific hypertensive effect on salt-loading and that mapped within the defined boundaries of the salt-susceptibility QTLs on chromosome 1. The differentially expressed transcripts were confirmed by RT-PCR. Of the 2933 genes annotated to rat chromosome 1, 1102 genes were identified within the boundaries of the two blood pressure QTLs. The microarray identified 2470 transcripts that were differentially expressed between the study groups. Cluster analysis identified genome-wide 192 genes that were relevant to salt-susceptibility and/or hypertension, 19 of which mapped to chromosome 1. Eight of these genes mapped within the boundaries of QTLs SS1a and SS1b. RT-PCR confirmed 7 genes, leaving TcTex1, Myadm, Lisch7, Axl-like, Fah, PRC1-like, and Serpinh1. None of these genes has been implicated in hypertension before. These genes become henceforth targets for our continuing search for the genetic basis of hypertension.
NASA Astrophysics Data System (ADS)
Gowda, P. H.
2016-12-01
Evapotranspiration (ET) is an important process in ecosystems' water budget and closely linked to its productivity. Therefore, regional scale daily time series ET maps developed at high and medium resolutions have large utility in studying the carbon-energy-water nexus and managing water resources. There are efforts to develop such datasets on a regional to global scale but often faced with the limitations of spatial-temporal resolution tradeoffs in satellite remote sensing technology. In this study, we developed frameworks for generating high and medium resolution daily ET maps from Landsat and MODIS (Moderate Resolution Imaging Spectroradiometer) data, respectively. For developing high resolution (30-m) daily time series ET maps with Landsat TM data, the series version of Two Source Energy Balance (TSEB) model was used to compute sensible and latent heat fluxes of soil and canopy separately. Landsat 5 (2000-2011) and Landsat 8 (2013-2014) imageries for row 28/35 and 27/36 covering central Oklahoma was used. MODIS data (2001-2014) covering Oklahoma and Texas Panhandle was used to develop medium resolution (250-m), time series daily ET maps with SEBS (Surface Energy Balance System) model. An extensive network of weather stations managed by Texas High Plains ET Network and Oklahoma Mesonet was used to generate spatially interpolated inputs of air temperature, relative humidity, wind speed, solar radiation, pressure, and reference ET. A linear interpolation sub-model was used to estimate the daily ET between the image acquisition days. Accuracy assessment of daily ET maps were done against eddy covariance data from two grassland sites at El Reno, OK. Statistical results indicated good performance by modeling frameworks developed for deriving time series ET maps. Results indicated that the proposed ET mapping framework is suitable for deriving daily time series ET maps at regional scale with Landsat and MODIS data.
Bhattarai, Sunil; Aly, Ahmed; Garcia, Kristy; Ruiz, Diandra; Pontarelli, Fabrizio; Dharap, Ashutosh
2018-06-03
Gene expression in cerebral ischemia has been a subject of intense investigations for several years. Studies utilizing probe-based high-throughput methodologies such as microarrays have contributed significantly to our existing knowledge but lacked the capacity to dissect the transcriptome in detail. Genome-wide RNA-sequencing (RNA-seq) enables comprehensive examinations of transcriptomes for attributes such as strandedness, alternative splicing, alternative transcription start/stop sites, and sequence composition, thus providing a very detailed account of gene expression. Leveraging this capability, we conducted an in-depth, genome-wide evaluation of the protein-coding transcriptome of the adult mouse cortex after transient focal ischemia at 6, 12, or 24 h of reperfusion using RNA-seq. We identified a total of 1007 transcripts at 6 h, 1878 transcripts at 12 h, and 1618 transcripts at 24 h of reperfusion that were significantly altered as compared to sham controls. With isoform-level resolution, we identified 23 splice variants arising from 23 genes that were novel mRNA isoforms. For a subset of genes, we detected reperfusion time-point-dependent splice isoform switching, indicating an expression and/or functional switch for these genes. Finally, for 286 genes across all three reperfusion time-points, we discovered multiple, distinct, simultaneously expressed and differentially altered isoforms per gene that were generated via alternative transcription start/stop sites. Of these, 165 isoforms derived from 109 genes were novel mRNAs. Together, our data unravel the protein-coding transcriptome of the cerebral cortex at an unprecedented depth to provide several new insights into the flexibility and complexity of stroke-related gene transcription and transcript organization.
On the influence of zero-padding on the nonlinear operations in Quantitative Susceptibility Mapping
Eskreis-Winkler, Sarah; Zhou, Dong; Liu, Tian; Gupta, Ajay; Gauthier, Susan A.; Wang, Yi; Spincemaille, Pascal
2016-01-01
Purpose Zero padding is a well-studied interpolation technique that improves image visualization without increasing image resolution. This interpolation is often performed as a last step before images are displayed on clinical workstations. Here, we seek to demonstrate the importance of zero padding before rather than after performing non-linear post-processing algorithms, such as Quantitative Susceptibility Mapping (QSM). To do so, we evaluate apparent spatial resolution, relative error and depiction of multiple sclerosis (MS) lesions on images that were zero padded prior to, in the middle of, and after the application of the QSM algorithm. Materials and Methods High resolution gradient echo (GRE) data were acquired on twenty MS patients, from which low resolution data were derived using k-space cropping. Pre-, mid-, and post-zero padded QSM images were reconstructed from these low resolution data by zero padding prior to field mapping, after field mapping, and after susceptibility mapping, respectively. Using high resolution QSM as the gold standard, apparent spatial resolution, relative error, and image quality of the pre-, mid-, and post-zero padded QSM images were measured and compared. Results Both the accuracy and apparent spatial resolution of the pre-zero padded QSM was higher than that of mid-zero padded QSM (p < 0.001; p < 0.001), which was higher than that of post-zero padded QSM (p < 0.001; p < 0.001). The image quality of pre-zero padded reconstructions was higher than that of mid- and post-zero padded reconstructions (p = 0.004; p < 0.001). Conclusion Zero padding of the complex GRE data prior to nonlinear susceptibility mapping improves image accuracy and apparent resolution compared to zero padding afterwards. It also provides better delineation of MS lesion geometry, which may improve lesion subclassification and disease monitoring in MS patients. PMID:27587225
On the influence of zero-padding on the nonlinear operations in Quantitative Susceptibility Mapping.
Eskreis-Winkler, Sarah; Zhou, Dong; Liu, Tian; Gupta, Ajay; Gauthier, Susan A; Wang, Yi; Spincemaille, Pascal
2017-01-01
Zero padding is a well-studied interpolation technique that improves image visualization without increasing image resolution. This interpolation is often performed as a last step before images are displayed on clinical workstations. Here, we seek to demonstrate the importance of zero padding before rather than after performing non-linear post-processing algorithms, such as Quantitative Susceptibility Mapping (QSM). To do so, we evaluate apparent spatial resolution, relative error and depiction of multiple sclerosis (MS) lesions on images that were zero padded prior to, in the middle of, and after the application of the QSM algorithm. High resolution gradient echo (GRE) data were acquired on twenty MS patients, from which low resolution data were derived using k-space cropping. Pre-, mid-, and post-zero padded QSM images were reconstructed from these low resolution data by zero padding prior to field mapping, after field mapping, and after susceptibility mapping, respectively. Using high resolution QSM as the gold standard, apparent spatial resolution, relative error, and image quality of the pre-, mid-, and post-zero padded QSM images were measured and compared. Both the accuracy and apparent spatial resolution of the pre-zero padded QSM was higher than that of mid-zero padded QSM (p<0.001; p<0.001), which was higher than that of post-zero padded QSM (p<0.001; p<0.001). The image quality of pre-zero padded reconstructions was higher than that of mid- and post-zero padded reconstructions (p=0.004; p<0.001). Zero padding of the complex GRE data prior to nonlinear susceptibility mapping improves image accuracy and apparent resolution compared to zero padding afterwards. It also provides better delineation of MS lesion geometry, which may improve lesion subclassification and disease monitoring in MS patients. Copyright © 2016 Elsevier Inc. All rights reserved.
Kumar, Ajay; Seetan, Raed; Mergoum, Mohamed; Tiwari, Vijay K; Iqbal, Muhammad J; Wang, Yi; Al-Azzam, Omar; Šimková, Hana; Luo, Ming-Cheng; Dvorak, Jan; Gu, Yong Q; Denton, Anne; Kilian, Andrzej; Lazo, Gerard R; Kianian, Shahryar F
2015-10-16
The large and complex genome of bread wheat (Triticum aestivum L., ~17 Gb) requires high resolution genome maps with saturated marker scaffolds to anchor and orient BAC contigs/ sequence scaffolds for whole genome assembly. Radiation hybrid (RH) mapping has proven to be an excellent tool for the development of such maps for it offers much higher and more uniform marker resolution across the length of the chromosome compared to genetic mapping and does not require marker polymorphism per se, as it is based on presence (retention) vs. absence (deletion) marker assay. In this study, a 178 line RH panel was genotyped with SSRs and DArT markers to develop the first high resolution RH maps of the entire D-genome of Ae. tauschii accession AL8/78. To confirm map order accuracy, the AL8/78-RH maps were compared with:1) a DArT consensus genetic map constructed using more than 100 bi-parental populations, 2) a RH map of the D-genome of reference hexaploid wheat 'Chinese Spring', and 3) two SNP-based genetic maps, one with anchored D-genome BAC contigs and another with anchored D-genome sequence scaffolds. Using marker sequences, the RH maps were also anchored with a BAC contig based physical map and draft sequence of the D-genome of Ae. tauschii. A total of 609 markers were mapped to 503 unique positions on the seven D-genome chromosomes, with a total map length of 14,706.7 cR. The average distance between any two marker loci was 29.2 cR which corresponds to 2.1 cM or 9.8 Mb. The average mapping resolution across the D-genome was estimated to be 0.34 Mb (Mb/cR) or 0.07 cM (cM/cR). The RH maps showed almost perfect agreement with several published maps with regard to chromosome assignments of markers. The mean rank correlations between the position of markers on AL8/78 maps and the four published maps, ranged from 0.75 to 0.92, suggesting a good agreement in marker order. With 609 mapped markers, a total of 2481 deletions for the whole D-genome were detected with an average deletion size of 42.0 Mb. A total of 520 markers were anchored to 216 Ae. tauschii sequence scaffolds, 116 of which were not anchored earlier to the D-genome. This study reports the development of first high resolution RH maps for the D-genome of Ae. tauschii accession AL8/78, which were then used for the anchoring of unassigned sequence scaffolds. This study demonstrates how RH mapping, which offered high and uniform resolution across the length of the chromosome, can facilitate the complete sequence assembly of the large and complex plant genomes.
Ringler, Max; Mangione, Rosanna; Pašukonis, Andrius; Rainer, Gerhard; Gyimesi, Kristin; Felling, Julia; Kronaus, Hannes; Réjou-Méchain, Maxime; Chave, Jérôme; Reiter, Karl; Ringler, Eva
2015-01-01
For animals with spatially complex behaviours at relatively small scales, the resolution of a global positioning system (GPS) receiver location is often below the resolution needed to correctly map animals’ spatial behaviour. Natural conditions such as canopy cover, canyons or clouds can further degrade GPS receiver reception. Here we present a detailed, high-resolution map of a 4.6 ha Neotropical river island and a 8.3 ha mainland plot with the location of every tree >5 cm DBH and all structures on the forest floor, which are relevant to our study species, the territorial frog Allobates femoralis (Dendrobatidae). The map was derived using distance- and compass-based survey techniques, rooted on dGPS reference points, and incorporates altitudinal information based on a LiDAR survey of the area. PMID:27053943
Visualization for genomics: the Microbial Genome Viewer.
Kerkhoven, Robert; van Enckevort, Frank H J; Boekhorst, Jos; Molenaar, Douwe; Siezen, Roland J
2004-07-22
A Web-based visualization tool, the Microbial Genome Viewer, is presented that allows the user to combine complex genomic data in a highly interactive way. This Web tool enables the interactive generation of chromosome wheels and linear genome maps from genome annotation data stored in a MySQL database. The generated images are in scalable vector graphics (SVG) format, which is suitable for creating high-quality scalable images and dynamic Web representations. Gene-related data such as transcriptome and time-course microarray experiments can be superimposed on the maps for visual inspection. The Microbial Genome Viewer 1.0 is freely available at http://www.cmbi.kun.nl/MGV
Towards a complete map of the human long non-coding RNA transcriptome.
Uszczynska-Ratajczak, Barbara; Lagarde, Julien; Frankish, Adam; Guigó, Roderic; Johnson, Rory
2018-05-23
Gene maps, or annotations, enable us to navigate the functional landscape of our genome. They are a resource upon which virtually all studies depend, from single-gene to genome-wide scales and from basic molecular biology to medical genetics. Yet present-day annotations suffer from trade-offs between quality and size, with serious but often unappreciated consequences for downstream studies. This is particularly true for long non-coding RNAs (lncRNAs), which are poorly characterized compared to protein-coding genes. Long-read sequencing technologies promise to improve current annotations, paving the way towards a complete annotation of lncRNAs expressed throughout a human lifetime.
Leaps and lulls in the developmental transcriptome of Dictyostelium discoideum.
Rosengarten, Rafael David; Santhanam, Balaji; Fuller, Danny; Katoh-Kurasawa, Mariko; Loomis, William F; Zupan, Blaz; Shaulsky, Gad
2015-04-13
Development of the soil amoeba Dictyostelium discoideum is triggered by starvation. When placed on a solid substrate, the starving solitary amoebae cease growth, communicate via extracellular cAMP, aggregate by tens of thousands and develop into multicellular organisms. Early phases of the developmental program are often studied in cells starved in suspension while cAMP is provided exogenously. Previous studies revealed massive shifts in the transcriptome under both developmental conditions and a close relationship between gene expression and morphogenesis, but were limited by the sampling frequency and the resolution of the methods. Here, we combine the superior depth and specificity of RNA-seq-based analysis of mRNA abundance with high frequency sampling during filter development and cAMP pulsing in suspension. We found that the developmental transcriptome exhibits mostly gradual changes interspersed by a few instances of large shifts. For each time point we treated the entire transcriptome as single phenotype, and were able to characterize development as groups of similar time points separated by gaps. The grouped time points represented gradual changes in mRNA abundance, or molecular phenotype, and the gaps represented times during which many genes are differentially expressed rapidly, and thus the phenotype changes dramatically. Comparing developmental experiments revealed that gene expression in filter developed cells lagged behind those treated with exogenous cAMP in suspension. The high sampling frequency revealed many genes whose regulation is reproducibly more complex than indicated by previous studies. Gene Ontology enrichment analysis suggested that the transition to multicellularity coincided with rapid accumulation of transcripts associated with DNA processes and mitosis. Later development included the up-regulation of organic signaling molecules and co-factor biosynthesis. Our analysis also demonstrated a high level of synchrony among the developing structures throughout development. Our data describe D. discoideum development as a series of coordinated cellular and multicellular activities. Coordination occurred within fields of aggregating cells and among multicellular bodies, such as mounds or migratory slugs that experience both cell-cell contact and various soluble signaling regimes. These time courses, sampled at the highest temporal resolution to date in this system, provide a comprehensive resource for studies of developmental gene expression.
NASA Technical Reports Server (NTRS)
Clegg, R. H.; Scherz, J. P.
1975-01-01
Successful aerial photography depends on aerial cameras providing acceptable photographs within cost restrictions of the job. For topographic mapping where ultimate accuracy is required only large format mapping cameras will suffice. For mapping environmental patterns of vegetation, soils, or water pollution, 9-inch cameras often exceed accuracy and cost requirements, and small formats may be better. In choosing the best camera for environmental mapping, relative capabilities and costs must be understood. This study compares resolution, photo interpretation potential, metric accuracy, and cost of 9-inch, 70mm, and 35mm cameras for obtaining simultaneous color and color infrared photography for environmental mapping purposes.
Rey, S; Boltana, S; Vargas, R; Roher, N; Mackenzie, S
2013-12-01
Resolving phenotype variation within a population in response to environmental perturbation is central to understanding biological adaptation. Relating meaningful adaptive changes at the level of the transcriptome requires the identification of processes that have a functional significance for the individual. This remains a major objective towards understanding the complex interactions between environmental demand and an individual's capacity to respond to such demands. The interpretation of such interactions and the significance of biological variation between individuals from the same or different populations remain a difficult and under-addressed question. Here, we provide evidence that variation in gene expression between individuals in a zebrafish population can be partially resolved by a priori screening for animal personality and accounts for >9% of observed variation in the brain transcriptome. Proactive and reactive individuals within a wild-type population exhibit consistent behavioural responses over time and context that relates to underlying differences in regulated gene networks and predicted protein-protein interactions. These differences can be mapped to distinct regions of the brain and provide a foundation towards understanding the coordination of underpinning adaptive molecular events within populations. © 2013 John Wiley & Sons Ltd.
Molecular and physiological responses to titanium dioxide ...
- Changes in tissue transcriptomes and productivity of Arabidopsis thaliana were investigated during exposure of plants to two widely-used engineered metal oxide nanoparticles, titanium dioxide (nano-titanium) and cerium dioxide (nano-cerium). Microarray analyses confirmed that exposure to either nanoparticle altered the transcriptomes of rosette leaves and roots, with comparatively larger numbers of differentially expressed genes (DEGs) found under nano-titania exposure. Nano-titania induced more DEGs in rosette leaves, whereas roots had more DEGs under nano-ceria exposure. MapMan analyses indicated that while nano-titania up-regulated overall and secondary metabolism in both tissues, metabolic processes under nano-ceria remained mostly unchanged. Gene enrichment analysis indicated that both nanoparticles mainly enriched ontology groups such as responses to stress (abiotic and biotic), and defense responses (pathogens), and responses to endogenous stimuli (hormones). Nano-titania specifically induced genes associated with photosynthesis, whereas nano-ceria induced expression of genes related to activating transcription factors, most notably those belonging to the ethylene responsive element binding protein family. Interestingly, there were also increased numbers of rosette leaves and plant biomass under nano-ceria exposure, but not under nano-titania. Other transcriptomic responses did not clearly relate to responses observed at the organism level. This may b
A biochemical landscape of A-to-I RNA editing in the human brain transcriptome
Sakurai, Masayuki; Ueda, Hiroki; Yano, Takanori; Okada, Shunpei; Terajima, Hideki; Mitsuyama, Toutai; Toyoda, Atsushi; Fujiyama, Asao; Kawabata, Hitomi; Suzuki, Tsutomu
2014-01-01
Inosine is an abundant RNA modification in the human transcriptome and is essential for many biological processes in modulating gene expression at the post-transcriptional level. Adenosine deaminases acting on RNA (ADARs) catalyze the hydrolytic deamination of adenosines to inosines (A-to-I editing) in double-stranded regions. We previously established a biochemical method called “inosine chemical erasing” (ICE) to directly identify inosines on RNA strands with high reliability. Here, we have applied the ICE method combined with deep sequencing (ICE-seq) to conduct an unbiased genome-wide screening of A-to-I editing sites in the transcriptome of human adult brain. Taken together with the sites identified by the conventional ICE method, we mapped 19,791 novel sites and newly found 1258 edited mRNAs, including 66 novel sites in coding regions, 41 of which cause altered amino acid assignment. ICE-seq detected novel editing sites in various repeat elements as well as in short hairpins. Gene ontology analysis revealed that these edited mRNAs are associated with transcription, energy metabolism, and neurological disorders, providing new insights into various aspects of human brain functions. PMID:24407955
Park, C Sehwan; Valomon, Amandine; Welzl, Hans
2015-01-01
Environmental enrichment has been reported to delay or restore age-related cognitive deficits, however, a mechanism to account for the cause and progression of normal cognitive decline and its preservation by environmental enrichment is lacking. Using genome-wide SAGE-Seq, we provide a global assessment of differentially expressed genes altered with age and environmental enrichment in the hippocampus. Qualitative and quantitative proteomics in naïve young and aged mice was used to further identify phosphorylated proteins differentially expressed with age. We found that increased expression of endogenous protein phosphatase-1 inhibitors in aged mice may be characteristic of long-term environmental enrichment and improved cognitive status. As such, hippocampus-dependent performances in spatial, recognition, and associative memories, which are sensitive to aging, were preserved by environmental enrichment and accompanied by decreased protein phosphatase activity. Age-associated phosphorylated proteins were also found to correspond to the functional categories of age-associated genes identified through transcriptome analysis. Together, this study provides a comprehensive map of the transcriptome and proteome in the aging brain, and elucidates endogenous protein phosphatase-1 inhibition as a potential means through which environmental enrichment may ameliorate age-related cognitive deficits.
Wei, Jiankai; Zhang, Xiaojun; Yu, Yang; Huang, Hao; Li, Fuhua; Xiang, Jianhai
2014-01-01
Penaeid shrimp has a distinctive metamorphosis stage during early development. Although morphological and biochemical studies about this ontogeny have been developed for decades, researches on gene expression level are still scarce. In this study, we have investigated the transcriptomes of five continuous developmental stages in Pacific white shrimp (Litopenaeus vannamei) with high throughput Illumina sequencing technology. The reads were assembled and clustered into 66,815 unigenes, of which 32,398 have putative homologues in nr database, 14,981 have been classified into diverse functional categories by Gene Ontology (GO) annotation and 26,257 have been associated with 255 pathways by KEGG pathway mapping. Meanwhile, the differentially expressed genes (DEGs) between adjacent developmental stages were identified and gene expression patterns were clustered. By GO term enrichment analysis, KEGG pathway enrichment analysis and functional gene profiling, the physiological changes during shrimp metamorphosis could be better understood, especially histogenesis, diet transition, muscle development and exoskeleton reconstruction. In conclusion, this is the first study that characterized the integrated transcriptomic profiles during early development of penaeid shrimp, and these findings will serve as significant references for shrimp developmental biology and aquaculture research. PMID:25197823
UniVIO: A Multiple Omics Database with Hormonome and Transcriptome Data from Rice
Sakurai, Tetsuya; Sakakibara, Hitoshi
2013-01-01
Plant hormones play important roles as signaling molecules in the regulation of growth and development by controlling the expression of downstream genes. Since the hormone signaling system represents a complex network involving functional cross-talk through the mutual regulation of signaling and metabolism, a comprehensive and integrative analysis of plant hormone concentrations and gene expression is important for a deeper understanding of hormone actions. We have developed a database named Uniformed Viewer for Integrated Omics (UniVIO: http://univio.psc.riken.jp/), which displays hormone-metabolome (hormonome) and transcriptome data in a single formatted (uniformed) heat map. At the present time, hormonome and transcriptome data obtained from 14 organ parts of rice plants at the reproductive stage and seedling shoots of three gibberellin signaling mutants are included in the database. The hormone concentration and gene expression data can be searched by substance name, probe ID, gene locus ID or gene description. A correlation search function has been implemented to enable users to obtain information of correlated substance accumulation and gene expression. In the correlation search, calculation method, range of correlation coefficient and plant samples can be selected freely. PMID:23314752
Epigenomic Reprogramming of Adult Cardiomyocyte-Derived Cardiac Progenitor Cells
Zhang, Yiqiang; Zhong, Jiang F; Qiu, Hongyu; Robb MacLellan, W.; Marbán, Eduardo; Wang, Charles
2015-01-01
It has been believed that mammalian adult cardiomyocytes (ACMs) are terminally-differentiated and are unable to proliferate. Recently, using a bi-transgenic ACM fate mapping mouse model and an in vitro culture system, we demonstrated that adult mouse cardiomyocytes were able to dedifferentiate into cardiac progenitor-like cells (CPCs). However, little is known about the molecular basis of their intrinsic cellular plasticity. Here we integrate single-cell transcriptome and whole-genome DNA methylation analyses to unravel the molecular mechanisms underlying the dedifferentiation and cell cycle reentry of mouse ACMs. Compared to parental cardiomyocytes, dedifferentiated mouse cardiomyocyte-derived CPCs (mCPCs) display epigenomic reprogramming with many differentially-methylated regions, both hypermethylated and hypomethylated, across the entire genome. Correlated well with the methylome, our transcriptomic data showed that the genes encoding cardiac structure and function proteins are remarkably down-regulated in mCPCs, while those for cell cycle, proliferation, and stemness are significantly up-regulated. In addition, implantation of mCPCs into infarcted mouse myocardium improves cardiac function with augmented left ventricular ejection fraction. Our study demonstrates that the cellular plasticity of mammalian cardiomyocytes is the result of a well-orchestrated epigenomic reprogramming and a subsequent global transcriptomic alteration. PMID:26657817
Zhu, Fengjiao; Yang, Zongying; Zhang, Yiliu; Hu, Kun; Fang, Wenhong
2017-01-01
Enrofloxacin is the most commonly used antibiotic to control diseases in aquatic animals caused by A. hydrophila. This study conducted de novo transcriptome sequencing and compared the global transcriptomes of enrofloxacin-resistant and enrofloxacin-susceptible strains. We got a total of 4,714 unigenes were assembled. Of these, 4,122 were annotated. A total of 3,280 unigenes were assigned to GO, 3,388 unigenes were classified into Cluster of Orthologous Groups of proteins (COG) using BLAST and BLAST2GO software, and 2,568 were mapped onto pathways using the Kyoto Encyclopedia of Gene and Genomes Pathway database. Furthermore, 218 unigenes were deemed to be DEGs. After enrofloxacin treatment, 135 genes were upregulated and 83 genes were downregulated. The GO terms biological process (126 genes) and metabolic process (136 genes) were the most enriched, and the terms for protein folding, response to stress, and SOS response were also significantly enriched. This study identified enrofloxacin treatment affects multiple biological functions of A. hydrophila. Enrofloxacin resistance in A. hydrophila is closely related to the reduction of intracellular drug accumulation caused by ABC transporters and increased expression of topoisomerase IV.
Yang, Zongying; Zhang, Yiliu; Hu, Kun; Fang, Wenhong
2017-01-01
Enrofloxacin is the most commonly used antibiotic to control diseases in aquatic animals caused by A. hydrophila. This study conducted de novo transcriptome sequencing and compared the global transcriptomes of enrofloxacin-resistant and enrofloxacin-susceptible strains. We got a total of 4,714 unigenes were assembled. Of these, 4,122 were annotated. A total of 3,280 unigenes were assigned to GO, 3,388 unigenes were classified into Cluster of Orthologous Groups of proteins (COG) using BLAST and BLAST2GO software, and 2,568 were mapped onto pathways using the Kyoto Encyclopedia of Gene and Genomes Pathway database. Furthermore, 218 unigenes were deemed to be DEGs. After enrofloxacin treatment, 135 genes were upregulated and 83 genes were downregulated. The GO terms biological process (126 genes) and metabolic process (136 genes) were the most enriched, and the terms for protein folding, response to stress, and SOS response were also significantly enriched. This study identified enrofloxacin treatment affects multiple biological functions of A. hydrophila. Enrofloxacin resistance in A. hydrophila is closely related to the reduction of intracellular drug accumulation caused by ABC transporters and increased expression of topoisomerase IV. PMID:28708867
Characterization of gonadal transcriptomes from the turbot (Scophthalmus maximus).
Hu, Yulong; Huang, Meng; Wang, Weiji; Guan, Jiantao; Kong, Jie
2016-01-01
The mechanisms underlying sexual reproduction and sex ratio determination remains unclear in turbot, a flatfish of great commercial value. And there is limited information in the turbot database regarding genes related to the reproductive system. Here, we conducted high-throughput transcriptome profiling of turbot gonad tissues to better understand their reproductive functions and to supply essential gene sequence information for marker-assisted selection programs in the turbot industry. In this study, two gonad libraries representing sex differences in Scophthalmus maximus yielded 453 818 high-quality reads that were assembled into 24 611 contigs and 33 713 singletons by using 454 pyrosequencing, 13 936 contigs and singletons (CS) of which were annotated using BLASTx. GO (Gene Ontology) and KEGG (Kyoto Encyclopedia of Genes and Genomes) pathway analyses revealed that various biological functions and processes were associated with many of the annotated CS. Expression analyses showed that 510 genes were differentially expressed in males versus females; 80% of these genes were annotated. In addition, 6484 and 6036 single nucleotide polymorphisms (SNPs) were identified in male and female libraries, respectively. This transcriptome resource will serve as the foundation for cDNA or SNP microarray construction, gene expression characterization, and sex-specific linkage mapping in turbot.
Evaluating kriging as a tool to improve moderate resolution maps of forest biomass
Elizabeth A. Freeman; Gretchen G. Moisen
2007-01-01
The USDA Forest Service, Forest Inventory and Analysis program (FIA) recently produced a nationwide map of forest biomass by modeling biomass collected on forest inventory plots as nonparametric functions of moderate resolution satellite data and other environmental variables using Cubist software. Efforts are underway to develop methods to enhance this initial map. We...
Real Time Monitoring of Flooding from Microwave Satellite Observations
NASA Technical Reports Server (NTRS)
Galantowicz, John F.; Frey, Herb (Technical Monitor)
2002-01-01
We have developed a new method for making high-resolution flood extent maps (e.g., at the 30-100 m scale of digital elevation models) in real-time from low-resolution (20-70 km) passive microwave observations. The method builds a "flood-potential" database from elevations and historic flood imagery and uses it to create a flood-extent map consistent with the observed open water fraction. Microwave radiometric measurements are useful for flood monitoring because they sense surface water in clear-or-cloudy conditions and can provide more timely data (e.g., compared to radars) from relatively wide swath widths and an increasing number of available platforms (DMSP, ADEOS-II, Terra, NPOESS, GPM). The chief disadvantages for flood mapping are the radiometers' low resolution and the need for local calibration of the relationship between radiances and open-water fraction. We present our method for transforming microwave sensor-scale open water fraction estimates into high-resolution flood extent maps and describe 30-day flood map sequences generated during a retrospective study of the 1993 Great Midwest Flood. We discuss the method's potential improvement through as yet unimplemented algorithm enhancements and expected advancements in microwave radiometry (e.g., improved resolution and atmospheric correction).
Avti, Pramod K; Hu, Song; Favazza, Christopher; Mikos, Antonios G; Jansen, John A; Shroyer, Kenneth R; Wang, Lihong V; Sitharaman, Balaji
2012-01-01
In the present study, the efficacy of multi-scale photoacoustic microscopy (PAM) was investigated to detect, map, and quantify trace amounts [nanograms (ng) to micrograms (µg)] of SWCNTs in a variety of histological tissue specimens consisting of cancer and benign tissue biopsies (histological specimens from implanted tissue engineering scaffolds). Optical-resolution (OR) and acoustic-resolution (AR)--Photoacoustic microscopy (PAM) was employed to detect, map and quantify the SWCNTs in a variety of tissue histological specimens and compared with other optical techniques (bright-field optical microscopy, Raman microscopy, near infrared (NIR) fluorescence microscopy). Both optical-resolution and acoustic-resolution PAM, allow the detection and quantification of SWCNTs in histological specimens with scalable spatial resolution and depth penetration. The noise-equivalent detection sensitivity to SWCNTs in the specimens was calculated to be as low as ∼7 pg. Image processing analysis further allowed the mapping, distribution, and quantification of the SWCNTs in the histological sections. The results demonstrate the potential of PAM as a promising imaging technique to detect, map, and quantify SWCNTs in histological specimens, and could complement the capabilities of current optical and electron microscopy techniques in the analysis of histological specimens containing SWCNTs.
Mapping Chinese tallow with color-infrared photography
Ramsey, Elijah W.; Nelson, G.A.; Sapkota, S.K.; Seeger, E.B.; Martella, K.D.
2002-01-01
Airborne color-infrared photography (CIR) (1:12,000 scale) was used to map localized occurrences of the widespread and aggressive Chinese tallow (Sapium sebiferum), an invasive species. Photography was collected during senescence when Chinese tallow's bright red leaves presented a high spectral contrast within the native bottomland hardwood and upland forests and marsh land-cover types. Mapped occurrences were conservative because not all senescing tallow leaves are bright red simultaneously. To simulate low spectral but high spatial resolution satellite/airborne image and digital video data, the CIR photography was transformed into raster images at spatial resolutions approximating 0.5 in and 1.0 m. The image data were then spectrally classified for the occurrence of bright red leaves associated with senescing Chinese tallow. Classification accuracies were greater than 95 percent at both spatial resolutions. There was no significant difference in either forest in the detection of tallow or inclusion of non-tallow trees associated with the two spatial resolutions. In marshes, slightly more tallow occurrences were mapped with the lower spatial resolution, but there were also more misclassifications of native land covers as tallow. Combining all land covers, there was no difference at detecting tallow occurrences (equal omission errors) between the two resolutions, but the higher spatial resolution was associated with less inclusion of non-tallow land covers as tallow (lower commission error). Overall, these results confirm that high spatial (???1 m) but low spectral resolution remote sensing data can be used for mapping Chinese tallow trees in dominant environments found in coastal and adjacent upland landscapes.
High Resolution Global Topography of Eros from NEAR Imaging and LIDAR Data
NASA Technical Reports Server (NTRS)
Gaskell, Robert W.; Konopliv, A.; Barnouin-Jha, O.; Scheeres, D.
2006-01-01
Principal Data Products: Ensemble of L-maps from SPC, Spacecraft state, Asteroid pole and rotation. Secondary Products: Global topography model, inertia tensor, gravity. Composite high resolution topography. Three dimensional image maps.
Anderson, David A; Walz, Marcus E; Weil, Ernesto; Tonellato, Peter; Smith, Matthew C
2016-01-01
Climate change-driven coral disease outbreaks have led to widespread declines in coral populations. Early work on coral genomics established that corals have a complex innate immune system, and whole-transcriptome gene expression studies have revealed mechanisms by which the coral immune system responds to stress and disease. The present investigation expands bioinformatic data available to study coral molecular physiology through the assembly and annotation of a reference transcriptome of the Caribbean reef-building coral, Orbicella faveolata. Samples were collected during a warm water thermal anomaly, coral bleaching event and Caribbean yellow band disease outbreak in 2010 in Puerto Rico. Multiplex sequencing of RNA on the Illumina GAIIx platform and de novo transcriptome assembly by Trinity produced 70,745,177 raw short-sequence reads and 32,463 O. faveolata transcripts, respectively. The reference transcriptome was annotated with gene ontologies, mapped to KEGG pathways, and a predicted proteome of 20,488 sequences was generated. Protein families and signaling pathways that are essential in the regulation of innate immunity across Phyla were investigated in-depth. Results were used to develop models of evolutionarily conserved Wnt, Notch, Rig-like receptor, Nod-like receptor, and Dicer signaling. O. faveolata is a coral species that has been studied widely under climate-driven stress and disease, and the present investigation provides new data on the genes that putatively regulate its immune system.
Walz, Marcus E.; Weil, Ernesto; Smith, Matthew C.
2016-01-01
Climate change-driven coral disease outbreaks have led to widespread declines in coral populations. Early work on coral genomics established that corals have a complex innate immune system, and whole-transcriptome gene expression studies have revealed mechanisms by which the coral immune system responds to stress and disease. The present investigation expands bioinformatic data available to study coral molecular physiology through the assembly and annotation of a reference transcriptome of the Caribbean reef-building coral, Orbicella faveolata. Samples were collected during a warm water thermal anomaly, coral bleaching event and Caribbean yellow band disease outbreak in 2010 in Puerto Rico. Multiplex sequencing of RNA on the Illumina GAIIx platform and de novo transcriptome assembly by Trinity produced 70,745,177 raw short-sequence reads and 32,463 O. faveolata transcripts, respectively. The reference transcriptome was annotated with gene ontologies, mapped to KEGG pathways, and a predicted proteome of 20,488 sequences was generated. Protein families and signaling pathways that are essential in the regulation of innate immunity across Phyla were investigated in-depth. Results were used to develop models of evolutionarily conserved Wnt, Notch, Rig-like receptor, Nod-like receptor, and Dicer signaling. O. faveolata is a coral species that has been studied widely under climate-driven stress and disease, and the present investigation provides new data on the genes that putatively regulate its immune system. PMID:26925311
Aballai, Víctor; Aedo, Jorge E; Maldonado, Jonathan; Bastias-Molina, Macarena; Silva, Herman; Meneses, Claudio; Boltaña, Sebastian; Reyes, Ariel; Molina, Alfredo; Valdés, Juan Antonio
2017-12-01
Stress is a primary contributing factor of fish disease and mortality in aquaculture. We have previously reported that the red cusk-eel (Genypterus chilensis), an important farmed marine fish, demonstrates a handling-stress response that results in increased juvenile mortality, which is mainly associated with skeletal muscle atrophy and liver steatosis. To better understand the systemic effects of stress on red cusk-eel immune-related gene expression, the present study assessed the transcriptomic head-kidney response to handling-stress. The RNA sequencing generated a total of 61,655,525 paired-end reads from control and stressed conditions. De novo assembly using the CLC Genomic Workbench produced 86,840 transcripts and created a reference transcriptome with a N50 of 1426bp. Reads mapped onto the assembled reference transcriptome resulted in the identification of 569 up-regulated and 513 down-regulated transcripts. Gene ontology enrichment analysis revealed a significant up-regulation of the biological processes, like response to stress, response to biotic stimulus, and immune response. Conversely, a significant down-regulation of biological processes is associated with metabolic processes. These results were validated by RT-qPCR analysis for nine candidate genes involved in the immune response. The present data demonstrated that short term stress promotes the immune innate response in the marine teleost G. chilensis. This study is an important step towards understanding the immune adaptive response to stress in non-model teleost species. Copyright © 2017 Elsevier Inc. All rights reserved.
Luck, Ashley N; Slatko, Barton E; Foster, Jeremy M
2017-01-01
Efficient transcriptomic sequencing of microbial mRNA derived from host-microbe associations is often compromised by the much lower relative abundance of microbial RNA in the mixed total RNA sample. One solution to this problem is to perform extensive sequencing until an acceptable level of transcriptome coverage is obtained. More cost-effective methods include use of prokaryotic and/or eukaryotic rRNA depletion strategies, sometimes in conjunction with depletion of polyadenylated eukaryotic mRNA. Here, we report use of Cappable-seq™ to specifically enrich, in a single step, Wolbachia endobacterial mRNA transcripts from total RNA prepared from the parasitic filarial nematode, Brugia malayi. The obligate Wolbachia endosymbiont is a proven drug target for many human filarial infections, yet the precise nature of its symbiosis with the nematode host is poorly understood. Insightful analysis of the expression levels of Wolbachia genes predicted to underpin the mutualistic association and of known drug target genes at different life cycle stages or in response to drug treatments is typically challenged by low transcriptomic coverage. Cappable-seq resulted in up to ~ 5-fold increase in the number of reads mapping to Wolbachia. On average, coverage of Wolbachia transcripts from B. malayi microfilariae was enriched ~40-fold by Cappable-seq. Additionally, this method has an additional benefit of selectively removing abundant prokaryotic ribosomal RNAs.The deeper microbial transcriptome sequencing afforded by Cappable-seq facilitates more detailed characterization of gene expression levels of pathogens and symbionts present in animal tissues.
De Novo Transcriptome Analysis of Allium cepa L. (Onion) Bulb to Identify Allergens and Epitopes
Rajkumar, Hemalatha; Ramagoni, Ramesh Kumar; Anchoju, Vijayendra Chary; Vankudavath, Raju Naik; Syed, Arshi Uz Zaman
2015-01-01
Allium cepa (onion) is a diploid plant with one of the largest nuclear genomes among all diploids. Onion is an example of an under-researched crop which has a complex heterozygous genome. There are no allergenic proteins and genomic data available for onions. This study was conducted to establish a transcriptome catalogue of onion bulb that will enable us to study onion related genes involved in medicinal use and allergies. Transcriptome dataset generated from onion bulb using the Illumina HiSeq 2000 technology showed a total of 99,074,309 high quality raw reads (~20 Gb). Based on sequence homology onion genes were categorized into 49 different functional groups. Most of the genes however, were classified under 'unknown' in all three gene ontology categories. Of the categorized genes, 61.2% showed metabolic functions followed by cellular components such as binding, cellular processes; catalytic activity and cell part. With BLASTx top hit analysis, a total of 2,511 homologous allergenic sequences were found, which had 37–100% similarity with 46 different types of allergens existing in the database. From the 46 contigs or allergens, 521 B-cell linear epitopes were identified using BepiPred linear epitope prediction tool. This is the first comprehensive insight into the transcriptome of onion bulb tissue using the NGS technology, which can be used to map IgE epitopes and prediction of structures and functions of various proteins. PMID:26284934
Ong, Wen Dee; Voo, Lok-Yung Christopher; Kumar, Vijay Subbiah
2012-01-01
Pineapple (Ananas comosus var. comosus), is an important tropical non-climacteric fruit with high commercial potential. Understanding the mechanism and processes underlying fruit ripening would enable scientists to enhance the improvement of quality traits such as, flavor, texture, appearance and fruit sweetness. Although, the pineapple is an important fruit, there is insufficient transcriptomic or genomic information that is available in public databases. Application of high throughput transcriptome sequencing to profile the pineapple fruit transcripts is therefore needed. To facilitate this, we have performed transcriptome sequencing of ripe yellow pineapple fruit flesh using Illumina technology. About 4.7 millions Illumina paired-end reads were generated and assembled using the Velvet de novo assembler. The assembly produced 28,728 unique transcripts with a mean length of approximately 200 bp. Sequence similarity search against non-redundant NCBI database identified a total of 16,932 unique transcripts (58.93%) with significant hits. Out of these, 15,507 unique transcripts were assigned to gene ontology terms. Functional annotation against Kyoto Encyclopedia of Genes and Genomes pathway database identified 13,598 unique transcripts (47.33%) which were mapped to 126 pathways. The assembly revealed many transcripts that were previously unknown. The unique transcripts derived from this work have rapidly increased of the number of the pineapple fruit mRNA transcripts as it is now available in public databases. This information can be further utilized in gene expression, genomics and other functional genomics studies in pineapple.
Ong, Wen Dee; Voo, Lok-Yung Christopher; Kumar, Vijay Subbiah
2012-01-01
Background Pineapple (Ananas comosus var. comosus), is an important tropical non-climacteric fruit with high commercial potential. Understanding the mechanism and processes underlying fruit ripening would enable scientists to enhance the improvement of quality traits such as, flavor, texture, appearance and fruit sweetness. Although, the pineapple is an important fruit, there is insufficient transcriptomic or genomic information that is available in public databases. Application of high throughput transcriptome sequencing to profile the pineapple fruit transcripts is therefore needed. Methodology/Principal Findings To facilitate this, we have performed transcriptome sequencing of ripe yellow pineapple fruit flesh using Illumina technology. About 4.7 millions Illumina paired-end reads were generated and assembled using the Velvet de novo assembler. The assembly produced 28,728 unique transcripts with a mean length of approximately 200 bp. Sequence similarity search against non-redundant NCBI database identified a total of 16,932 unique transcripts (58.93%) with significant hits. Out of these, 15,507 unique transcripts were assigned to gene ontology terms. Functional annotation against Kyoto Encyclopedia of Genes and Genomes pathway database identified 13,598 unique transcripts (47.33%) which were mapped to 126 pathways. The assembly revealed many transcripts that were previously unknown. Conclusions The unique transcripts derived from this work have rapidly increased of the number of the pineapple fruit mRNA transcripts as it is now available in public databases. This information can be further utilized in gene expression, genomics and other functional genomics studies in pineapple. PMID:23091603
FOLD-EM: automated fold recognition in medium- and low-resolution (4-15 Å) electron density maps.
Saha, Mitul; Morais, Marc C
2012-12-15
Owing to the size and complexity of large multi-component biological assemblies, the most tractable approach to determining their atomic structure is often to fit high-resolution radiographic or nuclear magnetic resonance structures of isolated components into lower resolution electron density maps of the larger assembly obtained using cryo-electron microscopy (cryo-EM). This hybrid approach to structure determination requires that an atomic resolution structure of each component, or a suitable homolog, is available. If neither is available, then the amount of structural information regarding that component is limited by the resolution of the cryo-EM map. However, even if a suitable homolog cannot be identified using sequence analysis, a search for structural homologs should still be performed because structural homology often persists throughout evolution even when sequence homology is undetectable, As macromolecules can often be described as a collection of independently folded domains, one way of searching for structural homologs would be to systematically fit representative domain structures from a protein domain database into the medium/low resolution cryo-EM map and return the best fits. Taken together, the best fitting non-overlapping structures would constitute a 'mosaic' backbone model of the assembly that could aid map interpretation and illuminate biological function. Using the computational principles of the Scale-Invariant Feature Transform (SIFT), we have developed FOLD-EM-a computational tool that can identify folded macromolecular domains in medium to low resolution (4-15 Å) electron density maps and return a model of the constituent polypeptides in a fully automated fashion. As a by-product, FOLD-EM can also do flexible multi-domain fitting that may provide insight into conformational changes that occur in macromolecular assemblies.
Electron-Beam Mapping of Vibrational Modes with Nanometer Spatial Resolution.
Dwyer, C; Aoki, T; Rez, P; Chang, S L Y; Lovejoy, T C; Krivanek, O L
2016-12-16
We demonstrate that a focused beam of high-energy electrons can be used to map the vibrational modes of a material with a spatial resolution of the order of one nanometer. Our demonstration is performed on boron nitride, a polar dielectric which gives rise to both localized and delocalized electron-vibrational scattering, either of which can be selected in our off-axial experimental geometry. Our experimental results are well supported by our calculations, and should reconcile current controversy regarding the spatial resolution achievable in vibrational mapping with focused electron beams.
Shi, Xun; Miller, Stephanie; Mwenda, Kevin; Onda, Akikazu; Reese, Judy; Onega, Tracy; Gui, Jiang; Karagas, Margret; Demidenko, Eugene; Moeschler, John
2013-09-06
Limited by data availability, most disease maps in the literature are for relatively large and subjectively-defined areal units, which are subject to problems associated with polygon maps. High resolution maps based on objective spatial units are needed to more precisely detect associations between disease and environmental factors. We propose to use a Restricted and Controlled Monte Carlo (RCMC) process to disaggregate polygon-level location data to achieve mapping aggregate data at an approximated individual level. RCMC assigns a random point location to a polygon-level location, in which the randomization is restricted by the polygon and controlled by the background (e.g., population at risk). RCMC allows analytical processes designed for individual data to be applied, and generates high-resolution raster maps. We applied RCMC to the town-level birth defect data for New Hampshire and generated raster maps at the resolution of 100 m. Besides the map of significance of birth defect risk represented by p-value, the output also includes a map of spatial uncertainty and a map of hot spots. RCMC is an effective method to disaggregate aggregate data. An RCMC-based disease mapping maximizes the use of available spatial information, and explicitly estimates the spatial uncertainty resulting from aggregation.
Geologic map of the Galaxias quadrangle (MTM 35217) of Mars
De Hon, Rene A.; Mouginis-Mark, Peter J.; Brick, Eugene E.
1999-01-01
The Galaxias region (MTM 35217) is one of a series of 1:500,000-scale science study areas on Mars sponsored by NASA's Planetary Geology and Geophysics Program. Situated near the northern limit of lava flows associated with Elysium Mons, this region includes a mixture of volcanic and nonvolcanic terrains. The region is also of interest for the fluvial systems that originate along the distal margins of the Elysium lava flows. Resolution of Viking Orbiter images used to prepare the base map ranges from 40 to 160 m/pixel. High-resolution frames (40 to 80 m/pixel) are found in the southeastern part of the map area and along the north edge of the quadrangle, but over half the quadrangle is included in medium-resolution frames (150 m/pixel). Two 8 m/pixel, very high resolution scenes are available (see fig. 1). Interpretation is complicated by variable resolution and sun angles that vary from east to west illumination on different images. Mapping methods and principles are adapted from those developed for lunar photogeologic mapping by Shoemaker and Hackman (1962), refined by Wilhelms (1972), and successfully applied by many workers to a variety of planetary surfaces. Mapping units are distinguished by topography and texture and are ranked by relative age on the basis of superposition and transection relations. Material units are assigned to time-stratigraphic systems defined by Scott and Carr (1978) and Tanaka (1986). This area is included within earlier maps that used Mariner 9 images at 1:5,000,000 scale (Elston, 1979) and globally at 1:25,000,000 scale (Scott and Carr, 1978). Regional maps based on the much higher resolutions of Viking Orbiter allowed more detailed discrimination of materials by Greeley and Guest (1987) at 1:15,000,000 scale and Tanaka and others (1992) at 1:5,000,000 scale. Some map units on this 1:500,000-scale map correspond to, or are partially equivalent to, units on the larger scale maps of Greeley and Guest (1987) and Tanaka and others (1992). Established terminology is used where feasible, but the scale of this map requires that some new units be introduced and that some previous terminology be redefined. Photogeologic methods are limited; therefore, more than one geologic explanation is given for some material units that do not readily lend themselves to an unequivocal interpretation.
Islam, Md S; Zeng, Linghe; Thyssen, Gregory N; Delhom, Christopher D; Kim, Hee Jin; Li, Ping; Fang, David D
2016-06-01
Three QTL regions controlling three fiber quality traits were validated and further fine-mapped with 27 new single nucleotide polymorphism (SNP) markers. Transcriptome analysis suggests that receptor-like kinases found within the validated QTLs are potential candidate genes responsible for superior fiber strength in cotton line MD52ne. Fiber strength, length, maturity and fineness determine the market value of cotton fibers and the quality of spun yarn. Cotton fiber strength has been recognized as a critical quality attribute in the modern textile industry. Fine mapping along with quantitative trait loci (QTL) validation and candidate gene prediction can uncover the genetic and molecular basis of fiber quality traits. Four previously-identified QTLs (qFBS-c3, qSFI-c14, qUHML-c14 and qUHML-c24) related to fiber bundle strength, short fiber index and fiber length, respectively, were validated using an F3 population that originated from a cross of MD90ne × MD52ne. A group of 27 new SNP markers generated from mapping-by-sequencing (MBS) were placed in QTL regions to improve and validate earlier maps. Our refined QTL regions spanned 4.4, 1.8 and 3.7 Mb of physical distance in the Gossypium raimondii reference genome. We performed RNA sequencing (RNA-seq) of 15 and 20 days post-anthesis fiber cells from MD52ne and MD90ne and aligned reads to the G. raimondii genome. The QTL regions contained 21 significantly differentially expressed genes (DEGs) between the two near-isogenic parental lines. SNPs that result in non-synonymous substitutions to amino acid sequences of annotated genes were identified within these DEGs, and mapped. Taken together, transcriptome and amino acid mutation analysis indicate that receptor-like kinase pathway genes are likely candidates for superior fiber strength and length in MD52ne. MBS along with RNA-seq demonstrated a powerful strategy to elucidate candidate genes for the QTLs that control complex traits in a complex genome like tetraploid upland cotton.
Satellite Snow-Cover Mapping: A Brief Review
NASA Technical Reports Server (NTRS)
Hall, Dorothy K.
1995-01-01
Satellite snow mapping has been accomplished since 1966, initially using data from the reflective part of the electromagnetic spectrum, and now also employing data from the microwave part of the spectrum. Visible and near-infrared sensors can provide excellent spatial resolution from space enabling detailed snow mapping. When digital elevation models are also used, snow mapping can provide realistic measurements of snow extent even in mountainous areas. Passive-microwave satellite data permit global snow cover to be mapped on a near-daily basis and estimates of snow depth to be made, but with relatively poor spatial resolution (approximately 25 km). Dense forest cover limits both techniques and optical remote sensing is limited further by cloudcover conditions. Satellite remote sensing of snow cover with imaging radars is still in the early stages of research, but shows promise at least for mapping wet or melting snow using C-band (5.3 GHz) synthetic aperture radar (SAR) data. Observing System (EOS) Moderate Resolution Imaging Spectroradiometer (MODIS) data beginning with the launch of the first EOS platform in 1998. Digital maps will be produced that will provide daily, and maximum weekly global snow, sea ice and lake ice cover at 1-km spatial resolution. Statistics will be generated on the extent and persistence of snow or ice cover in each pixel for each weekly map, cloudcover permitting. It will also be possible to generate snow- and ice-cover maps using MODIS data at 250- and 500-m resolution, and to study and map snow and ice characteristics such as albedo. been under development. Passive-microwave data offer the potential for determining not only snow cover, but snow water equivalent, depth and wetness under all sky conditions. A number of algorithms have been developed to utilize passive-microwave brightness temperatures to provide information on snow cover and water equivalent. The variability of vegetative Algorithms are being developed to map global snow and ice cover using Earth Algorithms to map global snow cover using passive-microwave data have also cover and of snow grain size, globally, limits the utility of a single algorithm to map global snow cover.
Automated high resolution mapping of coffee in Rwanda using an expert Bayesian network
NASA Astrophysics Data System (ADS)
Mukashema, A.; Veldkamp, A.; Vrieling, A.
2014-12-01
African highland agro-ecosystems are dominated by small-scale agricultural fields that often contain a mix of annual and perennial crops. This makes such systems difficult to map by remote sensing. We developed an expert Bayesian network model to extract the small-scale coffee fields of Rwanda from very high resolution data. The model was subsequently applied to aerial orthophotos covering more than 99% of Rwanda and on one QuickBird image for the remaining part. The method consists of a stepwise adjustment of pixel probabilities, which incorporates expert knowledge on size of coffee trees and fields, and on their location. The initial naive Bayesian network, which is a spectral-based classification, yielded a coffee map with an overall accuracy of around 50%. This confirms that standard spectral variables alone cannot accurately identify coffee fields from high resolution images. The combination of spectral and ancillary data (DEM and a forest map) allowed mapping of coffee fields and associated uncertainties with an overall accuracy of 87%. Aggregated to district units, the mapped coffee areas demonstrated a high correlation with the coffee areas reported in the detailed national coffee census of 2009 (R2 = 0.92). Unlike the census data our map provides high spatial resolution of coffee area patterns of Rwanda. The proposed method has potential for mapping other perennial small scale cropping systems in the East African Highlands and elsewhere.
Navigating 3D electron microscopy maps with EM-SURFER.
Esquivel-Rodríguez, Juan; Xiong, Yi; Han, Xusi; Guang, Shuomeng; Christoffer, Charles; Kihara, Daisuke
2015-05-30
The Electron Microscopy DataBank (EMDB) is growing rapidly, accumulating biological structural data obtained mainly by electron microscopy and tomography, which are emerging techniques for determining large biomolecular complex and subcellular structures. Together with the Protein Data Bank (PDB), EMDB is becoming a fundamental resource of the tertiary structures of biological macromolecules. To take full advantage of this indispensable resource, the ability to search the database by structural similarity is essential. However, unlike high-resolution structures stored in PDB, methods for comparing low-resolution electron microscopy (EM) density maps in EMDB are not well established. We developed a computational method for efficiently searching low-resolution EM maps. The method uses a compact fingerprint representation of EM maps based on the 3D Zernike descriptor, which is derived from a mathematical series expansion for EM maps that are considered as 3D functions. The method is implemented in a web server named EM-SURFER, which allows users to search against the entire EMDB in real-time. EM-SURFER compares the global shapes of EM maps. Examples of search results from different types of query structures are discussed. We developed EM-SURFER, which retrieves structurally relevant matches for query EM maps from EMDB within seconds. The unique capability of EM-SURFER to detect 3D shape similarity of low-resolution EM maps should prove invaluable in structural biology.
Baldrian, Petr; López-Mondéjar, Rubén
2014-02-01
Molecular methods for the analysis of biomolecules have undergone rapid technological development in the last decade. The advent of next-generation sequencing methods and improvements in instrumental resolution enabled the analysis of complex transcriptome, proteome and metabolome data, as well as a detailed annotation of microbial genomes. The mechanisms of decomposition by model fungi have been described in unprecedented detail by the combination of genome sequencing, transcriptomics and proteomics. The increasing number of available genomes for fungi and bacteria shows that the genetic potential for decomposition of organic matter is widespread among taxonomically diverse microbial taxa, while expression studies document the importance of the regulation of expression in decomposition efficiency. Importantly, high-throughput methods of nucleic acid analysis used for the analysis of metagenomes and metatranscriptomes indicate the high diversity of decomposer communities in natural habitats and their taxonomic composition. Today, the metaproteomics of natural habitats is of interest. In combination with advanced analytical techniques to explore the products of decomposition and the accumulation of information on the genomes of environmentally relevant microorganisms, advanced methods in microbial ecophysiology should increase our understanding of the complex processes of organic matter transformation.
Mapping day-of-burning with coarse-resolution satellite fire-detection data
Sean A. Parks
2014-01-01
Evaluating the influence of observed daily weather on observed fire-related effects (e.g. smoke production, carbon emissions and burn severity) often involves knowing exactly what day any given area has burned. As such, several studies have used fire progression maps  in which the perimeter of an actively burning fire is mapped at a fairly high temporal resolution -...
A Modeling Approach to Global Land Surface Monitoring with Low Resolution Satellite Imaging
NASA Technical Reports Server (NTRS)
Hlavka, Christine A.; Dungan, Jennifer; Livingston, Gerry P.; Gore, Warren J. (Technical Monitor)
1998-01-01
The effects of changing land use/land cover on global climate and ecosystems due to greenhouse gas emissions and changing energy and nutrient exchange rates are being addressed by federal programs such as NASA's Mission to Planet Earth (MTPE) and by international efforts such as the International Geosphere-Biosphere Program (IGBP). The quantification of these effects depends on accurate estimates of the global extent of critical land cover types such as fire scars in tropical savannas and ponds in Arctic tundra. To address the requirement for accurate areal estimates, methods for producing regional to global maps with satellite imagery are being developed. The only practical way to produce maps over large regions of the globe is with data of coarse spatial resolution, such as Advanced Very High Resolution Radiometer (AVHRR) weather satellite imagery at 1.1 km resolution or European Remote-Sensing Satellite (ERS) radar imagery at 100 m resolution. The accuracy of pixel counts as areal estimates is in doubt, especially for highly fragmented cover types such as fire scars and ponds. Efforts to improve areal estimates from coarse resolution maps have involved regression of apparent area from coarse data versus that from fine resolution in sample areas, but it has proven difficult to acquire sufficient fine scale data to develop the regression. A method for computing accurate estimates from coarse resolution maps using little or no fine data is therefore needed.
High-Resolution Underwater Mapping Using Side-Scan Sonar
2016-01-01
The goal of this study is to generate high-resolution sea floor maps using a Side-Scan Sonar(SSS). This is achieved by explicitly taking into account the SSS operation as follows. First, the raw sensor data is corrected by means of a physics-based SSS model. Second, the data is projected to the sea-floor. The errors involved in this projection are thoroughfully analysed. Third, a probabilistic SSS model is defined and used to estimate the probability of each sea-floor region to be observed. This probabilistic information is then used to weight the contribution of each SSS measurement to the map. Because of these models, arbitrary map resolutions can be achieved, even beyond the sensor resolution. Finally, a geometric map building method is presented and combined with the probabilistic approach. The resulting map is composed of two layers. The echo intensity layer holds the most likely echo intensities at each point in the sea-floor. The probabilistic layer contains information about how confident can the user or the higher control layers be about the echo intensity layer data. Experimental results have been conducted in a large subsea region. PMID:26821379
NASA Astrophysics Data System (ADS)
Ko, Tony H.; Hartl, Ingmar; Drexler, Wolfgang; Ghanta, Ravi K.; Fujimoto, James G.
2002-06-01
Quantitative, three-dimensional mapping of retinal architectural morphology was achieved using an ultrahigh resolution ophthalmic OCT system. This OCT system utilizes a broad bandwidth titanium-sapphire laser light source generating bandwidths of up to 300 nm near 800 nm center wavelength. The system enables real-time cross-sectional imaging of the retina with ~3 micrometers axial resolution. The macula and the papillomacular axis of a normal human subject were systematically mapped using a series of linear scans. Edge detection and segmentation algorithms were developed to quantify retinal and intraretinal thicknesses. Topographic mapping of the total retinal thickness and the total ganglion cell/inner plexiform layer thickness was achieved around the macula. A topographic mapping quantifying the progressive thickening of the nerve fiber layer (NFL) nasally approaching the optic disk was also demonstrated. The ability to create three-dimensional topographic mapping of retinal architectural morphology at ~3 micrometers axial resolution will be relevant for the diagnosis of many retinal diseases. The topographic quantification of these structures can serve as a powerful tool for developing algorithms and clinical scanning protocols for the screening and staging of ophthalmic diseases such as glaucoma.
Guitet, Stéphane; Hérault, Bruno; Molto, Quentin; Brunaux, Olivier; Couteron, Pierre
2015-01-01
Precise mapping of above-ground biomass (AGB) is a major challenge for the success of REDD+ processes in tropical rainforest. The usual mapping methods are based on two hypotheses: a large and long-ranged spatial autocorrelation and a strong environment influence at the regional scale. However, there are no studies of the spatial structure of AGB at the landscapes scale to support these assumptions. We studied spatial variation in AGB at various scales using two large forest inventories conducted in French Guiana. The dataset comprised 2507 plots (0.4 to 0.5 ha) of undisturbed rainforest distributed over the whole region. After checking the uncertainties of estimates obtained from these data, we used half of the dataset to develop explicit predictive models including spatial and environmental effects and tested the accuracy of the resulting maps according to their resolution using the rest of the data. Forest inventories provided accurate AGB estimates at the plot scale, for a mean of 325 Mg.ha-1. They revealed high local variability combined with a weak autocorrelation up to distances of no more than10 km. Environmental variables accounted for a minor part of spatial variation. Accuracy of the best model including spatial effects was 90 Mg.ha-1 at plot scale but coarse graining up to 2-km resolution allowed mapping AGB with accuracy lower than 50 Mg.ha-1. Whatever the resolution, no agreement was found with available pan-tropical reference maps at all resolutions. We concluded that the combined weak autocorrelation and weak environmental effect limit AGB maps accuracy in rainforest, and that a trade-off has to be found between spatial resolution and effective accuracy until adequate "wall-to-wall" remote sensing signals provide reliable AGB predictions. Waiting for this, using large forest inventories with low sampling rate (<0.5%) may be an efficient way to increase the global coverage of AGB maps with acceptable accuracy at kilometric resolution.
USDA-ARS?s Scientific Manuscript database
Theobroma cacao is a tree cultivated in the tropics around the world for its seeds that are the source of both chocolate and cocoa butter. The cacao genome sequencing project initiated as a collaboration between USDA, Mars, Inc. and IBM has generated a great deal of transcriptome and genome sequenc...
Digital transcriptome analysis of putative sex-determination genes in papaya (Carica papaya).
Urasaki, Naoya; Tarora, Kazuhiko; Shudo, Ayano; Ueno, Hiroki; Tamaki, Moritoshi; Miyagi, Norimichi; Adaniya, Shinichi; Matsumura, Hideo
2012-01-01
Papaya (Carica papaya) is a trioecious plant species that has male, female and hermaphrodite flowers on different plants. The primitive sex chromosomes genetically determine the sex of the papaya. Although draft sequences of the papaya genome are already available, the genes for sex determination have not been identified, likely due to the complicated structure of its sex-chromosome sequences. To identify the candidate genes for sex determination, we conducted a transcriptome analysis of flower samples from male, female and hermaphrodite plants using high-throughput SuperSAGE for digital gene expression analysis. Among the short sequence tags obtained from the transcripts, 312 unique tags were specifically mapped to the primitive sex chromosome (X or Y(h)) sequences. An annotation analysis revealed that retroelements are the most abundant sequences observed in the genes corresponding to these tags. The majority of tags on the sex chromosomes were located on the X chromosome, and only 30 tags were commonly mapped to both the X and Y(h) chromosome, implying a loss of many genes on the Y(h) chromosome. Nevertheless, candidate Y(h) chromosome-specific female determination genes, including a MADS-box gene, were identified. Information on these sex chromosome-specific expressed genes will help elucidating sex determination in the papaya.
Cui, Lipeng; Qiu, Zhengkun; Wang, Zhirong; Gao, Jianchang; Guo, Yanmei; Huang, Zejun; Du, Yongchen; Wang, Xiaoxuan
2017-01-01
The hydrophobic cuticle that covers the surface of tomato (Solanum lycopersicum) fruit plays key roles in development and protection against biotic and abiotic stresses, including water loss, mechanical damage, UV radiation, pathogens, and pests. However, many details of the genes and regulatory mechanisms involved in cuticle biosynthesis in fleshy fruits are not well understood. In this study, we describe a novel tomato fruit phenotype, characterized by epidermal reticulation (ER) of green fruit and a higher water loss rate than wild type (WT) fruit. The ER phenotype is controlled by a single gene, ER4.1, derived from an introgressed chromosomal segment from the wild tomato species S. pennellii (LA0716). We performed fine mapping of the single dominant gene to an ~300 kb region and identified Solyc04g082540, Solyc04g082950, Solyc04g082630, and Solyc04g082910as potential candidate genes for the ER4.1 locus, based on comparative RNA-seq analysis of ER and WT fruit peels. In addition, the transcriptome analysis revealed that the expression levels of genes involved in cutin, wax and flavonoid biosynthesis were altered in the ER fruit compared with WT. This study provides new insights into the regulatory mechanisms and metabolism of the fruit cuticle. PMID:28798753
Digital Transcriptome Analysis of Putative Sex-Determination Genes in Papaya (Carica papaya)
Urasaki, Naoya; Tarora, Kazuhiko; Shudo, Ayano; Ueno, Hiroki; Tamaki, Moritoshi; Miyagi, Norimichi; Adaniya, Shinichi; Matsumura, Hideo
2012-01-01
Papaya (Carica papaya) is a trioecious plant species that has male, female and hermaphrodite flowers on different plants. The primitive sex chromosomes genetically determine the sex of the papaya. Although draft sequences of the papaya genome are already available, the genes for sex determination have not been identified, likely due to the complicated structure of its sex-chromosome sequences. To identify the candidate genes for sex determination, we conducted a transcriptome analysis of flower samples from male, female and hermaphrodite plants using high-throughput SuperSAGE for digital gene expression analysis. Among the short sequence tags obtained from the transcripts, 312 unique tags were specifically mapped to the primitive sex chromosome (X or Yh) sequences. An annotation analysis revealed that retroelements are the most abundant sequences observed in the genes corresponding to these tags. The majority of tags on the sex chromosomes were located on the X chromosome, and only 30 tags were commonly mapped to both the X and Yh chromosome, implying a loss of many genes on the Yh chromosome. Nevertheless, candidate Yh chromosome-specific female determination genes, including a MADS-box gene, were identified. Information on these sex chromosome-specific expressed genes will help elucidating sex determination in the papaya. PMID:22815863
Proteogenomics Dashboard for the Human Proteome Project.
Tabas-Madrid, Daniel; Alves-Cruzeiro, Joao; Segura, Victor; Guruceaga, Elizabeth; Vialas, Vital; Prieto, Gorka; García, Carlos; Corrales, Fernando J; Albar, Juan Pablo; Pascual-Montano, Alberto
2015-09-04
dasHPPboard is a novel proteomics-based dashboard that collects and reports the experiments produced by the Spanish Human Proteome Project consortium (SpHPP) and aims to help HPP to map the entire human proteome. We have followed the strategy of analog genomics projects like the Encyclopedia of DNA Elements (ENCODE), which provides a vast amount of data on human cell lines experiments. The dashboard includes results of shotgun and selected reaction monitoring proteomics experiments, post-translational modifications information, as well as proteogenomics studies. We have also processed the transcriptomics data from the ENCODE and Human Body Map (HBM) projects for the identification of specific gene expression patterns in different cell lines and tissues, taking special interest in those genes having little proteomic evidence available (missing proteins). Peptide databases have been built using single nucleotide variants and novel junctions derived from RNA-Seq data that can be used in search engines for sample-specific protein identifications on the same cell lines or tissues. The dasHPPboard has been designed as a tool that can be used to share and visualize a combination of proteomic and transcriptomic data, providing at the same time easy access to resources for proteogenomics analyses. The dasHPPboard can be freely accessed at: http://sphppdashboard.cnb.csic.es.
NASA Astrophysics Data System (ADS)
Khandelwal, A.; Karpatne, A.; Kumar, V.
2017-12-01
In this paper, we present novel methods for producing surface water maps at 30 meter spatial resolution at a daily temporal resolution. These new methods will make use of the MODIS spectral data from Terra (available daily since 2000) to produce daily maps at 250 meter and 500 meter resolution, and then refine them using the relative elevation ordering of pixels at 30 meter resolution. The key component of these methods is the use of elevation structure (relative elevation ordering) of a water body. Elevation structure is not explicitly available at desired resolution for most water bodies in the world and hence it will be estimated using our previous work that uses the history of imperfect labels. In this paper, we will present a new technique that uses elevation structure (unlike existing pixel based methods) to enforce temporal consistency in surface water extents (lake area on nearby dates is likely to be very similar). This will greatly improve the quality of the MODIS scale land/water labels since daily MODIS data can have a large amount of missing (or poor quality) data due to clouds and other factors. The quality of these maps will be further improved using elevation based resolution refinement approach that will make use of elevation structure estimated at Landsat scale. With the assumption that elevation structure does not change over time, it provides a very effective way to transfer information between datasets even when they are not observed concurrently. In this work, we will derive elevation structure at Landsat scale from monthly water extent maps spanning 1984-2015, publicly available through a joint effort of Google Earth Engine and the European Commission's Joint Research Centre (JRC). This elevation structure will then be used to refine spatial resolution of Modis scale maps from 2000 onwards. We will present the analysis of these methods on a large and diverse set of water bodies across the world.
NASA Astrophysics Data System (ADS)
Broich, Mark
Humid tropical forest cover loss is threatening the sustainability of ecosystem goods and services as vast forest areas are rapidly cleared for industrial scale agriculture and tree plantations. Despite the importance of humid tropical forest in the provision of ecosystem services and economic development opportunities, the spatial and temporal distribution of forest cover loss across large areas is not well quantified. Here I improve the quantification of humid tropical forest cover loss using two remote sensing-based methods: sampling and wall-to-wall mapping. In all of the presented studies, the integration of coarse spatial, high temporal resolution data with moderate spatial, low temporal resolution data enable advances in quantifying forest cover loss in the humid tropics. Imagery from the Moderate Resolution Imaging Spectroradiometer (MODIS) are used as the source of coarse spatial resolution, high temporal resolution data and imagery from the Landsat Enhanced Thematic Mapper Plus (ETM+) sensor are used as the source of moderate spatial, low temporal resolution data. In a first study, I compare the precision of different sampling designs for the Brazilian Amazon using the annual deforestation maps derived by the Brazilian Space Agency for reference. I show that sampling designs can provide reliable deforestation estimates; furthermore, sampling designs guided by MODIS data can provide more efficient estimates than the systematic design used for the United Nations Food and Agricultural Organization Forest Resource Assessment 2010. Sampling approaches, such as the one demonstrated, are viable in regions where data limitations, such as cloud contamination, limit exhaustive mapping methods. Cloud-contaminated regions experiencing high rates of change include Insular Southeast Asia, specifically Indonesia and Malaysia. Due to persistent cloud cover, forest cover loss in Indonesia has only been mapped at a 5-10 year interval using photo interpretation of single best Landsat images. Such an approach does not provide timely results, and cloud cover reduces the utility of map outputs. In a second study, I develop a method to exhaustively mine the recently opened Landsat archive for cloud-free observations and automatically map forest cover loss for Sumatra and Kalimantan for the 2000-2005 interval. In a comparison with a reference dataset consisting of 64 Landsat sample blocks, I show that my method, using per pixel time-series, provides more accurate forest cover loss maps for multiyear intervals than approaches using image composites. In a third study, I disaggregate Landsat-mapped forest cover loss, mapped over a multiyear interval, by year using annual forest cover loss maps generated from coarse spatial, high temporal resolution MODIS imagery. I further disaggregate and analyze forest cover loss by forest land use, and provinces. Forest cover loss trends show high spatial and temporal variability. These results underline the importance of annual mapping for the quantification of forest cover loss in Indonesia, specifically in the light of the developing Reducing Emissions from Deforestation and Forest Degradation in Developing Countries policy framework (REDD). All three studies highlight the advances in quantifying forest cover loss in the humid tropics made by integrating coarse spatial, high temporal resolution data with moderate spatial, low temporal resolution data. The three methods presented can be combined into an integrated monitoring strategy.
Brownstein, John S; Cassa, Christopher A; Kohane, Isaac S; Mandl, Kenneth D
2006-12-08
Widespread availability of geographic information systems software has facilitated the use of disease mapping in academia, government and private sector. Maps that display the address of affected patients are often exchanged in public forums, and published in peer-reviewed journal articles. As previously reported, a search of figure legends in five major medical journals found 19 articles from 1994-2004 that identify over 19,000 patient addresses. In this report, a method is presented to evaluate whether patient privacy is being breached in the publication of low-resolution disease maps. To demonstrate the effect, a hypothetical low-resolution map of geocoded patient addresses was created and the accuracy with which patient addresses can be resolved is described. Through georeferencing and unsupervised classification of the original image, the method precisely re-identified 26% (144/550) of the patient addresses from a presentation quality map and 79% (432/550) from a publication quality map. For the presentation quality map, 99.8% of the addresses were within 70 meters (approximately one city block length) of the predicted patient location, 51.6% of addresses were identified within five buildings, 70.7% within ten buildings and 93% within twenty buildings. For the publication quality map, all addresses were within 14 meters and 11 buildings of the predicted patient location. This study demonstrates that lowering the resolution of a map displaying geocoded patient addresses does not sufficiently protect patient addresses from re-identification. Guidelines to protect patient privacy, including those of medical journals, should reflect policies that ensure privacy protection when spatial data are displayed or published.
Bidard, Frédérique; Coppin, Evelyne; Silar, Philippe
2012-08-01
Transcription pattern during mycelium growth of Podospora anserina was assayed by microarray analysis in wild type and in mutants affected in the MAP kinase genes PaMpk1 and PaMpk2 and in the NADPH oxidase gene PaNox1. 15% of the genes have their expression modified by a factor two or more as growth proceeds in wild type. The genes whose expression is modified during growth in P. anserina are either not conserved or differently regulated in Neurospora crassa and Aspergillus niger, two fungi for which transcriptome data during growth are available. The P. anserina mutants display a similar alteration of their transcriptome profile, with nearly 1000 genes affected similarly in the three mutants, accounting for their similar growth phenotypes. Yet, each mutant has its specific set of modified transcripts, in line with particular phenotypes exhibited by each mutant. Again, there is limited conservation during evolution of the genes regulated at the transcription level by MAP kinases, as indicated by the comparison the P. anserina data, with those of Aspergillus fumigatus and N. crassa, two fungi for which gene expression data are available for mutants of the MAPK pathways. Among the genes regulated in wild type and affected in the mutants, those involved in carbohydrate and secondary metabolisms appear prominent. The vast majority of the genes differentially expressed are of unknown function. Availability of their transcription profile at various stages of development should help to decipher their role in fungal physiology and development. Copyright © 2012 Elsevier Inc. All rights reserved.
Robust isotropic super-resolution by maximizing a Laplace posterior for MRI volumes
NASA Astrophysics Data System (ADS)
Han, Xian-Hua; Iwamoto, Yutaro; Shiino, Akihiko; Chen, Yen-Wei
2014-03-01
Magnetic resonance imaging can only acquire volume data with finite resolution due to various factors. In particular, the resolution in one direction (such as the slice direction) is much lower than others (such as the in-plane direction), yielding un-realistic visualizations. This study explores to reconstruct MRI isotropic resolution volumes from three orthogonal scans. This proposed super- resolution reconstruction is formulated as a maximum a posterior (MAP) problem, which relies on the generation model of the acquired scans from the unknown high-resolution volumes. Generally, the deviation ensemble of the reconstructed high-resolution (HR) volume from the available LR ones in the MAP is represented as a Gaussian distribution, which usually results in some noise and artifacts in the reconstructed HR volume. Therefore, this paper investigates a robust super-resolution by formulating the deviation set as a Laplace distribution, which assumes sparsity in the deviation ensemble based on the possible insight of the appeared large values only around some unexpected regions. In addition, in order to achieve reliable HR MRI volume, we integrates the priors such as bilateral total variation (BTV) and non-local mean (NLM) into the proposed MAP framework for suppressing artifacts and enriching visual detail. We validate the proposed robust SR strategy using MRI mouse data with high-definition resolution in two direction and low-resolution in one direction, which are imaged in three orthogonal scans: axial, coronal and sagittal planes. Experiments verifies that the proposed strategy can achieve much better HR MRI volumes than the conventional MAP method even with very high-magnification factor: 10.
Spatial and spectral resolution necessary for remotely sensed vegetation studies
NASA Technical Reports Server (NTRS)
Rock, B. N.
1982-01-01
An outline is presented of the required spatial and spectral resolution needed for accurate vegetation discrimination and mapping studies as well as for determination of state of health (i.e., detection of stress symptoms) of actively growing vegetation. Good success was achieved in vegetation discrimination and mapping of a heterogeneous forest cover in the ridge and valley portion of the Appalachians using multispectral data acquired with a spatial resolution of 15 m (IFOV). A sensor system delivering 10 to 15 m spatial resolution is needed for both vegetation mapping and detection of stress symptoms. Based on the vegetation discrimination and mapping exercises conducted at the Lost River site, accurate products (vegetation maps) are produced using broad-band spectral data ranging from the .500 to 2.500 micron portion of the spectrum. In order of decreasing utility for vegetation discrimination, the four most valuable TM simulator VNIR bands are: 6 (1.55 to 1.75 microns), 3 (0.63 to 0.69 microns), 5 (1.00 to 1.30 microns) and 4 (0.76 to 0.90 microns).
Savelyev, Alexander; Sugumaran, Ramanathan
2008-01-01
The goal of this project was to map the surface temperature of the University of Northern Iowa campus using high-resolution thermal infrared aerial imageries. A thermal camera with a spectral bandwidth of 3.0-5.0 μm was flown at the average altitude of 600 m, achieving ground resolution of 29 cm. Ground control data was used to construct the pixel- to-temperature conversion model, which was later used to produce temperature maps of the entire campus and also for validation of the model. The temperature map then was used to assess the building rooftop conditions and steam line faults in the study area. Assessment of the temperature map revealed a number of building structures that may be subject to insulation improvement due to their high surface temperatures leaks. Several hot spots were also identified on the campus for steam pipelines faults. High-resolution thermal infrared imagery proved highly effective tool for precise heat anomaly detection on the campus, and it can be used by university facility services for effective future maintenance of buildings and grounds. PMID:27873800
Recent Advancements in DNA Damage-Transcription Crosstalk and High-Resolution Mapping of DNA Breaks.
Vitelli, Valerio; Galbiati, Alessandro; Iannelli, Fabio; Pessina, Fabio; Sharma, Sheetal; d'Adda di Fagagna, Fabrizio
2017-08-31
Until recently, DNA damage arising from physiological DNA metabolism was considered a detrimental by-product for cells. However, an increasing amount of evidence has shown that DNA damage could have a positive role in transcription activation. In particular, DNA damage has been detected in transcriptional elements following different stimuli. These physiological DNA breaks are thought to be instrumental for the correct expression of genomic loci through different mechanisms. In this regard, although a plethora of methods are available to precisely map transcribed regions and transcription start sites, commonly used techniques for mapping DNA breaks lack sufficient resolution and sensitivity to draw a robust correlation between DNA damage generation and transcription. Recently, however, several methods have been developed to map DNA damage at single-nucleotide resolution, thus providing a new set of tools to correlate DNA damage and transcription. Here, we review how DNA damage can positively regulate transcription initiation, the current techniques for mapping DNA breaks at high resolution, and how these techniques can benefit future studies of DNA damage and transcription.
Brodusch, N; Demers, H; Gauvin, R
2013-04-01
A charge-coupled device camera of an electron backscattered diffraction system in a scanning electron microscope was positioned below a thin specimen and transmission Kikuchi patterns were collected. Contrary to electron backscattered diffraction, transmission electron forward scatter diffraction provides phase identification and orientation mapping at the nanoscale. The minimum Pd particle size for which a Kikuchi diffraction pattern was detected and indexed reliably was 5.6 nm. An orientation mapping resolution of 5 nm was measured at 30 kV. The resolution obtained with transmission electron forward scatter diffraction was of the same order of magnitude than that reported in electron nanodiffraction in the transmission electron microscope. An energy dispersive spectrometer X-ray map and a transmission electron forward scatter diffraction orientation map were acquired simultaneously. The high-resolution chemical, phase and orientation maps provided at once information on the chemical form, orientation and coherency of precipitates in an aluminium-lithium 2099 alloy. © 2013 The Authors Journal of Microscopy © 2013 Royal Microscopical Society.
NaviCell Web Service for network-based data visualization.
Bonnet, Eric; Viara, Eric; Kuperstein, Inna; Calzone, Laurence; Cohen, David P A; Barillot, Emmanuel; Zinovyev, Andrei
2015-07-01
Data visualization is an essential element of biological research, required for obtaining insights and formulating new hypotheses on mechanisms of health and disease. NaviCell Web Service is a tool for network-based visualization of 'omics' data which implements several data visual representation methods and utilities for combining them together. NaviCell Web Service uses Google Maps and semantic zooming to browse large biological network maps, represented in various formats, together with different types of the molecular data mapped on top of them. For achieving this, the tool provides standard heatmaps, barplots and glyphs as well as the novel map staining technique for grasping large-scale trends in numerical values (such as whole transcriptome) projected onto a pathway map. The web service provides a server mode, which allows automating visualization tasks and retrieving data from maps via RESTful (standard HTTP) calls. Bindings to different programming languages are provided (Python and R). We illustrate the purpose of the tool with several case studies using pathway maps created by different research groups, in which data visualization provides new insights into molecular mechanisms involved in systemic diseases such as cancer and neurodegenerative diseases. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
NaviCell Web Service for network-based data visualization
Bonnet, Eric; Viara, Eric; Kuperstein, Inna; Calzone, Laurence; Cohen, David P. A.; Barillot, Emmanuel; Zinovyev, Andrei
2015-01-01
Data visualization is an essential element of biological research, required for obtaining insights and formulating new hypotheses on mechanisms of health and disease. NaviCell Web Service is a tool for network-based visualization of ‘omics’ data which implements several data visual representation methods and utilities for combining them together. NaviCell Web Service uses Google Maps and semantic zooming to browse large biological network maps, represented in various formats, together with different types of the molecular data mapped on top of them. For achieving this, the tool provides standard heatmaps, barplots and glyphs as well as the novel map staining technique for grasping large-scale trends in numerical values (such as whole transcriptome) projected onto a pathway map. The web service provides a server mode, which allows automating visualization tasks and retrieving data from maps via RESTful (standard HTTP) calls. Bindings to different programming languages are provided (Python and R). We illustrate the purpose of the tool with several case studies using pathway maps created by different research groups, in which data visualization provides new insights into molecular mechanisms involved in systemic diseases such as cancer and neurodegenerative diseases. PMID:25958393
Bogun, Frank; Taj, Majid; Ting, Michael; Kim, Hyungjin Myra; Reich, Stephen; Good, Eric; Jongnarangsin, Krit; Chugh, Aman; Pelosi, Frank; Oral, Hakan; Morady, Fred
2008-03-01
Pace mapping has been used to identify the site of origin of focal ventricular arrhythmias. The spatial resolution of pace mapping has not been adequately quantified using currently available three-dimensional mapping systems. The purpose of this study was to determine the spatial resolution of pace mapping in patients with idiopathic ventricular tachycardia or premature ventricular contractions originating in the right ventricular outflow tract. In 16 patients with idiopathic ventricular tachycardia/ectopy from the right ventricular outflow tract, comparisons and classifications of pace maps were performed by two observers (good pace map: match >10/12 leads; inadequate pace map: match < or =10/12 leads) and a customized MATLAB 6.0 program (assessing correlation coefficient and normalized root mean square of the difference (nRMSd) between test and template signals). With an electroanatomic mapping system, the correlation coefficient of each pace map was correlated with the distance between the pacing site and the effective ablation site. The endocardial area within the 10-ms activation isochrone was measured. The ablation procedure was effective in all patients. Sites with good pace maps had a higher correlation coefficient and lower nRMSd than sites with inadequate pace maps (correlation coefficient: 0.96 +/- 0.03 vs 0.76 +/- 0.18, P <.0001; nRMSd: 0.41 +/- 0.16 vs 0.89 +/- 0.39, P <.0001). Using receiver operating characteristic curves, appropriate cutoff values were >0.94 for correlation coefficient (sensitivity 81%, specificity 89%) and < or =0.54 for nRMSd (sensitivity 76%, specificity 80%). Good pace maps were located a mean of 7.3 +/- 5.0 mm from the effective ablation site and had a mean activation time of -24 +/- 7 ms. However, in 3 (18%) of 16 patients, the best pace map was inadequate at the effective ablation site, with an endocardial activation time at these sites of -25 +/- 12 ms. Pace maps with correlation coefficient > or =0.94 were confined to an area of 1.8 +/- 0.6 cm2. The 10-ms isochrone measured 1.2 +/- 0.7 cm2. The spatial resolution of a good pace map for targeting ventricular tachycardia/ectopy is 1.8 cm2 in the right ventricular outflow tract and therefore is inferior to the spatial resolution of activation mapping as assessed by isochronal activation. In approximately 20% of patients, pace mapping is unreliable in identifying the site of origin, possibly due a deeper site of origin and preferential conduction via fibers connecting the focus to the endocardial surface.
Can Satellite Remote Sensing be Applied in Geological Mapping in Tropics?
NASA Astrophysics Data System (ADS)
Magiera, Janusz
2018-03-01
Remote sensing (RS) techniques are based on spectral data registered by RS scanners as energy reflected from the Earth's surface or emitted by it. In "geological" RS the reflectance (or emittence) should come from rock or sediment. The problem in tropical and subtropical areas is a dense vegetation. Spectral response from the rocks and sediments is gathered only from the gaps among the trees and shrubs. Images of high resolution are appreciated here, therefore. New generation of satellites and scanners (Digital Globe WV2, WV3 and WV4) yield imagery of spatial resolution of 2 m and up to 16 spectral bands (WV3). Images acquired by Landsat (TM, ETM+, OLI) and Sentinel 2 have good spectral resolution too (6-12 bands in visible and infrared) and, despite lower spatial resolution (10-60 m of pixel size) are useful in extracting lithological information too. Lithological RS map may reveal good precision (down to a single rock or outcrop of a meter size). Supplemented with the analysis of Digital Elevation Model and high resolution ortophotomaps (Google Maps, Bing etc.) allows for quick and cheap mapping of unsurveyed areas.
GDR (Genome Database for Rosaceae): integrated web-database for Rosaceae genomics and genetics data
Jung, Sook; Staton, Margaret; Lee, Taein; Blenda, Anna; Svancara, Randall; Abbott, Albert; Main, Dorrie
2008-01-01
The Genome Database for Rosaceae (GDR) is a central repository of curated and integrated genetics and genomics data of Rosaceae, an economically important family which includes apple, cherry, peach, pear, raspberry, rose and strawberry. GDR contains annotated databases of all publicly available Rosaceae ESTs, the genetically anchored peach physical map, Rosaceae genetic maps and comprehensively annotated markers and traits. The ESTs are assembled to produce unigene sets of each genus and the entire Rosaceae. Other annotations include putative function, microsatellites, open reading frames, single nucleotide polymorphisms, gene ontology terms and anchored map position where applicable. Most of the published Rosaceae genetic maps can be viewed and compared through CMap, the comparative map viewer. The peach physical map can be viewed using WebFPC/WebChrom, and also through our integrated GDR map viewer, which serves as a portal to the combined genetic, transcriptome and physical mapping information. ESTs, BACs, markers and traits can be queried by various categories and the search result sites are linked to the mapping visualization tools. GDR also provides online analysis tools such as a batch BLAST/FASTA server for the GDR datasets, a sequence assembly server and microsatellite and primer detection tools. GDR is available at http://www.rosaceae.org. PMID:17932055
NASA Astrophysics Data System (ADS)
Gries, Philipp; Funke, Lisa-Marie; Baumann, Frank; Schmidt, Karsten; Behrens, Thorsten; Scholten, Thomas
2016-04-01
Climate change, increase in population and intensification of land use pose a great challenge for sustainable handling of soils. Intelligent landuse systems are able to minimize and/or avoid soil erosion and loss of soil fertility. A successful application of such systems requires area-wide soil information with high resolution. Containing three consecutive steps, the project INE-2-H („innovative sustainable landuse") at the University of Tuebingen is about creating high-resolution soil information using Digital Soil Mapping (DSM) techniques to develop sustainable landuse strategies. Input data includes soil data from fieldwork (texture and carbon content), the official digital soil and geological map (1:50.000) as well as a wide selection of local, complex and combined terrain parameters. First, soil maps have been created using the DSM approach and Random Forest (RF). Due to high resolution (10x10 m pixels), those maps show a more detailed spatial variability of soil information compared to the official maps used. Root mean square errors (RMSE) of the modelled maps vary from 2.11 % to 6.87 % and the coefficients of determination (R²) go from 0.42 to 0.68. Second, soil erosion potentials have been estimated according to the Universal Soil Loss Equation (USLE). Long-term average annual soil loss ranges from 0.56 to 24.23 [t/ha/a]. Third, combining high-resolution erosion potentials with expert-knowledge of local farmers will result in a landuse system adapted to local conditions. This system will include sustainable strategies reducing soil erosion and conserving soil fertility.
NASA Astrophysics Data System (ADS)
Nesbit, P. R.; Hugenholtz, C.; Durkin, P.; Hubbard, S. M.; Kucharczyk, M.; Barchyn, T.
2016-12-01
Remote sensing and digital mapping have started to revolutionize geologic mapping in recent years as a result of their realized potential to provide high resolution 3D models of outcrops to assist with interpretation, visualization, and obtaining accurate measurements of inaccessible areas. However, in stratigraphic mapping applications in complex terrain, it is difficult to acquire information with sufficient detail at a wide spatial coverage with conventional techniques. We demonstrate the potential of a UAV and Structure from Motion (SfM) photogrammetric approach for improving 3D stratigraphic mapping applications within a complex badland topography. Our case study is performed in Dinosaur Provincial Park (Alberta, Canada), mapping late Cretaceous fluvial meander belt deposits of the Dinosaur Park formation amidst a succession of steeply sloping hills and abundant drainages - creating a challenge for stratigraphic mapping. The UAV-SfM dataset (2 cm spatial resolution) is compared directly with a combined satellite and aerial LiDAR dataset (30 cm spatial resolution) to reveal advantages and limitations of each dataset before presenting a unique workflow that utilizes the dense point cloud from the UAV-SfM dataset for analysis. The UAV-SfM dense point cloud minimizes distortion, preserves 3D structure, and records an RGB attribute - adding potential value in future studies. The proposed UAV-SfM workflow allows for high spatial resolution remote sensing of stratigraphy in complex topographic environments. This extended capability can add value to field observations and has the potential to be integrated with subsurface petroleum models.
A multi-temporal analysis approach for land cover mapping in support of nuclear incident response
NASA Astrophysics Data System (ADS)
Sah, Shagan; van Aardt, Jan A. N.; McKeown, Donald M.; Messinger, David W.
2012-06-01
Remote sensing can be used to rapidly generate land use maps for assisting emergency response personnel with resource deployment decisions and impact assessments. In this study we focus on constructing accurate land cover maps to map the impacted area in the case of a nuclear material release. The proposed methodology involves integration of results from two different approaches to increase classification accuracy. The data used included RapidEye scenes over Nine Mile Point Nuclear Power Station (Oswego, NY). The first step was building a coarse-scale land cover map from freely available, high temporal resolution, MODIS data using a time-series approach. In the case of a nuclear accident, high spatial resolution commercial satellites such as RapidEye or IKONOS can acquire images of the affected area. Land use maps from the two image sources were integrated using a probability-based approach. Classification results were obtained for four land classes - forest, urban, water and vegetation - using Euclidean and Mahalanobis distances as metrics. Despite the coarse resolution of MODIS pixels, acceptable accuracies were obtained using time series features. The overall accuracies using the fusion based approach were in the neighborhood of 80%, when compared with GIS data sets from New York State. The classifications were augmented using this fused approach, with few supplementary advantages such as correction for cloud cover and independence from time of year. We concluded that this method would generate highly accurate land maps, using coarse spatial resolution time series satellite imagery and a single date, high spatial resolution, multi-spectral image.
Beyond Population Distribution: Enhancing Sociocultural Resolution from Remote Sensing
NASA Astrophysics Data System (ADS)
Bhaduri, B. L.; Rose, A.
2017-12-01
At Oak Ridge National Laboratory, since late 1990s, we have focused on developing high resolution population distribution and dynamics data from local to global scales. Increasing resolutions of geographic data has been mirrored by population data sets developed across the community. However, attempts to increase temporal and sociocultural resolutions have been limited given the lack of high resolution data on human settlements and activities. While recent advancements in moderate to high resolution earth observation have led to better physiographic data, the approach of exploiting very high resolution (sub-meter resolution) imagery has also proven useful for generating accurate human settlement maps. It allows potential (social and vulnerability) characterization of population from settlement structures by exploiting image texture and spectral features. Our recent research utilizing machine learning and geocomputation has not only validated "poverty mapping from imagery" hypothesis, but has delineated a new paradigm of rapid analysis of high resolution imagery to enhance such "neighborhood" mapping techniques. Such progress in GIScience is allowing us to move towards the goal of creating a global foundation level database for impervious surfaces and "neighborhoods," and holds tremendous promise for key applications focusing on sustainable development including many social science applications.
Li, Gang; Hillier, LaDeana W; Grahn, Robert A; Zimin, Aleksey V; David, Victor A; Menotti-Raymond, Marilyn; Middleton, Rondo; Hannah, Steven; Hendrickson, Sher; Makunin, Alex; O'Brien, Stephen J; Minx, Pat; Wilson, Richard K; Lyons, Leslie A; Warren, Wesley C; Murphy, William J
2016-06-01
High-resolution genetic and physical maps are invaluable tools for building accurate genome assemblies, and interpreting results of genome-wide association studies (GWAS). Previous genetic and physical maps anchored good quality draft assemblies of the domestic cat genome, enabling the discovery of numerous genes underlying hereditary disease and phenotypes of interest to the biomedical science and breeding communities. However, these maps lacked sufficient marker density to order thousands of shorter scaffolds in earlier assemblies, which instead relied heavily on comparative mapping with related species. A high-resolution map would aid in validating and ordering chromosome scaffolds from existing and new genome assemblies. Here, we describe a high-resolution genetic linkage map of the domestic cat genome based on genotyping 453 domestic cats from several multi-generational pedigrees on the Illumina 63K SNP array. The final maps include 58,055 SNP markers placed relative to 6637 markers with unique positions, distributed across all autosomes and the X chromosome. Our final sex-averaged maps span a total autosomal length of 4464 cM, the longest described linkage map for any mammal, confirming length estimates from a previous microsatellite-based map. The linkage map was used to order and orient the scaffolds from a substantially more contiguous domestic cat genome assembly (Felis catus v8.0), which incorporated ∼20 × coverage of Illumina fragment reads. The new genome assembly shows substantial improvements in contiguity, with a nearly fourfold increase in N50 scaffold size to 18 Mb. We use this map to report probable structural errors in previous maps and assemblies, and to describe features of the recombination landscape, including a massive (∼50 Mb) recombination desert (of virtually zero recombination) on the X chromosome that parallels a similar desert on the porcine X chromosome in both size and physical location. Copyright © 2016 Li et al.
High-resolution maps of H2 regions at far-infrared wavelengths. [balloon-borne cassegrain telescope
NASA Technical Reports Server (NTRS)
Fazio, G. G.; Kleinmann, D. E.; Noyes, R. W.; Wright, E. L.; Zeilik, M., II; Low, F. J.
1974-01-01
The first successful flight of a balloon-borne 1-m telescope for far-infrared (40 micron) astronomy occurred on 4 February 1974 (UT), from Palestine, Texas. During 6 h at float altitude, the gyrostabilized telescope mapped the intensity of far-infrared radiation from the H 2 regions Ori A and W3 with a resolution of 1 prime. Partial maps of these regions were made with a resolution of 0.5 prime. These sources were resolved into several components, some of which were previously unknown. Observations of Mars were used for calibration.
Mikos, Antonios G.; Jansen, John A.; Shroyer, Kenneth R.; Wang, Lihong V.; Sitharaman, Balaji
2012-01-01
Aims In the present study, the efficacy of multi-scale photoacoustic microscopy (PAM) was investigated to detect, map, and quantify trace amounts [nanograms (ng) to micrograms (µg)] of SWCNTs in a variety of histological tissue specimens consisting of cancer and benign tissue biopsies (histological specimens from implanted tissue engineering scaffolds). Materials and Methods Optical-resolution (OR) and acoustic-resolution (AR) - Photoacoustic microscopy (PAM) was employed to detect, map and quantify the SWCNTs in a variety of tissue histological specimens and compared with other optical techniques (bright-field optical microscopy, Raman microscopy, near infrared (NIR) fluorescence microscopy). Results Both optical-resolution and acoustic-resolution PAM, allow the detection and quantification of SWCNTs in histological specimens with scalable spatial resolution and depth penetration. The noise-equivalent detection sensitivity to SWCNTs in the specimens was calculated to be as low as ∼7 pg. Image processing analysis further allowed the mapping, distribution, and quantification of the SWCNTs in the histological sections. Conclusions The results demonstrate the potential of PAM as a promising imaging technique to detect, map, and quantify SWCNTs in histological specimens, and could complement the capabilities of current optical and electron microscopy techniques in the analysis of histological specimens containing SWCNTs. PMID:22496892
USDA-ARS?s Scientific Manuscript database
Genotyping by sequencing (GBS) technology was used to identify a set of 9,933 single nucleotide polymorphism (SNP) markers for constructing a high-resolution genetic map of 1,087 cM for watermelon. The genome-wide variation of recombination rate (GWRR) across the map was evaluated and a positive co...
Using High Spatial Resolution Digital Imagery
2005-02-01
digital base maps were high resolution U.S. Geological Survey (USGS) Digital Orthophoto Quarter Quadrangles (DOQQ). The Root Mean Square Errors (RMSE...next step was to assign real world coordinates to the linear im- age. The mosaics were geometrically registered to the panchromatic orthophotos ...useable thematic map from high-resolution imagery. A more practical approach may be to divide the Refuge into a set of smaller areas, or tiles
Gonadal Transcriptome Analysis of Male and Female Olive Flounder (Paralichthys olivaceus)
Fan, Zhaofei; You, Feng; Wang, Lijuan; Weng, Shenda; Wu, Zhihao; Hu, Jinwei; Zou, Yuxia; Tan, Xungang; Zhang, Peijun
2014-01-01
Olive flounder (Paralichthys olivaceus) is an important commercially cultured marine flatfish in China, Korea, and Japan, of which female grows faster than male. In order to explore the molecular mechanism of flounder sex determination and development, we used RNA-seq technology to investigate transcriptomes of flounder gonads. This produced 22,253,217 and 19,777,841 qualified reads from ovary and testes, which were jointly assembled into 97,233 contigs. Among them, 23,223 contigs were mapped to known genes, of which 2,193 were predicted to be differentially expressed in ovary and 887 in testes. According to annotation information, several sex-related biological pathways including ovarian steroidogenesis and estrogen signaling pathways were firstly found in flounder. The dimorphic expression of overall sex-related genes provides further insights into sex determination and gonadal development. Our study also provides an archive for further studies of molecular mechanism of fish sex determination. PMID:25121093
Johard, Helena; Mahdessian, Diana; Fedr, Radek; Marks, Carolyn; Medalová, Jiřina; Souček, Karel; Lundberg, Emma; Linnarsson, Sten; Bryja, Vítězslav; Sekyrova, Petra; Altun, Mikael; Andäng, Michael
2017-01-01
The cell cycle coordinates core functions such as replication and cell division. However, cell-cycle-regulated transcription in the control of non-core functions, such as cell identity maintenance through specific transcription factors (TFs) and signalling pathways remains unclear. Here, we provide a resource consisting of mapped transcriptomes in unsynchronized HeLa and U2OS cancer cells sorted for cell cycle phase by Fucci reporter expression. We developed a novel algorithm for data analysis that enables efficient visualization and data comparisons and identified cell cycle synchronization of Notch signalling and TFs associated with development. Furthermore, the cell cycle synchronizes with the circadian clock, providing a possible link between developmental transcriptional networks and the cell cycle. In conclusion we find that cell cycle synchronized transcriptional patterns are temporally compartmentalized and more complex than previously anticipated, involving genes, which control cell identity and development. PMID:29228002
Neuropathological and transcriptomic characteristics of the aged brain
Miller, Jeremy A; Guillozet-Bongaarts, Angela; Gibbons, Laura E; Postupna, Nadia; Renz, Anne; Beller, Allison E; Sunkin, Susan M; Ng, Lydia; Rose, Shannon E; Smith, Kimberly A; Szafer, Aaron; Barber, Chris; Bertagnolli, Darren; Bickley, Kristopher; Brouner, Krissy; Caldejon, Shiella; Chapin, Mike; Chua, Mindy L; Coleman, Natalie M; Cudaback, Eiron; Cuhaciyan, Christine; Dalley, Rachel A; Dee, Nick; Desta, Tsega; Dolbeare, Tim A; Dotson, Nadezhda I; Fisher, Michael; Gaudreault, Nathalie; Gee, Garrett; Gilbert, Terri L; Goldy, Jeff; Griffin, Fiona; Habel, Caroline; Haradon, Zeb; Hejazinia, Nika; Hellstern, Leanne L; Horvath, Steve; Howard, Kim; Howard, Robert; Johal, Justin; Jorstad, Nikolas L; Josephsen, Samuel R; Kuan, Chihchau L; Lai, Florence; Lee, Eric; Lee, Felix; Lemon, Tracy; Li, Xianwu; Marshall, Desiree A; Melchor, Jose; Mukherjee, Shubhabrata; Nyhus, Julie; Pendergraft, Julie; Potekhina, Lydia; Rha, Elizabeth Y; Rice, Samantha; Rosen, David; Sapru, Abharika; Schantz, Aimee; Shen, Elaine; Sherfield, Emily; Shi, Shu; Sodt, Andy J; Thatra, Nivretta; Tieu, Michael; Wilson, Angela M; Montine, Thomas J; Larson, Eric B; Bernard, Amy; Crane, Paul K; Ellenbogen, Richard G
2017-01-01
As more people live longer, age-related neurodegenerative diseases are an increasingly important societal health issue. Treatments targeting specific pathologies such as amyloid beta in Alzheimer’s disease (AD) have not led to effective treatments, and there is increasing evidence of a disconnect between traditional pathology and cognitive abilities with advancing age, indicative of individual variation in resilience to pathology. Here, we generated a comprehensive neuropathological, molecular, and transcriptomic characterization of hippocampus and two regions cortex in 107 aged donors (median = 90) from the Adult Changes in Thought (ACT) study as a freely-available resource (http://aging.brain-map.org/). We confirm established associations between AD pathology and dementia, albeit with increased, presumably aging-related variability, and identify sets of co-expressed genes correlated with pathological tau and inflammation markers. Finally, we demonstrate a relationship between dementia and RNA quality, and find common gene signatures, highlighting the importance of properly controlling for RNA quality when studying dementia. PMID:29120328
Earth mapping - aerial or satellite imagery comparative analysis
NASA Astrophysics Data System (ADS)
Fotev, Svetlin; Jordanov, Dimitar; Lukarski, Hristo
Nowadays, solving the tasks for revision of existing map products and creation of new maps requires making a choice of the land cover image source. The issue of the effectiveness and cost of the usage of aerial mapping systems versus the efficiency and cost of very-high resolution satellite imagery is topical [1, 2, 3, 4]. The price of any remotely sensed image depends on the product (panchromatic or multispectral), resolution, processing level, scale, urgency of task and on whether the needed image is available in the archive or has to be requested. The purpose of the present work is: to make a comparative analysis between the two approaches for mapping the Earth having in mind two parameters: quality and cost. To suggest an approach for selection of the map information sources - airplane-based or spacecraft-based imaging systems with very-high spatial resolution. Two cases are considered: area that equals approximately one satellite scene and area that equals approximately the territory of Bulgaria.
In vivo correlation mapping microscopy
NASA Astrophysics Data System (ADS)
McGrath, James; Alexandrov, Sergey; Owens, Peter; Subhash, Hrebesh; Leahy, Martin
2016-04-01
To facilitate regular assessment of the microcirculation in vivo, noninvasive imaging techniques such as nailfold capillaroscopy are required in clinics. Recently, a correlation mapping technique has been applied to optical coherence tomography (OCT), which extends the capabilities of OCT to microcirculation morphology imaging. This technique, known as correlation mapping optical coherence tomography, has been shown to extract parameters, such as capillary density and vessel diameter, and key clinical markers associated with early changes in microvascular diseases. However, OCT has limited spatial resolution in both the transverse and depth directions. Here, we extend this correlation mapping technique to other microscopy modalities, including confocal microscopy, and take advantage of the higher spatial resolution offered by these modalities. The technique is achieved as a processing step on microscopy images and does not require any modification to the microscope hardware. Results are presented which show that this correlation mapping microscopy technique can extend the capabilities of conventional microscopy to enable mapping of vascular networks in vivo with high spatial resolution in both the transverse and depth directions.
NASA Astrophysics Data System (ADS)
Molinario, G.; Baraldi, A.; Altstatt, A. L.; Nackoney, J.
2011-12-01
The University of Maryland has been a USAID Central Africa Rregional Program for the Environment (CARPE) cross-cutting partner for many years, providing remote sensing derived information on forest cover and forest cover changes in support of CARPE's objectives of diminishing forest degradation, loss and biodiversity loss as a result of poor or inexistent land use planning strategies. Together with South Dakota State University, Congo Basin-wide maps have been provided that map forest cover loss at a maximum of 60m resolution, using Landsat imagery and higher resolution imagery for algorithm training and validation. However, to better meet the needs within the CARPE Landscapes, which call for higher resolution, more accurate land cover change maps, UMD has been exploring the use of the SIAM automatic spectral -rule classifier together with pan-sharpened Landsat data (15m resolution) and Very High Resolution imagery from various sources. The pilot project is being developed in collaboration with the African Wildlife Foundation in the Maringa Lopori Wamba CARPE Landscape. If successful in the future this methodology will make the creation of high resolution change maps faster and easier, making it accessible to other entities in the Congo Basin that need accurate land cover and land use change maps in order, for example, to create sustainable land use plans, conserve biodiversity and resources and prepare Reducing Emissions from forest Degradation and Deforestation (REDD) Measurement, Reporting and Verification (MRV) projects. The paper describes the need for higher resolution land cover change maps that focus on forest change dynamics such as the cycling between primary forests, secondary forest, agriculture and other expanding and intensifying land uses in the Maringa Lopori Wamba CARPE Landscape in the Equateur Province of the Democratic Republic of Congo. The Methodology uses the SIAM remote sensing imagery automatic spectral rule classifier, together with pan-sharpened Landsat imagery with 15m resolution and Very High Resolution imagery from different sensors, obtained from the Department of Defense database that was recently opened to NASA and its Earth Observation partners. Particular emphasis is placed on the detection of agricultural fields and their expansion in primary forests or intensification in secondary forests and fallow fields, as this is the primary driver of deforestation in this area. Fields in this area area also of very small size and irregular shapes, often partly obscured by neighboring forest canopy, hence the technical challenge of correctly detecting them and tracking them through time. Finally, the potential for use of this methodology in other regions where information on land cover changes is needed for land use sustainability planning, is also addressed.
Serin, Elise A. R.; Snoek, L. B.; Nijveen, Harm; Willems, Leo A. J.; Jiménez-Gómez, Jose M.; Hilhorst, Henk W. M.; Ligterink, Wilco
2017-01-01
High-density genetic maps are essential for high resolution mapping of quantitative traits. Here, we present a new genetic map for an Arabidopsis Bayreuth × Shahdara recombinant inbred line (RIL) population, built on RNA-seq data. RNA-seq analysis on 160 RILs of this population identified 30,049 single-nucleotide polymorphisms (SNPs) covering the whole genome. Based on a 100-kbp window SNP binning method, 1059 bin-markers were identified, physically anchored on the genome. The total length of the RNA-seq genetic map spans 471.70 centimorgans (cM) with an average marker distance of 0.45 cM and a maximum marker distance of 4.81 cM. This high resolution genotyping revealed new recombination breakpoints in the population. To highlight the advantages of such high-density map, we compared it to two publicly available genetic maps for the same population, comprising 69 PCR-based markers and 497 gene expression markers derived from microarray data, respectively. In this study, we show that SNP markers can effectively be derived from RNA-seq data. The new RNA-seq map closes many existing gaps in marker coverage, saturating the previously available genetic maps. Quantitative trait locus (QTL) analysis for published phenotypes using the available genetic maps showed increased QTL mapping resolution and reduced QTL confidence interval using the RNA-seq map. The new high-density map is a valuable resource that facilitates the identification of candidate genes and map-based cloning approaches. PMID:29259624
A Prototype MODI- SSM/I Snow Mapping Algorithm
NASA Technical Reports Server (NTRS)
Tait, Andrew B.; Barton, Jonathan S.; Hall, Dorothy K.
1999-01-01
Data in the wavelength range 0.545 - 1.652 microns from the Moderate Resolution Imaging Spectroradiometer (MODIS), to be launched aboard the Earth Observing System (EOS) Terra in the fall of 1999, will be used to map daily global snow cover at 500m resolution. However, during darkness, or when the satellite's view of the surface is obscured by cloud, snow cover cannot be mapped using MODIS data. We show that during these conditions, it is possible to supplement the MODIS product by mapping the snow cover using passive microwave data from the Special Sensor Microwave Imager (SSM/I), albeit with much poorer resolution. For a 7-day time period in March 1999, a prototype MODIS snow-cover product was compared with a prototype MODIS-SSM/I product for the same area in the mid-western United States. The combined MODIS-SSM/I product mapped 9% more snow cover than the MODIS-only product.
Cartographic quality of ERTS-1 images
NASA Technical Reports Server (NTRS)
Welch, R. I.
1973-01-01
Analyses of simulated and operational ERTS images have provided initial estimates of resolution, ground resolution, detectability thresholds and other measures of image quality of interest to earth scientists and cartographers. Based on these values, including an approximate ground resolution of 250 meters for both RBV and MSS systems, the ERTS-1 images appear suited to the production and/or revision of planimetric and photo maps of 1:500,000 scale and smaller for which map accuracy standards are compatible with the imaged detail. Thematic mapping, although less constrained by map accuracy standards, will be influenced by measurement thresholds and errors which have yet to be accurately determined for ERTS images. This study also indicates the desirability of establishing a quantitative relationship between image quality values and map products which will permit both engineers and cartographers/earth scientists to contribute to the design requirements of future satellite imaging systems.
aMAP is a validated pipeline for registration and segmentation of high-resolution mouse brain data
Niedworok, Christian J.; Brown, Alexander P. Y.; Jorge Cardoso, M.; Osten, Pavel; Ourselin, Sebastien; Modat, Marc; Margrie, Troy W.
2016-01-01
The validation of automated image registration and segmentation is crucial for accurate and reliable mapping of brain connectivity and function in three-dimensional (3D) data sets. While validation standards are necessarily high and routinely met in the clinical arena, they have to date been lacking for high-resolution microscopy data sets obtained from the rodent brain. Here we present a tool for optimized automated mouse atlas propagation (aMAP) based on clinical registration software (NiftyReg) for anatomical segmentation of high-resolution 3D fluorescence images of the adult mouse brain. We empirically evaluate aMAP as a method for registration and subsequent segmentation by validating it against the performance of expert human raters. This study therefore establishes a benchmark standard for mapping the molecular function and cellular connectivity of the rodent brain. PMID:27384127
Spotlight-Mode Synthetic Aperture Radar Processing for High-Resolution Lunar Mapping
NASA Technical Reports Server (NTRS)
Harcke, Leif; Weintraub, Lawrence; Yun, Sang-Ho; Dickinson, Richard; Gurrola, Eric; Hensley, Scott; Marechal, Nicholas
2010-01-01
During the 2008-2009 year, the Goldstone Solar System Radar was upgraded to support radar mapping of the lunar poles at 4 m resolution. The finer resolution of the new system and the accompanying migration through resolution cells called for spotlight, rather than delay-Doppler, imaging techniques. A new pre-processing system supports fast-time Doppler removal and motion compensation to a point. Two spotlight imaging techniques which compensate for phase errors due to i) out of focus-plane motion of the radar and ii) local topography, have been implemented and tested. One is based on the polar format algorithm followed by a unique autofocus technique, the other is a full bistatic time-domain backprojection technique. The processing system yields imagery of the specified resolution. Products enabled by this new system include topographic mapping through radar interferometry, and change detection techniques (amplitude and coherent change) for geolocation of the NASA LCROSS mission impact site.
The use of Sentinel-2 imagery for seagrass mapping: Kalloni Gulf (Lesvos Island, Greece) case study
NASA Astrophysics Data System (ADS)
Topouzelis, Konstantinos; Charalampis Spondylidis, Spyridon; Papakonstantinou, Apostolos; Soulakellis, Nikolaos
2016-08-01
Seagrass meadows play a significant role in ecosystems by stabilizing sediment and improving water clarity, which enhances seagrass growing conditions. It is high on the priority of EU legislation to map and protect them. The traditional use of medium spatial resolution satellite imagery e.g. Landsat-8 (30m) is very useful for mapping seagrass meadows on a regional scale. However, the availability of Sentinel-2 data, the recent ESA's satellite with its payload Multi-Spectral Instrument (MSI) is expected to improve the mapping accuracy. MSI designed to improve coastline studies due to its enhanced spatial and spectral capabilities e.g. optical bands with 10m spatial resolution. The present work examines the quality of Sentinel-2 images for seagrass mapping, the ability of each band in detection and discrimination of different habitats and estimates the accuracy of seagrass mapping. After pre-processing steps, e.g. radiometric calibration and atmospheric correction, image classified into four classes. Classification classes included sub-bottom composition e.g. seagrass, soft bottom, and hard bottom. Concrete vectors describing the areas covered by seagrass extracted from the high-resolution satellite image and used as in situ measurements. The developed methodology applied in the Gulf of Kalloni, (Lesvos Island - Greece). Results showed that Sentinel-2 images can be robustly used for seagrass mapping due to their spatial resolution, band availability and radiometric accuracy.
Geologic map of Ophir and central Candor Chasmata (MTM -05072) of Mars
Lucchitta, Baerbel K.
1999-01-01
The geologic map of Ophir and central Candor Chasmata is one of a series of 1:500,000 scale maps prepared for areas on Mars that are of particular scientific interest and may serve as potential future landing sites. This map is also part of a set that includes east Candor Chasma, west Candor Chasma, and Melas Chasma. The geologic interpretations are based dominantly on medium- and high-resolution Viking images, many of them stereoscopic, and supplemented by lower resolution apoapsis and other color images. A strip of very high resolution stereoscopic images (~20 m/pixel) crosses the central part of the quadrangle from northwest to southeast and served to clarify detailed relations not obvious on other images. A topographic map with contour intervals of 200 m was also used, as were multidirectional oblique images derived from merged image mosaics and topography (see fig. 1) (Bertolini and McEwen, 1990). Geologic relations and interpretations are based on the entire central Valles Marineris map set. The map area is included in the Valles Marineris map of Witbeck and others (1991), but units were defined independently. Age assignments, however, were integrated with those by Witbeck and others and Scott and Tanaka (1986).
Development and characterization of a 3D high-resolution terrain database
NASA Astrophysics Data System (ADS)
Wilkosz, Aaron; Williams, Bryan L.; Motz, Steve
2000-07-01
A top-level description of methods used to generate elements of a high resolution 3D characterization database is presented. The database elements are defined as ground plane elevation map, vegetation height elevation map, material classification map, discrete man-made object map, and temperature radiance map. The paper will cover data collection by means of aerial photography, techniques of soft photogrammetry used to derive the elevation data, and the methodology followed to generate the material classification map. The discussion will feature the development of the database elements covering Fort Greely, Alaska. The developed databases are used by the US Army Aviation and Missile Command to evaluate the performance of various missile systems.
NASA Astrophysics Data System (ADS)
Hendrickx, J. M. H.; Allen, R. G.; Myint, S. W.; Ogden, F. L.
2015-12-01
Large scale mapping of evapotranspiration and root zone soil moisture is only possible when satellite images are used. The spatial resolution of this imagery typically depends on its temporal resolution or the satellite overpass time. For example, the Landsat satellite acquires images at 30 m resolution every 16 days while the MODIS satellite acquires images at 250 m resolution every day. In this study we deal with optical/thermal imagery that is impacted by cloudiness contrary to radar imagery that penetrates through clouds. Due to cloudiness, the temporal resolution of Landsat drops from 16 days to about one clear sky Landsat image per month in the southwestern USA and about one every ten years in the humid tropics of Panama. Only by launching additional satellites can the temporal resolution be improved. Since this is too costly, an alternative is found by using ground measurements with high temporal resolution (from minutes to days) but poor spatial resolution. The challenge for large-scale evapotranspiration and root zone soil moisture mapping is to construct a layer stack consisting of N time layers covering the period of interest each containing M pixels covering the region of interest. We will present examples of the Phoenix Active Management Area in AZ (14,600 km2), Green River Basin in WY (44,000 km2), the Kishwaukee Watershed in IL (3,150 km2), the area covered by Landsat Path 28/Row 35 in OK (30,000 km2) and the Agua Salud Watershed in Panama (200 km2). In these regions we used Landsat or MODIS imagery for mapping evapotranspiration and root zone soil moisture by the algorithm Mapping EvapoTranspiration at high Resolution with Internalized Calibration (METRIC) together with meteorological measurements and sometimes either Large Aperture Scintillometers (LAS) or Eddy Covariance (EC). We conclude with lessons learned for future large-scale hydrological studies.
Yang, Qi; Meng, Fan-Rui; Bourque, Charles P-A; Zhao, Zhengyong
2017-09-08
Forest ecosite reflects the local site conditions that are meaningful to forest productivity as well as basic ecological functions. Field assessments of vegetation and soil types are often used to identify forest ecosites. However, the production of high-resolution ecosite maps for large areas from interpolating field data is difficult because of high spatial variation and associated costs and time requirements. Indices of soil moisture and nutrient regimes (i.e., SMR and SNR) introduced in this study reflect the combined effects of biogeochemical and topographic factors on forest growth. The objective of this research is to present a method for creating high-resolution forest ecosite maps based on computer-generated predictions of SMR and SNR for an area in Atlantic Canada covering about 4.3 × 10 6 hectares (ha) of forestland. Field data from 1,507 forest ecosystem classification plots were used to assess the accuracy of the ecosite maps produced. Using model predictions of SMR and SNR alone, ecosite maps were 61 and 59% correct in identifying 10 Acadian- and Maritime-Boreal-region ecosite types, respectively. This method provides an operational framework for the production of high-resolution maps of forest ecosites over large areas without the need for data from expensive, supplementary field surveys.
Jacquemin, Julie; Bhatia, Dharminder; Singh, Kuldeep; Wing, Rod A
2013-05-01
The wild relatives of rice contain a virtually untapped reservoir of traits that can be used help drive the 21st century green revolution aimed at solving world food security issues by 2050. To better understand and exploit the 23 species of the Oryza genus the rice research community is developing foundational resources composed of: 1) reference genomes and transcriptomes for all 23 species; 2) advanced mapping populations for functional and breeding studies; and 3) in situ conservation sites for ecological, evolutionary and population genomics. To this end, 16 genome sequencing projects are currently underway, and all completed assemblies have been annotated; and several advanced mapping populations have been developed, and more will be generated, mapped, and phenotyped, to uncover useful alleles. As wild Oryza populations are threatened by human activity and climate change, we also discuss the urgent need for sustainable in situ conservation of the genus. Copyright © 2013 Elsevier Ltd. All rights reserved.
Okamura-Oho, Yuko; Shimokawa, Kazuro; Nishimura, Masaomi; Takemoto, Satoko; Sato, Akira; Furuichi, Teiichi; Yokota, Hideo
2014-01-01
Using a recently invented technique for gene expression mapping in the whole-anatomy context, termed transcriptome tomography, we have generated a dataset of 36,000 maps of overall gene expression in the adult-mouse brain. Here, using an informatics approach, we identified a broad co-expression network that follows an inverse power law and is rich in functional interaction and gene-ontology terms. Our framework for the integrated analysis of expression maps and graphs of co-expression networks revealed that groups of combinatorially expressed genes, which regulate cell differentiation during development, were present in the adult brain and each of these groups was associated with a discrete cell types. These groups included non-coding genes of unknown function. We found that these genes specifically linked developmentally conserved groups in the network. A previously unrecognized robust expression pattern covering the whole brain was related to the molecular anatomy of key biological processes occurring in particular areas. PMID:25382412
Pathik, Bhupesh; Lee, Geoffrey; Sacher, Frédéric; Jaïs, Pierre; Massoullié, Grégoire; Derval, Nicolas; Bates, Matthew G; Lipton, Jonathan; Joseph, Stephen; Morton, Joseph; Sparks, Paul; Kistler, Peter; Kalman, Jonathan M
2017-09-01
Using high-resolution 3-dimensional (3D) mapping, the aim of this study was to further characterize right atrial macro-re-entrant tachycardias and answer unresolved questions in the understanding of this arrhythmia. Despite advances in understanding of the mechanisms of right atrial macro-re-entrant tachycardias, many questions lack definitive answers. The advent of high-resolution 3D mapping provides an opportunity to gain further insights into the nature of these common circuits. A total of 25 patients with right atrial macro-re-entrant tachycardia were studied. High-resolution 3D mapping (Rhythmia mapping system, Boston Scientific, Natick, Massachusetts) was performed. Regional voltage and conduction velocity were determined. Maps were analyzed to characterize wave front propagation patterns in all atrial regions. The relationship between substrate and conduction was evaluated. A total of 42 right atrial macro-re-entrant circuits were observed. The most common location of the posterior line of block was the posteromedial right atrium (73%). This line of block continued superiorly into the superior vena cava, taking an oblique course to finish on the anterior superior vena cava aspect in 73%. Conduction delay at the crista terminalis was less common (23%). Conduction slowing or block was seen at the limbus of the fossa ovalis (73%) and Eustachian ridge (77%). Highly variable and localized areas of slow conduction were also observed in the inferior septum (45%), superior septum (27%), anterosuperior right atrium (23%), and lateral right atrium (23%). Localized conduction slowing was seen in the cavotricuspid isthmus in 50% of patients, but there was no generalized conduction slowing in this isthmus. The voltage in regions of slow conduction was significantly lower compared with areas of normal conduction velocity (p < 0.001). Conduction channels were observed in 55% of patients. High-resolution 3D mapping has provided new insights into the nature of right atrial macro-re-entrant tachycardias. Variable regions of abnormal atrial substrate were associated with conduction slowing and block. Individual variation in propagation patterns was observed in association with this variable substrate. (Mapping of Atrial Arrhythmias Using High Spatial Resolution Mapping Catheters and the Rhythmia Mapping System; ACTRN12615000544572). Copyright © 2017 American College of Cardiology Foundation. Published by Elsevier Inc. All rights reserved.
Single Image Super-Resolution Using Global Regression Based on Multiple Local Linear Mappings.
Choi, Jae-Seok; Kim, Munchurl
2017-03-01
Super-resolution (SR) has become more vital, because of its capability to generate high-quality ultra-high definition (UHD) high-resolution (HR) images from low-resolution (LR) input images. Conventional SR methods entail high computational complexity, which makes them difficult to be implemented for up-scaling of full-high-definition input images into UHD-resolution images. Nevertheless, our previous super-interpolation (SI) method showed a good compromise between Peak-Signal-to-Noise Ratio (PSNR) performances and computational complexity. However, since SI only utilizes simple linear mappings, it may fail to precisely reconstruct HR patches with complex texture. In this paper, we present a novel SR method, which inherits the large-to-small patch conversion scheme from SI but uses global regression based on local linear mappings (GLM). Thus, our new SR method is called GLM-SI. In GLM-SI, each LR input patch is divided into 25 overlapped subpatches. Next, based on the local properties of these subpatches, 25 different local linear mappings are applied to the current LR input patch to generate 25 HR patch candidates, which are then regressed into one final HR patch using a global regressor. The local linear mappings are learned cluster-wise in our off-line training phase. The main contribution of this paper is as follows: Previously, linear-mapping-based conventional SR methods, including SI only used one simple yet coarse linear mapping to each patch to reconstruct its HR version. On the contrary, for each LR input patch, our GLM-SI is the first to apply a combination of multiple local linear mappings, where each local linear mapping is found according to local properties of the current LR patch. Therefore, it can better approximate nonlinear LR-to-HR mappings for HR patches with complex texture. Experiment results show that the proposed GLM-SI method outperforms most of the state-of-the-art methods, and shows comparable PSNR performance with much lower computational complexity when compared with a super-resolution method based on convolutional neural nets (SRCNN15). Compared with the previous SI method that is limited with a scale factor of 2, GLM-SI shows superior performance with average 0.79 dB higher in PSNR, and can be used for scale factors of 3 or higher.
Roth, Melissa S; Cokus, Shawn J; Gallaher, Sean D; Walter, Andreas; Lopez, David; Erickson, Erika; Endelman, Benjamin; Westcott, Daniel; Larabell, Carolyn A; Merchant, Sabeeha S; Pellegrini, Matteo; Niyogi, Krishna K
2017-05-23
Microalgae have potential to help meet energy and food demands without exacerbating environmental problems. There is interest in the unicellular green alga Chromochloris zofingiensis , because it produces lipids for biofuels and a highly valuable carotenoid nutraceutical, astaxanthin. To advance understanding of its biology and facilitate commercial development, we present a C. zofingiensis chromosome-level nuclear genome, organelle genomes, and transcriptome from diverse growth conditions. The assembly, derived from a combination of short- and long-read sequencing in conjunction with optical mapping, revealed a compact genome of ∼58 Mbp distributed over 19 chromosomes containing 15,274 predicted protein-coding genes. The genome has uniform gene density over chromosomes, low repetitive sequence content (∼6%), and a high fraction of protein-coding sequence (∼39%) with relatively long coding exons and few coding introns. Functional annotation of gene models identified orthologous families for the majority (∼73%) of genes. Synteny analysis uncovered localized but scrambled blocks of genes in putative orthologous relationships with other green algae. Two genes encoding beta-ketolase ( BKT ), the key enzyme synthesizing astaxanthin, were found in the genome, and both were up-regulated by high light. Isolation and molecular analysis of astaxanthin-deficient mutants showed that BKT1 is required for the production of astaxanthin. Moreover, the transcriptome under high light exposure revealed candidate genes that could be involved in critical yet missing steps of astaxanthin biosynthesis, including ABC transporters, cytochrome P450 enzymes, and an acyltransferase. The high-quality genome and transcriptome provide insight into the green algal lineage and carotenoid production.
Chung, Oksung; Jin, Seondeok; Cho, Yun Sung; Lim, Jeongheui; Kim, Hyunho; Jho, Sungwoong; Kim, Hak-Min; Jun, JeHoon; Lee, HyeJin; Chon, Alvin; Ko, Junsu; Edwards, Jeremy; Weber, Jessica A; Han, Kyudong; O'Brien, Stephen J; Manica, Andrea; Bhak, Jong; Paek, Woon Kee
2015-10-21
The cinereous vulture, Aegypius monachus, is the largest bird of prey and plays a key role in the ecosystem by removing carcasses, thus preventing the spread of diseases. Its feeding habits force it to cope with constant exposure to pathogens, making this species an interesting target for discovering functionally selected genetic variants. Furthermore, the presence of two independently evolved vulture groups, Old World and New World vultures, provides a natural experiment in which to investigate convergent evolution due to obligate scavenging. We sequenced the genome of a cinereous vulture, and mapped it to the bald eagle reference genome, a close relative with a divergence time of 18 million years. By comparing the cinereous vulture to other avian genomes, we find positively selected genetic variations in this species associated with respiration, likely linked to their ability of immune defense responses and gastric acid secretion, consistent with their ability to digest carcasses. Comparisons between the Old World and New World vulture groups suggest convergent gene evolution. We assemble the cinereous vulture blood transcriptome from a second individual, and annotate genes. Finally, we infer the demographic history of the cinereous vulture which shows marked fluctuations in effective population size during the late Pleistocene. We present the first genome and transcriptome analyses of the cinereous vulture compared to other avian genomes and transcriptomes, revealing genetic signatures of dietary and environmental adaptations accompanied by possible convergent evolution between the Old World and New World vultures.
Transcriptome sequencing of rhizome tissue of Sinopodophyllum hexandrum at two temperatures.
Kumari, Anita; Singh, Heikham Russiachand; Jha, Ashwani; Swarnkar, Mohit Kumar; Shankar, Ravi; Kumar, Sanjay
2014-10-07
Sinopodophyllum hexandrum is an endangered medicinal herb, which is commonly present in elevations ranging between 2,400-4,500 m and is sensitive to temperature. Medicinal property of the species is attributed to the presence of podophyllotoxin in the rhizome tissue. The present work analyzed transcriptome of rhizome tissue of S. hexandrum exposed to 15°C and 25°C to understand the temperature mediated molecular responses including those associated with podophyllotoxin biosynthesis. Deep sequencing of transcriptome with an average coverage of 88.34X yielded 60,089 assembled transcript sequences representing 20,387 unique genes having homology to known genes. Fragments per kilobase of exon per million fragments mapped (FPKM) based expression analysis revealed genes related to growth and development were over-expressed at 15°C, whereas genes involved in stress response were over-expressed at 25°C. There was a decreasing trend of podophyllotoxin accumulation at 25°C; data was well supported by the expression of corresponding genes of the pathway. FPKM data was validated by quantitative real-time polymerase chain reaction data using a total of thirty four genes and a positive correlation between the two platforms of gene expression was obtained. Also, detailed analyses yielded cytochrome P450s, methyltransferases and glycosyltransferases which could be the potential candidate hitherto unidentified genes of podophyllotoxin biosynthesis pathway. The present work revealed temperature responsive transcriptome of S. hexandrum on Illumina platform. Data suggested expression of genes for growth and development and podophyllotoxin biosynthesis at 15°C, and prevalence of those associated with stress response at 25°C.
Roth, Melissa S.; Cokus, Shawn J.; Gallaher, Sean D.; ...
2017-05-08
Microalgae have potential to help meet energy and food demands without exacerbating environmental problems. There is interest in the unicellular green alga Chromochloris zofingiensis, because it produces lipids for biofuels and a highly valuable carotenoid nutraceutical, astaxanthin. Here, to advance understanding of its biology and facilitate commercial development, we present a C. zofingiensis chromosome-level nuclear genome, organelle genomes, and transcriptome from diverse growth conditions. The assembly, derived from a combination of short- and long-read sequencing in conjunction with optical mapping, revealed a compact genome of ~58 Mbp distributed over 19 chromosomes containing 15,274 predicted protein-coding genes. The genome has uniformmore » gene density over chromosomes, low repetitive sequence content (~6%), and a high fraction of protein-coding sequence (~39%) with relatively long coding exons and few coding introns. Functional annotation of gene models identified orthologous families for the majority (~73%) of genes. Synteny analysis uncovered localized but scrambled blocks of genes in putative orthologous relationships with other green algae. Two genes encoding beta-ketolase (BKT), the key enzyme synthesizing astaxanthin, were found in the genome, and both were up-regulated by high light. Isolation and molecular analysis of astaxanthin-deficient mutants showed that BKT1 is required for the production of astaxanthin. Moreover, the transcriptome under high light exposure revealed candidate genes that could be involved in critical yet missing steps of astaxanthin biosynthesis, including ABC transporters, cytochrome P450 enzymes, and an acyltransferase. Finally, the high-quality genome and transcriptome provide insight into the green algal lineage and carotenoid production.« less
Multi-Omics Driven Assembly and Annotation of the Sandalwood (Santalum album) Genome.
Mahesh, Hirehally Basavarajegowda; Subba, Pratigya; Advani, Jayshree; Shirke, Meghana Deepak; Loganathan, Ramya Malarini; Chandana, Shankara Lingu; Shilpa, Siddappa; Chatterjee, Oishi; Pinto, Sneha Maria; Prasad, Thottethodi Subrahmanya Keshava; Gowda, Malali
2018-04-01
Indian sandalwood ( Santalum album ) is an important tropical evergreen tree known for its fragrant heartwood-derived essential oil and its valuable carving wood. Here, we applied an integrated genomic, transcriptomic, and proteomic approach to assemble and annotate the Indian sandalwood genome. Our genome sequencing resulted in the establishment of a draft map of the smallest genome for any woody tree species to date (221 Mb). The genome annotation predicted 38,119 protein-coding genes and 27.42% repetitive DNA elements. In-depth proteome analysis revealed the identities of 72,325 unique peptides, which confirmed 10,076 of the predicted genes. The addition of transcriptomic and proteogenomic approaches resulted in the identification of 53 novel proteins and 34 gene-correction events that were missed by genomic approaches. Proteogenomic analysis also helped in reassigning 1,348 potential noncoding RNAs as bona fide protein-coding messenger RNAs. Gene expression patterns at the RNA and protein levels indicated that peptide sequencing was useful in capturing proteins encoded by nuclear and organellar genomes alike. Mass spectrometry-based proteomic evidence provided an unbiased approach toward the identification of proteins encoded by organellar genomes. Such proteins are often missed in transcriptome data sets due to the enrichment of only messenger RNAs that contain poly(A) tails. Overall, the use of integrated omic approaches enhanced the quality of the assembly and annotation of this nonmodel plant genome. The availability of genomic, transcriptomic, and proteomic data will enhance genomics-assisted breeding, germplasm characterization, and conservation of sandalwood trees. © 2018 American Society of Plant Biologists. All Rights Reserved.
Pfalzer, Anna C; Kamanu, Frederick K; Parnell, Laurence D; Tai, Albert K; Liu, Zhenhua; Mason, Joel B; Crott, Jimmy W
2016-08-01
Obesity is a significant risk factor for colorectal cancer (CRC); however, the relative contribution of high-fat (HF) consumption and excess adiposity remains unclear. It is becoming apparent that obesity perturbs both the intestinal microbiome and metabolome, and each has the potential to induce protumorigenic changes in the epithelial transcriptome. The physiological consequences and the degree to which these different biologic systems interact remain poorly defined. To understand the mechanisms by which obesity drives colonic tumorigenesis, we profiled the colonic epithelial transcriptome of HF-fed and genetically obese (DbDb) mice with a genetic predisposition to intestinal tumorigenesis (Apc(1638N)); 266 and 584 genes were differentially expressed in the colonic mucosa of HF and DbDb mice, respectively. These genes mapped to pathways involved in immune function, and cellular proliferation and cancer. Furthermore, Akt was central within the networks of interacting genes identified in both gene sets. Regression analyses of coexpressed genes with the abundance of bacterial taxa identified three taxa, previously correlated with tumor burden, to be significantly correlated with a gene module enriched for Akt-related genes. Similarly, regression of coexpressed genes with metabolites found that adenosine, which was negatively associated with inflammatory markers and tumor burden, was also correlated with a gene module enriched with Akt regulators. Our findings provide evidence that HF consumption and excess adiposity result in changes in the colonic transcriptome that, although distinct, both appear to converge on Akt signaling. Such changes could be mediated by alterations in the colonic microbiome and metabolome.
Ramos, Miguel Jesus Nunes; Coito, João Lucas; Fino, Joana; Cunha, Jorge; Silva, Helena; de Almeida, Patrícia Gomes; Costa, Maria Manuela Ribeiro; Amâncio, Sara; Paulo, Octávio S; Rocheta, Margarida
2017-01-01
RNA-seq of Vitis during early stages of bud development, in male, female and hermaphrodite flowers, identified new loci outside of annotated gene models, suggesting their involvement in sex establishment. The molecular mechanisms responsible for flower sex specification remain unclear for most plant species. In the case of V. vinifera ssp. vinifera, it is not fully understood what determines hermaphroditism in the domesticated subspecies and male or female flowers in wild dioecious relatives (Vitis vinifera ssp. sylvestris). Here, we describe a de novo assembly of the transcriptome of three flower developmental stages from the three Vitis vinifera flower types. The validation of de novo assembly showed a correlation of 0.825. The main goals of this work were the identification of V. v. sylvestris exclusive transcripts and the characterization of differential gene expression during flower development. RNA from several flower developmental stages was used previously to generate Illumina sequence reads. Through a sequential de novo assembly strategy one comprehensive transcriptome comprising 95,516 non-redundant transcripts was assembled. From this dataset 81,064 transcripts were annotated to V. v. vinifera reference transcriptome and 11,084 were annotated against V. v. vinifera reference genome. Moreover, we found 3368 transcripts that could not be mapped to Vitis reference genome. From all the non-redundant transcripts that were assembled, bioinformatics analysis identified 133 specific of V. v. sylvestris and 516 transcripts differentially expressed among the three flower types. The detection of transcription from areas of the genome not currently annotated suggests active transcription of previously unannotated genomic loci during early stages of bud development.
Dissecting the Root Nodule Transcriptome of Chickpea (Cicer arietinum L.)
Kant, Chandra; Pradhan, Seema; Bhatia, Sabhyata
2016-01-01
A hallmark trait of chickpea (Cicer arietinum L.), like other legumes, is the capability to convert atmospheric nitrogen (N2) into ammonia (NH3) in symbiotic association with Mesorhizobium ciceri. However, the complexity of molecular networks associated with the dynamics of nodule development in chickpea need to be analyzed in depth. Hence, in order to gain insights into the chickpea nodule development, the transcriptomes of nodules at early, middle and late stages of development were sequenced using the Roche 454 platform. This generated 490.84 Mb sequence data comprising 1,360,251 reads which were assembled into 83,405 unigenes. Transcripts were annotated using Gene Ontology (GO), Cluster of Orthologous Groups (COG) and Kyoto Encyclopedia of Genes and Genomes (KEGG) metabolic pathways analysis. Differential expression analysis revealed that a total of 3760 transcripts were differentially expressed in at least one of three stages, whereas 935, 117 and 2707 transcripts were found to be differentially expressed in the early, middle and late stages of nodule development respectively. MapMan analysis revealed enrichment of metabolic pathways such as transport, protein synthesis, signaling and carbohydrate metabolism during root nodulation. Transcription factors were predicted and analyzed for their differential expression during nodule development. Putative nodule specific transcripts were identified and enriched for GO categories using BiNGO which revealed many categories to be enriched during nodule development, including transcription regulators and transporters. Further, the assembled transcriptome was also used to mine for genic SSR markers. In conclusion, this study will help in enriching the transcriptomic resources implicated in understanding of root nodulation events in chickpea. PMID:27348121
DOE Office of Scientific and Technical Information (OSTI.GOV)
Roth, Melissa S.; Cokus, Shawn J.; Gallaher, Sean D.
Microalgae have potential to help meet energy and food demands without exacerbating environmental problems. There is interest in the unicellular green alga Chromochloris zofingiensis, because it produces lipids for biofuels and a highly valuable carotenoid nutraceutical, astaxanthin. Here, to advance understanding of its biology and facilitate commercial development, we present a C. zofingiensis chromosome-level nuclear genome, organelle genomes, and transcriptome from diverse growth conditions. The assembly, derived from a combination of short- and long-read sequencing in conjunction with optical mapping, revealed a compact genome of ~58 Mbp distributed over 19 chromosomes containing 15,274 predicted protein-coding genes. The genome has uniformmore » gene density over chromosomes, low repetitive sequence content (~6%), and a high fraction of protein-coding sequence (~39%) with relatively long coding exons and few coding introns. Functional annotation of gene models identified orthologous families for the majority (~73%) of genes. Synteny analysis uncovered localized but scrambled blocks of genes in putative orthologous relationships with other green algae. Two genes encoding beta-ketolase (BKT), the key enzyme synthesizing astaxanthin, were found in the genome, and both were up-regulated by high light. Isolation and molecular analysis of astaxanthin-deficient mutants showed that BKT1 is required for the production of astaxanthin. Moreover, the transcriptome under high light exposure revealed candidate genes that could be involved in critical yet missing steps of astaxanthin biosynthesis, including ABC transporters, cytochrome P450 enzymes, and an acyltransferase. Finally, the high-quality genome and transcriptome provide insight into the green algal lineage and carotenoid production.« less
Roth, Melissa S.; Cokus, Shawn J.; Gallaher, Sean D.; Walter, Andreas; Lopez, David; Erickson, Erika; Endelman, Benjamin; Westcott, Daniel; Larabell, Carolyn A.; Merchant, Sabeeha S.; Pellegrini, Matteo
2017-01-01
Microalgae have potential to help meet energy and food demands without exacerbating environmental problems. There is interest in the unicellular green alga Chromochloris zofingiensis, because it produces lipids for biofuels and a highly valuable carotenoid nutraceutical, astaxanthin. To advance understanding of its biology and facilitate commercial development, we present a C. zofingiensis chromosome-level nuclear genome, organelle genomes, and transcriptome from diverse growth conditions. The assembly, derived from a combination of short- and long-read sequencing in conjunction with optical mapping, revealed a compact genome of ∼58 Mbp distributed over 19 chromosomes containing 15,274 predicted protein-coding genes. The genome has uniform gene density over chromosomes, low repetitive sequence content (∼6%), and a high fraction of protein-coding sequence (∼39%) with relatively long coding exons and few coding introns. Functional annotation of gene models identified orthologous families for the majority (∼73%) of genes. Synteny analysis uncovered localized but scrambled blocks of genes in putative orthologous relationships with other green algae. Two genes encoding beta-ketolase (BKT), the key enzyme synthesizing astaxanthin, were found in the genome, and both were up-regulated by high light. Isolation and molecular analysis of astaxanthin-deficient mutants showed that BKT1 is required for the production of astaxanthin. Moreover, the transcriptome under high light exposure revealed candidate genes that could be involved in critical yet missing steps of astaxanthin biosynthesis, including ABC transporters, cytochrome P450 enzymes, and an acyltransferase. The high-quality genome and transcriptome provide insight into the green algal lineage and carotenoid production. PMID:28484037
Whole transcriptome analysis of the poultry red mite Dermanyssus gallinae (De Geer, 1778).
Schicht, Sabine; Qi, Weihong; Poveda, Lucy; Strube, Christina
2014-03-01
SUMMARY Although the poultry red mite Dermanyssus gallinae (De Geer, 1778) is the major parasitic pest in poultry farming causing substantial economic losses every year, nucleotide data are rare in the public databases. Therefore, de novo sequencing covering the transcriptome of D. gallinae was carried out resulting in a dataset of 232 097 singletons and 42 130 contiguous sequences (contigs) which were subsequently clustered into 24 140 isogroups consisting of 35 788 isotigs. After removal of sequences possibly originating from bacteria or the chicken host, 267 464 sequences (231 657 singletons, 56 contigs and 35 751 isotigs) remained, of which 10·3% showed homology to proteins derived from other organisms. The most significant Blast top-hit species was the mite Metaseiulus occidentalis followed by the tick Ixodes scapularis. To gain functional knowledge of D. gallinae transcripts, sequences were mapped to Gene Ontology terms, Kyoto Encyclopedia of Gene and Genomes (KEGG) pathways and parsed to InterProScan. The transcriptome dataset provides new insights in general mite genetics and lays a foundation for future studies on stage-specific transcriptomics as well as genomic, proteomic, and metabolomic explorations and might provide new perspectives to control this parasitic mite by identifying possible drug targets or vaccine candidates. It is also worth noting that in different tested species of the class Arachnida no 28S rRNA was detectable in the rRNA profile, indicating that 28S rRNA might consists of two separate, hydrogen-bonded fragments, whose (heat-induced) disruption may led to co-migration with 18S rRNA.
Using next generation transcriptome sequencing to predict an ectomycorrhizal metablome.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Larsen, P. E.; Sreedasyam, A.; Trivedi, G
Mycorrhizae, symbiotic interactions between soil fungi and tree roots, are ubiquitous in terrestrial ecosystems. The fungi contribute phosphorous, nitrogen and mobilized nutrients from organic matter in the soil and in return the fungus receives photosynthetically-derived carbohydrates. This union of plant and fungal metabolisms is the mycorrhizal metabolome. Understanding this symbiotic relationship at a molecular level provides important contributions to the understanding of forest ecosystems and global carbon cycling. We generated next generation short-read transcriptomic sequencing data from fully-formed ectomycorrhizae between Laccaria bicolor and aspen (Populus tremuloides) roots. The transcriptomic data was used to identify statistically significantly expressed gene models usingmore » a bootstrap-style approach, and these expressed genes were mapped to specific metabolic pathways. Integration of expressed genes that code for metabolic enzymes and the set of expressed membrane transporters generates a predictive model of the ectomycorrhizal metabolome. The generated model of mycorrhizal metabolome predicts that the specific compounds glycine, glutamate, and allantoin are synthesized by L. bicolor and that these compounds or their metabolites may be used for the benefit of aspen in exchange for the photosynthetically-derived sugars fructose and glucose. The analysis illustrates an approach to generate testable biological hypotheses to investigate the complex molecular interactions that drive ectomycorrhizal symbiosis. These models are consistent with experimental environmental data and provide insight into the molecular exchange processes for organisms in this complex ecosystem. The method used here for predicting metabolomic models of mycorrhizal systems from deep RNA sequencing data can be generalized and is broadly applicable to transcriptomic data derived from complex systems.« less
Gu, Li; Zhang, Zhong-Yi; Quan, Hong; Li, Ming-Jie; Zhao, Fang-Yu; Xu, Yuan-Jiang; Liu, Jiang; Sai, Man; Zheng, Wei-Lie; Lan, Xiao-Zhong
2018-06-01
Mirabilis himalaica (Edgew.) Heimerl is among the most important genuine medicinal plants in Tibet. However, the biosynthesis mechanisms of the active compounds in this species are unclear, severely limiting its application. To clarify the molecular biosynthesis mechanism of the key representative active compounds, specifically rotenoid, which is of special medicinal value for M. himalaica, RNA sequencing and TOF-MS technologies were used to construct transcriptomic and metabolomic libraries from the roots, stems, and leaves of M. himalaica plants collected from their natural habitat. As a result, each of the transcriptomic libraries from the different tissues was sequenced, generating more than 10 Gb of clean data ultimately assembled into 147,142 unigenes. In the three tissues, metabolomic analysis identified 522 candidate compounds, of which 170 metabolites involved in 114 metabolic pathways were mapped to the KEGG. Of these genes, 61 encoding enzymes were identified to function at key steps of the pathways related to rotenoid biosynthesis, where 14 intermediate metabolites were also located. An integrated analysis of metabolic and transcriptomic data revealed that most of the intermediate metabolites and enzymes related to rotenoid biosynthesis were synthesized in the roots, stems and leaves of M. himalaica, which suggested that the use of non-medicinal tissues to extract compounds was feasible. In addition, the CHS and CHI genes were found to play important roles in rotenoid biosynthesis, especially, since CHS might be an important rate-limiting enzyme. This study provides a hypothetical basis for the screening of new active metabolites and the metabolic engineering of rotenoid in M. himalaica.
Aircraft Detection in High-Resolution SAR Images Based on a Gradient Textural Saliency Map.
Tan, Yihua; Li, Qingyun; Li, Yansheng; Tian, Jinwen
2015-09-11
This paper proposes a new automatic and adaptive aircraft target detection algorithm in high-resolution synthetic aperture radar (SAR) images of airport. The proposed method is based on gradient textural saliency map under the contextual cues of apron area. Firstly, the candidate regions with the possible existence of airport are detected from the apron area. Secondly, directional local gradient distribution detector is used to obtain a gradient textural saliency map in the favor of the candidate regions. In addition, the final targets will be detected by segmenting the saliency map using CFAR-type algorithm. The real high-resolution airborne SAR image data is used to verify the proposed algorithm. The results demonstrate that this algorithm can detect aircraft targets quickly and accurately, and decrease the false alarm rate.
Robb, Paul D; Craven, Alan J
2008-12-01
An image processing technique is presented for atomic resolution high-angle annular dark-field (HAADF) images that have been acquired using scanning transmission electron microscopy (STEM). This technique is termed column ratio mapping and involves the automated process of measuring atomic column intensity ratios in high-resolution HAADF images. This technique was developed to provide a fuller analysis of HAADF images than the usual method of drawing single intensity line profiles across a few areas of interest. For instance, column ratio mapping reveals the compositional distribution across the whole HAADF image and allows a statistical analysis and an estimation of errors. This has proven to be a very valuable technique as it can provide a more detailed assessment of the sharpness of interfacial structures from HAADF images. The technique of column ratio mapping is described in terms of a [110]-oriented zinc-blende structured AlAs/GaAs superlattice using the 1 angstroms-scale resolution capability of the aberration-corrected SuperSTEM 1 instrument.
Molecular dynamics-based refinement and validation for sub-5 Å cryo-electron microscopy maps.
Singharoy, Abhishek; Teo, Ivan; McGreevy, Ryan; Stone, John E; Zhao, Jianhua; Schulten, Klaus
2016-07-07
Two structure determination methods, based on the molecular dynamics flexible fitting (MDFF) paradigm, are presented that resolve sub-5 Å cryo-electron microscopy (EM) maps with either single structures or ensembles of such structures. The methods, denoted cascade MDFF and resolution exchange MDFF, sequentially re-refine a search model against a series of maps of progressively higher resolutions, which ends with the original experimental resolution. Application of sequential re-refinement enables MDFF to achieve a radius of convergence of ~25 Å demonstrated with the accurate modeling of β-galactosidase and TRPV1 proteins at 3.2 Å and 3.4 Å resolution, respectively. The MDFF refinements uniquely offer map-model validation and B-factor determination criteria based on the inherent dynamics of the macromolecules studied, captured by means of local root mean square fluctuations. The MDFF tools described are available to researchers through an easy-to-use and cost-effective cloud computing resource on Amazon Web Services.
Combined Landsat-8 and Sentinel-2 Burned Area Mapping
NASA Astrophysics Data System (ADS)
Huang, H.; Roy, D. P.; Zhang, H.; Boschetti, L.; Yan, L.; Li, Z.
2017-12-01
Fire products derived from coarse spatial resolution satellite data have become an important source of information for the multiple user communities involved in fire science and applications. The advent of the MODIS on NASA's Terra and Aqua satellites enabled systematic production of 500m global burned area maps. There is, however, an unequivocal demand for systematically generated higher spatial resolution burned area products, in particular to examine the role of small-fires for various applications. Moderate spatial resolution contemporaneous satellite data from Landsat-8 and the Sentinel-2A and -2B sensors provide the opportunity for detailed spatial mapping of burned areas. Combined, these polar-orbiting systems provide 10m to 30m multi-spectral global coverage more than once every three days. This NASA funded research presents results to prototype a combined Landsat-8 Sentinel-2 burned area product. The Landsat-8 and Sentinel-2 pre-processing, the time-series burned area mapping algorithm, and preliminary results and validation using high spatial resolution commercial satellite data over Africa are presented.
NASA Astrophysics Data System (ADS)
Tang, Wei; Liao, Mingsheng; Zhang, Lu; Li, Wei; Yu, Weimin
2016-09-01
A high spatial and temporal resolution of the precipitable water vapour (PWV) in the atmosphere is a key requirement for the short-scale weather forecasting and climate research. The aim of this work is to derive temporally differenced maps of the spatial distribution of PWV by analysing the tropospheric delay "noise" in interferometric synthetic aperture radar (InSAR). Time series maps of differential PWV were obtained by processing a set of ENVISAT ASAR (Advanced Synthetic Aperture Radar) images covering the area of southern California, USA from 6 October 2007 to 29 November 2008. To get a more accurate PWV, the component of hydrostatic delay was calculated and subtracted by using ERA-Interim reanalysis products. In addition, the ERA-Interim was used to compute the conversion factors required to convert the zenith wet delay to water vapour. The InSAR-derived differential PWV maps were calibrated by means of the GPS PWV measurements over the study area. We validated our results against the measurements of PWV derived from the Medium Resolution Imaging Spectrometer (MERIS) which was located together with the ASAR sensor on board the ENVISAT satellite. Our comparative results show strong spatial correlations between the two data sets. The difference maps have Gaussian distributions with mean values close to zero and standard deviations below 2 mm. The advantage of the InSAR technique is that it provides water vapour distribution with a spatial resolution as fine as 20 m and an accuracy of ˜ 2 mm. Such high-spatial-resolution maps of PWV could lead to much greater accuracy in meteorological understanding and quantitative precipitation forecasts. With the launch of Sentinel-1A and Sentinel-1B satellites, every few days (6 days) new SAR images can be acquired with a wide swath up to 250 km, enabling a unique operational service for InSAR-based water vapour maps with unprecedented spatial and temporal resolution.
Gao, Shuai; Yan, Liying; Wang, Rui; Li, Jingyun; Yong, Jun; Zhou, Xin; Wei, Yuan; Wu, Xinglong; Wang, Xiaoye; Fan, Xiaoying; Yan, Jie; Zhi, Xu; Gao, Yun; Guo, Hongshan; Jin, Xiao; Wang, Wendong; Mao, Yunuo; Wang, Fengchao; Wen, Lu; Fu, Wei; Ge, Hao; Qiao, Jie; Tang, Fuchou
2018-06-01
The development of the digestive tract is critical for proper food digestion and nutrient absorption. Here, we analyse the main organs of the digestive tract, including the oesophagus, stomach, small intestine and large intestine, from human embryos between 6 and 25 weeks of gestation as well as the large intestine from adults using single-cell RNA-seq analyses. In total, 5,227 individual cells are analysed and 40 cell types clearly identified. Their crucial biological features, including developmental processes, signalling pathways, cell cycle, nutrient digestion and absorption metabolism, and transcription factor networks, are systematically revealed. Moreover, the differentiation and maturation processes of the large intestine are thoroughly investigated by comparing the corresponding transcriptome profiles between embryonic and adult stages. Our work offers a rich resource for investigating the gene regulation networks of the human fetal digestive tract and adult large intestine at single-cell resolution.
Pluto Topography and Composition Map
2017-09-28
These maps are from New Horizons' data on the topography (top) and composition (bottom) of Pluto's surface. In the high-resolution topographical map, the highlighted red region is high in elevation. The map below, showing the composition, indicates the same section also contains methane, color-coded in orange. One can see the orange features spread into the fuzzier, lower-resolution data that covers the rest of the globe, meaning those areas, too, are high in methane, and therefore likely to be high in elevation. https://photojournal.jpl.nasa.gov/catalog/PIA22036
DOE Office of Scientific and Technical Information (OSTI.GOV)
BIRKEL, GARRETT; GARCIA MARTIN, HECTOR; MORRELL, WILLIAM
"Arrowland" is a web-based software application primarily for mapping, integrating and visualizing a variety of metabolism data of living organisms, including but not limited to metabolomics, proteomics, transcriptomics and fluxomics. This software application makes multi-omics data analysis intuitive and interactive. It improves data sharing and communication by enabling users to visualize their omics data using a web browser (on a PC or mobile device). It increases user's productivity by simplifying multi-omics data analysis using well developed maps as a guide. Users using this tool can gain insights into their data sets that would be difficult or even impossible to teasemore » out by looking at raw number, or using their currently existing toolchains to generate static single-use maps. Arrowland helps users save time by visualizing relative changes in different conditions or over time, and helps users to produce more significant insights faster. Preexisting maps decrease the learning curve for beginners in the omics field. Sets of multi-omics data are presented in the browser, as a two-dimensional flowchart resembling a map, with varying levels of detail information, based on the scaling of the map. Users can pan and zoom to explore different maps, compare maps, upload their own research data sets onto desired maps, alter map appearance in ways that facilitate interpretation, visualization and analysis of the given data, and export data, reports and actionable items to help the user initiative.« less
2011-01-01
Background Addition of sugar syrups to the basic wort is a popular technique to achieve higher gravity in beer fermentations, but it results in dilution of the free amino nitrogen (FAN) content in the medium. The multicomponent protease enzyme Flavourzyme has beneficial effect on the brewer's yeast fermentation performance during high gravity fermentations as it increases the initial FAN value and results in higher FAN uptake, higher specific growth rate, higher ethanol yield and improved flavour profile. Results In the present study, transcriptome and metabolome analysis were used to elucidate the effect on the addition of the multicomponent protease enzyme Flavourzyme and its influence on the metabolism of the brewer's yeast strain Weihenstephan 34/70. The study underlines the importance of sufficient nitrogen availability during the course of beer fermentation. The applied metabolome and transcriptome analysis allowed mapping the effect of the wort sugar composition on the nitrogen uptake. Conclusion Both the transcriptome and the metabolome analysis revealed that there is a significantly higher impact of protease addition for maltose syrup supplemented fermentations, while addition of glucose syrup to increase the gravity in the wort resulted in increased glucose repression that lead to inhibition of amino acid uptake and hereby inhibited the effect of the protease addition. PMID:21513553
Genome-wide proteomics analysis on longissimus muscles in Qinchuan beef cattle.
He, Hua; Chen, Si; Liang, Wei; Liu, Xiaolin
2017-04-01
To gain further insight into the molecular mechanism of bovine muscle development, we combined mass spectrometry characterization of proteins with Illumina deep sequencing of RNAs obtained from bovine longissimus muscle (LD) at prenatal and postnatal stages. For the proteomic study, each group of LD proteins was extracted and labeled using isobaric tags for relative and absolute quantitation (iTRAQ) method. Among the 1321 proteins identified from six samples, 390 proteins were differentially expressed in embryos at day 135 post-fertilization (Emb135d) vs. 30-month-old adult cattle (Emb135d vs. 30M) samples. Gene Ontology, Cluster of Orthologous Groups and Kyoto Encyclopedia of Genes and Genomes analyses were further conducted to better understand the different functions. Furthermore, we analyzed the relationship between transcript and protein regulation between samples by direct comparison of expression levels from transcriptomic and iTRAQ-based proteomics. Association results indicated that 1295 of 1321 proteins could be mapped to transcriptome sequencing data. This study provides the most comprehensive, targeted survey of bovine LD proteins to date and has shown the power of combining transcriptomic and proteomic approaches to provide molecular insights for understanding the developmental characteristics in bovine muscle, and even in other mammals. © 2016 Stichting International Foundation for Animal Genetics.
Transcription Profiling Analysis of Mango–Fusarium mangiferae Interaction
Liu, Feng; Wu, Jing-bo; Zhan, Ru-lin; Ou, Xiong-chang
2016-01-01
Malformation caused by Fusarium mangiferae is one of the most destructive mango diseases affecting the canopy and floral development, leading to dramatic reduction in fruit yield. To further understand the mechanism of interaction between mango and F. mangiferae, we monitored the transcriptome profiles of buds from susceptible mango plants, which were challenged with F. mangiferae. More than 99 million reads were deduced by RNA-sequencing and were assembled into 121,267 unigenes. Based on the sequence similarity searches, 61,706 unigenes were identified, of which 21,273 and 50,410 were assigned to gene ontology categories and clusters of orthologous groups, respectively, and 33,243 were mapped to 119 KEGG pathways. The differentially expressed genes of mango were detected, having 15,830, 26,061, and 20,146 DEGs respectively, after infection for 45, 75, and 120 days. The analysis of the comparative transcriptome suggests that basic defense mechanisms play important roles in disease resistance. The data also show the transcriptional responses of interactions between mango and the pathogen and more drastic changes in the host transcriptome in response to the pathogen. These results could be used to develop new methods to broaden the resistance of mango to malformation, including the over-expression of key mango genes. PMID:27683574
Cândido, Elizabete de Souza; Fernandes, Gabriel da Rocha; de Alencar, Sérgio Amorim; Cardoso, Marlon Henrique e Silva; Lima, Stella Maris de Freitas; Miranda, Vívian de Jesus; Porto, William Farias; Nolasco, Diego Oliveira; de Oliveira-Júnior, Nelson Gomes; Barbosa, Aulus Estevão Anjos de Deus; Pogue, Robert Edward; Rezende, Taia Maria Berto; Dias, Simoni Campos; Franco, Octávio Luiz
2014-01-01
Zantedeschia aethiopica is an evergreen perennial plant cultivated worldwide and commonly used for ornamental and medicinal purposes including the treatment of bacterial infections. However, the current understanding of molecular and physiological mechanisms in this plant is limited, in comparison to other non-model plants. In order to improve understanding of the biology of this botanical species, RNA-Seq technology was used for transcriptome assembly and characterization. Following Z. aethiopica spathe tissue RNA extraction, high-throughput RNA sequencing was performed with the aim of obtaining both abundant and rare transcript data. Functional profiling based on KEGG Orthology (KO) analysis highlighted contigs that were involved predominantly in genetic information (37%) and metabolism (34%) processes. Predicted proteins involved in the plant circadian system, hormone signal transduction, secondary metabolism and basal immunity are described here. In silico screening of the transcriptome data set for antimicrobial peptide (AMP) –encoding sequences was also carried out and three lipid transfer proteins (LTP) were identified as potential AMPs involved in plant defense. Spathe predicted protein maps were drawn, and suggested that major plant efforts are expended in guaranteeing the maintenance of cell homeostasis, characterized by high investment in carbohydrate, amino acid and energy metabolism as well as in genetic information. PMID:24614014
Shemesh-Mayer, Einat; Ben-Michael, Tomer; Rotem, Neta; Rabinowitch, Haim D.; Doron-Faigenboim, Adi; Kosmala, Arkadiusz; Perlikowski, Dawid; Sherman, Amir; Kamenetsky, Rina
2015-01-01
Commercial cultivars of garlic, a popular condiment, are sterile, making genetic studies and breeding of this plant challenging. However, recent fertility restoration has enabled advanced physiological and genetic research and hybridization in this important crop. Morphophysiological studies, combined with transcriptome and proteome analyses and quantitative PCR validation, enabled the identification of genes and specific processes involved in gametogenesis in fertile and male-sterile garlic genotypes. Both genotypes exhibit normal meiosis at early stages of anther development, but in the male-sterile plants, tapetal hypertrophy after microspore release leads to pollen degeneration. Transcriptome analysis and global gene-expression profiling showed that >16,000 genes are differentially expressed in the fertile vs. male-sterile developing flowers. Proteome analysis and quantitative comparison of 2D-gel protein maps revealed 36 significantly different protein spots, 9 of which were present only in the male-sterile genotype. Bioinformatic and quantitative PCR validation of 10 candidate genes exhibited significant expression differences between male-sterile and fertile flowers. A comparison of morphophysiological and molecular traits of fertile and male-sterile garlic flowers suggests that respiratory restrictions and/or non-regulated programmed cell death of the tapetum can lead to energy deficiency and consequent pollen abortion. Potential molecular markers for male fertility and sterility in garlic are proposed. PMID:25972879
The genomic and transcriptomic architecture of 2,000 breast tumours reveals novel subgroups.
Curtis, Christina; Shah, Sohrab P; Chin, Suet-Feung; Turashvili, Gulisa; Rueda, Oscar M; Dunning, Mark J; Speed, Doug; Lynch, Andy G; Samarajiwa, Shamith; Yuan, Yinyin; Gräf, Stefan; Ha, Gavin; Haffari, Gholamreza; Bashashati, Ali; Russell, Roslin; McKinney, Steven; Langerød, Anita; Green, Andrew; Provenzano, Elena; Wishart, Gordon; Pinder, Sarah; Watson, Peter; Markowetz, Florian; Murphy, Leigh; Ellis, Ian; Purushotham, Arnie; Børresen-Dale, Anne-Lise; Brenton, James D; Tavaré, Simon; Caldas, Carlos; Aparicio, Samuel
2012-04-18
The elucidation of breast cancer subgroups and their molecular drivers requires integrated views of the genome and transcriptome from representative numbers of patients. We present an integrated analysis of copy number and gene expression in a discovery and validation set of 997 and 995 primary breast tumours, respectively, with long-term clinical follow-up. Inherited variants (copy number variants and single nucleotide polymorphisms) and acquired somatic copy number aberrations (CNAs) were associated with expression in ~40% of genes, with the landscape dominated by cis- and trans-acting CNAs. By delineating expression outlier genes driven in cis by CNAs, we identified putative cancer genes, including deletions in PPP2R2A, MTAP and MAP2K4. Unsupervised analysis of paired DNA–RNA profiles revealed novel subgroups with distinct clinical outcomes, which reproduced in the validation cohort. These include a high-risk, oestrogen-receptor-positive 11q13/14 cis-acting subgroup and a favourable prognosis subgroup devoid of CNAs. Trans-acting aberration hotspots were found to modulate subgroup-specific gene networks, including a TCR deletion-mediated adaptive immune response in the ‘CNA-devoid’ subgroup and a basal-specific chromosome 5 deletion-associated mitotic network. Our results provide a novel molecular stratification of the breast cancer population, derived from the impact of somatic CNAs on the transcriptome.
Young, Neil D.; Jex, Aaron R.; Cantacessi, Cinzia; Hall, Ross S.; Campbell, Bronwyn E.; Spithill, Terence W.; Tangkawattana, Sirikachorn; Tangkawattana, Prasarn; Laha, Thewarach; Gasser, Robin B.
2011-01-01
Fasciola gigantica (Digenea) is an important foodborne trematode that causes liver fluke disease (fascioliasis) in mammals, including ungulates and humans, mainly in tropical climatic zones of the world. Despite its socioeconomic impact, almost nothing is known about the molecular biology of this parasite, its interplay with its hosts, and the pathogenesis of fascioliasis. Modern genomic technologies now provide unique opportunities to rapidly tackle these exciting areas. The present study reports the first transcriptome representing the adult stage of F. gigantica (of bovid origin), defined using a massively parallel sequencing-coupled bioinformatic approach. From >20 million raw sequence reads, >30,000 contiguous sequences were assembled, of which most were novel. Relative levels of transcription were determined for individual molecules, which were also characterized (at the inferred amino acid level) based on homology, gene ontology, and/or pathway mapping. Comparisons of the transcriptome of F. gigantica with those of other trematodes, including F. hepatica, revealed similarities in transcription for molecules inferred to have key roles in parasite-host interactions. Overall, the present dataset should provide a solid foundation for future fundamental genomic, proteomic, and metabolomic explorations of F. gigantica, as well as a basis for applied outcomes such as the development of novel methods of intervention against this neglected parasite. PMID:21408104
2013-01-01
Background Previous experiments have shown that the reduced gravity aboard the International Space Station (ISS) causes important alterations in Drosophila gene expression. These changes were shown to be intimately linked to environmental space-flight related constraints. Results Here, we use an array of different techniques for ground-based simulation of microgravity effects to assess the effect of suboptimal environmental conditions on the gene expression of Drosophila in reduced gravity. A global and integrative analysis, using “gene expression dynamics inspector” (GEDI) self-organizing maps, reveals different degrees in the responses of the transcriptome when using different environmental conditions or microgravity/hypergravity simulation devices. Although the genes that are affected are different in each simulation technique, we find that the same gene ontology groups, including at least one large multigene family related with behavior, stress response or organogenesis, are over represented in each case. Conclusions These results suggest that the transcriptome as a whole can be finely tuned to gravity force. In optimum environmental conditions, the alteration of gravity has only mild effects on gene expression but when environmental conditions are far from optimal, the gene expression must be tuned greatly and effects become more robust, probably linked to the lack of experience of organisms exposed to evolutionary novel environments such as a gravitational free one. PMID:23806134
Yang, Mei; Zhu, Lingping; Pan, Cheng; Xu, Liming; Liu, Yanling; Ke, Weidong; Yang, Pingfang
2015-08-17
Rhizome is the storage organ of lotus derived from modified stems. The development of rhizome is a complex process and depends on the balanced expression of the genes that is controlled by environmental and endogenous factors. However, little is known about the mechanism that regulates rhizome girth enlargement. In this study, using RNA-seq, transcriptomic analyses were performed at three rhizome developmental stages-the stolon, middle swelling and later swelling stage -in the cultivars 'ZO' (temperate lotus with enlarged rhizome) and 'RL' (tropical lotus with stolon). About 348 million high-quality reads were generated, and 88.5% of the data were mapped to the reference genome. Of 26783 genes identified, 24069 genes were previously predicted in the reference, and 2714 genes were novel transcripts. Moreover, 8821 genes were differentially expressed between the cultivars at the three stages. Functional analysis identified that these genes were significantly enriched in pathways carbohydrate metabolism and plant hormone signal transduction. Twenty-two genes involved in photoperiod pathway, starch metabolism and hormone signal transduction were candidate genes inducing rhizome girth enlargement. Comparative transcriptomic analysis detected several differentially expressed genes and potential candidate genes required for rhizome girth enlargement, which lay a foundation for future studies on molecular mechanisms underlying rhizome formation.
Yang, Mei; Zhu, Lingping; Pan, Cheng; Xu, Liming; Liu, Yanling; Ke, Weidong; Yang, Pingfang
2015-01-01
Rhizome is the storage organ of lotus derived from modified stems. The development of rhizome is a complex process and depends on the balanced expression of the genes that is controlled by environmental and endogenous factors. However, little is known about the mechanism that regulates rhizome girth enlargement. In this study, using RNA-seq, transcriptomic analyses were performed at three rhizome developmental stages—the stolon, middle swelling and later swelling stage —in the cultivars ‘ZO’ (temperate lotus with enlarged rhizome) and ‘RL’ (tropical lotus with stolon). About 348 million high-quality reads were generated, and 88.5% of the data were mapped to the reference genome. Of 26783 genes identified, 24069 genes were previously predicted in the reference, and 2714 genes were novel transcripts. Moreover, 8821 genes were differentially expressed between the cultivars at the three stages. Functional analysis identified that these genes were significantly enriched in pathways carbohydrate metabolism and plant hormone signal transduction. Twenty-two genes involved in photoperiod pathway, starch metabolism and hormone signal transduction were candidate genes inducing rhizome girth enlargement. Comparative transcriptomic analysis detected several differentially expressed genes and potential candidate genes required for rhizome girth enlargement, which lay a foundation for future studies on molecular mechanisms underlying rhizome formation. PMID:26279185
Wang, Zegang; Tang, Kai; Zhang, Dayong; Wan, Yizhen; Wen, Yan; Lu, Quanyou; Wang, Lei
2017-01-01
This study is the first to comprehensively characterize m6A patterns in the Arabidopsis chloroplast and mitochondria transcriptomes based on our open accessible data deposited in NCBI's Gene Expression Omnibus with GEO Series accession number of GSE72706. Over 86% of the transcripts were methylated by m6A in the two organelles. Over 550 and 350 m6A sites were mapped, with ~5.6 to ~5.8 and ~4.6 to ~4.9 m6A sites per transcript, to the chloroplast and mitochondria genome, respectively. The overall m6A methylation extent in the two organelles was greatly higher than that in the nucleus. The m6A motif sequences in the transcriptome of two organelles were similar to the nuclear motifs, suggesting that selection of the m6A motifs for RNA methylation was conserved between the nucleus and organelle transcriptomes. The m6A patterns of rRNAs and tRNAs in the organelle were similar to those in the nucleus. However, the m6A patterns in coding RNAs were distinct between the nucleus and the organelle, suggesting that that regulation of the m6A methylation patterns may be different between the nuclei and the organelles. The extensively methylated transcripts in the two organelles were mainly associated with rRNA, ribosomal proteins, photosystem reaction proteins, tRNA, NADH dehydrogenase and redox. On average, 64% and 79% of the transcripts in the two organelles showed differential m6A methylation across three organs of the leaves, flowers and roots. The m6A methylation extent in the chloroplast was higher than that in the mitochondria. This study provides deep insights into the m6A methylation topology and differentiation in the plant organelle transcriptomes.
Rossouw, Debra; Næs, Tormod; Bauer, Florian F
2008-01-01
Background 'Omics' tools provide novel opportunities for system-wide analysis of complex cellular functions. Secondary metabolism is an example of a complex network of biochemical pathways, which, although well mapped from a biochemical point of view, is not well understood with regards to its physiological roles and genetic and biochemical regulation. Many of the metabolites produced by this network such as higher alcohols and esters are significant aroma impact compounds in fermentation products, and different yeast strains are known to produce highly divergent aroma profiles. Here, we investigated whether we can predict the impact of specific genes of known or unknown function on this metabolic network by combining whole transcriptome and partial exo-metabolome analysis. Results For this purpose, the gene expression levels of five different industrial wine yeast strains that produce divergent aroma profiles were established at three different time points of alcoholic fermentation in synthetic wine must. A matrix of gene expression data was generated and integrated with the concentrations of volatile aroma compounds measured at the same time points. This relatively unbiased approach to the study of volatile aroma compounds enabled us to identify candidate genes for aroma profile modification. Five of these genes, namely YMR210W, BAT1, AAD10, AAD14 and ACS1 were selected for overexpression in commercial wine yeast, VIN13. Analysis of the data show a statistically significant correlation between the changes in the exo-metabome of the overexpressing strains and the changes that were predicted based on the unbiased alignment of transcriptomic and exo-metabolomic data. Conclusion The data suggest that a comparative transcriptomics and metabolomics approach can be used to identify the metabolic impacts of the expression of individual genes in complex systems, and the amenability of transcriptomic data to direct applications of biotechnological relevance. PMID:18990252
NASA Astrophysics Data System (ADS)
Macander, M. J.; Frost, G. V., Jr.
2015-12-01
Regional-scale mapping of vegetation and other ecosystem properties has traditionally relied on medium-resolution remote sensing such as Landsat (30 m) and MODIS (250 m). Yet, the burgeoning availability of high-resolution (<=2 m) imagery and ongoing advances in computing power and analysis tools raises the prospect of performing ecosystem mapping at fine spatial scales over large study domains. Here we demonstrate cutting-edge mapping approaches over a ~35,000 km² study area on Alaska's North Slope using calibrated and atmospherically-corrected mosaics of high-resolution WorldView-2 and GeoEye-1 imagery: (1) an a priori spectral approach incorporating the Satellite Imagery Automatic Mapper (SIAM) algorithms; (2) image segmentation techniques; and (3) texture metrics. The SIAM spectral approach classifies radiometrically-calibrated imagery to general vegetation density categories and non-vegetated classes. The SIAM classes were developed globally and their applicability in arctic tundra environments has not been previously evaluated. Image segmentation, or object-based image analysis, automatically partitions high-resolution imagery into homogeneous image regions that can then be analyzed based on spectral, textural, and contextual information. We applied eCognition software to delineate waterbodies and vegetation classes, in combination with other techniques. Texture metrics were evaluated to determine the feasibility of using high-resolution imagery to algorithmically characterize periglacial surface forms (e.g., ice-wedge polygons), which are an important physical characteristic of permafrost-dominated regions but which cannot be distinguished by medium-resolution remote sensing. These advanced mapping techniques provide products which can provide essential information supporting a broad range of ecosystem science and land-use planning applications in northern Alaska and elsewhere in the circumpolar Arctic.
Two-component Thermal Dust Emission Model: Application to the Planck HFI Maps
NASA Astrophysics Data System (ADS)
Meisner, Aaron M.; Finkbeiner, Douglas P.
2014-06-01
We present full-sky, 6.1 arcminute resolution maps of dust optical depth and temperature derived by fitting the Finkbeiner et al. (1999) two-component dust emission model to the Planck HFI and IRAS 100 micron maps. This parametrization of the far infrared thermal dust SED as the sum of two modified blackbodies serves as an important alternative to the commonly adopted single modified blackbody dust emission model. We expect our Planck-based maps of dust temperature and optical depth to form the basis for a next-generation, high-resolution extinction map which will additionally incorporate small-scale detail from WISE imaging.
Abdallah, Abdallah M.; Hill-Cawthorne, Grant A.; Otto, Thomas D.; Coll, Francesc; Guerra-Assunção, José Afonso; Gao, Ge; Naeem, Raeece; Ansari, Hifzur; Malas, Tareq B.; Adroub, Sabir A.; Verboom, Theo; Ummels, Roy; Zhang, Huoming; Panigrahi, Aswini Kumar; McNerney, Ruth; Brosch, Roland; Clark, Taane G.; Behr, Marcel A.; Bitter, Wilbert; Pain, Arnab
2015-01-01
Although Bacillus Calmette-Guérin (BCG) vaccines against tuberculosis have been available for more than 90 years, their effectiveness has been hindered by variable protective efficacy and a lack of lasting memory responses. One factor contributing to this variability may be the diversity of the BCG strains that are used around the world, in part from genomic changes accumulated during vaccine production and their resulting differences in gene expression. We have compared the genomes and transcriptomes of a global collection of fourteen of the most widely used BCG strains at single base-pair resolution. We have also used quantitative proteomics to identify key differences in expression of proteins across five representative BCG strains of the four tandem duplication (DU) groups. We provide a comprehensive map of single nucleotide polymorphisms (SNPs), copy number variation and insertions and deletions (indels) across fourteen BCG strains. Genome-wide SNP characterization allowed the construction of a new and robust phylogenic genealogy of BCG strains. Transcriptional and proteomic profiling revealed a metabolic remodeling in BCG strains that may be reflected by altered immunogenicity and possibly vaccine efficacy. Together, these integrated-omic data represent the most comprehensive catalogue of genetic variation across a global collection of BCG strains. PMID:26487098
A high-resolution network model for global gene regulation in Mycobacterium tuberculosis
Peterson, Eliza J.R.; Reiss, David J.; Turkarslan, Serdar; Minch, Kyle J.; Rustad, Tige; Plaisier, Christopher L.; Longabaugh, William J.R.; Sherman, David R.; Baliga, Nitin S.
2014-01-01
The resilience of Mycobacterium tuberculosis (MTB) is largely due to its ability to effectively counteract and even take advantage of the hostile environments of a host. In order to accelerate the discovery and characterization of these adaptive mechanisms, we have mined a compendium of 2325 publicly available transcriptome profiles of MTB to decipher a predictive, systems-scale gene regulatory network model. The resulting modular organization of 98% of all MTB genes within this regulatory network was rigorously tested using two independently generated datasets: a genome-wide map of 7248 DNA-binding locations for 143 transcription factors (TFs) and global transcriptional consequences of overexpressing 206 TFs. This analysis has discovered specific TFs that mediate conditional co-regulation of genes within 240 modules across 14 distinct environmental contexts. In addition to recapitulating previously characterized regulons, we discovered 454 novel mechanisms for gene regulation during stress, cholesterol utilization and dormancy. Significantly, 183 of these mechanisms act uniquely under conditions experienced during the infection cycle to regulate diverse functions including 23 genes that are essential to host-pathogen interactions. These and other insights underscore the power of a rational, model-driven approach to unearth novel MTB biology that operates under some but not all phases of infection. PMID:25232098
Chintapalli, Venkateswara R; Al Bratty, Mohammed; Korzekwa, Dominika; Watson, David G; Dow, Julian A T
2013-01-01
Metabolomics can provide exciting insights into organismal function, but most work on simple models has focussed on the whole organism metabolome, so missing the contributions of individual tissues. Comprehensive metabolite profiles for ten tissues from adult Drosophila melanogaster were obtained here by two chromatographic methods, a hydrophilic interaction (HILIC) method for polar metabolites and a lipid profiling method also based on HILIC, in combination with an Orbitrap Exactive instrument. Two hundred and forty two polar metabolites were putatively identified in the various tissues, and 251 lipids were observed in positive ion mode and 61 in negative ion mode. Although many metabolites were detected in all tissues, every tissue showed characteristically abundant metabolites which could be rationalised against specific tissue functions. For example, the cuticle contained high levels of glutathione, reflecting a role in oxidative defence; the alimentary canal (like vertebrate gut) had high levels of acylcarnitines for fatty acid metabolism, and the head contained high levels of ether lipids. The male accessory gland uniquely contained decarboxylated S-adenosylmethionine. These data thus both provide valuable insights into tissue function, and a reference baseline, compatible with the FlyAtlas.org transcriptomic resource, for further metabolomic analysis of this important model organism, for example in the modelling of human inborn errors of metabolism, aging or metabolic imbalances such as diabetes.
A high-resolution cattle CNV map by population-scale genome sequencing
USDA-ARS?s Scientific Manuscript database
Copy Number Variations (CNVs) are common genomic structural variations that have been linked to human diseases and phenotypic traits. CNVs represent an important type of genetic variation among cattle breeds and even individual animals; however, only low-resolution maps of cattle CNVs currently exis...
Large-Scale, High-Resolution Neurophysiological Maps Underlying fMRI of Macaque Temporal Lobe
Papanastassiou, Alex M.; DiCarlo, James J.
2013-01-01
Maps obtained by functional magnetic resonance imaging (fMRI) are thought to reflect the underlying spatial layout of neural activity. However, previous studies have not been able to directly compare fMRI maps to high-resolution neurophysiological maps, particularly in higher level visual areas. Here, we used a novel stereo microfocal x-ray system to localize thousands of neural recordings across monkey inferior temporal cortex (IT), construct large-scale maps of neuronal object selectivity at subvoxel resolution, and compare those neurophysiology maps with fMRI maps from the same subjects. While neurophysiology maps contained reliable structure at the sub-millimeter scale, fMRI maps of object selectivity contained information at larger scales (>2.5 mm) and were only partly correlated with raw neurophysiology maps collected in the same subjects. However, spatial smoothing of neurophysiology maps more than doubled that correlation, while a variety of alternative transforms led to no significant improvement. Furthermore, raw spiking signals, once spatially smoothed, were as predictive of fMRI maps as local field potential signals. Thus, fMRI of the inferior temporal lobe reflects a spatially low-passed version of neurophysiology signals. These findings strongly validate the widespread use of fMRI for detecting large (>2.5 mm) neuronal domains of object selectivity but show that a complete understanding of even the most pure domains (e.g., faces vs nonface objects) requires investigation at fine scales that can currently only be obtained with invasive neurophysiological methods. PMID:24048850
Guitet, Stéphane; Hérault, Bruno; Molto, Quentin; Brunaux, Olivier; Couteron, Pierre
2015-01-01
Precise mapping of above-ground biomass (AGB) is a major challenge for the success of REDD+ processes in tropical rainforest. The usual mapping methods are based on two hypotheses: a large and long-ranged spatial autocorrelation and a strong environment influence at the regional scale. However, there are no studies of the spatial structure of AGB at the landscapes scale to support these assumptions. We studied spatial variation in AGB at various scales using two large forest inventories conducted in French Guiana. The dataset comprised 2507 plots (0.4 to 0.5 ha) of undisturbed rainforest distributed over the whole region. After checking the uncertainties of estimates obtained from these data, we used half of the dataset to develop explicit predictive models including spatial and environmental effects and tested the accuracy of the resulting maps according to their resolution using the rest of the data. Forest inventories provided accurate AGB estimates at the plot scale, for a mean of 325 Mg.ha-1. They revealed high local variability combined with a weak autocorrelation up to distances of no more than10 km. Environmental variables accounted for a minor part of spatial variation. Accuracy of the best model including spatial effects was 90 Mg.ha-1 at plot scale but coarse graining up to 2-km resolution allowed mapping AGB with accuracy lower than 50 Mg.ha-1. Whatever the resolution, no agreement was found with available pan-tropical reference maps at all resolutions. We concluded that the combined weak autocorrelation and weak environmental effect limit AGB maps accuracy in rainforest, and that a trade-off has to be found between spatial resolution and effective accuracy until adequate “wall-to-wall” remote sensing signals provide reliable AGB predictions. Waiting for this, using large forest inventories with low sampling rate (<0.5%) may be an efficient way to increase the global coverage of AGB maps with acceptable accuracy at kilometric resolution. PMID:26402522
Orbital-science investigation: Part C: photogrammetry of Apollo 15 photography
Wu, Sherman S.C.; Schafer, Francis J.; Jordan, Raymond; Nakata, Gary M.; Derick, James L.
1972-01-01
Mapping of large areas of the Moon by photogrammetric methods was not seriously considered until the Apollo 15 mission. In this mission, a mapping camera system and a 61-cm optical-bar high-resolution panoramic camera, as well as a laser altimeter, were used. The mapping camera system comprises a 7.6-cm metric terrain camera and a 7.6-cm stellar camera mounted in a fixed angular relationship (an angle of 96° between the two camera axes). The metric camera has a glass focal-plane plate with reseau grids. The ground-resolution capability from an altitude of 110 km is approximately 20 m. Because of the auxiliary stellar camera and the laser altimeter, the resulting metric photography can be used not only for medium- and small-scale cartographic or topographic maps, but it also can provide a basis for establishing a lunar geodetic network. The optical-bar panoramic camera has a 135- to 180-line resolution, which is approximately 1 to 2 m of ground resolution from an altitude of 110 km. Very large scale specialized topographic maps for supporting geologic studies of lunar-surface features can be produced from the stereoscopic coverage provided by this camera.
Theoretical considerations for mapping activation in human cardiac fibrillation
NASA Astrophysics Data System (ADS)
Rappel, Wouter-Jan; Narayan, Sanjiv M.
2013-06-01
Defining mechanisms for cardiac fibrillation is challenging because, in contrast to other arrhythmias, fibrillation exhibits complex non-repeatability in spatiotemporal activation but paradoxically exhibits conserved spatial gradients in rate, dominant frequency, and electrical propagation. Unlike animal models, in which fibrillation can be mapped at high spatial and temporal resolution using optical dyes or arrays of contact electrodes, mapping of cardiac fibrillation in patients is constrained practically to lower resolutions or smaller fields-of-view. In many animal models, atrial fibrillation is maintained by localized electrical rotors and focal sources. However, until recently, few studies had revealed localized sources in human fibrillation, so that the impact of mapping constraints on the ability to identify rotors or focal sources in humans was not described. Here, we determine the minimum spatial and temporal resolutions theoretically required to detect rigidly rotating spiral waves and focal sources, then extend these requirements for spiral waves in computer simulations. Finally, we apply our results to clinical data acquired during human atrial fibrillation using a novel technique termed focal impulse and rotor mapping (FIRM). Our results provide theoretical justification and clinical demonstration that FIRM meets the spatio-temporal resolution requirements to reliably identify rotors and focal sources for human atrial fibrillation.
High Resolution IRAS Maps and IR Emission of M31 -- II. Diffuse Component and Interstellar Dust
NASA Technical Reports Server (NTRS)
Xu, C.; Helou, G.
1995-01-01
Large-scale dust heating and cooling in the diffuse medium of M31 is studied using the high resolution (HiRes) IRAS maps in conjunction with UV, optical (UBV), and the HI maps. A dust heating/cooling model is developed based on a radiative transfer model which assumes a 'Sandwich' configuration of dust and stars takes account of the effect of dust grain scattering.
Salivary Polytene Chromosome Map of Anopheles darlingi, the Main Vector of Neotropical Malaria
Rafael, Míriam S.; Rohde, Cláudia; Bridi, Letícia C.; da Silva Valente Gaiesky, Vera Lúcia; Tadei, Wanderli P.
2010-01-01
New photomap of Anopheles (Nyssorhynchus) darlingi Root, 1926, is described for a population from Guajará-Mirim, State of Rondonia, Brazil. The number of sections in the previous A. darlingi reference map was maintained and new subsections were added to the five chromosome arms. Breakage points of paracentric inversions had been previously incorporated into the photomap of this species. An additional inversion is reported, called 3Lc, totaling 14 inversions in the A. darlingi chromosome arms. The proposed photomap is potentially useful for further evolutionary studies in addition to physical and in silico chromosome mapping using A. darlingi genomic and transcriptome sequences. Furthermore, in our attempt to compare sections of the 2R chromosome arm of A. darlingi with Anopheles funestus, Anopheles stephensi, and Anopheles gambiae, we found great differences in the arrangement of the polytene chromosome bands, which are consistent with the known phylogenetic divergence of these species. PMID:20682862
2013-01-01
Background Although banana (Musa sp.) is an important edible crop, contributing towards poverty alleviation and food security, limited transcriptome datasets are available for use in accelerated molecular-based breeding in this genus. 454 GS-FLX Titanium technology was employed to determine the sequence of gene transcripts in genotypes of Musa acuminata ssp. burmannicoides Calcutta 4 and M. acuminata subgroup Cavendish cv. Grande Naine, contrasting in resistance to the fungal pathogen Mycosphaerella musicola, causal organism of Sigatoka leaf spot disease. To enrich for transcripts under biotic stress responses, full length-enriched cDNA libraries were prepared from whole plant leaf materials, both uninfected and artificially challenged with pathogen conidiospores. Results The study generated 846,762 high quality sequence reads, with an average length of 334 bp and totalling 283 Mbp. De novo assembly generated 36,384 and 35,269 unigene sequences for M. acuminata Calcutta 4 and Cavendish Grande Naine, respectively. A total of 64.4% of the unigenes were annotated through Basic Local Alignment Search Tool (BLAST) similarity analyses against public databases. Assembled sequences were functionally mapped to Gene Ontology (GO) terms, with unigene functions covering a diverse range of molecular functions, biological processes and cellular components. Genes from a number of defense-related pathways were observed in transcripts from each cDNA library. Over 99% of contig unigenes mapped to exon regions in the reference M. acuminata DH Pahang whole genome sequence. A total of 4068 genic-SSR loci were identified in Calcutta 4 and 4095 in Cavendish Grande Naine. A subset of 95 potential defense-related gene-derived simple sequence repeat (SSR) loci were validated for specific amplification and polymorphism across M. acuminata accessions. Fourteen loci were polymorphic, with alleles per polymorphic locus ranging from 3 to 8 and polymorphism information content ranging from 0.34 to 0.82. Conclusions A large set of unigenes were characterized in this study for both M. acuminata Calcutta 4 and Cavendish Grande Naine, increasing the number of public domain Musa ESTs. This transcriptome is an invaluable resource for furthering our understanding of biological processes elicited during biotic stresses in Musa. Gene-based markers will facilitate molecular breeding strategies, forming the basis of genetic linkage mapping and analysis of quantitative trait loci. PMID:23379821
Panagopoulos, Ioannis; Gorunova, Ludmila; Bjerkehagen, Bodil; Heim, Sverre
2014-01-01
Whole transcriptome sequencing was used to study a small round cell tumor in which a t(4;19)(q35;q13) was part of the complex karyotype but where the initial reverse transcriptase PCR (RT-PCR) examination did not detect a CIC-DUX4 fusion transcript previously described as the crucial gene-level outcome of this specific translocation. The RNA sequencing data were analysed using the FusionMap, FusionFinder, and ChimeraScan programs which are specifically designed to identify fusion genes. FusionMap, FusionFinder, and ChimeraScan identified 1017, 102, and 101 fusion transcripts, respectively, but CIC-DUX4 was not among them. Since the RNA sequencing data are in the fastq text-based format, we searched the files using the "grep" command-line utility. The "grep" command searches the text for specific expressions and displays, by default, the lines where matches occur. The "specific expression" was a sequence of 20 nucleotides from the coding part of the last exon 20 of CIC (Reference Sequence: NM_015125.3) chosen since all the so far reported CIC breakpoints have occurred here. Fifteen chimeric CIC-DUX4 cDNA sequences were captured and the fusion between the CIC and DUX4 genes was mapped precisely. New primer combinations were constructed based on these findings and were used together with a polymerase suitable for amplification of GC-rich DNA templates to amplify CIC-DUX4 cDNA fragments which had the same fusion point found with "grep". In conclusion, FusionMap, FusionFinder, and ChimeraScan generated a plethora of fusion transcripts but did not detect the biologically important CIC-DUX4 chimeric transcript; they are generally useful but evidently suffer from imperfect both sensitivity and specificity. The "grep" command is an excellent tool to capture chimeric transcripts from RNA sequencing data when the pathological and/or cytogenetic information strongly indicates the presence of a specific fusion gene.
Chang, Yaqing; Zhao, Wenming; Du, Zhenlin; Hao, Zhenlin
2015-01-01
Shell color is an important trait that is used in breeding the Japanese scallop Patinopecten yessoensis, the most economically important scallop species in China. We constructed four transcriptome libraries from different shell color lines of P. yessoensis: the left and right shell mantles of ordinary strains of P. yessoensis and the left shell mantles of the ‘Ivory’ and ‘Maple’ strains. These four libraries were paired-end sequenced using the Illumina HiSeq 2000 platform and contained 54,802,692 sequences, 40,798,962 sequences, 74,019,262 sequences, and 44,466,166 sequences, respectively. A total of 214,087,082 expressed sequence tags were assembled into 73,522 unigenes with an average size of 1,163 bp. When the data were compared against the public Nr and Swiss-Prot databases using BlastX, nearly 30.55% (22,458) of the unigenes were significantly matched to known unique proteins. Gene Ontology annotation and pathway mapping analysis using the Kyoto Encyclopedia of Genes and Genomes categorized unigenes according to their diverse biological functions and processes and identified candidate genes that were potentially involved in growth, pigmentation, metal transcription, and immunity. Expression profile analysis was performed on all four libraries and many differentially expressed genes were identified. In addition, 5,772 simple sequence repeats were obtained from the P. yessoensis transcriptomes, and 464,197, 395,646, and 310,649 single nucleotide polymorphisms were revealed in the ordinary strains, the ‘Ivory’ strain, and the ‘Maple’ strain, respectively. These results provide valuable information for future genomic studies on P. yessoensis and improve our understanding of the molecular mechanisms involved in the growth, immunity, shell coloring, and shell biomineralization of this species. These resources also may be used in a variety of applications, such as trait mapping, marker-assisted breeding, studies of population genetics and genomics, and work on functional genomics. PMID:25680107
A SNP resource for Douglas-fir: de novo transcriptome assembly and SNP detection and validation.
Howe, Glenn T; Yu, Jianbin; Knaus, Brian; Cronn, Richard; Kolpak, Scott; Dolan, Peter; Lorenz, W Walter; Dean, Jeffrey F D
2013-02-28
Douglas-fir (Pseudotsuga menziesii), one of the most economically and ecologically important tree species in the world, also has one of the largest tree breeding programs. Although the coastal and interior varieties of Douglas-fir (vars. menziesii and glauca) are native to North America, the coastal variety is also widely planted for timber production in Europe, New Zealand, Australia, and Chile. Our main goal was to develop a SNP resource large enough to facilitate genomic selection in Douglas-fir breeding programs. To accomplish this, we developed a 454-based reference transcriptome for coastal Douglas-fir, annotated and evaluated the quality of the reference, identified putative SNPs, and then validated a sample of those SNPs using the Illumina Infinium genotyping platform. We assembled a reference transcriptome consisting of 25,002 isogroups (unique gene models) and 102,623 singletons from 2.76 million 454 and Sanger cDNA sequences from coastal Douglas-fir. We identified 278,979 unique SNPs by mapping the 454 and Sanger sequences to the reference, and by mapping four datasets of Illumina cDNA sequences from multiple seed sources, genotypes, and tissues. The Illumina datasets represented coastal Douglas-fir (64.00 and 13.41 million reads), interior Douglas-fir (80.45 million reads), and a Yakima population similar to interior Douglas-fir (8.99 million reads). We assayed 8067 SNPs on 260 trees using an Illumina Infinium SNP genotyping array. Of these SNPs, 5847 (72.5%) were called successfully and were polymorphic. Based on our validation efficiency, our SNP database may contain as many as ~200,000 true SNPs, and as many as ~69,000 SNPs that could be genotyped at ~20,000 gene loci using an Infinium II array-more SNPs than are needed to use genomic selection in tree breeding programs. Ultimately, these genomic resources will enhance Douglas-fir breeding and allow us to better understand landscape-scale patterns of genetic variation and potential responses to climate change.
SPARTA: Simple Program for Automated reference-based bacterial RNA-seq Transcriptome Analysis.
Johnson, Benjamin K; Scholz, Matthew B; Teal, Tracy K; Abramovitch, Robert B
2016-02-04
Many tools exist in the analysis of bacterial RNA sequencing (RNA-seq) transcriptional profiling experiments to identify differentially expressed genes between experimental conditions. Generally, the workflow includes quality control of reads, mapping to a reference, counting transcript abundance, and statistical tests for differentially expressed genes. In spite of the numerous tools developed for each component of an RNA-seq analysis workflow, easy-to-use bacterially oriented workflow applications to combine multiple tools and automate the process are lacking. With many tools to choose from for each step, the task of identifying a specific tool, adapting the input/output options to the specific use-case, and integrating the tools into a coherent analysis pipeline is not a trivial endeavor, particularly for microbiologists with limited bioinformatics experience. To make bacterial RNA-seq data analysis more accessible, we developed a Simple Program for Automated reference-based bacterial RNA-seq Transcriptome Analysis (SPARTA). SPARTA is a reference-based bacterial RNA-seq analysis workflow application for single-end Illumina reads. SPARTA is turnkey software that simplifies the process of analyzing RNA-seq data sets, making bacterial RNA-seq analysis a routine process that can be undertaken on a personal computer or in the classroom. The easy-to-install, complete workflow processes whole transcriptome shotgun sequencing data files by trimming reads and removing adapters, mapping reads to a reference, counting gene features, calculating differential gene expression, and, importantly, checking for potential batch effects within the data set. SPARTA outputs quality analysis reports, gene feature counts and differential gene expression tables and scatterplots. SPARTA provides an easy-to-use bacterial RNA-seq transcriptional profiling workflow to identify differentially expressed genes between experimental conditions. This software will enable microbiologists with limited bioinformatics experience to analyze their data and integrate next generation sequencing (NGS) technologies into the classroom. The SPARTA software and tutorial are available at sparta.readthedocs.org.
Kandpal, Raj P; Rajasimha, Harsha K; Brooks, Matthew J; Nellissery, Jacob; Wan, Jun; Qian, Jiang; Kern, Timothy S; Swaroop, Anand
2012-01-01
To define gene expression changes associated with diabetic retinopathy in a mouse model using next generation sequencing, and to utilize transcriptome signatures to assess molecular pathways by which pharmacological agents inhibit diabetic retinopathy. We applied a high throughput RNA sequencing (RNA-seq) strategy using Illumina GAIIx to characterize the entire retinal transcriptome from nondiabetic and from streptozotocin-treated mice 32 weeks after induction of diabetes. Some of the diabetic mice were treated with inhibitors of receptor for advanced glycation endproducts (RAGE) and p38 mitogen activated protein (MAP) kinase, which have previously been shown to inhibit diabetic retinopathy in rodent models. The transcripts and alternatively spliced variants were determined in all experimental groups. Next generation sequencing-based RNA-seq profiles provided comprehensive signatures of transcripts that are altered in early stages of diabetic retinopathy. These transcripts encoded proteins involved in distinct yet physiologically relevant disease-associated pathways such as inflammation, microvasculature formation, apoptosis, glucose metabolism, Wnt signaling, xenobiotic metabolism, and photoreceptor biology. Significant upregulation of crystallin transcripts was observed in diabetic animals, and the diabetes-induced upregulation of these transcripts was inhibited in diabetic animals treated with inhibitors of either RAGE or p38 MAP kinase. These two therapies also showed dissimilar regulation of some subsets of transcripts that included alternatively spliced versions of arrestin, neutral sphingomyelinase activation associated factor (Nsmaf), SH3-domain GRB2-like interacting protein 1 (Sgip1), and axin. Diabetes alters many transcripts in the retina, and two therapies that inhibit the vascular pathology similarly inhibit a portion of these changes, pointing to possible molecular mechanisms for their beneficial effects. These therapies also changed the abundance of various alternatively spliced versions of signaling transcripts, suggesting a possible role of alternative splicing in disease etiology. Our studies clearly demonstrate RNA-seq as a comprehensive strategy for identifying disease-specific transcripts, and for determining comparative profiles of molecular changes mediated by candidate drugs.
Das, Pranab J; McCarthy, Fiona; Vishnoi, Monika; Paria, Nandina; Gresham, Cathy; Li, Gang; Kachroo, Priyanka; Sudderth, A Kendrick; Teague, Sheila; Love, Charles C; Varner, Dickson D; Chowdhary, Bhanu P; Raudsepp, Terje
2013-01-01
Mature mammalian sperm contain a complex population of RNAs some of which might regulate spermatogenesis while others probably play a role in fertilization and early development. Due to this limited knowledge, the biological functions of sperm RNAs remain enigmatic. Here we report the first characterization of the global transcriptome of the sperm of fertile stallions. The findings improved understanding of the biological significance of sperm RNAs which in turn will allow the discovery of sperm-based biomarkers for stallion fertility. The stallion sperm transcriptome was interrogated by analyzing sperm and testes RNA on a 21,000-element equine whole-genome oligoarray and by RNA-seq. Microarray analysis revealed 6,761 transcripts in the sperm, of which 165 were sperm-enriched, and 155 were differentially expressed between the sperm and testes. Next, 70 million raw reads were generated by RNA-seq of which 50% could be aligned with the horse reference genome. A total of 19,257 sequence tags were mapped to all horse chromosomes and the mitochondrial genome. The highest density of mapped transcripts was in gene-rich ECA11, 12 and 13, and the lowest in gene-poor ECA9 and X; 7 gene transcripts originated from ECAY. Structural annotation aligned sperm transcripts with 4,504 known horse and/or human genes, rRNAs and 82 miRNAs, whereas 13,354 sequence tags remained anonymous. The data were aligned with selected equine gene models to identify additional exons and splice variants. Gene Ontology annotations showed that sperm transcripts were associated with molecular processes (chemoattractant-activated signal transduction, ion transport) and cellular components (membranes and vesicles) related to known sperm functions at fertilization, while some messenger and micro RNAs might be critical for early development. The findings suggest that the rich repertoire of coding and non-coding RNAs in stallion sperm is not a random remnant from spermatogenesis in testes but a selectively retained and functionally coherent collection of RNAs.
Das, Pranab J.; McCarthy, Fiona; Vishnoi, Monika; Paria, Nandina; Gresham, Cathy; Li, Gang; Kachroo, Priyanka; Sudderth, A. Kendrick; Teague, Sheila; Love, Charles C.; Varner, Dickson D.; Chowdhary, Bhanu P.; Raudsepp, Terje
2013-01-01
Mature mammalian sperm contain a complex population of RNAs some of which might regulate spermatogenesis while others probably play a role in fertilization and early development. Due to this limited knowledge, the biological functions of sperm RNAs remain enigmatic. Here we report the first characterization of the global transcriptome of the sperm of fertile stallions. The findings improved understanding of the biological significance of sperm RNAs which in turn will allow the discovery of sperm-based biomarkers for stallion fertility. The stallion sperm transcriptome was interrogated by analyzing sperm and testes RNA on a 21,000-element equine whole-genome oligoarray and by RNA-seq. Microarray analysis revealed 6,761 transcripts in the sperm, of which 165 were sperm-enriched, and 155 were differentially expressed between the sperm and testes. Next, 70 million raw reads were generated by RNA-seq of which 50% could be aligned with the horse reference genome. A total of 19,257 sequence tags were mapped to all horse chromosomes and the mitochondrial genome. The highest density of mapped transcripts was in gene-rich ECA11, 12 and 13, and the lowest in gene-poor ECA9 and X; 7 gene transcripts originated from ECAY. Structural annotation aligned sperm transcripts with 4,504 known horse and/or human genes, rRNAs and 82 miRNAs, whereas 13,354 sequence tags remained anonymous. The data were aligned with selected equine gene models to identify additional exons and splice variants. Gene Ontology annotations showed that sperm transcripts were associated with molecular processes (chemoattractant-activated signal transduction, ion transport) and cellular components (membranes and vesicles) related to known sperm functions at fertilization, while some messenger and micro RNAs might be critical for early development. The findings suggest that the rich repertoire of coding and non-coding RNAs in stallion sperm is not a random remnant from spermatogenesis in testes but a selectively retained and functionally coherent collection of RNAs. PMID:23409192
A SNP resource for Douglas-fir: de novo transcriptome assembly and SNP detection and validation
2013-01-01
Background Douglas-fir (Pseudotsuga menziesii), one of the most economically and ecologically important tree species in the world, also has one of the largest tree breeding programs. Although the coastal and interior varieties of Douglas-fir (vars. menziesii and glauca) are native to North America, the coastal variety is also widely planted for timber production in Europe, New Zealand, Australia, and Chile. Our main goal was to develop a SNP resource large enough to facilitate genomic selection in Douglas-fir breeding programs. To accomplish this, we developed a 454-based reference transcriptome for coastal Douglas-fir, annotated and evaluated the quality of the reference, identified putative SNPs, and then validated a sample of those SNPs using the Illumina Infinium genotyping platform. Results We assembled a reference transcriptome consisting of 25,002 isogroups (unique gene models) and 102,623 singletons from 2.76 million 454 and Sanger cDNA sequences from coastal Douglas-fir. We identified 278,979 unique SNPs by mapping the 454 and Sanger sequences to the reference, and by mapping four datasets of Illumina cDNA sequences from multiple seed sources, genotypes, and tissues. The Illumina datasets represented coastal Douglas-fir (64.00 and 13.41 million reads), interior Douglas-fir (80.45 million reads), and a Yakima population similar to interior Douglas-fir (8.99 million reads). We assayed 8067 SNPs on 260 trees using an Illumina Infinium SNP genotyping array. Of these SNPs, 5847 (72.5%) were called successfully and were polymorphic. Conclusions Based on our validation efficiency, our SNP database may contain as many as ~200,000 true SNPs, and as many as ~69,000 SNPs that could be genotyped at ~20,000 gene loci using an Infinium II array—more SNPs than are needed to use genomic selection in tree breeding programs. Ultimately, these genomic resources will enhance Douglas-fir breeding and allow us to better understand landscape-scale patterns of genetic variation and potential responses to climate change. PMID:23445355
USDA-ARS?s Scientific Manuscript database
Ultra high resolution digital aerial photography has great potential to complement or replace ground measurements of vegetation cover for rangeland monitoring and assessment. We investigated object-based image analysis (OBIA) techniques for classifying vegetation in southwestern U.S. arid rangelands...
False-color infrared aerial photography of the Yaquina Bay Estuary, Oregon was acquired at extreme low tides and digitally orthorectified with a ground pixel resolution of 20 cm to provide data for intertidal vegetation mapping. Submerged, semi-exposed and exposed eelgrass mead...
WebGIVI: a web-based gene enrichment analysis and visualization tool.
Sun, Liang; Zhu, Yongnan; Mahmood, A S M Ashique; Tudor, Catalina O; Ren, Jia; Vijay-Shanker, K; Chen, Jian; Schmidt, Carl J
2017-05-04
A major challenge of high throughput transcriptome studies is presenting the data to researchers in an interpretable format. In many cases, the outputs of such studies are gene lists which are then examined for enriched biological concepts. One approach to help the researcher interpret large gene datasets is to associate genes and informative terms (iTerm) that are obtained from the biomedical literature using the eGIFT text-mining system. However, examining large lists of iTerm and gene pairs is a daunting task. We have developed WebGIVI, an interactive web-based visualization tool ( http://raven.anr.udel.edu/webgivi/ ) to explore gene:iTerm pairs. WebGIVI was built via Cytoscape and Data Driven Document JavaScript libraries and can be used to relate genes to iTerms and then visualize gene and iTerm pairs. WebGIVI can accept a gene list that is used to retrieve the gene symbols and corresponding iTerm list. This list can be submitted to visualize the gene iTerm pairs using two distinct methods: a Concept Map or a Cytoscape Network Map. In addition, WebGIVI also supports uploading and visualization of any two-column tab separated data. WebGIVI provides an interactive and integrated network graph of gene and iTerms that allows filtering, sorting, and grouping, which can aid biologists in developing hypothesis based on the input gene lists. In addition, WebGIVI can visualize hundreds of nodes and generate a high-resolution image that is important for most of research publications. The source code can be freely downloaded at https://github.com/sunliang3361/WebGIVI . The WebGIVI tutorial is available at http://raven.anr.udel.edu/webgivi/tutorial.php .
High-Resolution Global Geologic Map of Ceres from NASA Dawn Mission
NASA Astrophysics Data System (ADS)
Williams, D. A.; Buczkowski, D. L.; Crown, D. A.; Frigeri, A.; Hughson, K.; Kneissl, T.; Krohn, K.; Mest, S. C.; Pasckert, J. H.; Platz, T.; Ruesch, O.; Schulzeck, F.; Scully, J. E. C.; Sizemore, H. G.; Nass, A.; Jaumann, R.; Raymond, C. A.; Russell, C. T.
2018-06-01
This presentation will discuss the completed 1:4,000,000 global geologic map of dwarf planet Ceres derived from Dawn Framing Camera Low Altitude Mapping Orbit (LAMo) images, combining 15 quadrangle maps.
Wang, Guizhou; Liu, Jianbo; He, Guojin
2013-01-01
This paper presents a new classification method for high-spatial-resolution remote sensing images based on a strategic mechanism of spatial mapping and reclassification. The proposed method includes four steps. First, the multispectral image is classified by a traditional pixel-based classification method (support vector machine). Second, the panchromatic image is subdivided by watershed segmentation. Third, the pixel-based multispectral image classification result is mapped to the panchromatic segmentation result based on a spatial mapping mechanism and the area dominant principle. During the mapping process, an area proportion threshold is set, and the regional property is defined as unclassified if the maximum area proportion does not surpass the threshold. Finally, unclassified regions are reclassified based on spectral information using the minimum distance to mean algorithm. Experimental results show that the classification method for high-spatial-resolution remote sensing images based on the spatial mapping mechanism and reclassification strategy can make use of both panchromatic and multispectral information, integrate the pixel- and object-based classification methods, and improve classification accuracy. PMID:24453808
MBARI Mapping AUV: A High-Resolution Deep Ocean Seafloor Mapping Capability
NASA Astrophysics Data System (ADS)
Caress, D. W.; Kirkwood, W. J.; Thomas, H.; McEwen, R.; Henthorn, R.; McGill, P.; Thompson, D.; Sibenac, M.; Jensen, S.; Shane, F.; Hamilton, A.
2005-05-01
The Monterey Bay Aquarium Research Institute (MBARI) is developing an autonomous seafloor mapping capability for deep ocean science applications. The MBARI Mapping AUV is a 0.53 m (21 in) diameter, 5.1 m (16.7 ft) long, Dorado-class vehicle designed to carry four mapping sonars. The primary sensor is a 200 kHz multibeam sonar producing swath bathymetry and sidescan. In addition, the vehicle carries 100 kHz and 410 kHz chirp sidescan sonars, and a 2-16 kHz sweep chirp subbottom profiler. Navigation and attitude data are obtained from an inertial navigation system (INS) incorporating a ring laser gyro and a 300 kHz Doppler velocity log (DVL). The vehicle also includes acoustic modem, ultra-short baseline navigation, and long-baseline navigation systems. The Mapping AUV is powered by 6 kWhr of Li-polymer batteries, providing expected mission duration of 12 hours at a typical speed of 1.5 m/s. All components of the vehicle are rated to 6000 m depth, allowing MBARI to conduct high-resolution mapping of the deep-ocean seafloor. The sonar package is also be mountable on ROV Ventana, allowing surveys at altitudes less than 20 m at topographically challenging sites. The vehicle was assembled and extensively tested during 2004; this year we are commencing operations for MBARI science projects while continuing the process of testing and integrating the complete suite of sensors and systems. MBARI is beginning to use this capability to observe the changing morphology of dynamic systems such as submarine canyons and active slumps, to map deep-water benthic habitats at resolutions comparable to ROV and submersible observations, to provide basemaps for ROV dives, and to provide high resolution bathymetry and subbottom profiles as part of a variety of projects requiring knowledge of the seafloor. We will present initial results from surveys in and around Monterey Canyon, including high resolution repeat surveys of four sites along the canyon axis.
Dong, Jinwei; Xiao, Xiangming; Sheldon, Sage; Biradar, Chandrashekhar; Zhang, Geli; Duong, Nguyen Dinh; Hazarika, Manzul; Wikantika, Ketut; Takeuhci, Wataru; Moore, Berrien
2014-01-01
Southeast Asia experienced higher rates of deforestation than other continents in the 1990s and still was a hotspot of forest change in the 2000s. Biodiversity conservation planning and accurate estimation of forest carbon fluxes and pools need more accurate information about forest area, spatial distribution and fragmentation. However, the recent forest maps of Southeast Asia were generated from optical images at spatial resolutions of several hundreds of meters, and they do not capture well the exceptionally complex and dynamic environments in Southeast Asia. The forest area estimates from those maps vary substantially, ranging from 1.73×10(6) km(2) (GlobCover) to 2.69×10(6) km(2) (MCD12Q1) in 2009; and their uncertainty is constrained by frequent cloud cover and coarse spatial resolution. Recently, cloud-free imagery from the Phased Array Type L-band Synthetic Aperture Radar (PALSAR) onboard the Advanced Land Observing Satellite (ALOS) became available. We used the PALSAR 50-m orthorectified mosaic imagery in 2009 to generate a forest cover map of Southeast Asia at 50-m spatial resolution. The validation, using ground-reference data collected from the Geo-Referenced Field Photo Library and high-resolution images in Google Earth, showed that our forest map has a reasonably high accuracy (producer's accuracy 86% and user's accuracy 93%). The PALSAR-based forest area estimates in 2009 are significantly correlated with those from GlobCover and MCD12Q1 at national and subnational scales but differ in some regions at the pixel scale due to different spatial resolutions, forest definitions, and algorithms. The resultant 50-m forest map was used to quantify forest fragmentation and it revealed substantial details of forest fragmentation. This new 50-m map of tropical forests could serve as a baseline map for forest resource inventory, deforestation monitoring, reducing emissions from deforestation and forest degradation (REDD+) implementation, and biodiversity.
Dong, Jinwei; Xiao, Xiangming; Sheldon, Sage; Biradar, Chandrashekhar; Zhang, Geli; Dinh Duong, Nguyen; Hazarika, Manzul; Wikantika, Ketut; Takeuhci, Wataru; Moore, Berrien
2014-01-01
Southeast Asia experienced higher rates of deforestation than other continents in the 1990s and still was a hotspot of forest change in the 2000s. Biodiversity conservation planning and accurate estimation of forest carbon fluxes and pools need more accurate information about forest area, spatial distribution and fragmentation. However, the recent forest maps of Southeast Asia were generated from optical images at spatial resolutions of several hundreds of meters, and they do not capture well the exceptionally complex and dynamic environments in Southeast Asia. The forest area estimates from those maps vary substantially, ranging from 1.73×106 km2 (GlobCover) to 2.69×106 km2 (MCD12Q1) in 2009; and their uncertainty is constrained by frequent cloud cover and coarse spatial resolution. Recently, cloud-free imagery from the Phased Array Type L-band Synthetic Aperture Radar (PALSAR) onboard the Advanced Land Observing Satellite (ALOS) became available. We used the PALSAR 50-m orthorectified mosaic imagery in 2009 to generate a forest cover map of Southeast Asia at 50-m spatial resolution. The validation, using ground-reference data collected from the Geo-Referenced Field Photo Library and high-resolution images in Google Earth, showed that our forest map has a reasonably high accuracy (producer's accuracy 86% and user's accuracy 93%). The PALSAR-based forest area estimates in 2009 are significantly correlated with those from GlobCover and MCD12Q1 at national and subnational scales but differ in some regions at the pixel scale due to different spatial resolutions, forest definitions, and algorithms. The resultant 50-m forest map was used to quantify forest fragmentation and it revealed substantial details of forest fragmentation. This new 50-m map of tropical forests could serve as a baseline map for forest resource inventory, deforestation monitoring, reducing emissions from deforestation and forest degradation (REDD+) implementation, and biodiversity. PMID:24465714
Udhane, Sameer S; Legeza, Balazs; Marti, Nesa; Hertig, Damian; Diserens, Gaëlle; Nuoffer, Jean-Marc; Vermathen, Peter; Flück, Christa E
2017-08-17
Metformin is an antidiabetic drug, which inhibits mitochondrial respiratory-chain-complex I and thereby seems to affect the cellular metabolism in many ways. It is also used for the treatment of the polycystic ovary syndrome (PCOS), the most common endocrine disorder in women. In addition, metformin possesses antineoplastic properties. Although metformin promotes insulin-sensitivity and ameliorates reproductive abnormalities in PCOS, its exact mechanisms of action remain elusive. Therefore, we studied the transcriptome and the metabolome of metformin in human adrenal H295R cells. Microarray analysis revealed changes in 693 genes after metformin treatment. Using high resolution magic angle spinning nuclear magnetic resonance spectroscopy (HR-MAS-NMR), we determined 38 intracellular metabolites. With bioinformatic tools we created an integrated pathway analysis to understand different intracellular processes targeted by metformin. Combined metabolomics and transcriptomics data analysis showed that metformin affects a broad range of cellular processes centered on the mitochondrium. Data confirmed several known effects of metformin on glucose and androgen metabolism, which had been identified in clinical and basic studies previously. But more importantly, novel links between the energy metabolism, sex steroid biosynthesis, the cell cycle and the immune system were identified. These omics studies shed light on a complex interplay between metabolic pathways in steroidogenic systems.
Time of day determines Arabidopsis transcriptome and growth dynamics under mild drought.
Dubois, Marieke; Claeys, Hannes; Van den Broeck, Lisa; Inzé, Dirk
2017-02-01
Drought stress is a major problem for agriculture worldwide, causing significant yield losses. Plants have developed highly flexible mechanisms to deal with drought, including organ- and developmental stage-specific responses. In young leaves, growth is repressed as an active mechanism to save water and energy, increasing the chances of survival but decreasing yield. Despite its importance, the molecular basis for this growth inhibition is largely unknown. Here, we present a novel approach to explore early molecular mechanisms controlling Arabidopsis leaf growth inhibition following mild drought. We found that growth and transcriptome responses to drought are highly dynamic. Growth was only repressed by drought during the day, and our evidence suggests that this may be due to gating by the circadian clock. Similarly, time of day strongly affected the extent, specificity, and in certain cases even direction of drought-induced changes in gene expression. These findings underscore the importance of taking into account diurnal patterns to understand stress responses, as only a small core of drought-responsive genes are affected by drought at all times of the day. Finally, we leveraged our high-resolution data to demonstrate that phenotypic and transcriptome responses can be matched to identify putative novel regulators of growth under mild drought. © 2016 The Authors Plant, Cell & Environment Published by John Wiley & Sons Ltd.
Zeng, Liping; Zhang, Ning; Zhang, Qiang; Endress, Peter K; Huang, Jie; Ma, Hong
2017-05-01
Explosive diversification is widespread in eukaryotes, making it difficult to resolve phylogenetic relationships. Eudicots contain c. 75% of extant flowering plants, are important for human livelihood and terrestrial ecosystems, and have probably experienced explosive diversifications. The eudicot phylogenetic relationships, especially among those of the Pentapetalae, remain unresolved. Here, we present a highly supported eudicot phylogeny and diversification rate shifts using 31 newly generated transcriptomes and 88 other datasets covering 70% of eudicot orders. A highly supported eudicot phylogeny divided Pentapetalae into two groups: one with rosids, Saxifragales, Vitales and Santalales; the other containing asterids, Caryophyllales and Dilleniaceae, with uncertainty for Berberidopsidales. Molecular clock analysis estimated that crown eudicots originated c. 146 Ma, considerably earlier than earliest tricolpate pollen fossils and most other molecular clock estimates, and Pentapetalae sequentially diverged into eight major lineages within c. 15 Myr. Two identified increases of diversification rate are located in the stems leading to Pentapetalae and asterids, and lagged behind the gamma hexaploidization. The nuclear genes from newly generated transcriptomes revealed a well-resolved eudicot phylogeny, sequential separation of major core eudicot lineages and temporal mode of diversifications, providing new insights into the evolutionary trend of morphologies and contributions to the diversification of eudicots. © 2017 The Authors. New Phytologist © 2017 New Phytologist Trust.
A Reassessment of the Mars Ocean Hypothesis
NASA Technical Reports Server (NTRS)
Parker, T. J.
2004-01-01
Initial work on the identification and mapping of potential ancient shorelines on Mars was based on Viking Orbiter image data (Parker et al., 1987, 1989, 1993). The Viking Orbiters were designed to locate landing site for the two landers and were not specifically intended to map the entire planet. Fortunately, they mapped the entire planet. Unfortunately, they did so at an average resolution of greater than 200m/pixel. Higher resolution images, even mosaics of interesting regions, are available, but relatively sparse. Mapping of shorelines on Earth requires both high-resolution aerial photos or satellite images and good topographic information. Three significant sources of additional data from missions subsequent to Viking are useful for reassessing the ocean hypothesis. These are: MGS MOC images; MGS MOLA topography; Odyssey THEMIS IR and VIS images; and MER surface geology at Meridiani and Gusev. Okay, my mistake: Four.
Aircraft Detection in High-Resolution SAR Images Based on a Gradient Textural Saliency Map
Tan, Yihua; Li, Qingyun; Li, Yansheng; Tian, Jinwen
2015-01-01
This paper proposes a new automatic and adaptive aircraft target detection algorithm in high-resolution synthetic aperture radar (SAR) images of airport. The proposed method is based on gradient textural saliency map under the contextual cues of apron area. Firstly, the candidate regions with the possible existence of airport are detected from the apron area. Secondly, directional local gradient distribution detector is used to obtain a gradient textural saliency map in the favor of the candidate regions. In addition, the final targets will be detected by segmenting the saliency map using CFAR-type algorithm. The real high-resolution airborne SAR image data is used to verify the proposed algorithm. The results demonstrate that this algorithm can detect aircraft targets quickly and accurately, and decrease the false alarm rate. PMID:26378543
Skakun, Sergii; Vermote, Eric; Roger, Jean-Claude; Franch, Belen
2018-01-01
Timely and accurate information on crop yield is critical to many applications within agriculture monitoring. Thanks to its coverage and temporal resolution, coarse spatial resolution satellite imagery has always been a source of valuable information for yield forecasting and assessment at national and regional scales. With availability of free images acquired by Landsat-8 and Sentinel-2 remote sensing satellites, it becomes possible to enable temporal resolution of an image every 3–5 days, and therefore, to develop next generation agriculture products at higher spatial resolution (30 m). This paper explores the combined use of Landsat-8 and Sentinel-2A for winter crop mapping and winter wheat assessment at regional scale. For the former, we adapt a previously developed approach for Moderate Resolution Imaging Spectroradiometer (MODIS) at 250 m resolution that allows automatic mapping of winter crops taking into account knowledge on crop calendar and without ground truth data. For the latter, we use a generalized winter wheat yield model that is based on NDVI-peak estimation and MODIS data, and further downscaled to be applicable at 30 m resolution. We show that integration of Landsat-8 and Sentinel-2A has a positive impact both for winter crop mapping and winter wheat yield assessment. In particular, the error of winter wheat yield estimates can be reduced up to 1.8 times comparing to the single satellite usage. PMID:29888751
Skakun, Sergii; Vermote, Eric; Roger, Jean-Claude; Franch, Belen
2017-01-01
Timely and accurate information on crop yield is critical to many applications within agriculture monitoring. Thanks to its coverage and temporal resolution, coarse spatial resolution satellite imagery has always been a source of valuable information for yield forecasting and assessment at national and regional scales. With availability of free images acquired by Landsat-8 and Sentinel-2 remote sensing satellites, it becomes possible to enable temporal resolution of an image every 3-5 days, and therefore, to develop next generation agriculture products at higher spatial resolution (30 m). This paper explores the combined use of Landsat-8 and Sentinel-2A for winter crop mapping and winter wheat assessment at regional scale. For the former, we adapt a previously developed approach for Moderate Resolution Imaging Spectroradiometer (MODIS) at 250 m resolution that allows automatic mapping of winter crops taking into account knowledge on crop calendar and without ground truth data. For the latter, we use a generalized winter wheat yield model that is based on NDVI-peak estimation and MODIS data, and further downscaled to be applicable at 30 m resolution. We show that integration of Landsat-8 and Sentinel-2A has a positive impact both for winter crop mapping and winter wheat yield assessment. In particular, the error of winter wheat yield estimates can be reduced up to 1.8 times comparing to the single satellite usage.
NASA Technical Reports Server (NTRS)
Skakun, Sergii; Vermote, Eric; Roger, Jean-Claude; Franch, Belen
2017-01-01
Timely and accurate information on crop yield and production is critical to many applications within agriculture monitoring. Thanks to its coverage and temporal resolution, coarse spatial resolution satellite imagery has always been a source of valuable information for yield forecasting and assessment at national and regional scales. With availability of free images acquired by Landsat-8 and Sentinel-2 remote sensing satellites, it becomes possible to provide temporal resolution of an image every 3-5 days, and therefore, to develop next generation agriculture products at higher spatial resolution (10-30 m). This paper explores the combined use of Landsat-8 and Sentinel-2A for winter crop mapping and winter wheat yield assessment at regional scale. For the former, we adapt a previously developed approach for the Moderate Resolution Imaging Spectroradiometer (MODIS) instrument at 250 m resolution that allows automatic mapping of winter crops taking into account a priori knowledge on crop calendar. For the latter, we use a generalized winter wheat yield forecasting model that is based on estimation of the peak Normalized Difference Vegetation Index (NDVI) from MODIS image time-series, and further downscaled to be applicable at 30 m resolution. We show that integration of Landsat-8 and Sentinel-2A improves both winter crop mapping and winter wheat yield assessment. In particular, the error of winter wheat yield estimates can be reduced up to 1.8 times compared to using a single satellite.
Structure-aware depth super-resolution using Gaussian mixture model
NASA Astrophysics Data System (ADS)
Kim, Sunok; Oh, Changjae; Kim, Youngjung; Sohn, Kwanghoon
2015-03-01
This paper presents a probabilistic optimization approach to enhance the resolution of a depth map. Conventionally, a high-resolution color image is considered as a cue for depth super-resolution under the assumption that the pixels with similar color likely belong to similar depth. This assumption might induce a texture transferring from the color image into the depth map and an edge blurring artifact to the depth boundaries. In order to alleviate these problems, we propose an efficient depth prior exploiting a Gaussian mixture model in which an estimated depth map is considered to a feature for computing affinity between two pixels. Furthermore, a fixed-point iteration scheme is adopted to address the non-linearity of a constraint derived from the proposed prior. The experimental results show that the proposed method outperforms state-of-the-art methods both quantitatively and qualitatively.
Sánchez-Sevilla, José F; Vallarino, José G; Osorio, Sonia; Bombarely, Aureliano; Posé, David; Merchante, Catharina; Botella, Miguel A; Amaya, Iraida; Valpuesta, Victoriano
2017-10-23
RNA-seq has been used to perform global expression analysis of the achene and the receptacle at four stages of fruit ripening, and of the roots and leaves of strawberry (Fragaria × ananassa). About 967 million reads and 191 Gb of sequence were produced, using Illumina sequencing. Mapping the reads in the related genome of the wild diploid Fragaria vesca revealed differences between the achene and receptacle development program, and reinforced the role played by ethylene in the ripening receptacle. For the strawberry transcriptome assembly, a de novo strategy was followed, generating separate assemblies for each of the ten tissues and stages sampled. The Trinity program was used for these assemblies, resulting in over 1.4 M isoforms. Filtering by a threshold of 0.3 FPKM, and doing Blastx (E-value < 1 e-30) against the UniProt database of plants reduced the number to 472,476 isoforms. Their assembly with the MIRA program (90% homology) resulted in 26,087 contigs. From these, 91.34 percent showed high homology to Fragaria vesca genes and 87.30 percent Fragaria iinumae (BlastN E-value < 1 e-100). Mapping back the reads on the MIRA contigs identified polymorphisms at nucleotide level, using FREEBAYES, as well as estimate their relative abundance in each sample.
Krishna, Srikar; Nair, Aparna; Cheedipudi, Sirisha; Poduval, Deepak; Dhawan, Jyotsna; Palakodeti, Dasaradhi; Ghanekar, Yashoda
2013-01-07
Small non-coding RNAs such as miRNAs, piRNAs and endo-siRNAs fine-tune gene expression through post-transcriptional regulation, modulating important processes in development, differentiation, homeostasis and regeneration. Using deep sequencing, we have profiled small non-coding RNAs in Hydra magnipapillata and investigated changes in small RNA expression pattern during head regeneration. Our results reveal a unique repertoire of small RNAs in hydra. We have identified 126 miRNA loci; 123 of these miRNAs are unique to hydra. Less than 50% are conserved across two different strains of Hydra vulgaris tested in this study, indicating a highly diverse nature of hydra miRNAs in contrast to bilaterian miRNAs. We also identified siRNAs derived from precursors with perfect stem-loop structure and that arise from inverted repeats. piRNAs were the most abundant small RNAs in hydra, mapping to transposable elements, the annotated transcriptome and unique non-coding regions on the genome. piRNAs that map to transposable elements and the annotated transcriptome display a ping-pong signature. Further, we have identified several miRNAs and piRNAs whose expression is regulated during hydra head regeneration. Our study defines different classes of small RNAs in this cnidarian model system, which may play a role in orchestrating gene expression essential for hydra regeneration.
Krishna, Srikar; Nair, Aparna; Cheedipudi, Sirisha; Poduval, Deepak; Dhawan, Jyotsna; Palakodeti, Dasaradhi; Ghanekar, Yashoda
2013-01-01
Small non-coding RNAs such as miRNAs, piRNAs and endo-siRNAs fine-tune gene expression through post-transcriptional regulation, modulating important processes in development, differentiation, homeostasis and regeneration. Using deep sequencing, we have profiled small non-coding RNAs in Hydra magnipapillata and investigated changes in small RNA expression pattern during head regeneration. Our results reveal a unique repertoire of small RNAs in hydra. We have identified 126 miRNA loci; 123 of these miRNAs are unique to hydra. Less than 50% are conserved across two different strains of Hydra vulgaris tested in this study, indicating a highly diverse nature of hydra miRNAs in contrast to bilaterian miRNAs. We also identified siRNAs derived from precursors with perfect stem–loop structure and that arise from inverted repeats. piRNAs were the most abundant small RNAs in hydra, mapping to transposable elements, the annotated transcriptome and unique non-coding regions on the genome. piRNAs that map to transposable elements and the annotated transcriptome display a ping–pong signature. Further, we have identified several miRNAs and piRNAs whose expression is regulated during hydra head regeneration. Our study defines different classes of small RNAs in this cnidarian model system, which may play a role in orchestrating gene expression essential for hydra regeneration. PMID:23166307
An atomic model of brome mosaic virus using direct electron detection and real-space optimization.
Wang, Zhao; Hryc, Corey F; Bammes, Benjamin; Afonine, Pavel V; Jakana, Joanita; Chen, Dong-Hua; Liu, Xiangan; Baker, Matthew L; Kao, Cheng; Ludtke, Steven J; Schmid, Michael F; Adams, Paul D; Chiu, Wah
2014-09-04
Advances in electron cryo-microscopy have enabled structure determination of macromolecules at near-atomic resolution. However, structure determination, even using de novo methods, remains susceptible to model bias and overfitting. Here we describe a complete workflow for data acquisition, image processing, all-atom modelling and validation of brome mosaic virus, an RNA virus. Data were collected with a direct electron detector in integrating mode and an exposure beyond the traditional radiation damage limit. The final density map has a resolution of 3.8 Å as assessed by two independent data sets and maps. We used the map to derive an all-atom model with a newly implemented real-space optimization protocol. The validity of the model was verified by its match with the density map and a previous model from X-ray crystallography, as well as the internal consistency of models from independent maps. This study demonstrates a practical approach to obtain a rigorously validated atomic resolution electron cryo-microscopy structure.
Kuenstner, J Todd; Chamberlin, William; Naser, Saleh A; Collins, Michael T; Dow, Coad Thomas; Aitken, John M; Weg, Stuart; Telega, Grzegorz; John, Kuruvilla; Haas, David; Eckstein, Torsten M; Kali, Maher; Welch, Christine; Petrie, Thomas
2015-01-01
A cohort of family members with various chronic diseases including Crohn’s disease, asthma, complex regional pain syndrome, hypothyroidism, type 1 diabetes mellitus, and lymphangiomatosis and/or evidence of infection by Mycobacterium avium subsp. paratuberculosis (MAP) are described in this series of case reports. MAP was cultured from the blood of three members affected by the first five diseases and there was accompanying elevated anti-MAP IgG in two members. The patient affected by the sixth disease has a markedly elevated anti-MAP titer. The two patients affected by the first four diseases have been treated with a combination of anti-MAP antibiotics and ultraviolet blood irradiation therapy with resolution of the disease symptomatology and inability to culture MAP in post treatment blood samples. These case reports of patients with MAP infections provide supportive evidence of a pathogenic role of MAP in humans. PMID:25852293
Comparison of an Atomic Model and Its Cryo-EM Image at the Central Axis of a Helix
He, Jing; Zeil, Stephanie; Hallak, Hussam; McKaig, Kele; Kovacs, Julio; Wriggers, Willy
2016-01-01
Cryo-electron microscopy (cryo-EM) is an important biophysical technique that produces three-dimensional (3D) density maps at different resolutions. Because more and more models are being produced from cryo-EM density maps, validation of the models is becoming important. We propose a method for measuring local agreement between a model and the density map using the central axis of the helix. This method was tested using 19 helices from cryo-EM density maps between 5.5 Å and 7.2 Å resolution and 94 helices from simulated density maps. This method distinguished most of the well-fitting helices, although challenges exist for shorter helices. PMID:27280059
California State Waters Map Series-Offshore of Point Reyes, California
Watt, Janet T.; Dartnell, Peter; Golden, Nadine E.; Greene, H. Gary; Erdey, Mercedes D.; Cochrane, Guy R.; Johnson, Samuel Y.; Hartwell, Stephen R.; Kvitek, Rikk G.; Manson, Michael W.; Endris, Charles A.; Dieter, Bryan E.; Sliter, Ray W.; Krigsman, Lisa M.; Lowe, Erik; Chinn, John L.; Watt, Janet T.; Cochran, Susan A.
2015-01-01
This publication about the Offshore of Point Reyes map area includes ten map sheets that contain explanatory text, in addition to this descriptive pamphlet and a data catalog of geographic information system (GIS) files. Sheets 1, 2, and 3 combine data from four different sonar surveys to generate comprehensive high-resolution bathymetry and acoustic-backscatter coverage of the map area. These data reveal a range of physiographic features (highlighted in the perspective views on sheet 4) such as the flat, sediment-covered seafloor in Drakes Bay, as well as abundant “scour depressions” on the Bodega Head–Tomales Point shelf (see sheet 9) and local, tectonically controlled bedrock uplifts. To validate geological and biological interpretations of the sonar data shown in sheets 1, 2, and 3, the U.S. Geological Survey towed a camera sled over specific offshore locations, collecting both video and photographic imagery; these “ground-truth” surveying data are summarized on sheet 6. Sheet 5 is a “seafloor character” map, which classifies the seafloor on the basis of depth, slope, rugosity (ruggedness), and backscatter intensity and which is further informed by the ground-truth-survey imagery. Sheet 7 is a map of “potential habitats,” which are delineated on the basis of substrate type, geomorphology, seafloor process, or other attributes that may provide a habitat for a specific species or assemblage of organisms. Sheet 8 compiles representative seismic-reflection profiles from the map area, providing information on the subsurface stratigraphy and structure of the map area. Sheet 9 shows the distribution and thickness of young sediment (deposited over the last about 21,000 years, during the most recent sea-level rise) in both the map area and the larger Salt Point to Drakes Bay region, interpreted on the basis of the seismic-reflection data, and it identifies the Offshore of Point Reyes map area as lying within the Bodega Head–Tomales Point shelf, Point Reyes bar, and Bolinas shelf domains. Sheet 10 is a geologic map that merges onshore geologic mapping (compiled from existing maps by the California Geological Survey) and new offshore geologic mapping that is based on integration of high-resolution bathymetry and backscatter imagery (sheets 1, 2, 3), seafloor-sediment and rock samples (Reid and others, 2006), digital camera and video imagery (sheet 6), and high-resolution seismic-reflection profiles (sheet 8), as well as aerial-photographic interpretation of nearshore areas. The information provided by the map sheets, pamphlet, and data catalog have a broad range of applications. High-resolution bathymetry, acoustic backscatter, ground-truth-surveying imagery, and habitat mapping all contribute to habitat characterization and ecosystem-based management by providing essential data for delineation of marine protected areas and ecosystem restoration. Many of the maps provide high-resolution baselines that will be critical for monitoring environmental change associated with climate change, coastal development, or other forcings. High-resolution bathymetry is a critical component for modeling coastal flooding caused by storms and tsunamis, as well as inundation associated with longer term sea-level rise. Seismic-reflection and bathymetric data help characterize earthquake and tsunami sources, critical for natural-hazard assessments of coastal zones. Information on sediment distribution and thickness is essential to the understanding of local and regional sediment transport, as well as the development of regional sediment-management plans. In addition, siting of any new offshore infrastructure (for example, pipelines, cables, or renewable-energy facilities) will depend on high-resolution mapping. Finally, this mapping will both stimulate and enable new scientific research and also raise public awareness of, and education about, coastal environments and issues.
NASA Technical Reports Server (NTRS)
Doggett, T.; Figueredo, P.; Greeley, R.; Hare, T.; Kolb, E.; Mullins, K.; Senske, D.; Tanaka, K.; Weiser, S.
2008-01-01
Europa, with its indications of a sub-ice ocean, is of keen interest to astrobiology and planetary geology. Knowledge of the global distribution and timing of Europan geologic units is a key step for the synthesis of data from the Galileo mission, and for the planning of future missions to the satellite. The first geologic map of Europa was produced at a hemisphere scale with low resolution Voyager data. Following the acquisition of higher resolution data by the Galileo mission, researchers have identified surface units and determined sequences of events in relatively small areas of Europa through geologic mapping using images at various resolutions acquired by Galileo's Solid State Imaging camera. These works provided a local to subregional perspective and employed different criteria for the determination and naming of units. Unified guidelines for the identification, mapping and naming of Europan geologic units were put forth by and employed in regional-to-hemispheric scale mapping which is now being expanded into a global geologic map. A global photomosaic of Galileo and Voyager data was used as a basemap for mapping in ArcGIS, following suggested methodology of all-stratigraphy for planetary mapping. The following units have been defined in global mapping and are listed in stratigraphic order from oldest to youngest: ridged plains material, Argadnel Regio unit, dark plains material, lineaments, disrupted plains material, lenticulated plains material and Chaos material.
Ecology and space: A case study in mapping harmful invasive species
David T. Barnett,; Jarnevich, Catherine S.; Chong, Geneva W.; Stohlgren, Thomas J.; Sunil Kumar,; Holcombe, Tracy R.; Brunn, Stanley D.; Dodge, Martin
2017-01-01
The establishment and invasion of non-native plant species have the ability to alter the composition of native species and functioning of ecological systems with financial costs resulting from mitigation and loss of ecological services. Spatially documenting invasions has applications for management and theory, but the utility of maps is challenged by availability and uncertainty of data, and the reliability of extrapolating mapped data in time and space. The extent and resolution of projections also impact the ability to inform invasive species science and management. Early invasive species maps were coarse-grained representations that underscored the phenomena, but had limited capacity to direct management aside from development of watch lists for priorities for prevention and containment. Integrating mapped data sets with fine-resolution environmental variables in the context of species-distribution models allows a description of species-environment relationships and an understanding of how, why, and where invasions may occur. As with maps, the extent and resolution of models impact the resulting insight. Models of cheatgrass (Bromus tectorum) across a variety of spatial scales and grain result in divergent species-environment relationships. New data can improve models and efficiently direct further inventories. Mapping can target areas of greater model uncertainty or the bounds of modeled distribution to efficiently refine models and maps. This iterative process results in dynamic, living maps capable of describing the ongoing process of species invasions.
Rapid mapping of hurricane damage to forests
Erik M. Nielsen
2009-01-01
The prospects for producing rapid, accurate delineations of the spatial extent of forest wind damage were evaluated using Hurricane Katrina as a test case. A damage map covering the full spatial extent of Katrina?s impact was produced from Moderate Resolution Imaging Spectroradiometer (MODIS) satellite imagery using higher resolution training data. Forest damage...
USDA-ARS?s Scientific Manuscript database
This study demonstrated a new method for mapping high-resolution (spatial: 1 m, and temporal: 1 h) soil moisture by assimilating distributed temperature sensing (DTS) observed soil temperatures at intermediate scales. In order to provide robust soil moisture and property estimates, we first proposed...
High-resolution carbon mapping on the million-hectare Island of Hawaii
Gregory P. Asner; R. Flint Hughes; Joseph Mascaro; Amanda L. Uowolo; David E. Knapp; James Jacobson; Ty Kennedy-Bowdoin; John K . Clark
2011-01-01
Current markets and international agreements for reducing emissions from deforestation and forest degradation (REDD) rely on carbon (C) monitoring techniques. Combining field measurements, airborne light detection and ranging (LiDAR)-based observations, and satellite-based imagery, we developed a 30-meter-resolution map of aboveground C density spanning 40 vegetation...
A Global Map of Thermal Inertia from Mars Global Surveyor Mapping-Mission Data
NASA Technical Reports Server (NTRS)
Mellon, M. T.; Kretke, K. A.; Smith, M. D.; Pelkey, S. M.
2002-01-01
TES (thermal emission spectrometry) has obtained high spatial resolution surface temperature observations from which thermal inertia has been derived. Seasonal coverage of these data now provides a nearly global view of Mars, including the polar regions, at high resolution. Additional information is contained in the original extended abstract.
NASA Astrophysics Data System (ADS)
Nazari, B.; Seo, D.; Cannon, A.
2013-12-01
With many diverse features such as channels, pipes, culverts, buildings, etc., hydraulic modeling in urban areas for inundation mapping poses significant challenges. Identifying the practical extent of the details to be modeled in order to obtain sufficiently accurate results in a timely manner for effective emergency management is one of them. In this study we assess the tradeoffs between model complexity vs. information content for decision making in applying high-resolution hydrologic and hydraulic models for real-time flash flood forecasting and inundation mapping in urban areas. In a large urban area such as the Dallas-Fort Worth Metroplex (DFW), there exists very large spatial variability in imperviousness depending on the area of interest. As such, one may expect significant sensitivity of hydraulic model results to the resolution and accuracy of hydrologic models. In this work, we present the initial results from coupling of high-resolution hydrologic and hydraulic models for two 'hot spots' within the City of Fort Worth for real-time inundation mapping.
Molecular dynamics-based refinement and validation for sub-5 Å cryo-electron microscopy maps
Singharoy, Abhishek; Teo, Ivan; McGreevy, Ryan; Stone, John E; Zhao, Jianhua; Schulten, Klaus
2016-01-01
Two structure determination methods, based on the molecular dynamics flexible fitting (MDFF) paradigm, are presented that resolve sub-5 Å cryo-electron microscopy (EM) maps with either single structures or ensembles of such structures. The methods, denoted cascade MDFF and resolution exchange MDFF, sequentially re-refine a search model against a series of maps of progressively higher resolutions, which ends with the original experimental resolution. Application of sequential re-refinement enables MDFF to achieve a radius of convergence of ~25 Å demonstrated with the accurate modeling of β-galactosidase and TRPV1 proteins at 3.2 Å and 3.4 Å resolution, respectively. The MDFF refinements uniquely offer map-model validation and B-factor determination criteria based on the inherent dynamics of the macromolecules studied, captured by means of local root mean square fluctuations. The MDFF tools described are available to researchers through an easy-to-use and cost-effective cloud computing resource on Amazon Web Services. DOI: http://dx.doi.org/10.7554/eLife.16105.001 PMID:27383269
A Basketball Court-Size Global Map of Mars for Education and Public Outreach
NASA Astrophysics Data System (ADS)
Hill, J. R.; Christensen, P. R.
2017-12-01
The Thermal Emission Imaging System (THEMIS) onboard the 2001 Mars Odyssey spacecraft has acquired over 220,000 infrared images of the Martian surface at a resolution of 100 m/pixel since the start of science operations in February 2002. A global map was previously developed by mosaicking together over 24,000 high-quality full-resolution THEMIS daytime infrared images. Although the resulting map has been extremely valuable for scientific and mission operations applications, it has been difficult to communicate this value to students, citizen scientists and the general public, since their interactions with the map have been limited to computer-based geographic information system (GIS) interfaces. We determined that, in order to better communicate the value and importance of mapping the entire Martian surface at this resolution, people need to be able to physically interact with the map and experience its full scale. Therefore, the THEMIS Day IR Global Mosaic with Colorized MOLA Elevation will be printed on a 45ft x 90ft vinyl mat, which will allow observers to walk across and physically experience the map at approximately full resolution (printed at 200 pixels per inch). The size of the map was chosen to fit on a standard high school basketball court, so that a large number of schools will have a sufficiently large indoor surface on which to display the map for education events. The vinyl material and printing process selected for the map have been proven to be wear-resistant in similar applications, as long as everyone who walks on the map wears socks or similarly soft foot coverings. In order to make transportation easier, the map will be printed in two 45ft x 45ft sections, which will be joined together at events to create the full 45ft x 90ft map. The final stages of the map production will take place in early fall 2017, followed by initial education events at Arizona State University and local schools to test the educational activities associated with the map. This project was partially inspired by the National Geographic Society's Giant Traveling Maps Program, was completed with the assistance of the Arizona Geographic Alliance, and was largely funded through the Arizona State University School of Earth and Space Exploration (SESE) Summer Exploration Graduate Fellowship program.
Assumption-versus data-based approaches to summarizing species' ranges.
Peterson, A Townsend; Navarro-Sigüenza, Adolfo G; Gordillo, Alejandro
2018-06-01
For conservation decision making, species' geographic distributions are mapped using various approaches. Some such efforts have downscaled versions of coarse-resolution extent-of-occurrence maps to fine resolutions for conservation planning. We examined the quality of the extent-of-occurrence maps as range summaries and the utility of refining those maps into fine-resolution distributional hypotheses. Extent-of-occurrence maps tend to be overly simple, omit many known and well-documented populations, and likely frequently include many areas not holding populations. Refinement steps involve typological assumptions about habitat preferences and elevational ranges of species, which can introduce substantial error in estimates of species' true areas of distribution. However, no model-evaluation steps are taken to assess the predictive ability of these models, so model inaccuracies are not noticed. Whereas range summaries derived by these methods may be useful in coarse-grained, global-extent studies, their continued use in on-the-ground conservation applications at fine spatial resolutions is not advisable in light of reliance on assumptions, lack of real spatial resolution, and lack of testing. In contrast, data-driven techniques that integrate primary data on biodiversity occurrence with remotely sensed data that summarize environmental dimensions (i.e., ecological niche modeling or species distribution modeling) offer data-driven solutions based on a minimum of assumptions that can be evaluated and validated quantitatively to offer a well-founded, widely accepted method for summarizing species' distributional patterns for conservation applications. © 2016 Society for Conservation Biology.
Reporting of quantitative oxygen mapping in EPR imaging
NASA Astrophysics Data System (ADS)
Subramanian, Sankaran; Devasahayam, Nallathamby; McMillan, Alan; Matsumoto, Shingo; Munasinghe, Jeeva P.; Saito, Keita; Mitchell, James B.; Chandramouli, Gadisetti V. R.; Krishna, Murali C.
2012-01-01
Oxygen maps derived from electron paramagnetic resonance spectral-spatial imaging (EPRI) are based upon the relaxivity of molecular oxygen with paramagnetic spin probes. This technique can be combined with MRI to facilitate mapping of pO 2 values in specific anatomic locations with high precision. The co-registration procedure, which matches the physical and digital dimensions of EPR and MR images, may present the pO 2 map at the higher MRI resolution, exaggerating the spatial resolution of oxygen, making it difficult to precisely distinguish hypoxic regions from normoxic regions. The latter distinction is critical in monitoring the treatment of cancer by radiation and chemotherapy, since it is well-established that hypoxic regions are three or four times more resistant to treatment compared to normoxic regions. The aim of this article is to describe pO 2 maps based on the intrinsic resolution of EPRI. A spectral parameter that affects the intrinsic spatial resolution of EPRI is the full width at half maximum (FWHM) height of the gradient-free EPR absorption line in frequency-encoded imaging. In single point imaging too, the transverse relaxation times (T2∗) limit the resolution since the signal decays by exp(-tp/T2∗) where the delay time after excitation pulse, t p, is related to the resolution. Although the spin densities of two point objects may be resolved at this separation, it is inadequate to evaluate quantitative changes of pO 2 levels since the linewidths are proportionately affected by pO 2. A spatial separation of at least twice this resolution is necessary to correctly identify a change in pO 2 level. In addition, the pO 2 values are blurred by uncertainties arising from spectral dimensions. Blurring due to noise and low resolution modulates the pO 2 levels at the boundaries of hypoxic and normoxic regions resulting in higher apparent pO 2 levels in hypoxic regions. Therefore, specification of intrinsic resolution and pO 2 uncertainties are necessary to interpret digitally processed pO 2 illustrations.
Kalra, Shikha; Puniya, Bhanwar Lal; Kulshreshtha, Deepika; Kumar, Sunil; Kaur, Jagdeep; Ramachandran, Srinivasan; Singh, Kashmir
2013-01-01
Chlorophytum borivilianum, an endangered medicinal plant species is highly recognized for its aphrodisiac properties provided by saponins present in the plant. The transcriptome information of this species is limited and only few hundred expressed sequence tags (ESTs) are available in the public databases. To gain molecular insight of this plant, high throughput transcriptome sequencing of leaf RNA was carried out using Illumina's HiSeq 2000 sequencing platform. A total of 22,161,444 single end reads were retrieved after quality filtering. Available (e.g., De-Bruijn/Eulerian graph) and in-house developed bioinformatics tools were used for assembly and annotation of transcriptome. A total of 101,141 assembled transcripts were obtained, with coverage size of 22.42 Mb and average length of 221 bp. Guanine-cytosine (GC) content was found to be 44%. Bioinformatics analysis, using non-redundant proteins, gene ontology (GO), enzyme commission (EC) and kyoto encyclopedia of genes and genomes (KEGG) databases, extracted all the known enzymes involved in saponin and flavonoid biosynthesis. Few genes of the alkaloid biosynthesis, along with anticancer and plant defense genes, were also discovered. Additionally, several cytochrome P450 (CYP450) and glycosyltransferase unique sequences were also found. We identified simple sequence repeat motifs in transcripts with an abundance of di-nucleotide simple sequence repeat (SSR; 43.1%) markers. Large scale expression profiling through Reads per Kilobase per Million mapped reads (RPKM) showed major genes involved in different metabolic pathways of the plant. Genes, expressed sequence tags (ESTs) and unique sequences from this study provide an important resource for the scientific community, interested in the molecular genetics and functional genomics of C. borivilianum. PMID:24376689
Comparative transcriptome analysis of microsclerotia development in Nomuraea rileyi.
Song, Zhangyong; Yin, Youping; Jiang, Shasha; Liu, Juanjuan; Chen, Huan; Wang, Zhongkang
2013-06-19
Nomuraea rileyi is used as an environmental-friendly biopesticide. However, mass production and commercialization of this organism are limited due to its fastidious growth and sporulation requirements. When cultured in amended medium, we found that N. rileyi could produce microsclerotia bodies, replacing conidiophores as the infectious agent. However, little is known about the genes involved in microsclerotia development. In the present study, the transcriptomes were analyzed using next-generation sequencing technology to find the genes involved in microsclerotia development. A total of 4.69 Gb of clean nucleotides comprising 32,061 sequences was obtained, and 20,919 sequences were annotated (about 65%). Among the annotated sequences, only 5928 were annotated with 34 gene ontology (GO) functional categories, and 12,778 sequences were mapped to 165 pathways by searching against the Kyoto Encyclopedia of Genes and Genomes pathway (KEGG) database. Furthermore, we assessed the transcriptomic differences between cultures grown in minimal and amended medium. In total, 4808 sequences were found to be differentially expressed; 719 differentially expressed unigenes were assigned to 25 GO classes and 1888 differentially expressed unigenes were assigned to 161 KEGG pathways, including 25 enrichment pathways. Subsequently, we examined the up-regulation or uniquely expressed genes following amended medium treatment, which were also expressed on the enrichment pathway, and found that most of them participated in mediating oxidative stress homeostasis. To elucidate the role of oxidative stress in microsclerotia development, we analyzed the diversification of unigenes using quantitative reverse transcription-PCR (RT-qPCR). Our findings suggest that oxidative stress occurs during microsclerotia development, along with a broad metabolic activity change. Our data provide the most comprehensive sequence resource available for the study of N. rileyi. We believe that the transcriptome datasets will serve as an important public information platform to accelerate studies on N. rileyi microsclerotia.
The De Novo Transcriptome and Its Functional Annotation in the Seed Beetle Callosobruchus maculatus.
Sayadi, Ahmed; Immonen, Elina; Bayram, Helen; Arnqvist, Göran
2016-01-01
Despite their unparalleled biodiversity, the genomic resources available for beetles (Coleoptera) remain relatively scarce. We present an integrative and high quality annotated transcriptome of the beetle Callosobruchus maculatus, an important and cosmopolitan agricultural pest as well as an emerging model species in ecology and evolutionary biology. Using Illumina sequencing technology, we sequenced 492 million read pairs generated from 51 samples of different developmental stages (larvae, pupae and adults) of C. maculatus. Reads were de novo assembled using the Trinity software, into a single combined assembly as well as into three separate assemblies based on data from the different developmental stages. The combined assembly generated 218,192 transcripts and 145,883 putative genes. Putative genes were annotated with the Blast2GO software and the Trinotate pipeline. In total, 33,216 putative genes were successfully annotated using Blastx against the Nr (non-redundant) database and 13,382 were assigned to 34,100 Gene Ontology (GO) terms. We classified 5,475 putative genes into Clusters of Orthologous Groups (COG) and 116 metabolic pathways maps were predicted based on the annotation. Our analyses suggested that the transcriptional specificity increases with ontogeny. For example, out of 33,216 annotated putative genes, 51 were only expressed in larvae, 63 only in pupae and 171 only in adults. Our study illustrates the importance of including samples from several developmental stages when the aim is to provide an integrative and high quality annotated transcriptome. Our results will represent an invaluable resource for those working with the ecology, evolution and pest control of C. maculatus, as well for comparative studies of the transcriptomics and genomics of beetles more generally.
The De Novo Transcriptome and Its Functional Annotation in the Seed Beetle Callosobruchus maculatus
Sayadi, Ahmed; Immonen, Elina; Bayram, Helen
2016-01-01
Despite their unparalleled biodiversity, the genomic resources available for beetles (Coleoptera) remain relatively scarce. We present an integrative and high quality annotated transcriptome of the beetle Callosobruchus maculatus, an important and cosmopolitan agricultural pest as well as an emerging model species in ecology and evolutionary biology. Using Illumina sequencing technology, we sequenced 492 million read pairs generated from 51 samples of different developmental stages (larvae, pupae and adults) of C. maculatus. Reads were de novo assembled using the Trinity software, into a single combined assembly as well as into three separate assemblies based on data from the different developmental stages. The combined assembly generated 218,192 transcripts and 145,883 putative genes. Putative genes were annotated with the Blast2GO software and the Trinotate pipeline. In total, 33,216 putative genes were successfully annotated using Blastx against the Nr (non-redundant) database and 13,382 were assigned to 34,100 Gene Ontology (GO) terms. We classified 5,475 putative genes into Clusters of Orthologous Groups (COG) and 116 metabolic pathways maps were predicted based on the annotation. Our analyses suggested that the transcriptional specificity increases with ontogeny. For example, out of 33,216 annotated putative genes, 51 were only expressed in larvae, 63 only in pupae and 171 only in adults. Our study illustrates the importance of including samples from several developmental stages when the aim is to provide an integrative and high quality annotated transcriptome. Our results will represent an invaluable resource for those working with the ecology, evolution and pest control of C. maculatus, as well for comparative studies of the transcriptomics and genomics of beetles more generally. PMID:27442123