comparative expressed sequence: Topics by Science.gov

Sample records for comparative expressed sequence

Expressed sequence tags from heat-shocked seagrass Zostera noltii (Hornemann) from its southern distribution range.

PubMed

Massa, Sónia I; Pearson, Gareth A; Aires, Tânia; Kube, Michael; Olsen, Jeanine L; Reinhardt, Richard; Serrão, Ester A; Arnaud-Haond, Sophie

2011-09-01

Predicted global climate change threatens the distributional ranges of species worldwide. We identified genes expressed in the intertidal seagrass Zostera noltii during recovery from a simulated low tide heat-shock exposure. Five Expressed Sequence Tag (EST) libraries were compared, corresponding to four recovery times following sub-lethal temperature stress, and a non-stressed control. We sequenced and analyzed 7009 sequence reads from 30min, 2h, 4h and 24h after the beginning of the heat-shock (AHS), and 1585 from the control library, for a total of 8594 sequence reads. Among 51 Tentative UniGenes (TUGs) exhibiting significantly different expression between libraries, 19 (37.3%) were identified as 'molecular chaperones' and were over-expressed following heat-shock, while 12 (23.5%) were 'photosynthesis TUGs' generally under-expressed in heat-shocked plants. A time course analysis of expression showed a rapid increase in expression of the molecular chaperone class, most of which were heat-shock proteins; which increased from 2 sequence reads in the control library to almost 230 in the 30min AHS library, followed by a slow decrease during further recovery. In contrast, 'photosynthesis TUGs' were under-expressed 30min AHS compared with the control library, and declined progressively with recovery time in the stress libraries, with a total of 29 sequence reads 24h AHS, compared with 125 in the control. A total of 4734 TUGs were screened for EST-Single Sequence Repeats (EST-SSRs) and 86 microsatellites were identified. Copyright © 2011 Elsevier B.V. All rights reserved.
Increased complexity of circRNA expression during species evolution.

PubMed

Dong, Rui; Ma, Xu-Kai; Chen, Ling-Ling; Yang, Li

2017-08-03

Circular RNAs (circRNAs) are broadly identified from precursor mRNA (pre-mRNA) back-splicing across various species. Recent studies have suggested a cell-/tissue- specific manner of circRNA expression. However, the distinct expression pattern of circRNAs among species and its underlying mechanism still remain to be explored. Here, we systematically compared circRNA expression from human and mouse, and found that only a small portion of human circRNAs could be determined in parallel mouse samples. The conserved circRNA expression between human and mouse is correlated with the existence of orientation-opposite complementary sequences in introns that flank back-spliced exons in both species, but not the circRNA sequences themselves. Quantification of RNA pairing capacity of orientation-opposite complementary sequences across circRNA-flanking introns by Complementary Sequence Index (CSI) identifies that among all types of complementary sequences, SINEs, especially Alu elements in human, contribute the most for circRNA formation and that their diverse distribution across species leads to the increased complexity of circRNA expression during species evolution. Together, our integrated and comparative reference catalog of circRNAs in different species reveals a species-specific pattern of circRNA expression and suggests a previously under-appreciated impact of fast-evolved SINEs on the regulation of (circRNA) gene expression.
Developing expressed sequence tag libraries and the discovery of simple sequence repeat markers for two species of raspberry (Rubus L.)

USDA-ARS?s Scientific Manuscript database

Background: Due to a relatively high level of codominant inheritance and transferability within and among taxonomic groups, simple sequence repeat (SSR) markers are important elements in comparative mapping and delineation of genomic regions associated with traits of economic importance. Expressed S...
Analysis of codon usage in beta-tubulin sequences of helminths.

PubMed

von Samson-Himmelstjerna, G; Harder, A; Failing, K; Pape, M; Schnieder, T

2003-07-01

Codon usage bias has been shown to be correlated with gene expression levels in many organisms, including the nematode Caenorhabditis elegans. Here, the codon usage (cu) characteristics for a set of currently available beta-tubulin coding sequences of helminths were assessed by calculating several indices, including the effective codon number (Nc), the intrinsic codon deviation index (ICDI), the P2 value and the mutational response index (MRI). The P2 value gives a measure of translational pressure, which has been shown to be correlated to high gene expression levels in some organisms, but it has not yet been analysed in that respect in helminths. For all but two of the C. elegans beta-tubulin coding sequences investigated, the P2 value was the only index that indicated the presence of codon usage bias. Therefore, we propose that in general the helminth beta-tubulin sequences investigated here are not expressed at high levels. Furthermore, we calculated the correlation coefficients for the cu patterns of the helminth beta-tubulin sequences compared with those of highly expressed genes in organisms such as Escherichia coli and C. elegans. It was found that beta-tubulin cu patterns for all sequences of members of the Strongylida were significantly correlated to those for highly expressed C. elegans genes. This approach provides a new measure for comparing the adaptation of cu of a particular coding sequence with that of highly expressed genes in possible expression systems.Finally, using the cu patterns of the sequences studied, a phylogenetic tree was constructed. The topology of this tree was very much in concordance with that of a phylogeny based on small subunit ribosomal DNA sequence alignments.
Differential gene expression in the siphonophore Nanomia bijuga (Cnidaria) assessed with multiple next-generation sequencing workflows.

PubMed

Siebert, Stefan; Robinson, Mark D; Tintori, Sophia C; Goetz, Freya; Helm, Rebecca R; Smith, Stephen A; Shaner, Nathan; Haddock, Steven H D; Dunn, Casey W

2011-01-01

We investigated differential gene expression between functionally specialized feeding polyps and swimming medusae in the siphonophore Nanomia bijuga (Cnidaria) with a hybrid long-read/short-read sequencing strategy. We assembled a set of partial gene reference sequences from long-read data (Roche 454), and generated short-read sequences from replicated tissue samples that were mapped to the references to quantify expression. We collected and compared expression data with three short-read expression workflows that differ in sample preparation, sequencing technology, and mapping tools. These workflows were Illumina mRNA-Seq, which generates sequence reads from random locations along each transcript, and two tag-based approaches, SOLiD SAGE and Helicos DGE, which generate reads from particular tag sites. Differences in expression results across workflows were mostly due to the differential impact of missing data in the partial reference sequences. When all 454-derived gene reference sequences were considered, Illumina mRNA-Seq detected more than twice as many differentially expressed (DE) reference sequences as the tag-based workflows. This discrepancy was largely due to missing tag sites in the partial reference that led to false negatives in the tag-based workflows. When only the subset of reference sequences that unambiguously have tag sites was considered, we found broad congruence across workflows, and they all identified a similar set of DE sequences. Our results are promising in several regards for gene expression studies in non-model organisms. First, we demonstrate that a hybrid long-read/short-read sequencing strategy is an effective way to collect gene expression data when an annotated genome sequence is not available. Second, our replicated sampling indicates that expression profiles are highly consistent across field-collected animals in this case. Third, the impacts of partial reference sequences on the ability to detect DE can be mitigated through workflow choice and deeper reference sequencing.
Differential Gene Expression in the Siphonophore Nanomia bijuga (Cnidaria) Assessed with Multiple Next-Generation Sequencing Workflows

PubMed Central

Siebert, Stefan; Robinson, Mark D.; Tintori, Sophia C.; Goetz, Freya; Helm, Rebecca R.; Smith, Stephen A.; Shaner, Nathan; Haddock, Steven H. D.; Dunn, Casey W.

2011-01-01

We investigated differential gene expression between functionally specialized feeding polyps and swimming medusae in the siphonophore Nanomia bijuga (Cnidaria) with a hybrid long-read/short-read sequencing strategy. We assembled a set of partial gene reference sequences from long-read data (Roche 454), and generated short-read sequences from replicated tissue samples that were mapped to the references to quantify expression. We collected and compared expression data with three short-read expression workflows that differ in sample preparation, sequencing technology, and mapping tools. These workflows were Illumina mRNA-Seq, which generates sequence reads from random locations along each transcript, and two tag-based approaches, SOLiD SAGE and Helicos DGE, which generate reads from particular tag sites. Differences in expression results across workflows were mostly due to the differential impact of missing data in the partial reference sequences. When all 454-derived gene reference sequences were considered, Illumina mRNA-Seq detected more than twice as many differentially expressed (DE) reference sequences as the tag-based workflows. This discrepancy was largely due to missing tag sites in the partial reference that led to false negatives in the tag-based workflows. When only the subset of reference sequences that unambiguously have tag sites was considered, we found broad congruence across workflows, and they all identified a similar set of DE sequences. Our results are promising in several regards for gene expression studies in non-model organisms. First, we demonstrate that a hybrid long-read/short-read sequencing strategy is an effective way to collect gene expression data when an annotated genome sequence is not available. Second, our replicated sampling indicates that expression profiles are highly consistent across field-collected animals in this case. Third, the impacts of partial reference sequences on the ability to detect DE can be mitigated through workflow choice and deeper reference sequencing. PMID:21829563
Music performance and the perception of key.

PubMed

Thompson, W F; Cuddy, L L

1997-02-01

The effect of music performance on perceived key movement was examined. Listeners judged key movement in sequences presented without performance expression (mechanical) in Experiment 1 and with performance expression in Experiment 2. Modulation distance varied. Judgments corresponded to predictions based on the cycle of fifths and toroidal models of key relatedness, with the highest correspondence for performed versions with the toroidal model. In Experiment 3, listeners compared mechanical sequences with either performed sequences or modifications of performed sequences. Modifications preserved expressive differences between chords, but not between voices. Predictions from Experiments 1 and 2 held only for performed sequences, suggesting that differences between voices are informative of key movement. Experiment 4 confirmed that modifications did not disrupt musicality. Analyses of performances further suggested a link between performance expression and key.
Gene discovery in the hamster: a comparative genomics approach for gene annotation by sequencing of hamster testis cDNAs

PubMed Central

Oduru, Sreedhar; Campbell, Janee L; Karri, SriTulasi; Hendry, William J; Khan, Shafiq A; Williams, Simon C

2003-01-01

Background Complete genome annotation will likely be achieved through a combination of computer-based analysis of available genome sequences combined with direct experimental characterization of expressed regions of individual genomes. We have utilized a comparative genomics approach involving the sequencing of randomly selected hamster testis cDNAs to begin to identify genes not previously annotated on the human, mouse, rat and Fugu (pufferfish) genomes. Results 735 distinct sequences were analyzed for their relatedness to known sequences in public databases. Eight of these sequences were derived from previously unidentified genes and expression of these genes in testis was confirmed by Northern blotting. The genomic locations of each sequence were mapped in human, mouse, rat and pufferfish, where applicable, and the structure of their cognate genes was derived using computer-based predictions, genomic comparisons and analysis of uncharacterized cDNA sequences from human and macaque. Conclusion The use of a comparative genomics approach resulted in the identification of eight cDNAs that correspond to previously uncharacterized genes in the human genome. The proteins encoded by these genes included a new member of the kinesin superfamily, a SET/MYND-domain protein, and six proteins for which no specific function could be predicted. Each gene was expressed primarily in testis, suggesting that they may play roles in the development and/or function of testicular cells. PMID:12783626
Cloning and expression of cDNA coding for bouganin.

PubMed

den Hartog, Marcel T; Lubelli, Chiara; Boon, Louis; Heerkens, Sijmie; Ortiz Buijsse, Antonio P; de Boer, Mark; Stirpe, Fiorenzo

2002-03-01

Bouganin is a ribosome-inactivating protein that recently was isolated from Bougainvillea spectabilis Willd. In this work, the cloning and expression of the cDNA encoding for bouganin is described. From the cDNA, the amino-acid sequence was deduced, which correlated with the primary sequence data obtained by amino-acid sequencing on the native protein. Bouganin is synthesized as a pro-peptide consisting of 305 amino acids, the first 26 of which act as a leader signal while the 29 C-terminal amino acids are cleaved during processing of the molecule. The mature protein consists of 250 amino acids. Using the cDNA sequence encoding the mature protein of 250 amino acids, a recombinant protein was expressed, purified and characterized. The recombinant molecule had similar activity in a cell-free protein synthesis assay and had comparable toxicity on living cells as compared to the isolated native bouganin.
A detailed gene expression study of the Miscanthus genus reveals changes in the transcriptome associated with the rejuvenation of spring rhizomes.

PubMed

Barling, Adam; Swaminathan, Kankshita; Mitros, Therese; James, Brandon T; Morris, Juliette; Ngamboma, Ornella; Hall, Megan C; Kirkpatrick, Jessica; Alabady, Magdy; Spence, Ashley K; Hudson, Matthew E; Rokhsar, Daniel S; Moose, Stephen P

2013-12-09

The Miscanthus genus of perennial C4 grasses contains promising biofuel crops for temperate climates. However, few genomic resources exist for Miscanthus, which limits understanding of its interesting biology and future genetic improvement. A comprehensive catalog of expressed sequences were generated from a variety of Miscanthus species and tissue types, with an emphasis on characterizing gene expression changes in spring compared to fall rhizomes. Illumina short read sequencing technology was used to produce transcriptome sequences from different tissues and organs during distinct developmental stages for multiple Miscanthus species, including Miscanthus sinensis, Miscanthus sacchariflorus, and their interspecific hybrid Miscanthus × giganteus. More than fifty billion base-pairs of Miscanthus transcript sequence were produced. Overall, 26,230 Sorghum gene models (i.e., ~ 96% of predicted Sorghum genes) had at least five Miscanthus reads mapped to them, suggesting that a large portion of the Miscanthus transcriptome is represented in this dataset. The Miscanthus × giganteus data was used to identify genes preferentially expressed in a single tissue, such as the spring rhizome, using Sorghum bicolor as a reference. Quantitative real-time PCR was used to verify examples of preferential expression predicted via RNA-Seq. Contiguous consensus transcript sequences were assembled for each species and annotated using InterProScan. Sequences from the assembled transcriptome were used to amplify genomic segments from a doubled haploid Miscanthus sinensis and from Miscanthus × giganteus to further disentangle the allelic and paralogous variations in genes. This large expressed sequence tag collection creates a valuable resource for the study of Miscanthus biology by providing detailed gene sequence information and tissue preferred expression patterns. We have successfully generated a database of transcriptome assemblies and demonstrated its use in the study of genes of interest. Analysis of gene expression profiles revealed biological pathways that exhibit altered regulation in spring compared to fall rhizomes, which are consistent with their different physiological functions. The expression profiles of the subterranean rhizome provides a better understanding of the biological activities of the underground stem structures that are essentials for perenniality and the storage or remobilization of carbon and nutrient resources.
RNA sequencing confirms similarities between PPI-responsive oesophageal eosinophilia and eosinophilic oesophagitis.

PubMed

Peterson, K A; Yoshigi, M; Hazel, M W; Delker, D A; Lin, E; Krishnamurthy, C; Consiglio, N; Robson, J; Yandell, M; Clayton, F

2018-06-04

Although current American guidelines distinguish proton pump inhibitor-responsive oesophageal eosinophilia (PPI-REE) from eosinophilic oesophagitis (EoE), these entities are broadly similar. While two microarray studies showed that they have similar transcriptomes, more extensive RNA sequencing studies have not been done previously. To determine whether RNA sequencing identifies genetic markers distinguishing PPI-REE from EoE. We retrospectively examined 13 PPI-REE and 14 EoE biopsies, matched for tissue eosinophil content, and 14 normal controls. Patients and controls were not PPI-treated at the time of biopsy. We did RNA sequencing on formalin-fixed, paraffin-embedded tissue, with differential expression confirmation by quantitative polymerase chain reaction (PCR). We validated the use of formalin-fixed, paraffin-embedded vs RNAlater-preserved tissue, and compared our formalin-fixed, paraffin-embedded EoE results to a prior EoE study. By RNA sequencing, no genes were differentially expressed between the EoE and PPI-REE groups at the false discovery rate (FDR) ≤0.01 level. Compared to normal controls, 1996 genes were differentially expressed in the PPI-REE group and 1306 genes in the EoE group. By less stringent criteria, only MAPK8IP2 was differentially expressed between PPI-REE and EoE (FDR = 0.029, 2.2-fold less in EoE than in PPI-REE), with similar results by PCR. KCNJ2, which was differentially expressed in a prior study, was similar in the EoE and PPI-REE groups by both RNA sequencing and real-time PCR. Eosinophilic oesophagitis and PPI-REE have comparable transcriptomes, confirming that they are part of the same disease continuum. © 2018 John Wiley & Sons Ltd.
RNA Sequencing Reveals Differential Expression of Mitochondrial and Oxidation Reduction Genes in the Long-Lived Naked Mole-Rat When Compared to Mice

PubMed Central

Holmes, Andrew; Szafranski, Karol; Faulkes, Chris G.; Coen, Clive W.; Buffenstein, Rochelle; Platzer, Matthias; de Magalhães, João Pedro; Church, George M.

2011-01-01

The naked mole-rat (Heterocephalus glaber) is a long-lived, cancer resistant rodent and there is a great interest in identifying the adaptations responsible for these and other of its unique traits. We employed RNA sequencing to compare liver gene expression profiles between naked mole-rats and wild-derived mice. Our results indicate that genes associated with oxidoreduction and mitochondria were expressed at higher relative levels in naked mole-rats. The largest effect is nearly 300-fold higher expression of epithelial cell adhesion molecule (Epcam), a tumour-associated protein. Also of interest are the protease inhibitor, alpha2-macroglobulin (A2m), and the mitochondrial complex II subunit Sdhc, both ageing-related genes found strongly over-expressed in the naked mole-rat. These results hint at possible candidates for specifying species differences in ageing and cancer, and in particular suggest complex alterations in mitochondrial and oxidation reduction pathways in the naked mole-rat. Our differential gene expression analysis obviated the need for a reference naked mole-rat genome by employing a combination of Illumina/Solexa and 454 platforms for transcriptome sequencing and assembling transcriptome contigs of the non-sequenced species. Overall, our work provides new research foci and methods for studying the naked mole-rat's fascinating characteristics. PMID:22073188
Single-Cell RNA-Sequencing: Assessment of Differential Expression Analysis Methods.

PubMed

Dal Molin, Alessandra; Baruzzo, Giacomo; Di Camillo, Barbara

2017-01-01

The sequencing of the transcriptomes of single-cells, or single-cell RNA-sequencing, has now become the dominant technology for the identification of novel cell types and for the study of stochastic gene expression. In recent years, various tools for analyzing single-cell RNA-sequencing data have been proposed, many of them with the purpose of performing differentially expression analysis. In this work, we compare four different tools for single-cell RNA-sequencing differential expression, together with two popular methods originally developed for the analysis of bulk RNA-sequencing data, but largely applied to single-cell data. We discuss results obtained on two real and one synthetic dataset, along with considerations about the perspectives of single-cell differential expression analysis. In particular, we explore the methods performance in four different scenarios, mimicking different unimodal or bimodal distributions of the data, as characteristic of single-cell transcriptomics. We observed marked differences between the selected methods in terms of precision and recall, the number of detected differentially expressed genes and the overall performance. Globally, the results obtained in our study suggest that is difficult to identify a best performing tool and that efforts are needed to improve the methodologies for single-cell RNA-sequencing data analysis and gain better accuracy of results.
A draft of the genome and four transcriptomes of a medicinal and pesticidal angiosperm Azadirachta indica

PubMed Central

2012-01-01

Background The Azadirachta indica (neem) tree is a source of a wide number of natural products, including the potent biopesticide azadirachtin. In spite of its widespread applications in agriculture and medicine, the molecular aspects of the biosynthesis of neem terpenoids remain largely unexplored. The current report describes the draft genome and four transcriptomes of A. indica and attempts to contextualise the sequence information in terms of its molecular phylogeny, transcript expression and terpenoid biosynthesis pathways. A. indica is the first member of the family Meliaceae to be sequenced using next generation sequencing approach. Results The genome and transcriptomes of A. indica were sequenced using multiple sequencing platforms and libraries. The A. indica genome is AT-rich, bears few repetitive DNA elements and comprises about 20,000 genes. The molecular phylogenetic analyses grouped A. indica together with Citrus sinensis from the Rutaceae family validating its conventional taxonomic classification. Comparative transcript expression analysis showed either exclusive or enhanced expression of known genes involved in neem terpenoid biosynthesis pathways compared to other sequenced angiosperms. Genome and transcriptome analyses in A. indica led to the identification of repeat elements, nucleotide composition and expression profiles of genes in various organs. Conclusions This study on A. indica genome and transcriptomes will provide a model for characterization of metabolic pathways involved in synthesis of bioactive compounds, comparative evolutionary studies among various Meliaceae family members and help annotate their genomes. A better understanding of molecular pathways involved in the azadirachtin synthesis in A. indica will pave ways for bulk production of environment friendly biopesticides. PMID:22958331
Characteristics of the Lotus japonicus gene repertoire deduced from large-scale expressed sequence tag (EST) analysis.

PubMed

Asamizu, Erika; Nakamura, Yasukazu; Sato, Shusei; Tabata, Satoshi

2004-02-01

To perform a comprehensive analysis of genes expressed in a model legume, Lotus japonicus, a total of 74472 3'-end expressed sequence tags (EST) were generated from cDNA libraries produced from six different organs. Clustering of sequences was performed with an identity criterion of 95% for 50 bases, and a total of 20457 non-redundant sequences, 8503 contigs and 11954 singletons were generated. EST sequence coverage was analyzed by using the annotated L. japonicus genomic sequence and 1093 of the 1889 predicted protein-encoding genes (57.9%) were hit by the EST sequence(s). Gene content was compared to several plant species. Among the 8503 contigs, 471 were identified as sequences conserved only in leguminous species and these included several disease resistance-related genes. This suggested that in legumes, these genes may have evolved specifically to resist pathogen attack. The rate of gene sequence divergence was assessed by comparing similarity level and functional category based on the Gene Ontology (GO) annotation of Arabidopsis genes. This revealed that genes encoding ribosomal proteins, as well as those related to translation, photosynthesis, and cellular structure were more abundantly represented in the highly conserved class, and that genes encoding transcription factors and receptor protein kinases were abundantly represented in the less conserved class. To make the sequence information and the cDNA clones available to the research community, a Web database with useful services was created at http://www.kazusa.or.jp/en/plant/lotus/EST/.
Improving RNA-Seq expression estimation by modeling isoform- and exon-specific read sequencing rate.

PubMed

Liu, Xuejun; Shi, Xinxin; Chen, Chunlin; Zhang, Li

2015-10-16

The high-throughput sequencing technology, RNA-Seq, has been widely used to quantify gene and isoform expression in the study of transcriptome in recent years. Accurate expression measurement from the millions or billions of short generated reads is obstructed by difficulties. One is ambiguous mapping of reads to reference transcriptome caused by alternative splicing. This increases the uncertainty in estimating isoform expression. The other is non-uniformity of read distribution along the reference transcriptome due to positional, sequencing, mappability and other undiscovered sources of biases. This violates the uniform assumption of read distribution for many expression calculation approaches, such as the direct RPKM calculation and Poisson-based models. Many methods have been proposed to address these difficulties. Some approaches employ latent variable models to discover the underlying pattern of read sequencing. However, most of these methods make bias correction based on surrounding sequence contents and share the bias models by all genes. They therefore cannot estimate gene- and isoform-specific biases as revealed by recent studies. We propose a latent variable model, NLDMseq, to estimate gene and isoform expression. Our method adopts latent variables to model the unknown isoforms, from which reads originate, and the underlying percentage of multiple spliced variants. The isoform- and exon-specific read sequencing biases are modeled to account for the non-uniformity of read distribution, and are identified by utilizing the replicate information of multiple lanes of a single library run. We employ simulation and real data to verify the performance of our method in terms of accuracy in the calculation of gene and isoform expression. Results show that NLDMseq obtains competitive gene and isoform expression compared to popular alternatives. Finally, the proposed method is applied to the detection of differential expression (DE) to show its usefulness in the downstream analysis. The proposed NLDMseq method provides an approach to accurately estimate gene and isoform expression from RNA-Seq data by modeling the isoform- and exon-specific read sequencing biases. It makes use of a latent variable model to discover the hidden pattern of read sequencing. We have shown that it works well in both simulations and real datasets, and has competitive performance compared to popular methods. The method has been implemented as a freely available software which can be found at https://github.com/PUGEA/NLDMseq.
openSputnik--a database to ESTablish comparative plant genomics using unsaturated sequence collections.

PubMed

Rudd, Stephen

2005-01-01

The public expressed sequence tag collections are continually being enriched with high-quality sequences that represent an ever-expanding range of taxonomically diverse plant species. While these sequence collections provide biased insight into the populations of expressed genes available within individual species and their associated tissues, the information is conceivably of wider relevance in a comparative context. When we consider the available expressed sequence tag (EST) collections of summer 2004, most of the major plant taxonomic clades are at least superficially represented. Investigation of the five million available plant ESTs provides a wealth of information that has applications in modelling the routes of plant genome evolution and the identification of lineage-specific genes and gene families. Over four million ESTs from over 50 distinct plant species have been collated within an EST analysis pipeline called openSputnik. The ESTs were resolved down into approximately one million unigene sequences. These have been annotated using orthology-based annotation transfer from reference plant genomes and using a variety of contemporary bioinformatics methods to assign peptide, structural and functional attributes. The openSputnik database is available at http://sputnik.btk.fi.
Microsatellite DNA in genomic survey sequences and UniGenes of loblolly pine

Treesearch

Craig S Echt; Surya Saha; Dennis L Deemer; C Dana Nelson

2011-01-01

Genomic DNA sequence databases are a potential and growing resource for simple sequence repeat (SSR) marker development in loblolly pine (Pinus taeda L.). Loblolly pine also has many expressed sequence tags (ESTs) available for microsatellite (SSR) marker development. We compared loblolly pine SSR densities in genome survey sequences (GSSs) to those in non-redundant...
Genetic analysis of tumorigenesis: XXXII. Localization of constitutionally amplified KRAS sequences to Chinese hamster chromosomes X and Y by in situ hybridization.

PubMed

Stenman, G; Anisowicz, A; Sager, R

1988-11-01

The KRAS gene is constitutionally amplified in the Chinese hamster. We have mapped the amplified sequences by in situ hybridization to two major sites on the X and Y chromosomes, Xq4 and Yp2. No autosomal site was detected despite a search under relaxed hybridization conditions. KRAS DNA is amplified about 50-fold compared to a human cell line known to have a diploid number of KRAS sequences, whereas mRNA expression is 5- to 10-fold lower than in normal human cells. While mRNA expression levels do not necessarily parallel gene copy number, the low expression level strongly suggests that the amplified sequences are transcriptionally silent. It is suggested that the amplified sequences arose from the original KRAS gene on chromosome 8 and that the KRAS sequences on the Y chromosome arose by X-Y recombination.
Identification, characterization and expression analysis of lineage-specific genes within sweet orange (Citrus sinensis).

PubMed

Xu, Yuantao; Wu, Guizhi; Hao, Baohai; Chen, Lingling; Deng, Xiuxin; Xu, Qiang

2015-11-23

With the availability of rapidly increasing number of genome and transcriptome sequences, lineage-specific genes (LSGs) can be identified and characterized. Like other conserved functional genes, LSGs play important roles in biological evolution and functions. Two set of citrus LSGs, 296 citrus-specific genes (CSGs) and 1039 orphan genes specific to sweet orange, were identified by comparative analysis between the sweet orange genome sequences and 41 genomes and 273 transcriptomes. With the two sets of genes, gene structure and gene expression pattern were investigated. On average, both the CSGs and orphan genes have fewer exons, shorter gene length and higher GC content when compared with those evolutionarily conserved genes (ECs). Expression profiling indicated that most of the LSGs expressed in various tissues of sweet orange and some of them exhibited distinct temporal and spatial expression patterns. Particularly, the orphan genes were preferentially expressed in callus, which is an important pluripotent tissue of citrus. Besides, part of the CSGs and orphan genes expressed responsive to abiotic stress, indicating their potential functions during interaction with environment. This study identified and characterized two sets of LSGs in citrus, dissected their sequence features and expression patterns, and provided valuable clues for future functional analysis of the LSGs in sweet orange.

Positive Selection Underlies Faster-Z Evolution of Gene Expression in Birds

PubMed Central

Dean, Rebecca; Harrison, Peter W.; Wright, Alison E.; Zimmer, Fabian; Mank, Judith E.

2015-01-01

The elevated rate of evolution for genes on sex chromosomes compared with autosomes (Fast-X or Fast-Z evolution) can result either from positive selection in the heterogametic sex or from nonadaptive consequences of reduced relative effective population size. Recent work in birds suggests that Fast-Z of coding sequence is primarily due to relaxed purifying selection resulting from reduced relative effective population size. However, gene sequence and gene expression are often subject to distinct evolutionary pressures; therefore, we tested for Fast-Z in gene expression using next-generation RNA-sequencing data from multiple avian species. Similar to studies of Fast-Z in coding sequence, we recover clear signatures of Fast-Z in gene expression; however, in contrast to coding sequence, our data indicate that Fast-Z in expression is due to positive selection acting primarily in females. In the soma, where gene expression is highly correlated between the sexes, we detected Fast-Z in both sexes, although at a higher rate in females, suggesting that many positively selected expression changes in females are also expressed in males. In the gonad, where intersexual correlations in expression are much lower, we detected Fast-Z for female gene expression, but crucially, not males. This suggests that a large amount of expression variation is sex-specific in its effects within the gonad. Taken together, our results indicate that Fast-Z evolution of gene expression is the product of positive selection acting on recessive beneficial alleles in the heterogametic sex. More broadly, our analysis suggests that the adaptive potential of Z chromosome gene expression may be much greater than that of gene sequence, results which have important implications for the role of sex chromosomes in speciation and sexual selection. PMID:26067773
Differential effects of simple repeating DNA sequences on gene expression from the SV40 early promoter.

PubMed

Amirhaeri, S; Wohlrab, F; Wells, R D

1995-02-17

The influence of simple repeat sequences, cloned into different positions relative to the SV40 early promoter/enhancer, on the transient expression of the chloramphenicol acetyltransferase (CAT) gene was investigated. Insertion of (G)29.(C)29 in either orientation into the 5'-untranslated region of the CAT gene reduced expression in CV-1 cells 50-100 fold when compared with controls with random sequence inserts. Analysis of CAT-specific mRNA levels demonstrated that the effect was due to a reduction of CAT mRNA production rather than to posttranscriptional events. In contrast, insertion of the same insert in either orientation upstream of the promoter-enhancer or downstream of the gene stimulated gene expression 2-3-fold. These effects could be reversed by cotransfection of a competitor plasmid carrying (G)25.(C)25 sequences. The results suggest that a G.C-binding transcription factor modulates gene expression in this system and that promoter strength can be regulated by providing protein-binding sites in trans. Although constructs containing longer tracts of alternating (C-G), (T-G), or (A-T) sequences inhibited CAT expression when inserted in the 5'-untranslated region of the CAT gene, the amount of CAT mRNA was unaffected. Hence, these inhibitions must be due to posttranscriptional events, presumably at the level of translation. These effects of microsatellite sequences on gene expression are discussed with respect to recent data on related simple repeat sequences which cause several human genetic diseases.
Characterization of the glutathione S-transferase gene family through ESTs and expression analyses within common and pigmented cultivars of Citrus sinensis (L.) Osbeck.

PubMed

Licciardello, Concetta; D'Agostino, Nunzio; Traini, Alessandra; Recupero, Giuseppe Reforgiato; Frusciante, Luigi; Chiusano, Maria Luisa

2014-02-03

Glutathione S-transferases (GSTs) represent a ubiquitous gene family encoding detoxification enzymes able to recognize reactive electrophilic xenobiotic molecules as well as compounds of endogenous origin. Anthocyanin pigments require GSTs for their transport into the vacuole since their cytoplasmic retention is toxic to the cell. Anthocyanin accumulation in Citrus sinensis (L.) Osbeck fruit flesh determines different phenotypes affecting the typical pigmentation of Sicilian blood oranges. In this paper we describe: i) the characterization of the GST gene family in C. sinensis through a systematic EST analysis; ii) the validation of the EST assembly by exploiting the genome sequences of C. sinensis and C. clementina and their genome annotations; iii) GST gene expression profiling in six tissues/organs and in two different sweet orange cultivars, Cadenera (common) and Moro (pigmented). We identified 61 GST transcripts, described the full- or partial-length nature of the sequences and assigned to each sequence the GST class membership exploiting a comparative approach and the classification scheme proposed for plant species. A total of 23 full-length sequences were defined. Fifty-four of the 61 transcripts were successfully aligned to the C. sinensis and C. clementina genomes. Tissue specific expression profiling demonstrated that the expression of some GST transcripts was 'tissue-affected' and cultivar specific. A comparative analysis of C. sinensis GSTs with those from other plant species was also considered. Data from the current analysis are accessible at http://biosrv.cab.unina.it/citrusGST/, with the aim to provide a reference resource for C. sinensis GSTs. This study aimed at the characterization of the GST gene family in C. sinensis. Based on expression patterns from two different cultivars and on sequence-comparative analyses, we also highlighted that two sequences, a Phi class GST and a Mapeg class GST, could be involved in the conjugation of anthocyanin pigments and in their transport into the vacuole, specifically in fruit flesh of the pigmented cultivar.
The use of a viral 2A sequence for the simultaneous over-expression of both the vgf gene and enhanced green fluorescent protein (eGFP) in vitro and in vivo

PubMed Central

Lewis, Jo E.; Brameld, John M.; Hill, Phil; Barrett, Perry; Ebling, Francis J.P.; Jethwa, Preeti H.

2015-01-01

Introduction The viral 2A sequence has become an attractive alternative to the traditional internal ribosomal entry site (IRES) for simultaneous over-expression of two genes and in combination with recombinant adeno-associated viruses (rAAV) has been used to manipulate gene expression in vitro. New method To develop a rAAV construct in combination with the viral 2A sequence to allow long-term over-expression of the vgf gene and fluorescent marker gene for tracking of the transfected neurones in vivo. Results Transient transfection of the AAV plasmid containing the vgf gene, viral 2A sequence and eGFP into SH-SY5Y cells resulted in eGFP fluorescence comparable to a commercially available reporter construct. This increase in fluorescent cells was accompanied by an increase in VGF mRNA expression. Infusion of the rAAV vector containing the vgf gene, viral 2A sequence and eGFP resulted in eGFP fluorescence in the hypothalamus of both mice and Siberian hamsters, 32 weeks post infusion. In situ hybridisation confirmed that the location of VGF mRNA expression in the hypothalamus corresponded to the eGFP pattern of fluorescence. Comparison with old method The viral 2A sequence is much smaller than the traditional IRES and therefore allowed over-expression of the vgf gene with fluorescent tracking without compromising viral capacity. Conclusion The use of the viral 2A sequence in the AAV plasmid allowed the simultaneous expression of both genes in vitro. When used in combination with rAAV it resulted in long-term over-expression of both genes at equivalent locations in the hypothalamus of both Siberian hamsters and mice, without any adverse effects. PMID:26300182
Generation, annotation and analysis of ESTs from Trichoderma harzianum CECT 2413

PubMed Central

Vizcaíno, Juan Antonio; González, Francisco Javier; Suárez, M Belén; Redondo, José; Heinrich, Julian; Delgado-Jarana, Jesús; Hermosa, Rosa; Gutiérrez, Santiago; Monte, Enrique; Llobell, Antonio; Rey, Manuel

2006-01-01

Background The filamentous fungus Trichoderma harzianum is used as biological control agent of several plant-pathogenic fungi. In order to study the genome of this fungus, a functional genomics project called "TrichoEST" was developed to give insights into genes involved in biological control activities using an approach based on the generation of expressed sequence tags (ESTs). Results Eight different cDNA libraries from T. harzianum strain CECT 2413 were constructed. Different growth conditions involving mainly different nutrient conditions and/or stresses were used. We here present the analysis of the 8,710 ESTs generated. A total of 3,478 unique sequences were identified of which 81.4% had sequence similarity with GenBank entries, using the BLASTX algorithm. Using the Gene Ontology hierarchy, we performed the annotation of 51.1% of the unique sequences and compared its distribution among the gene libraries. Additionally, the InterProScan algorithm was used in order to further characterize the sequences. The identification of the putatively secreted proteins was also carried out. Later, based on the EST abundance, we examined the highly expressed genes and a hydrophobin was identified as the gene expressed at the highest level. We compared our collection of ESTs with the previous collections obtained from Trichoderma species and we also compared our sequence set with different complete eukaryotic genomes from several animals, plants and fungi. Accordingly, the presence of similar sequences in different kingdoms was also studied. Conclusion This EST collection and its annotation provide a significant resource for basic and applied research on T. harzianum, a fungus with a high biotechnological interest. PMID:16872539
Comparative analysis of the feline immunoglobulin repertoire.

PubMed

Steiniger, Sebastian C J; Glanville, Jacob; Harris, Douglas W; Wilson, Thomas L; Ippolito, Gregory C; Dunham, Steven A

2017-03-01

Next-Generation Sequencing combined with bioinformatics is a powerful tool for analyzing the large number of DNA sequences present in the expressed antibody repertoire and these data sets can be used to advance a number of research areas including antibody discovery and engineering. The accurate measurement of the immune repertoire sequence composition, diversity and abundance is important for understanding the repertoire response in infections, vaccinations and cancer immunology and could also be useful for elucidating novel molecular targets. In this study 4 individual domestic cats (Felis catus) were subjected to antibody repertoire sequencing with total number of sequences generated 1079863 for VH for IgG, 1050824 VH for IgM, 569518 for VK and 450195 for VL. Our analysis suggests that a similar VDJ expression patterns exists across all cats. Similar to the canine repertoire, the feline repertoire is dominated by a single subgroup, namely VH3. The antibody paratope of felines showed similar amino acid variation when compared to human, mouse and canine counterparts. All animals show a similarly skewed VH CDR-H3 profile and, when compared to canine, human and mouse, distinct differences are observed. Our study represents the first attempt to characterize sequence diversity in the expressed feline antibody repertoire and this demonstrates the utility of using NGS to elucidate entire antibody repertoires from individual animals. These data provide significant insight into understanding the feline immune system function. Copyright © 2017 International Alliance for Biological Standardization. Published by Elsevier Ltd. All rights reserved.
A distributed system for fast alignment of next-generation sequencing data.

PubMed

Srimani, Jaydeep K; Wu, Po-Yen; Phan, John H; Wang, May D

2010-12-01

We developed a scalable distributed computing system using the Berkeley Open Interface for Network Computing (BOINC) to align next-generation sequencing (NGS) data quickly and accurately. NGS technology is emerging as a promising platform for gene expression analysis due to its high sensitivity compared to traditional genomic microarray technology. However, despite the benefits, NGS datasets can be prohibitively large, requiring significant computing resources to obtain sequence alignment results. Moreover, as the data and alignment algorithms become more prevalent, it will become necessary to examine the effect of the multitude of alignment parameters on various NGS systems. We validate the distributed software system by (1) computing simple timing results to show the speed-up gained by using multiple computers, (2) optimizing alignment parameters using simulated NGS data, and (3) computing NGS expression levels for a single biological sample using optimal parameters and comparing these expression levels to that of a microarray sample. Results indicate that the distributed alignment system achieves approximately a linear speed-up and correctly distributes sequence data to and gathers alignment results from multiple compute clients.
Optimized invertase expression and secretion cassette for improving Yarrowia lipolytica growth on sucrose for industrial applications.

PubMed

Lazar, Zbigniew; Rossignol, Tristan; Verbeke, Jonathan; Crutz-Le Coq, Anne-Marie; Nicaud, Jean-Marc; Robak, Małgorzata

2013-11-01

Yarrowia lipolytica requires the expression of a heterologous invertase to grow on a sucrose-based substrate. This work reports the construction of an optimized invertase expression cassette composed of Saccharomyces cerevisiae Suc2p secretion signal sequence followed by the SUC2 sequence and under the control of the strong Y. lipolytica pTEF promoter. This new construction allows a fast and optimal cleavage of sucrose into glucose and fructose and allows cells to reach the maximum growth rate. Contrary to pre-existing constructions, the expression of SUC2 is not sensitive to medium composition in this context. The strain JMY2593, expressing this new cassette with an optimized secretion signal sequence and a strong promoter, produces 4,519 U/l of extracellular invertase in bioreactor experiments compared to 597 U/l in a strain expressing the former invertase construction. The expression of this cassette strongly improved production of invertase and is suitable for simultaneously high production level of citric acid from sucrose-based media.
Metal resistance sequences and transgenic plants

DOEpatents

Meagher, Richard Brian; Summers, Anne O.; Rugh, Clayton L.

1999-10-12

The present invention provides nucleic acid sequences encoding a metal ion resistance protein, which are expressible in plant cells. The metal resistance protein provides for the enzymatic reduction of metal ions including but not limited to divalent Cu, divalent mercury, trivalent gold, divalent cadmium, lead ions and monovalent silver ions. Transgenic plants which express these coding sequences exhibit increased resistance to metal ions in the environment as compared with plants which have not been so genetically modified. Transgenic plants with improved resistance to organometals including alkylmercury compounds, among others, are provided by the further inclusion of plant-expressible organometal lyase coding sequences, as specifically exemplified by the plant-expressible merB coding sequence. Furthermore, these transgenic plants which have been genetically modified to express the metal resistance coding sequences of the present invention can participate in the bioremediation of metal contamination via the enzymatic reduction of metal ions. Transgenic plants resistant to organometals can further mediate remediation of organic metal compounds, for example, alkylmetal compounds including but not limited to methyl mercury, methyl lead compounds, methyl cadmium and methyl arsenic compounds, in the environment by causing the freeing of mercuric or other metal ions and the reduction of the ionic mercury or other metal ions to the less toxic elemental mercury or other metals.
Error propagation in eigenimage filtering.

PubMed

Soltanian-Zadeh, H; Windham, J P; Jenkins, J M

1990-01-01

Mathematical derivation of error (noise) propagation in eigenimage filtering is presented. Based on the mathematical expressions, a method for decreasing the propagated noise given a sequence of images is suggested. The signal-to-noise ratio (SNR) and contrast-to-noise ratio (CNR) of the final composite image are compared to the SNRs and CNRs of the images in the sequence. The consistency of the assumptions and accuracy of the mathematical expressions are investigated using sequences of simulated and real magnetic resonance (MR) images of an agarose phantom and a human brain.
Minimal doses of a sequence-optimized transgene mediate high-level and long-term EPO expression in vivo: challenging CpG-free gene design.

PubMed

Kosovac, D; Wild, J; Ludwig, C; Meissner, S; Bauer, A P; Wagner, R

2011-02-01

Advanced gene delivery techniques can be combined with rational gene design to further improve the efficiency of plasmid DNA (pDNA)-mediated transgene expression in vivo. Herein, we analyzed the influence of intragenic sequence modifications on transgene expression in vitro and in vivo using murine erythropoietin (mEPO) as a transgene model. A single electro-gene transfer of an RNA- and codon-optimized mEPOopt gene into skeletal muscle resulted in a 3- to 4-fold increase of mEPO production sustained for >1 year and triggered a significant increase in hematocrit and hemoglobin without causing adverse effects. mEPO expression and hematologic levels were significantly lower when using comparable amounts of the wild type (mEPOwt) gene and only marginal effects were induced by mEPOΔCpG lacking intragenic CpG dinucleotides, even at high pDNA amounts. Corresponding with these observations, in vitro analysis of transfected cells revealed a 2- to 3-fold increased (mEPOopt) and 50% decreased (mEPOΔCpG) erythropoietin expression compared with mEPOwt, respectively. RNA analyses demonstrated that the specific design of the transgene sequence influenced expression levels by modulating transcriptional activity and nuclear plus cytoplasmic RNA amounts rather than translation. In sum, whereas CpG depletion negatively interferes with efficient expression in postmitotic tissues, mEPOopt doses <0.5 μg were sufficient to trigger optimal long-term hematologic effects encouraging the use of sequence-optimized transgenes to further reduce effective pDNA amounts.
Identification and Characterization of MicroRNAs in Ovary and Testis of Nile Tilapia (Oreochromis niloticus) by Using Solexa Sequencing Technology

PubMed Central

Zhou, Yi; Yu, Fan; Gao, Yun; Luo, Yongju; Tang, Zhanyang; Guo, Zhongbao; Guo, Enyan; Gan, Xi; Zhang, Ming; Zhang, Yaping

2014-01-01

MicroRNAs (miRNAs) are endogenous non-coding small RNAs which play important roles in the regulation of gene expression by cleaving or inhibiting the translation of target gene transcripts. Thereinto, some specific miRNAs show regulatory activities in gonad development via translational control. In order to further understand the role of miRNA-mediated posttranscriptional regulation in Nile tilapia (Oreochromis niloticus) ovary and testis, two small RNA libraries of Nile tilapia were sequenced by Solexa small RNA deep sequencing methods. A total of 9,731,431 and 8,880,497 raw reads, representing 5,407,800 and 4,396,281 unique sequences were obtained from the sexually mature ovaries and testes, respectively. After comparing the small RNA sequences with the Rfam database, 1,432,210 reads in ovaries and 984,146 reads in testes were matched to the genome sequence of Nile tilapia. Bioinformatic analysis identified 764 mature miRNA, 209 miRNA-5p and 202 miRNA-3p were found in the two libraries, of which 525 known miRNAs are both expressed in the ovary and testis of Nile tilapia. Comparison of expression profiles of the testis, miR-727, miR-129 and miR-29 families were highly expressed in tilapia ovary. Additionally, miR-132, miR-212, miR-33a and miR-135b families, showed significant higher expression in testis compared with that in ovary. Furthermore, the expression patterns of the miRNAs were analyzed in different developmental stages of gonad. The result showed different expression patterns were observed during development of testis and ovary. In addition, the identification and characterization of differentially expressed miRNAs in the ovaries and testis of Nile tilapia provides important information on the role of miRNA in the regulation of the ovarian and testicular development and function. This data will be helpful to facilitate studies on the regulation of miRNAs during teleosts reproduction. PMID:24466258
Analysis of xylem formation in pine by cDNA sequencing

NASA Technical Reports Server (NTRS)

Allona, I.; Quinn, M.; Shoop, E.; Swope, K.; St Cyr, S.; Carlis, J.; Riedl, J.; Retzel, E.; Campbell, M. M.; Sederoff, R.;

1998-01-01

Secondary xylem (wood) formation is likely to involve some genes expressed rarely or not at all in herbaceous plants. Moreover, environmental and developmental stimuli influence secondary xylem differentiation, producing morphological and chemical changes in wood. To increase our understanding of xylem formation, and to provide material for comparative analysis of gymnosperm and angiosperm sequences, ESTs were obtained from immature xylem of loblolly pine (Pinus taeda L.). A total of 1,097 single-pass sequences were obtained from 5' ends of cDNAs made from gravistimulated tissue from bent trees. Cluster analysis detected 107 groups of similar sequences, ranging in size from 2 to 20 sequences. A total of 361 sequences fell into these groups, whereas 736 sequences were unique. About 55% of the pine EST sequences show similarity to previously described sequences in public databases. About 10% of the recognized genes encode factors involved in cell wall formation. Sequences similar to cell wall proteins, most known lignin biosynthetic enzymes, and several enzymes of carbohydrate metabolism were found. A number of putative regulatory proteins also are represented. Expression patterns of several of these genes were studied in various tissues and organs of pine. Sequencing novel genes expressed during xylem formation will provide a powerful means of identifying mechanisms controlling this important differentiation pathway.

Small RNA Analysis in Sindbis Virus Infected Human HEK293 Cells

PubMed Central

Dalmay, Tamas; Powell, Penny P.

2013-01-01

Introduction In contrast to the defence mechanism of RNA interference (RNAi) in plants and invertebrates, its role in the innate response to virus infection of mammals is a matter of debate. Since RNAi has a well-established role in controlling infection of the alphavirus Sindbis virus (SINV) in insects, we have used this virus to investigate the role of RNAi in SINV infection of human cells. Results SINV AR339 and TR339-GFP were adapted to grow in HEK293 cells. Deep sequencing of small RNAs (sRNAs) early in SINV infection (4 and 6 hpi) showed low abundance (0.8%) of viral sRNAs (vsRNAs), with no size, sequence or location specific patterns characteristic of Dicer products nor did they possess any discernible pattern to ascribe to a specific RNAi biogenesis pathway. This was supported by multiple variants for each sequence, and lack of hot spots along the viral genome sequence. The abundance of the best defined vsRNAs was below the limit of Northern blot detection. The adaptation of the virus to HEK293 cells showed little sequence changes compared to the reference; however, a SNP in E1 gene with a preference from G to C was found. Deep sequencing results showed little variation of expression of cellular microRNAs (miRNAs) at 4 and 6 hpi compared to uninfected cells. Twelve miRNAs exhibiting some minor differential expression by sequencing, showed no difference in expression by Northern blot analysis. Conclusions We show that, unlike SINV infection of invertebrates, generation of Dicer-dependent svRNAs and change in expression of cellular miRNAs were not detected as part of the Human response to SINV. PMID:24391886
Sequence divergence in the 3'-untranslated region has an effect on the subfunctionalization of duplicate genes.

PubMed

Tong, Ying; Zheng, Kang; Zhao, Shufang; Xiao, Guanxiu; Luo, Chen

2012-11-01

Recent studies demonstrated that sequence divergence in both transcriptional regulatory region and coding region contributes to the subfunctionalization of duplicate gene. However, whether sequence divergence in the 3'-untranslated region (3'-UTR) has an impact on the subfunctionalization of duplicate genes remains unclear. Here, we identified two diverging duplicate vsx1 (visual system homeobox-1) loci in goldfish, named vsx1A1 and vsx1A2. Phylogenetic analysis suggests that vsx1A1 and vsx1A2 may arise from a duplication of vsx1 after the separation of goldfish and zebrafish. Sequence comparison revealed that divergence in both transcriptional and translational regulatory regions is higher than divergence in the introns. vsx1A2 expresses during blastula and gastrula stages and in adult retina but silences from segmentation stage to hatching stage, vsx1A1 starts expression from segmentation onward. Comparing to that zebrafish vsx1 expresses in all the developmental stages and in the adult retina, it appears that goldfish vsx1A1 and vsx1A2 are under going to share the functions of ancestral vsx1. The different but overlapping temporal expression patterns of vsx1A1 and vsx1A2 suggest that sequence divergence in the promoter region of duplicate vsx1 is not sufficient for partitioning the functions of ancestral vsx1. By comparing vsx1A1 and vsx1A2 3'-UTR-linked green fluorescent protein gene expression patterns, we demonstrated that the 3'-UTR of vsx1A1 remains but the 3'-UTR of vsx1A2 has lost the capability of mediating bipolar cell specific expression during retina development. These results indicate that sequence divergence in the 3'-UTRs has a clear effect on subfunctionalization of the duplicate genes. © 2012 WILEY PERIODICALS, INC.
Positive Selection Underlies Faster-Z Evolution of Gene Expression in Birds.

PubMed

Dean, Rebecca; Harrison, Peter W; Wright, Alison E; Zimmer, Fabian; Mank, Judith E

2015-10-01

The elevated rate of evolution for genes on sex chromosomes compared with autosomes (Fast-X or Fast-Z evolution) can result either from positive selection in the heterogametic sex or from nonadaptive consequences of reduced relative effective population size. Recent work in birds suggests that Fast-Z of coding sequence is primarily due to relaxed purifying selection resulting from reduced relative effective population size. However, gene sequence and gene expression are often subject to distinct evolutionary pressures; therefore, we tested for Fast-Z in gene expression using next-generation RNA-sequencing data from multiple avian species. Similar to studies of Fast-Z in coding sequence, we recover clear signatures of Fast-Z in gene expression; however, in contrast to coding sequence, our data indicate that Fast-Z in expression is due to positive selection acting primarily in females. In the soma, where gene expression is highly correlated between the sexes, we detected Fast-Z in both sexes, although at a higher rate in females, suggesting that many positively selected expression changes in females are also expressed in males. In the gonad, where intersexual correlations in expression are much lower, we detected Fast-Z for female gene expression, but crucially, not males. This suggests that a large amount of expression variation is sex-specific in its effects within the gonad. Taken together, our results indicate that Fast-Z evolution of gene expression is the product of positive selection acting on recessive beneficial alleles in the heterogametic sex. More broadly, our analysis suggests that the adaptive potential of Z chromosome gene expression may be much greater than that of gene sequence, results which have important implications for the role of sex chromosomes in speciation and sexual selection. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Impact of sequencing depth and read length on single cell RNA sequencing data of T cells.

PubMed

Rizzetto, Simone; Eltahla, Auda A; Lin, Peijie; Bull, Rowena; Lloyd, Andrew R; Ho, Joshua W K; Venturi, Vanessa; Luciani, Fabio

2017-10-06

Single cell RNA sequencing (scRNA-seq) provides great potential in measuring the gene expression profiles of heterogeneous cell populations. In immunology, scRNA-seq allowed the characterisation of transcript sequence diversity of functionally relevant T cell subsets, and the identification of the full length T cell receptor (TCRαβ), which defines the specificity against cognate antigens. Several factors, e.g. RNA library capture, cell quality, and sequencing output affect the quality of scRNA-seq data. We studied the effects of read length and sequencing depth on the quality of gene expression profiles, cell type identification, and TCRαβ reconstruction, utilising 1,305 single cells from 8 publically available scRNA-seq datasets, and simulation-based analyses. Gene expression was characterised by an increased number of unique genes identified with short read lengths (<50 bp), but these featured higher technical variability compared to profiles from longer reads. Successful TCRαβ reconstruction was achieved for 6 datasets (81% - 100%) with at least 0.25 millions (PE) reads of length >50 bp, while it failed for datasets with <30 bp reads. Sufficient read length and sequencing depth can control technical noise to enable accurate identification of TCRαβ and gene expression profiles from scRNA-seq data of T cells.
The Medicago sativa gene index 1.2: a web-accessible gene expression atlas for investigating expression differences between Medicago sativa subspecies.

PubMed

O'Rourke, Jamie A; Fu, Fengli; Bucciarelli, Bruna; Yang, S Sam; Samac, Deborah A; Lamb, JoAnn F S; Monteros, Maria J; Graham, Michelle A; Gronwald, John W; Krom, Nick; Li, Jun; Dai, Xinbin; Zhao, Patrick X; Vance, Carroll P

2015-07-07

Alfalfa (Medicago sativa L.) is the primary forage legume crop species in the United States and plays essential economic and ecological roles in agricultural systems across the country. Modern alfalfa is the result of hybridization between tetraploid M. sativa ssp. sativa and M. sativa ssp. falcata. Due to its large and complex genome, there are few genomic resources available for alfalfa improvement. A de novo transcriptome assembly from two alfalfa subspecies, M. sativa ssp. sativa (B47) and M. sativa ssp. falcata (F56) was developed using Illumina RNA-seq technology. Transcripts from roots, nitrogen-fixing root nodules, leaves, flowers, elongating stem internodes, and post-elongation stem internodes were assembled into the Medicago sativa Gene Index 1.2 (MSGI 1.2) representing 112,626 unique transcript sequences. Nodule-specific and transcripts involved in cell wall biosynthesis were identified. Statistical analyses identified 20,447 transcripts differentially expressed between the two subspecies. Pair-wise comparisons of each tissue combination identified 58,932 sequences differentially expressed in B47 and 69,143 sequences differentially expressed in F56. Comparing transcript abundance in floral tissues of B47 and F56 identified expression differences in sequences involved in anthocyanin and carotenoid synthesis, which determine flower pigmentation. Single nucleotide polymorphisms (SNPs) unique to each M. sativa subspecies (110,241) were identified. The Medicago sativa Gene Index 1.2 increases the expressed sequence data available for alfalfa by ninefold and can be expanded as additional experiments are performed. The MSGI 1.2 transcriptome sequences, annotations, expression profiles, and SNPs were assembled into the Alfalfa Gene Index and Expression Database (AGED) at http://plantgrn.noble.org/AGED/ , a publicly available genomic resource for alfalfa improvement and legume research.
Molecular phenotype of zebrafish ovarian follicle by serial analysis of gene expression and proteomic profiling, and comparison with the transcriptomes of other animals

PubMed Central

Knoll-Gellida, Anja; André, Michèle; Gattegno, Tamar; Forgue, Jean; Admon, Arie; Babin, Patrick J

2006-01-01

Background The ability of an oocyte to develop into a viable embryo depends on the accumulation of specific maternal information and molecules, such as RNAs and proteins. A serial analysis of gene expression (SAGE) was carried out in parallel with proteomic analysis on fully-grown ovarian follicles from zebrafish (Danio rerio). The data obtained were compared with ovary/follicle/egg molecular phenotypes of other animals, published or available in public sequence databases. Results Sequencing of 27,486 SAGE tags identified 11,399 different ones, including 3,329 tags with an occurrence superior to one. Fifty-eight genes were expressed at over 0.15% of the total population and represented 17.34% of the mRNA population identified. The three most expressed transcripts were a rhamnose-binding lectin, beta-actin 2, and a transcribed locus similar to the H2B histone family. Comparison with the large-scale expressed sequence tags sequencing approach revealed highly expressed transcripts that were not previously known to be expressed at high levels in fish ovaries, like the short-sized polarized metallothionein 2 transcript. A higher sensitivity for the detection of transcripts with a characterized maternal genetic contribution was also demonstrated compared to large-scale sequencing of cDNA libraries. Ferritin heavy polypeptide 1, heat shock protein 90-beta, lactate dehydrogenase B4, beta-actin isoforms, tubulin beta 2, ATP synthase subunit 9, together with 40 S ribosomal protein S27a, were common highly-expressed transcripts of vertebrate ovary/unfertilized egg. Comparison of transcriptome and proteome data revealed that transcript levels provide little predictive value with respect to the extent of protein abundance. All the proteins identified by proteomic analysis of fully-grown zebrafish follicles had at least one transcript counterpart, with two exceptions: eosinophil chemotactic cytokine and nothepsin. Conclusion This study provides a complete sequence data set of maternal mRNA stored in zebrafish germ cells at the end of oogenesis. This catalogue contains highly-expressed transcripts that are part of a vertebrate ovarian expressed gene signature. Comparison of transcriptome and proteome data identified downregulated transcripts or proteins potentially incorporated in the oocyte by endocytosis. The molecular phenotype described provides groundwork for future experimental approaches aimed at identifying functionally important stored maternal transcripts and proteins involved in oogenesis and early stages of embryo development. PMID:16526958
Generation and Analysis of a Large-Scale Expressed Sequence Tag Database from a Full-Length Enriched cDNA Library of Developing Leaves of Gossypium hirsutum L

PubMed Central

Pang, Chaoyou; Fan, Shuli; Song, Meizhen; Yu, Shuxun

2013-01-01

Background Cotton (Gossypium hirsutum L.) is one of the world’s most economically-important crops. However, its entire genome has not been sequenced, and limited resources are available in GenBank for understanding the molecular mechanisms underlying leaf development and senescence. Methodology/Principal Findings In this study, 9,874 high-quality ESTs were generated from a normalized, full-length cDNA library derived from pooled RNA isolated from throughout leaf development during the plant blooming stage. After clustering and assembly of these ESTs, 5,191 unique sequences, representative 1,652 contigs and 3,539 singletons, were obtained. The average unique sequence length was 682 bp. Annotation of these unique sequences revealed that 84.4% showed significant homology to sequences in the NCBI non-redundant protein database, and 57.3% had significant hits to known proteins in the Swiss-Prot database. Comparative analysis indicated that our library added 2,400 ESTs and 991 unique sequences to those known for cotton. The unigenes were functionally characterized by gene ontology annotation. We identified 1,339 and 200 unigenes as potential leaf senescence-related genes and transcription factors, respectively. Moreover, nine genes related to leaf senescence and eleven MYB transcription factors were randomly selected for quantitative real-time PCR (qRT-PCR), which revealed that these genes were regulated differentially during senescence. The qRT-PCR for three GhYLSs revealed that these genes express express preferentially in senescent leaves. Conclusions/Significance These EST resources will provide valuable sequence information for gene expression profiling analyses and functional genomics studies to elucidate their roles, as well as for studying the mechanisms of leaf development and senescence in cotton and discovering candidate genes related to important agronomic traits of cotton. These data will also facilitate future whole-genome sequence assembly and annotation in G. hirsutum and comparative genomics among Gossypium species. PMID:24146870

De novo Transcriptome Assembly of Common Wild Rice (Oryza rufipogon Griff.) and Discovery of Drought-Response Genes in Root Tissue Based on Transcriptomic Data.

PubMed

Tian, Xin-Jie; Long, Yan; Wang, Jiao; Zhang, Jing-Wen; Wang, Yan-Yan; Li, Wei-Min; Peng, Yu-Fa; Yuan, Qian-Hua; Pei, Xin-Wu

2015-01-01

The perennial O. rufipogon (common wild rice), which is considered to be the ancestor of Asian cultivated rice species, contains many useful genetic resources, including drought resistance genes. However, few studies have identified the drought resistance and tissue-specific genes in common wild rice. In this study, transcriptome sequencing libraries were constructed, including drought-treated roots (DR) and control leaves (CL) and roots (CR). Using Illumina sequencing technology, we generated 16.75 million bases of high-quality sequence data for common wild rice and conducted de novo assembly and annotation of genes without prior genome information. These reads were assembled into 119,332 unigenes with an average length of 715 bp. A total of 88,813 distinct sequences (74.42% of unigenes) significantly matched known genes in the NCBI NT database. Differentially expressed gene (DEG) analysis showed that 3617 genes were up-regulated and 4171 genes were down-regulated in the CR library compared with the CL library. Among the DEGs, 535 genes were expressed in roots but not in shoots. A similar comparison between the DR and CR libraries showed that 1393 genes were up-regulated and 315 genes were down-regulated in the DR library compared with the CR library. Finally, 37 genes that were specifically expressed in roots were screened after comparing the DEGs identified in the above-described analyses. This study provides a transcriptome sequence resource for common wild rice plants and establishes a digital gene expression profile of wild rice plants under drought conditions using the assembled transcriptome data as a reference. Several tissue-specific and drought-stress-related candidate genes were identified, representing a fully characterized transcriptome and providing a valuable resource for genetic and genomic studies in plants.
Cloning and baculovirus expression of a desiccation stress gene from the beetle, Tenebrio molitor.

PubMed

Graham, L A; Bendena, W G; Walker, V K

1996-02-01

The cDNA sequence encoding a novel desiccation stress protein (dsp28) found in the hemolymph of the common yellow mealworm beetle, Tenebrio molitor, has been determined. The sequence encodes a 225 amino acid protein containing a 20 amino acid signal peptide. Dsp28 shows no significant similarity to any known nucleic acid or protein sequence. Levels of dsp28 mRNA were found to increase approx 5-fold following desiccation. Dsp28 cDNA has been cloned into a baculovirus expression vector and the expressed protein was compared to native dsp28. Both dsp28 expressed by recombinant baculovirus and native dsp28 are glycosylated and N-terminally processed. Although dsp28 is induced by cold in addition to desiccation stress, it does not contribute to the freezing point depression (thermal hysteresis) observed in Tenebrio hemolymph.
High-throughput sequencing of small RNAs and analysis of differentially expressed microRNAs associated with pistil development in Japanese apricot

PubMed Central

2012-01-01

Background MicroRNAs (miRNAs) are a class of endogenous, small, non-coding RNAs that regulate gene expression by mediating gene silencing at transcriptional and post-transcriptional levels in high plants. However, the diversity of miRNAs and their roles in floral development in Japanese apricot (Prunus mume Sieb. et Zucc) remains largely unexplored. Imperfect flowers with pistil abortion seriously decrease production yields. To understand the role of miRNAs in pistil development, pistil development-related miRNAs were identified by Solexa sequencing in Japanese apricot. Results Solexa sequencing was used to identify and quantitatively profile small RNAs from perfect and imperfect flower buds of Japanese apricot. A total of 22,561,972 and 24,952,690 reads were sequenced from two small RNA libraries constructed from perfect and imperfect flower buds, respectively. Sixty-one known miRNAs, belonging to 24 families, were identified. Comparative profiling revealed that seven known miRNAs exhibited significant differential expression between perfect and imperfect flower buds. A total of 61 potentially novel miRNAs/new members of known miRNA families were also identified by the presence of mature miRNAs and corresponding miRNA*s in the sRNA libraries. Comparative analysis showed that six potentially novel miRNAs were differentially expressed between perfect and imperfect flower buds. Target predictions of the 13 differentially expressed miRNAs resulted in 212 target genes. Gene ontology (GO) annotation revealed that high-ranking miRNA target genes are those implicated in the developmental process, the regulation of transcription and response to stress. Conclusions This study represents the first comparative identification of miRNAomes between perfect and imperfect Japanese apricot flowers. Seven known miRNAs and six potentially novel miRNAs associated with pistil development were identified, using high-throughput sequencing of small RNAs. The findings, both computationally and experimentally, provide valuable information for further functional characterisation of miRNAs associated with pistil development in plants. PMID:22863067
The use of a viral 2A sequence for the simultaneous over-expression of both the vgf gene and enhanced green fluorescent protein (eGFP) in vitro and in vivo.

PubMed

Lewis, Jo E; Brameld, John M; Hill, Phil; Barrett, Perry; Ebling, Francis J P; Jethwa, Preeti H

2015-12-30

The viral 2A sequence has become an attractive alternative to the traditional internal ribosomal entry site (IRES) for simultaneous over-expression of two genes and in combination with recombinant adeno-associated viruses (rAAV) has been used to manipulate gene expression in vitro. To develop a rAAV construct in combination with the viral 2A sequence to allow long-term over-expression of the vgf gene and fluorescent marker gene for tracking of the transfected neurones in vivo. Transient transfection of the AAV plasmid containing the vgf gene, viral 2A sequence and eGFP into SH-SY5Y cells resulted in eGFP fluorescence comparable to a commercially available reporter construct. This increase in fluorescent cells was accompanied by an increase in VGF mRNA expression. Infusion of the rAAV vector containing the vgf gene, viral 2A sequence and eGFP resulted in eGFP fluorescence in the hypothalamus of both mice and Siberian hamsters, 32 weeks post infusion. In situ hybridisation confirmed that the location of VGF mRNA expression in the hypothalamus corresponded to the eGFP pattern of fluorescence. The viral 2A sequence is much smaller than the traditional IRES and therefore allowed over-expression of the vgf gene with fluorescent tracking without compromising viral capacity. The use of the viral 2A sequence in the AAV plasmid allowed the simultaneous expression of both genes in vitro. When used in combination with rAAV it resulted in long-term over-expression of both genes at equivalent locations in the hypothalamus of both Siberian hamsters and mice, without any adverse effects. Copyright © 2015 The Authors. Published by Elsevier B.V. All rights reserved.
Identification and expression of the tig gene coding for trigger factor from psychrophilic bacteria with no information of genome sequence available.

PubMed

Lee, Kyunghee; Choi, Hyojung; Im, Hana

2009-08-01

Trigger factor (TF) plays a key role as a molecular chaperone with a peptidyl-prolyl cis-trans isomerase (PPIase) activity by which cells promote folding of newly synthesized proteins coming out of ribosomes. Since psychrophilic bacteria grow at a quite low temperature, between 4 and 15 degrees C, TF from such bacteria was investigated and compared with that of mesophilic bacteria E. coli in order to offer an explanation of cold-adaptation at a molecular level. Using a combination of gradient PCRs with homologous primers and LA PCR in vitro cloning technology, the tig gene was fully identified from Psychromonas arctica, whose genome sequence is not yet available. The resulting amino acid sequence of the TF was compared with other homologous TFs using sequence alignments to search for common domains. In addition, we have developed a protein expression system, by which TF proteins from P. arctica (PaTF) were produced by IPTG induction upon cloning the tig gene on expression vectors, such as pAED4. We have further examined the role of expressed psychrophilic PaTF on survival against cold treatment at 4 degrees C. Finally, we have attempted the in vitro biochemical characterization of TF proteins with His-tags expressed in a pET system, such as the PPIase activity of PaTF protein. Our results demonstrate that the expressed PaTF proteins helped cells survive against cold environments in vivo and the purified PaTF in vitro display the functional PPIase activity in a concentration dependent manner.
Comparative Analysis of Gene Expression for Convergent Evolution of Camera Eye Between Octopus and Human

PubMed Central

Ogura, Atsushi; Ikeo, Kazuho; Gojobori, Takashi

2004-01-01

Although the camera eye of the octopus is very similar to that of humans, phylogenetic and embryological analyses have suggested that their camera eyes have been acquired independently. It has been known as a typical example of convergent evolution. To study the molecular basis of convergent evolution of camera eyes, we conducted a comparative analysis of gene expression in octopus and human camera eyes. We sequenced 16,432 ESTs of the octopus eye, leading to 1052 nonredundant genes that have matches in the protein database. Comparing these 1052 genes with 13,303 already-known ESTs of the human eye, 729 (69.3%) genes were commonly expressed between the human and octopus eyes. On the contrary, when we compared octopus eye ESTs with human connective tissue ESTs, the expression similarity was quite low. To trace the evolutionary changes that are potentially responsible for camera eye formation, we also compared octopus-eye ESTs with the completed genome sequences of other organisms. We found that 1019 out of the 1052 genes had already existed at the common ancestor of bilateria, and 875 genes were conserved between humans and octopuses. It suggests that a larger number of conserved genes and their similar gene expression may be responsible for the convergent evolution of the camera eye. PMID:15289475
Characterization of the glutathione S-transferase gene family through ESTs and expression analyses within common and pigmented cultivars of Citrus sinensis (L.) Osbeck

PubMed Central

2014-01-01

Background Glutathione S-transferases (GSTs) represent a ubiquitous gene family encoding detoxification enzymes able to recognize reactive electrophilic xenobiotic molecules as well as compounds of endogenous origin. Anthocyanin pigments require GSTs for their transport into the vacuole since their cytoplasmic retention is toxic to the cell. Anthocyanin accumulation in Citrus sinensis (L.) Osbeck fruit flesh determines different phenotypes affecting the typical pigmentation of Sicilian blood oranges. In this paper we describe: i) the characterization of the GST gene family in C. sinensis through a systematic EST analysis; ii) the validation of the EST assembly by exploiting the genome sequences of C. sinensis and C. clementina and their genome annotations; iii) GST gene expression profiling in six tissues/organs and in two different sweet orange cultivars, Cadenera (common) and Moro (pigmented). Results We identified 61 GST transcripts, described the full- or partial-length nature of the sequences and assigned to each sequence the GST class membership exploiting a comparative approach and the classification scheme proposed for plant species. A total of 23 full-length sequences were defined. Fifty-four of the 61 transcripts were successfully aligned to the C. sinensis and C. clementina genomes. Tissue specific expression profiling demonstrated that the expression of some GST transcripts was 'tissue-affected' and cultivar specific. A comparative analysis of C. sinensis GSTs with those from other plant species was also considered. Data from the current analysis are accessible at http://biosrv.cab.unina.it/citrusGST/, with the aim to provide a reference resource for C. sinensis GSTs. Conclusions This study aimed at the characterization of the GST gene family in C. sinensis. Based on expression patterns from two different cultivars and on sequence-comparative analyses, we also highlighted that two sequences, a Phi class GST and a Mapeg class GST, could be involved in the conjugation of anthocyanin pigments and in their transport into the vacuole, specifically in fruit flesh of the pigmented cultivar. PMID:24490620
A Systematic Analysis of the Structures of Heterologously Expressed Proteins and Those from Their Native Hosts in the RCSB PDB Archive.

PubMed

Zhou, Ren-Bin; Lu, Hui-Meng; Liu, Jie; Shi, Jian-Yu; Zhu, Jing; Lu, Qin-Qin; Yin, Da-Chuan

2016-01-01

Recombinant expression of proteins has become an indispensable tool in modern day research. The large yields of recombinantly expressed proteins accelerate the structural and functional characterization of proteins. Nevertheless, there are literature reported that the recombinant proteins show some differences in structure and function as compared with the native ones. Now there have been more than 100,000 structures (from both recombinant and native sources) publicly available in the Protein Data Bank (PDB) archive, which makes it possible to investigate if there exist any proteins in the RCSB PDB archive that have identical sequence but have some difference in structures. In this paper, we present the results of a systematic comparative study of the 3D structures of identical naturally purified versus recombinantly expressed proteins. The structural data and sequence information of the proteins were mined from the RCSB PDB archive. The combinatorial extension (CE), FATCAT-flexible and TM-Align methods were employed to align the protein structures. The root-mean-square distance (RMSD), TM-score, P-value, Z-score, secondary structural elements and hydrogen bonds were used to assess the structure similarity. A thorough analysis of the PDB archive generated five-hundred-seventeen pairs of native and recombinant proteins that have identical sequence. There were no pairs of proteins that had the same sequence and significantly different structural fold, which support the hypothesis that expression in a heterologous host usually could fold correctly into their native forms.
Sugarcane transgenics expressing MYB transcription factors show improved glucose release

DOE PAGES

Poovaiah, Charleson R.; Bewg, William P.; Lan, Wu; ...

2016-07-15

In this study, sugarcane, a tropical C4 perennial crop, is capable of producing 30-100 tons or more of biomass per hectare annually. The lignocellulosic residue remaining after sugar extraction is currently underutilized and can provide a significant source of biomass for the production of second-generation bioethanol. As a result, MYB31 and MYB42 were cloned from maize and expressed in sugarcane with and without the UTR sequences. The cloned sequences were 98 and 99 % identical to the published nucleotide sequences. The inclusion of the UTR sequences did not affect any of the parameters tested. There was little difference in plantmore » height and the number of internodes of the MYB-overexpressing sugarcane plants when compared with controls. MYB transgene expression determined by qPCR exhibited continued expression in young and maturing internodes. MYB31 downregulated more genes within the lignin biosynthetic pathway than MYB42. MYB31 and MYB42 expression resulted in decreased lignin content in some lines. All MYB42 plants further analyzed showed significant increases in glucose release by enzymatic hydrolysis in 72 h, whereas only two MYB31 plants released more glucose than control plants. This correlated directly with a significant decrease in acid-insoluble lignin. Soluble sucrose content of the MYB42 transgenic plants did not vary compared to control plants. In conclusion, this study demonstrates the use of MYB transcription factors to improve the production of bioethanol from sugarcane bagasse remaining after sugar extraction.« less
A Systematic Analysis of the Structures of Heterologously Expressed Proteins and Those from Their Native Hosts in the RCSB PDB Archive

PubMed Central

Zhou, Ren-Bin; Lu, Hui-Meng; Liu, Jie; Shi, Jian-Yu; Zhu, Jing; Lu, Qin-Qin; Yin, Da-Chuan

2016-01-01

Recombinant expression of proteins has become an indispensable tool in modern day research. The large yields of recombinantly expressed proteins accelerate the structural and functional characterization of proteins. Nevertheless, there are literature reported that the recombinant proteins show some differences in structure and function as compared with the native ones. Now there have been more than 100,000 structures (from both recombinant and native sources) publicly available in the Protein Data Bank (PDB) archive, which makes it possible to investigate if there exist any proteins in the RCSB PDB archive that have identical sequence but have some difference in structures. In this paper, we present the results of a systematic comparative study of the 3D structures of identical naturally purified versus recombinantly expressed proteins. The structural data and sequence information of the proteins were mined from the RCSB PDB archive. The combinatorial extension (CE), FATCAT-flexible and TM-Align methods were employed to align the protein structures. The root-mean-square distance (RMSD), TM-score, P-value, Z-score, secondary structural elements and hydrogen bonds were used to assess the structure similarity. A thorough analysis of the PDB archive generated five-hundred-seventeen pairs of native and recombinant proteins that have identical sequence. There were no pairs of proteins that had the same sequence and significantly different structural fold, which support the hypothesis that expression in a heterologous host usually could fold correctly into their native forms. PMID:27517583
Sugarcane transgenics expressing MYB transcription factors show improved glucose release

DOE Office of Scientific and Technical Information (OSTI.GOV)

Poovaiah, Charleson R.; Bewg, William P.; Lan, Wu

In this study, sugarcane, a tropical C4 perennial crop, is capable of producing 30-100 tons or more of biomass per hectare annually. The lignocellulosic residue remaining after sugar extraction is currently underutilized and can provide a significant source of biomass for the production of second-generation bioethanol. As a result, MYB31 and MYB42 were cloned from maize and expressed in sugarcane with and without the UTR sequences. The cloned sequences were 98 and 99 % identical to the published nucleotide sequences. The inclusion of the UTR sequences did not affect any of the parameters tested. There was little difference in plantmore » height and the number of internodes of the MYB-overexpressing sugarcane plants when compared with controls. MYB transgene expression determined by qPCR exhibited continued expression in young and maturing internodes. MYB31 downregulated more genes within the lignin biosynthetic pathway than MYB42. MYB31 and MYB42 expression resulted in decreased lignin content in some lines. All MYB42 plants further analyzed showed significant increases in glucose release by enzymatic hydrolysis in 72 h, whereas only two MYB31 plants released more glucose than control plants. This correlated directly with a significant decrease in acid-insoluble lignin. Soluble sucrose content of the MYB42 transgenic plants did not vary compared to control plants. In conclusion, this study demonstrates the use of MYB transcription factors to improve the production of bioethanol from sugarcane bagasse remaining after sugar extraction.« less
RNA sequencing: current and prospective uses in metabolic research.

PubMed

Vikman, Petter; Fadista, Joao; Oskolkov, Nikolay

2014-10-01

Previous global RNA analysis was restricted to known transcripts in species with a defined transcriptome. Next generation sequencing has transformed transcriptomics by making it possible to analyse expressed genes with an exon level resolution from any tissue in any species without any a priori knowledge of which genes that are being expressed, splice patterns or their nucleotide sequence. In addition, RNA sequencing is a more sensitive technique compared with microarrays with a larger dynamic range, and it also allows for investigation of imprinting and allele-specific expression. This can be done for a cost that is able to compete with that of a microarray, making RNA sequencing a technique available to most researchers. Therefore RNA sequencing has recently become the state of the art with regards to large-scale RNA investigations and has to a large extent replaced microarrays. The only drawback is the large data amounts produced, which together with the complexity of the data can make a researcher spend far more time on analysis than performing the actual experiment. © 2014 Society for Endocrinology.
Comparative Analysis of Expressed Genes from Cacao Meristems Infected by Moniliophthora perniciosa

PubMed Central

Gesteira, Abelmon S.; Micheli, Fabienne; Carels, Nicolas; Da Silva, Aline C.; Gramacho, Karina P.; Schuster, Ivan; Macêdo, Joci N.; Pereira, Gonçalo A. G.; Cascardo, Júlio C. M.

2007-01-01

Background and Aims Witches' broom disease is caused by the hemibiotrophic basidiomycete Moniliophthora perniciosa, and is one of the most important diseases of cacao in the western hemisphere. Because very little is known about the global process of such disease development, expressed sequence tags (ESTs) were used to identify genes expressed during the Theobroma cacao–Moniliophthora perniciosa interaction. Methods Two cDNA libraries corresponding to the resistant (RT) and susceptible (SP) cacao–M. perniciosa interactions were constructed from total RNA, using the DB SMART Creator cDNA library kit (Clontech). Clones were randomly selected, sequenced from the 5′ end and analysed using bioinformatics tools including in silico analysis of the differential gene expression. Key Results A total of 6884 ESTs were generated from the RT and SP cDNA libraries. These ESTs were composed of 2585 singlets and 341 contigs for a total of 2926 non-redundant sequences. The redundancy of the libraries was low and their specificity high when compared with the few other cacao libraries already published. Sequence analysis allowed the assignment of a putative functional category for 54 % of sequences, whereas approx. 22 % of sequences corresponded to unknown function and approx. 24 % of sequences did not show any significant similarity with other proteins present in the database. Despite the similar overall distribution of the sequences in functional categories between the two libraries, qualitative differences were observed. Genes involved during the defence response to pathogen infection or in programmed cell death were identified, such as pathogenesis related-proteins, trypsin inhibitor or oxalate oxidase, and some of them showed an in silico differential expression between the resistant and the susceptible interactions. Conclusions As far as is known this is the first EST resource from the cacao–M. perniciosa interaction and it is believed that it will provide a significant contribution to the understanding of the molecular mechanisms of the resistance and susceptibility of cacao to M. perniciosa, to develop strategies to control witches broom, and as a source of polymorphism for molecular marker development and marker-assisted selection. PMID:17557832
Inhibition of hepatitis B virus replication with linear DNA sequences expressing antiviral micro-RNA shuttles

DOE Office of Scientific and Technical Information (OSTI.GOV)

Chattopadhyay, Saket; Ely, Abdullah; Bloom, Kristie

2009-11-20

RNA interference (RNAi) may be harnessed to inhibit viral gene expression and this approach is being developed to counter chronic infection with hepatitis B virus (HBV). Compared to synthetic RNAi activators, DNA expression cassettes that generate silencing sequences have advantages of sustained efficacy and ease of propagation in plasmid DNA (pDNA). However, the large size of pDNAs and inclusion of sequences conferring antibiotic resistance and immunostimulation limit delivery efficiency and safety. To develop use of alternative DNA templates that may be applied for therapeutic gene silencing, we assessed the usefulness of PCR-generated linear expression cassettes that produce anti-HBV micro-RNA (miR)more » shuttles. We found that silencing of HBV markers of replication was efficient (>75%) in cell culture and in vivo. miR shuttles were processed to form anti-HBV guide strands and there was no evidence of induction of the interferon response. Modification of terminal sequences to include flanking human adenoviral type-5 inverted terminal repeats was easily achieved and did not compromise silencing efficacy. These linear DNA sequences should have utility in the development of gene silencing applications where modifications of terminal elements with elimination of potentially harmful and non-essential sequences are required.« less
Comparative venom gland transcriptomics of Naja kaouthia (monocled cobra) from Malaysia and Thailand: elucidating geographical venom variation and insights into sequence novelty

PubMed Central

Chanhome, Lawan; Tan, Nget Hong

2017-01-01

Background The monocled cobra (Naja kaouthia) is a medically important venomous snake in Southeast Asia. Its venom has been shown to vary geographically in relation to venom composition and neurotoxic activity, indicating vast diversity of the toxin genes within the species. To investigate the polygenic trait of the venom and its locale-specific variation, we profiled and compared the venom gland transcriptomes of N. kaouthia from Malaysia (NK-M) and Thailand (NK-T) applying next-generation sequencing (NGS) technology. Methods The transcriptomes were sequenced on the Illumina HiSeq platform, assembled and followed by transcript clustering and annotations for gene expression and function. Pairwise or multiple sequence alignments were conducted on the toxin genes expressed. Substitution rates were studied for the major toxins co-expressed in NK-M and NK-T. Results and discussion The toxin transcripts showed high redundancy (41–82% of the total mRNA expression) and comprised 23 gene families expressed in NK-M and NK-T, respectively (22 gene families were co-expressed). Among the venom genes, three-finger toxins (3FTxs) predominated in the expression, with multiple sequences noted. Comparative analysis and selection study revealed that 3FTxs are genetically conserved between the geographical specimens whilst demonstrating distinct differential expression patterns, implying gene up-regulation for selected principal toxins, or alternatively, enhanced transcript degradation or lack of transcription of certain traits. One of the striking features that elucidates the inter-geographical venom variation is the up-regulation of α-neurotoxins (constitutes ∼80.0% of toxin’s fragments per kilobase of exon model per million mapped reads (FPKM)), particularly the long-chain α-elapitoxin-Nk2a (48.3%) in NK-T but only 1.7% was noted in NK-M. Instead, short neurotoxin isoforms were up-regulated in NK-M (46.4%). Another distinct transcriptional pattern observed is the exclusively and abundantly expressed cytotoxin CTX-3 in NK-T. The findings suggested correlation with the geographical variation in proteome and toxicity of the venom, and support the call for optimising antivenom production and use in the region. Besides, the current study uncovered full and partial sequences of numerous toxin genes from N. kaouthia which have not been reported hitherto; these include N. kaouthia-specific l-amino acid oxidase (LAAO), snake venom serine protease (SVSP), cystatin, acetylcholinesterase (AChE), hyaluronidase (HYA), waprin, phospholipase B (PLB), aminopeptidase (AP), neprilysin, etc. Taken together, the findings further enrich the snake toxin database and provide deeper insights into the genetic diversity of cobra venom toxins. PMID:28392982
Study of cnidarian-algal symbiosis in the "omics" age.

PubMed

Meyer, Eli; Weis, Virginia M

2012-08-01

The symbiotic associations between cnidarians and dinoflagellate algae (Symbiodinium) support productive and diverse ecosystems in coral reefs. Many aspects of this association, including the mechanistic basis of host-symbiont recognition and metabolic interaction, remain poorly understood. The first completed genome sequence for a symbiotic anthozoan is now available (the coral Acropora digitifera), and extensive expressed sequence tag resources are available for a variety of other symbiotic corals and anemones. These resources make it possible to profile gene expression, protein abundance, and protein localization associated with the symbiotic state. Here we review the history of "omics" studies of cnidarian-algal symbiosis and the current availability of sequence resources for corals and anemones, identifying genes putatively involved in symbiosis across 10 anthozoan species. The public availability of candidate symbiosis-associated genes leaves the field of cnidarian-algal symbiosis poised for in-depth comparative studies of sequence diversity and gene expression and for targeted functional studies of genes associated with symbiosis. Reviewing the progress to date suggests directions for future investigations of cnidarian-algal symbiosis that include (i) sequencing of Symbiodinium, (ii) proteomic analysis of the symbiosome membrane complex, (iii) glycomic analysis of Symbiodinium cell surfaces, and (iv) expression profiling of the gastrodermal cells hosting Symbiodinium.
CORNAS: coverage-dependent RNA-Seq analysis of gene expression data without biological replicates.

PubMed

Low, Joel Z B; Khang, Tsung Fei; Tammi, Martti T

2017-12-28

In current statistical methods for calling differentially expressed genes in RNA-Seq experiments, the assumption is that an adjusted observed gene count represents an unknown true gene count. This adjustment usually consists of a normalization step to account for heterogeneous sample library sizes, and then the resulting normalized gene counts are used as input for parametric or non-parametric differential gene expression tests. A distribution of true gene counts, each with a different probability, can result in the same observed gene count. Importantly, sequencing coverage information is currently not explicitly incorporated into any of the statistical models used for RNA-Seq analysis. We developed a fast Bayesian method which uses the sequencing coverage information determined from the concentration of an RNA sample to estimate the posterior distribution of a true gene count. Our method has better or comparable performance compared to NOISeq and GFOLD, according to the results from simulations and experiments with real unreplicated data. We incorporated a previously unused sequencing coverage parameter into a procedure for differential gene expression analysis with RNA-Seq data. Our results suggest that our method can be used to overcome analytical bottlenecks in experiments with limited number of replicates and low sequencing coverage. The method is implemented in CORNAS (Coverage-dependent RNA-Seq), and is available at https://github.com/joel-lzb/CORNAS .
Sexual selection drives evolution and rapid turnover of male gene expression.

PubMed

Harrison, Peter W; Wright, Alison E; Zimmer, Fabian; Dean, Rebecca; Montgomery, Stephen H; Pointer, Marie A; Mank, Judith E

2015-04-07

The profound and pervasive differences in gene expression observed between males and females, and the unique evolutionary properties of these genes in many species, have led to the widespread assumption that they are the product of sexual selection and sexual conflict. However, we still lack a clear understanding of the connection between sexual selection and transcriptional dimorphism, often termed sex-biased gene expression. Moreover, the relative contribution of sexual selection vs. drift in shaping broad patterns of expression, divergence, and polymorphism remains unknown. To assess the role of sexual selection in shaping these patterns, we assembled transcriptomes from an avian clade representing the full range of sexual dimorphism and sexual selection. We use these species to test the links between sexual selection and sex-biased gene expression evolution in a comparative framework. Through ancestral reconstruction of sex bias, we demonstrate a rapid turnover of sex bias across this clade driven by sexual selection and show it to be primarily the result of expression changes in males. We use phylogenetically controlled comparative methods to demonstrate that phenotypic measures of sexual selection predict the proportion of male-biased but not female-biased gene expression. Although male-biased genes show elevated rates of coding sequence evolution, consistent with previous reports in a range of taxa, there is no association between sexual selection and rates of coding sequence evolution, suggesting that expression changes may be more important than coding sequence in sexual selection. Taken together, our results highlight the power of sexual selection to act on gene expression differences and shape genome evolution.
Transcription Factor Map Alignment of Promoter Regions

PubMed Central

Blanco, Enrique; Messeguer, Xavier; Smith, Temple F; Guigó, Roderic

2006-01-01

We address the problem of comparing and characterizing the promoter regions of genes with similar expression patterns. This remains a challenging problem in sequence analysis, because often the promoter regions of co-expressed genes do not show discernible sequence conservation. In our approach, thus, we have not directly compared the nucleotide sequence of promoters. Instead, we have obtained predictions of transcription factor binding sites, annotated the predicted sites with the labels of the corresponding binding factors, and aligned the resulting sequences of labels—to which we refer here as transcription factor maps (TF-maps). To obtain the global pairwise alignment of two TF-maps, we have adapted an algorithm initially developed to align restriction enzyme maps. We have optimized the parameters of the algorithm in a small, but well-curated, collection of human–mouse orthologous gene pairs. Results in this dataset, as well as in an independent much larger dataset from the CISRED database, indicate that TF-map alignments are able to uncover conserved regulatory elements, which cannot be detected by the typical sequence alignments. PMID:16733547
A 5′ Noncoding Exon Containing Engineered Intron Enhances Transgene Expression from Recombinant AAV Vectors in vivo

PubMed Central

Lu, Jiamiao; Williams, James A.; Luke, Jeremy; Zhang, Feijie; Chu, Kirk; Kay, Mark A.

2017-01-01

We previously developed a mini-intronic plasmid (MIP) expression system in which the essential bacterial elements for plasmid replication and selection are placed within an engineered intron contained within a universal 5′ UTR noncoding exon. Like minicircle DNA plasmids (devoid of bacterial backbone sequences), MIP plasmids overcome transcriptional silencing of the transgene. However, in addition MIP plasmids increase transgene expression by 2 and often >10 times higher than minicircle vectors in vivo and in vitro. Based on these findings, we examined the effects of the MIP intronic sequences in a recombinant adeno-associated virus (AAV) vector system. Recombinant AAV vectors containing an intron with a bacterial replication origin and bacterial selectable marker increased transgene expression by 40 to 100 times in vivo when compared with conventional AAV vectors. Therefore, inclusion of this noncoding exon/intron sequence upstream of the coding region can substantially enhance AAV-mediated gene expression in vivo. PMID:27903072

[Cloning and characterization of genes differentially expressed in human dental pulp cells and gingival fibroblasts].

PubMed

Wang, Zhong-dong; Wu, Ji-nan; Zhou, Lin; Ling, Jun-qi; Guo, Xi-min; Xiao, Ming-zhen; Zhu, Feng; Pu, Qin; Chai, Yu-bo; Zhao, Zhong-liang

2007-02-01

To study the biological properties of human dental pulp cells (HDPC) by cloning and analysis of genes differentially expressed in HDPC in comparison with human gingival fibroblasts (HGF). HDPC and HGF were cultured and identified by immunocytochemistry. HPDC and HGF subtractive cDNA library was established by PCR-based modified subtractive hybridization, genes differentially expressed by HPDC were cloned, sequenced and compared to find homogeneous sequence in GenBank by BLAST. Cloning and sequencing analysis indicate 12 genes differentially expressed were obtained, in which two were unknown genes. Among the 10 known genes, 4 were related to signal transduction, 2 were related to trans-membrane transportation (both cell membrane and nuclear membrane), and 2 were related to RNA splicing mechanisms. The biological properties of HPDC are determined by the differential expression of some genes and the growth and differentiation of HPDC are associated to the dynamic protein synthesis and secretion activities of the cell.
Rapid in silico cloning of genes using expressed sequence tags (ESTs).

PubMed

Gill, R W; Sanseau, P

2000-01-01

Expressed sequence tags (ESTs) are short single-pass DNA sequences obtained from either end of cDNA clones. These ESTs are derived from a vast number of cDNA libraries obtained from different species. Human ESTs are the bulk of the data and have been widely used to identify new members of gene families, as markers on the human chromosomes, to discover polymorphism sites and to compare expression patterns in different tissues or pathologies states. Information strategies have been devised to query EST databases. Since most of the analysis is performed with a computer, the term "in silico" strategy has been coined. In this chapter we will review the current status of EST databases, the pros and cons of EST-type data and describe possible strategies to retrieve meaningful information.
Gene discovery in Eimeria tenella by immunoscreening cDNA expression libraries of sporozoites and schizonts with chicken intestinal antibodies.

PubMed

Réfega, Susana; Girard-Misguich, Fabienne; Bourdieu, Christiane; Péry, Pierre; Labbé, Marie

2003-04-02

Specific antibodies were produced ex vivo from intestinal culture of Eimeria tenella infected chickens. The specificity of these intestinal antibodies was tested against different parasite stages. These antibodies were used to immunoscreen first generation schizont and sporozoite cDNA libraries permitting the identification of new E. tenella antigens. We obtained a total of 119 cDNA clones which were subjected to sequence analysis. The sequences coding for the proteins inducing local immune responses were compared with nucleotide or protein databases and with expressed sequence tags (ESTs) databases. We identified new Eimeria genes coding for heat shock proteins, a ribosomal protein, a pyruvate kinase and a pyridoxine kinase. Specific features of other sequences are discussed.
Bat Accelerated Regions Identify a Bat Forelimb Specific Enhancer in the HoxD Locus

PubMed Central

Mason, Mandy K.; VanderMeer, Julia E.; Zhao, Jingjing; Eckalbar, Walter L.; Logan, Malcolm; Illing, Nicola; Pollard, Katherine S.; Ahituv, Nadav

2016-01-01

The molecular events leading to the development of the bat wing remain largely unknown, and are thought to be caused, in part, by changes in gene expression during limb development. These expression changes could be instigated by variations in gene regulatory enhancers. Here, we used a comparative genomics approach to identify regions that evolved rapidly in the bat ancestor, but are highly conserved in other vertebrates. We discovered 166 bat accelerated regions (BARs) that overlap H3K27ac and p300 ChIP-seq peaks in developing mouse limbs. Using a mouse enhancer assay, we show that five Myotis lucifugus BARs drive gene expression in the developing mouse limb, with the majority showing differential enhancer activity compared to the mouse orthologous BAR sequences. These include BAR116, which is located telomeric to the HoxD cluster and had robust forelimb expression for the M. lucifugus sequence and no activity for the mouse sequence at embryonic day 12.5. Developing limb expression analysis of Hoxd10-Hoxd13 in Miniopterus natalensis bats showed a high-forelimb weak-hindlimb expression for Hoxd10-Hoxd11, similar to the expression trend observed for M. lucifugus BAR116 in mice, suggesting that it could be involved in the regulation of the bat HoxD complex. Combined, our results highlight novel regulatory regions that could be instrumental for the morphological differences leading to the development of the bat wing. PMID:27019019
Quantitative analysis of a deeply sequenced marine microbial metatranscriptome.

PubMed

Gifford, Scott M; Sharma, Shalabh; Rinta-Kanto, Johanna M; Moran, Mary Ann

2011-03-01

The potential of metatranscriptomic sequencing to provide insights into the environmental factors that regulate microbial activities depends on how fully the sequence libraries capture community expression (that is, sample-sequencing depth and coverage depth), and the sensitivity with which expression differences between communities can be detected (that is, statistical power for hypothesis testing). In this study, we use an internal standard approach to make absolute (per liter) estimates of transcript numbers, a significant advantage over proportional estimates that can be biased by expression changes in unrelated genes. Coastal waters of the southeastern United States contain 1 × 10(12) bacterioplankton mRNA molecules per liter of seawater (~200 mRNA molecules per bacterial cell). Even for the large bacterioplankton libraries obtained in this study (~500,000 possible protein-encoding sequences in each of two libraries after discarding rRNAs and small RNAs from >1 million 454 FLX pyrosequencing reads), sample-sequencing depth was only 0.00001%. Expression levels of 82 genes diagnostic for transformations in the marine nitrogen, phosphorus and sulfur cycles ranged from below detection (<1 × 10(6) transcripts per liter) for 36 genes (for example, phosphonate metabolism gene phnH, dissimilatory nitrate reductase subunit napA) to >2.7 × 10(9) transcripts per liter (ammonia transporter amt and ammonia monooxygenase subunit amoC). Half of the categories for which expression was detected, however, had too few copy numbers for robust statistical resolution, as would be required for comparative (experimental or time-series) expression studies. By representing whole community gene abundance and expression in absolute units (per volume or mass of environment), 'omics' data can be better leveraged to improve understanding of microbially mediated processes in the ocean.
Sequestration of cAMP response element-binding proteins by transcription factor decoys causes collateral elaboration of regenerating Aplysia motor neuron axons.

PubMed

Dash, P K; Tian, L M; Moore, A N

1998-07-07

Axonal injury increases intracellular Ca2+ and cAMP and has been shown to induce gene expression, which is thought to be a key event for regeneration. Increases in intracellular Ca2+ and/or cAMP can alter gene expression via activation of a family of transcription factors that bind to and modulate the expression of CRE (Ca2+/cAMP response element) sequence-containing genes. We have used Aplysia motor neurons to examine the role of CRE-binding proteins in axonal regeneration after injury. We report that axonal injury increases the binding of proteins to a CRE sequence-containing probe. In addition, Western blot analysis revealed that the level of ApCREB2, a CRE sequence-binding repressor, was enhanced as a result of axonal injury. The sequestration of CRE-binding proteins by microinjection of CRE sequence-containing plasmids enhanced axon collateral formation (both number and length) as compared with control plasmid injections. These findings show that Ca2+/cAMP-mediated gene expression via CRE-binding transcription factors participates in the regeneration of motor neuron axons.
Bioinformatic Analysis of the Human Recombinant Iduronate 2-Sulfate Sulfatase

PubMed Central

Morales-Álvarez, Edwin D.; Rivera-Hoyos, Claudia M.; Landázuri, Patricia; Poutou-Piñales, Raúl A.; Pedroza-Rodríguez, Aura M.

2016-01-01

Mucopolysaccharidosis type II is a human recessive disease linked to the X chromosome caused by deficiency of lysosomal enzyme Iduronate 2-Sulfate Sulfatase (IDS), which leads to accumulation of glycosaminoglycans in tissues and organs. The human enzyme has been expressed in Escherichia coli and Pichia pastoris in attempt to develop more successful expression systems that allow the production of recombinant IDS for Enzyme Replacement Therapy (ERT). However, the preservation of native signal peptide in the sequence has caused conflicts in processing and recognition in the past, which led to problems in expression and enzyme activity. With the main object being the improvement of the expression system, we eliminate the native signal peptide of human recombinant IDS. The resulting sequence showed two modified codons, thus, our study aimed to analyze computationally the nucleotide sequence of the IDSnh without signal peptide in order to determine the 3D structure and other biochemical properties to compare them with the native human IDS (IDSnh). Results showed that there are no significant differences between both molecules in spite of the two-codon modifications detected in the recombinant DNA sequence. PMID:27335624
Analysis of temporal transcription expression profiles reveal links between protein function and developmental stages of Drosophila melanogaster.

PubMed

Wan, Cen; Lees, Jonathan G; Minneci, Federico; Orengo, Christine A; Jones, David T

2017-10-01

Accurate gene or protein function prediction is a key challenge in the post-genome era. Most current methods perform well on molecular function prediction, but struggle to provide useful annotations relating to biological process functions due to the limited power of sequence-based features in that functional domain. In this work, we systematically evaluate the predictive power of temporal transcription expression profiles for protein function prediction in Drosophila melanogaster. Our results show significantly better performance on predicting protein function when transcription expression profile-based features are integrated with sequence-derived features, compared with the sequence-derived features alone. We also observe that the combination of expression-based and sequence-based features leads to further improvement of accuracy on predicting all three domains of gene function. Based on the optimal feature combinations, we then propose a novel multi-classifier-based function prediction method for Drosophila melanogaster proteins, FFPred-fly+. Interpreting our machine learning models also allows us to identify some of the underlying links between biological processes and developmental stages of Drosophila melanogaster.
Optimized Probe Masking for Comparative Transcriptomics of Closely Related Species

PubMed Central

Poeschl, Yvonne; Delker, Carolin; Trenner, Jana; Ullrich, Kristian Karsten; Quint, Marcel; Grosse, Ivo

2013-01-01

Microarrays are commonly applied to study the transcriptome of specific species. However, many available microarrays are restricted to model organisms, and the design of custom microarrays for other species is often not feasible. Hence, transcriptomics approaches of non-model organisms as well as comparative transcriptomics studies among two or more species often make use of cost-intensive RNAseq studies or, alternatively, by hybridizing transcripts of a query species to a microarray of a closely related species. When analyzing these cross-species microarray expression data, differences in the transcriptome of the query species can cause problems, such as the following: (i) lower hybridization accuracy of probes due to mismatches or deletions, (ii) probes binding multiple transcripts of different genes, and (iii) probes binding transcripts of non-orthologous genes. So far, methods for (i) exist, but these neglect (ii) and (iii). Here, we propose an approach for comparative transcriptomics addressing problems (i) to (iii), which retains only transcript-specific probes binding transcripts of orthologous genes. We apply this approach to an Arabidopsis lyrata expression data set measured on a microarray designed for Arabidopsis thaliana, and compare it to two alternative approaches, a sequence-based approach and a genomic DNA hybridization-based approach. We investigate the number of retained probe sets, and we validate the resulting expression responses by qRT-PCR. We find that the proposed approach combines the benefit of sequence-based stringency and accuracy while allowing the expression analysis of much more genes than the alternative sequence-based approach. As an added benefit, the proposed approach requires probes to detect transcripts of orthologous genes only, which provides a superior base for biological interpretation of the measured expression responses. PMID:24260119
An upstream sequence modulates phenazine production at the level of transcription and translation in the biological control strain Pseudomonas chlororaphis 30-84

PubMed Central

Wang, Dongping; Ries, Tessa R.; Pierson, Leland S.; Pierson, Elizabeth A.

2018-01-01

Phenazines are bacterial secondary metabolites and play important roles in the antagonistic activity of the biological control strain P. chlororaphis 30–84 against take-all disease of wheat. The expression of the P. chlororaphis 30–84 phenazine biosynthetic operon (phzXYFABCD) is dependent on the PhzR/PhzI quorum sensing system located immediately upstream of the biosynthetic operon as well as other regulatory systems including Gac/Rsm. Bioinformatic analysis of the sequence between the divergently oriented phzR and phzX promoters identified features within the 5’-untranslated region (5’-UTR) of phzX that are conserved only among 2OHPCA producing Pseudomonas. The conserved sequence features are potentially capable of producing secondary structures that negatively modulate one or both promoters. Transcriptional and translational fusion assays revealed that deletion of 90-bp of sequence at the 5’-UTR of phzX led to up to 4-fold greater expression of the reporters with the deletion compared to the controls, which indicated this sequence negatively modulates phenazine gene expression both transcriptionally and translationally. This 90-bp sequence was deleted from the P. chlororaphis 30–84 chromosome, resulting in 30-84Enh, which produces significantly more phenazine than the wild-type while retaining quorum sensing control. The transcriptional expression of phzR/phzI and amount of AHL signal produced by 30-84Enh also were significantly greater than for the wild-type, suggesting this 90-bp sequence also negatively affects expression of the quorum sensing genes. In addition, deletion of the 90-bp partially relieved RsmE-mediated translational repression, indicating a role for Gac/RsmE interaction. Compared to the wild-type, enhanced phenazine production by 30-84Enh resulted in improvement in fungal inhibition, biofilm formation, extracellular DNA release and suppression of take-all disease of wheat in soil without negative consequences on growth or rhizosphere persistence. This work provides greater insight into the regulation of phenazine biosynthesis with potential applications for improved biological control. PMID:29451920
Genomic resources for songbird research and their use in characterizing gene expression during brain development

PubMed Central

Li, XiaoChing; Wang, Xiu-Jie; Tannenhauser, Jonathan; Podell, Sheila; Mukherjee, Piali; Hertel, Moritz; Biane, Jeremy; Masuda, Shoko; Nottebohm, Fernando; Gaasterland, Terry

2007-01-01

Vocal learning and neuronal replacement have been studied extensively in songbirds, but until recently, few molecular and genomic tools for songbird research existed. Here we describe new molecular/genomic resources developed in our laboratory. We made cDNA libraries from zebra finch (Taeniopygia guttata) brains at different developmental stages. A total of 11,000 cDNA clones from these libraries, representing 5,866 unique gene transcripts, were randomly picked and sequenced from the 3′ ends. A web-based database was established for clone tracking, sequence analysis, and functional annotations. Our cDNA libraries were not normalized. Sequencing ESTs without normalization produced many developmental stage-specific sequences, yielding insights into patterns of gene expression at different stages of brain development. In particular, the cDNA library made from brains at posthatching day 30–50, corresponding to the period of rapid song system development and song learning, has the most diverse and richest set of genes expressed. We also identified five microRNAs whose sequences are highly conserved between zebra finch and other species. We printed cDNA microarrays and profiled gene expression in the high vocal center of both adult male zebra finches and canaries (Serinus canaria). Genes differentially expressed in the high vocal center were identified from the microarray hybridization results. Selected genes were validated by in situ hybridization. Networks among the regulated genes were also identified. These resources provide songbird biologists with tools for genome annotation, comparative genomics, and microarray gene expression analysis. PMID:17426146
Analysis of expressed sequence tags from Prunus mume flower and fruit and development of simple sequence repeat markers

PubMed Central

2010-01-01

Background Expressed Sequence Tag (EST) has been a cost-effective tool in molecular biology and represents an abundant valuable resource for genome annotation, gene expression, and comparative genomics in plants. Results In this study, we constructed a cDNA library of Prunus mume flower and fruit, sequenced 10,123 clones of the library, and obtained 8,656 expressed sequence tag (EST) sequences with high quality. The ESTs were assembled into 4,473 unigenes composed of 1,492 contigs and 2,981 singletons and that have been deposited in NCBI (accession IDs: GW868575 - GW873047), among which 1,294 unique ESTs were with known or putative functions. Furthermore, we found 1,233 putative simple sequence repeats (SSRs) in the P. mume unigene dataset. We randomly tested 42 pairs of PCR primers flanking potential SSRs, and 14 pairs were identified as true-to-type SSR loci and could amplify polymorphic bands from 20 individual plants of P. mume. We further used the 14 EST-SSR primer pairs to test the transferability on peach and plum. The result showed that nearly 89% of the primer pairs produced target PCR bands in the two species. A high level of marker polymorphism was observed in the plum species (65%) and low in the peach (46%), and the clustering analysis of the three species indicated that these SSR markers were useful in the evaluation of genetic relationships and diversity between and within the Prunus species. Conclusions We have constructed the first cDNA library of P. mume flower and fruit, and our data provide sets of molecular biology resources for P. mume and other Prunus species. These resources will be useful for further study such as genome annotation, new gene discovery, gene functional analysis, molecular breeding, evolution and comparative genomics between Prunus species. PMID:20626882
Alteration of gene expression in human hepatocellular carcinoma with integrated hepatitis B virus DNA.

PubMed

Tamori, Akihiro; Yamanishi, Yoshihiro; Kawashima, Shuichi; Kanehisa, Minoru; Enomoto, Masaru; Tanaka, Hiromu; Kubo, Shoji; Shiomi, Susumu; Nishiguchi, Shuhei

2005-08-15

Integration of hepatitis B virus (HBV) DNA into the human genome is one of the most important steps in HBV-related carcinogenesis. This study attempted to find the link between HBV DNA, the adjoining cellular sequence, and altered gene expression in hepatocellular carcinoma (HCC) with integrated HBV DNA. We examined 15 cases of HCC infected with HBV by cassette ligation-mediated PCR. The human DNA adjacent to the integrated HBV DNA was sequenced. Protein coding sequences were searched for in the human sequence. In five cases with HBV DNA integration, from which good quality RNA was extracted, gene expression was examined by cDNA microarray analysis. The human DNA sequence successive to integrated HBV DNA was determined in the 15 HCCs. Eight protein-coding regions were involved: ras-responsive element binding protein 1, calmodulin 1, mixed lineage leukemia 2 (MLL2), FLJ333655, LOC220272, LOC255345, LOC220220, and LOC168991. The MLL2 gene was expressed in three cases with HBV DNA integrated into exon 3 of MLL2 and in one case with HBV DNA integrated into intron 3 of MLL2. Gene expression analysis suggested that two HCCs with HBV integrated into MLL2 had similar patterns of gene expression compared with three HCCs with HBV integrated into other loci of human chromosomes. HBV DNA was integrated at random sites of human DNA, and the MLL2 gene was one of the targets for integration. Our results suggest that HBV DNA might modulate human genes near integration sites, followed by integration site-specific expression of such genes during hepatocarcinogenesis.
Inferring gene expression from ribosomal promoter sequences, a crowdsourcing approach

PubMed Central

Meyer, Pablo; Siwo, Geoffrey; Zeevi, Danny; Sharon, Eilon; Norel, Raquel; Segal, Eran; Stolovitzky, Gustavo; Siwo, Geoffrey; Rider, Andrew K.; Tan, Asako; Pinapati, Richard S.; Emrich, Scott; Chawla, Nitesh; Ferdig, Michael T.; Tung, Yi-An; Chen, Yong-Syuan; Chen, Mei-Ju May; Chen, Chien-Yu; Knight, Jason M.; Sahraeian, Sayed Mohammad Ebrahim; Esfahani, Mohammad Shahrokh; Dreos, Rene; Bucher, Philipp; Maier, Ezekiel; Saeys, Yvan; Szczurek, Ewa; Myšičková, Alena; Vingron, Martin; Klein, Holger; Kiełbasa, Szymon M.; Knisley, Jeff; Bonnell, Jeff; Knisley, Debra; Kursa, Miron B.; Rudnicki, Witold R.; Bhattacharjee, Madhuchhanda; Sillanpää, Mikko J.; Yeung, James; Meysman, Pieter; Rodríguez, Aminael Sánchez; Engelen, Kristof; Marchal, Kathleen; Huang, Yezhou; Mordelet, Fantine; Hartemink, Alexander; Pinello, Luca; Yuan, Guo-Cheng

2013-01-01

The Gene Promoter Expression Prediction challenge consisted of predicting gene expression from promoter sequences in a previously unknown experimentally generated data set. The challenge was presented to the community in the framework of the sixth Dialogue for Reverse Engineering Assessments and Methods (DREAM6), a community effort to evaluate the status of systems biology modeling methodologies. Nucleotide-specific promoter activity was obtained by measuring fluorescence from promoter sequences fused upstream of a gene for yellow fluorescence protein and inserted in the same genomic site of yeast Saccharomyces cerevisiae. Twenty-one teams submitted results predicting the expression levels of 53 different promoters from yeast ribosomal protein genes. Analysis of participant predictions shows that accurate values for low-expressed and mutated promoters were difficult to obtain, although in the latter case, only when the mutation induced a large change in promoter activity compared to the wild-type sequence. As in previous DREAM challenges, we found that aggregation of participant predictions provided robust results, but did not fare better than the three best algorithms. Finally, this study not only provides a benchmark for the assessment of methods predicting activity of a specific set of promoters from their sequence, but it also shows that the top performing algorithm, which used machine-learning approaches, can be improved by the addition of biological features such as transcription factor binding sites. PMID:23950146
[Effect of human oviductal embryotrophic factors on gene expression of mouse preimplantation embryos].

PubMed

Yao, Yuan-Qing; Lee, Kai-Fai; Xu, Jia-Seng; Ho, Pak-Chung; Yeung, Shu-Biu

2007-09-01

To investigate the effect of embryotrophic factors (ETF) from human oviductal cells on gene expression of mouse early developmental embryos and discuss the role of fallopian tube in early development of embryos. ETF was isolated from conditioned medium of human oviductal cell line by sequential liquid chromatographic systems. Mouse embryos were treated by ETF in vitro. Using differential display RT-PCR, the gene expression of embryos treated by ETF was compared with embryos without ETF treatment. The differentially expressed genes were separated, re-amplified, cloned and sequenced. Gene expression profiles of embryos with ETF treatment was different from embryos without this treatment. Eight differentially expressed genes were cloned and sequenced. These genes functioned in RNA degradation, synthesis, splicing, protein trafficking, cellular differentiation and embryo development. Embryotrophic factors from human oviductal cells affect gene expression of early developmental embryos. The human oviductal cells play wide roles in early developmental stages of embryos.
Gut transcriptome of replete adult female cattle ticks, Rhipicephalus (Boophilus) microplus, feeding upon a Babesia bovis-infected bovine host.

PubMed

Heekin, Andrew M; Guerrero, Felix D; Bendele, Kylie G; Saldivar, Leo; Scoles, Glen A; Dowd, Scot E; Gondro, Cedric; Nene, Vishvanath; Djikeng, Appolinaire; Brayton, Kelly A

2013-09-01

As it feeds upon cattle, Rhipicephalus (Boophilus) microplus is capable of transmitting a number of pathogenic organisms, including the apicomplexan hemoparasite Babesia bovis, a causative agent of bovine babesiosis. The R. microplus female gut transcriptome was studied for two cohorts: adult females feeding on a bovine host infected with B. bovis and adult females feeding on an uninfected bovine. RNA was purified and used to generate a subtracted cDNA library from B. bovis-infected female gut, and 4,077 expressed sequence tags (ESTs) were sequenced. Gene expression was also measured by a microarray designed from the publicly available R. microplus gene index: BmiGI Version 2. We compared gene expression in the tick gut from females feeding upon an uninfected bovine to gene expression in tick gut from females feeding upon a splenectomized bovine infected with B. bovis. Thirty-three ESTs represented on the microarray were expressed at a higher level in female gut samples from the ticks feeding upon a B. bovis-infected calf compared to expression levels in female gut samples from ticks feeding on an uninfected calf. Forty-three transcripts were expressed at a lower level in the ticks feeding upon B. bovis-infected female guts compared with expression in female gut samples from ticks feeding on the uninfected calf. These array data were used as initial characterization of gene expression associated with the infection of R. microplus by B. bovis.
Expression of myostatin is not altered in lines of poultry exhibiting myofiber hyper- and hypoplasia.

PubMed

Mott, I; Ivarie, R

2002-06-01

Decades of selective breeding have yielded lines of poultry with substantial myofiber hyperplasia, vet little is known about what genes have been altered during the course of selection. Myostatin is a strong negative regulator of muscle mass in mice and cattle and could have been one of many genetic factors contributing to increased myofiber deposition in growth-selected lines of poultry. To test this hypothesis, the sequence and expression patterns of myostatin were analyzed in growth-selected lines of chickens and quail. The sequence of broiler myostatin cDNA, amplified via reverse transcription (RT)-PCR from embryonic muscle RNA, contained no missense mutations in the coding sequence when compared to that of White Leghorn layers, although two silent single nucleotide polymorphisms (SNP) were found. Northern analysis of myostatin transcripts from embryonic pectoralis and quadriceps showed no significant differences in expression levels between broiler and layer muscle RNA. However, levels of myostatin transcripts were greatly reduced in muscles of posthatch chicks compared to embryonic muscle. Myostatin protein was also present in broiler and layer embryonic muscle at similar levels. No significant polymorphisms or differences in RNA expression levels were found in embryonic muscles of divergently selected lines of Japanese quail. These results indicate that intense artificial selection in these growth-selected lines of poultry has neither silenced the expression of myostatin nor created null alleles via mutation in the lines analyzed.
OP17MICRORNA PROFILING USING SMALL RNA-SEQ IN PAEDIATRIC LOW GRADE GLIOMAS

PubMed Central

Jeyapalan, Jennie N.; Jones, Tania A.; Tatevossian, Ruth G.; Qaddoumi, Ibrahim; Ellison, David W.; Sheer, Denise

2014-01-01

INTRODUCTION: MicroRNAs regulate gene expression by targeting mRNAs for translational repression or degradation at the post-transcriptional level. In paediatric low-grade gliomas a few key genetic mutations have been identified, including BRAF fusions, FGFR1 duplications and MYB rearrangements. Our aim in the current study is to profile aberrant microRNA expression in paediatric low-grade gliomas and determine the role of epigenetic changes in the aetiology and behaviour of these tumours. METHOD: MicroRNA profiling of tumour samples (6 pilocytic, 2 diffuse, 2 pilomyxoid astrocytomas) and normal brain controls (4 adult normal brain samples and a primary glial progenitor cell-line) was performed using small RNA sequencing. Bioinformatic analysis included sequence alignment, analysis of the number of reads (CPM, counts per million) and differential expression. RESULTS: Sequence alignment identified 695 microRNAs, whose expression was compared in tumours v. normal brain. PCA and hierarchical clustering showed separate groups for tumours and normal brain. Computational analysis identified approximately 400 differentially expressed microRNAs in the tumours compared to matched location controls. Our findings will then be validated and integrated with extensive genetic and epigenetic information we have previously obtained for the full tumour cohort. CONCLUSION: We have identified microRNAs that are differentially expressed in paediatric low-grade gliomas. As microRNAs are known to target genes involved in the initiation and progression of cancer, they provide critical information on tumour pathogenesis and are an important class of biomarkers.
Reference genome sequence of the model plant Setaria

DOE Office of Scientific and Technical Information (OSTI.GOV)

Bennetzen, Jeffrey L; Schmutz, Jeremy; Wang, Hao

We generated a high-quality reference genome sequence for foxtail millet (Setaria italica). The ~400-Mb assembly covers ~80% of the genome and >95% of the gene space. The assembly was anchored to a 992-locus genetic map and was annotated by comparison with >1.3 million expressed sequence tag reads. We produced more than 580 million RNA-Seq reads to facilitate expression analyses. We also sequenced Setaria viridis, the ancestral wild relative of S. italica, and identified regions of differential single-nucleotide polymorphism density, distribution of transposable elements, small RNA content, chromosomal rearrangement and segregation distortion. The genus Setaria includes natural and cultivated species thatmore » demonstrate a wide capacity for adaptation. The genetic basis of this adaptation was investigated by comparing five sequenced grass genomes. We also used the diploid Setaria genome to evaluate the ongoing genome assembly of a related polyploid, switchgrass (Panicum virgatum).« less
Reference genome sequence of the model plant Setaria.

PubMed

Bennetzen, Jeffrey L; Schmutz, Jeremy; Wang, Hao; Percifield, Ryan; Hawkins, Jennifer; Pontaroli, Ana C; Estep, Matt; Feng, Liang; Vaughn, Justin N; Grimwood, Jane; Jenkins, Jerry; Barry, Kerrie; Lindquist, Erika; Hellsten, Uffe; Deshpande, Shweta; Wang, Xuewen; Wu, Xiaomei; Mitros, Therese; Triplett, Jimmy; Yang, Xiaohan; Ye, Chu-Yu; Mauro-Herrera, Margarita; Wang, Lin; Li, Pinghua; Sharma, Manoj; Sharma, Rita; Ronald, Pamela C; Panaud, Olivier; Kellogg, Elizabeth A; Brutnell, Thomas P; Doust, Andrew N; Tuskan, Gerald A; Rokhsar, Daniel; Devos, Katrien M

2012-05-13

We generated a high-quality reference genome sequence for foxtail millet (Setaria italica). The ∼400-Mb assembly covers ∼80% of the genome and >95% of the gene space. The assembly was anchored to a 992-locus genetic map and was annotated by comparison with >1.3 million expressed sequence tag reads. We produced more than 580 million RNA-Seq reads to facilitate expression analyses. We also sequenced Setaria viridis, the ancestral wild relative of S. italica, and identified regions of differential single-nucleotide polymorphism density, distribution of transposable elements, small RNA content, chromosomal rearrangement and segregation distortion. The genus Setaria includes natural and cultivated species that demonstrate a wide capacity for adaptation. The genetic basis of this adaptation was investigated by comparing five sequenced grass genomes. We also used the diploid Setaria genome to evaluate the ongoing genome assembly of a related polyploid, switchgrass (Panicum virgatum).

The opportunities and challenges of large-scale molecular approaches to songbird neurobiology

PubMed Central

Mello, C.V.; Clayton, D.F.

2014-01-01

High-through put methods for analyzing genome structure and function are having a large impact in song-bird neurobiology. Methods include genome sequencing and annotation, comparative genomics, DNA microarrays and transcriptomics, and the development of a brain atlas of gene expression. Key emerging findings include the identification of complex transcriptional programs active during singing, the robust brain expression of non-coding RNAs, evidence of profound variations in gene expression across brain regions, and the identification of molecular specializations within song production and learning circuits. Current challenges include the statistical analysis of large datasets, effective genome curations, the efficient localization of gene expression changes to specific neuronal circuits and cells, and the dissection of behavioral and environmental factors that influence brain gene expression. The field requires efficient methods for comparisons with organisms like chicken, which offer important anatomical, functional and behavioral contrasts. As sequencing costs plummet, opportunities emerge for comparative approaches that may help reveal evolutionary transitions contributing to vocal learning, social behavior and other properties that make songbirds such compelling research subjects. PMID:25280907
Comparative transcriptome analysis of microsclerotia development in Nomuraea rileyi.

PubMed

Song, Zhangyong; Yin, Youping; Jiang, Shasha; Liu, Juanjuan; Chen, Huan; Wang, Zhongkang

2013-06-19

Nomuraea rileyi is used as an environmental-friendly biopesticide. However, mass production and commercialization of this organism are limited due to its fastidious growth and sporulation requirements. When cultured in amended medium, we found that N. rileyi could produce microsclerotia bodies, replacing conidiophores as the infectious agent. However, little is known about the genes involved in microsclerotia development. In the present study, the transcriptomes were analyzed using next-generation sequencing technology to find the genes involved in microsclerotia development. A total of 4.69 Gb of clean nucleotides comprising 32,061 sequences was obtained, and 20,919 sequences were annotated (about 65%). Among the annotated sequences, only 5928 were annotated with 34 gene ontology (GO) functional categories, and 12,778 sequences were mapped to 165 pathways by searching against the Kyoto Encyclopedia of Genes and Genomes pathway (KEGG) database. Furthermore, we assessed the transcriptomic differences between cultures grown in minimal and amended medium. In total, 4808 sequences were found to be differentially expressed; 719 differentially expressed unigenes were assigned to 25 GO classes and 1888 differentially expressed unigenes were assigned to 161 KEGG pathways, including 25 enrichment pathways. Subsequently, we examined the up-regulation or uniquely expressed genes following amended medium treatment, which were also expressed on the enrichment pathway, and found that most of them participated in mediating oxidative stress homeostasis. To elucidate the role of oxidative stress in microsclerotia development, we analyzed the diversification of unigenes using quantitative reverse transcription-PCR (RT-qPCR). Our findings suggest that oxidative stress occurs during microsclerotia development, along with a broad metabolic activity change. Our data provide the most comprehensive sequence resource available for the study of N. rileyi. We believe that the transcriptome datasets will serve as an important public information platform to accelerate studies on N. rileyi microsclerotia.
Cloning of a MADS box gene (GhMADS3) from cotton and analysis of its homeotic role in transgenic tobacco.

PubMed

Guo, Yulong; Zhu, Qinlong; Zheng, Shangyong; Li, Mingyang

2007-06-01

A MADS box gene (GhMADS3) was cloned from cotton (Gossypium hirsutum L.) based on EST sequences. The predicted protein sequence of GhMADS3 showed 85%, 73%, and 62% identity with Theobroma cacao TcAG, Antirrhinum majus FAR, and Arabidopsis thaliana AG, respectively, and was grouped with AG homologues when the full length sequences excluding N-extensions were compared. GhMADS3 expressed in the wild type cotton flower primarily in stamens and carpels, which was comparable to AG in Arabidopsis. However, it was not expressed in floral buds of a homeotic cotton variant chv1. Ectopic expression of GhMADS3 in tobacco (Nicotiana tabacum L.) resulted in flowers with sepal-to-carpel and petal-to-stamen transformation. The carpelloid first whorl organs, with stigmatic tissue on their upper edges, had a white appearance when compared with the dark green color of the wild type sepals. At times, long filaments were observed at the fusion site of the first carpelloid oranges. The second whorl organs in staminoid were usually smaller than the wild type and the color was changed from pink to white. These results suggest that GhMADS3 has a homeotic role in flower development.
Porcine MYF6 gene: sequence, homology analysis, and variation in the promoter region.

PubMed

Wyszyńska-Koko, J; Kurył, J

2004-01-01

MYF6 gene codes for the bHLH transcription factor belonging to MyoD family. Its expression accompanies the processes of differentiation and maturation of myotubes during embriogenesis and continues on a relatively high level after birth, affecting the muscle phenotype. The porcine MYF6 gene was amplified and sequenced and compared with MYF6 gene sequences of other species. The amino acid sequence was deduced and an interspecies homology analysis was performed. Myf-6 protein shows a high conservation among species of 99 and 97% identity when comparing pig with cow and human, respectively, and of 93% when comparing pig with mouse and rat. The single nucleotide polymorphism (SNP) was revealed within the promoter region, which appeared to be T --> C transition recognized by a MspI restriction enzyme.
Genome-wide analyses of long noncoding RNA expression profiles correlated with radioresistance in nasopharyngeal carcinoma via next-generation deep sequencing.

PubMed

Li, Guo; Liu, Yong; Liu, Chao; Su, Zhongwu; Ren, Shuling; Wang, Yunyun; Deng, Tengbo; Huang, Donghai; Tian, Yongquan; Qiu, Yuanzheng

2016-09-06

Radioresistance is one of the major factors limiting the therapeutic efficacy and prognosis of patients with nasopharyngeal carcinoma (NPC). Accumulating evidence has suggested that aberrant expression of long noncoding RNAs (lncRNAs) contributes to cancer progression. Therefore, here we identified lncRNAs associated with radioresistance in NPC. The differential expression profiles of lncRNAs associated with NPC radioresistance were constructed by next-generation deep sequencing by comparing radioresistant NPC cells with their parental cells. LncRNA-related mRNAs were predicted and analyzed using bioinformatics algorithms compared with the mRNA profiles related to radioresistance obtained in our previous study. Several lncRNAs and associated mRNAs were validated in established NPC radioresistant cell models and NPC tissues. By comparison between radioresistant CNE-2-Rs and parental CNE-2 cells by next-generation deep sequencing, a total of 781 known lncRNAs and 2054 novel lncRNAs were annotated. The top five upregulated and downregulated known/novel lncRNAs were detected using quantitative real-time reverse transcription-polymerase chain reaction, and 7/10 known lncRNAs and 3/10 novel lncRNAs were demonstrated to have significant differential expression trends that were the same as those predicted by deep sequencing. From the prediction process, 13 pairs of lncRNAs and their associated genes were acquired, and the prediction trends of three pairs were validated in both radioresistant CNE-2-Rs and 6-10B-Rs cell lines, including lncRNA n373932 and SLITRK5, n409627 and PRSS12, and n386034 and RIMKLB. LncRNA n373932 and its related SLITRK5 showed dramatic expression changes in post-irradiation radioresistant cells and a negative expression correlation in NPC tissues (R = -0.595, p < 0.05). Our study provides an overview of the expression profiles of radioresistant lncRNAs and potentially related mRNAs, which will facilitate future investigations into the function of lncRNAs in NPC radioresistance.
EXP-PAC: providing comparative analysis and storage of next generation gene expression data.

PubMed

Church, Philip C; Goscinski, Andrzej; Lefèvre, Christophe

2012-07-01

Microarrays and more recently RNA sequencing has led to an increase in available gene expression data. How to manage and store this data is becoming a key issue. In response we have developed EXP-PAC, a web based software package for storage, management and analysis of gene expression and sequence data. Unique to this package is SQL based querying of gene expression data sets, distributed normalization of raw gene expression data and analysis of gene expression data across experiments and species. This package has been populated with lactation data in the international milk genomic consortium web portal (http://milkgenomics.org/). Source code is also available which can be hosted on a Windows, Linux or Mac APACHE server connected to a private or public network (http://mamsap.it.deakin.edu.au/~pcc/Release/EXP_PAC.html). Copyright © 2012 Elsevier Inc. All rights reserved.
Resources and Recommendations for Using Transcriptomics to Address Grand Challenges in Comparative Biology

PubMed Central

Mykles, Donald L.; Burnett, Karen G.; Durica, David S.; Joyce, Blake L.; McCarthy, Fiona M.; Schmidt, Carl J.; Stillman, Jonathon H.

2016-01-01

High-throughput RNA sequencing (RNA-seq) technology has become an important tool for studying physiological responses of organisms to changes in their environment. De novo assembly of RNA-seq data has allowed researchers to create a comprehensive catalog of genes expressed in a tissue and to quantify their expression without a complete genome sequence. The contributions from the “Tapping the Power of Crustacean Transcriptomics to Address Grand Challenges in Comparative Biology” symposium in this issue show the successes and limitations of using RNA-seq in the study of crustaceans. In conjunction with the symposium, the Animal Genome to Phenome Research Coordination Network collated comments from participants at the meeting regarding the challenges encountered when using transcriptomics in their research. Input came from novices and experts ranging from graduate students to principal investigators. Many were unaware of the bioinformatics analysis resources currently available on the CyVerse platform. Our analysis of community responses led to three recommendations for advancing the field: (1) integration of genomic and RNA-seq sequence assemblies for crustacean gene annotation and comparative expression; (2) development of methodologies for the functional analysis of genes; and (3) information and training exchange among laboratories for transmission of best practices. The field lacks the methods for manipulating tissue-specific gene expression. The decapod crustacean research community should consider the cherry shrimp, Neocaridina denticulata, as a decapod model for the application of transgenic tools for functional genomics. This would require a multi-investigator effort. PMID:27639274
Tissue-Specific Transcriptomics in the Field Cricket Teleogryllus oceanicus

PubMed Central

Bailey, Nathan W.; Veltsos, Paris; Tan, Yew-Foon; Millar, A. Harvey; Ritchie, Michael G.; Simmons, Leigh W.

2013-01-01

Field crickets (family Gryllidae) frequently are used in studies of behavioral genetics, sexual selection, and sexual conflict, but there have been no studies of transcriptomic differences among different tissue types. We evaluated transcriptome variation among testis, accessory gland, and the remaining whole-body preparations from males of the field cricket, Teleogryllus oceanicus. Non-normalized cDNA libraries from each tissue were sequenced on the Roche 454 platform, and a master assembly was constructed using testis, accessory gland, and whole-body preparations. A total of 940,200 reads were assembled into 41,962 contigs, to which 36,856 singletons (reads not assembled into a contig) were added to provide a total of 78,818 sequences used in annotation analysis. A total of 59,072 sequences (75%) were unique to one of the three tissues. Testis tissue had the greatest proportion of tissue-specific sequences (62.6%), followed by general body (56.43%) and accessory gland tissue (44.16%). We tested the hypothesis that tissues expressing gene products expected to evolve rapidly as a result of sexual selection—testis and accessory gland—would yield a smaller proportion of BLASTx matches to homologous genes in the model organism Drosophila melanogaster compared with whole-body tissue. Uniquely expressed sequences in both testis and accessory gland showed a significantly lower rate of matching to annotated D. melanogaster genes compared with those from general body tissue. These results correspond with empirical evidence that genes expressed in testis and accessory gland tissue are rapidly evolving targets of selection. PMID:23390599
Tissue-specific transcriptomics in the field cricket Teleogryllus oceanicus.

PubMed

Bailey, Nathan W; Veltsos, Paris; Tan, Yew-Foon; Millar, A Harvey; Ritchie, Michael G; Simmons, Leigh W

2013-02-01

Field crickets (family Gryllidae) frequently are used in studies of behavioral genetics, sexual selection, and sexual conflict, but there have been no studies of transcriptomic differences among different tissue types. We evaluated transcriptome variation among testis, accessory gland, and the remaining whole-body preparations from males of the field cricket, Teleogryllus oceanicus. Non-normalized cDNA libraries from each tissue were sequenced on the Roche 454 platform, and a master assembly was constructed using testis, accessory gland, and whole-body preparations. A total of 940,200 reads were assembled into 41,962 contigs, to which 36,856 singletons (reads not assembled into a contig) were added to provide a total of 78,818 sequences used in annotation analysis. A total of 59,072 sequences (75%) were unique to one of the three tissues. Testis tissue had the greatest proportion of tissue-specific sequences (62.6%), followed by general body (56.43%) and accessory gland tissue (44.16%). We tested the hypothesis that tissues expressing gene products expected to evolve rapidly as a result of sexual selection--testis and accessory gland--would yield a smaller proportion of BLASTx matches to homologous genes in the model organism Drosophila melanogaster compared with whole-body tissue. Uniquely expressed sequences in both testis and accessory gland showed a significantly lower rate of matching to annotated D. melanogaster genes compared with those from general body tissue. These results correspond with empirical evidence that genes expressed in testis and accessory gland tissue are rapidly evolving targets of selection.
Comparison of Five Major Trichome Regulatory Genes in Brassica villosa with Orthologues within the Brassicaceae

PubMed Central

Nayidu, Naghabushana K.; Kagale, Sateesh; Taheri, Ali; Withana-Gamage, Thushan S.; Parkin, Isobel A. P.; Sharpe, Andrew G.; Gruber, Margaret Y.

2014-01-01

Coding sequences for major trichome regulatory genes, including the positive regulators GLABRA 1(GL1), GLABRA 2 (GL2), ENHANCER OF GLABRA 3 (EGL3), and TRANSPARENT TESTA GLABRA 1 (TTG1) and the negative regulator TRIPTYCHON (TRY), were cloned from wild Brassica villosa, which is characterized by dense trichome coverage over most of the plant. Transcript (FPKM) levels from RNA sequencing indicated much higher expression of the GL2 and TTG1 regulatory genes in B. villosa leaves compared with expression levels of GL1 and EGL3 genes in either B. villosa or the reference genome species, glabrous B. oleracea; however, cotyledon TTG1 expression was high in both species. RNA sequencing and Q-PCR also revealed an unusual expression pattern for the negative regulators TRY and CPC, which were much more highly expressed in trichome-rich B. villosa leaves than in glabrous B. oleracea leaves and in glabrous cotyledons from both species. The B. villosa TRY expression pattern also contrasted with TRY expression patterns in two diploid Brassica species, and with the Arabidopsis model for expression of negative regulators of trichome development. Further unique sequence polymorphisms, protein characteristics, and gene evolution studies highlighted specific amino acids in GL1 and GL2 coding sequences that distinguished glabrous species from hairy species and several variants that were specific for each B. villosa gene. Positive selection was observed for GL1 between hairy and non-hairy plants, and as expected the origin of the four expressed positive trichome regulatory genes in B. villosa was predicted to be from B. oleracea. In particular the unpredicted expression patterns for TRY and CPC in B. villosa suggest additional characterization is needed to determine the function of the expanded families of trichome regulatory genes in more complex polyploid species within the Brassicaceae. PMID:24755905
Gene expression analysis of flax seed development

PubMed Central

2011-01-01

Background Flax, Linum usitatissimum L., is an important crop whose seed oil and stem fiber have multiple industrial applications. Flax seeds are also well-known for their nutritional attributes, viz., omega-3 fatty acids in the oil and lignans and mucilage from the seed coat. In spite of the importance of this crop, there are few molecular resources that can be utilized toward improving seed traits. Here, we describe flax embryo and seed development and generation of comprehensive genomic resources for the flax seed. Results We describe a large-scale generation and analysis of expressed sequences in various tissues. Collectively, the 13 libraries we have used provide a broad representation of genes active in developing embryos (globular, heart, torpedo, cotyledon and mature stages) seed coats (globular and torpedo stages) and endosperm (pooled globular to torpedo stages) and genes expressed in flowers, etiolated seedlings, leaves, and stem tissue. A total of 261,272 expressed sequence tags (EST) (GenBank accessions LIBEST_026995 to LIBEST_027011) were generated. These EST libraries included transcription factor genes that are typically expressed at low levels, indicating that the depth is adequate for in silico expression analysis. Assembly of the ESTs resulted in 30,640 unigenes and 82% of these could be identified on the basis of homology to known and hypothetical genes from other plants. When compared with fully sequenced plant genomes, the flax unigenes resembled poplar and castor bean more than grape, sorghum, rice or Arabidopsis. Nearly one-fifth of these (5,152) had no homologs in sequences reported for any organism, suggesting that this category represents genes that are likely unique to flax. Digital analyses revealed gene expression dynamics for the biosynthesis of a number of important seed constituents during seed development. Conclusions We have developed a foundational database of expressed sequences and collection of plasmid clones that comprise even low-expressed genes such as those encoding transcription factors. This has allowed us to delineate the spatio-temporal aspects of gene expression underlying the biosynthesis of a number of important seed constituents in flax. Flax belongs to a taxonomic group of diverse plants and the large sequence database will allow for evolutionary studies as well. PMID:21529361
High-throughput sequencing of natively paired antibody chains provides evidence for original antigenic sin shaping the antibody response to influenza vaccination.

PubMed

Tan, Yann-Chong; Blum, Lisa K; Kongpachith, Sarah; Ju, Chia-Hsin; Cai, Xiaoyong; Lindstrom, Tamsin M; Sokolove, Jeremy; Robinson, William H

2014-03-01

We developed a DNA barcoding method to enable high-throughput sequencing of the cognate heavy- and light-chain pairs of the antibodies expressed by individual B cells. We used this approach to elucidate the plasmablast antibody response to influenza vaccination. We show that >75% of the rationally selected plasmablast antibodies bind and neutralize influenza, and that antibodies from clonal families, defined by sharing both heavy-chain VJ and light-chain VJ sequence usage, do so most effectively. Vaccine-induced heavy-chain VJ regions contained on average >20 nucleotide mutations as compared to their predicted germline gene sequences, and some vaccine-induced antibodies exhibited higher binding affinities for hemagglutinins derived from prior years' seasonal influenza as compared to their affinities for the immunization strains. Our results show that influenza vaccination induces the recall of memory B cells that express antibodies that previously underwent affinity maturation against prior years' seasonal influenza, suggesting that 'original antigenic sin' shapes the antibody response to influenza vaccination. Published by Elsevier Inc.
Comparative 454 pyrosequencing of transcripts from two olive genotypes during fruit development

PubMed Central

Alagna, Fiammetta; D'Agostino, Nunzio; Torchia, Laura; Servili, Maurizio; Rao, Rosa; Pietrella, Marco; Giuliano, Giovanni; Chiusano, Maria Luisa; Baldoni, Luciana; Perrotta, Gaetano

2009-01-01

Background Despite its primary economic importance, genomic information on olive tree is still lacking. 454 pyrosequencing was used to enrich the very few sequence data currently available for the Olea europaea species and to identify genes involved in expression of fruit quality traits. Results Fruits of Coratina, a widely cultivated variety characterized by a very high phenolic content, and Tendellone, an oleuropein-lacking natural variant, were used as starting material for monitoring the transcriptome. Four different cDNA libraries were sequenced, respectively at the beginning and at the end of drupe development. A total of 261,485 reads were obtained, for an output of about 58 Mb. Raw sequence data were processed using a four step pipeline procedure and data were stored in a relational database with a web interface. Conclusion Massively parallel sequencing of different fruit cDNA collections has provided large scale information about the structure and putative function of gene transcripts accumulated during fruit development. Comparative transcript profiling allowed the identification of differentially expressed genes with potential relevance in regulating the fruit metabolism and phenolic content during ripening. PMID:19709400
Ribosomal binding site sequences and promoters for expressing glutamate decarboxylase and producing γ-aminobutyrate in Corynebacterium glutamicum.

PubMed

Shi, Feng; Luan, Mingyue; Li, Yongfu

2018-04-18

Glutamate decarboxylase (GAD) converts L-glutamate (Glu) into γ-aminobutyric acid (GABA). Corynebacterium glutamicum that expresses exogenous GAD gene, gadB2 or gadB1, can synthesize GABA from its own produced Glu. To enhance GABA production in C. glutamicum, ribosomal binding site (RBS) sequence and promoter were searched and optimized for increasing the expression efficiency of gadB2. R4 exhibited the highest strength among RBS sequences tested, with 6 nt the optimal aligned spacing (AS) between RBS and start codon. This combination of RBS sequence and AS contributed to gadB2 expression, increased GAD activity by 156% and GABA production by 82% compared to normal strong RBS and AS combination. Then, a series of native promoters were selected for transcribing gadB2 under optimal RBS and AS combination. P dnaK , P dtsR , P odhI and P clgR expressed gadB2 and produced GABA as effectively as widely applied P tuf and P cspB promoters and more effectively than P sod promoter. However, each native promoter did not work as well as the synthetic strong promoter P tacM , which produced 20.2 ± 0.3 g/L GABA. Even with prolonged length and bicistronic architecture, the strength of P dnaK did not enhance. Finally, gadB2 and mutant gadB1 were co-expressed under the optimal promoter and RBS combination, thus converted Glu into GABA completely and improved GABA production to more than 25 g/L. This study provides useful promoters and RBS sequences for gene expression in C. glutamicum.
Decreased expression of cell adhesion genes in cancer stem-like cells isolated from primary oral squamous cell carcinomas.

PubMed

Mishra, Amrendra; Sriram, Harshini; Chandarana, Pinal; Tanavde, Vivek; Kumar, Rekha V; Gopinath, Ashok; Govindarajan, Raman; Ramaswamy, S; Sadasivam, Subhashini

2018-05-01

The goal of this study was to isolate cancer stem-like cells marked by high expression of CD44, a putative cancer stem cell marker, from primary oral squamous cell carcinomas and identify distinctive gene expression patterns in these cells. From 1 October 2013 to 4 September 2015, 76 stage III-IV primary oral squamous cell carcinoma of the gingivobuccal sulcus were resected. In all, 13 tumours were analysed by immunohistochemistry to visualise CD44-expressing cells. Expression of CD44 within The Cancer Genome Atlas-Head and Neck Squamous Cell Carcinoma RNA-sequencing data was also assessed. Seventy resected tumours were dissociated into single cells and stained with antibodies to CD44 as well as CD45 and CD31 (together referred as Lineage/Lin). From 45 of these, CD44 + Lin - and CD44 - Lin - subpopulations were successfully isolated using fluorescence-activated cell sorting, and good-quality RNA was obtained from 14 such sorted pairs. Libraries from five pairs were sequenced and the results analysed using bioinformatics tools. Reverse transcription quantitative polymerase chain reaction was performed to experimentally validate the differential expression of selected candidate genes identified from the transcriptome sequencing in the same 5 and an additional 9 tumours. CD44 was expressed on the surface of poorly differentiated tumour cells, and within the The Cancer Genome Atlas-Head and Neck Squamous Cell Carcinoma samples, its messenger RNA levels were higher in tumours compared to normal. Transcriptomics revealed that 102 genes were upregulated and 85 genes were downregulated in CD44 + Lin - compared to CD44 - Lin - cells in at least 3 of the 5 tumours sequenced. The upregulated genes included those involved in immune regulation, while the downregulated genes were enriched for genes involved in cell adhesion. Decreased expression of PCDH18, MGP, SPARCL1 and KRTDAP was confirmed by reverse transcription quantitative polymerase chain reaction. Lower expression of the cell-cell adhesion molecule PCDH18 correlated with poorer overall survival in the The Cancer Genome Atlas-Head and Neck Squamous Cell Carcinoma data highlighting it as a potential negative prognostic factor in this cancer.
[Expression of human-mouse chimeric antibody directed against Chikungunya virus with site-specific integration system].

PubMed

Li, Jian-min; Chen, Wei; Jia, Xiu-jie; An, Xiao-ping; Li, Bing; Fan, Ying-ru; Tong, Yi-gang

2005-05-01

To obtain CHO/dhfr(-) cells line with integrated FRT sequence in the chromosome transcription active site and to express human-mouse chimeric antibody directed against Chikungunya Virus by using the cell line. The fusion gene of FRT and HBsAg was constructed by PCR and cloned into the MCS of pCI-neo to construct pCI-FRT-HBsAg. The pCI-FRT-HBsAg was transfected into CHO/dhfr(-) cells and cell clones with high expression of HBsAg were screened by detecting the amount of HBsAg with ELISA. A CHO cell clone with the highest expression was chosen and named as CHO/dhfr(-) FRT(+). pAFRT HFLF, a expression plasmid of chimeric antibody with RFT sequence was transfected into CHO/dhfr(-) FRT(+) cells and cell clones with high expression of the chimeric antibody were screened by increasing concentration of MTX. A CHO cell clone with high expression of the chimeric antibody was cultured in large scale and supernatant was collected from which the chimeric antibody was purified. The purified chimeric antibody was analyzed by SDS-PAGE, Western blot and IFA. A CHO/dhfr(-) cells line with integrated FRT sequence in the chromosome transcription active site was obtained successfully. A cell clone with yield of 5 mg/L of chimeric antibody was obtained, as compared with routine CHO cell expression system with a yield of 2 mg/L. A cell line with integrated FRT sequence in the chromosome transcription active site was obtained and with it human-mouse chimeric antibody directed against Chikungunya virus was expressed. This system lays a solid foundation which can be used for expressing antibodies and other proteins.
Comparative immunogenomics of molluscs.

PubMed

Schultz, Jonathan H; Adema, Coen M

2017-10-01

Comparative immunology, studying both vertebrates and invertebrates, provided the earliest descriptions of phagocytosis as a general immune mechanism. However, the large scale of animal diversity challenges all-inclusive investigations and the field of immunology has developed by mostly emphasizing study of a few vertebrate species. In addressing the lack of comprehensive understanding of animal immunity, especially that of invertebrates, comparative immunology helps toward management of invertebrates that are food sources, agricultural pests, pathogens, or transmit diseases, and helps interpret the evolution of animal immunity. Initial studies showed that the Mollusca (second largest animal phylum), and invertebrates in general, possess innate defenses but lack the lymphocytic immune system that characterizes vertebrate immunology. Recognizing the reality of both common and taxon-specific immune features, and applying up-to-date cell and molecular research capabilities, in-depth studies of a select number of bivalve and gastropod species continue to reveal novel aspects of molluscan immunity. The genomics era heralded a new stage of comparative immunology; large-scale efforts yielded an initial set of full molluscan genome sequences that is available for analyses of full complements of immune genes and regulatory sequences. Next-generation sequencing (NGS), due to lower cost and effort required, allows individual researchers to generate large sequence datasets for growing numbers of molluscs. RNAseq provides expression profiles that enable discovery of immune genes and genome sequences reveal distribution and diversity of immune factors across molluscan phylogeny. Although computational de novo sequence assembly will benefit from continued development and automated annotation may require some experimental validation, NGS is a powerful tool for comparative immunology, especially increasing coverage of the extensive molluscan diversity. To date, immunogenomics revealed new levels of complexity of molluscan defense by indicating sequence heterogeneity in individual snails and bivalves, and members of expanded immune gene families are expressed differentially to generate pathogen-specific defense responses. Copyright © 2017 Elsevier Ltd. All rights reserved.
Comparative transcriptome analysis of lufenuron-resistant and susceptible strains of Spodoptera frugiperda (Lepidoptera: Noctuidae).

PubMed

do Nascimento, Antonio Rogério Bezerra; Fresia, Pablo; Cônsoli, Fernando Luis; Omoto, Celso

2015-11-21

The evolution of insecticide resistance in Spodoptera frugiperda (Lepidoptera: Noctuidae) has resulted in large economic losses and disturbances to the environment and agroecosystems. Resistance to lufenuron, a chitin biosynthesis inhibitor insecticide, was recently documented in Brazilian populations of S. frugiperda. Thus, we utilized large-scale cDNA sequencing (RNA-Seq analysis) to compare the pattern of gene expression between lufenuron-resistant (LUF-R) and susceptible (LUF-S) S. larvae in an attempt to identify the molecular basis behind the resistance mechanism(s) of S. frugiperda to this insecticide. A transcriptome was assembled using approximately 19.6 million 100 bp-long single-end reads, which generated 18,506 transcripts with a N50 of 996 bp. A search against the NCBI non-redundant database generated 51.1% (9,457) functionally annotated transcripts. A large portion of the alignments were homologous to insects, with the majority (45%) being similar to sequences of Bombyx mori (Lepidoptera: Bombycidae). Moreover, 10% of the alignments were similar to sequences of various species of Spodoptera (Lepidoptera: Noctuidae), with 3% of them being similar to sequences of S. frugiperda. A comparative analysis of the gene expression between LUF-R and LUF-S S. frugiperda larvae identified 940 differentially expressed transcripts (p ≤ 0.05, t-test; fold change ≥ 4). Six of them were associated with cuticle metabolism. Of those, four were overexpressed in LUF-R larvae. The machinery involved with the detoxification process was represented by 35 differentially expressed transcripts; 24 of them belonging to P450 monooxygenases, four to glutathione-S-transferases, six to carboxylases and one to sulfotransferases. RNA-Seq analysis was validated for a number of selected candidate transcripts by using quantitative real time PCR (qPCR). The gene expression profile of LUF-R larvae of S. frugiperda differs from LUF-S larvae. In general, gene expression is much higher in resistant larvae when compared to the susceptible ones, particularly for those genes involved with pathways for xenobiotic detoxification, mainly represented by P450 monooxygenases transcripts. Our data indicate that enzymes involved with the detoxification process, and mostly the P450, are one of the resistance mechanisms employed by the LUF-R S. frugiperda larvae against lufenuron.
Comparative mapping in the Pinaceae

Treesearch

Konstantin V. Krutovsky; Michela Troggio; Garth R. Brown; Kathleen D. Jermstad; David B. Neale

2004-01-01

A comparative genetic map was constructed between two important genera of the family Pinaceae. Ten homologous linkage groups in loblolly pine (Pinus taeda L.) and Douglas fir (Pseudotsuga menziesii [Mirb.] Franco) were identified using orthologous expressed sequence tag polymorphism (ESTP) and restriction fragment length polymorphism (RFLP) markers. The comparative...
Phylum-Level Conservation of Regulatory Information in Nematodes despite Extensive Non-coding Sequence Divergence

PubMed Central

Gordon, Kacy L.; Arthur, Robert K.; Ruvinsky, Ilya

2015-01-01

Gene regulatory information guides development and shapes the course of evolution. To test conservation of gene regulation within the phylum Nematoda, we compared the functions of putative cis-regulatory sequences of four sets of orthologs (unc-47, unc-25, mec-3 and elt-2) from distantly-related nematode species. These species, Caenorhabditis elegans, its congeneric C. briggsae, and three parasitic species Meloidogyne hapla, Brugia malayi, and Trichinella spiralis, represent four of the five major clades in the phylum Nematoda. Despite the great phylogenetic distances sampled and the extensive sequence divergence of nematode genomes, all but one of the regulatory elements we tested are able to drive at least a subset of the expected gene expression patterns. We show that functionally conserved cis-regulatory elements have no more extended sequence similarity to their C. elegans orthologs than would be expected by chance, but they do harbor motifs that are important for proper expression of the C. elegans genes. These motifs are too short to be distinguished from the background level of sequence similarity, and while identical in sequence they are not conserved in orientation or position. Functional tests reveal that some of these motifs contribute to proper expression. Our results suggest that conserved regulatory circuitry can persist despite considerable turnover within cis elements. PMID:26020930

Multiple regulatory mechanisms of hepatocyte growth factor expression in malignant cells with a short poly(dA) sequence in the HGF gene promoter.

PubMed

Sakai, Kazuko; Takeda, Masayuki; Okamoto, Isamu; Nakagawa, Kazuhiko; Nishio, Kazuto

2015-01-01

Hepatocyte growth factor (HGF) expression is a poor prognostic factor in various types of cancer. Expression levels of HGF have been reported to be regulated by shorter poly(dA) sequences in the promoter region. In the present study, the poly(dA) mononucleotide tract in various types of human cancer cell lines was examined and compared with the HGF expression levels in those cells. Short deoxyadenosine repeat sequences were detected in five of the 55 cell lines used in the present study. The H69, IM95, CCK-81, Sui73 and H28 cells exhibited a truncated poly(dA) sequence in which the number of poly(dA) repeats was reduced by ≥5 bp. Two of the cell lines exhibited high HGF expression, determined by reverse transcription quantitative polymerase chain reaction and enzyme-linked immunosorbent assay. The CCK-81, Sui73 and H28 cells with shorter poly(dA) sequences exhibited low HGF expression. The cause of the suppression of HGF expression in the CCK-81, Sui73 and H28 cells was clarified by two approaches, suppression by methylation and single nucleotide polymorphisms in the HGF gene. Exposure to 5-Aza-dC, an inhibitor of DNA methyltransferase 1, induced an increased expression of HGF in the CCK-81 cells, but not in the other cells. Single-nucleotide polymorphism (SNP) rs72525097 in intron 1 was detected in the Sui73 and H28 cells. Taken together, it was found that the defect of poly(dA) in the HGF promoter was present in various types of cancer, including lung, stomach, colorectal, pancreas and mesothelioma. The present study proposes the negative regulation mechanisms by methylation and SNP in intron 1 of HGF for HGF expression in cancer cells with short poly(dA).
Transcriptome Sequencing of Gracilariopsis lemaneiformis to Analyze the Genes Related to Optically Active Phycoerythrin Synthesis.

PubMed

Huang, Xiaoyun; Zang, Xiaonan; Wu, Fei; Jin, Yuming; Wang, Haitao; Liu, Chang; Ding, Yating; He, Bangxiang; Xiao, Dongfang; Song, Xinwei; Liu, Zhu

2017-01-01

Gracilariopsis lemaneiformis (aka Gracilaria lemaneiformis) is a red macroalga rich in phycoerythrin, which can capture light efficiently and transfer it to photosystemⅡ. However, little is known about the synthesis of optically active phycoerythrinin in G. lemaneiformis at the molecular level. With the advent of high-throughput sequencing technology, analysis of genetic information for G. lemaneiformis by transcriptome sequencing is an effective means to get a deeper insight into the molecular mechanism of phycoerythrin synthesis. Illumina technology was employed to sequence the transcriptome of two strains of G. lemaneiformis- the wild type and a green-pigmented mutant. We obtained a total of 86915 assembled unigenes as a reference gene set, and 42884 unigenes were annotated in at least one public database. Taking the above transcriptome sequencing as a reference gene set, 4041 differentially expressed genes were screened to analyze and compare the gene expression profiles of the wild type and green mutant. By GO and KEGG pathway analysis, we concluded that three factors, including a reduction in the expression level of apo-phycoerythrin, an increase of chlorophyll light-harvesting complex synthesis, and reduction of phycoerythrobilin by competitive inhibition, caused the reduction of optically active phycoerythrin in the green-pigmented mutant.
Molecular evolution of adiponectin in Carnivora and its mRNA expression in relation to hepatic lipidosis.

PubMed

Nieminen, Petteri; Rouvinen-Watt, Kirsti; Kapiainen, Suvi; Harris, Lora; Mustonen, Anne-Mari

2010-09-15

Adiponectin is a novel adipocyte-derived hormone with low circulating concentrations and/or mRNA expression in obesity and non-alcoholic fatty liver disease (NAFLD). The adiponectin mRNA of several Carnivora species was sequenced to enable further gene expression studies in this clade with potential experimental species to examine the connections of hypoadiponectinemia to hepatic lipidosis. In addition, adiponectin mRNA expression was studied in the retroperitoneal fat of the American mink (Neovison vison), as hepatic lipidosis with close similarities to NAFLD can be rapidly induced to the species by fasting. The mRNA expression was determined after overnight-7d of food deprivation and 28d of re-feeding and correlated to the liver fat %. The homologies between the determined carnivoran mRNA sequences and that of the domestic dog were 92.2-99.1%. As the mRNA expression was not affected by short-term fasting and did not correlate with the liver fat %, there seems to be no clear connection between adiponectin and the development of lipidosis in the American mink. In the future, the obtained sequences can be utilized in further studies of adiponectin expression in comparative endocrinology. Copyright (c) 2010 Elsevier Inc. All rights reserved.
De novo transcriptome sequencing of axolotl blastema for identification of differentially expressed genes during limb regeneration

PubMed Central

2013-01-01

Background Salamanders are unique among vertebrates in their ability to completely regenerate amputated limbs through the mediation of blastema cells located at the stump ends. This regeneration is nerve-dependent because blastema formation and regeneration does not occur after limb denervation. To obtain the genomic information of blastema tissues, de novo transcriptomes from both blastema tissues and denervated stump ends of Ambystoma mexicanum (axolotls) 14 days post-amputation were sequenced and compared using Solexa DNA sequencing. Results The sequencing done for this study produced 40,688,892 reads that were assembled into 307,345 transcribed sequences. The N50 of transcribed sequence length was 562 bases. A similarity search with known proteins identified 39,200 different genes to be expressed during limb regeneration with a cut-off E-value exceeding 10-5. We annotated assembled sequences by using gene descriptions, gene ontology, and clusters of orthologous group terms. Targeted searches using these annotations showed that the majority of the genes were in the categories of essential metabolic pathways, transcription factors and conserved signaling pathways, and novel candidate genes for regenerative processes. We discovered and confirmed numerous sequences of the candidate genes by using quantitative polymerase chain reaction and in situ hybridization. Conclusion The results of this study demonstrate that de novo transcriptome sequencing allows gene expression analysis in a species lacking genome information and provides the most comprehensive mRNA sequence resources for axolotls. The characterization of the axolotl transcriptome can help elucidate the molecular mechanisms underlying blastema formation during limb regeneration. PMID:23815514
The Eimeria Transcript DB: an integrated resource for annotated transcripts of protozoan parasites of the genus Eimeria

PubMed Central

Rangel, Luiz Thibério; Novaes, Jeniffer; Durham, Alan M.; Madeira, Alda Maria B. N.; Gruber, Arthur

2013-01-01

Parasites of the genus Eimeria infect a wide range of vertebrate hosts, including chickens. We have recently reported a comparative analysis of the transcriptomes of Eimeria acervulina, Eimeria maxima and Eimeria tenella, integrating ORESTES data produced by our group and publicly available Expressed Sequence Tags (ESTs). All cDNA reads have been assembled, and the reconstructed transcripts have been submitted to a comprehensive functional annotation pipeline. Additional studies included orthology assignment across apicomplexan parasites and clustering analyses of gene expression profiles among different developmental stages of the parasites. To make all this body of information publicly available, we constructed the Eimeria Transcript Database (EimeriaTDB), a web repository that provides access to sequence data, annotation and comparative analyses. Here, we describe the web interface, available sequence data sets and query tools implemented on the site. The main goal of this work is to offer a public repository of sequence and functional annotation data of reconstructed transcripts of parasites of the genus Eimeria. We believe that EimeriaTDB will represent a valuable and complementary resource for the Eimeria scientific community and for those researchers interested in comparative genomics of apicomplexan parasites. Database URL: http://www.coccidia.icb.usp.br/eimeriatdb/ PMID:23411718
Resources and Recommendations for Using Transcriptomics to Address Grand Challenges in Comparative Biology.

PubMed

Mykles, Donald L; Burnett, Karen G; Durica, David S; Joyce, Blake L; McCarthy, Fiona M; Schmidt, Carl J; Stillman, Jonathon H

2016-12-01

High-throughput RNA sequencing (RNA-seq) technology has become an important tool for studying physiological responses of organisms to changes in their environment. De novo assembly of RNA-seq data has allowed researchers to create a comprehensive catalog of genes expressed in a tissue and to quantify their expression without a complete genome sequence. The contributions from the "Tapping the Power of Crustacean Transcriptomics to Address Grand Challenges in Comparative Biology" symposium in this issue show the successes and limitations of using RNA-seq in the study of crustaceans. In conjunction with the symposium, the Animal Genome to Phenome Research Coordination Network collated comments from participants at the meeting regarding the challenges encountered when using transcriptomics in their research. Input came from novices and experts ranging from graduate students to principal investigators. Many were unaware of the bioinformatics analysis resources currently available on the CyVerse platform. Our analysis of community responses led to three recommendations for advancing the field: (1) integration of genomic and RNA-seq sequence assemblies for crustacean gene annotation and comparative expression; (2) development of methodologies for the functional analysis of genes; and (3) information and training exchange among laboratories for transmission of best practices. The field lacks the methods for manipulating tissue-specific gene expression. The decapod crustacean research community should consider the cherry shrimp, Neocaridina denticulata, as a decapod model for the application of transgenic tools for functional genomics. This would require a multi-investigator effort. © The Author 2016. Published by Oxford University Press on behalf of the Society for Integrative and Comparative Biology. All rights reserved. For permissions please email: journals.permissions@oup.com.
Identification of tissue-specific, abiotic stress-responsive gene expression patterns in wine grape (Vitis vinifera L.) based on curation and mining of large-scale EST data sets

PubMed Central

2011-01-01

Background Abiotic stresses, such as water deficit and soil salinity, result in changes in physiology, nutrient use, and vegetative growth in vines, and ultimately, yield and flavor in berries of wine grape, Vitis vinifera L. Large-scale expressed sequence tags (ESTs) were generated, curated, and analyzed to identify major genetic determinants responsible for stress-adaptive responses. Although roots serve as the first site of perception and/or injury for many types of abiotic stress, EST sequencing in root tissues of wine grape exposed to abiotic stresses has been extremely limited to date. To overcome this limitation, large-scale EST sequencing was conducted from root tissues exposed to multiple abiotic stresses. Results A total of 62,236 expressed sequence tags (ESTs) were generated from leaf, berry, and root tissues from vines subjected to abiotic stresses and compared with 32,286 ESTs sequenced from 20 public cDNA libraries. Curation to correct annotation errors, clustering and assembly of the berry and leaf ESTs with currently available V. vinifera full-length transcripts and ESTs yielded a total of 13,278 unique sequences, with 2302 singletons and 10,976 mapped to V. vinifera gene models. Of these, 739 transcripts were found to have significant differential expression in stressed leaves and berries including 250 genes not described previously as being abiotic stress responsive. In a second analysis of 16,452 ESTs from a normalized root cDNA library derived from roots exposed to multiple, short-term, abiotic stresses, 135 genes with root-enriched expression patterns were identified on the basis of their relative EST abundance in roots relative to other tissues. Conclusions The large-scale analysis of relative EST frequency counts among a diverse collection of 23 different cDNA libraries from leaf, berry, and root tissues of wine grape exposed to a variety of abiotic stress conditions revealed distinct, tissue-specific expression patterns, previously unrecognized stress-induced genes, and many novel genes with root-enriched mRNA expression for improving our understanding of root biology and manipulation of rootstock traits in wine grape. mRNA abundance estimates based on EST library-enriched expression patterns showed only modest correlations between microarray and quantitative, real-time reverse transcription-polymerase chain reaction (qRT-PCR) methods highlighting the need for deep-sequencing expression profiling methods. PMID:21592389
[Blue-light induced expression of S-adenosy-L-homocysteine hydrolase-like gene in Mucor amphibiorum RCS1].

PubMed

Gao, Ya; Wang, Shu; Fu, Mingjia; Zhong, Guolin

2013-09-04

To determine blue-light induced expression of S-adenosyl-L-homocysteine hydrolase-like (sahhl) gene in fungus Mucor amphibiorum RCS1. In the random process of PCR, a sequence of 555 bp was obtained from M. amphibiorum RCS1. The 555 bp sequence was labeled with digoxin to prepare the probe for northern hybridization. By northern hybridization, the transcription of sahhl gene was analyzed in M. amphibiorum RCS1 mycelia culture process from darkness to blue light to darkness. Simultaneously real-time PCR method was used to the sahhl gene expression analysis. Compared with the sequence of sahh gene from Homo sapiens, Mus musculus and some fungi species, a high homology of the 555 bp sequence was confirmed. Therefore, the preliminary confirmation has supported that the 555 bp sequence should be sahhl gene from M. amphibiorum RCS1. Under the dark pre-culture in 24 h, a large amounts of transcript of sahhl gene in the mycelia can be detected by northern hybridization and real-time PCR in the condition of 24 h blue light. But a large amounts of transcript of sahhl gene were not found in other detection for the dark pre-culture of 48 h, even though M. amphibiorum RCS1 mycelia were induced by blue light. Blue light can induce the expression of sahhl gene in the vigorous growth of M. amphibiorum RCS1 mycelia.
Heterologous Array Analysis in Pinaceae: Hybridization of Pinus Taeda cDNA Arrays With cDNA From Needles and Embryogenic Cultures of P. Taeda, P. Sylvestris or Picea Abies

PubMed Central

van Zyl, Leonel; von Arnold, Sara; Bozhkov, Peter; Chen, Yongzhong; Egertsdotter, Ulrika; MacKay, John; Sederoff, Ronald R.; Shen, Jing; Zelena, Lyubov

2002-01-01

Hybridization of labelled cDNA from various cell types with high-density arrays of expressed sequence tags is a powerful technique for investigating gene expression. Few conifer cDNA libraries have been sequenced. Because of the high level of sequence conservation between Pinus and Picea we have investigated the use of arrays from one genus for studies of gene expression in the other. The partial cDNAs from 384 identifiable genes expressed in differentiating xylem of Pinus taeda were printed on nylon membranes in randomized replicates. These were hybridized with labelled cDNA from needles or embryogenic cultures of Pinus taeda, P. sylvestris and Picea abies, and with labelled cDNA from leaves of Nicotiana tabacum. The Spearman correlation of gene expression for pairs of conifer species was high for needles (r2 = 0.78 − 0.86), and somewhat lower for embryogenic cultures (r2 = 0.68 − 0.83). The correlation of gene expression for tobacco leaves and needles of each of the three conifer species was lower but sufficiently high (r2 = 0.52 − 0.63) to suggest that many partial gene sequences are conserved in angiosperms and gymnosperms. Heterologous probing was further used to identify tissue-specific gene expression over species boundaries. To evaluate the significance of differences in gene expression, conventional parametric tests were compared with permutation tests after four methods of normalization. Permutation tests after Z-normalization provide the highest degree of discrimination but may enhance the probability of type I errors. It is concluded that arrays of cDNA from loblolly pine are useful for studies of gene expression in other pines or spruces. PMID:18629264
Drug resistance is conferred on the model yeast Saccharomyces cerevisiae by expression of full-length melanoma-associated human ATP-binding cassette transporter ABCB5.

PubMed

Keniya, Mikhail V; Holmes, Ann R; Niimi, Masakazu; Lamping, Erwin; Gillet, Jean-Pierre; Gottesman, Michael M; Cannon, Richard D

2014-10-06

ABCB5, an ATP-binding cassette (ABC) transporter, is highly expressed in melanoma cells, and may contribute to the extreme resistance of melanomas to chemotherapy by efflux of anti-cancer drugs. Our goal was to determine whether we could functionally express human ABCB5 in the model yeast Saccharomyces cerevisiae, in order to demonstrate an efflux function for ABCB5 in the absence of background pump activity from other human transporters. Heterologous expression would also facilitate drug discovery for this important target. DNAs encoding ABCB5 sequences were cloned into the chromosomal PDR5 locus of a S. cerevisiae strain in which seven endogenous ABC transporters have been deleted. Protein expression in the yeast cells was monitored by immunodetection using both a specific anti-ABCB5 antibody and a cross-reactive anti-ABCB1 antibody. ABCB5 function in recombinant yeast cells was measured by determining whether the cells possessed increased resistance to known pump substrates, compared to the host yeast strain, in assays of yeast growth. Three ABCB5 constructs were made in yeast. One was derived from the ABCB5-β mRNA, which is highly expressed in human tissues but is a truncation of a canonical full-size ABC transporter. Two constructs contained full-length ABCB5 sequences: either a native sequence from cDNA or a synthetic sequence codon-harmonized for S. cerevisiae. Expression of all three constructs in yeast was confirmed by immunodetection. Expression of the codon-harmonized full-length ABCB5 DNA conferred increased resistance, relative to the host yeast strain, to the putative substrates rhodamine 123, daunorubicin, tetramethylrhodamine, FK506, or clorgyline. We conclude that full-length ABCB5 can be functionally expressed in S. cerevisiae and confers drug resistance.
Single-cell RNA-sequencing reveals a distinct population of proglucagon-expressing cells specific to the mouse upper small intestine.

PubMed

Glass, Leslie L; Calero-Nieto, Fernando J; Jawaid, Wajid; Larraufie, Pierre; Kay, Richard G; Göttgens, Berthold; Reimann, Frank; Gribble, Fiona M

2017-10-01

To identify sub-populations of intestinal preproglucagon-expressing (PPG) cells producing Glucagon-like Peptide-1, and their associated expression profiles of sensory receptors, thereby enabling the discovery of therapeutic strategies that target these cell populations for the treatment of diabetes and obesity. We performed single cell RNA sequencing of PPG-cells purified by flow cytometry from the upper small intestine of 3 GLU-Venus mice. Cells from 2 mice were sequenced at low depth, and from the third mouse at high depth. High quality sequencing data from 234 PPG-cells were used to identify clusters by tSNE analysis. qPCR was performed to compare the longitudinal and crypt/villus locations of cluster-specific genes. Immunofluorescence and mass spectrometry were used to confirm protein expression. PPG-cells formed 3 major clusters: a group with typical characteristics of classical L-cells, including high expression of Gcg and Pyy (comprising 51% of all PPG-cells); a cell type overlapping with Gip-expressing K-cells (14%); and a unique cluster expressing Tph1 and Pzp that was predominantly located in proximal small intestine villi and co-produced 5-HT (35%). Expression of G-protein coupled receptors differed between clusters, suggesting the cell types are differentially regulated and would be differentially targetable. Our findings support the emerging concept that many enteroendocrine cell populations are highly overlapping, with individual cells producing a range of peptides previously assigned to distinct cell types. Different receptor expression profiles across the clusters highlight potential drug targets to increase gut hormone secretion for the treatment of diabetes and obesity. Copyright © 2017 The Authors. Published by Elsevier GmbH.. All rights reserved.
Comparative transcriptome analysis reveals differentially expressed genes associated with sex expression in garden asparagus (Asparagus officinalis).

PubMed

Li, Shu-Fen; Zhang, Guo-Jun; Zhang, Xue-Jin; Yuan, Jin-Hong; Deng, Chuan-Liang; Gao, Wu-Jun

2017-08-22

Garden asparagus (Asparagus officinalis) is a highly valuable vegetable crop of commercial and nutritional interest. It is also commonly used to investigate the mechanisms of sex determination and differentiation in plants. However, the sex expression mechanisms in asparagus remain poorly understood. De novo transcriptome sequencing via Illumina paired-end sequencing revealed more than 26 billion bases of high-quality sequence data from male and female asparagus flower buds. A total of 72,626 unigenes with an average length of 979 bp were assembled. In comparative transcriptome analysis, 4876 differentially expressed genes (DEGs) were identified in the possible sex-determining stage of female and male/supermale flower buds. Of these DEGs, 433, including 285 male/supermale-biased and 149 female-biased genes, were annotated as flower related. Of the male/supermale-biased flower-related genes, 102 were probably involved in anther development. In addition, 43 DEGs implicated in hormone response and biosynthesis putatively associated with sex expression and reproduction were discovered. Moreover, 128 transcription factor (TF)-related genes belonging to various families were found to be differentially expressed, and this finding implied the essential roles of TF in sex determination or differentiation in asparagus. Correlation analysis indicated that miRNA-DEG pairs were also implicated in asparagus sexual development. Our study identified a large number of DEGs involved in the sex expression and reproduction of asparagus, including known genes participating in plant reproduction, plant hormone signaling, TF encoding, and genes with unclear functions. We also found that miRNAs might be involved in the sex differentiation process. Our study could provide a valuable basis for further investigations on the regulatory networks of sex determination and differentiation in asparagus and facilitate further genetic and genomic studies on this dioecious species.
Phylogenetic and comparative gene expression analysis of barley (Hordeum vulgare)WRKY transcription factor family reveals putatively retained functions betweenmonocots and dicots

DOE Office of Scientific and Technical Information (OSTI.GOV)

Mangelsen, Elke; Kilian, Joachim; Berendzen, Kenneth W.

2008-02-01

WRKY proteins belong to the WRKY-GCM1 superfamily of zinc finger transcription factors that have been subject to a large plant-specific diversification. For the cereal crop barley (Hordeum vulgare), three different WRKY proteins have been characterized so far, as regulators in sucrose signaling, in pathogen defense, and in response to cold and drought, respectively. However, their phylogenetic relationship remained unresolved. In this study, we used the available sequence information to identify a minimum number of 45 barley WRKY transcription factor (HvWRKY) genes. According to their structural features the HvWRKY factors were classified into the previously defined polyphyletic WRKY subgroups 1 tomore » 3. Furthermore, we could assign putative orthologs of the HvWRKY proteins in Arabidopsis and rice. While in most cases clades of orthologous proteins were formed within each group or subgroup, other clades were composed of paralogous proteins for the grasses and Arabidopsis only, which is indicative of specific gene radiation events. To gain insight into their putative functions, we examined expression profiles of WRKY genes from publicly available microarray data resources and found group specific expression patterns. While putative orthologs of the HvWRKY transcription factors have been inferred from phylogenetic sequence analysis, we performed a comparative expression analysis of WRKY genes in Arabidopsis and barley. Indeed, highly correlative expression profiles were found between some of the putative orthologs. HvWRKY genes have not only undergone radiation in monocot or dicot species, but exhibit evolutionary traits specific to grasses. HvWRKY proteins exhibited not only sequence similarities between orthologs with Arabidopsis, but also relatedness in their expression patterns. This correlative expression is indicative for a putative conserved function of related WRKY proteins in mono- and dicot species.« less
Gene amplification of 5-enol-pyruvylshikimate-3-phosphate synthase in glyphosate-resistant Kochia scoparia.

PubMed

Wiersma, Andrew T; Gaines, Todd A; Preston, Christopher; Hamilton, John P; Giacomini, Darci; Robin Buell, C; Leach, Jan E; Westra, Philip

2015-02-01

Field-evolved resistance to the herbicide glyphosate is due to amplification of one of two EPSPS alleles, increasing transcription and protein with no splice variants or effects on other pathway genes. The widely used herbicide glyphosate inhibits the shikimate pathway enzyme 5-enolpyruvylshikimate-3-phosphate synthase (EPSPS). Globally, the intensive use of glyphosate for weed control has selected for glyphosate resistance in 31 weed species. Populations of suspected glyphosate-resistant Kochia scoparia were collected from fields located in the US central Great Plains. Glyphosate dose response verified glyphosate resistance in nine populations. The mechanism of resistance to glyphosate was investigated using targeted sequencing, quantitative PCR, immunoblotting, and whole transcriptome de novo sequencing to characterize the sequence and expression of EPSPS. Sequence analysis showed no mutation of the EPSPS Pro106 codon in glyphosate-resistant K. scoparia, whereas EPSPS genomic copy number and transcript abundance were elevated three- to ten-fold in resistant individuals relative to susceptible individuals. Glyphosate-resistant individuals with increased relative EPSPS copy numbers had consistently lower shikimate accumulation in leaf disks treated with 100 μM glyphosate and EPSPS protein levels were higher in glyphosate-resistant individuals with increased gene copy number compared to glyphosate-susceptible individuals. RNA sequence analysis revealed seven nucleotide positions with two different expressed alleles in glyphosate-susceptible reads. However, one nucleotide at the seven positions was predominant in glyphosate-resistant sequences, suggesting that only one of two EPSPS alleles was amplified in glyphosate-resistant individuals. No alternatively spliced EPSPS transcripts were detected. Expression of five other genes in the chorismate pathway was unaffected in glyphosate-resistant individuals with increased EPSPS expression. These results indicate increased EPSPS expression is a mechanism for glyphosate resistance in these K. scoparia populations.
Gene expression profiling of adult female tissues in feeding Rhipicephalus microplus cattle ticks.

PubMed

Stutzer, Christian; van Zyl, Willem A; Olivier, Nicholas A; Richards, Sabine; Maritz-Olivier, Christine

2013-06-01

The southern cattle tick, Rhipicephalus microplus, is an economically important pest, especially for resource-poor countries, both as a highly adaptive invasive species and prominent vector of disease. The increasing prevalence of resistance to chemical acaricides and variable efficacy of current tick vaccine candidates highlight the need for more effective control methods. In the absence of a fully annotated genome, the wealth of available expressed sequence tag sequence data for this species presents a unique opportunity to study the genes that are expressed in tissues involved in blood meal acquisition, digestion and reproduction during feeding. Utilising a custom oligonucleotide microarray designed from available singletons (BmiGI Version 2.1) and expressed sequence tag sequences of R. microplus, the expression profiles in feeding adult female midgut, salivary glands and ovarian tissues were compared. From 13,456 assembled transcripts, 588 genes expressed in all three tissues were identified from fed adult females 20 days post infestation. The greatest complement of genes relate to translation and protein turnover. Additionally, a number of unique transcripts were identified for each tissue that relate well to their respective physiological/biological function/role(s). These transcripts include secreted anti-hemostatics and defense proteins from the salivary glands for acquisition of a blood meal, proteases as well as enzymes and transporters for digestion and nutrient acquisition from ingested blood in the midgut, and finally proteins and associated factors involved in DNA replication and cell-cycle control for oogenesis in the ovaries. Comparative analyses of adult female tissues during feeding enabled the identification of a catalogue of transcripts that may be essential for successful feeding and reproduction in the cattle tick, R. microplus. Future studies will increase our understanding of basic tick biology, allowing the identification of shared proteins/pathways among different tissues that may offer novel targets for the development of new tick control strategies. Copyright © 2013 Australian Society for Parasitology Inc. Published by Elsevier Ltd. All rights reserved.
RNA expression in a cartilaginous fish cell line reveals ancient 3′ noncoding regions highly conserved in vertebrates

PubMed Central

Forest, David; Nishikawa, Ryuhei; Kobayashi, Hiroshi; Parton, Angela; Bayne, Christopher J.; Barnes, David W.

2007-01-01

We have established a cartilaginous fish cell line [Squalus acanthias embryo cell line (SAE)], a mesenchymal stem cell line derived from the embryo of an elasmobranch, the spiny dogfish shark S. acanthias. Elasmobranchs (sharks and rays) first appeared >400 million years ago, and existing species provide useful models for comparative vertebrate cell biology, physiology, and genomics. Comparative vertebrate genomics among evolutionarily distant organisms can provide sequence conservation information that facilitates identification of critical coding and noncoding regions. Although these genomic analyses are informative, experimental verification of functions of genomic sequences depends heavily on cell culture approaches. Using ESTs defining mRNAs derived from the SAE cell line, we identified lengthy and highly conserved gene-specific nucleotide sequences in the noncoding 3′ UTRs of eight genes involved in the regulation of cell growth and proliferation. Conserved noncoding 3′ mRNA regions detected by using the shark nucleotide sequences as a starting point were found in a range of other vertebrate orders, including bony fish, birds, amphibians, and mammals. Nucleotide identity of shark and human in these regions was remarkably well conserved. Our results indicate that highly conserved gene sequences dating from the appearance of jawed vertebrates and representing potential cis-regulatory elements can be identified through the use of cartilaginous fish as a baseline. Because the expression of genes in the SAE cell line was prerequisite for their identification, this cartilaginous fish culture system also provides a physiologically valid tool to test functional hypotheses on the role of these ancient conserved sequences in comparative cell biology. PMID:17227856
Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks

PubMed Central

Trapnell, Cole; Roberts, Adam; Goff, Loyal; Pertea, Geo; Kim, Daehwan; Kelley, David R; Pimentel, Harold; Salzberg, Steven L; Rinn, John L; Pachter, Lior

2012-01-01

Recent advances in high-throughput cDNA sequencing (RNA-seq) can reveal new genes and splice variants and quantify expression genome-wide in a single assay. The volume and complexity of data from RNA-seq experiments necessitate scalable, fast and mathematically principled analysis software. TopHat and Cufflinks are free, open-source software tools for gene discovery and comprehensive expression analysis of high-throughput mRNA sequencing (RNA-seq) data. Together, they allow biologists to identify new genes and new splice variants of known ones, as well as compare gene and transcript expression under two or more conditions. This protocol describes in detail how to use TopHat and Cufflinks to perform such analyses. It also covers several accessory tools and utilities that aid in managing data, including CummeRbund, a tool for visualizing RNA-seq analysis results. Although the procedure assumes basic informatics skills, these tools assume little to no background with RNA-seq analysis and are meant for novices and experts alike. The protocol begins with raw sequencing reads and produces a transcriptome assembly, lists of differentially expressed and regulated genes and transcripts, and publication-quality visualizations of analysis results. The protocol's execution time depends on the volume of transcriptome sequencing data and available computing resources but takes less than 1 d of computer time for typical experiments and ~1 h of hands-on time. PMID:22383036
Efficient gusA transient expression in Porphyra yezoensis protoplasts mediated by endogenous beta-tubulin flanking sequences

NASA Astrophysics Data System (ADS)

Gong, Qianhong; Yu, Wengong; Dai, Jixun; Liu, Hongquan; Xu, Rifu; Guan, Huashi; Pan, Kehou

2007-01-01

Endogenous tubulin promoter has been widely used for expressing foreign genes in green algae, but the efficiency and feasibility of endogenous tubulin promoter in the economically important Porphyra yezoensis (Rhodophyta) are unknown. In this study, the flanking sequences of beta-tubulin gene from P. yezoensis were amplified and two transient expression vectors were constructed to determine their transcription promoting feasibility for foreign gene gusA. The testing vector pATubGUS was constructed by inserting 5'-and 3'-flanking regions ( Tub5' and Tub3') up-and down-stream of β-glucuronidase (GUS) gene ( gusA), respectively, into pA, a derivative of pCAT®3-enhancer vector. The control construct, pAGUSTub3, contains only gusA and Tub3'. These constructs were electroporated into P. yezoensis protoplasts and the GUS activities were quantitatively analyzed by spectrometry. The results demonstrated that gusA gene was efficiently expressed in P. yezoensis protoplasts under the regulation of 5'-flanking sequence of the beta-tubulin gene. More interestingly, the pATubGUS produced stronger GUS activity in P. yezoensis protoplasts when compared to the result from pBI221, in which the gusA gene was directed by a constitutive CaMV 35S promoter. The data suggest that the integration of P. yezoensis protoplast and its endogenous beta-tubulin flanking sequences is a potential novel system for foreign gene expression.
Modulation of c-fms proto-oncogene in an ovarian carcinoma cell line by a hammerhead ribozyme.

PubMed Central

Yokoyama, Y.; Morishita, S.; Takahashi, Y.; Hashimoto, M.; Tamaya, T.

1997-01-01

Co-expression of macrophage colony-stimulating factor (M-CSF) and its receptor (c-fms) is often found in ovarian epithelial carcinoma, suggesting the existence of autocrine regulation of cell growth by M-CSF. To block this autocrine loop, we have developed hammerhead ribozymes against c-fms mRNA. As target sites of the ribozyme, we chose the GUC sequence in codon 18 and codon 27 of c-fms mRNA. Two kinds of ribozymes were able to cleave an artificial c-fms RNA substrate in a cell-free system, although the ribozyme against codon 18 was much more efficient than that against codon 27. We next constructed an expression vector carrying a ribozyme sequence that targeted the GUC sequence in codon 18 of c-fms mRNA. It was introduced into TYK-nu cells that expressed M-CSF and its receptor. Its transfectant showed a reduced growth potential. The expression levels of c-fms protein and mRNA in the transfectant were clearly decreased with the expression of ribozyme RNA compared with that of an untransfected control or a transfectant with the vector without the ribozyme sequence. These results suggest that the ribozyme against GUC in codon 18 of c-fms mRNA is a promising tool for blocking the autocrine loop of M-CSF in ovarian epithelial carcinoma. Images Figure 2 Figure 3 Figure 5 Figure 6 PMID:9376277
A dehydration-inducible gene in the truffle Tuber borchii identifies a novel group of dehydrins

PubMed Central

Abba', Simona; Ghignone, Stefano; Bonfante, Paola

2006-01-01

Background The expressed sequence tag M6G10 was originally isolated from a screening for differentially expressed transcripts during the reproductive stage of the white truffle Tuber borchii. mRNA levels for M6G10 increased dramatically during fruiting body maturation compared to the vegetative mycelial stage. Results Bioinformatics tools, phylogenetic analysis and expression studies were used to support the hypothesis that this sequence, named TbDHN1, is the first dehydrin (DHN)-like coding gene isolated in fungi. Homologs of this gene, all defined as "coding for hypothetical proteins" in public databases, were exclusively found in ascomycetous fungi and in plants. Although complete (or almost complete) fungal genomes and EST collections of some Basidiomycota and Glomeromycota are already available, DHN-like proteins appear to be represented only in Ascomycota. A new and previously uncharacterized conserved signature pattern was identified and proposed to Uniprot database as the main distinguishing feature of this new group of DHNs. Expression studies provide experimental evidence of a transcript induction of TbDHN1 during cellular dehydration. Conclusion Expression pattern and sequence similarities to known plant DHNs indicate that TbDHN1 is the first characterized DHN-like protein in fungi. The high similarity of TbDHN1 with homolog coding sequences implies the existence of a novel fungal/plant group of LEA Class II proteins characterized by a previously undescribed signature pattern. PMID:16512918

Characterizing the heterogeneity of triple-negative breast cancers using microdissected normal ductal epithelium and RNA-sequencing

PubMed Central

Radovich, Milan; Clare, Susan E.; Atale, Rutuja; Pardo, Ivanesa; Hancock, Bradley A.; Solzak, Jeffrey P.; Kassem, Nawal; Mathieson, Theresa; Storniolo, Anna Maria V.; Rufenbarger, Connie; Lillemoe, Heather A.; Blosser, Rachel J.; Choi, Mi Ran; Sauder, Candice A.; Doxey, Diane; Henry, Jill E.; Hilligoss, Eric E.; Sakarya, Onur; Hyland, Fiona C.; Hickenbotham, Matthew; Zhu, Jin; Glasscock, Jarret; Badve, Sunil; Ivan, Mircea; Liu, Yunlong; Sledge, George W.; Schneider, Bryan P.

2014-01-01

Triple-negative breast cancers (TNBCs) are a heterogeneous set of tumors defined by an absence of actionable therapeutic targets (ER−,PR−,HER2−). Microdissected normal ductal epithelium from healthy volunteers represents a novel comparator to reveal insights into TNBC heterogeneity and to inform drug development. Using RNA-sequencing data from our institution and The Cancer Genome Atlas (TCGA) we compared the transcriptomes of 94 TNBCs, 20 microdissected normal breast tissues from healthy volunteers from the Susan G. Komen for the Cure Tissue Bank, and 10 histologically normal tissues adjacent to tumor. Pathway analysis comparing TNBCs to optimized normal controls of microdissected normal epithelium versus classic controls composed of adjacent normal tissue revealed distinct molecular signatures. Differential gene expression of TNBC compared with normal comparators demonstrated important findings for TNBC-specific clinical trials testing targeted agents; lack of over-expression for negative studies and over-expression in studies with drug activity. Next, by comparing each individual TNBC to the set of microdissected normals, we demonstrate that TNBC heterogeneity is attributable to transcriptional chaos, is associated with non-silent DNA mutational load, and explains transcriptional heterogeneity in addition to known molecular subtypes. Finally, chaos analysis identified 146 core genes dysregulated in >90% of TNBCs revealing an over-expressed central network. In conclusion, Use of microdissected normal ductal epithelium from healthy volunteers enables an optimized approach for studying TNBC and uncovers biological heterogeneity mediated by transcriptional chaos. PMID:24292813
Characterizing the heterogeneity of triple-negative breast cancers using microdissected normal ductal epithelium and RNA-sequencing.

PubMed

Radovich, Milan; Clare, Susan E; Atale, Rutuja; Pardo, Ivanesa; Hancock, Bradley A; Solzak, Jeffrey P; Kassem, Nawal; Mathieson, Theresa; Storniolo, Anna Maria V; Rufenbarger, Connie; Lillemoe, Heather A; Blosser, Rachel J; Choi, Mi Ran; Sauder, Candice A; Doxey, Diane; Henry, Jill E; Hilligoss, Eric E; Sakarya, Onur; Hyland, Fiona C; Hickenbotham, Matthew; Zhu, Jin; Glasscock, Jarret; Badve, Sunil; Ivan, Mircea; Liu, Yunlong; Sledge, George W; Schneider, Bryan P

2014-01-01

Triple-negative breast cancers (TNBCs) are a heterogeneous set of tumors defined by an absence of actionable therapeutic targets (ER, PR, and HER-2). Microdissected normal ductal epithelium from healthy volunteers represents a novel comparator to reveal insights into TNBC heterogeneity and to inform drug development. Using RNA-sequencing data from our institution and The Cancer Genome Atlas (TCGA) we compared the transcriptomes of 94 TNBCs, 20 microdissected normal breast tissues from healthy volunteers from the Susan G. Komen for the Cure Tissue Bank, and 10 histologically normal tissues adjacent to tumor. Pathway analysis comparing TNBCs to optimized normal controls of microdissected normal epithelium versus classic controls composed of adjacent normal tissue revealed distinct molecular signatures. Differential gene expression of TNBC compared with normal comparators demonstrated important findings for TNBC-specific clinical trials testing targeted agents; lack of over-expression for negative studies and over-expression in studies with drug activity. Next, by comparing each individual TNBC to the set of microdissected normals, we demonstrate that TNBC heterogeneity is attributable to transcriptional chaos, is associated with non-silent DNA mutational load, and explains transcriptional heterogeneity in addition to known molecular subtypes. Finally, chaos analysis identified 146 core genes dysregulated in >90 % of TNBCs revealing an over-expressed central network. In conclusion, use of microdissected normal ductal epithelium from healthy volunteers enables an optimized approach for studying TNBC and uncovers biological heterogeneity mediated by transcriptional chaos.
De novo assembled expressed gene catalog of a fast-growing Eucalyptus tree produced by Illumina mRNA-Seq

PubMed Central

2010-01-01

Background De novo assembly of transcript sequences produced by short-read DNA sequencing technologies offers a rapid approach to obtain expressed gene catalogs for non-model organisms. A draft genome sequence will be produced in 2010 for a Eucalyptus tree species (E. grandis) representing the most important hardwood fibre crop in the world. Genome annotation of this valuable woody plant and genetic dissection of its superior growth and productivity will be greatly facilitated by the availability of a comprehensive collection of expressed gene sequences from multiple tissues and organs. Results We present an extensive expressed gene catalog for a commercially grown E. grandis × E. urophylla hybrid clone constructed using only Illumina mRNA-Seq technology and de novo assembly. A total of 18,894 transcript-derived contigs, a large proportion of which represent full-length protein coding genes were assembled and annotated. Analysis of assembly quality, length and diversity show that this dataset represent the most comprehensive expressed gene catalog for any Eucalyptus tree. mRNA-Seq analysis furthermore allowed digital expression profiling of all of the assembled transcripts across diverse xylogenic and non-xylogenic tissues, which is invaluable for ascribing putative gene functions. Conclusions De novo assembly of Illumina mRNA-Seq reads is an efficient approach for transcriptome sequencing and profiling in Eucalyptus and other non-model organisms. The transcriptome resource (Eucspresso, http://eucspresso.bi.up.ac.za/) generated by this study will be of value for genomic analysis of woody biomass production in Eucalyptus and for comparative genomic analysis of growth and development in woody and herbaceous plants. PMID:21122097
Deep sequencing-based transcriptome analysis of Plutella xylostella larvae parasitized by Diadegma semiclausum

PubMed Central

2011-01-01

Background Parasitoid insects manipulate their hosts' physiology by injecting various factors into their host upon parasitization. Transcriptomic approaches provide a powerful approach to study insect host-parasitoid interactions at the molecular level. In order to investigate the effects of parasitization by an ichneumonid wasp (Diadegma semiclausum) on the host (Plutella xylostella), the larval transcriptome profile was analyzed using a short-read deep sequencing method (Illumina). Symbiotic polydnaviruses (PDVs) associated with ichneumonid parasitoids, known as ichnoviruses, play significant roles in host immune suppression and developmental regulation. In the current study, D. semiclausum ichnovirus (DsIV) genes expressed in P. xylostella were identified and their sequences compared with other reported PDVs. Five of these genes encode proteins of unknown identity, that have not previously been reported. Results De novo assembly of cDNA sequence data generated 172,660 contigs between 100 and 10000 bp in length; with 35% of > 200 bp in length. Parasitization had significant impacts on expression levels of 928 identified insect host transcripts. Gene ontology data illustrated that the majority of the differentially expressed genes are involved in binding, catalytic activity, and metabolic and cellular processes. In addition, the results show that transcription levels of antimicrobial peptides, such as gloverin, cecropin E and lysozyme, were up-regulated after parasitism. Expression of ichnovirus genes were detected in parasitized larvae with 19 unique sequences identified from five PDV gene families including vankyrin, viral innexin, repeat elements, a cysteine-rich motif, and polar residue rich protein. Vankyrin 1 and repeat element 1 genes showed the highest transcription levels among the DsIV genes. Conclusion This study provides detailed information on differential expression of P. xylostella larval genes following parasitization, DsIV genes expressed in the host and also improves our current understanding of this host-parasitoid interaction. PMID:21906285
Analysis of 10,000 ESTs from lymphocytes of the cynomolgus monkey to improve our understanding of its immune system

PubMed Central

Chen, Wei-Hua; Wang, Xue-Xia; Lin, Wei; He, Xiao-Wei; Wu, Zhen-Qiang; Lin, Ying; Hu, Song-Nian; Wang, Xiao-Ning

2006-01-01

Background The cynomolgus monkey (Macaca fascicularis) is one of the most widely used surrogate animal models for an increasing number of human diseases and vaccines, especially immune-system-related ones. Towards a better understanding of the gene expression background upon its immunogenetics, we constructed a cDNA library from Epstein-Barr virus (EBV)-transformed B lymphocytes of a cynomolgus monkey and sequenced 10,000 randomly picked clones. Results After processing, 8,312 high-quality expressed sequence tags (ESTs) were generated and assembled into 3,728 unigenes. Annotations of these uniquely expressed transcripts demonstrated that out of the 2,524 open reading frame (ORF) positive unigenes (mitochondrial and ribosomal sequences were not included), 98.8% shared significant similarities (E-value less than 1e-10) with the NCBI nucleotide (nt) database, while only 67.7% (E-value less than 1e-5) did so with the NCBI non-redundant protein (nr) database. Further analysis revealed that 90.0% of the unigenes that shared no similarities to the nr database could be assigned to human chromosomes, in which 75 did not match significantly to any cynomolgus monkey and human ESTs. The mapping regions to known human genes on the human genome were described in detail. The protein family and domain analysis revealed that the first, second and fourth of the most abundantly expressed protein families were all assigned to immunoglobulin and major histocompatibility complex (MHC)-related proteins. The expression profiles of these genes were compared with that of homologous genes in human blood, lymph nodes and a RAMOS cell line, which demonstrated expression changes after transformation with EBV. The degree of sequence similarity of the MHC class I and II genes to the human reference sequences was evaluated. The results indicated that class I molecules showed weak amino acid identities (<90%), while class II showed slightly higher ones. Conclusion These results indicated that the genes expressed in the cynomolgus monkey could be used to identify novel protein-coding genes and revise those incomplete or incorrect annotations in the human genome by comparative methods, since the old world monkeys and humans share high similarities at the molecular level, especially within coding regions. The identification of multiple genes involved in the immune response, their sequence variations to the human homologues, and their responses to EBV infection could provide useful information to improve our understanding of the cynomolgus monkey immune system. PMID:16618371
Generation of expressed sequence tags for discovery of genes responsible for floral traits of Chrysanthemum morifolium by next-generation sequencing technology.

PubMed

Sasaki, Katsutomo; Mitsuda, Nobutaka; Nashima, Kenji; Kishimoto, Kyutaro; Katayose, Yuichi; Kanamori, Hiroyuki; Ohmiya, Akemi

2017-09-04

Chrysanthemum morifolium is one of the most economically valuable ornamental plants worldwide. Chrysanthemum is an allohexaploid plant with a large genome that is commercially propagated by vegetative reproduction. New cultivars with different floral traits, such as color, morphology, and scent, have been generated mainly by classical cross-breeding and mutation breeding. However, only limited genetic resources and their genome information are available for the generation of new floral traits. To obtain useful information about molecular bases for floral traits of chrysanthemums, we read expressed sequence tags (ESTs) of chrysanthemums by high-throughput sequencing using the 454 pyrosequencing technology. We constructed normalized cDNA libraries, consisting of full-length, 3'-UTR, and 5'-UTR cDNAs derived from various tissues of chrysanthemums. These libraries produced a total number of 3,772,677 high-quality reads, which were assembled into 213,204 contigs. By comparing the data obtained with those of full genome-sequenced species, we confirmed that our chrysanthemum contig set contained the majority of all expressed genes, which was sufficient for further molecular analysis in chrysanthemums. We confirmed that our chrysanthemum EST set (contigs) contained a number of contigs that encoded transcription factors and enzymes involved in pigment and aroma compound metabolism that was comparable to that of other species. This information can serve as an informative resource for identifying genes involved in various biological processes in chrysanthemums. Moreover, the findings of our study will contribute to a better understanding of the floral characteristics of chrysanthemums including the myriad cultivars at the molecular level.
Genomic Heat Shock Element Sequences Drive Cooperative Human Heat Shock Factor 1 DNA Binding and Selectivity*

PubMed Central

Jaeger, Alex M.; Makley, Leah N.; Gestwicki, Jason E.; Thiele, Dennis J.

2014-01-01

The heat shock transcription factor 1 (HSF1) activates expression of a variety of genes involved in cell survival, including protein chaperones, the protein degradation machinery, anti-apoptotic proteins, and transcription factors. Although HSF1 activation has been linked to amelioration of neurodegenerative disease, cancer cells exhibit a dependence on HSF1 for survival. Indeed, HSF1 drives a program of gene expression in cancer cells that is distinct from that activated in response to proteotoxic stress, and HSF1 DNA binding activity is elevated in cycling cells as compared with arrested cells. Active HSF1 homotrimerizes and binds to a DNA sequence consisting of inverted repeats of the pentameric sequence nGAAn, known as heat shock elements (HSEs). Recent comprehensive ChIP-seq experiments demonstrated that the architecture of HSEs is very diverse in the human genome, with deviations from the consensus sequence in the spacing, orientation, and extent of HSE repeats that could influence HSF1 DNA binding efficacy and the kinetics and magnitude of target gene expression. To understand the mechanisms that dictate binding specificity, HSF1 was purified as either a monomer or trimer and used to evaluate DNA-binding site preferences in vitro using fluorescence polarization and thermal denaturation profiling. These results were compared with quantitative chromatin immunoprecipitation assays in vivo. We demonstrate a role for specific orientations of extended HSE sequences in driving preferential HSF1 DNA binding to target loci in vivo. These studies provide a biochemical basis for understanding differential HSF1 target gene recognition and transcription in neurodegenerative disease and in cancer. PMID:25204655
Cloning, characterization, expression and comparative analysis of pig Golgi membrane sphingomyelin synthase 1.

PubMed

Guillén, Natalia; Navarro, María A; Surra, Joaquín C; Arnal, Carmen; Fernández-Juan, Marta; Cebrián-Pérez, Jose Alvaro; Osada, Jesús

2007-02-15

Pig sphingomyelin synthase 1 (SMS1) cDNA was cloned, characterized and compared to the human ortholog. Porcine protein consists of 413 amino acids and displays a 97% sequence identity with human protein. A phylogenic tree of proteins reveals that porcine SMS1 is more closely related to bovine and rodent proteins than to human. Analysis of protein mass was higher than the theoretical prediction based on amino acid sequence suggesting a kind of posttranslational modification. Quantitative representation of tissue distribution obtained by real-time RT-PCR showed that it was widely expressed although important variations in levels were obtained among organs. Thus, the cardiovascular system, especially the heart, showed the highest value of all the tissues studied. Regional differences of expression were observed in the central nervous system and intestinal tract. Analysis of the hepatic mRNA and protein expressions of SMS1 following turpentine treatment revealed a progressive decrease in the former paralleled by a decrease in the protein concentration. These findings indicate the variation in expression in the different tissues might suggest a different requirement of Golgi sphingomyelin for the specific function in each organ and a regulation of the enzyme in response to turpentine-induced hepatic injury.
DSAP: deep-sequencing small RNA analysis pipeline.

PubMed

Huang, Po-Jung; Liu, Yi-Chung; Lee, Chi-Ching; Lin, Wei-Chen; Gan, Richie Ruei-Chi; Lyu, Ping-Chiang; Tang, Petrus

2010-07-01

DSAP is an automated multiple-task web service designed to provide a total solution to analyzing deep-sequencing small RNA datasets generated by next-generation sequencing technology. DSAP uses a tab-delimited file as an input format, which holds the unique sequence reads (tags) and their corresponding number of copies generated by the Solexa sequencing platform. The input data will go through four analysis steps in DSAP: (i) cleanup: removal of adaptors and poly-A/T/C/G/N nucleotides; (ii) clustering: grouping of cleaned sequence tags into unique sequence clusters; (iii) non-coding RNA (ncRNA) matching: sequence homology mapping against a transcribed sequence library from the ncRNA database Rfam (http://rfam.sanger.ac.uk/); and (iv) known miRNA matching: detection of known miRNAs in miRBase (http://www.mirbase.org/) based on sequence homology. The expression levels corresponding to matched ncRNAs and miRNAs are summarized in multi-color clickable bar charts linked to external databases. DSAP is also capable of displaying miRNA expression levels from different jobs using a log(2)-scaled color matrix. Furthermore, a cross-species comparative function is also provided to show the distribution of identified miRNAs in different species as deposited in miRBase. DSAP is available at http://dsap.cgu.edu.tw.
A combination of LongSAGE with Solexa sequencing is well suited to explore the depth and the complexity of transcriptome

PubMed Central

Hanriot, Lucie; Keime, Céline; Gay, Nadine; Faure, Claudine; Dossat, Carole; Wincker, Patrick; Scoté-Blachon, Céline; Peyron, Christelle; Gandrillon, Olivier

2008-01-01

Background "Open" transcriptome analysis methods allow to study gene expression without a priori knowledge of the transcript sequences. As of now, SAGE (Serial Analysis of Gene Expression), LongSAGE and MPSS (Massively Parallel Signature Sequencing) are the mostly used methods for "open" transcriptome analysis. Both LongSAGE and MPSS rely on the isolation of 21 pb tag sequences from each transcript. In contrast to LongSAGE, the high throughput sequencing method used in MPSS enables the rapid sequencing of very large libraries containing several millions of tags, allowing deep transcriptome analysis. However, a bias in the complexity of the transcriptome representation obtained by MPSS was recently uncovered. Results In order to make a deep analysis of mouse hypothalamus transcriptome avoiding the limitation introduced by MPSS, we combined LongSAGE with the Solexa sequencing technology and obtained a library of more than 11 millions of tags. We then compared it to a LongSAGE library of mouse hypothalamus sequenced with the Sanger method. Conclusion We found that Solexa sequencing technology combined with LongSAGE is perfectly suited for deep transcriptome analysis. In contrast to MPSS, it gives a complex representation of transcriptome as reliable as a LongSAGE library sequenced by the Sanger method. PMID:18796152
Comparative analysis of gene regulatory networks: from network reconstruction to evolution.

PubMed

Thompson, Dawn; Regev, Aviv; Roy, Sushmita

2015-01-01

Regulation of gene expression is central to many biological processes. Although reconstruction of regulatory circuits from genomic data alone is therefore desirable, this remains a major computational challenge. Comparative approaches that examine the conservation and divergence of circuits and their components across strains and species can help reconstruct circuits as well as provide insights into the evolution of gene regulatory processes and their adaptive contribution. In recent years, advances in genomic and computational tools have led to a wealth of methods for such analysis at the sequence, expression, pathway, module, and entire network level. Here, we review computational methods developed to study transcriptional regulatory networks using comparative genomics, from sequence to functional data. We highlight how these methods use evolutionary conservation and divergence to reliably detect regulatory components as well as estimate the extent and rate of divergence. Finally, we discuss the promise and open challenges in linking regulatory divergence to phenotypic divergence and adaptation.
Aquaporin 4 is a Ubiquitously Expressed Isoform in the Dogfish (Squalus acanthias) Shark.

PubMed

Cutler, Christopher P; Maciver, Bryce; Cramb, Gordon; Zeidel, Mark

2011-01-01

The dogfish ortholog of aquaporin 4 (AQP4) was amplified from cDNA using degenerate PCR followed by cloning and sequencing. The complete coding region was then obtained using 5' and 3' RACE techniques. Alignment of the sequence with AQP4 amino acid sequences from other species showed that dogfish AQP4 has high levels (up to 65.3%) of homology with higher vertebrate sequences but lower levels of homology to Agnathan (38.2%) or teleost (57.5%) fish sequences. Northern blotting indicated that the dogfish mRNA was approximately 3.2 kb and was highly expressed in the rectal gland (a shark fluid secretory organ). Semi-quantitative PCR further indicates that AQP4 is ubiquitous, being expressed in all tissues measured but at low levels in certain tissues, where the level in liver > gill > intestine. Manipulation of the external environmental salinity of groups of dogfish showed that when fish were acclimated in stages to 120% seawater (SW) or 75% SW, there was no change in AQP4 mRNA expression in either rectal gland, kidney, or esophagus/cardiac stomach. Whereas quantitative PCR experiments using the RNA samples from the same experiment, showed a significant 63.1% lower abundance of gill AQP4 mRNA expression in 120% SW-acclimated dogfish. The function of dogfish AQP4 was also determined by measuring the effect of the AQP4 expression in Xenopus laevis oocytes. Dogfish AQP4 expressing-oocytes, exhibited significantly increased osmotic water permeability (P(f)) compared to controls, and this was invariant with pH. Permeability was not significantly reduced by treatment of oocytes with mercury chloride, as is also the case with AQP4 in other species. Similarly AQP4 expressing-oocytes did not exhibit enhanced urea or glycerol permeability, which is also consistent with the water-selective property of AQP4 in other species.
Aquaporin 4 is a Ubiquitously Expressed Isoform in the Dogfish (Squalus acanthias) Shark

PubMed Central

Cutler, Christopher P; MacIver, Bryce; Cramb, Gordon; Zeidel, Mark

2012-01-01

The dogfish ortholog of aquaporin 4 (AQP4) was amplified from cDNA using degenerate PCR followed by cloning and sequencing. The complete coding region was then obtained using 5′ and 3′ RACE techniques. Alignment of the sequence with AQP4 amino acid sequences from other species showed that dogfish AQP4 has high levels (up to 65.3%) of homology with higher vertebrate sequences but lower levels of homology to Agnathan (38.2%) or teleost (57.5%) fish sequences. Northern blotting indicated that the dogfish mRNA was approximately 3.2 kb and was highly expressed in the rectal gland (a shark fluid secretory organ). Semi-quantitative PCR further indicates that AQP4 is ubiquitous, being expressed in all tissues measured but at low levels in certain tissues, where the level in liver > gill > intestine. Manipulation of the external environmental salinity of groups of dogfish showed that when fish were acclimated in stages to 120% seawater (SW) or 75% SW, there was no change in AQP4 mRNA expression in either rectal gland, kidney, or esophagus/cardiac stomach. Whereas quantitative PCR experiments using the RNA samples from the same experiment, showed a significant 63.1% lower abundance of gill AQP4 mRNA expression in 120% SW-acclimated dogfish. The function of dogfish AQP4 was also determined by measuring the effect of the AQP4 expression in Xenopus laevis oocytes. Dogfish AQP4 expressing-oocytes, exhibited significantly increased osmotic water permeability (Pf) compared to controls, and this was invariant with pH. Permeability was not significantly reduced by treatment of oocytes with mercury chloride, as is also the case with AQP4 in other species. Similarly AQP4 expressing-oocytes did not exhibit enhanced urea or glycerol permeability, which is also consistent with the water-selective property of AQP4 in other species. PMID:22291652
DOE Office of Scientific and Technical Information (OSTI.GOV)

Bennetzen, Jeffrey L; Yang, Xiaohan; Ye, Chuyu

We generated a high-quality reference genome sequence for foxtail millet (Setaria italica). The {approx}400-Mb assembly covers {approx}80% of the genome and >95% of the gene space. The assembly was anchored to a 992-locus genetic map and was annotated by comparison with >1.3 million expressed sequence tag reads. We produced more than 580 million RNA-Seq reads to facilitate expression analyses. We also sequenced Setaria viridis, the ancestral wild relative of S. italica, and identified regions of differential single-nucleotide polymorphism density, distribution of transposable elements, small RNA content, chromosomal rearrangement and segregation distortion. The genus Setaria includes natural and cultivated species thatmore » demonstrate a wide capacity for adaptation. The genetic basis of this adaptation was investigated by comparing five sequenced grass genomes. We also used the diploid Setaria genome to evaluate the ongoing genome assembly of a related polyploid, switchgrass (Panicum virgatum).« less
Nucleotide sequencing analysis of a LEU gene of Candida maltosa which complements leuB mutation of Escherichia coli and leu2 mutation of Saccharomyces cerevisiae.

PubMed

Takagi, M; Kobayashi, N; Sugimoto, M; Fujii, T; Watari, J; Yano, K

1987-01-01

The expression of a LEU gene from Candida maltosa (designated as C-LEU2) isolated previously (Kawamura et al. 1983) was shown to be regulated, when transferred into Saccharomyces cerevisiae, by leucine and threonine in the medium, as in the case of LEU2 gene of S. cerevisiae. The coding region together with the regulatory region was subcloned and the nucleotide sequence was determined. When the sequence of the coding region was compared with that of LEU2, the homology was 72% for base pairs and 76% for deduced amino acids. Comparison of the regulatory region of C-LEU2 with those of LEU1 and LEU2 suggested a few short consensus sequences which are involved in regulation of gene expression by leucine and threonine in the medium.
Issues with RNA-seq analysis in non-model organisms: A salmonid example.

PubMed

Sundaram, Arvind; Tengs, Torstein; Grimholt, Unni

2017-10-01

High throughput sequencing (HTS) is useful for many purposes as exemplified by the other topics included in this special issue. The purpose of this paper is to look into the unique challenges of using this technology in non-model organisms where resources such as genomes, functional genome annotations or genome complexity provide obstacles not met in model organisms. To describe these challenges, we narrow our scope to RNA sequencing used to study differential gene expression in response to pathogen challenge. As a demonstration species we chose Atlantic salmon, which has a sequenced genome with poor annotation and an added complexity due to many duplicated genes. We find that our RNA-seq analysis pipeline deciphers between duplicates despite high sequence identity. However, annotation issues provide problems in linking differentially expressed genes to pathways. Also, comparing results between approaches and species are complicated due to lack of standardized annotation. Copyright © 2017 Elsevier Ltd. All rights reserved.
Construction and Evaluation of Normalized cDNA Libraries Enriched with Full-Length Sequences for Rapid Discovery of New Genes from Sisal (Agave sisalana Perr.) Different Developmental Stages

PubMed Central

Zhou, Wen-Zhao; Zhang, Yan-Mei; Lu, Jun-Ying; Li, Jun-Feng

2012-01-01

To provide a resource of sisal-specific expressed sequence data and facilitate this powerful approach in new gene research, the preparation of normalized cDNA libraries enriched with full-length sequences is necessary. Four libraries were produced with RNA pooled from Agave sisalana multiple tissues to increase efficiency of normalization and maximize the number of independent genes by SMART™ method and the duplex-specific nuclease (DSN). This procedure kept the proportion of full-length cDNAs in the subtracted/normalized libraries and dramatically enhanced the discovery of new genes. Sequencing of 3875 cDNA clones of libraries revealed 3320 unigenes with an average insert length about 1.2 kb, indicating that the non-redundancy of libraries was about 85.7%. These unigene functions were predicted by comparing their sequences to functional domain databases and extensively annotated with Gene Ontology (GO) terms. Comparative analysis of sisal unigenes and other plant genomes revealed that four putative MADS-box genes and knotted-like homeobox (knox) gene were obtained from a total of 1162 full-length transcripts. Furthermore, real-time PCR showed that the characteristics of their transcripts mainly depended on the tight expression regulation of a number of genes during the leaf and flower development. Analysis of individual library sequence data indicated that the pooled-tissue approach was highly effective in discovering new genes and preparing libraries for efficient deep sequencing. PMID:23202944
Identification and validation of differentially expressed transcripts by RNA-sequencing of formalin-fixed, paraffin-embedded (FFPE) lung tissue from patients with Idiopathic Pulmonary Fibrosis.

PubMed

Vukmirovic, Milica; Herazo-Maya, Jose D; Blackmon, John; Skodric-Trifunovic, Vesna; Jovanovic, Dragana; Pavlovic, Sonja; Stojsic, Jelena; Zeljkovic, Vesna; Yan, Xiting; Homer, Robert; Stefanovic, Branko; Kaminski, Naftali

2017-01-12

Idiopathic Pulmonary Fibrosis (IPF) is a lethal lung disease of unknown etiology. A major limitation in transcriptomic profiling of lung tissue in IPF has been a dependence on snap-frozen fresh tissues (FF). In this project we sought to determine whether genome scale transcript profiling using RNA Sequencing (RNA-Seq) could be applied to archived Formalin-Fixed Paraffin-Embedded (FFPE) IPF tissues. We isolated total RNA from 7 IPF and 5 control FFPE lung tissues and performed 50 base pair paired-end sequencing on Illumina 2000 HiSeq. TopHat2 was used to map sequencing reads to the human genome. On average ~62 million reads (53.4% of ~116 million reads) were mapped per sample. 4,131 genes were differentially expressed between IPF and controls (1,920 increased and 2,211 decreased (FDR < 0.05). We compared our results to differentially expressed genes calculated from a previously published dataset generated from FF tissues analyzed on Agilent microarrays (GSE47460). The overlap of differentially expressed genes was very high (760 increased and 1,413 decreased, FDR < 0.05). Only 92 differentially expressed genes changed in opposite directions. Pathway enrichment analysis performed using MetaCore confirmed numerous IPF relevant genes and pathways including extracellular remodeling, TGF-beta, and WNT. Gene network analysis of MMP7, a highly differentially expressed gene in both datasets, revealed the same canonical pathways and gene network candidates in RNA-Seq and microarray data. For validation by NanoString nCounter® we selected 35 genes that had a fold change of 2 in at least one dataset (10 discordant, 10 significantly differentially expressed in one dataset only and 15 concordant genes). High concordance of fold change and FDR was observed for each type of the samples (FF vs FFPE) with both microarrays (r = 0.92) and RNA-Seq (r = 0.90) and the number of discordant genes was reduced to four. Our results demonstrate that RNA sequencing of RNA obtained from archived FFPE lung tissues is feasible. The results obtained from FFPE tissue are highly comparable to FF tissues. The ability to perform RNA-Seq on archived FFPE IPF tissues should greatly enhance the availability of tissue biopsies for research in IPF.
Cloning, analysis and functional annotation of expressed sequence tags from the Earthworm Eisenia fetida

PubMed Central

Pirooznia, Mehdi; Gong, Ping; Guan, Xin; Inouye, Laura S; Yang, Kuan; Perkins, Edward J; Deng, Youping

2007-01-01

Background Eisenia fetida, commonly known as red wiggler or compost worm, belongs to the Lumbricidae family of the Annelida phylum. Little is known about its genome sequence although it has been extensively used as a test organism in terrestrial ecotoxicology. In order to understand its gene expression response to environmental contaminants, we cloned 4032 cDNAs or expressed sequence tags (ESTs) from two E. fetida libraries enriched with genes responsive to ten ordnance related compounds using suppressive subtractive hybridization-PCR. Results A total of 3144 good quality ESTs (GenBank dbEST accession number EH669363–EH672369 and EL515444–EL515580) were obtained from the raw clone sequences after cleaning. Clustering analysis yielded 2231 unique sequences including 448 contigs (from 1361 ESTs) and 1783 singletons. Comparative genomic analysis showed that 743 or 33% of the unique sequences shared high similarity with existing genes in the GenBank nr database. Provisional function annotation assigned 830 Gene Ontology terms to 517 unique sequences based on their homology with the annotated genomes of four model organisms Drosophila melanogaster, Mus musculus, Saccharomyces cerevisiae, and Caenorhabditis elegans. Seven percent of the unique sequences were further mapped to 99 Kyoto Encyclopedia of Genes and Genomes pathways based on their matching Enzyme Commission numbers. All the information is stored and retrievable at a highly performed, web-based and user-friendly relational database called EST model database or ESTMD version 2. Conclusion The ESTMD containing the sequence and annotation information of 4032 E. fetida ESTs is publicly accessible at . PMID:18047730
Analysis of Microbe-Associated Molecular Pattern-Responsive Synthetic Promoters with the Parsley Protoplast System.

PubMed

Kanofsky, Konstantin; Lehmeyer, Mona; Schulze, Jutta; Hehl, Reinhard

2016-01-01

Plants recognize pathogens by microbe-associated molecular patterns (MAMPs) and subsequently induce an immune response. The regulation of gene expression during the immune response depends largely on cis-sequences conserved in promoters of MAMP-responsive genes. These cis-sequences can be analyzed by constructing synthetic promoters linked to a reporter gene and by testing these constructs in transient expression systems. Here, the use of the parsley (Petroselinum crispum) protoplast system for analyzing MAMP-responsive synthetic promoters is described. The synthetic promoter consists of four copies of a potential MAMP-responsive cis-sequence cloned upstream of a minimal promoter and the uidA reporter gene. The reporter plasmid contains a second reporter gene, which is constitutively expressed and hence eliminates the requirement of a second plasmid used as a transformation control. The reporter plasmid is transformed into parsley protoplasts that are elicited by the MAMP Pep25. The MAMP responsiveness is validated by comparing the reporter gene activity from MAMP-treated and untreated cells and by normalizing reporter gene activity using the constitutively expressed reporter gene.

Global Gene Expression Patterns and Somatic Mutations in Sporadic Intracranial Aneurysms.

PubMed

Li, Zhili; Tan, Haibin; Shi, Yi; Huang, Guangfu; Wang, Zhenyu; Liu, Ling; Yin, Cheng; Wang, Qi

2017-04-01

High-throughput sequencing technologies can expand our understanding of the pathologic basis of intracranial aneurysms (IAs). Our study was aimed to decipher the gene expression signature and genetic factors associated with IAs. We determined the gene expression levels of 3 cases of IAs by RNA sequencing. Bioinformatics analysis was conducted to identify the differentially expressed genes (DEGs) and uncover their biological function. In addition, whole genome sequencing was performed on an additional 6 cases of IAs to detect the potential somatic alterations in DEGs. Compared with the normal arterial tissue, 1709 genes were differentially expressed in IAs arterial tissue. The most significantly up-regulated gene and down-regulated gene, H19 and HIST1H3J, may be essential for tumorigenesis of IAs. Hub protein of IKBKG in protein-protein interaction network was probably involved in the inflammation process in aneurysms. Another 2 hub proteins, ACTB and MKI67IP, as well as up-regulated genes, might be abnormally activated in aneurysms and involved in the pathogenesis of IAs. Further whole genome sequencing and filtering yielded 4 candidate somatic single nucleotide variants including MUC3B, and BLM may be involved in the pathogenesis of IAs. Even though, our results do not support the hypothesis of somatic mutations occurred in the DEGs. Two-dimensional genomic data from transcriptome and whole genome sequencing indicated that no somatic mutations occurred in DEGs. In addition, 3 DEGs (IKBKG, ACTB, and MKI67IP) and 2 mutant genes (MUC3B and BLM) were essential in IAs. Copyright © 2017 Elsevier Inc. All rights reserved.
Characterization and expression profiles of MaACS and MaACO genes from mulberry (Morus alba L.)*

PubMed Central

Liu, Chang-ying; Lü, Rui-hua; Li, Jun; Zhao, Ai-chun; Wang, Xi-ling; Diane, Umuhoza; Wang, Xiao-hong; Wang, Chuan-hong; Yu, Ya-sheng; Han, Shu-mei; Lu, Cheng; Yu, Mao-de

2014-01-01

1-Aminocyclopropane-1-carboxylic acid synthase (ACS) and 1-aminocyclopropane-1-carboxylic acid oxidase (ACO) are encoded by multigene families and are involved in fruit ripening by catalyzing the production of ethylene throughout the development of fruit. However, there are no reports on ACS or ACO genes in mulberry, partly because of the limited molecular research background. In this study, we have obtained five ACS gene sequences and two ACO gene sequences from Morus Genome Database. Sequence alignment and phylogenetic analysis of MaACO1 and MaACO2 showed that their amino acids are conserved compared with ACO proteins from other species. MaACS1 and MaACS2 are type I, MaACS3 and MaACS4 are type II, and MaACS5 is type III, with different C-terminal sequences. Quantitative reverse transcriptase polymerase chain reaction (qRT-PCR) expression analysis showed that the transcripts of MaACS genes were strongly expressed in fruit, and more weakly in other tissues. The expression of MaACO1 and MaACO2 showed different patterns in various mulberry tissues. MaACS and MaACO genes demonstrated two patterns throughout the development of mulberry fruit, and both of them were strongly up-regulated by abscisic acid (ABA) and ethephon. PMID:25001221
Frame-Insensitive Expression Cloning of Fluorescent Protein from Scolionema suvaense.

PubMed

Horiuchi, Yuki; Laskaratou, Danai; Sliwa, Michel; Ruckebusch, Cyril; Hatori, Kuniyuki; Mizuno, Hideaki; Hotta, Jun-Ichi

2018-01-26

Expression cloning from cDNA is an important technique for acquiring genes encoding novel fluorescent proteins. However, the probability of in-frame cDNA insertion following the first start codon of the vector is normally only 1/3, which is a cause of low cloning efficiency. To overcome this issue, we developed a new expression plasmid vector, pRSET-TriEX, in which transcriptional slippage was induced by introducing a DNA sequence of (dT) 14 next to the first start codon of pRSET. The effectiveness of frame-insensitive cloning was validated by inserting the gene encoding eGFP with all three possible frames to the vector. After transformation with one of these plasmids, E. coli cells expressed eGFP with no significant difference in the expression level. The pRSET-TriEX vector was then used for expression cloning of a novel fluorescent protein from Scolionema suvaense . We screened 3658 E. coli colonies transformed with pRSET-TriEX containing Scolionema suvaense cDNA, and found one colony expressing a novel green fluorescent protein, ScSuFP. The highest score in protein sequence similarity was 42% with the chain c of multi-domain green fluorescent protein like protein "ember" from Anthoathecata sp. Variations in the N- and/or C-terminal sequence of ScSuFP compared to other fluorescent proteins indicate that the expression cloning, rather than the sequence similarity-based methods, was crucial for acquiring the gene encoding ScSuFP. The absorption maximum was at 498 nm, with an extinction efficiency of 1.17 × 10⁵ M -1 ·cm -1 . The emission maximum was at 511 nm and the fluorescence quantum yield was determined to be 0.6. Pseudo-native gel electrophoresis showed that the protein forms obligatory homodimers.
Generation and analysis of expressed sequence tags in the extreme large genomes Lilium and Tulipa.

PubMed

Shahin, Arwa; van Kaauwen, Martijn; Esselink, Danny; Bargsten, Joachim W; van Tuyl, Jaap M; Visser, Richard G F; Arens, Paul

2012-11-20

Bulbous flowers such as lily and tulip (Liliaceae family) are monocot perennial herbs that are economically very important ornamental plants worldwide. However, there are hardly any genetic studies performed and genomic resources are lacking. To build genomic resources and develop tools to speed up the breeding in both crops, next generation sequencing was implemented. We sequenced and assembled transcriptomes of four lily and five tulip genotypes using 454 pyro-sequencing technology. Successfully, we developed the first set of 81,791 contigs with an average length of 514 bp for tulip, and enriched the very limited number of 3,329 available ESTs (Expressed Sequence Tags) for lily with 52,172 contigs with an average length of 555 bp. The contigs together with singletons covered on average 37% of lily and 39% of tulip estimated transcriptome. Mining lily and tulip sequence data for SSRs (Simple Sequence Repeats) showed that di-nucleotide repeats were twice more abundant in UTRs (UnTranslated Regions) compared to coding regions, while tri-nucleotide repeats were equally spread over coding and UTR regions. Two sets of single nucleotide polymorphism (SNP) markers suitable for high throughput genotyping were developed. In the first set, no SNPs flanking the target SNP (50 bp on either side) were allowed. In the second set, one SNP in the flanking regions was allowed, which resulted in a 2 to 3 fold increase in SNP marker numbers compared with the first set. Orthologous groups between the two flower bulbs: lily and tulip (12,017 groups) and among the three monocot species: lily, tulip, and rice (6,900 groups) were determined using OrthoMCL. Orthologous groups were screened for common SNP markers and EST-SSRs to study synteny between lily and tulip, which resulted in 113 common SNP markers and 292 common EST-SSR. Lily and tulip contigs generated were annotated and described according to Gene Ontology terminology. Two transcriptome sets were built that are valuable resources for marker development, comparative genomic studies and candidate gene approaches. Next generation sequencing of leaf transcriptome is very effective; however, deeper sequencing and using more tissues and stages is advisable for extended comparative studies.
Transcript profiling reveals expression differences in wild-type and glabrous soybean lines

PubMed Central

2011-01-01

Background Trichome hairs affect diverse agronomic characters such as seed weight and yield, prevent insect damage and reduce loss of water but their molecular control has not been extensively studied in soybean. Several detailed models for trichome development have been proposed for Arabidopsis thaliana, but their applicability to important crops such as cotton and soybean is not fully known. Results Two high throughput transcript sequencing methods, Digital Gene Expression (DGE) Tag Profiling and RNA-Seq, were used to compare the transcriptional profiles in wild-type (cv. Clark standard, CS) and a mutant (cv. Clark glabrous, i.e., trichomeless or hairless, CG) soybean isoline that carries the dominant P1 allele. DGE data and RNA-Seq data were mapped to the cDNAs (Glyma models) predicted from the reference soybean genome, Williams 82. Extending the model length by 250 bp at both ends resulted in significantly more matches of authentic DGE tags indicating that many of the predicted gene models are prematurely truncated at the 5' and 3' UTRs. The genome-wide comparative study of the transcript profiles of the wild-type versus mutant line revealed a number of differentially expressed genes. One highly-expressed gene, Glyma04g35130, in wild-type soybean was of interest as it has high homology to the cotton gene GhRDL1 gene that has been identified as being involved in cotton fiber initiation and is a member of the BURP protein family. Sequence comparison of Glyma04g35130 among Williams 82 with our sequences derived from CS and CG isolines revealed various SNPs and indels including addition of one nucleotide C in the CG and insertion of ~60 bp in the third exon of CS that causes a frameshift mutation and premature truncation of peptides in both lines as compared to Williams 82. Conclusion Although not a candidate for the P1 locus, a BURP family member (Glyma04g35130) from soybean has been shown to be abundantly expressed in the CS line and very weakly expressed in the glabrous CG line. RNA-Seq and DGE data are compared and provide experimental data on the expression of predicted soybean gene models as well as an overview of the genes expressed in young shoot tips of two closely related isolines. PMID:22029708
Comparative transcriptome analysis of microsclerotia development in Nomuraea rileyi

PubMed Central

2013-01-01

Background Nomuraea rileyi is used as an environmental-friendly biopesticide. However, mass production and commercialization of this organism are limited due to its fastidious growth and sporulation requirements. When cultured in amended medium, we found that N. rileyi could produce microsclerotia bodies, replacing conidiophores as the infectious agent. However, little is known about the genes involved in microsclerotia development. In the present study, the transcriptomes were analyzed using next-generation sequencing technology to find the genes involved in microsclerotia development. Results A total of 4.69 Gb of clean nucleotides comprising 32,061 sequences was obtained, and 20,919 sequences were annotated (about 65%). Among the annotated sequences, only 5928 were annotated with 34 gene ontology (GO) functional categories, and 12,778 sequences were mapped to 165 pathways by searching against the Kyoto Encyclopedia of Genes and Genomes pathway (KEGG) database. Furthermore, we assessed the transcriptomic differences between cultures grown in minimal and amended medium. In total, 4808 sequences were found to be differentially expressed; 719 differentially expressed unigenes were assigned to 25 GO classes and 1888 differentially expressed unigenes were assigned to 161 KEGG pathways, including 25 enrichment pathways. Subsequently, we examined the up-regulation or uniquely expressed genes following amended medium treatment, which were also expressed on the enrichment pathway, and found that most of them participated in mediating oxidative stress homeostasis. To elucidate the role of oxidative stress in microsclerotia development, we analyzed the diversification of unigenes using quantitative reverse transcription-PCR (RT-qPCR). Conclusion Our findings suggest that oxidative stress occurs during microsclerotia development, along with a broad metabolic activity change. Our data provide the most comprehensive sequence resource available for the study of N. rileyi. We believe that the transcriptome datasets will serve as an important public information platform to accelerate studies on N. rileyi microsclerotia. PMID:23777366
Transcriptome Assembly, Gene Annotation and Tissue Gene Expression Atlas of the Rainbow Trout

PubMed Central

Salem, Mohamed; Paneru, Bam; Al-Tobasei, Rafet; Abdouni, Fatima; Thorgaard, Gary H.; Rexroad, Caird E.; Yao, Jianbo

2015-01-01

Efforts to obtain a comprehensive genome sequence for rainbow trout are ongoing and will be complemented by transcriptome information that will enhance genome assembly and annotation. Previously, transcriptome reference sequences were reported using data from different sources. Although the previous work added a great wealth of sequences, a complete and well-annotated transcriptome is still needed. In addition, gene expression in different tissues was not completely addressed in the previous studies. In this study, non-normalized cDNA libraries were sequenced from 13 different tissues of a single doubled haploid rainbow trout from the same source used for the rainbow trout genome sequence. A total of ~1.167 billion paired-end reads were de novo assembled using the Trinity RNA-Seq assembler yielding 474,524 contigs > 500 base-pairs. Of them, 287,593 had homologies to the NCBI non-redundant protein database. The longest contig of each cluster was selected as a reference, yielding 44,990 representative contigs. A total of 4,146 contigs (9.2%), including 710 full-length sequences, did not match any mRNA sequences in the current rainbow trout genome reference. Mapping reads to the reference genome identified an additional 11,843 transcripts not annotated in the genome. A digital gene expression atlas revealed 7,678 housekeeping and 4,021 tissue-specific genes. Expression of about 16,000–32,000 genes (35–71% of the identified genes) accounted for basic and specialized functions of each tissue. White muscle and stomach had the least complex transcriptomes, with high percentages of their total mRNA contributed by a small number of genes. Brain, testis and intestine, in contrast, had complex transcriptomes, with a large numbers of genes involved in their expression patterns. This study provides comprehensive de novo transcriptome information that is suitable for functional and comparative genomics studies in rainbow trout, including annotation of the genome. PMID:25793877
Sooty mangabey genome sequence provides insight into AIDS resistance in a natural SIV host.

PubMed

Palesch, David; Bosinger, Steven E; Tharp, Gregory K; Vanderford, Thomas H; Paiardini, Mirko; Chahroudi, Ann; Johnson, Zachary P; Kirchhoff, Frank; Hahn, Beatrice H; Norgren, Robert B; Patel, Nirav B; Sodora, Donald L; Dawoud, Reem A; Stewart, Caro-Beth; Seepo, Sara M; Harris, R Alan; Liu, Yue; Raveendran, Muthuswamy; Han, Yi; English, Adam; Thomas, Gregg W C; Hahn, Matthew W; Pipes, Lenore; Mason, Christopher E; Muzny, Donna M; Gibbs, Richard A; Sauter, Daniel; Worley, Kim; Rogers, Jeffrey; Silvestri, Guido

2018-01-03

In contrast to infections with human immunodeficiency virus (HIV) in humans and simian immunodeficiency virus (SIV) in macaques, SIV infection of a natural host, sooty mangabeys (Cercocebus atys), is non-pathogenic despite high viraemia. Here we sequenced and assembled the genome of a captive sooty mangabey. We conducted genome-wide comparative analyses of transcript assemblies from C. atys and AIDS-susceptible species, such as humans and macaques, to identify candidates for host genetic factors that influence susceptibility. We identified several immune-related genes in the genome of C. atys that show substantial sequence divergence from macaques or humans. One of these sequence divergences, a C-terminal frameshift in the toll-like receptor-4 (TLR4) gene of C. atys, is associated with a blunted in vitro response to TLR-4 ligands. In addition, we found a major structural change in exons 3-4 of the immune-regulatory protein intercellular adhesion molecule 2 (ICAM-2); expression of this variant leads to reduced cell surface expression of ICAM-2. These data provide a resource for comparative genomic studies of HIV and/or SIV pathogenesis and may help to elucidate the mechanisms by which SIV-infected sooty mangabeys avoid AIDS.
Sooty mangabey genome sequence provides insight into AIDS resistance in a natural SIV host

PubMed Central

Palesch, David; Bosinger, Steven E.; Tharp, Gregory K.; Vanderford, Thomas H.; Paiardini, Mirko; Chahroudi, Ann; Johnson, Zachary P.; Kirchhoff, Frank; Hahn, Beatrice H.; Norgren, Robert B.; Patel, Nirav B.; Sodora, Donald L.; Dawoud, Reem A.; Stewart, Caro-Beth; Seepo, Sara M.; Harris, R. Alan; Liu, Yue; Raveendran, Muthuswamy; Han, Yi; English, Adam; Thomas, Gregg W. C.; Hahn, Matthew W.; Pipes, Lenore; Mason, Christopher E.; Muzny, Donna M.; Gibbs, Richard A.; Sauter, Daniel; Worley, Kim; Rogers, Jeffrey; Silvestri, Guido

2018-01-01

In contrast to infections with human immunodeficiency virus (HIV) in humans and simian immunodeficiency virus (SIV) in macaques, SIV infection of a natural host, sooty mangabeys (Cercocebus atys), is non-pathogenic despite high viraemia1. Here we sequenced and assembled the genome of a captive sooty mangabey. We conducted genome-wide comparative analyses of transcript assemblies from C. atys and AIDS-susceptible species, such as humans and macaques, to identify candidates for host genetic factors that influence susceptibility. We identified several immune-related genes in the genome of C. atys that show substantial sequence divergence from macaques or humans. One of these sequence divergences, a C-terminal frameshift in the toll-like receptor-4 (TLR4) gene of C. atys, is associated with a blunted in vitro response to TLR-4 ligands. In addition, we found a major structural change in exons 3–4 of the immune-regulatory protein intercellular adhesion molecule 2 (ICAM-2); expression of this variant leads to reduced cell surface expression of ICAM-2. These data provide a resource for comparative genomic studies of HIV and/or SIV pathogenesis and may help to elucidate the mechanisms by which SIV-infected sooty mangabeys avoid AIDS. PMID:29300007
Expression Differentiation Is Constrained to Low-Expression Proteins over Ecological Timescales

PubMed Central

Margres, Mark J.; Wray, Kenneth P.; Seavy, Margaret; McGivern, James J.; Herrera, Nathanael D.; Rokyta, Darin R.

2016-01-01

Protein expression level is one of the strongest predictors of protein sequence evolutionary rate, with high-expression protein sequences evolving at slower rates than low-expression protein sequences largely because of constraints on protein folding and function. Expression evolutionary rates also have been shown to be negatively correlated with expression level across human and mouse orthologs over relatively long divergence times (i.e., ∼100 million years). Long-term evolutionary patterns, however, often cannot be extrapolated to microevolutionary processes (and vice versa), and whether this relationship holds for traits evolving under directional selection within a single species over ecological timescales (i.e., <5000 years) is unknown and not necessarily expected. Expression is a metabolically costly process, and the expression level of a particular protein is predicted to be a tradeoff between the benefit of its function and the costs of its expression. Selection should drive the expression level of all proteins close to values that maximize fitness, particularly for high-expression proteins because of the increased energetic cost of production. Therefore, stabilizing selection may reduce the amount of standing expression variation for high-expression proteins, and in combination with physiological constraints that may place an upper bound on the range of beneficial expression variation, these constraints could severely limit the availability of beneficial expression variants. To determine whether rapid-expression evolution was restricted to low-expression proteins owing to these constraints on highly expressed proteins over ecological timescales, we compared venom protein expression levels across mainland and island populations for three species of pit vipers. We detected significant differentiation in protein expression levels in two of the three species and found that rapid-expression differentiation was restricted to low-expression proteins. Our results suggest that various constraints on high-expression proteins reduce the availability of beneficial expression variants relative to low-expression proteins, enabling low-expression proteins to evolve and potentially lead to more rapid adaptation. PMID:26546003
Second generation sequencing of microRNA in Human Bone Cells treated with Parathyroid Hormone or Dexamethasone.

PubMed

Laxman, Navya; Rubin, Carl-Johan; Mallmin, Hans; Nilsson, Olle; Tellgren-Roth, Christian; Kindmark, Andreas

2016-03-01

We investigated the impact of treatment with parathyroid hormone (PTH) and dexamethasone (DEX) for 2 and 24h by RNA sequencing of miRNAs in primary human bone (HOB) cells. A total of 207 million reads were obtained, and normalized absolute expression retrieved for 373 most abundant miRNAs. In naïve control cells, 7 miRNAs were differentially expressed (FDR<0.05) between the two time points. Ten miRNAs exhibited differential expression (FDR <0.05) across two time points and treatments after adjusting for expression in controls and were selected for downstream analyses. Results show significant effects on miRNA expression when comparing PTH with DEX at 2h with even more pronounced effects at 24h. Interestingly, several miRNAs exhibiting differences in expression are predicted to target genes involved in bone metabolism e.g. miR-30c2, miR-203 and miR-205 targeting RUNX2, and miR-320 targeting β-catenin (CTNNB1) mRNA expression. CTNNB1and RUNX2 levels were decreased after DEX treatment and increased after PTH treatment. Our analysis also identified 2 putative novel miRNAs in PTH and DEX treated cells at 24h. RNA sequencing showed that PTH and DEX treatment affect miRNA expression in HOB cells and that regulated miRNAs in turn are correlated with expression levels of key genes involved in bone metabolism. Copyright © 2016 Elsevier Inc. All rights reserved.
Comparative Transcriptomes and EVO-DEVO Studies Depending on Next Generation Sequencing.

PubMed

Liu, Tiancheng; Yu, Lin; Liu, Lei; Li, Hong; Li, Yixue

2015-01-01

High throughput technology has prompted the progressive omics studies, including genomics and transcriptomics. We have reviewed the improvement of comparative omic studies, which are attributed to the high throughput measurement of next generation sequencing technology. Comparative genomics have been successfully applied to evolution analysis while comparative transcriptomics are adopted in comparison of expression profile from two subjects by differential expression or differential coexpression, which enables their application in evolutionary developmental biology (EVO-DEVO) studies. EVO-DEVO studies focus on the evolutionary pressure affecting the morphogenesis of development and previous works have been conducted to illustrate the most conserved stages during embryonic development. Old measurements of these studies are based on the morphological similarity from macro view and new technology enables the micro detection of similarity in molecular mechanism. Evolutionary model of embryo development, which includes the "funnel-like" model and the "hourglass" model, has been evaluated by combination of these new comparative transcriptomic methods with prior comparative genomic information. Although the technology has promoted the EVO-DEVO studies into a new era, technological and material limitation still exist and further investigations require more subtle study design and procedure.
Generation and analysis of expressed sequence tags from the bone marrow of Chinese Sika deer.

PubMed

Yao, Baojin; Zhao, Yu; Zhang, Mei; Li, Juan

2012-03-01

Sika deer is one of the best-known and highly valued animals of China. Despite its economic, cultural, and biological importance, there has not been a large-scale sequencing project for Sika deer to date. With the ultimate goal of sequencing the complete genome of this organism, we first established a bone marrow cDNA library for Sika deer and generated a total of 2,025 reads. After processing the sequences, 2,017 high-quality expressed sequence tags (ESTs) were obtained. These ESTs were assembled into 1,157 unigenes, including 238 contigs and 919 singletons. Comparative analyses indicated that 888 (76.75%) of the unigenes had significant matches to sequences in the non-redundant protein database, In addition to highly expressed genes, such as stearoyl-CoA desaturase, cytochrome c oxidase, adipocyte-type fatty acid-binding protein, adiponectin and thymosin beta-4, we also obtained vascular endothelial growth factor-A and heparin-binding growth-associated molecule, both of which are of great importance for angiogenesis research. There were 244 (21.09%) unigenes with no significant match to any sequence in current protein or nucleotide databases, and these sequences may represent genes with unknown function in Sika deer. Open reading frame analysis of the sequences was performed using the getorf program. In addition, the sequences were functionally classified using the gene ontology hierarchy, clusters of orthologous groups of proteins and Kyoto encyclopedia of genes and genomes databases. Analysis of ESTs described in this paper provides an important resource for the transcriptome exploration of Sika deer, and will also facilitate further studies on functional genomics, gene discovery and genome annotation of Sika deer.
Brassica ASTRA: an integrated database for Brassica genomic research.

PubMed

Love, Christopher G; Robinson, Andrew J; Lim, Geraldine A C; Hopkins, Clare J; Batley, Jacqueline; Barker, Gary; Spangenberg, German C; Edwards, David

2005-01-01

Brassica ASTRA is a public database for genomic information on Brassica species. The database incorporates expressed sequences with Swiss-Prot and GenBank comparative sequence annotation as well as secondary Gene Ontology (GO) annotation derived from the comparison with Arabidopsis TAIR GO annotations. Simple sequence repeat molecular markers are identified within resident sequences and mapped onto the closely related Arabidopsis genome sequence. Bacterial artificial chromosome (BAC) end sequences derived from the Multinational Brassica Genome Project are also mapped onto the Arabidopsis genome sequence enabling users to identify candidate Brassica BACs corresponding to syntenic regions of Arabidopsis. This information is maintained in a MySQL database with a web interface providing the primary means of interrogation. The database is accessible at http://hornbill.cspp.latrobe.edu.au.
Comparative RNA-sequencing of the acarbose producer Actinoplanes sp. SE50/110 cultivated in different growth media.

PubMed

Schwientek, Patrick; Wendler, Sergej; Neshat, Armin; Eirich, Christina; Rückert, Christian; Klein, Andreas; Wehmeier, Udo F; Kalinowski, Jörn; Stoye, Jens; Pühler, Alfred

2013-08-20

Actinoplanes sp. SE50/110 is known as the producer of the alpha-glucosidase inhibitor acarbose, a potent drug in the treatment of type-2 diabetes mellitus. We conducted the first whole transcriptome analysis of Actinoplanes sp. SE50/110, using RNA-sequencing technology for comparative gene expression studies between cells grown in maltose minimal medium, maltose minimal medium with trace elements, and glucose complex medium. We first studied the behavior of Actinoplanes sp. SE50/110 cultivations in these three media and found that the different media had significant impact on growth rate and in particular on acarbose production. It was demonstrated that Actinoplanes sp. SE50/110 grew well in all three media, but acarbose biosynthesis was only observed in cultures grown in maltose minimal medium with and without trace elements. When comparing the expression profiles between the maltose minimal media with and without trace elements, only few significantly differentially expressed genes were found, which mainly code for uptake systems of metal ions provided in the trace element solution. In contrast, the comparison of expression profiles from maltose minimal medium and glucose complex medium revealed a large number of differentially expressed genes, of which the most conspicuous genes account for iron storage and uptake. Furthermore, the acarbose gene cluster was found to be highly expressed in maltose-containing media and almost silent in the glucose-containing medium. In addition, a putative antibiotic biosynthesis gene cluster was found to be similarly expressed as the acarbose cluster. Copyright © 2012 Elsevier B.V. All rights reserved.
Gene expression analysis of E. coli strains provides insights into the role of gene regulation in diversification

PubMed Central

Vital, Marius; Chai, Benli; Østman, Bjørn; Cole, James; Konstantinidis, Konstantinos T; Tiedje, James M

2015-01-01

Escherichia coli spans a genetic continuum from enteric strains to several phylogenetically distinct, atypical lineages that are rare in humans, but more common in extra-intestinal environments. To investigate the link between gene regulation, phylogeny and diversification in this species, we analyzed global gene expression profiles of four strains representing distinct evolutionary lineages, including a well-studied laboratory strain, a typical commensal (enteric) strain and two environmental strains. RNA-Seq was employed to compare the whole transcriptomes of strains grown under batch, chemostat and starvation conditions. Highly differentially expressed genes showed a significantly lower nucleotide sequence identity compared with other genes, indicating that gene regulation and coding sequence conservation are directly connected. Overall, distances between the strains based on gene expression profiles were largely dependent on the culture condition and did not reflect phylogenetic relatedness. Expression differences of commonly shared genes (all four strains) and E. coli core genes were consistently smaller between strains characterized by more similar primary habitats. For instance, environmental strains exhibited increased expression of stress defense genes under carbon-limited growth and entered a more pronounced survival-like phenotype during starvation compared with other strains, which stayed more alert for substrate scavenging and catabolism during no-growth conditions. Since those environmental strains show similar genetic distance to each other and to the other two strains, these findings cannot be simply attributed to genetic relatedness but suggest physiological adaptations. Our study provides new insights into ecologically relevant gene-expression and underscores the role of (differential) gene regulation for the diversification of the model bacterial species. PMID:25343512
Gene expression profiling in the hippocampus of learned helpless and nonhelpless rats.

PubMed

Kohen, R; Kirov, S; Navaja, G P; Happe, H Kevin; Hamblin, M W; Snoddy, J R; Neumaier, J F; Petty, F

2005-01-01

In the learned helplessness (LH) animal model of depression, failure to attempt escape from avoidable environmental stress, LH, indicates behavioral despair, whereas nonhelpless (NH) behavior reflects behavioral resilience to the effects of environmental stress. Comparing hippocampal gene expression with large-scale oligonucleotide microarrays, we found that stress-resilient (NH) rats, although behaviorally indistinguishable from controls, showed a distinct gene expression profile compared to LH, sham stressed, and naïve control animals. Genes that were confirmed as differentially expressed in the NH group by quantitative PCR strongly correlated in their levels of expression across all four animal groups. Differential expression could not be confirmed at the protein level. We identified several shared degenerate sequence motifs in the 3' untranslated region (3'UTR) of differentially expressed genes that could be a factor in this tight correlation of expression levels among differentially expressed genes.
Preparing and Analyzing Expressed Sequence Tags (ESTs) Library for the Mammary Tissue of Local Turkish Kivircik Sheep

PubMed Central

Omeroglu Ulu, Zehra; Ulu, Salih; Un, Cemal; Ozdem Oztabak, Kemal; Altunatmaz, Kemal

2017-01-01

Kivircik sheep is an important local Turkish sheep according to its meat quality and milk productivity. The aim of this study was to analyze gene expression profiles of both prenatal and postnatal stages for the Kivircik sheep. Therefore, two different cDNA libraries, which were taken from the same Kivircik sheep mammary gland tissue at prenatal and postnatal stages, were constructed. Total 3072 colonies which were randomly selected from the two libraries were sequenced for developing a sheep ESTs collection. We used Phred/Phrap computer programs for analysis of the raw EST and readable EST sequences were assembled with the CAP3 software. Putative functions of all unique sequences and statistical analysis were determined by Geneious software. Total 422 ESTs have over 80% similarity to known sequences of other organisms in NCBI classified by Panther database for the Gene Ontology (GO) category. By comparing gene expression profiles, we observed some putative genes that may be relative to reproductive performance or play important roles in milk synthesis and secretion. A total of 2414 ESTs have been deposited to the NCBI GenBank database (GW996847–GW999260). EST data in this study have provided a new source of information to functional genome studies of sheep. PMID:28239610
Delimiting regulatory sequences of the Drosophila melanogaster Ddc gene.

PubMed Central

Hirsh, J; Morgan, B A; Scholnick, S B

1986-01-01

We delimited sequences necessary for in vivo expression of the Drosophila melanogaster dopa decarboxylase gene Ddc. The expression of in vitro-altered genes was assayed following germ line integration via P-element vectors. Sequences between -209 and -24 were necessary for normally regulated expression, although genes lacking these sequences could be expressed at 10 to 50% of wild-type levels at specific developmental times. These genes showed components of normal developmental expression, which suggests that they retain some regulatory elements. All Ddc genes lacking the normal immediate 5'-flanking sequences were grossly deficient in larval central nervous system expression. Thus, this upstream region must contain at least one element necessary for this expression. A mutated Ddc gene without a normal TATA boxlike sequence used the normal RNA start points, indicating that this sequences is not required for start point specificity. Images PMID:3099170
Reducing DNA context dependence in bacterial promoters

PubMed Central

Carr, Swati B.; Densmore, Douglas M.

2017-01-01

Variation in the DNA sequence upstream of bacterial promoters is known to affect the expression levels of the products they regulate, sometimes dramatically. While neutral synthetic insulator sequences have been found to buffer promoters from upstream DNA context, there are no established methods for designing effective insulator sequences with predictable effects on expression levels. We address this problem with Degenerate Insulation Screening (DIS), a novel method based on a randomized 36-nucleotide insulator library and a simple, high-throughput, flow-cytometry-based screen that randomly samples from a library of 436 potential insulated promoters. The results of this screen can then be compared against a reference uninsulated device to select a set of insulated promoters providing a precise level of expression. We verify this method by insulating the constitutive, inducible, and repressible promotors of a four transcriptional-unit inverter (NOT-gate) circuit, finding both that order dependence is largely eliminated by insulation and that circuit performance is also significantly improved, with a 5.8-fold mean improvement in on/off ratio. PMID:28422998

miRNAs involved in the development and differentiation of fertile and sterile flowers in Viburnum macrocephalum f. keteleeri.

PubMed

Li, Weixing; He, Zhichong; Zhang, Li; Lu, Zhaogeng; Xu, Jing; Cui, Jiawen; Wang, Li; Jin, Biao

2017-10-13

Sterile and fertile flowers are important evolutionary developmental phenotypes in angiosperm flowers. The development of floral organs, critical in angiosperm reproduction, is regulated by microRNAs (miRNAs). However, the mechanisms underpinning the miRNA regulation of the differentiation and development of sterile and fertile flowers remain unclear. Here, based on investigations of the morphological differences between fertile and sterile flowers, we used high-throughput sequencing to characterize the miRNAs in the differentiated floral organs of Viburnum macrocephalum f. keteleeri. We identified 49 known miRNAs and 67 novel miRNAs by small RNA (sRNA) sequencing and bioinformatics analysis, and 17 of these known and novel miRNA precursors were validated by polymerase chain reaction (PCR) and Sanger sequencing. Furthermore, by comparing the sequencing results of two sRNA libraries, we found that 30 known and 39 novel miRNA sequences were differentially expressed, and 35 were upregulated and 34 downregulated in sterile compared with fertile flowers. Combined with their predicted targets, the potential roles of miRNAs in V. macrocephalum f. keteleeri flowers include involvement in floral organogenesis, cell proliferation, hormonal pathways, and stress responses. miRNA precursors and targets were further validated by quantitative real-time PCR (qRT-PCR). Specifically, miR156a-5p, miR156g, and miR156j expression levels were significantly higher in fertile flowers than in sterile flowers, while SPL genes displayed the opposite expression pattern. Considering that the targets of miR156 are predicted to be SPL genes, we propose that miR156 may be involved in the regulation of stamen development in V. macrocephalum f. keteleeri. We identified miRNAs differentially expressed between fertile and sterile flowers in V. macrocephalum f. keteleeri and provided new insights into the important regulatory roles of miRNAs in the differentiation and development of fertile and sterile flowers.
Grizzly bear corticosteroid binding globulin: Cloning and serum protein expression.

PubMed

Chow, Brian A; Hamilton, Jason; Alsop, Derek; Cattet, Marc R L; Stenhouse, Gordon; Vijayan, Mathilakath M

2010-06-01

Serum corticosteroid levels are routinely measured as markers of stress in wild animals. However, corticosteroid levels rise rapidly in response to the acute stress of capture and restraint for sampling, limiting its use as an indicator of chronic stress. We hypothesized that serum corticosteroid binding globulin (CBG), the primary transport protein for corticosteroids in circulation, may be a better marker of the stress status prior to capture in grizzly bears (Ursus arctos). To test this, a full-length CBG cDNA was cloned and sequenced from grizzly bear testis and polyclonal antibodies were generated for detection of this protein in bear sera. The deduced nucleotide and protein sequences were 1218 bp and 405 amino acids, respectively. Multiple sequence alignments showed that grizzly bear CBG (gbCBG) was 90% and 83% identical to the dog CBG nucleotide and amino acid sequences, respectively. The affinity purified rabbit gbCBG antiserum detected grizzly bear but not human CBG. There were no sex differences in serum total cortisol concentration, while CBG expression was significantly higher in adult females compared to males. Serum cortisol levels were significantly higher in bears captured by leg-hold snare compared to those captured by remote drug delivery from helicopter. However, serum CBG expression between these two groups did not differ significantly. Overall, serum CBG levels may be a better marker of chronic stress, especially because this protein is not modulated by the stress of capture and restraint in grizzly bears. Copyright 2010 Elsevier Inc. All rights reserved.
Insights into rubber biosynthesis from transcriptome analysis of Hevea brasiliensis latex.

PubMed

Chow, Keng-See; Wan, Kiew-Lian; Isa, Mohd Noor Mat; Bahari, Azlina; Tan, Siang-Hee; Harikrishna, K; Yeang, Hoong-Yeet

2007-01-01

Hevea brasiliensis is the most widely cultivated species for commercial production of natural rubber (cis-polyisoprene). In this study, 10,040 expressed sequence tags (ESTs) were generated from the latex of the rubber tree, which represents the cytoplasmic content of a single cell type, in order to analyse the latex transcription profile with emphasis on rubber biosynthesis-related genes. A total of 3,441 unique transcripts (UTs) were obtained after quality editing and assembly of EST sequences. Functional classification of UTs according to the Gene Ontology convention showed that 73.8% were related to genes of unknown function. Among highly expressed ESTs, a significant proportion encoded proteins related to rubber biosynthesis and stress or defence responses. Sequences encoding rubber particle membrane proteins (RPMPs) belonging to three protein families accounted for 12% of the ESTs. Characterization of these ESTs revealed nine RPMP variants (7.9-27 kDa) including the 14 kDa REF (rubber elongation factor) and 22 kDa SRPP (small rubber particle protein). The expression of multiple RPMP isoforms in latex was shown using antibodies against REF and SRPP. Both EST and quantitative reverse transcription-PCR (QRT-PCR) analyses demonstrated REF and SRPP to be the most abundant transcripts in latex. Besides rubber biosynthesis, comparative sequence analysis showed that the RPMPs are highly similar to sequences in the plant kingdom having stress-related functions. Implications of the RPMP function in cis-polyisoprene biosynthesis in the context of transcript abundance and differential gene expression are discussed.
Identification of photoactivated adenylyl cyclases in Naegleria australiensis and BLUF-containing protein in Naegleria fowleri.

PubMed

Yasukawa, Hiro; Sato, Aya; Kita, Ayaka; Kodaira, Ken-Ichi; Iseki, Mineo; Takahashi, Tetsuo; Shibusawa, Mami; Watanabe, Masakatsu; Yagita, Kenji

2013-01-01

Complete genome sequencing of Naegleria gruberi has revealed that the organism encodes polypeptides similar to photoactivated adenylyl cyclases (PACs). Screening in the N. australiensis genome showed that the organism also encodes polypeptides similar to PACs. Each of the Naegleria proteins consists of a "sensors of blue-light using FAD" domain (BLUF domain) and an adenylyl cyclase domain (AC domain). PAC activity of the Naegleria proteins was assayed by comparing sensitivities of Escherichia coli cells heterologously expressing the proteins to antibiotics in a dark condition and a blue light-irradiated condition. Antibiotics used in the assays were fosfomycin and fosmidomycin. E. coli cells expressing the Naegleria proteins showed increased fosfomycin sensitivity and fosmidomycin sensitivity when incubated under blue light, indicating that the proteins functioned as PACs in the bacterial cells. Analysis of the N. fowleri genome revealed that the organism encodes a protein bearing an amino acid sequence similar to that of BLUF. A plasmid expressing a chimeric protein consisting of the BLUF-like sequence found in N. fowleri and the adenylyl cyclase domain of N. gruberi PAC was constructed to determine whether the BLUF-like sequence functioned as a sensor of blue light. E. coli cells expressing a chimeric protein showed increased fosfomycin sensitivity and fosmidomycin sensitivity when incubated under blue light. These experimental results indicated that the sequence similar to the BLUF domain found in N. fowleri functioned as a sensor of blue light.
Characterization of a novel ADAM protease expressed by Pneumocystis carinii.

PubMed

Kennedy, Cassie C; Kottom, Theodore J; Limper, Andrew H

2009-08-01

Pneumocystis species are opportunistic fungal pathogens that cause severe pneumonia in immunocompromised hosts. Recent evidence has suggested that unidentified proteases are involved in Pneumocystis life cycle regulation. Proteolytically active ADAM (named for "a disintegrin and metalloprotease") family molecules have been identified in some fungal organisms, such as Aspergillus fumigatus and Schizosaccharomyces pombe, and some have been shown to participate in life cycle regulation. Accordingly, we sought to characterize ADAM-like molecules in the fungal opportunistic pathogen, Pneumocystis carinii (PcADAM). After an in silico search of the P. carinii genomic sequencing project identified a 329-bp partial sequence with homology to known ADAM proteins, the full-length PcADAM sequence was obtained by PCR extension cloning, yielding a final coding sequence of 1,650 bp. Sequence analysis detected the presence of a typical ADAM catalytic active site (HEXXHXXGXXHD). Expression of PcADAM over the Pneumocystis life cycle was analyzed by Northern blot. Southern and contour-clamped homogenous electronic field blot analysis demonstrated its presence in the P. carinii genome. Expression of PcADAM was observed to be increased in Pneumocystis cysts compared to trophic forms. The full-length gene was subsequently cloned and heterologously expressed in Saccharomyces cerevisiae. Purified PcADAMp protein was proteolytically active in casein zymography, requiring divalent zinc. Furthermore, native PcADAMp extracted directly from freshly isolated Pneumocystis organisms also exhibited protease activity. This is the first report of protease activity attributable to a specific, characterized protein in the clinically important opportunistic fungal pathogen Pneumocystis.
Genomic identification of regulatory elements by evolutionary sequence comparison and functional analysis.

PubMed

Loots, Gabriela G

2008-01-01

Despite remarkable recent advances in genomics that have enabled us to identify most of the genes in the human genome, comparable efforts to define transcriptional cis-regulatory elements that control gene expression are lagging behind. The difficulty of this task stems from two equally important problems: our knowledge of how regulatory elements are encoded in genomes remains elementary, and there is a vast genomic search space for regulatory elements, since most of mammalian genomes are noncoding. Comparative genomic approaches are having a remarkable impact on the study of transcriptional regulation in eukaryotes and currently represent the most efficient and reliable methods of predicting noncoding sequences likely to control the patterns of gene expression. By subjecting eukaryotic genomic sequences to computational comparisons and subsequent experimentation, we are inching our way toward a more comprehensive catalog of common regulatory motifs that lie behind fundamental biological processes. We are still far from comprehending how the transcriptional regulatory code is encrypted in the human genome and providing an initial global view of regulatory gene networks, but collectively, the continued development of comparative and experimental approaches will rapidly expand our knowledge of the transcriptional regulome.
Hybrid Sequencing of Full-Length cDNA Transcripts of Stems and Leaves in Dendrobium officinale

PubMed Central

He, Liu; Fu, Shuhua; Xu, Zhichao; Yan, Jun; Xu, Jiang; Zhou, Hong; Zhou, Jianguo; Chen, Xinlian; Li, Ying; Au, Kin Fai; Yao, Hui

2017-01-01

Dendrobium officinale is an extremely valuable orchid used in traditional Chinese medicine, so sought after that it has a higher market value than gold. Although the expression profiles of some genes involved in the polysaccharide synthesis have previously been investigated, little research has been carried out on their alternatively spliced isoforms in D. officinale. In addition, information regarding the translocation of sugars from leaves to stems in D. officinale also remains limited. We analyzed the polysaccharide content of D. officinale leaves and stems, and completed in-depth transcriptome sequencing of these two diverse tissue types using second-generation sequencing (SGS) and single-molecule real-time (SMRT) sequencing technology. The results of this study yielded a digital inventory of gene and mRNA isoform expressions. A comparative analysis of both transcriptomes uncovered a total of 1414 differentially expressed genes, including 844 that were up-regulated and 570 that were down-regulated in stems. Of these genes, one sugars will eventually be exported transporter (SWEET) and one sucrose transporter (SUT) are expressed to a greater extent in D. officinale stems than in leaves. Two glycosyltransferase (GT) and four cellulose synthase (Ces) genes undergo a distinct degree of alternative splicing. In the stems, the content of polysaccharides is twice as much as that in the leaves. The differentially expressed GT and transcription factor (TF) genes will be the focus of further study. The genes DoSWEET4 and DoSUT1 are significantly expressed in the stem, and are likely to be involved in sugar loading in the phloem. PMID:28981454
Bovine mammary gene expression profiling during the onset of lactation.

PubMed

Gao, Yuanyuan; Lin, Xueyan; Shi, Kerong; Yan, Zhengui; Wang, Zhonghua

2013-01-01

Lactogenesis includes two stages. Stage I begins a few weeks before parturition. Stage II is initiated around the time of parturition and extends for several days afterwards. To better understand the molecular events underlying these changes, genome-wide gene expression profiling was conducted using digital gene expression (DGE) on bovine mammary tissue at three time points (on approximately day 35 before parturition (-35 d), day 7 before parturition (-7 d) and day 3 after parturition (+3 d)). Approximately 6.2 million (M), 5.8 million (M) and 6.1 million (M) 21-nt cDNA tags were sequenced in the three cDNA libraries (-35 d, -7 d and +3 d), respectively. After aligning to the reference sequences, the three cDNA libraries included 8,662, 8,363 and 8,359 genes, respectively. With a fold change cutoff criteria of ≥ 2 or ≤-2 and a false discovery rate (FDR) of ≤ 0.001, a total of 812 genes were significantly differentially expressed at -7 d compared with -35 d (stage I). Gene ontology analysis showed that those significantly differentially expressed genes were mainly associated with cell cycle, lipid metabolism, immune response and biological adhesion. A total of 1,189 genes were significantly differentially expressed at +3 d compared with -7 d (stage II), and these genes were mainly associated with the immune response and cell cycle. Moreover, there were 1,672 genes significantly differentially expressed at +3 d compared with -35 d. Gene ontology analysis showed that the main differentially expressed genes were those associated with metabolic processes. The results suggest that the mammary gland begins to lactate not only by a gain of function but also by a broad suppression of function to effectively push most of the cell's resources towards lactation.
Quantification of differential gene expression by multiplexed targeted resequencing of cDNA

PubMed Central

Arts, Peer; van der Raadt, Jori; van Gestel, Sebastianus H.C.; Steehouwer, Marloes; Shendure, Jay; Hoischen, Alexander; Albers, Cornelis A.

2017-01-01

Whole-transcriptome or RNA sequencing (RNA-Seq) is a powerful and versatile tool for functional analysis of different types of RNA molecules, but sample reagent and sequencing cost can be prohibitive for hypothesis-driven studies where the aim is to quantify differential expression of a limited number of genes. Here we present an approach for quantification of differential mRNA expression by targeted resequencing of complementary DNA using single-molecule molecular inversion probes (cDNA-smMIPs) that enable highly multiplexed resequencing of cDNA target regions of ∼100 nucleotides and counting of individual molecules. We show that accurate estimates of differential expression can be obtained from molecule counts for hundreds of smMIPs per reaction and that smMIPs are also suitable for quantification of relative gene expression and allele-specific expression. Compared with low-coverage RNA-Seq and a hybridization-based targeted RNA-Seq method, cDNA-smMIPs are a cost-effective high-throughput tool for hypothesis-driven expression analysis in large numbers of genes (10 to 500) and samples (hundreds to thousands). PMID:28474677
Selection of differently temporally regulated African swine fever virus promoters with variable expression activities and their application for transient and recombinant virus mediated gene expression.

PubMed

Portugal, Raquel S; Bauer, Anja; Keil, Guenther M

2017-08-01

African swine fever virus threatens pig production worldwide due to the lack of vaccines, for which generation of both deletion and insertion mutants is considered. For development of the latter, operational ASFV promoters of different temporal regulation and strengths are desirable. We therefore compared the capacities of putative promoter sequences from p72, CD2v, p30, viral DNA polymerase and U104L genes to mediate expression of luciferase from transfected plasmids after activation in trans, or p30-, DNA polymerase- and U104L promoters in cis, using respective ASFV recombinants. We identified sequences with promoter activities upstream the viral ORFs, and showed that they differ in both their expression intensity regulating properties and in their temporal regulation. In summary, p30 and DNA polymerase promoters are recommended for high level early regulated transgene expression. For late expression, the p72, CD2v and U104L promoter are suitable. The latter however, only if low level transgene expression is aimed. Copyright © 2017 Elsevier Inc. All rights reserved.
Usefulness of heterologous promoters in the Pseudozyma flocculosa gene expression system.

PubMed

Avis, Tyler J; Anguenot, Raphaël; Neveu, Bertrand; Bolduc, Sébastien; Zhao, Yingyi; Cheng, Yali; Labbé, Caroline; Belzile, François; Bélanger, Richard R

2008-02-01

The basidiomycetous fungus Pseudozyma flocculosa represents a promising new host for the expression of complex recombinant proteins. Two novel heterologous promoter sequences, the Ustilago maydis glyceraldehyde-3-phosphate dehydrogenase (GPD) and Pseudozyma tsukubaensis alpha-glucosidase promoters, were tested for their ability to provide expression in P. flocculosa. In liquid medium, these two promoters produced lower levels of intracellular green fluorescent protein (GFP) as compared to the U. maydis hsp70 promoter. However, GPD and alpha-glucosidase sequences behaved as constitutive promoters whereas the hsp70 promoter appeared to be morphology-dependent. When using the hsp70 promoter, the expression of GFP increased proportionally to the concentration of hygromycin in the culture medium, indicating possible induction of the promoter by the antibiotic. Optimal solid-state culture conditions were designed for high throughput screening of hygromycin-resistant transformants with the hsp70 promoter in P. flocculosa.
Transcriptome Profiling of Chironomus kiinensis under Phenol Stress Using Solexa Sequencing Technology

PubMed Central

Cao, Chuanwang; Wang, Zhiying; Niu, Changying; Desneux, Nicolas; Gao, Xiwu

2013-01-01

Phenol is a major pollutant in aquatic ecosystems due to its chemical stability, water solubility and environmental mobility. To date, little is known about the molecular modifications of invertebrates under phenol stress. In the present study, we used Solexa sequencing technology to investigate the transcriptome and differentially expressed genes (DEGs) of midges (Chironomus kiinensis) in response to phenol stress. A total of 51,518,972 and 51,150,832 clean reads in the phenol-treated and control libraries, respectively, were obtained and assembled into 51,014 non-redundant (Nr) consensus sequences. A total of 6,032 unigenes were classified by Gene Ontology (GO), and 18,366 unigenes were categorized into 238 Kyoto Encyclopedia of Genes and Genomes (KEGG) categories. These genes included representatives from almost all functional categories. A total of 10,724 differentially expressed genes (P value <0.05) were detected in a comparative analysis of the expression profiles between phenol-treated and control C. kiinensis including 8,390 upregulated and 2,334 downregulated genes. The expression levels of 20 differentially expressed genes were confirmed by real-time RT-PCR, and the trends in gene expression that were observed matched the Solexa expression profiles, although the magnitude of the variations was different. Through pathway enrichment analysis, significantly enriched pathways were identified for the DEGs, including metabolic pathways, aryl hydrocarbon receptor (AhR), pancreatic secretion and neuroactive ligand-receptor interaction pathways, which may be associated with the phenol responses of C. kiinensis. Using Solexa sequencing technology, we identified several groups of key candidate genes as well as important biological pathways involved in the molecular modifications of chironomids under phenol stress. PMID:23527048
Bacterial expression of self-assembling peptide hydrogelators

NASA Astrophysics Data System (ADS)

Sonmez, Cem

For tissue regeneration and drug delivery applications, various architectures are explored to serve as biomaterial tools. Via de novo design, functional peptide hydrogel materials have been developed as scaffolds for biomedical applications. The objective of this study is to investigate bacterial expression as an alternative method to chemical synthesis for the recombinant production of self-assembling peptides that can form rigid hydrogels under physiological conditions. The Schneider and Pochan Labs have designed and characterized a 20 amino acid beta-hairpin forming amphiphilic peptide containing a D-residue in its turn region (MAX1). As a result, this peptide must be prepared chemically. Peptide engineering, using the sequence of MAX1 as a template, afforded a small family of peptides for expression (EX peptides) that have different turn sequences consisting of natural amino acids and amenable to bacterial expression. Each sequence was initially chemically synthesized to quickly assess the material properties of its corresponding gel. One model peptide EX1, was chosen to start the bacterial expression studies. DNA constructs facilitating the expression of EX1 were designed in such that the peptide could be expressed with different fusion partners and subsequently cleaved by enzymatic or chemical means to afford the free peptide. Optimization studies were performed to increase the yield of pure peptide that ultimately allowed 50 mg of pure peptide to be harvested from one liter of culture, providing an alternate means to produce this hydrogel-forming peptide. Recombinant production of other self-assembling hairpins with different turn sequences was also successful using this optimized protocol. The studies demonstrate that new beta-hairpin self-assembling peptides that are amenable to bacterial production and form rigid hydrogels at physiological conditions can be designed and produced by fermentation in good yield at significantly reduced cost when compared to chemical synthesis.
Molecular cloning and sequence analysis of two carbonic anhydrase in the swimming crab Portunus trituberculatus and its expression in response to salinity and pH stress.

PubMed

Pan, Luqing; Hu, Dongxu; Liu, Maoqi; Hu, Yanyan; Liu, Shengnan

2016-01-15

Carbonic anhydrase (CA) is involved in ion transport, acid-base balance and pH regulation by catalyzing the interconversion of CO2 and HCO3(-). In this study, full-length cDNA sequences of two CA isoforms were identified from Portunus trituberculatus. One was Portunus trituberculatus cytoplasmic carbonic anydrase (PtCAc) and the other one was Portunus trituberculatus glycosyl-phosphatidylinositol-linked carbonic anhydrase (PtCAg). The sequence of PtCAc was formed by an ORF of 816 bp, encoding a protein of 30.18 kDa. The PtCAg was constituted by an ORF of 927 bp, encoding a protein of 34.09 kDa. The deduced amino acid sequences of the two CA isoforms were compared to other crustacean' CA sequences. Both of them reflected high conservation of the residues and domains essential to the function of the two enzymes. The tissue expression analysis of PtCAc and PtCAg were detected in gill, muscle, hepatopancreas, hemocytes and gonad. PtCAc and PtCAg gene expressions were studied under salinity and pH challenge. The results showed that when salinity decreased (30 to 20 ppt), the mRNA expression of PtCAc increased significantly at 24 and 48 h, and the highest value appeared at 24h. The mRNA expression of PtCAg had the same situation with PtCAc. However, when salinity increased (30 to 35 ppt), only the mRNA expression of PtCAc increased significantly at 48 h. When pH changed, only the mRNA expression of PtCAc increased significantly at 12h, which was under low pH situation. The mRNA expression of PtCAg increased significantly at 12-48 h, and there was no significant difference of the expression between the pH challenged group and the control group in other experimental time. The results provided the base of understanding CA' function and the underlying mechanism in response to environmental changes in crustaceans. Copyright © 2015 Elsevier B.V. All rights reserved.
Characterization of the Structural Gene Promoter of Aedes aegypti Densovirus

PubMed Central

Ward, Todd W.; Kimmick, Michael W.; Afanasiev, Boris N.; Carlson, Jonathan O.

2001-01-01

Aedes aegypti densonucleosis virus (AeDNV) has two promoters that have been shown to be active by reporter gene expression analysis (B. N. Afanasiev, Y. V. Koslov, J. O. Carlson, and B. J. Beaty, Exp. Parasitol. 79:322–339, 1994). Northern blot analysis of cells infected with AeDNV revealed two transcripts 1,200 and 3,500 nucleotides in length that are assumed to express the structural protein (VP) gene and nonstructural protein genes, respectively. Primer extension was used to map the transcriptional start site of the structural protein gene. Surprisingly, the structural protein gene transcript began at an initiator consensus sequence, CAGT, 60 nucleotides upstream from the map unit 61 TATAA sequence previously thought to define the promoter. Constructs with the β-galactosidase gene fused to the structural protein gene were used to determine elements necessary for promoter function. Deletion or mutation of the initiator sequence, CAGT, reduced protein expression by 93%, whereas mutation of the TATAA sequence at map unit 61 had little effect. An additional open reading frame was observed upstream of the structural protein gene that can express β-galactosidase at a low level (20% of that of VP fusions). Expression of the AeDNV structural protein gene was shown to be stimulated by the major nonstructural protein NS1 (Afanasiev et al., Exp. parasitol., 1994). To determine the sequences required for transactivation, expression of structural protein gene–β-galactosidase gene fusion constructs differing in AeDNV genome content was measured with and without NS1. The presence of NS1 led to an 8- to 10-fold increase in expression when either genomic end was present, compared to a 2-fold increase with a construct lacking the genomic ends. An even higher (37-fold) increase in expression occurred with both genomic ends present; however, this was in part due to template replication as shown by Southern blot analysis. These data indicate the location and importance of various elements necessary for efficient protein expression and transactivation from the structural protein gene promoter of AeDNV. PMID:11152505
Genome‑wide identification of long noncoding RNAs in CCl4‑induced liver fibrosis via RNA sequencing.

PubMed

Gong, Zhenghua; Tang, Jialin; Xiang, Tianxin; Lin, Jiayu; Deng, Chaowen; Peng, Yanzhong; Zheng, Jie; Hu, Guoxin

2018-05-07

Liver fibrosis occurs as a result of chronic liver lesions, which may subsequently develop into liver cirrhosis and hepatocellular carcinoma. The involvement of long noncoding RNAs (lncRNAs) in liver fibrosis is being increasingly recognized. However, the exact mechanisms and functions of the majority of lncRNAs are poorly characterized. In the present study, the hepatotoxic substance carbon tetrachloride (CCl4) was employed to induce liver fibrosis in an animal model and agenome‑wide identification of lncRNAs in fibrotic liver tissues compared with CCl4 untreated liver tissues was performed using RNA sequencing. Sprague‑Dawley rats were treated with CCl4 for 8 weeks. Histopathogical alterations were observed in liver tissues, and serum levels of alanine aminotransferase, aspartate aminotransferase, transforming growth factor‑β1 and tumor necrosis factor‑α were significantly higher, in the CCl4‑treated group compared with the CCl4 untreated group. RNA sequencing of liver tissues demonstrated that 231 lncRNAs and 1,036 mRNAs were differentially expressed between the two groups. Furthermore, bioinformatics analysis demonstrated that the differentially expressed mRNAs were predominantly enriched in 'ECM‑receptor interaction', 'PI3K‑Akt signaling pathway' and 'focal adhesion' pathways, all of which are essential for liver fibrosis development. Validation of 12 significantly aberrant lncRNAs by reverse transcription‑quantitative polymerase chain reaction indicated that the expression patterns of 11 lncRNAs were consistent with the sequencing data. Furthermore, overexpression of lncRNA NR_002155.1, which was markedly downregulated in CCl4‑treated liver tissues, was demonstrated to inhibit HSC‑T6 cell proliferation in vitro. In conclusion, the present study determined the expression patterns of mRNAs and lncRNAs in fibrotic liver tissue induced by CCl4. The identified differentially expressed lncRNAs may serve as novel diagnostic biomarkers and therapeutic targets for liver fibrosis.
Young infants' generalization of emotional expressions: effects of familiarity.

PubMed

Walker-Andrews, Arlene S; Krogh-Jespersen, Sheila; Mayhew, Estelle M Y; Coffield, Caroline N

2011-08-01

From birth, infants are exposed to a wealth of emotional information in their interactions. Much research has been done to investigate the development of emotion perception, and factors influencing that development. The current study investigates the role of familiarity on 3.5-month-old infants' generalization of emotional expressions. Infants were assigned to one of two habituation sequences: in one sequence, infants were visually habituated to parental expressions of happy or sad. At test, infants viewed either a continuation of the habituation sequence, their mother depicting a novel expression, an unfamiliar female depicting the habituated expression, or an unfamiliar female depicting a novel expression. In the second sequence, a new sample of infants was matched to the infants in the first sequence. These infants viewed the same habituation and test sequences, but the actors were unfamiliar to them. Only those infants who viewed their own mothers and fathers during the habituation sequence increased looking. They dishabituated looking to maternal novel expressions, the unfamiliar female's novel expression, and the unfamiliar female depicting the habituated expression, especially when sad parental expressions were followed by an expression change to happy or to a change in person. Infants are guided in their recognition of emotional expressions by the familiarity of their parents, before generalizing to others. 2011 APA, all rights reserved
Sequence diversity and differential expression of major phenylpropanoid-flavonoid biosynthetic genes among three mango varieties.

PubMed

Hoang, Van L T; Innes, David J; Shaw, P Nicholas; Monteith, Gregory R; Gidley, Michael J; Dietzgen, Ralf G

2015-07-30

Mango fruits contain a broad spectrum of phenolic compounds which impart potential health benefits; their biosynthesis is catalysed by enzymes in the phenylpropanoid-flavonoid (PF) pathway. The aim of this study was to reveal the variability in genes involved in the PF pathway in three different mango varieties Mangifera indica L., a member of the family Anacardiaceae: Kensington Pride (KP), Irwin (IW) and Nam Doc Mai (NDM) and to determine associations with gene expression and mango flavonoid profiles. A close evolutionary relationship between mango genes and those from the woody species poplar of the Salicaceae family (Populus trichocarpa) and grape of the Vitaceae family (Vitis vinifera), was revealed through phylogenetic analysis of PF pathway genes. We discovered 145 SNPs in total within coding sequences with an average frequency of one SNP every 316 bp. Variety IW had the highest SNP frequency (one SNP every 258 bp) while KP and NDM had similar frequencies (one SNP every 369 bp and 360 bp, respectively). The position in the PF pathway appeared to influence the extent of genetic diversity of the encoded enzymes. The entry point enzymes phenylalanine lyase (PAL), cinnamate 4-mono-oxygenase (C4H) and chalcone synthase (CHS) had low levels of SNP diversity in their coding sequences, whereas anthocyanidin reductase (ANR) showed the highest SNP frequency followed by flavonoid 3'-hydroxylase (F3'H). Quantitative PCR revealed characteristic patterns of gene expression that differed between mango peel and flesh, and between varieties. The combination of mango expressed sequence tags and availability of well-established reference PF biosynthetic genes from other plant species allowed the identification of coding sequences of genes that may lead to the formation of important flavonoid compounds in mango fruits and facilitated characterisation of single nucleotide polymorphisms between varieties. We discovered an association between the extent of sequence variation and position in the pathway for up-stream genes. The high expression of PAL, C4H and CHS genes in mango peel compared to flesh is associated with high amounts of total phenolic contents in peels, which suggest that these genes have an influence on total flavonoid levels in mango fruit peel and flesh. In addition, the particularly high expression levels of ANR in KP and NDM peels compared to IW peel and the significant accumulation of its product epicatechin gallate (ECG) in those extracts reflects the rate-limiting role of ANR on ECG biosynthesis in mango.
Response of heat shock protein genes of the oriental fruit moth under diapause and thermal stress reveals multiple patterns dependent on the nature of stress exposure.

PubMed

Zhang, Bo; Peng, Yu; Zheng, Jincheng; Liang, Lina; Hoffmann, Ary A; Ma, Chun-Sen

2016-07-01

Heat shock protein gene (Hsp) families are thought to be important in thermal adaptation, but their expression patterns under various thermal stresses have still been poorly characterized outside of model systems. We have therefore characterized Hsp genes and their stress responses in the oriental fruit moth (OFM), Grapholita molesta, a widespread global orchard pest, and compared patterns of expression in this species to that of other insects. Genes from four Hsp families showed variable expression levels among tissues and developmental stages. Members of the Hsp40, 70, and 90 families were highly expressed under short exposures to heat and cold. Expression of Hsp40, 70, and Hsc70 family members increased in OFM undergoing diapause, while Hsp90 was downregulated. We found that there was strong sequence conservation of members of large Hsp families (Hsp40, Hsp60, Hsp70, Hsc70) across taxa, but this was not always matched by conservation of expression patterns. When the large Hsps as well as small Hsps from OFM were compared under acute and ramping heat stress, two groups of sHsps expression patterns were apparent, depending on whether expression increased or decreased immediately after stress exposure. These results highlight potential differences in conservation of function as opposed to sequence in this gene family and also point to Hsp genes potentially useful as bioindicators of diapause and thermal stress in OFM.
Bacillus anthracis genome organization in light of whole transcriptome sequencing

DOE Office of Scientific and Technical Information (OSTI.GOV)

Martin, Jeffrey; Zhu, Wenhan; Passalacqua, Karla D.

2010-03-22

Emerging knowledge of whole prokaryotic transcriptomes could validate a number of theoretical concepts introduced in the early days of genomics. What are the rules connecting gene expression levels with sequence determinants such as quantitative scores of promoters and terminators? Are translation efficiency measures, e.g. codon adaptation index and RBS score related to gene expression? We used the whole transcriptome shotgun sequencing of a bacterial pathogen Bacillus anthracis to assess correlation of gene expression level with promoter, terminator and RBS scores, codon adaptation index, as well as with a new measure of gene translational efficiency, average translation speed. We compared computationalmore » predictions of operon topologies with the transcript borders inferred from RNA-Seq reads. Transcriptome mapping may also improve existing gene annotation. Upon assessment of accuracy of current annotation of protein-coding genes in the B. anthracis genome we have shown that the transcriptome data indicate existence of more than a hundred genes missing in the annotation though predicted by an ab initio gene finder. Interestingly, we observed that many pseudogenes possess not only a sequence with detectable coding potential but also promoters that maintain transcriptional activity.« less

Identification of the full-length β-actin sequence and expression profiles in the tree shrew (Tupaia belangeri).

PubMed

Zheng, Yu; Yun, Chenxia; Wang, Qihui; Smith, Wanli W; Leng, Jing

2015-02-01

The tree shrew (Tupaia belangeri) diverges from the primate order (Primates) and is classified as a separate taxonomic group of mammals - Scandentia. It has been suggested that the tree shrew can be used as an animal model for studying human diseases; however, the genomic sequence of the tree shrew is largely unidentified. In the present study, we reported the full-length cDNA sequence of the housekeeping gene, β-actin, in the tree shrew. The amino acid sequence of β-actin in the tree shrew was compared to that of humans and other species; a simple phylogenetic relationship was discovered. Quantitative polymerase chain reaction (qPCR) and western blot analysis further demonstrated that the expression profiles of β-actin, as a general conservative housekeeping gene, in the tree shrew were similar to those in humans, although the expression levels varied among different types of tissue in the tree shrew. Our data provide evidence that the tree shrew has a close phylogenetic association with humans. These findings further enhance the potential that the tree shrew, as a species, may be used as an animal model for studying human disorders.
Genomic deletion of a long-range bone enhancer misregulatessclerostin in Van Buchem disease

DOE Office of Scientific and Technical Information (OSTI.GOV)

Loots, Gabriela G.; Kneissel, Michaela; Keller, Hansjoerg

2005-04-15

Mutations in distant regulatory elements can negatively impact human development and health, yet due to the difficulty of detecting these critical sequences we predominantly focus on coding sequences for diagnostic purposes. We have undertaken a comparative sequence-based approach to characterize a large noncoding region deleted in patients affected by Van Buchem disease (VB), a severe sclerosing bone dysplasia. Using BAC recombination and transgenesis we characterized the expression of human sclerostin (sost) from normal (hSOSTwt) or Van Buchem(hSOSTvb D) alleles. Only the hSOSTwt allele faithfully expressed high levels of human sost in the adult bone and impacted bone metabolism, consistent withmore » the model that the VB noncoding deletion removes a sost specific regulatory element. By exploiting cross-species sequence comparisons with in vitro and in vivo enhancer assays we were able to identify a candidate enhancer element that drives human sost expression in osteoblast-like cell lines in vitro and in the skeletal anlage of the E14.5 mouse embryo, and discovered a novel function for sclerostin during limb development. Our approach represents a framework for characterizing distant regulatory elements associated with abnormal human phenotypes.« less
MicroRNA-200c Modulates the Expression of MUC4 and MUC16 by Directly Targeting Their Coding Sequences in Human Pancreatic Cancer

PubMed Central

Radhakrishnan, Prakash; Mohr, Ashley M.; Grandgenett, Paul M.; Steele, Maria M.; Batra, Surinder K.; Hollingsworth, Michael A.

2013-01-01

Transmembrane mucins, MUC4 and MUC16 are associated with tumor progression and metastatic potential in human pancreatic adenocarcinoma. We discovered that miR-200c interacts with specific sequences within the coding sequence of MUC4 and MUC16 mRNAs, and evaluated the regulatory nature of this association. Pancreatic cancer cell lines S2.028 and T3M-4 transfected with miR-200c showed a 4.18 and 8.50 fold down regulation of MUC4 mRNA, and 4.68 and 4.82 fold down regulation of MUC16 mRNA compared to mock-transfected cells, respectively. A significant reduction of glycoprotein expression was also observed. These results indicate that miR-200c overexpression regulates MUC4 and MUC16 mucins in pancreatic cancer cells by directly targeting the mRNA coding sequence of each, resulting in reduced levels of MUC4 and MUC16 mRNA and protein. These data suggest that, in addition to regulating proteins that modulate EMT, miR-200c influences expression of cell surface mucins in pancreatic cancer. PMID:24204560
MicroRNA-200c modulates the expression of MUC4 and MUC16 by directly targeting their coding sequences in human pancreatic cancer.

PubMed

Radhakrishnan, Prakash; Mohr, Ashley M; Grandgenett, Paul M; Steele, Maria M; Batra, Surinder K; Hollingsworth, Michael A

2013-01-01

Transmembrane mucins, MUC4 and MUC16 are associated with tumor progression and metastatic potential in human pancreatic adenocarcinoma. We discovered that miR-200c interacts with specific sequences within the coding sequence of MUC4 and MUC16 mRNAs, and evaluated the regulatory nature of this association. Pancreatic cancer cell lines S2.028 and T3M-4 transfected with miR-200c showed a 4.18 and 8.50 fold down regulation of MUC4 mRNA, and 4.68 and 4.82 fold down regulation of MUC16 mRNA compared to mock-transfected cells, respectively. A significant reduction of glycoprotein expression was also observed. These results indicate that miR-200c overexpression regulates MUC4 and MUC16 mucins in pancreatic cancer cells by directly targeting the mRNA coding sequence of each, resulting in reduced levels of MUC4 and MUC16 mRNA and protein. These data suggest that, in addition to regulating proteins that modulate EMT, miR-200c influences expression of cell surface mucins in pancreatic cancer.
Comparative transcriptome analysis of the Asteraceae halophyte Karelinia caspica under salt stress.

PubMed

Zhang, Xia; Liao, Maoseng; Chang, Dan; Zhang, Fuchun

2014-12-17

Much attention has been given to the potential of halophytes as sources of tolerance traits for introduction into cereals. However, a great deal remains unknown about the diverse mechanisms employed by halophytes to cope with salinity. To characterize salt tolerance mechanisms underlying Karelinia caspica, an Asteraceae halophyte, we performed Large-scale transcriptomic analysis using a high-throughput Illumina sequencing platform. Comparative gene expression analysis was performed to correlate the effects of salt stress and ABA regulation at the molecular level. Total sequence reads generated by pyrosequencing were assembled into 287,185 non-redundant transcripts with an average length of 652 bp. Using the BLAST function in the Swiss-Prot, NCBI nr, GO, KEGG, and KOG databases, a total of 216,416 coding sequences associated with known proteins were annotated. Among these, 35,533 unigenes were classified into 69 gene ontology categories, and 18,378 unigenes were classified into 202 known pathways. Based on the fold changes observed when comparing the salt stress and control samples, 60,127 unigenes were differentially expressed, with 38,122 and 22,005 up- and down-regulated, respectively. Several of the differentially expressed genes are known to be involved in the signaling pathway of the plant hormone ABA, including ABA metabolism, transport, and sensing as well as the ABA signaling cascade. Transcriptome profiling of K. caspica contribute to a comprehensive understanding of K. caspica at the molecular level. Moreover, the global survey of differentially expressed genes in this species under salt stress and analyses of the effects of salt stress and ABA regulation will contribute to the identification and characterization of genes and molecular mechanisms underlying salt stress responses in Asteraceae plants.
Computational identification of developmental enhancers:conservation and function of transcription factor binding-site clustersin drosophila melanogaster and drosophila psedoobscura

DOE Office of Scientific and Technical Information (OSTI.GOV)

Berman, Benjamin P.; Pfeiffer, Barret D.; Laverty, Todd R.

2004-08-06

The identification of sequences that control transcription in metazoans is a major goal of genome analysis. In a previous study, we demonstrated that searching for clusters of predicted transcription factor binding sites could discover active regulatory sequences, and identified 37 regions of the Drosophila melanogaster genome with high densities of predicted binding sites for five transcription factors involved in anterior-posterior embryonic patterning. Nine of these clusters overlapped known enhancers. Here, we report the results of in vivo functional analysis of 27 remaining clusters. We generated transgenic flies carrying each cluster attached to a basal promoter and reporter gene, and assayedmore » embryos for reporter gene expression. Six clusters are enhancers of adjacent genes: giant, fushi tarazu, odd-skipped, nubbin, squeeze and pdm2; three drive expression in patterns unrelated to those of neighboring genes; the remaining 18 do not appear to have enhancer activity. We used the Drosophila pseudoobscura genome to compare patterns of evolution in and around the 15 positive and 18 false-positive predictions. Although conservation of primary sequence cannot distinguish true from false positives, conservation of binding-site clustering accurately discriminates functional binding-site clusters from those with no function. We incorporated conservation of binding-site clustering into a new genome-wide enhancer screen, and predict several hundred new regulatory sequences, including 85 adjacent to genes with embryonic patterns. Measuring conservation of sequence features closely linked to function--such as binding-site clustering--makes better use of comparative sequence data than commonly used methods that examine only sequence identity.« less
The p40 Subunit of Interleukin (IL)-12 Promotes Stabilization and Export of the p35 Subunit

PubMed Central

Jalah, Rashmi; Rosati, Margherita; Ganneru, Brunda; Pilkington, Guy R.; Valentin, Antonio; Kulkarni, Viraj; Bergamaschi, Cristina; Chowdhury, Bhabadeb; Zhang, Gen-Mu; Beach, Rachel Kelly; Alicea, Candido; Broderick, Kate E.; Sardesai, Niranjan Y.; Pavlakis, George N.; Felber, Barbara K.

2013-01-01

IL-12 is a 70-kDa heterodimeric cytokine composed of the p35 and p40 subunits. To maximize cytokine production from plasmid DNA, molecular steps controlling IL-12p70 biosynthesis at the posttranscriptional and posttranslational levels were investigated. We show that the combination of RNA/codon-optimized gene sequences and fine-tuning of the relative expression levels of the two subunits within a cell resulted in increased production of the IL-12p70 heterodimer. We found that the p40 subunit plays a critical role in enhancing the stability, intracellular trafficking, and export of the p35 subunit. This posttranslational regulation mediated by the p40 subunit is conserved in mammals. Based on these findings, dual gene expression vectors were generated, producing an optimal ratio of the two subunits, resulting in a ∼1 log increase in human, rhesus, and murine IL-12p70 production compared with vectors expressing the wild type sequences. Such optimized DNA plasmids also produced significantly higher levels of systemic bioactive IL-12 upon in vivo DNA delivery in mice compared with plasmids expressing the wild type sequences. A single therapeutic injection of an optimized murine IL-12 DNA plasmid showed significantly more potent control of tumor development in the B16 melanoma cancer model in mice. Therefore, the improved IL-12p70 DNA vectors have promising potential for in vivo use as molecular vaccine adjuvants and in cancer immunotherapy. PMID:23297419
The Eucalyptus terpene synthase gene family.

PubMed

Külheim, Carsten; Padovan, Amanda; Hefer, Charles; Krause, Sandra T; Köllner, Tobias G; Myburg, Alexander A; Degenhardt, Jörg; Foley, William J

2015-06-11

Terpenoids are abundant in the foliage of Eucalyptus, providing the characteristic smell as well as being valuable economically and influencing ecological interactions. Quantitative and qualitative inter- and intra- specific variation of terpenes is common in eucalypts. The genome sequences of Eucalyptus grandis and E. globulus were mined for terpene synthase genes (TPS) and compared to other plant species. We investigated the relative expression of TPS in seven plant tissues and functionally characterized five TPS genes from E. grandis. Compared to other sequenced plant genomes, Eucalyptus grandis has the largest number of putative functional TPS genes of any sequenced plant. We discovered 113 and 106 putative functional TPS genes in E. grandis and E. globulus, respectively. All but one TPS from E. grandis were expressed in at least one of seven plant tissues examined. Genomic clusters of up to 20 genes were identified. Many TPS are expressed in tissues other than leaves which invites a re-evaluation of the function of terpenes in Eucalyptus. Our data indicate that terpenes in Eucalyptus may play a wider role in biotic and abiotic interactions than previously thought. Tissue specific expression is common and the possibility of stress induction needs further investigation. Phylogenetic comparison of the two investigated Eucalyptus species gives insight about recent evolution of different clades within the TPS gene family. While the majority of TPS genes occur in orthologous pairs some clades show evidence of recent gene duplication, as well as loss of function.
Combined hairpin-antisense compositions and methods for modulating expression

DOEpatents

Shanklin, John; Nguyen, Tam

2014-08-05

A nucleotide construct comprising a nucleotide sequence that forms a stem and a loop, wherein the loop comprises a nucleotide sequence that modulates expression of a target, wherein the stem comprises a nucleotide sequence that modulates expression of a target, and wherein the target modulated by the nucleotide sequence in the loop and the target modulated by the nucleotide sequence in the stem may be the same or different. Vectors, methods of regulating target expression, methods of providing a cell, and methods of treating conditions comprising the nucleotide sequence are also disclosed.
Combined hairpin-antisense compositions and methods for modulating expression

DOEpatents

Shanklin, John; Nguyen, Tam Huu

2015-11-24

A nucleotide construct comprising a nucleotide sequence that forms a stem and a loop, wherein the loop comprises a nucleotide sequence that modulates expression of a target, wherein the stem comprises a nucleotide sequence that modulates expression of a target, and wherein the target modulated by the nucleotide sequence in the loop and the target modulated by the nucleotide sequence in the stem may be the same or different. Vectors, methods of regulating target expression, methods of providing a cell, and methods of treating conditions comprising the nucleotide sequence are also disclosed.
Transcriptome de novo assembly from next-generation sequencing and comparative analyses in the hexaploid salt marsh species Spartina maritima and Spartina alterniflora (Poaceae)

PubMed Central

Ferreira de Carvalho, J; Poulain, J; Da Silva, C; Wincker, P; Michon-Coudouel, S; Dheilly, A; Naquin, D; Boutte, J; Salmon, A; Ainouche, M

2013-01-01

Spartina species have a critical ecological role in salt marshes and represent an excellent system to investigate recurrent polyploid speciation. Using the 454 GS-FLX pyrosequencer, we assembled and annotated the first reference transcriptome (from roots and leaves) for two related hexaploid Spartina species that hybridize in Western Europe, the East American invasive Spartina alterniflora and the Euro-African S. maritima. The de novo read assembly generated 38 478 consensus sequences and 99% found an annotation using Poaceae databases, representing a total of 16 753 non-redundant genes. Spartina expressed sequence tags were mapped onto the Sorghum bicolor genome, where they were distributed among the subtelomeric arms of the 10 S. bicolor chromosomes, with high gene density correlation. Normalization of the complementary DNA library improved the number of annotated genes. Ecologically relevant genes were identified among GO biological function categories in salt and heavy metal stress response, C4 photosynthesis and in lignin and cellulose metabolism. Expression of some of these genes had been found to be altered by hybridization and genome duplication in a previous microarray-based study in Spartina. As these species are hexaploid, up to three duplicated homoeologs may be expected per locus. When analyzing sequence polymorphism at four different loci in S. maritima and S. alterniflora, we found up to four haplotypes per locus, suggesting the presence of two expressed homoeologous sequences with one or two allelic variants each. This reference transcriptome will allow analysis of specific Spartina genes of ecological or evolutionary interest, estimation of homoeologous gene expression variation using RNA-seq and further gene expression evolution analyses in natural populations. PMID:23149455
The Vigna unguiculata Gene Expression Atlas (VuGEA) from de novo assembly and quantification of RNA-seq data provides insights into seed maturation mechanisms.

PubMed

Yao, Shaolun; Jiang, Chuan; Huang, Ziyue; Torres-Jerez, Ivone; Chang, Junil; Zhang, Heng; Udvardi, Michael; Liu, Renyi; Verdier, Jerome

2016-10-01

Legume research and cultivar development are important for sustainable food production, especially of high-protein seed. Thanks to the development of deep-sequencing technologies, crop species have been taken to the front line, even without completion of their genome sequences. Black-eyed pea (Vigna unguiculata) is a legume species widely grown in semi-arid regions, which has high potential to provide stable seed protein production in a broad range of environments, including drought conditions. The black-eyed pea reference genotype has been used to generate a gene expression atlas of the major plant tissues (i.e. leaf, root, stem, flower, pod and seed), with a developmental time series for pods and seeds. From these various organs, 27 cDNA libraries were generated and sequenced, resulting in more than one billion reads. Following filtering, these reads were de novo assembled into 36 529 transcript sequences that were annotated and quantified across the different tissues. A set of 24 866 unique transcript sequences, called Unigenes, was identified. All the information related to transcript identification, annotation and quantification were stored into a gene expression atlas webserver (http://vugea.noble.org), providing a user-friendly interface and necessary tools to analyse transcript expression in black-eyed pea organs and to compare data with other legume species. Using this gene expression atlas, we inferred details of molecular processes that are active during seed development, and identified key putative regulators of seed maturation. Additionally, we found evidence for conservation of regulatory mechanisms involving miRNA in plant tissues subjected to drought and seeds undergoing desiccation. © 2016 The Authors. The Plant Journal published by Society for Experimental Biology and John Wiley & Sons Ltd.
Dose-Response Analysis of RNA-Seq Profiles in Archival ...

EPA Pesticide Factsheets

Use of archival resources has been limited to date by inconsistent methods for genomic profiling of degraded RNA from formalin-fixed paraffin-embedded (FFPE) samples. RNA-sequencing offers a promising way to address this problem. Here we evaluated transcriptomic dose responses using RNA-sequencing in paired FFPE and frozen (FROZ) samples from two archival studies in mice, one 20 years old. Experimental treatments included 3 different doses of di(2-ethylhexyl)phthalate or dichloroacetic acid for the recently archived and older studies, respectively. Total RNA was ribo-depleted and sequenced using the Illumina HiSeq platform. In the recently archived study, FFPE samples had 35% lower total counts compared to FROZ samples but high concordance in fold-change values of differentially expressed genes (DEGs) (r2 = 0.99), highly enriched pathways (90% overlap with FROZ), and benchmark dose estimates for preselected target genes (2% difference vs FROZ). In contrast, older FFPE samples had markedly lower total counts (3% of FROZ) and poor concordance in global DEGs and pathways. However, counts from FFPE and FROZ samples still positively correlated (r2 = 0.84 across all transcripts) and showed comparable dose responses for more highly expressed target genes. These findings highlight potential applications and issues in using RNA-sequencing data from FFPE samples. Recently archived FFPE samples were highly similar to FROZ samples in sequencing q
Comparative Temporal Transcriptome Profiling of Wheat near Isogenic Line Carrying Lr57 under Compatible and Incompatible Interactions

PubMed Central

Yadav, Inderjit S.; Sharma, Amandeep; Kaur, Satinder; Nahar, Natasha; Bhardwaj, Subhash C.; Sharma, Tilak R.; Chhuneja, Parveen

2016-01-01

Leaf rust caused by Puccinia triticina (Pt) is one of the most important diseases of bread wheat globally. Recent advances in sequencing technologies have provided opportunities to analyse the complete transcriptomes of the host as well as pathogen for studying differential gene expression during infection. Pathogen induced differential gene expression was characterized in a near isogenic line carrying leaf rust resistance gene Lr57 and susceptible recipient genotype WL711. RNA samples were collected at five different time points 0, 12, 24, 48, and 72 h post inoculation (HPI) with Pt 77-5. A total of 3020 transcripts were differentially expressed with 1458 and 2692 transcripts in WL711 and WL711+Lr57, respectively. The highest number of differentially expressed transcripts was detected at 12 HPI. Functional categorization using Blast2GO classified the genes into biological processes, molecular function and cellular components. WL711+Lr57 showed much higher number of differentially expressed nucleotide binding and leucine rich repeat genes and expressed more protein kinases and pathogenesis related proteins such as chitinases, glucanases and other PR proteins as compared to susceptible genotype. Pathway annotation with KEGG categorized genes into 13 major classes with carbohydrate metabolism being the most prominent followed by amino acid, secondary metabolites, and nucleotide metabolism. Gene co-expression network analysis identified four and eight clusters of highly correlated genes in WL711 and WL711+Lr57, respectively. Comparative analysis of the differentially expressed transcripts led to the identification of some transcripts which were specifically expressed only in WL711+Lr57. It was apparent from the whole transcriptome sequencing that the resistance gene Lr57 directed the expression of different genes involved in building the resistance response in the host to combat invading pathogen. The RNAseq data and differentially expressed transcripts identified in present study is a genomic resource which can be used for further studying the host pathogen interaction for Lr57 and wheat transcriptome in general. PMID:28066494
Comparative expression profiling in grape (Vitis vinifera) berries derived from frequency analysis of ESTs and MPSS signatures.

PubMed

Iandolino, Alberto; Nobuta, Kan; da Silva, Francisco Goes; Cook, Douglas R; Meyers, Blake C

2008-05-12

Vitis vinifera (V. vinifera) is the primary grape species cultivated for wine production, with an industry valued annually in the billions of dollars worldwide. In order to sustain and increase grape production, it is necessary to understand the genetic makeup of grape species. Here we performed mRNA profiling using Massively Parallel Signature Sequencing (MPSS) and combined it with available Expressed Sequence Tag (EST) data. These tag-based technologies, which do not require a priori knowledge of genomic sequence, are well-suited for transcriptional profiling. The sequence depth of MPSS allowed us to capture and quantify almost all the transcripts at a specific stage in the development of the grape berry. The number and relative abundance of transcripts from stage II grape berries was defined using Massively Parallel Signature Sequencing (MPSS). A total of 2,635,293 17-base and 2,259,286 20-base signatures were obtained, representing at least 30,737 and 26,878 distinct sequences. The average normalized abundance per signature was approximately 49 TPM (Transcripts Per Million). Comparisons of the MPSS signatures with available Vitis species' ESTs and a unigene set demonstrated that 6,430 distinct contigs and 2,190 singletons have a perfect match to at least one MPSS signature. Among the matched sequences, ESTs were identified from tissues other than berries or from berries at different developmental stages. Additional MPSS signatures not matching to known grape ESTs can extend our knowledge of the V. vinifera transcriptome, particularly when these data are used to assist in annotation of whole genome sequences from Vitis vinifera. The MPSS data presented here not only achieved a higher level of saturation than previous EST based analyses, but in doing so, expand the known set of transcripts of grape berries during the unique stage in development that immediately precedes the onset of ripening. The MPSS dataset also revealed evidence of antisense expression not previously reported in grapes but comparable to that reported in other plant species. Finally, we developed a novel web-based, public resource for utilization of the grape MPSS data [1].
Complete Genome Sequence of Sporisorium scitamineum and Biotrophic Interaction Transcriptome with Sugarcane

PubMed Central

Benevenuto, Juliana; Peters, Leila P.; Carvalho, Giselle; Palhares, Alessandra; Quecine, Maria C.; Nunes, Filipe R. S.; Kmit, Maria C. P.; Wai, Alvan; Hausner, Georg; Aitken, Karen S.; Berkman, Paul J.; Fraser, James A.; Moolhuijzen, Paula M.; Coutinho, Luiz L.; Creste, Silvana; Vieira, Maria L. C.; Kitajima, João P.; Monteiro-Vitorello, Claudia B.

2015-01-01

Sporisorium scitamineum is a biotrophic fungus responsible for the sugarcane smut, a worldwide spread disease. This study provides the complete sequence of individual chromosomes of S. scitamineum from telomere to telomere achieved by a combination of PacBio long reads and Illumina short reads sequence data, as well as a draft sequence of a second fungal strain. Comparative analysis to previous available sequences of another strain detected few polymorphisms among the three genomes. The novel complete sequence described herein allowed us to identify and annotate extended subtelomeric regions, repetitive elements and the mitochondrial DNA sequence. The genome comprises 19,979,571 bases, 6,677 genes encoding proteins, 111 tRNAs and 3 assembled copies of rDNA, out of our estimated number of copies as 130. Chromosomal reorganizations were detected when comparing to sequences of S. reilianum, the closest smut relative, potentially influenced by repeats of transposable elements. Repetitive elements may have also directed the linkage of the two mating-type loci. The fungal transcriptome profiling from in vitro and from interaction with sugarcane at two time points (early infection and whip emergence) revealed that 13.5% of the genes were differentially expressed in planta and particular to each developmental stage. Among them are plant cell wall degrading enzymes, proteases, lipases, chitin modification and lignin degradation enzymes, sugar transporters and transcriptional factors. The fungus also modulates transcription of genes related to surviving against reactive oxygen species and other toxic metabolites produced by the plant. Previously described effectors in smut/plant interactions were detected but some new candidates are proposed. Ten genomic islands harboring some of the candidate genes unique to S. scitamineum were expressed only in planta. RNAseq data was also used to reassure gene predictions. PMID:26065709
Wheat EST resources for functional genomics of abiotic stress

PubMed Central

Houde, Mario; Belcaid, Mahdi; Ouellet, François; Danyluk, Jean; Monroy, Antonio F; Dryanova, Ani; Gulick, Patrick; Bergeron, Anne; Laroche, André; Links, Matthew G; MacCarthy, Luke; Crosby, William L; Sarhan, Fathey

2006-01-01

Background Wheat is an excellent species to study freezing tolerance and other abiotic stresses. However, the sequence of the wheat genome has not been completely characterized due to its complexity and large size. To circumvent this obstacle and identify genes involved in cold acclimation and associated stresses, a large scale EST sequencing approach was undertaken by the Functional Genomics of Abiotic Stress (FGAS) project. Results We generated 73,521 quality-filtered ESTs from eleven cDNA libraries constructed from wheat plants exposed to various abiotic stresses and at different developmental stages. In addition, 196,041 ESTs for which tracefiles were available from the National Science Foundation wheat EST sequencing program and DuPont were also quality-filtered and used in the analysis. Clustering of the combined ESTs with d2_cluster and TGICL yielded a few large clusters containing several thousand ESTs that were refractory to routine clustering techniques. To resolve this problem, the sequence proximity and "bridges" were identified by an e-value distance graph to manually break clusters into smaller groups. Assembly of the resolved ESTs generated a 75,488 unique sequence set (31,580 contigs and 43,908 singletons/singlets). Digital expression analyses indicated that the FGAS dataset is enriched in stress-regulated genes compared to the other public datasets. Over 43% of the unique sequence set was annotated and classified into functional categories according to Gene Ontology. Conclusion We have annotated 29,556 different sequences, an almost 5-fold increase in annotated sequences compared to the available wheat public databases. Digital expression analysis combined with gene annotation helped in the identification of several pathways associated with abiotic stress. The genomic resources and knowledge developed by this project will contribute to a better understanding of the different mechanisms that govern stress tolerance in wheat and other cereals. PMID:16772040
Expression of the histone chaperone SET/TAF-Iβ during the strobilation process of Mesocestoides corti (Platyhelminthes, Cestoda).

PubMed

Costa, Caroline B; Monteiro, Karina M; Teichmann, Aline; da Silva, Edileuza D; Lorenzatto, Karina R; Cancela, Martín; Paes, Jéssica A; Benitz, André de N D; Castillo, Estela; Margis, Rogério; Zaha, Arnaldo; Ferreira, Henrique B

2015-08-01

The histone chaperone SET/TAF-Iβ is implicated in processes of chromatin remodelling and gene expression regulation. It has been associated with the control of developmental processes, but little is known about its function in helminth parasites. In Mesocestoides corti, a partial cDNA sequence related to SET/TAF-Iβ was isolated in a screening for genes differentially expressed in larvae (tetrathyridia) and adult worms. Here, the full-length coding sequence of the M. corti SET/TAF-Iβ gene was analysed and the encoded protein (McSET/TAF) was compared with orthologous sequences, showing that McSET/TAF can be regarded as a SET/TAF-Iβ family member, with a typical nucleosome-assembly protein (NAP) domain and an acidic tail. The expression patterns of the McSET/TAF gene and protein were investigated during the strobilation process by RT-qPCR, using a set of five reference genes, and by immunoblot and immunofluorescence, using monospecific polyclonal antibodies. A gradual increase in McSET/TAF transcripts and McSET/TAF protein was observed upon development induction by trypsin, demonstrating McSET/TAF differential expression during strobilation. These results provided the first evidence for the involvement of a protein from the NAP family of epigenetic effectors in the regulation of cestode development.
Inferring the expression variability of human transposable element-derived exons by linear model analysis of deep RNA sequencing data.

PubMed

Zhang, Wensheng; Edwards, Andrea; Fan, Wei; Fang, Zhide; Deininger, Prescott; Zhang, Kun

2013-08-28

The exonization of transposable elements (TEs) has proven to be a significant mechanism for the creation of novel exons. Existing knowledge of the retention patterns of TE exons in mRNAs were mainly established by the analysis of Expressed Sequence Tag (EST) data and microarray data. This study seeks to validate and extend previous studies on the expression of TE exons by an integrative statistical analysis of high throughput RNA sequencing data. We collected 26 RNA-seq datasets spanning multiple tissues and cancer types. The exon-level digital expressions (indicating retention rates in mRNAs) were quantified by a double normalized measure, called the rescaled RPKM (Reads Per Kilobase of exon model per Million mapped reads). We analyzed the distribution profiles and the variability (across samples and between tissue/disease groups) of TE exon expressions, and compared them with those of other constitutive or cassette exons. We inferred the effects of four genomic factors, including the location, length, cognate TE family and TE nucleotide proportion (RTE, see Methods section) of a TE exon, on the exons' expression level and expression variability. We also investigated the biological implications of an assembly of highly-expressed TE exons. Our analysis confirmed prior studies from the following four aspects. First, with relatively high expression variability, most TE exons in mRNAs, especially those without exact counterparts in the UCSC RefSeq (Reference Sequence) gene tables, demonstrate low but still detectable expression levels in most tissue samples. Second, the TE exons in coding DNA sequences (CDSs) are less highly expressed than those in 3' (5') untranslated regions (UTRs). Third, the exons derived from chronologically ancient repeat elements, such as MIRs, tend to be highly expressed in comparison with those derived from younger TEs. Fourth, the previously observed negative relationship between the lengths of exons and the inclusion levels in transcripts is also true for exonized TEs. Furthermore, our study resulted in several novel findings. They include: (1) for the TE exons with non-zero expression and as shown in most of the studied biological samples, a high TE nucleotide proportion leads to their lower retention rates in mRNAs; (2) the considered genomic features (i.e. a continuous variable such as the exon length or a category indicator such as 3'UTR) influence the expression level and the expression variability (CV) of TE exons in an inverse manner; (3) not only the exons derived from Alu elements but also the exons from the TEs of other families were preferentially established in zinc finger (ZNF) genes.
Molecular analysis of two phytohemagglutinin genes and their expression in Phaseolus vulgaris cv. Pinto, a lectin-deficient cultivar of the bean.

PubMed

Voelker, T A; Staswick, P; Chrispeels, M J

1986-12-01

Phytohemagglutinin (PHA), the seed lectin of the common bean, Phaseolus vulgaris, is encoded by two highly homologous, tandemly linked genes, dlec1 and dlec2, which are coordinately expressed at high levels in developing cotyledons. Their respective transcripts translate into closely related polypeptides, PHA-E and PHA-L, constituents of the tetrameric lectin which accumulates at high levels in developing seeds. In the bean cultivar Pinto UI111, PHA-E is not detectable, and PHA-L accumulates at very reduced levels. To investigate the cause of the Pinto phenotype, we cloned and sequenced the two PHA genes of Pinto, called Pdlec1 and Pdlec2, and determined the abundance of their respective mRNAs in developing cotyledons. Both genes are more than 90% homologous to the normal PHA genes found in other cultivars. Pdlec1 carries a 1-bp frameshift mutation close to the 5' end of its coding sequence. Only very truncated polypeptides could be made from its mRNA. The gene Pdlec2 encodes a polypeptide, which resembles PHA-L and its predicted amino acid sequence agrees with the available Pinto PHA amino acid sequence data. Analysis of the mRNA of developing cotyledons revealed that the Pdlec1 message is reduced 600-fold, and Pdlec2 mRNA is reduced 20-fold with respect to mRNA levels in normal cultivars. A comparison of the sequences which are upstream from the coding sequence shows that Pdlec2 has a 100-bp deletion compared to the other genes (dlec1, dlec2 and Pdlec1). This deletion which contains a large tandem repeat may be responsible for the low level of expression of Pdlec2. The very low expression of Pdlec1 is as yet unexplained.

IDENTIFICATION AND EXPRESSION OF MACROPHAGE MIGRATION INHIBITORY FACTOR IN SARCOPTES SCABIEI

PubMed Central

COTE’, N.M.; JAWORSKI, D.C.; WASALA, N.B.; MORGAN, M.S.; ARLIAN, L. G.

2013-01-01

Macrophage migration inhibitory factor (MIF) is a pleiotropic proinflammatory cytokine produced by many mammalian tissues including skin. It is also found in many invertebrate parasites of mammals including ticks and may function to aid the parasite to evade the innate and adaptive immune responses in the host. In this study, the cDNA for a MIF gene was sequenced from Sarcoptes scabiei, the scabies mite, using RT-PCR and RACE molecular techniques. The resulting nucleotide sequence had a length of 405 base pairs and the putative amino acid sequences for the mite and tick (Dermacentor variabilis) proteins were identical. The initial steps for the project resulted in the production of expressed scabies mite cDNAs. A real time (qPCR) assay was performed with MIF from scabies mites and various tick species. Results show that mRNA encoding MIF homologues was three times more abundant in the mite samples when compared to RNA prepared from D. variabilis salivary glands and 1.3 times more abundant when compared with RNA prepared from D. variabilis midgut. PMID:23831036
Whole-exome sequencing in a single proband reveals a mutation in the CHST8 gene in autosomal recessive peeling skin syndrome

PubMed Central

Cabral, Rita M.; Kurban, Mazen; Wajid, Muhammad; Shimomura, Yutaka; Petukhova, Lynn; Christiano, Angela M.

2015-01-01

Generalized peeling skin syndrome (PSS) is an autosomal recessive genodermatosis characterized by lifelong, continuous shedding of the upper epidermis. Using whole-genome homozygozity mapping and whole-exome sequencing, we identified a novel homozygous missense mutation (c.229C>T, R77W) within the CHST8 gene, in a large consanguineous family with non-inflammatory PSS type A. CHST8 encodes a Golgi transmembrane N-acetylgalactosamine-4-O-sulfotransferase (GalNAc4-ST1), which we show by immunofluorescence staining to be expressed throughout normal epidermis. A colorimetric assay for total sulfated glycosaminoglycan (GAG) quantification, comparing human keratinocytes (CCD1106 KERTr) expressing wild type and mutant recombinant GalNAc4-ST1, revealed decreased levels of total sulfated GAGs in cells expressing mutant GalNAc4-ST1, suggesting loss of function. Western blotting revealed lower expression levels of mutant recombinant GalNAc4-ST1 compared to wild type, suggesting that accelerated degradation may result in loss of function, leading to PSS type A. This is the first report describing a mutation as the cause of PSS type A. PMID:22289416
Whole-exome sequencing in a single proband reveals a mutation in the CHST8 gene in autosomal recessive peeling skin syndrome.

PubMed

Cabral, Rita M; Kurban, Mazen; Wajid, Muhammad; Shimomura, Yutaka; Petukhova, Lynn; Christiano, Angela M

2012-04-01

Generalized peeling skin syndrome (PSS) is an autosomal recessive genodermatosis characterized by lifelong, continuous shedding of the upper epidermis. Using whole-genome homozygozity mapping and whole-exome sequencing, we identified a novel homozygous missense mutation (c.229C>T, R77W) within the CHST8 gene, in a large consanguineous family with non-inflammatory PSS type A. CHST8 encodes a Golgi transmembrane N-acetylgalactosamine-4-O-sulfotransferase (GalNAc4-ST1), which we show by immunofluorescence staining to be expressed throughout normal epidermis. A colorimetric assay for total sulfated glycosaminoglycan (GAG) quantification, comparing human keratinocytes (CCD1106 KERTr) expressing wild type and mutant recombinant GalNAc4-ST1, revealed decreased levels of total sulfated GAGs in cells expressing mutant GalNAc4-ST1, suggesting loss of function. Western blotting revealed lower expression levels of mutant recombinant GalNAc4-ST1 compared to wild type, suggesting that accelerated degradation may result in loss of function, leading to PSS type A. This is the first report describing a mutation as the cause of PSS type A. Copyright © 2012 Elsevier Inc. All rights reserved.
Genomic organization, expression, and chromosome localization of a third aurora-related kinase gene, Aie1.

PubMed

Hu, H M; Chuang, C K; Lee, M J; Tseng, T C; Tang, T K

2000-11-01

We previously reported two novel testis-specific serine/threonine kinases, Aie1 (mouse) and AIE2 (human), that share high amino acid identities with the kinase domains of fly aurora and yeast Ipl1. Here, we report the entire intron-exon organization of the Aie1 gene and analyze the expression patterns of Aie1 mRNA during testis development. The mouse Aie1 gene spans approximately 14 kb and contains seven exons. The sequences of the exon-intron boundaries of the Aie1 gene conform to the consensus sequences (GT/AG) of the splicing donor and acceptor sites of most eukaryotic genes. Comparative genomic sequencing revealed that the gene structure is highly conserved between mouse Aie1 and human AIE2. However, much less homology was found in the sequence outside the kinase-coding domains. The Aie1 locus was mapped to mouse chromosome 7A2-A3 by fluorescent in situ hybridization. Northern blot analysis indicates that Aie1 mRNA likely is expressed at a low level on day 14 and reaches its plateau on day 21 in the developing postnatal testis. RNA in situ hybridization indicated that the expression of the Aie1 transcript was restricted to meiotically active germ cells, with the highest levels detected in spermatocytes at the late pachytene stage. These findings suggest that Aie1 plays a role in spermatogenesis.
Molecular Cloning, Characterization, and Differential Expression of a Glucoamylase Gene from the Basidiomycetous Fungus Lentinula edodes

PubMed Central

Zhao, J.; Chen, Y. H.; Kwan, H. S.

2000-01-01

The complete nucleotide sequence of putative glucoamylase gene gla1 from the basidiomycetous fungus Lentinula edodes strain L54 is reported. The coding region of the genomic glucoamylase sequence, which is preceded by eukaryotic promoter elements CAAT and TATA, spans 2,076 bp. The gla1 gene sequence codes for a putative polypeptide of 571 amino acids and is interrupted by seven introns. The open reading frame sequence of the gla1 gene shows strong homology with those of other fungal glucoamylase genes and encodes a protein with an N-terminal catalytic domain and a C-terminal starch-binding domain. The similarity between the Gla1 protein and other fungal glucoamylases is from 45 to 61%, with the region of highest conservation found in catalytic domains and starch-binding domains. We compared the kinetics of glucoamylase activity and levels of gene expression in L. edodes strain L54 grown on different carbon sources (glucose, starch, cellulose, and potato extract) and in various developmental stages (mycelium growth, primordium appearance, and fruiting body formation). Quantitative reverse transcription PCR utilizing pairs of primers specific for gla1 gene expression shows that expression of gla1 was induced by starch and increased during the process of fruiting body formation, which indicates that glucoamylases may play an important role in the morphogenesis of the basidiomycetous fungus. PMID:10831434
Reduced expression of APC-1B but not APC-1A by the deletion of promoter 1B is responsible for familial adenomatous polyposis.

PubMed

Yamaguchi, Kiyoshi; Nagayama, Satoshi; Shimizu, Eigo; Komura, Mitsuhiro; Yamaguchi, Rui; Shibuya, Tetsuo; Arai, Masami; Hatakeyama, Seira; Ikenoue, Tsuneo; Ueno, Masashi; Miyano, Satoru; Imoto, Seiya; Furukawa, Yoichi

2016-05-24

Germline mutations in the tumor suppressor gene APC are associated with familial adenomatous polyposis (FAP). Here we applied whole-genome sequencing (WGS) to the DNA of a sporadic FAP patient in which we did not find any pathological APC mutations by direct sequencing. WGS identified a promoter deletion of approximately 10 kb encompassing promoter 1B and exon1B of APC. Additional allele-specific expression analysis by deep cDNA sequencing revealed that the deletion reduced the expression of the mutated APC allele to as low as 11.2% in the total APC transcripts, suggesting that the residual mutant transcripts were driven by other promoter(s). Furthermore, cap analysis of gene expression (CAGE) demonstrated that the deleted promoter 1B region is responsible for the great majority of APC transcription in many tissues except the brain. The deletion decreased the transcripts of APC-1B to 39-45% in the patient compared to the healthy controls, but it did not decrease those of APC-1A. Different deletions including promoter 1B have been reported in FAP patients. Taken together, our results strengthen the evidence that analysis of structural variations in promoter 1B should be considered for the FAP patients whose pathological mutations are not identified by conventional direct sequencing.
Cloning of Russian sturgeon (Acipenser gueldenstaedtii) growth hormone and insulin-like growth factor I and their expression in male and female fish during the first period of growth.

PubMed

Yom Din, S; Hurvitz, A; Goldberg, D; Jackson, K; Levavi-Sivan, B; Degani, G

2008-03-01

In this study, the GH and IGF-I of the Russian sturgeon (rs), Acipenser gueldenstaedtii, were cloned and sequenced, and their mRNA gene expression determined. In addition, to improve our understanding of the GH function, the expression of this hormone was assessed in young males and females. Moreover, IGF-I expression was quantified in young males and compared to that in older ones. The nucleotide sequence of the rsGH cDNA was 980 bp long and had an open reading frame of 642 bp, beginning with the first ATG codon at position 39 and ending with the stop codon at position 683. A putative polyadenylation signal, AATAAA, was recognized 42 bp upstream of the poly (A) tail. The position of the signal- peptide cleavage site was predicted to be at position 111, yielding a signal peptide of 24 amino-acids (aa) and a mature peptide of 190 aa. When the rsGH aa sequence was compared with other species, the highest degree of identity was found to be with mammalians (66-70% identity), followed by anguilliformes and amphibia (61%) and other fish (39-47%). The level of rsGH mRNA was discovered to be similar in pituitaries of females and males of 5 age groups (1, 2, 3, 4, and 5- yr-old). In females and males, the levels did not change dramatically during the first 5 yr of growth. The partial nucleotide sequence of the rsIGF-I was 445 bp long and had an open reading frame of 396 bp, beginning with the ATG codon at position 50. The position of the signal-peptide cleavage site was predicted to be at position 187, yielding a signal peptide of 44 aa. The highest level of IGF-I mRNA expression was recorded in the kidney of adult sturgeons. The IGF-I mRNA expression levels in the intestine, pituitary gland, and liver were not significantly different. Low levels of expression were found in the brain, heart, and muscle. In most tissues, there was no significant difference between mRNA levels of one and 5-yr-old fish. In conclusion, based on the GH-sequence analysis, A. gueldenstaedtii is genetically distant from other teleosts. The expression of the GH mRNA was similar in males and females, and its level remained constant during the first 5 yr of growth. While the IGF-I mRNA expression differed amongst various tissues, the level in each tissue was similar in 1 and 5-yr-old fish.
Porcine transcriptome analysis based on 97 non-normalized cDNA libraries and assembly of 1,021,891 expressed sequence tags

PubMed Central

Gorodkin, Jan; Cirera, Susanna; Hedegaard, Jakob; Gilchrist, Michael J; Panitz, Frank; Jørgensen, Claus; Scheibye-Knudsen, Karsten; Arvin, Troels; Lumholdt, Steen; Sawera, Milena; Green, Trine; Nielsen, Bente J; Havgaard, Jakob H; Rosenkilde, Carina; Wang, Jun; Li, Heng; Li, Ruiqiang; Liu, Bin; Hu, Songnian; Dong, Wei; Li, Wei; Yu, Jun; Wang, Jian; Stærfeldt, Hans-Henrik; Wernersson, Rasmus; Madsen, Lone B; Thomsen, Bo; Hornshøj, Henrik; Bujie, Zhan; Wang, Xuegang; Wang, Xuefei; Bolund, Lars; Brunak, Søren; Yang, Huanming; Bendixen, Christian; Fredholm, Merete

2007-01-01

Background Knowledge of the structure of gene expression is essential for mammalian transcriptomics research. We analyzed a collection of more than one million porcine expressed sequence tags (ESTs), of which two-thirds were generated in the Sino-Danish Pig Genome Project and one-third are from public databases. The Sino-Danish ESTs were generated from one normalized and 97 non-normalized cDNA libraries representing 35 different tissues and three developmental stages. Results Using the Distiller package, the ESTs were assembled to roughly 48,000 contigs and 73,000 singletons, of which approximately 25% have a high confidence match to UniProt. Approximately 6,000 new porcine gene clusters were identified. Expression analysis based on the non-normalized libraries resulted in the following findings. The distribution of cluster sizes is scaling invariant. Brain and testes are among the tissues with the greatest number of different expressed genes, whereas tissues with more specialized function, such as developing liver, have fewer expressed genes. There are at least 65 high confidence housekeeping gene candidates and 876 cDNA library-specific gene candidates. We identified differential expression of genes between different tissues, in particular brain/spinal cord, and found patterns of correlation between genes that share expression in pairs of libraries. Finally, there was remarkable agreement in expression between specialized tissues according to Gene Ontology categories. Conclusion This EST collection, the largest to date in pig, represents an essential resource for annotation, comparative genomics, assembly of the pig genome sequence, and further porcine transcription studies. PMID:17407547
The Intolerance of Regulatory Sequence to Genetic Variation Predicts Gene Dosage Sensitivity

PubMed Central

Wang, Quanli; Halvorsen, Matt; Han, Yujun; Weir, William H.; Allen, Andrew S.; Goldstein, David B.

2015-01-01

Noncoding sequence contains pathogenic mutations. Yet, compared with mutations in protein-coding sequence, pathogenic regulatory mutations are notoriously difficult to recognize. Most fundamentally, we are not yet adept at recognizing the sequence stretches in the human genome that are most important in regulating the expression of genes. For this reason, it is difficult to apply to the regulatory regions the same kinds of analytical paradigms that are being successfully applied to identify mutations among protein-coding regions that influence risk. To determine whether dosage sensitive genes have distinct patterns among their noncoding sequence, we present two primary approaches that focus solely on a gene’s proximal noncoding regulatory sequence. The first approach is a regulatory sequence analogue of the recently introduced residual variation intolerance score (RVIS), termed noncoding RVIS, or ncRVIS. The ncRVIS compares observed and predicted levels of standing variation in the regulatory sequence of human genes. The second approach, termed ncGERP, reflects the phylogenetic conservation of a gene’s regulatory sequence using GERP++. We assess how well these two approaches correlate with four gene lists that use different ways to identify genes known or likely to cause disease through changes in expression: 1) genes that are known to cause disease through haploinsufficiency, 2) genes curated as dosage sensitive in ClinGen’s Genome Dosage Map, 3) genes judged likely to be under purifying selection for mutations that change expression levels because they are statistically depleted of loss-of-function variants in the general population, and 4) genes judged unlikely to cause disease based on the presence of copy number variants in the general population. We find that both noncoding scores are highly predictive of dosage sensitivity using any of these criteria. In a similar way to ncGERP, we assess two ensemble-based predictors of regional noncoding importance, ncCADD and ncGWAVA, and find both scores are significantly predictive of human dosage sensitive genes and appear to carry information beyond conservation, as assessed by ncGERP. These results highlight that the intolerance of noncoding sequence stretches in the human genome can provide a critical complementary tool to other genome annotation approaches to help identify the parts of the human genome increasingly likely to harbor mutations that influence risk of disease. PMID:26332131
Identification of immunity-related genes in the larvae of Protaetia brevitarsis seulensis (Coleoptera: Cetoniidae) by a next-generation sequencing-based transcriptome analysis.

PubMed

Bang, Kyeongrin; Hwang, Sejung; Lee, Jiae; Cho, Saeyoull

2015-01-01

To identify immune-related genes in the larvae of white-spotted flower chafers, next-generation sequencing was conducted with an Illumina HiSeq2000, resulting in 100 million cDNA reads with sequence information from over 10 billion base pairs (bp) and >50× transcriptome coverage. A subset of 77,336 contigs was created, and ∼35,532 sequences matched entries against the NCBI nonredundant database (cutoff, e < 10(-5)). Statistical analysis was performed on the 35,532 contigs. For profiling of the immune response, samples were analyzed by aligning 42 base sequence tags to the de novo reference assembly, comparing levels in immunized larvae to control levels of expression. Of the differentially expressed genes, 3,440 transcripts were upregulated and 3,590 transcripts were downregulated. Many of these genes were confirmed as immune-related genes such as pattern recognition proteins, immune-related signal transduction proteins, antimicrobial peptides, and cellular response proteins, by comparison to published data. © The Author 2015. Published by Oxford University Press on behalf of the Entomological Society of America.
Single-cell full-length total RNA sequencing uncovers dynamics of recursive splicing and enhancer RNAs.

PubMed

Hayashi, Tetsutaro; Ozaki, Haruka; Sasagawa, Yohei; Umeda, Mana; Danno, Hiroki; Nikaido, Itoshi

2018-02-12

Total RNA sequencing has been used to reveal poly(A) and non-poly(A) RNA expression, RNA processing and enhancer activity. To date, no method for full-length total RNA sequencing of single cells has been developed despite the potential of this technology for single-cell biology. Here we describe random displacement amplification sequencing (RamDA-seq), the first full-length total RNA-sequencing method for single cells. Compared with other methods, RamDA-seq shows high sensitivity to non-poly(A) RNA and near-complete full-length transcript coverage. Using RamDA-seq with differentiation time course samples of mouse embryonic stem cells, we reveal hundreds of dynamically regulated non-poly(A) transcripts, including histone transcripts and long noncoding RNA Neat1. Moreover, RamDA-seq profiles recursive splicing in >300-kb introns. RamDA-seq also detects enhancer RNAs and their cell type-specific activity in single cells. Taken together, we demonstrate that RamDA-seq could help investigate the dynamics of gene expression, RNA-processing events and transcriptional regulation in single cells.
Developmental Considerations of Sperm Protein 17 Gene Expression in Rheumatoid Arthritis Synoviocytes

PubMed Central

Takeoka, Yuichi; Kenny, Thomas P.; Yago, Hisashi; Naiki, Mitsuru; Gershwin, M. Eric; Robbins, Dick L.

2002-01-01

Rheumatoid arthritis (RA) is an autoimmune disease characterized by proliferative synovial tissue. We used mRNA differential display and library subtraction to compare mRNA expression in RA and osteoarthritis (OA) synoviocytes. We initially compared the mRNA expression patterns in 1 female RA and 1 OA synovia and found a differentially expressed 350 bp transcript in the RA synoviocytes which was, by sequence analysis, 100% homologous to sperm protein 17 (Sp17). Moreover, the Sp17 transcript was found differentially expressed in a RA synovial library that was subtracted with an OA synovial library. Using specific primers for full length Sp17, a 1.1 kb transcript was amplified from the synoviocytes of 7 additional female RA patients, sequenced and found to 100% homologous to Sp17. Thus, we found the unexpected expression of Sp17, a thought to be gamete-specific protein, in the synoviocytes of 8/8 female RA patients in contrast to control OA synoviocytes. Interestingly, Sp17's structural relationship with cell-binding and recognition proteins, suggests that Sp17 may function in cell-cell recognition and signaling in the RA synoviocyte. Further, Sp17 could have a significant regulatory role in RA synoviocyte gene transcription and/or signal transduction. Thus, Sp17 could have an important role in RA synoviocyte proliferation or defective apoptosis. Finally, the presence of Sp17 in synoviocytes has interesting developmental considerations. PMID:12739786
Exploring codon context bias for synthetic gene design of a thermostable invertase in Escherichia coli.

PubMed

Pek, Han Bin; Klement, Maximilian; Ang, Kok Siong; Chung, Bevan Kai-Sheng; Ow, Dave Siak-Wei; Lee, Dong-Yup

2015-01-01

Various isoforms of invertases from prokaryotes, fungi, and higher plants has been expressed in Escherichia coli, and codon optimisation is a widely-adopted strategy for improvement of heterologous enzyme expression. Successful synthetic gene design for recombinant protein expression can be done by matching its translational elongation rate against heterologous host organisms via codon optimization. Amongst the various design parameters considered for the gene synthesis, codon context bias has been relatively overlooked compared to individual codon usage which is commonly adopted in most of codon optimization tools. In addition, matching the rates of transcription and translation based on secondary structure may lead to enhanced protein folding. In this study, we evaluated codon context fitness as design criterion for improving the expression of thermostable invertase from Thermotoga maritima in Escherichia coli and explored the relevance of secondary structure regions for folding and expression. We designed three coding sequences by using (1) a commercial vendor optimized gene algorithm, (2) codon context for the whole gene, and (3) codon context based on the secondary structure regions. Then, the codon optimized sequences were transformed and expressed in E. coli. From the resultant enzyme activities and protein yield data, codon context fitness proved to have the highest activity as compared to the wild-type control and other criteria while secondary structure-based strategy is comparable to the control. Codon context bias was shown to be a relevant parameter for enhancing enzyme production in Escherichia coli by codon optimization. Thus, we can effectively design synthetic genes within heterologous host organisms using this criterion. Copyright © 2015 Elsevier Inc. All rights reserved.
Sequencing of mRNA identifies re-expression of fetal splice variants in cardiac hypertrophy

PubMed Central

Ames, EG; Lawson, MJ; Mackey, AJ; Holmes, JW

2013-01-01

Cardiac hypertrophy has been well-characterized at the level of transcription. During cardiac hypertrophy, genes normally expressed primarily during fetal heart development are reexpressed, and this fetal gene program is believed to be a critical component of the hypertrophic process. Recently, alternative splicing of mRNA transcripts has been shown to be temporally regulated during heart development, leading us to consider whether fetal patterns of splicing also reappear during hypertrophy. We hypothesized that patterns of alternative splicing occurring during heart development are recapitulated during cardiac hypertrophy. Here we present a study of isoform expression during pressure-overload cardiac hypertrophy induced by 10 days of transverse aortic constriction (TAC) in rats and in developing fetal rat hearts compared to sham-operated adult rat hearts, using high-throughput sequencing of poly(A) tail mRNA. We find a striking degree of overlap between the isoforms expressed differentially in fetal and pressure-overloaded hearts compared to control: forty-four percent of the isoforms with significantly altered expression in TAC hearts are also expressed at significantly different levels in fetal hearts compared to control (P < 0.001). The isoforms that are shared between hypertrophy and fetal heart development are significantly enriched for genes involved in cytoskeletal organization, RNA processing, developmental processes, and metabolic enzymes. Our data strongly support the concept that mRNA splicing patterns normally associated with heart development recur as part of the hypertrophic response to pressure overload. These findings suggest that cardiac hypertrophy shares post-transcriptional as well as transcriptional regulatory mechanisms with fetal heart development. PMID:23688780
Large-scale identification and comparative analysis of miRNA expression profile in the respiratory tree of the sea cucumber Apostichopus japonicus during aestivation.

PubMed

Chen, Muyan; Storey, Kenneth B

2014-02-01

The sea cucumber Apostichopus japonicus withstands high water temperatures in the summer by suppressing its metabolic rate and entering a state of aestivation. We hypothesized that changes in the expression of miRNAs could provide important post-transcriptional regulation of gene expression during hypometabolism via control over mRNA translation. The present study analyzed profiles of miRNA expression in the sea cucumber respiratory tree using Solexa deep sequencing technology. We identified 279 sea cucumber miRNAs, including 15 novel miRNAs specific to sea cucumber. Animals sampled during deep aestivation (DA; after at least 15 days of continuous torpor) were compared with animals from a non-aestivation (NA) state (animals that had passed through aestivation and returned to an active state). We identified 30 differentially expressed miRNAs ([RPM (reads per million) >10, |FC| (|fold change|)≥1, FDR (false discovery rate)<0.01]) during aestivation, which were validated by two other miRNA profiling methods: miRNA microarray and real-time PCR. Among the most prominent miRNA species, miR-124, miR-124-3p, miR-79, miR-9 and miR-2010 were significantly over-expressed during deep aestivation compared with non-aestivation animals, suggesting that these miRNAs may play important roles in metabolic rate suppression during aestivation. High-throughput sequencing data and microarray data have been submitted to the GEO database with accession number: 16902695. Copyright © 2014 Elsevier B.V. All rights reserved.
Comparative transcriptome analyses of three medicinal Forsythia species and prediction of candidate genes involved in secondary metabolisms.

PubMed

Sun, Luchao; Rai, Amit; Rai, Megha; Nakamura, Michimi; Kawano, Noriaki; Yoshimatsu, Kayo; Suzuki, Hideyuki; Kawahara, Nobuo; Saito, Kazuki; Yamazaki, Mami

2018-05-07

The three Forsythia species, F. suspensa, F. viridissima and F. koreana, have been used as herbal medicines in China, Japan and Korea for centuries and they are known to be rich sources of numerous pharmaceutical metabolites, forsythin, forsythoside A, arctigenin, rutin and other phenolic compounds. In this study, de novo transcriptome sequencing and assembly was performed on these species. Using leaf and flower tissues of F. suspensa, F. viridissima and F. koreana, 1.28-2.45-Gbp sequences of Illumina based pair-end reads were obtained and assembled into 81,913, 88,491 and 69,458 unigenes, respectively. Classification of the annotated unigenes in gene ontology terms and KEGG pathways was used to compare the transcriptome of three Forsythia species. The expression analysis of orthologous genes across all three species showed the expression in leaf tissues being highly correlated. The candidate genes presumably involved in the biosynthetic pathway of lignans and phenylethanoid glycosides were screened as co-expressed genes. They express highly in the leaves of F. viridissima and F. koreana. Furthermore, the three unigenes annotated as acyltransferase were predicted to be associated with the biosynthesis of acteoside and forsythoside A from the expression pattern and phylogenetic analysis. This study is the first report on comparative transcriptome analyses of medicinally important Forsythia genus and will serve as an important resource to facilitate further studies on biosynthesis and regulation of therapeutic compounds in Forsythia species.
Reversible second-order conditional sequences in incidental sequence learning tasks.

PubMed

Pasquali, Antoine; Cleeremans, Axel; Gaillard, Vinciane

2018-06-01

In sequence learning tasks, participants' sensitivity to the sequential structure of a series of events often overshoots their ability to express relevant knowledge intentionally, as in generation tasks that require participants to produce either the next element of a sequence (inclusion) or a different element (exclusion). Comparing generation performance under inclusion and exclusion conditions makes it possible to assess the respective influences of conscious and unconscious learning. Recently, two main concerns have been expressed concerning such tasks. First, it is often difficult to design control sequences in such a way that they enable clear comparisons with the training material. Second, it is challenging to ask participants to perform appropriately under exclusion instructions, for the requirement to exclude familiar responses often leads them to adopt degenerate strategies (e.g., pushing on the same key all the time), which then need to be specifically singled out as invalid. To overcome both concerns, we introduce reversible second-order conditional (RSOC) sequences and show (a) that they elicit particularly strong transfer effects, (b) that dissociation of implicit and explicit influences becomes possible thanks to the removal of salient transitions in RSOCs, and (c) that exclusion instructions can be greatly simplified without losing sensitivity.
Too much data, but little inter-changeability: a lesson learned from mining public data on tissue specificity of gene expression.

PubMed

Li, Shuyu; Li, Yiqun Helen; Wei, Tao; Su, Eric Wen; Duffin, Kevin; Liao, Birong

2006-10-25

The tissue expression pattern of a gene often provides an important clue to its potential role in a biological process. A vast amount of gene expression data have been and are being accumulated in public repository through different technology platforms. However, exploitations of these rich data sources remain limited in part due to issues of technology standardization. Our objective is to test the data comparability between SAGE and microarray technologies, through examining the expression pattern of genes under normal physiological states across variety of tissues. There are 42-54% of genes showing significant correlations in tissue expression patterns between SAGE and GeneChip, with 30-40% of genes whose expression patterns are positively correlated and 10-15% of genes whose expression patterns are negatively correlated at a statistically significant level (p = 0.05). Our analysis suggests that the discrepancy on the expression patterns derived from technology platforms is not likely from the heterogeneity of tissues used in these technologies, or other spurious correlations resulting from microarray probe design, abundance of genes, or gene function. The discrepancy can be partially explained by errors in the original assignment of SAGE tags to genes due to the evolution of sequence databases. In addition, sequence analysis has indicated that many SAGE tags and Affymetrix array probe sets are mapped to different splice variants or different sequence regions although they represent the same gene, which also contributes to the observed discrepancies between SAGE and array expression data. To our knowledge, this is the first report attempting to mine gene expression patterns across tissues using public data from different technology platforms. Unlike previous similar studies that only demonstrated the discrepancies between the two gene expression platforms, we carried out in-depth analysis to further investigate the cause for such discrepancies. Our study shows that the exploitation of rich public expression resource requires extensive knowledge about the technologies, and experiment. Informatic methodologies for better interoperability among platforms still remain a gap. One of the areas that can be improved practically is the accurate sequence mapping of SAGE tags and array probes to full-length genes.
Expression profiling of snoRNAs in normal hematopoiesis and AML

PubMed Central

Warner, Wayne A.; Spencer, David H.; Trissal, Maria; White, Brian S.; Helton, Nichole; Ley, Timothy J.

2018-01-01

Small nucleolar RNAs (snoRNAs) are noncoding RNAs that contribute to ribosome biogenesis and RNA splicing by modifying ribosomal RNA and spliceosome RNAs, respectively. We optimized a next-generation sequencing approach and a custom analysis pipeline to identify and quantify expression of snoRNAs in acute myeloid leukemia (AML) and normal hematopoietic cell populations. We show that snoRNAs are expressed in a lineage- and development-specific fashion during hematopoiesis. The most striking examples involve snoRNAs located in 2 imprinted loci, which are highly expressed in hematopoietic progenitors and downregulated during myeloid differentiation. Although most snoRNAs are expressed at similar levels in AML cells compared with CD34+, a subset of snoRNAs showed consistent differential expression, with the great majority of these being decreased in the AML samples. Analysis of host gene expression, splicing patterns, and whole-genome sequence data for mutational events did not identify transcriptional patterns or genetic alterations that account for these expression differences. These data provide a comprehensive analysis of the snoRNA transcriptome in normal and leukemic cells and should be helpful in the design of studies to define the contribution of snoRNAs to normal and malignant hematopoiesis. PMID:29365324
microRNA expression profiling in fetal single ventricle malformation identified by deep sequencing.

PubMed

Yu, Zhang-Bin; Han, Shu-Ping; Bai, Yun-Fei; Zhu, Chun; Pan, Ya; Guo, Xi-Rong

2012-01-01

microRNAs (miRNAs) have emerged as key regulators in many biological processes, particularly cardiac growth and development, although the specific miRNA expression profile associated with this process remains to be elucidated. This study aimed to characterize the cellular microRNA profile involved in the development of congenital heart malformation, through the investigation of single ventricle (SV) defects. Comprehensive miRNA profiling in human fetal SV cardiac tissue was performed by deep sequencing. Differential expression of 48 miRNAs was revealed by sequencing by oligonucleotide ligation and detection (SOLiD) analysis. Of these, 38 were down-regulated and 10 were up-regulated in differentiated SV cardiac tissue, compared to control cardiac tissue. This was confirmed by real-time quantitative reverse transcription-polymerase chain reaction (qRT-PCR) analysis. Predicted target genes of the 48 differentially expressed miRNAs were analyzed by gene ontology and categorized according to cellular process, regulation of biological process and metabolic process. Pathway-Express analysis identified the WNT and mTOR signaling pathways as the most significant processes putatively affected by the differential expression of these miRNAs. The candidate genes involved in cardiac development were identified as potential targets for these differentially expressed microRNAs and the collaborative network of microRNAs and cardiac development related-mRNAs was constructed. These data provide the basis for future investigation of the mechanism of the occurrence and development of fetal SV malformations.

Expressed sequences tags of the anther smut fungus, Microbotryum violaceum, identify mating and pathogenicity genes

PubMed Central

Yockteng, Roxana; Marthey, Sylvain; Chiapello, Hélène; Gendrault, Annie; Hood, Michael E; Rodolphe, François; Devier, Benjamin; Wincker, Patrick; Dossat, Carole; Giraud, Tatiana

2007-01-01

Background The basidiomycete fungus Microbotryum violaceum is responsible for the anther-smut disease in many plants of the Caryophyllaceae family and is a model in genetics and evolutionary biology. Infection is initiated by dikaryotic hyphae produced after the conjugation of two haploid sporidia of opposite mating type. This study describes M. violaceum ESTs corresponding to nuclear genes expressed during conjugation and early hyphal production. Results A normalized cDNA library generated 24,128 sequences, which were assembled into 7,765 unique genes; 25.2% of them displayed significant similarity to annotated proteins from other organisms, 74.3% a weak similarity to the same set of known proteins, and 0.5% were orphans. We identified putative pheromone receptors and genes that in other fungi are involved in the mating process. We also identified many sequences similar to genes known to be involved in pathogenicity in other fungi. The M. violaceum EST database, MICROBASE, is available on the Web and provides access to the sequences, assembled contigs, annotations and programs to compare similarities against MICROBASE. Conclusion This study provides a basis for cloning the mating type locus, for further investigation of pathogenicity genes in the anther smut fungi, and for comparative genomics. PMID:17692127
Sox2 regulatory region 2 sequence works as a DNA nuclear targeting sequence enhancing the efficiency of an exogenous gene expression in ES cells

DOE Office of Scientific and Technical Information (OSTI.GOV)

Funabashi, Hisakage; Takatsu, Makoto; Saito, Mikako

2010-10-01

Research highlights: {yields} SV40-DTS worked as a DTS in ES cells as well as other types of cells. {yields} Sox2 regulatory region 2 worked as a DTS in ES cells and thus was termed as SRR2-DTS. {yields} SRR2-DTS was suggested as an ES cell-specific DTS. -- Abstract: In this report, the effects of two DNA nuclear targeting sequence (DTS) candidates on the gene expression efficiency in ES cells were investigated. Reporter plasmids containing the simian virus 40 (SV40) promoter/enhancer sequence (SV40-DTS), a DTS for various types of cells but not being reported yet for ES cells, and the 81 basemore » pairs of Sox2 regulatory region 2 (SRR2) where two transcriptional factors in ES cells, Oct3/4 and Sox2, are bound (SRR2-DTS), were introduced into cytoplasm in living cells by femtoinjection. The gene expression efficiencies of each plasmid in mouse insulinoma cell line MIN6 cells and mouse ES cells were then evaluated. Plasmids including SV40-DTS and SRR2-DTS exhibited higher gene expression efficiency comparing to plasmids without these DTSs, and thus it was concluded that both sequences work as a DTS in ES cells. In addition, it was suggested that SRR2-DTS works as an ES cell-specific DTS. To the best of our knowledge, this is the first report to confirm the function of DTSs in ES cells.« less
Characterization and safety evaluation of HPPD W336, a modified 4-hydroxyphenylpyruvate dioxygenase protein, and the impact of its expression on plant metabolism in herbicide-tolerant MST-FGØ72-2 soybean.

PubMed

Dreesen, Rozemarijn; Capt, Annabelle; Oberdoerfer, Regina; Coats, Isabelle; Pallett, Kenneth Edward

2018-06-09

By transgenic expression technology, a modified 4-hydroxyphenylpyruvate dioxygenase enzyme (HPPD W336) originating from Pseudomonas fluorescens is expressed in MST-FGØ72-2 soybean to confer tolerance to 4-benzoyl isoxazole and triketone type of herbicides1F. Characterization and safety assessment of HPPD W336 were performed. No relevant sequence homologies were found with known allergens or toxins. Although sequence identity to known toxins showed identity to HPPD proteins annotated as hemolysins, the absence of hemolytic activity of HPPD W336 was demonstrated in vitro. HPPD W336 degrades rapidly in simulated gastric fluid. The absence of toxicity and hemolytic potential of HPPD W336 was confirmed by in vivo studies. The substrate spectrum of HPPD W336 was compared with wild type HPPD proteins, demonstrating that its expression is unlikely to induce any metabolic shifts in soybean. The potential effect of expression of HPPD W336 on metabolic pathways related to tyrosine was investigated by comparing seed composition of MST-FGØ72-2 soybean with non-genetically modified varieties, demonstrating that expression of HPPD W336 does not change aromatic amino acid, homogentisate and tocochromanol levels. In conclusion, HPPD W336 was demonstrated to be as safe as other food proteins. No adverse metabolic effects were identified related to HPPD W336 expression in MST-FGØ72-2 soybean. Copyright © 2018. Published by Elsevier Inc.
Characterization and Comparative Profiling of MiRNA Transcriptomes in Bighead Carp and Silver Carp

PubMed Central

Chi, Wei; Tong, Chaobo; Gan, Xiaoni; He, Shunping

2011-01-01

MicroRNAs (miRNAs) are small non-coding RNA molecules that are processed from large ‘hairpin’ precursors and function as post-transcriptional regulators of target genes. Although many individual miRNAs have recently been extensively studied, there has been very little research on miRNA transcriptomes in teleost fishes. By using high throughput sequencing technology, we have identified 167 and 166 conserved miRNAs (belonging to 108 families) in bighead carp (Hypophthalmichthys nobilis) and silver carp (Hypophthalmichthys molitrix), respectively. We compared the expression patterns of conserved miRNAs by means of hierarchical clustering analysis and log2 ratio. Results indicated that there is not a strong correlation between sequence conservation and expression conservation, most of these miRNAs have similar expression patterns. However, high expression differences were also identified for several individual miRNAs. Several miRNA* sequences were also found in our dataset and some of them may have regulatory functions. Two computational strategies were used to identify novel miRNAs from un-annotated data in the two carps. A first strategy based on zebrafish genome, identified 8 and 22 novel miRNAs in bighead carp and silver carp, respectively. We postulate that these miRNAs should also exist in the zebrafish, but the methodologies used have not allowed for their detection. In the second strategy we obtained several carp-specific miRNAs, 31 in bighead carp and 32 in silver carp, which showed low expression. Gain and loss of family members were observed in several miRNA families, which suggests that duplication of animal miRNA genes may occur through evolutionary processes which are similar to the protein-coding genes. PMID:21858165
Grouping and characterization of putative glycosyltransferase genes from Panax ginseng Meyer.

PubMed

Khorolragchaa, Altanzul; Kim, Yu-Jin; Rahimi, Shadi; Sukweenadhi, Johan; Jang, Moon-Gi; Yang, Deok-Chun

2014-02-15

Glycosyltransferases are members of the multigene family of plants that can transfer single or multiple activated sugars to a range of plant molecules, resulting in the glycosylation of plant compounds. Although the activities of many glycosyltransferases and their products have been recognized for a long time, only in recent years were some glycosyltransferase genes identified and few have been functionally characterized in detail. Korean ginseng (Panax ginseng Meyer), belonging to Araliaceae, has been well known as a popular mysterious medicinal herb in East Asia for over 2,000 years. A total of 704 glycosyltransferase unique sequences have been found from a ginseng expressed sequence tag (EST) library, and these sequences encode enzymes responsible for the secondary metabolite biosynthesis. Finally, twelve UDP glycosyltransferases (UGTs) were selected as the candidates most likely to be involved in triterpenoid synthesis. In this study, we classified the candidate P. ginseng UGTs (PgUGTs) into proper families and groups, which resulted in eight UGT families and six UGT groups. We also investigated those gene candidates encoding for glycosyltransferases by analysis of gene expression in methyl jasmonate (MeJA)-treated ginseng adventitious roots and different tissues from four-year-old ginseng using quantitative reverse transcriptase-polymerase chain reaction (RT-PCR). For organ-specific expression, most of PgUGT transcription levels were higher in leaves and roots compared with flower buds and stems. The transcription of PgUGTs in adventitious roots treated with MeJA increased as compared with the control. PgUGT1 and PgUGT2, which belong to the UGT71 family genes expressed in MeJA-treated adventitious roots, were especially sensitive, showing 33.32 and 38.88-fold expression increases upon 24h post-treatments, respectively. © 2013 Elsevier B.V. All rights reserved.
Genome and transcriptome sequencing in prospective metastatic triple-negative breast cancer uncovers therapeutic vulnerabilities.

PubMed

Craig, David W; O'Shaughnessy, Joyce A; Kiefer, Jeffrey A; Aldrich, Jessica; Sinari, Shripad; Moses, Tracy M; Wong, Shukmei; Dinh, Jennifer; Christoforides, Alexis; Blum, Joanne L; Aitelli, Cristi L; Osborne, Cynthia R; Izatt, Tyler; Kurdoglu, Ahmet; Baker, Angela; Koeman, Julie; Barbacioru, Catalin; Sakarya, Onur; De La Vega, Francisco M; Siddiqui, Asim; Hoang, Linh; Billings, Paul R; Salhia, Bodour; Tolcher, Anthony W; Trent, Jeffrey M; Mousses, Spyro; Von Hoff, Daniel; Carpten, John D

2013-01-01

Triple-negative breast cancer (TNBC) is characterized by the absence of expression of estrogen receptor, progesterone receptor, and HER-2. Thirty percent of patients recur after first-line treatment, and metastatic TNBC (mTNBC) has a poor prognosis with median survival of one year. Here, we present initial analyses of whole genome and transcriptome sequencing data from 14 prospective mTNBC. We have cataloged the collection of somatic genomic alterations in these advanced tumors, particularly those that may inform targeted therapies. Genes mutated in multiple tumors included TP53, LRP1B, HERC1, CDH5, RB1, and NF1. Notable genes involved in focal structural events were CTNNA1, PTEN, FBXW7, BRCA2, WT1, FGFR1, KRAS, HRAS, ARAF, BRAF, and PGCP. Homozygous deletion of CTNNA1 was detected in 2 of 6 African Americans. RNA sequencing revealed consistent overexpression of the FOXM1 gene when tumor gene expression was compared with nonmalignant breast samples. Using an outlier analysis of gene expression comparing one cancer with all the others, we detected expression patterns unique to each patient's tumor. Integrative DNA/RNA analysis provided evidence for deregulation of mutated genes, including the monoallelic expression of TP53 mutations. Finally, molecular alterations in several cancers supported targeted therapeutic intervention on clinical trials with known inhibitors, particularly for alterations in the RAS/RAF/MEK/ERK and PI3K/AKT/mTOR pathways. In conclusion, whole genome and transcriptome profiling of mTNBC have provided insights into somatic events occurring in this difficult to treat cancer. These genomic data have guided patients to investigational treatment trials and provide hypotheses for future trials in this irremediable cancer.
Noncoding RNA Expression and Targeted Next-Generation Sequencing Distinguish Tubulocystic Renal Cell Carcinoma (TC-RCC) from Other Renal Neoplasms.

PubMed

Lawrie, Charles H; Armesto, María; Fernandez-Mercado, Marta; Arestín, María; Manterola, Lorea; Goicoechea, Ibai; Larrea, Erika; Caffarel, María M; Araujo, Angela M; Sole, Carla; Sperga, Maris; Alvarado-Cabrero, Isabel; Michal, Michal; Hes, Ondrej; López, José I

2018-01-01

Tubulocystic renal cell carcinoma (TC-RCC) is a rare recently described renal neoplasm characterized by gross, microscopic, and immunohistochemical differences from other renal tumor types and was recently classified as a distinct entity. However, this distinction remains controversial particularly because some genetic studies suggest a close relationship with papillary RCC (PRCC). The molecular basis of this disease remains largely unexplored. We therefore performed noncoding (nc) RNA/miRNA expression analysis and targeted next-generation sequencing mutational profiling on 13 TC-RCC cases (11 pure, two mixed TC-RCC/PRCC) and compared with other renal neoplasms. The expression profile of miRNAs and other ncRNAs in TC-RCC was distinct and validated 10 differentially expressed miRNAs by quantitative RT-PCR, including miR-155 and miR-34a, that were significantly down-regulated compared with PRCC cases (n = 22). With the use of targeted next-generation sequencing we identified mutations in 14 different genes, most frequently (>60% of TC-RCC cases) in ABL1 and PDFGRA genes. These mutations were present in <5% of clear cell RCC, PRCC, or chromophobe RCC cases (n > 600) of The Cancer Genome Atlas database. In summary, this study is by far the largest molecular study of TC-RCC cases and the first to investigate either ncRNA expression or their genomic profile. These results add molecular evidence that TC-RCC is indeed a distinct entity from PRCC and other renal neoplasms. Copyright © 2018 American Society for Investigative Pathology and the Association for Molecular Pathology. Published by Elsevier Inc. All rights reserved.
High-Throughput Sequencing Reveals Differential Expression of miRNAs in Intestine from Sea Cucumber during Aestivation

PubMed Central

Chen, Muyan; Zhang, Xiumei; Liu, Jianning; Storey, Kenneth B.

2013-01-01

The regulatory role of miRNA in gene expression is an emerging hot new topic in the control of hypometabolism. Sea cucumber aestivation is a complicated physiological process that includes obvious hypometabolism as evidenced by a decrease in the rates of oxygen consumption and ammonia nitrogen excretion, as well as a serious degeneration of the intestine into a very tiny filament. To determine whether miRNAs play regulatory roles in this process, the present study analyzed profiles of miRNA expression in the intestine of the sea cucumber (Apostichopus japonicus), using Solexa deep sequencing technology. We identified 308 sea cucumber miRNAs, including 18 novel miRNAs specific to sea cucumber. Animals sampled during deep aestivation (DA) after at least 15 days of continuous torpor, were compared with animals from a non-aestivation (NA) state (animals that had passed through aestivation and returned to the active state). We identified 42 differentially expressed miRNAs [RPM (reads per million) >10, |FC| (|fold change|) ≥1, FDR (false discovery rate) <0.01] during aestivation, which were validated by two other miRNA profiling methods: miRNA microarray and real-time PCR. Among the most prominent miRNA species, miR-200-3p, miR-2004, miR-2010, miR-22, miR-252a, miR-252a-3p and miR-92 were significantly over-expressed during deep aestivation compared with non-aestivation animals. Preliminary analyses of their putative target genes and GO analysis suggest that these miRNAs could play important roles in global transcriptional depression and cell differentiation during aestivation. High-throughput sequencing data and microarray data have been submitted to GEO database. PMID:24143179
Elephant Transcriptome Provides Insights into the Evolution of Eutherian Placentation

PubMed Central

Hou, Zhuo-Cheng; Sterner, Kirstin N.; Romero, Roberto; Than, Nandor Gabor; Gonzalez, Juan M.; Weckle, Amy; Xing, Jun; Benirschke, Kurt; Goodman, Morris; Wildman, Derek E.

2012-01-01

The chorioallantoic placenta connects mother and fetus in eutherian pregnancies. In order to understand the evolution of the placenta and provide further understanding of placenta biology, we sequenced the transcriptome of a term placenta of an African elephant (Loxodonta africana) and compared these data with RNA sequence and microarray data from other eutherian placentas including human, mouse, and cow. We characterized the composition of 55,910 expressed sequence tag (i.e., cDNA) contigs using our custom annotation pipeline. A Markov algorithm was used to cluster orthologs of human, mouse, cow, and elephant placenta transcripts. We found 2,963 genes are commonly expressed in the placentas of these eutherian mammals. Gene ontology categories previously suggested to be important for placenta function (e.g., estrogen receptor signaling pathway, cell motion and migration, and adherens junctions) were significantly enriched in these eutherian placenta–expressed genes. Genes duplicated in different lineages and also specifically expressed in the placenta contribute to the great diversity observed in mammalian placenta anatomy. We identified 1,365 human lineage–specific, 1,235 mouse lineage–specific, 436 cow lineage–specific, and 904 elephant-specific placenta-expressed (PE) genes. The most enriched clusters of human-specific PE genes are signal/glycoprotein and immunoglobulin, and humans possess a deeply invasive human hemochorial placenta that comes into direct contact with maternal immune cells. Inference of phylogenetically conserved and derived transcripts demonstrates the power of comparative transcriptomics to trace placenta evolution and variation across mammals and identified candidate genes that may be important in the normal function of the human placenta, and their dysfunction may be related to human pregnancy complications. PMID:22546564
RNA Sequencing Analysis of the Gametophyte Transcriptome from the Liverwort, Marchantia polymorpha

PubMed Central

Sharma, Niharika; Jung, Chol-Hee; Bhalla, Prem L.; Singh, Mohan B.

2014-01-01

The liverwort Marchantia polymorpha is a member of the most basal lineage of land plants (embryophytes) and likely retains many ancestral morphological, physiological and molecular characteristics. Despite its phylogenetic importance and the availability of previous EST studies, M. polymorpha’s lack of economic importance limits accessible genomic resources for this species. We employed Illumina RNA-Seq technology to sequence the gametophyte transcriptome of M. polymorpha. cDNA libraries from 6 different male and female developmental tissues were sequenced to delineate a global view of the M. polymorpha transcriptome. Approximately 80 million short reads were obtained and assembled into a non-redundant set of 46,533 transcripts (> = 200 bp) from 46,070 loci. The average length and the N50 length of the transcripts were 757 bp and 471 bp, respectively. Sequence comparison of assembled transcripts with non-redundant proteins from embryophytes resulted in the annotation of 43% of the transcripts. The transcripts were also compared with M. polymorpha expressed sequence tags (ESTs), and approximately 69.5% of the transcripts appeared to be novel. Twenty-one percent of the transcripts were assigned GO terms to improve annotation. In addition, 6,112 simple sequence repeats (SSRs) were identified as potential molecular markers, which may be useful in studies of genetic diversity. A comparative genomics approach revealed that a substantial proportion of the genes (35.5%) expressed in M. polymorpha were conserved across phylogenetically related species, such as Selaginella and Physcomitrella, and identified 580 genes that are potentially unique to liverworts. Our study presents an extensive amount of novel sequence information for M. polymorpha. This information will serve as a valuable genomics resource for further molecular, developmental and comparative evolutionary studies, as well as for the isolation and characterization of functional genes that are involved in sex differentiation and sexual reproduction in this liverwort. PMID:24841988
cisprimertool: software to implement a comparative genomics strategy for the development of conserved intron scanning (CIS) markers.

PubMed

Jayashree, B; Jagadeesh, V T; Hoisington, D

2008-05-01

The availability of complete, annotated genomic sequence information in model organisms is a rich resource that can be extended to understudied orphan crops through comparative genomic approaches. We report here a software tool (cisprimertool) for the identification of conserved intron scanning regions using expressed sequence tag alignments to a completely sequenced model crop genome. The method used is based on earlier studies reporting the assessment of conserved intron scanning primers (called CISP) within relatively conserved exons located near exon-intron boundaries from onion, banana, sorghum and pearl millet alignments with rice. The tool is freely available to academic users at http://www.icrisat.org/gt-bt/CISPTool.htm. © 2007 ICRISAT.
Functional analysis and transcriptional output of the Göttingen minipig genome.

PubMed

Heckel, Tobias; Schmucki, Roland; Berrera, Marco; Ringshandl, Stephan; Badi, Laura; Steiner, Guido; Ravon, Morgane; Küng, Erich; Kuhn, Bernd; Kratochwil, Nicole A; Schmitt, Georg; Kiialainen, Anna; Nowaczyk, Corinne; Daff, Hamina; Khan, Azinwi Phina; Lekolool, Isaac; Pelle, Roger; Okoth, Edward; Bishop, Richard; Daubenberger, Claudia; Ebeling, Martin; Certa, Ulrich

2015-11-14

In the past decade the Göttingen minipig has gained increasing recognition as animal model in pharmaceutical and safety research because it recapitulates many aspects of human physiology and metabolism. Genome-based comparison of drug targets together with quantitative tissue expression analysis allows rational prediction of pharmacology and cross-reactivity of human drugs in animal models thereby improving drug attrition which is an important challenge in the process of drug development. Here we present a new chromosome level based version of the Göttingen minipig genome together with a comparative transcriptional analysis of tissues with pharmaceutical relevance as basis for translational research. We relied on mapping and assembly of WGS (whole-genome-shotgun sequencing) derived reads to the reference genome of the Duroc pig and predict 19,228 human orthologous protein-coding genes. Genome-based prediction of the sequence of human drug targets enables the prediction of drug cross-reactivity based on conservation of binding sites. We further support the finding that the genome of Sus scrofa contains about ten-times less pseudogenized genes compared to other vertebrates. Among the functional human orthologs of these minipig pseudogenes we found HEPN1, a putative tumor suppressor gene. The genomes of Sus scrofa, the Tibetan boar, the African Bushpig, and the Warthog show sequence conservation of all inactivating HEPN1 mutations suggesting disruption before the evolutionary split of these pig species. We identify 133 Sus scrofa specific, conserved long non-coding RNAs (lncRNAs) in the minipig genome and show that these transcripts are highly conserved in the African pigs and the Tibetan boar suggesting functional significance. Using a new minipig specific microarray we show high conservation of gene expression signatures in 13 tissues with biomedical relevance between humans and adult minipigs. We underline this relationship for minipig and human liver where we could demonstrate similar expression levels for most phase I drug-metabolizing enzymes. Higher expression levels and metabolic activities were found for FMO1, AKR/CRs and for phase II drug metabolizing enzymes in minipig as compared to human. The variability of gene expression in equivalent human and minipig tissues is considerably higher in minipig organs, which is important for study design in case a human target belongs to this variable category in the minipig. The first analysis of gene expression in multiple tissues during development from young to adult shows that the majority of transcriptional programs are concluded four weeks after birth. This finding is in line with the advanced state of human postnatal organ development at comparative age categories and further supports the minipig as model for pediatric drug safety studies. Genome based assessment of sequence conservation combined with gene expression data in several tissues improves the translational value of the minipig for human drug development. The genome and gene expression data presented here are important resources for researchers using the minipig as model for biomedical research or commercial breeding. Potential impact of our data for comparative genomics, translational research, and experimental medicine are discussed.
Generation and analysis of expressed sequence tags in the extreme large genomes Lilium and Tulipa

PubMed Central

2012-01-01

Background Bulbous flowers such as lily and tulip (Liliaceae family) are monocot perennial herbs that are economically very important ornamental plants worldwide. However, there are hardly any genetic studies performed and genomic resources are lacking. To build genomic resources and develop tools to speed up the breeding in both crops, next generation sequencing was implemented. We sequenced and assembled transcriptomes of four lily and five tulip genotypes using 454 pyro-sequencing technology. Results Successfully, we developed the first set of 81,791 contigs with an average length of 514 bp for tulip, and enriched the very limited number of 3,329 available ESTs (Expressed Sequence Tags) for lily with 52,172 contigs with an average length of 555 bp. The contigs together with singletons covered on average 37% of lily and 39% of tulip estimated transcriptome. Mining lily and tulip sequence data for SSRs (Simple Sequence Repeats) showed that di-nucleotide repeats were twice more abundant in UTRs (UnTranslated Regions) compared to coding regions, while tri-nucleotide repeats were equally spread over coding and UTR regions. Two sets of single nucleotide polymorphism (SNP) markers suitable for high throughput genotyping were developed. In the first set, no SNPs flanking the target SNP (50 bp on either side) were allowed. In the second set, one SNP in the flanking regions was allowed, which resulted in a 2 to 3 fold increase in SNP marker numbers compared with the first set. Orthologous groups between the two flower bulbs: lily and tulip (12,017 groups) and among the three monocot species: lily, tulip, and rice (6,900 groups) were determined using OrthoMCL. Orthologous groups were screened for common SNP markers and EST-SSRs to study synteny between lily and tulip, which resulted in 113 common SNP markers and 292 common EST-SSR. Lily and tulip contigs generated were annotated and described according to Gene Ontology terminology. Conclusions Two transcriptome sets were built that are valuable resources for marker development, comparative genomic studies and candidate gene approaches. Next generation sequencing of leaf transcriptome is very effective; however, deeper sequencing and using more tissues and stages is advisable for extended comparative studies. PMID:23167289
Comparative high-throughput transcriptome sequencing and development of SiESTa, the Silene EST annotation database

PubMed Central

2011-01-01

Background The genus Silene is widely used as a model system for addressing ecological and evolutionary questions in plants, but advances in using the genus as a model system are impeded by the lack of available resources for studying its genome. Massively parallel sequencing cDNA has recently developed into an efficient method for characterizing the transcriptomes of non-model organisms, generating massive amounts of data that enable the study of multiple species in a comparative framework. The sequences generated provide an excellent resource for identifying expressed genes, characterizing functional variation and developing molecular markers, thereby laying the foundations for future studies on gene sequence and gene expression divergence. Here, we report the results of a comparative transcriptome sequencing study of eight individuals representing four Silene and one Dianthus species as outgroup. All sequences and annotations have been deposited in a newly developed and publicly available database called SiESTa, the Silene EST annotation database. Results A total of 1,041,122 EST reads were generated in two runs on a Roche GS-FLX 454 pyrosequencing platform. EST reads were analyzed separately for all eight individuals sequenced and were assembled into contigs using TGICL. These were annotated with results from BLASTX searches and Gene Ontology (GO) terms, and thousands of single-nucleotide polymorphisms (SNPs) were characterized. Unassembled reads were kept as singletons and together with the contigs contributed to the unigenes characterized in each individual. The high quality of unigenes is evidenced by the proportion (49%) that have significant hits in similarity searches with the A. thaliana proteome. The SiESTa database is accessible at http://www.siesta.ethz.ch. Conclusion The sequence collections established in the present study provide an important genomic resource for four Silene and one Dianthus species and will help to further develop Silene as a plant model system. The genes characterized will be useful for future research not only in the species included in the present study, but also in related species for which no genomic resources are yet available. Our results demonstrate the efficiency of massively parallel transcriptome sequencing in a comparative framework as an approach for developing genomic resources in diverse groups of non-model organisms. PMID:21791039
Massive Collection of Full-Length Complementary DNA Clones and Microarray Analyses:. Keys to Rice Transcriptome Analysis

NASA Astrophysics Data System (ADS)

Kikuchi, Shoshi

2009-02-01

Completion of the high-precision genome sequence analysis of rice led to the collection of about 35,000 full-length cDNA clones and the determination of their complete sequences. Mapping of these full-length cDNA sequences has given us information on (1) the number of genes expressed in the rice genome; (2) the start and end positions and exon-intron structures of rice genes; (3) alternative transcripts; (4) possible encoded proteins; (5) non-protein-coding (np) RNAs; (6) the density of gene localization on the chromosome; (7) setting the parameters of gene prediction programs; and (8) the construction of a microarray system that monitors global gene expression. Manual curation for rice gene annotation by using mapping information on full-length cDNA and EST assemblies has revealed about 32,000 expressed genes in the rice genome. Analysis of major gene families, such as those encoding membrane transport proteins (pumps, ion channels, and secondary transporters), along with the evolution from bacteria to higher animals and plants, reveals how gene numbers have increased through adaptation to circumstances. Family-based gene annotation also gives us a new way of comparing organisms. Massive amounts of data on gene expression under many kinds of physiological conditions are being accumulated in rice oligoarrays (22K and 44K) based on full-length cDNA sequences. Cluster analyses of genes that have the same promoter cis-elements, that have similar expression profiles, or that encode enzymes in the same metabolic pathways or signal transduction cascades give us clues to understanding the networks of gene expression in rice. As a tool for that purpose, we recently developed "RiCES", a tool for searching for cis-elements in the promoter regions of clustered genes.
Comparative bioinformatics, temporal and spatial expression analyses of Ixodes scapularis organic anion transporting polypeptides

PubMed Central

Radulović, Željko; Porter, Lindsay M.; Kim, Tae K.; Mulenga, Albert

2015-01-01

Organic anion-transporting polypeptides (Oatps) are an integral part of the detoxification mechanism in vertebrates and invertebrates. These cell surface proteins are involved in mediating the sodium-independent uptake and/or distribution of a broad array of organic amphipathic compounds and xenobiotic drugs. This study describes bioinformatics and biological characterization of 9 Oatp sequences in the Ixodes scapularis genome. These sequences have been annotated on the basis of 12 transmembrane domains, consensus motif D-X-RW-(I,V)-GAWW-X-G-(F,L)-L, and 11 conserved cysteine amino acid residues in the large extracellular loop 5 that characterize the Oatp superfamily. Ixodes scapularis Oatps may regulate non-redundant cross-tick species conserved functions in that they did not cluster as a monolithic group on the phylogeny tree and that they have orthologs in other ticks. Phylogeny clustering patterns also suggest that some tick Oatp sequences transport substrates that are similar to those of body louse, mosquito, eye worm, and filarial worm Oatps. Semi-quantitative RT-PCR analysis demonstrated that all 9 I. scapularis Oatp sequences were expressed during tick feeding. Ixodes scapularis Oatp genes potentially regulate functions during early and/or late-stage tick feeding as revealed by normalized mRNA profiles. Normalized transcript abundance indicates that I. scapularis Oatp genes are strongly expressed in unfed ticks during the first 24 h of feeding and/or at the end of the tick feeding process. Except for 2 I. scapularis Oatps, which were expressed in the salivary glands and ovaries, all other genes were expressed in all tested organs, suggesting the significance of I. scapularis Oatps in maintaining tick homeostasis. Different I. scapularis Oatp mRNA expression patterns were detected and discussed with reference to different physiological states of unfed and feeding ticks. PMID:24582512
Peanut gene expression profiling in developing seeds at different reproduction stages during Aspergillus parasiticus infection

PubMed Central

Guo, Baozhu; Chen, Xiaoping; Dang, Phat; Scully, Brian T; Liang, Xuanqiang; Holbrook, C Corley; Yu, Jiujiang; Culbreath, Albert K

2008-01-01

Background Peanut (Arachis hypogaea L.) is an important crop economically and nutritionally, and is one of the most susceptible host crops to colonization of Aspergillus parasiticus and subsequent aflatoxin contamination. Knowledge from molecular genetic studies could help to devise strategies in alleviating this problem; however, few peanut DNA sequences are available in the public database. In order to understand the molecular basis of host resistance to aflatoxin contamination, a large-scale project was conducted to generate expressed sequence tags (ESTs) from developing seeds to identify resistance-related genes involved in defense response against Aspergillus infection and subsequent aflatoxin contamination. Results We constructed six different cDNA libraries derived from developing peanut seeds at three reproduction stages (R5, R6 and R7) from a resistant and a susceptible cultivated peanut genotypes, 'Tifrunner' (susceptible to Aspergillus infection with higher aflatoxin contamination and resistant to TSWV) and 'GT-C20' (resistant to Aspergillus with reduced aflatoxin contamination and susceptible to TSWV). The developing peanut seed tissues were challenged by A. parasiticus and drought stress in the field. A total of 24,192 randomly selected cDNA clones from six libraries were sequenced. After removing vector sequences and quality trimming, 21,777 high-quality EST sequences were generated. Sequence clustering and assembling resulted in 8,689 unique EST sequences with 1,741 tentative consensus EST sequences (TCs) and 6,948 singleton ESTs. Functional classification was performed according to MIPS functional catalogue criteria. The unique EST sequences were divided into twenty-two categories. A similarity search against the non-redundant protein database available from NCBI indicated that 84.78% of total ESTs showed significant similarity to known proteins, of which 165 genes had been previously reported in peanuts. There were differences in overall expression patterns in different libraries and genotypes. A number of sequences were expressed throughout all of the libraries, representing constitutive expressed sequences. In order to identify resistance-related genes with significantly differential expression, a statistical analysis to estimate the relative abundance (R) was used to compare the relative abundance of each gene transcripts in each cDNA library. Thirty six and forty seven unique EST sequences with threshold of R > 4 from libraries of 'GT-C20' and 'Tifrunner', respectively, were selected for examination of temporal gene expression patterns according to EST frequencies. Nine and eight resistance-related genes with significant up-regulation were obtained in 'GT-C20' and 'Tifrunner' libraries, respectively. Among them, three genes were common in both genotypes. Furthermore, a comparison of our EST sequences with other plant sequences in the TIGR Gene Indices libraries showed that the percentage of peanut EST matched to Arabidopsis thaliana, maize (Zea mays), Medicago truncatula, rapeseed (Brassica napus), rice (Oryza sativa), soybean (Glycine max) and wheat (Triticum aestivum) ESTs ranged from 33.84% to 79.46% with the sequence identity ≥ 80%. These results revealed that peanut ESTs are more closely related to legume species than to cereal crops, and more homologous to dicot than to monocot plant species. Conclusion The developed ESTs can be used to discover novel sequences or genes, to identify resistance-related genes and to detect the differences among alleles or markers between these resistant and susceptible peanut genotypes. Additionally, this large collection of cultivated peanut EST sequences will make it possible to construct microarrays for gene expression studies and for further characterization of host resistance mechanisms. It will be a valuable genomic resource for the peanut community. The 21,777 ESTs have been deposited to the NCBI GenBank database with accession numbers ES702769 to ES724546. PMID:18248674
Kaposi's Sarcoma-Associated Herpesvirus MicroRNA Single-Nucleotide Polymorphisms Identified in Clinical Samples Can Affect MicroRNA Processing, Level of Expression, and Silencing Activity

PubMed Central

Han, Soo-Jin; Marshall, Vickie; Barsov, Eugene; Quiñones, Octavio; Ray, Alex; Labo, Nazzarena; Trivett, Matthew; Ott, David; Renne, Rolf

2013-01-01

Kaposi's sarcoma-associated herpesvirus (KSHV) encodes 12 pre-microRNAs that can produce 25 KSHV mature microRNAs. We previously reported single-nucleotide polymorphisms (SNPs) in KSHV-encoded pre-microRNA and mature microRNA sequences from clinical samples (V. Marshall et al., J. Infect. Dis., 195:645–659, 2007). To determine whether microRNA SNPs affect pre-microRNA processing and, ultimately, mature microRNA expression levels, we performed a detailed comparative analysis of (i) mature microRNA expression levels, (ii) in vitro Drosha/Dicer processing, and (iii) RNA-induced silencing complex-dependent targeting of wild-type (wt) and variant microRNA genes. Expression of pairs of wt and variant pre-microRNAs from retroviral vectors and measurement of KSHV mature microRNA expression by real-time reverse transcription-PCR (RT-PCR) revealed differential expression levels that correlated with the presence of specific sequence polymorphisms. Measurement of KSHV mature microRNA expression in a panel of primary effusion lymphoma cell lines by real-time RT-PCR recapitulated some observed expression differences but suggested a more complex relationship between sequence differences and expression of mature microRNA. Furthermore, in vitro maturation assays demonstrated significant SNP-associated changes in Drosha/DGCR8 and/or Dicer processing. These data demonstrate that SNPs within KSHV-encoded pre-microRNAs are associated with differential microRNA expression levels. Given the multiple reports on the involvement of microRNAs in cancer, the biological significance of these phenotypic and genotypic variants merits further studies in patients with KSHV-associated malignancies. PMID:24006441
Identification of aberrantly expressed long non-coding RNAs in stomach adenocarcinoma.

PubMed

Gu, Jianbin; Li, Yong; Fan, Liqiao; Zhao, Qun; Tan, Bibo; Hua, Kelei; Wu, Guobin

2017-07-25

Stomach adenocarcinoma (STAD) is a common malignancy worldwide. This study aimed to identify the aberrantly expressed long non-coding RNAs (lncRNAs) in STAD. Total of 74 DElncRNAs and 449 DEmRNAs were identified in STAD compared with paired non-tumor tissues. The DElncRNA/DEmRNA co-expression network was constructed, which covered 519 nodes and 2993 edges. The qRT-PCR validation results of DElncRNAs were consistent with our bioinformatics analysis based on RNA-sequencing. The DEmRNAs co-expressed with DElncRNAs were significantly enriched in gastric acid secretion, complement and coagulation cascades, pancreatic secretion, cytokine-cytokine receptor interaction and Jak-STAT signaling pathway. The expression levels of the nine candidate DElncRNAs in TCGA database were compatible with our RNA-sequencing. FEZF1-AS1, HOTAIR and LINC01234 had the potential diagnosis value for STAD. The lncRNA and mRNA expression profile of 3 STAD tissues and 3 matched adjacent non-tumor tissues was obtained through high-throughput RNA-sequencing. Differentially expressed lncRNAs/mRNAs (DElncRNAs/DEmRNAs) were identified in STAD. DElncRNA/DEmRNA co-expression network construction, Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analyses were conducted to predict the biological functions of DElncRNAs. Quantitative real-time polymerase chain reaction (qRT-PCR) was subjected to validate the expression levels of DEmRNAs and DElncRNAs. Moreover, the expression of DElncRNAs was validated through The Cancer Genome Atlas (TCGA) database. The diagnosis value of candidate DElncRNAs was accessed by receiver operating characteristic (ROC) analysis. Our work might provide useful information for exploring the tumorigenesis mechanism of STAD and pave the road for identification of diagnostic biomarkers in STAD.
The Silkworm (Bombyx mori) microRNAs and Their Expressions in Multiple Developmental Stages

PubMed Central

Luo, Qibin; Cai, Yimei; Lin, Wen-chang; Chen, Huan; Yang, Yue; Hu, Songnian; Yu, Jun

2008-01-01

Background MicroRNAs (miRNAs) play crucial roles in various physiological processes through post-transcriptional regulation of gene expressions and are involved in development, metabolism, and many other important molecular mechanisms and cellular processes. The Bombyx mori genome sequence provides opportunities for a thorough survey for miRNAs as well as comparative analyses with other sequenced insect species. Methodology/Principal Findings We identified 114 non-redundant conserved miRNAs and 148 novel putative miRNAs from the B. mori genome with an elaborate computational protocol. We also sequenced 6,720 clones from 14 developmental stage-specific small RNA libraries in which we identified 35 unique miRNAs containing 21 conserved miRNAs (including 17 predicted miRNAs) and 14 novel miRNAs (including 11 predicted novel miRNAs). Among the 114 conserved miRNAs, we found six pairs of clusters evolutionarily conserved cross insect lineages. Our observations on length heterogeneity at 5′ and/or 3′ ends of nine miRNAs between cloned and predicted sequences, and three mature forms deriving from the same arm of putative pre-miRNAs suggest a mechanism by which miRNAs gain new functions. Analyzing development-related miRNAs expression at 14 developmental stages based on clone-sampling and stem-loop RT PCR, we discovered an unusual abundance of 33 sequences representing 12 different miRNAs and sharply fluctuated expression of miRNAs at larva-molting stage. The potential functions of several stage-biased miRNAs were also analyzed in combination with predicted target genes and silkworm's phenotypic traits; our results indicated that miRNAs may play key regulatory roles in specific developmental stages in the silkworm, such as ecdysis. Conclusions/Significance Taking a combined approach, we identified 118 conserved miRNAs and 151 novel miRNA candidates from the B. mori genome sequence. Our expression analyses by sampling miRNAs and real-time PCR over multiple developmental stages allowed us to pinpoint molting stages as hotspots of miRNA expression both in sorts and quantities. Based on the analysis of target genes, we hypothesized that miRNAs regulate development through a particular emphasis on complex stages rather than general regulatory mechanisms. PMID:18714353

Single-Cell Sequencing of the Healthy and Diseased Heart Reveals Ckap4 as a New Modulator of Fibroblasts Activation.

PubMed

Gladka, Monika M; Molenaar, Bas; de Ruiter, Hesther; van der Elst, Stefan; Tsui, Hoyee; Versteeg, Danielle; Lacraz, Grègory P A; Huibers, Manon M H; van Oudenaarden, Alexander; van Rooij, Eva

2018-01-31

Background -Genome-wide transcriptome analysis has greatly advanced our understanding of the regulatory networks underlying basic cardiac biology and mechanisms driving disease. However, so far, the resolution of studying gene expression patterns in the adult heart has been limited to the level of extracts from whole tissues. The use of tissue homogenates inherently causes the loss of any information on cellular origin or cell type-specific changes in gene expression. Recent developments in RNA amplification strategies provide a unique opportunity to use small amounts of input RNA for genome-wide sequencing of single cells. Methods -Here, we present a method to obtain high quality RNA from digested cardiac tissue from adult mice for automated single-cell sequencing of both the healthy and diseased heart. Results -After optimization, we were able to perform single-cell sequencing on adult cardiac tissue under both homeostatic conditions and after ischemic injury. Clustering analysis based on differential gene expression unveiled known and novel markers of all main cardiac cell types. Based on differential gene expression we were also able to identify multiple subpopulations within a certain cell type. Furthermore, applying single-cell sequencing on both the healthy and the injured heart indicated the presence of disease-specific cell subpopulations. As such, we identified cytoskeleton associated protein 4 ( Ckap4 ) as a novel marker for activated fibroblasts that positively correlates with known myofibroblast markers in both mouse and human cardiac tissue. Ckap4 inhibition in activated fibroblasts treated with TGFβ triggered a greater increase in the expression of genes related to activated fibroblasts compared to control, suggesting a role of Ckap4 in modulating fibroblast activation in the injured heart. Conclusions -Single-cell sequencing on both the healthy and diseased adult heart allows us to study transcriptomic differences between cardiac cells, as well as cell type-specific changes in gene expression during cardiac disease. This new approach provides a wealth of novel insights into molecular changes that underlie the cellular processes relevant for cardiac biology and pathophysiology. Applying this technology could lead to the discovery of new therapeutic targets relevant for heart disease.
Cloning and expression of L-asparaginase gene in Escherichia coli.

PubMed

Wang, Y; Qian, S; Meng, G; Zhang, S

2001-08-01

The L-asparaginase (ASN) from Escherichia coli AS1.357 was cloned as a DNA fragment generated using polymerase chain reaction technology and primers derived from conserved regions of published ASN gene sequences. Recombinant plasmid pASN containing ASN gene and expression vector pBV220 was transformed in different E. coli host strains. The activity and expression level of ASN in the engineering strains could reach 228 IU/mL of culture fluid and about 50% of the total soluble cell protein respectively, more than 40-fold the enzyme activity of the wild strain. The recombinant plasmid in E. coli AS1.357 remained stable after 72 h of cultivation and 5 h of heat induction without selective pressure. The ASN gene of E. coli AS1.357 was sequenced and had high homology compared to the reported data.
Molecular characterization, mRNA expression of prolactin receptor (PRLR) gene during pregnancy, nonpregnancy in the yak (Bos grunniens).

PubMed

Zi, Xiang-Dong; Chen, Da-Wen; Wang, Hong-Mei

2012-02-01

Prolactin (PRL) plays central roles in a wide range of body functions in mammals, and the actions are mediated by the specific cell surface receptor, the prolactin receptor (PRLR). To better understand the role of PRL in the yak (Bos grunniens), in the present study, we first cloned yak PRLR cDNA, and compared its mRNA expression in several tissues with cattle (Bos taurus). By reverse transcriptase-polymerase chain reaction (RT-PCR) strategy, we obtained full-length of yak PRLR cDNA sequence comprised of an open reading frame of 1746bp encoding a 581 amino acid protein, and contained a signal sequence and a transmembrane region. The intracellular domain had two pairs of cysteine residues and a WSXWS motif. The cytoplasmic domain comprised 323 residues and contained box 1 sequence. The yak PRLR shared 66.0-98.5% protein sequence identity with mammalian homologs. Real-time PCR analysis revealed that PRLR mRNA was higher in mammary tissue than in ovary and endometrium (P<0.01). During pregnancy, the ovary and mammary PRLR mRNA expression increased by 33- and 2.9-fold in yak, respectively, and increased by 46- and 3.8-fold in cattle, respectively. PRLR mRNA expression was higher (P<0.05) in mammary tissue and ovary of pregnant cow than that of pregnant yak. It is proposed that the increased ovarian and mammary PRLR mRNA expression during pregnancy may be associated with corpus luteum function for maintenance of pregnancy and mammary development for subsequent lactation. Copyright © 2011 Elsevier Inc. All rights reserved.
Streptomyces griseus streptomycin phosphotransferase: expression of its gene in Escherichia coli and sequence homology with other antibiotic phosphotransferases and with eukaryotic protein kinases.

PubMed

Lim, C K; Smith, M C; Petty, J; Baumberg, S; Wootton, J C

1989-12-01

The aphD gene of Streptomyces griseus, encoding a streptomycin 6-phosphotransferase (SPH), was sub-cloned in the pBR322-based expression vector pRK9 (which contains the Serratia marcescens trp promoter) with selection for expression of streptomycin resistance in Escherichia coli. Two hybrid plasmids, pCKL631 and pCKL711, were isolated which conferred resistance. Both contained a approximately 2 kbp fragment already suspected to include aphD. The properties of in vitro deletion derivatives of these plasmids were consistent with the presumed location of aphD. In vitro deletion of a sequence including most of the trp promoter largely, but not quite completely, abolished the ability of the plasmid to confer streptomycin resistance, confirming that expression was indeed principally from the trp promoter. A polypeptide of approximately 34.5 kDa was present in minicells containing plasmids that conferred streptomycin resistance, but was absent when the plasmids contained in vitro deletions removing streptomycin resistance. Part of the fragment was sequenced and an open reading frame corresponding to aphD identified. A computer-assisted comparison of the deduced SPH sequence with those of other antibiotic phosphotransferases suggested a common structure A-B-C-D-E, where B and D were conserved between all sequences compared while A, C and E divided between the streptomycin and hygromycin B phosphotransferases on one hand and kanamycin/neomycin ones on the other. A composite sequence data base was searched for homologues to consensus matrices constructed from five approximately 12-residue subsequences within blocks B and D. For one subsequence, corresponding to the N-terminal portion of block D, those sequences from the database that yielded the highest homology scores comprised almost entirely either antibiotic phosphotransferases or eukaryotic protein kinases. Possible evolutionary implications of this homology, previously described by other groups, are discussed.
PARRoT- a homology-based strategy to quantify and compare RNA-sequencing from non-model organisms.

PubMed

Gan, Ruei-Chi; Chen, Ting-Wen; Wu, Timothy H; Huang, Po-Jung; Lee, Chi-Ching; Yeh, Yuan-Ming; Chiu, Cheng-Hsun; Huang, Hsien-Da; Tang, Petrus

2016-12-22

Next-generation sequencing promises the de novo genomic and transcriptomic analysis of samples of interests. However, there are only a few organisms having reference genomic sequences and even fewer having well-defined or curated annotations. For transcriptome studies focusing on organisms lacking proper reference genomes, the common strategy is de novo assembly followed by functional annotation. However, things become even more complicated when multiple transcriptomes are compared. Here, we propose a new analysis strategy and quantification methods for quantifying expression level which not only generate a virtual reference from sequencing data, but also provide comparisons between transcriptomes. First, all reads from the transcriptome datasets are pooled together for de novo assembly. The assembled contigs are searched against NCBI NR databases to find potential homolog sequences. Based on the searched result, a set of virtual transcripts are generated and served as a reference transcriptome. By using the same reference, normalized quantification values including RC (read counts), eRPKM (estimated RPKM) and eTPM (estimated TPM) can be obtained that are comparable across transcriptome datasets. In order to demonstrate the feasibility of our strategy, we implement it in the web service PARRoT. PARRoT stands for Pipeline for Analyzing RNA Reads of Transcriptomes. It analyzes gene expression profiles for two transcriptome sequencing datasets. For better understanding of the biological meaning from the comparison among transcriptomes, PARRoT further provides linkage between these virtual transcripts and their potential function through showing best hits in SwissProt, NR database, assigning GO terms. Our demo datasets showed that PARRoT can analyze two paired-end transcriptomic datasets of approximately 100 million reads within just three hours. In this study, we proposed and implemented a strategy to analyze transcriptomes from non-reference organisms which offers the opportunity to quantify and compare transcriptome profiles through a homolog based virtual transcriptome reference. By using the homolog based reference, our strategy effectively avoids the problems that may cause from inconsistencies among transcriptomes. This strategy will shed lights on the field of comparative genomics for non-model organism. We have implemented PARRoT as a web service which is freely available at http://parrot.cgu.edu.tw .
Comparative Analysis of V-Akt Murine Thymoma Viral Oncogene Homolog 3 (AKT3) Gene between Cow and Buffalo Reveals Substantial Differences for Mastitis.

PubMed

Ullah, Farman; Bhattarai, Dinesh; Cheng, Zhangrui; Liang, Xianwei; Deng, Tingxian; Rehman, Zia Ur; Talpur, Hira Sajjad; Worku, Tesfaye; Brohi, Rahim Dad; Safdar, Muhammad; Ahmad, Muhammad Jamil; Salim, Mohammad; Khan, Momen; Ahmad, Hafiz Ishfaq; Zhang, Shujun

2018-01-01

AKT3 gene is a constituent of the serine/threonine protein kinase family and plays a crucial role in synthesis of milk fats and cholesterol by regulating activity of the sterol regulatory element binding protein (SREBP). AKT3 is highly conserved in mammals and its expression levels during the lactation periods of cattle are markedly increased. AKT3 is highly expressed in the intestine followed by mammary gland and it is also expressed in immune cells. It is involved in the TLR pathways as effectively as proinflammatory cytokines. The aims of this study were to investigate the sequences differences between buffalo and cow. Our results showed that there were substantial differences between buffalo and cow in some exons and noteworthy differences of the gene size in different regions. We also identified the important consensus sequence motifs, variation in 2000 upstream of ATG, substantial difference in the "3'UTR" region, and miRNA association in the buffalo sequences compared with the cow. In addition, genetic analyses, such as gene structure, phylogenetic tree, position of different motifs, and functional domains, were performed to establish their correlation with other species. This may indicate that a buffalo breed has potential resistance to disease, environment changes, and airborne microorganisms and some good production and reproductive traits.
Gene Structures, Evolution and Transcriptional Profiling of the WRKY Gene Family in Castor Bean (Ricinus communis L.).

PubMed

Zou, Zhi; Yang, Lifu; Wang, Danhua; Huang, Qixing; Mo, Yeyong; Xie, Guishui

2016-01-01

WRKY proteins comprise one of the largest transcription factor families in plants and form key regulators of many plant processes. This study presents the characterization of 58 WRKY genes from the castor bean (Ricinus communis L., Euphorbiaceae) genome. Compared with the automatic genome annotation, one more WRKY-encoding locus was identified and 20 out of the 57 predicted gene models were manually corrected. All RcWRKY genes were shown to contain at least one intron in their coding sequences. According to the structural features of the present WRKY domains, the identified RcWRKY genes were assigned to three previously defined groups (I-III). Although castor bean underwent no recent whole-genome duplication event like physic nut (Jatropha curcas L., Euphorbiaceae), comparative genomics analysis indicated that one gene loss, one intron loss and one recent proximal duplication occurred in the RcWRKY gene family. The expression of all 58 RcWRKY genes was supported by ESTs and/or RNA sequencing reads derived from roots, leaves, flowers, seeds and endosperms. Further global expression profiles with RNA sequencing data revealed diverse expression patterns among various tissues. Results obtained from this study not only provide valuable information for future functional analysis and utilization of the castor bean WRKY genes, but also provide a useful reference to investigate the gene family expansion and evolution in Euphorbiaceus plants.
Comparative Analysis of V-Akt Murine Thymoma Viral Oncogene Homolog 3 (AKT3) Gene between Cow and Buffalo Reveals Substantial Differences for Mastitis

PubMed Central

Bhattarai, Dinesh; Cheng, Zhangrui; Liang, Xianwei; Deng, Tingxian; Rehman, Zia Ur; Talpur, Hira Sajjad; Worku, Tesfaye; Brohi, Rahim Dad; Safdar, Muhammad; Ahmad, Muhammad Jamil; Salim, Mohammad; Khan, Momen; Ahmad, Hafiz Ishfaq

2018-01-01

AKT3 gene is a constituent of the serine/threonine protein kinase family and plays a crucial role in synthesis of milk fats and cholesterol by regulating activity of the sterol regulatory element binding protein (SREBP). AKT3 is highly conserved in mammals and its expression levels during the lactation periods of cattle are markedly increased. AKT3 is highly expressed in the intestine followed by mammary gland and it is also expressed in immune cells. It is involved in the TLR pathways as effectively as proinflammatory cytokines. The aims of this study were to investigate the sequences differences between buffalo and cow. Our results showed that there were substantial differences between buffalo and cow in some exons and noteworthy differences of the gene size in different regions. We also identified the important consensus sequence motifs, variation in 2000 upstream of ATG, substantial difference in the “3′UTR” region, and miRNA association in the buffalo sequences compared with the cow. In addition, genetic analyses, such as gene structure, phylogenetic tree, position of different motifs, and functional domains, were performed to establish their correlation with other species. This may indicate that a buffalo breed has potential resistance to disease, environment changes, and airborne microorganisms and some good production and reproductive traits. PMID:29862252
Functional characterization of recombinant bromelain of Ananas comosus expressed in a prokaryotic system.

PubMed

George, Susan; Bhasker, Salini; Madhav, Harish; Nair, Archana; Chinnamma, Mohankumar

2014-02-01

Bromelain (BRM) is a defense protein present in the fruit and stem of pineapple (Ananas comosus) and it is grouped as a cysteine protease enzyme with diversified medicinal uses. Based on its therapeutic applications, bromelain has got sufficient attention in pharmaceutical industries. In the present study, the full coding gene of bromelain in pineapple stem (1,093 bp) was amplified by RT-PCR. The PCR product was cloned, sequenced, and characterized. The sequence analysis of the gene revealed the single nucleotide polymorphism and its phylogenetic relatedness. The peptide sequence deduced from the gene showed the amino acid variations, physicochemical properties and secondary and tertiary structural features of the protein. The full BRM gene was transformed to prokaryotic vector pET32b and expressed in Escherichia coli BL21 DE3pLysS host cells successfully. The identity of the recombinant bromelain (rBRM) protein was confirmed by Western blot analysis using anti-BRM-rabbit IgG antibody. The activity of recombinant bromelain compared with purified native bromelain was determined by protease assay. The inhibitory effect of rBRM compared with native BRM in the growth of Gram-positive and Gram-negative strains of Streptococcus agalactiae and Escherichia coli O111 was evident from the antibacterial sensitivity test. To the best of our knowledge, this is the first report showing the bactericidal property of rBRM expressed in a prokaryotic system.
Computational identification of developmental enhancers:conservation and function of transcription factor binding-site clustersin drosophila melanogaster and drosophila psedoobscura

DOE Office of Scientific and Technical Information (OSTI.GOV)

Berman, Benjamin P.; Pfeiffer, Barret D.; Laverty, Todd R.

2004-08-06

Background The identification of sequences that control transcription in metazoans is a major goal of genome analysis. In a previous study, we demonstrated that searching for clusters of predicted transcription factor binding sites could discover active regulatory sequences, and identified 37 regions of the Drosophila melanogaster genome with high densities of predicted binding sites for five transcription factors involved in anterior-posterior embryonic patterning. Nine of these clusters overlapped known enhancers. Here, we report the results of in vivo functional analysis of 27 remaining clusters. Results We generated transgenic flies carrying each cluster attached to a basal promoter and reporter gene,more » and assayed embryos for reporter gene expression. Six clusters are enhancers of adjacent genes: giant, fushi tarazu, odd-skipped, nubbin, squeeze and pdm2; three drive expression in patterns unrelated to those of neighboring genes; the remaining 18 do not appear to have enhancer activity. We used the Drosophila pseudoobscura genome to compare patterns of evolution in and around the 15 positive and 18 false-positive predictions. Although conservation of primary sequence cannot distinguish true from false positives, conservation of binding-site clustering accurately discriminates functional binding-site clusters from those with no function. We incorporated conservation of binding-site clustering into a new genome-wide enhancer screen, and predict several hundred new regulatory sequences, including 85 adjacent to genes with embryonic patterns. Conclusions Measuring conservation of sequence features closely linked to function - such as binding-site clustering - makes better use of comparative sequence data than commonly used methods that examine only sequence identity.« less
Highly sensitive luciferase reporter assay using a potent destabilization sequence of calpain 3.

PubMed

Yasunaga, Mayu; Murotomi, Kazutoshi; Abe, Hiroko; Yamazaki, Tomomi; Nishii, Shigeaki; Ohbayashi, Tetsuya; Oshimura, Mitsuo; Noguchi, Takako; Niwa, Kazuki; Ohmiya, Yoshihiro; Nakajima, Yoshihiro

2015-01-20

Reporter assays that use luciferases are widely employed for monitoring cellular events associated with gene expression in vitro and in vivo. To improve the response of the luciferase reporter to acute changes of gene expression, a destabilization sequence is frequently used to reduce the stability of luciferase protein in the cells, which results in an increase of sensitivity of the luciferase reporter assay. In this study, we identified a potent destabilization sequence (referred to as the C9 fragment) consisting of 42 amino acid residues from human calpain 3 (CAPN3). Whereas the half-life of Emerald Luc (ELuc) from the Brazilian click beetle Pyrearinus termitilluminans was reduced by fusing PEST (t1/2=9.8 to 2.8h), the half-life of C9-fused ELuc was significantly shorter (t1/2=1.0h) than that of PEST-fused ELuc when measurements were conducted at 37°C. In addition, firefly luciferase (luc2) was also markedly destabilized by the C9 fragment compared with the humanized PEST sequence. These results indicate that the C9 fragment from CAPN3 is a much more potent destabilization sequence than the PEST sequence. Furthermore, real-time bioluminescence recording of the activation kinetics of nuclear factor-κB after transient treatment with tumor necrosis factor α revealed that the response of C9-fused ELuc is significantly greater than that of PEST-fused ELuc, demonstrating that the use of the C9 fragment realizes a luciferase reporter assay that has faster response speed compared with that provided by the PEST sequence. Copyright © 2014 Elsevier B.V. All rights reserved.
Xuhuai goat H-FABP gene clone, subcellular localization of expression products and the preparation of transgenic mice.

PubMed

Yin, Yan-hui; Li, Bi-chun; Wei, Guang-hui; Zhu, Cai-ye; Li, Wei; Zhang, Ya-ni; Du, Li-xin; Cao, Wen-guang

2012-05-01

The aim of this study was to clone the heart-type fatty acid binding protein (H-FABP) gene of Xuhuai goat, to explore it bioinformatically, and analyze the subcellular localization using enhanced green fluorescent protein (EGFP). The results showed that the coding sequence (CDS) length of Xuhuai goat H-FABP gene was 402 bp, encoding 133 amino acids (GenBank accession number AY466498.1). The H-FABP cDNA coding sequence was compared with the corresponding region of human, chicken, brown rat, cow, wild boar, donkey, and zebrafish. The similarity were 89%, 76%, 85%, 84%, 93%, 91%, 70%, respectively. For the corresponding amino acid sequences, the similarity were 90%, 79%, 88%, 97%, 95%, 94%, 72%, respectively. This study did not find the signal peptide region in the H-FABP protein; it revealed that H-FABP protein might be a nonsecreted protein. H-FABP expression was detected in vitro by reverse transcription-polymerase chain reaction (RT-PCR), and the EGFP-H-FABP fusion protein was localized to the cytoplasm. The gene could also be transiently and permanently expressed in mice.
Informatic selection of a neural crest-melanocyte cDNA set for microarray analysis

PubMed Central

Loftus, S. K.; Chen, Y.; Gooden, G.; Ryan, J. F.; Birznieks, G.; Hilliard, M.; Baxevanis, A. D.; Bittner, M.; Meltzer, P.; Trent, J.; Pavan, W.

1999-01-01

With cDNA microarrays, it is now possible to compare the expression of many genes simultaneously. To maximize the likelihood of finding genes whose expression is altered under the experimental conditions, it would be advantageous to be able to select clones for tissue-appropriate cDNA sets. We have taken advantage of the extensive sequence information in the dbEST expressed sequence tag (EST) database to identify a neural crest-derived melanocyte cDNA set for microarray analysis. Analysis of characterized genes with dbEST identified one library that contained ESTs representing 21 neural crest-expressed genes (library 198). The distribution of the ESTs corresponding to these genes was biased toward being derived from library 198. This is in contrast to the EST distribution profile for a set of control genes, characterized to be more ubiquitously expressed in multiple tissues (P < 1 × 10−9). From library 198, a subset of 852 clustered ESTs were selected that have a library distribution profile similar to that of the 21 neural crest-expressed genes. Microarray analysis demonstrated the majority of the neural crest-selected 852 ESTs (Mel1 array) were differentially expressed in melanoma cell lines compared with a non-neural crest kidney epithelial cell line (P < 1 × 10−8). This was not observed with an array of 1,238 ESTs that was selected without library origin bias (P = 0.204). This study presents an approach for selecting tissue-appropriate cDNAs that can be used to examine the expression profiles of developmental processes and diseases. PMID:10430933
Optimal Scaling of Digital Transcriptomes

PubMed Central

Glusman, Gustavo; Caballero, Juan; Robinson, Max; Kutlu, Burak; Hood, Leroy

2013-01-01

Deep sequencing of transcriptomes has become an indispensable tool for biology, enabling expression levels for thousands of genes to be compared across multiple samples. Since transcript counts scale with sequencing depth, counts from different samples must be normalized to a common scale prior to comparison. We analyzed fifteen existing and novel algorithms for normalizing transcript counts, and evaluated the effectiveness of the resulting normalizations. For this purpose we defined two novel and mutually independent metrics: (1) the number of “uniform” genes (genes whose normalized expression levels have a sufficiently low coefficient of variation), and (2) low Spearman correlation between normalized expression profiles of gene pairs. We also define four novel algorithms, one of which explicitly maximizes the number of uniform genes, and compared the performance of all fifteen algorithms. The two most commonly used methods (scaling to a fixed total value, or equalizing the expression of certain ‘housekeeping’ genes) yielded particularly poor results, surpassed even by normalization based on randomly selected gene sets. Conversely, seven of the algorithms approached what appears to be optimal normalization. Three of these algorithms rely on the identification of “ubiquitous” genes: genes expressed in all the samples studied, but never at very high or very low levels. We demonstrate that these include a “core” of genes expressed in many tissues in a mutually consistent pattern, which is suitable for use as an internal normalization guide. The new methods yield robustly normalized expression values, which is a prerequisite for the identification of differentially expressed and tissue-specific genes as potential biomarkers. PMID:24223126
Clonal Relatedness of Enterotoxigenic Escherichia coli (ETEC) Strains Expressing LT and CS17 Isolated from Children with Diarrhoea in La Paz, Bolivia

PubMed Central

Rodas, Claudia; Klena, John D.; Nicklasson, Matilda; Iniguez, Volga; Sjöling, Åsa

2011-01-01

Background Enterotoxigenic Escherichia coli (ETEC) is a major cause of traveller's and infantile diarrhoea in the developing world. ETEC produces two toxins, a heat-stable toxin (known as ST) and a heat-labile toxin (LT) and colonization factors that help the bacteria to attach to epithelial cells. Methodology/Principal Findings In this study, we characterized a subset of ETEC clinical isolates recovered from Bolivian children under 5 years of age using a combination of multilocus sequence typing (MLST) analysis, virulence typing, serotyping and antimicrobial resistance test patterns in order to determine the genetic background of ETEC strains circulating in Bolivia. We found that strains expressing the heat-labile (LT) enterotoxin and colonization factor CS17 were common and belonged to several MLST sequence types but mainly to sequence type-423 and sequence type-443 (Achtman scheme). To further study the LT/CS17 strains we analysed the nucleotide sequence of the CS17 operon and compared the structure to LT/CS17 ETEC isolates from Bangladesh. Sequence analysis confirmed that all sequence type-423 strains from Bolivia had a single nucleotide polymorphism; SNPbol in the CS17 operon that was also found in some other MLST sequence types from Bolivia but not in strains recovered from Bangladeshi children. The dominant ETEC clone in Bolivia (sequence type-423/SNPbol) was found to persist over multiple years and was associated with severe diarrhoea but these strains were variable with respect to antimicrobial resistance patterns. Conclusion/Significance The results showed that although the LT/CS17 phenotype is common among ETEC strains in Bolivia, multiple clones, as determined by unique MLST sequence types, populate this phenotype. Our data also appear to suggest that acquisition and loss of antimicrobial resistance in LT-expressing CS17 ETEC clones is more dynamic than acquisition or loss of virulence factors. PMID:22140423
Clonal relatedness of enterotoxigenic Escherichia coli (ETEC) strains expressing LT and CS17 isolated from children with diarrhoea in La Paz, Bolivia.

PubMed

Rodas, Claudia; Klena, John D; Nicklasson, Matilda; Iniguez, Volga; Sjöling, Asa

2011-01-01

Enterotoxigenic Escherichia coli (ETEC) is a major cause of traveller's and infantile diarrhoea in the developing world. ETEC produces two toxins, a heat-stable toxin (known as ST) and a heat-labile toxin (LT) and colonization factors that help the bacteria to attach to epithelial cells. In this study, we characterized a subset of ETEC clinical isolates recovered from Bolivian children under 5 years of age using a combination of multilocus sequence typing (MLST) analysis, virulence typing, serotyping and antimicrobial resistance test patterns in order to determine the genetic background of ETEC strains circulating in Bolivia. We found that strains expressing the heat-labile (LT) enterotoxin and colonization factor CS17 were common and belonged to several MLST sequence types but mainly to sequence type-423 and sequence type-443 (Achtman scheme). To further study the LT/CS17 strains we analysed the nucleotide sequence of the CS17 operon and compared the structure to LT/CS17 ETEC isolates from Bangladesh. Sequence analysis confirmed that all sequence type-423 strains from Bolivia had a single nucleotide polymorphism; SNP(bol) in the CS17 operon that was also found in some other MLST sequence types from Bolivia but not in strains recovered from Bangladeshi children. The dominant ETEC clone in Bolivia (sequence type-423/SNP(bol)) was found to persist over multiple years and was associated with severe diarrhoea but these strains were variable with respect to antimicrobial resistance patterns. The results showed that although the LT/CS17 phenotype is common among ETEC strains in Bolivia, multiple clones, as determined by unique MLST sequence types, populate this phenotype. Our data also appear to suggest that acquisition and loss of antimicrobial resistance in LT-expressing CS17 ETEC clones is more dynamic than acquisition or loss of virulence factors.
Cloning, expression and phylogenetic analysis of Hemolin, from the Chinese oak silkmoth, Antheraea pernyi.

PubMed

Li, Wenli; Terenius, Olle; Hirai, Makoto; Nilsson, Anders S; Faye, Ingrid

2005-01-01

The Chinese oak silk moth Antheraea pernyi is an important silk producer. To understand microbial resistance of this moth, we cloned Hemolin, encoding a multifunctional immune protein belonging to the immunoglobulin superfamily, and examined the expression in gonads and fat body. The ApHemolin amino acid sequence was compared to other Hemolin sequences in order to predict functional sites. Several sites were conserved; among them a phosphate binding site, which according to 3D structure modelling does not appear in neuroglian, the phylogenetically closest related protein. In addition, two conserved KDG sequences in the C-C' loop of immunoglobulin domains 1 and 3, give rise to gamma-turns, which is a common motif in the C'-C'' loop of the hypervariable region L2 in vertebrate immunoglobulins. The comparisons also show variable regions of specific interest for future studies of hemolin and its interaction with microbial entities.
Transcriptome sequencing and annotation of the halophytic microalga Dunaliella salina * #

PubMed Central

Hong, Ling; Liu, Jun-li; Midoun, Samira Z.; Miller, Philip C.

2017-01-01

The unicellular green alga Dunaliella salina is well adapted to salt stress and contains compounds (including β-carotene and vitamins) with potential commercial value. A large transcriptome database of D. salina during the adjustment, exponential and stationary growth phases was generated using a high throughput sequencing platform. We characterized the metabolic processes in D. salina with a focus on valuable metabolites, with the aim of manipulating D. salina to achieve greater economic value in large-scale production through a bioengineering strategy. Gene expression profiles under salt stress verified using quantitative polymerase chain reaction (qPCR) implied that salt can regulate the expression of key genes. This study generated a substantial fraction of D. salina transcriptional sequences for the entire growth cycle, providing a basis for the discovery of novel genes. This first full-scale transcriptome study of D. salina establishes a foundation for further comparative genomic studies. PMID:28990374
Finding consistent patterns: A nonparametric approach for identifying differential expression in RNA-Seq data

PubMed Central

Li, Jun; Tibshirani, Robert

2015-01-01

We discuss the identification of features that are associated with an outcome in RNA-Sequencing (RNA-Seq) and other sequencing-based comparative genomic experiments. RNA-Seq data takes the form of counts, so models based on the normal distribution are generally unsuitable. The problem is especially challenging because different sequencing experiments may generate quite different total numbers of reads, or ‘sequencing depths’. Existing methods for this problem are based on Poisson or negative binomial models: they are useful but can be heavily influenced by ‘outliers’ in the data. We introduce a simple, nonparametric method with resampling to account for the different sequencing depths. The new method is more robust than parametric methods. It can be applied to data with quantitative, survival, two-class or multiple-class outcomes. We compare our proposed method to Poisson and negative binomial-based methods in simulated and real data sets, and find that our method discovers more consistent patterns than competing methods. PMID:22127579
Comparative Transcriptome Analysis of the Accessory Sex Gland and Testis from the Chinese Mitten Crab (Eriocheir sinensis)

PubMed Central

He, Lin; Jiang, Hui; Cao, Dandan; Liu, Lihua; Hu, Songnian; Wang, Qun

2013-01-01

The accessory sex gland (ASG) is an important component of the male reproductive system, which functions to enhance the fertility of spermatozoa during male reproduction. Certain proteins secreted by the ASG are known to bind to the spermatozoa membrane and affect its function. The ASG gene expression profile in Chinese mitten crab (Eriocheir sinensis) has not been extensively studied, and limited genetic research has been conducted on this species. The advent of high-throughput sequencing technologies enables the generation of genomic resources within a short period of time and at minimal cost. In the present study, we performed de novo transcriptome sequencing to produce a comprehensive transcript dataset for the ASG of E. sinensis using Illumina sequencing technology. This analysis yielded a total of 33,221,284 sequencing reads, including 2.6 Gb of total nucleotides. Reads were assembled into 85,913 contigs (average 218 bp), or 58,567 scaffold sequences (average 292 bp), that identified 37,955 unigenes (average 385 bp). We assembled all unigenes and compared them with the published testis transcriptome from E. sinensis. In order to identify which genes may be involved in ASG function, as it pertains to modification of spermatozoa, we compared the ASG and testis transcriptome of E. sinensis. Our analysis identified specific genes with both higher and lower tissue expression levels in the two tissues, and the functions of these genes were analyzed to elucidate their potential roles during maturation of spermatozoa. Availability of detailed transcriptome data from ASG and testis in E. sinensis can assist our understanding of the molecular mechanisms involved with spermatozoa conservation, transport, maturation and capacitation and potentially acrosome activation. PMID:23342039

Gustatory Receptor Expression in the Labella and Tarsi of Aedes aegypti

DTIC Science & Technology

2013-01-01

Gibbs, R., Chen, R., 2011. The Drosophila melanogaster transcriptome by paired-end RNA sequencing . Genome Res. 21, 315e324. Debboun, M., Strickman, D...from genomic sequences and compared to previously identified insect GRs (Kent et al., 2008). In general, GRs of the two main mosquito sub- families...almost always demonstrated conservation in Drosophila melanogaster as well. Twelve out the total 40 AaegGRs with likely orthologs in An. gambiae had
Genomic analysis of expressed sequence tags in American black bear Ursus americanus

PubMed Central

2010-01-01

Background Species of the bear family (Ursidae) are important organisms for research in molecular evolution, comparative physiology and conservation biology, but relatively little genetic sequence information is available for this group. Here we report the development and analyses of the first large scale Expressed Sequence Tag (EST) resource for the American black bear (Ursus americanus). Results Comprehensive analyses of molecular functions, alternative splicing, and tissue-specific expression of 38,757 black bear EST sequences were conducted using the dog genome as a reference. We identified 18 genes, involved in functions such as lipid catabolism, cell cycle, and vesicle-mediated transport, that are showing rapid evolution in the bear lineage Three genes, Phospholamban (PLN), cysteine glycine-rich protein 3 (CSRP3) and Troponin I type 3 (TNNI3), are related to heart contraction, and defects in these genes in humans lead to heart disease. Two genes, biphenyl hydrolase-like (BPHL) and CSRP3, contain positively selected sites in bear. Global analysis of evolution rates of hibernation-related genes in bear showed that they are largely conserved and slowly evolving genes, rather than novel and fast-evolving genes. Conclusion We provide a genomic resource for an important mammalian organism and our study sheds new light on the possible functions and evolution of bear genes. PMID:20338065
Genomic analysis of expressed sequence tags in American black bear Ursus americanus.

PubMed

Zhao, Sen; Shao, Chunxuan; Goropashnaya, Anna V; Stewart, Nathan C; Xu, Yichi; Tøien, Øivind; Barnes, Brian M; Fedorov, Vadim B; Yan, Jun

2010-03-26

Species of the bear family (Ursidae) are important organisms for research in molecular evolution, comparative physiology and conservation biology, but relatively little genetic sequence information is available for this group. Here we report the development and analyses of the first large scale Expressed Sequence Tag (EST) resource for the American black bear (Ursus americanus). Comprehensive analyses of molecular functions, alternative splicing, and tissue-specific expression of 38,757 black bear EST sequences were conducted using the dog genome as a reference. We identified 18 genes, involved in functions such as lipid catabolism, cell cycle, and vesicle-mediated transport, that are showing rapid evolution in the bear lineage Three genes, Phospholamban (PLN), cysteine glycine-rich protein 3 (CSRP3) and Troponin I type 3 (TNNI3), are related to heart contraction, and defects in these genes in humans lead to heart disease. Two genes, biphenyl hydrolase-like (BPHL) and CSRP3, contain positively selected sites in bear. Global analysis of evolution rates of hibernation-related genes in bear showed that they are largely conserved and slowly evolving genes, rather than novel and fast-evolving genes. We provide a genomic resource for an important mammalian organism and our study sheds new light on the possible functions and evolution of bear genes.
Exploring the roles of DNA methylation in the metal-reducing bacterium Shewanella oneidensis MR-1

DOE Office of Scientific and Technical Information (OSTI.GOV)

Bendall, Matthew L.; Luong, Khai; Wetmore, Kelly M.

2013-08-30

We performed whole genome analyses of DNA methylation in Shewanella 17 oneidensis MR-1 to examine its possible role in regulating gene expression and 18 other cellular processes. Single-Molecule Real Time (SMRT) sequencing 19 revealed extensive methylation of adenine (N6mA) throughout the 20 genome. These methylated bases were located in five sequence motifs, 21 including three novel targets for Type I restriction/modification enzymes. The 22 sequence motifs targeted by putative methyltranferases were determined via 23 SMRT sequencing of gene knockout mutants. In addition, we found S. 24 oneidensis MR-1 cultures grown under various culture conditions displayed 25 different DNA methylation patterns.more » However, the small number of differentially 26 methylated sites could not be directly linked to the much larger number of 27 differentially expressed genes in these conditions, suggesting DNA methylation is 28 not a major regulator of gene expression in S. oneidensis MR-1. The enrichment 29 of methylated GATC motifs in the origin of replication indicate DNA methylation 30 may regulate genome replication in a manner similar to that seen in Escherichia 31 coli. Furthermore, comparative analyses suggest that many 32 Gammaproteobacteria, including all members of the Shewanellaceae family, may 33 also utilize DNA methylation to regulate genome replication.« less
Antisense Transcription Is Pervasive but Rarely Conserved in Enteric Bacteria

PubMed Central

Raghavan, Rahul; Sloan, Daniel B.; Ochman, Howard

2012-01-01

ABSTRACT Noncoding RNAs, including antisense RNAs (asRNAs) that originate from the complementary strand of protein-coding genes, are involved in the regulation of gene expression in all domains of life. Recent application of deep-sequencing technologies has revealed that the transcription of asRNAs occurs genome-wide in bacteria. Although the role of the vast majority of asRNAs remains unknown, it is often assumed that their presence implies important regulatory functions, similar to those of other noncoding RNAs. Alternatively, many antisense transcripts may be produced by chance transcription events from promoter-like sequences that result from the degenerate nature of bacterial transcription factor binding sites. To investigate the biological relevance of antisense transcripts, we compared genome-wide patterns of asRNA expression in closely related enteric bacteria, Escherichia coli and Salmonella enterica serovar Typhimurium, by performing strand-specific transcriptome sequencing. Although antisense transcripts are abundant in both species, less than 3% of asRNAs are expressed at high levels in both species, and only about 14% appear to be conserved among species. And unlike the promoters of protein-coding genes, asRNA promoters show no evidence of sequence conservation between, or even within, species. Our findings suggest that many or even most bacterial asRNAs are nonadaptive by-products of the cell’s transcription machinery. PMID:22872780
Genomic Sequence around Butterfly Wing Development Genes: Annotation and Comparative Analysis

PubMed Central

Conceição, Inês C.; Long, Anthony D.; Gruber, Jonathan D.; Beldade, Patrícia

2011-01-01

Background Analysis of genomic sequence allows characterization of genome content and organization, and access beyond gene-coding regions for identification of functional elements. BAC libraries, where relatively large genomic regions are made readily available, are especially useful for species without a fully sequenced genome and can increase genomic coverage of phylogenetic and biological diversity. For example, no butterfly genome is yet available despite the unique genetic and biological properties of this group, such as diversified wing color patterns. The evolution and development of these patterns is being studied in a few target species, including Bicyclus anynana, where a whole-genome BAC library allows targeted access to large genomic regions. Methodology/Principal Findings We characterize ∼1.3 Mb of genomic sequence around 11 selected genes expressed in B. anynana developing wings. Extensive manual curation of in silico predictions, also making use of a large dataset of expressed genes for this species, identified repetitive elements and protein coding sequence, and highlighted an expansion of Alcohol dehydrogenase genes. Comparative analysis with orthologous regions of the lepidopteran reference genome allowed assessment of conservation of fine-scale synteny (with detection of new inversions and translocations) and of DNA sequence (with detection of high levels of conservation of non-coding regions around some, but not all, developmental genes). Conclusions The general properties and organization of the available B. anynana genomic sequence are similar to the lepidopteran reference, despite the more than 140 MY divergence. Our results lay the groundwork for further studies of new interesting findings in relation to both coding and non-coding sequence: 1) the Alcohol dehydrogenase expansion with higher similarity between the five tandemly-repeated B. anynana paralogs than with the corresponding B. mori orthologs, and 2) the high conservation of non-coding sequence around the genes wingless and Ecdysone receptor, both involved in multiple developmental processes including wing pattern formation. PMID:21909358
Identification and characterization of pyrokinin and CAPA peptides, and corresponding GPCRs from spotted wing drosophila, Drosophila suzukii.

PubMed

Choi, Man-Yeon; Ahn, Seung-Joon; Kim, A Young; Koh, Youngho

2017-05-15

The family of FXPRLamide peptides serves as a major insect hormone. It is characterized by a core active amino acid sequence conserved at the C-terminal ends, and provides various physiological roles across the Insecta. In this study we identified and characterized pyrokinin (PK) and CAPA cDNAs encoding two FXPRLamide peptides, pyrokinin and CAPA-DH (diapause hormone), and two corresponding G protein-coupled receptors (GPCRs) from spotted wing drosophila (SWD), Drosophila suzukii. Expressions of PK and CAPA mRNAs were differentially observed during all life stages except the embryo, and the detection of CAPA transcription was relatively strong compared with the PK gene in SWD. Both D. suzukii pyrokinin receptor (DrosuPKr) and CAPA-DH receptor (DrosuCAPA-DHr) were functionally expressed and confirmed through binding to PK and DH peptides. Differential expression of two GPCRs occurred during all life stages; a strong transcription of DrosuPKr was observed in the 3rd instar. DrosuCAPA-DHr was clearly expressed from the embryo to the larva, but not detected in the adult. Gene regulation during the life stages was not synchronized between ligand and receptor. For example, SWD CAPA mRNA has been up-regulated in the adult while CAPA-DHr was down-regulated. The difference could be from the CAPA mRNA translating multiple peptides including CAPA-DH and two CAPA-PVK (periviscerokinin) peptides to act on different receptors. Comparing the genes of SWD PK, CAPA, PKr and CAPA-DHr to four corresponding genes of D. melanogaster, SWD CAPA and the receptor are more similar to D. melanogaster than PK and the receptor. These data suggest that the CAPA gene could be evolutionally more conserved to have a common biological role in insects. In addition, the effect of Kozak sequences was investigated by the expression of the GPCRs with or without Kozak sequences in Sf9 insect cells. The Kozak sequenced PK receptor was significantly less active than the original (= no Kozak sequenced) receptor. Our results provide a knowledge for potential biological function(s) of PK and CAPA-DH peptides in SWD, and possibly offer a novel control method for this pest insect in the future. Published by Elsevier Inc.
Next-generation sequencing facilitates quantitative analysis of wild-type and Nrl−/− retinal transcriptomes

PubMed Central

Brooks, Matthew J.; Rajasimha, Harsha K.; Roger, Jerome E.

2011-01-01

Purpose Next-generation sequencing (NGS) has revolutionized systems-based analysis of cellular pathways. The goals of this study are to compare NGS-derived retinal transcriptome profiling (RNA-seq) to microarray and quantitative reverse transcription polymerase chain reaction (qRT–PCR) methods and to evaluate protocols for optimal high-throughput data analysis. Methods Retinal mRNA profiles of 21-day-old wild-type (WT) and neural retina leucine zipper knockout (Nrl−/−) mice were generated by deep sequencing, in triplicate, using Illumina GAIIx. The sequence reads that passed quality filters were analyzed at the transcript isoform level with two methods: Burrows–Wheeler Aligner (BWA) followed by ANOVA (ANOVA) and TopHat followed by Cufflinks. qRT–PCR validation was performed using TaqMan and SYBR Green assays. Results Using an optimized data analysis workflow, we mapped about 30 million sequence reads per sample to the mouse genome (build mm9) and identified 16,014 transcripts in the retinas of WT and Nrl−/− mice with BWA workflow and 34,115 transcripts with TopHat workflow. RNA-seq data confirmed stable expression of 25 known housekeeping genes, and 12 of these were validated with qRT–PCR. RNA-seq data had a linear relationship with qRT–PCR for more than four orders of magnitude and a goodness of fit (R2) of 0.8798. Approximately 10% of the transcripts showed differential expression between the WT and Nrl−/− retina, with a fold change ≥1.5 and p value <0.05. Altered expression of 25 genes was confirmed with qRT–PCR, demonstrating the high degree of sensitivity of the RNA-seq method. Hierarchical clustering of differentially expressed genes uncovered several as yet uncharacterized genes that may contribute to retinal function. Data analysis with BWA and TopHat workflows revealed a significant overlap yet provided complementary insights in transcriptome profiling. Conclusions Our study represents the first detailed analysis of retinal transcriptomes, with biologic replicates, generated by RNA-seq technology. The optimized data analysis workflows reported here should provide a framework for comparative investigations of expression profiles. Our results show that NGS offers a comprehensive and more accurate quantitative and qualitative evaluation of mRNA content within a cell or tissue. We conclude that RNA-seq based transcriptome characterization would expedite genetic network analyses and permit the dissection of complex biologic functions. PMID:22162623
Searching for resistance genes to Bursaphelenchus xylophilus using high throughput screening.

PubMed

Santos, Carla S; Pinheiro, Miguel; Silva, Ana I; Egas, Conceição; Vasconcelos, Marta W

2012-11-07

Pine wilt disease (PWD), caused by the pinewood nematode (PWN; Bursaphelenchus xylophilus), damages and kills pine trees and is causing serious economic damage worldwide. Although the ecological mechanism of infestation is well described, the plant's molecular response to the pathogen is not well known. This is due mainly to the lack of genomic information and the complexity of the disease. High throughput sequencing is now an efficient approach for detecting the expression of genes in non-model organisms, thus providing valuable information in spite of the lack of the genome sequence. In an attempt to unravel genes potentially involved in the pine defense against the pathogen, we hereby report the high throughput comparative sequence analysis of infested and non-infested stems of Pinus pinaster (very susceptible to PWN) and Pinus pinea (less susceptible to PWN). Four cDNA libraries from infested and non-infested stems of P. pinaster and P. pinea were sequenced in a full 454 GS FLX run, producing a total of 2,083,698 reads. The putative amino acid sequences encoded by the assembled transcripts were annotated according to Gene Ontology, to assign Pinus contigs into Biological Processes, Cellular Components and Molecular Functions categories. Most of the annotated transcripts corresponded to Picea genes-25.4-39.7%, whereas a smaller percentage, matched Pinus genes, 1.8-12.8%, probably a consequence of more public genomic information available for Picea than for Pinus. The comparative transcriptome analysis showed that when P. pinaster was infested with PWN, the genes malate dehydrogenase, ABA, water deficit stress related genes and PAR1 were highly expressed, while in PWN-infested P. pinea, the highly expressed genes were ricin B-related lectin, and genes belonging to the SNARE and high mobility group families. Quantitative PCR experiments confirmed the differential gene expression between the two pine species. Defense-related genes triggered by nematode infestation were detected in both P. pinaster and P. pinea transcriptomes utilizing 454 pyrosequencing technology. P. pinaster showed higher abundance of genes related to transcriptional regulation, terpenoid secondary metabolism (including some with nematicidal activity) and pathogen attack. P. pinea showed higher abundance of genes related to oxidative stress and higher levels of expression in general of stress responsive genes. This study provides essential information about the molecular defense mechanisms utilized by P. pinaster and P. pinea against PWN infestation and contributes to a better understanding of PWD.
Cloning of a neonatal calcium atpase isoform (SERCA 1B) from extraocular muscle of adult blue marlin (Makaira nigricans).

PubMed

Londraville, R L; Cramer, T D; Franck, J P; Tullis, A; Block, B A

2000-10-01

Complete cDNAs for the fast-twitch Ca2+ -ATPase isoform (SERCA 1) were cloned and sequenced from blue marlin (Makaira nigricans) extraocular muscle (EOM). Complete cDNAs for SERCA 1 were also cloned from fast-twitch skeletal muscle of the same species. The two sequences are identical over the coding region except for the last five codons on the carboxyl end; EOM SERCA 1 cDNA codes for 996 amino acids and the fast-twitch cDNAs code for 991 aa. Phylogenetic analysis revealed that EOM SERCA 1 clusters with an isoform of Ca2+ -ATPase normally expressed in early development of mammals (SERCA 1B). This is the first report of SERCA 1B in an adult vertebrate. RNA hybridization assays indicate that 1B expression is limited to extraocular muscles. Because EOM gives rise to the thermogenic heater organ in marlin, we investigated whether SERCA 1B may play a role in heat generation, or if 1B expression is common in EOM among vertebrates. Chicken also expresses SERCA 1B in EOM, but rat expresses SERCA 1A; because SERCA 1B is not specific to heater tissue we conclude it is unlikely that it plays a specific role in intracellular heat production. Comparative sequence analysis does reveal, however, several sites that may be the source of functional differences between fish and mammalian SERCAs.
Revealing impaired pathways in the an11 mutant by high-throughput characterization of Petunia axillaris and Petunia inflata transcriptomes.

PubMed

Zenoni, Sara; D'Agostino, Nunzio; Tornielli, Giovanni B; Quattrocchio, Francesca; Chiusano, Maria L; Koes, Ronald; Zethof, Jan; Guzzo, Flavia; Delledonne, Massimo; Frusciante, Luigi; Gerats, Tom; Pezzotti, Mario

2011-10-01

Petunia is an excellent model system, especially for genetic, physiological and molecular studies. Thus far, however, genome-wide expression analysis has been applied rarely because of the lack of sequence information. We applied next-generation sequencing to generate, through de novo read assembly, a large catalogue of transcripts for Petunia axillaris and Petunia inflata. On the basis of both transcriptomes, comprehensive microarray chips for gene expression analysis were established and used for the analysis of global- and organ-specific gene expression in Petunia axillaris and Petunia inflata and to explore the molecular basis of the seed coat defects in a Petunia hybrida mutant, anthocyanin 11 (an11), lacking a WD40-repeat (WDR) transcription regulator. Among the transcripts differentially expressed in an11 seeds compared with wild type, many expected targets of AN11 were found but also several interesting new candidates that might play a role in morphogenesis of the seed coat. Our results validate the combination of next-generation sequencing with microarray analyses strategies to identify the transcriptome of two petunia species without previous knowledge of their genome, and to develop comprehensive chips as useful tools for the analysis of gene expression in P. axillaris, P. inflata and P. hybrida. © 2011 The Authors. The Plant Journal © 2011 Blackwell Publishing Ltd.
Identification of Differentially Expressed miRNAs between White and Black Hair Follicles by RNA-Sequencing in the Goat (Capra hircus)

PubMed Central

Wu, Zhenyang; Fu, Yuhua; Cao, Jianhua; Yu, Mei; Tang, Xiaohui; Zhao, Shuhong

2014-01-01

MicroRNAs (miRNAs) play a key role in many biological processes by regulating gene expression at the post-transcriptional level. A number of miRNAs have been identified from livestock species. However, compared with other animals, such as pigs and cows, the number of miRNAs identified in goats is quite low, particularly in hair follicles. In this study, to investigate the functional roles of miRNAs in goat hair follicles of goats with different coat colors, we sequenced miRNAs from two hair follicles samples (white and black) using Solexa sequencing. A total of 35,604,016 reads were obtained, which included 30,878,637 clean reads (86.73%). MiRDeep2 software identified 214 miRNAs. Among them, 205 were conserved among species and nine were novel miRNAs. Furthermore, DESeq software identified six differentially expressed miRNAs. Quantitative PCR confirmed differential expression of two miRNAs, miR-10b and miR-211. KEGG pathways were analyzed using the DAVID website for the predicted target genes of the differentially expressed miRNAs. Several signaling pathways including Notch and MAPK pathways may affect the process of coat color formation. Our study showed that the identified miRNAs might play an essential role in black and white follicle formation in goats. PMID:24879525
Suppression of prolactin gene expression in GH cells correlates with site-specific DNA methylation.

PubMed

Zhang, Z X; Kumar, V; Rivera, R T; Pasion, S G; Chisholm, J; Biswas, D K

1989-10-01

Prolactin- (PRL) producing and nonproducing subclones of the GH line of (rat) pituitary tumor cells have been compared to elucidate the regulatory mechanisms of PRL gene expression. Particular emphasis was placed on delineating the molecular basis of the suppressed state of the PRL gene in the prolactin-nonproducing (PRL-) GH subclone (GH(1)2C1). We examined six methylatable cytosine residues (5, -CCGG- and 1, -GCGC-) within the 30-kb region of the PRL gene in these subclones. This analysis revealed that -CCGG-sequences of the transcribed region, and specifically, one in the fourth exon of the PRL gene, were heavily methylated in the PRL-, GH(1)2C1 cells. Furthermore, the inhibition of PRL gene expression in GH(1)2C1 was reversed by short-term treatment of the cells with a sublethal concentration of azacytidine (AzaC), an inhibitor of DNA methylation. The reversion of PRL gene expression by AzaC was correlated with the concurrent demethylation of the same -CCGG- sequences in the transcribed region of PRL gene. An inverse correlation between PRL gene expression and the level of methylation of the internal -C- residues in the specific -CCGG-sequence of the transcribed region of the PRL gene was demonstrated. The DNase I sensitivity of these regions of the PRL gene in PRL+, PRL-, and AzaC-treated cells was also consistent with an inverse relationship between methylation state, a higher order of structural modification, and gene expression.(ABSTRACT TRUNCATED AT 250 WORDS)
Murine mesenchymal and embryonic stem cells express a similar Hox gene profile.

PubMed

Phinney, Donald G; Gray, Andrew J; Hill, Katy; Pandey, Amitabh

2005-12-30

Using degenerate oligonucleotide primers targeting the homeobox domain, we amplified by PCR and sequenced 723 clones from five murine cell populations and lines derived from embryonic mesoderm and adult bone marrow. Transcripts from all four vertebrate Hox clusters were expressed by the different populations. Hierarchical clustering of the data revealed that mesenchymal stem cells (MSCs) and the embryonic stem (ES) cell line D3 shared a similar Hox expression profile. These populations exclusively expressed Hoxb2, Hoxb5, Hoxb7, and Hoxc4, transcripts regulating self-renewal and differentiation of other stem cells. Additionally, Hoxa7 transcript quantified by real-time PCR strongly correlated (r2=0.89) with the number of Hoxa7 clones identified by sequencing, validating that data from the PCR screen reflects differences in Hox mRNA abundance between populations. This is the first study to catalogue Hox transcripts in murine MSCs and by comparative analyses identify specific Hox genes that may contribute to their stem cell character.
Molecular Cloning and Characterization of a New C-type Lysozyme Gene from Yak Mammary Tissue

PubMed Central

Jiang, Ming Feng; Hu, Ming Jun; Ren, Hong Hui; Wang, Li

2015-01-01

Milk lysozyme is the ubiquitous enzyme in milk of mammals. In this study, the cDNA sequence of a new chicken-type (c-type) milk lysozyme gene (YML), was cloned from yak mammary gland tissue. A 444 bp open reading frames, which encodes 148 amino acids (16.54 kDa) with a signal peptide of 18 amino acids, was sequenced. Further analysis indicated that the nucleic acid and amino acid sequences identities between yak and cow milk lysozyme were 89.04% and 80.41%, respectively. Recombinant yak milk lysozyme (rYML) was produced by Escherichia coli BL21 and Pichia pastoris X33. The highest lysozyme activity was detected for heterologous protein rYML5 (M = 1,864.24 U/mg, SD = 25.75) which was expressed in P. pastoris with expression vector pPICZαA and it clearly inhibited growth of Staphylococcus aureus. Result of the YML gene expression using quantitative polymerase chain reaction showed that the YML gene was up-regulated to maximum at 30 day postpartum, that is, comparatively high YML can be found in initial milk production. The phylogenetic tree indicated that the amino acid sequence was similar to cow kidney lysozyme, which implied that the YML may have diverged from a different ancestor gene such as cow mammary glands. In our study, we suggest that YML be a new c-type lysozyme expressed in yak mammary glands that plays a role as host immunity. PMID:26580446
Integration of Next Generation Sequencing and EPR Analysis to Uncover Molecular Mechanism Underlying Shell Color Variation in Scallops

PubMed Central

Sun, Xiujun; Liu, Zhihong; Zhou, Liqing; Wu, Biao; Dong, Yinghui; Yang, Aiguo

2016-01-01

The Yesso scallop Patinopecten yessoensis displays polymorphism in shell colors, which is of great interest for the scallop industry. To identify genes involved in the shell coloration, in the present study, we investigate the transcriptome differences by Illumina digital gene expression (DGE) analysis in two extreme color phenotypes, Red and White. Illumina sequencing yields a total of 62,715,364 clean sequence reads, and more than 85% reads are mapped into our previously sequenced transcriptome. There are 25 significantly differentially expressed genes between Red and White scallops. EPR (Electron paramagnetic resonance) analysis has identified EPR spectra of pheomelanin and eumelanin in the red shells, but not in the white shells. Compared to the Red scallops, the White scallops have relatively higher mRNA expression in tyrosinase genes, but lower expression in other melanogensis-associated genes. Meantime, the relatively lower tyrosinase protein and decreased tyrosinase activity in White scallops are suggested to be associated with the lack of melanin in the white shells. Our findings highlight the functional roles of melanogensis-associated genes in the melanization process of scallop shells, and shed new lights on the transcriptional and post-transcriptional mechanisms in the regulation of tyrosinase activity during the process of melanin synthesis. The present results will assist our molecular understanding of melanin synthesis underlying shell color polymorphism in scallops, as well as other bivalves, and also help the color-based breeding in shellfish aquaculture. PMID:27563719
Differential protein expression in alligator leukocytes in response to bacterial lipopolysaccharide injection.

PubMed

Merchant, Mark; Kinney, Clint; Sanders, Paige

2009-12-01

Blood was collected from three juvenile alligators (Alligator mississippiensis) before, and again 24h after, injection with bacterial lipopolysaccharide (LPS). The leukocytes were collected from both samples, and the proteins were extracted. Each group of proteins was labeled with a different fluorescent dye and the differences in protein expression were analyzed by two dimensional differential in-gel expressions (2D-DIGE). The proteins which appeared to be increased or decreased by treatment with LPS were selected and analyzed by MALDI-TOF to determine mass and LC-MS/MS to acquire the partial protein sequences. The peptide sequences were compared to the NCBI protein sequence database to determine homology with other sequences from other species. Several proteins of interest appeared to be increased upon LPS stimulation. Proteins with homology to human transgelin-2, fish glucose-6-phosphate dehydrogenase, amphibian α-enolase, alligator lactate dehydrogenase, fish ubiquitin-activating enzyme, and fungal β-tubulin were also increased after LPS injection. Proteins with homology to fish vimentin 4, murine heterogeneous nuclear ribonucleoprotein A3, and avian calreticulin were found to be decreased in response to LPS. In addition, five proteins, four of which were up-regulated (827, 560, 512, and 650%) and one that exhibited repressed expression (307%), did not show homology to any protein in the database, and thus may represent newly discovered proteins. We are using this biochemical approach to isolate and characterize alligator proteins with potential relevant immune function.
Cutaneous squamous and neuroendocrine carcinoma: genetically and immunohistochemically different from Merkel cell carcinoma

PubMed Central

Pulitzer, Melissa P; Brannon, A Rose; Berger, Michael F; Louis, Peter; Scott, Sasinya N; Jungbluth, Achim A; Coit, Daniel G; Brownell, Isaac; Busam, Klaus J

2016-01-01

Cutaneous neuroendocrine (Merkel cell) carcinoma most often arises de novo in the background of a clonally integrated virus, the Merkel cell polyomavirus, and is notable for positive expression of retinoblastoma 1 (RB1) protein and low expression of p53 compared with the rare Merkel cell polyomavirus-negative Merkel cell carcinomas. Combined squamous and Merkel cell tumors are consistently negative for Merkel cell polyomavirus. Little is known about their immunophenotypic or molecular profile. Herein, we studied 10 combined cutaneous squamous cell and neuroendocrine carcinomas for immunohistochemical expression of p53, retinoblastoma 1 protein, neurofilament, p63, and cytokeratin 20 (CK20). We compared mutation profiles of five combined Merkel cell carcinomas and seven ‘pure’ Merkel cell carcinomas using targeted next-generation sequencing. Combined tumors were from the head, trunk, and leg of Caucasian males and one female aged 52–89. All cases were highly p53- and p63-positive and neurofilament-negative in the squamous component, whereas RB1-negative in both components. Eight out of 10 were p53-positive, 3/10 p63-positive, and 3/10 focally neurofilament-positive in the neuroendocrine component. Six out of 10 were CK20-positive in any part. By next-generation sequencing, combined tumors were highly mutated, with an average of 48 mutations per megabase compared with pure tumors, which showed 1.25 mutations per megabase. RB1 and p53 mutations were identified in all five combined tumors. Combined tumors represent an immunophenotypically and genetically distinct variant of primary cutaneous neuroendocrine carcinomas, notable for a highly mutated genetic profile, significant p53 expression and/or mutation, absent RB1 expression in the context of increased RB1 mutation, and minimal neurofilament expression. PMID:26022453
Cutaneous squamous and neuroendocrine carcinoma: genetically and immunohistochemically different from Merkel cell carcinoma.

PubMed

Pulitzer, Melissa P; Brannon, A Rose; Berger, Michael F; Louis, Peter; Scott, Sasinya N; Jungbluth, Achim A; Coit, Daniel G; Brownell, Isaac; Busam, Klaus J

2015-08-01

Cutaneous neuroendocrine (Merkel cell) carcinoma most often arises de novo in the background of a clonally integrated virus, the Merkel cell polyomavirus, and is notable for positive expression of retinoblastoma 1 (RB1) protein and low expression of p53 compared with the rare Merkel cell polyomavirus-negative Merkel cell carcinomas. Combined squamous and Merkel cell tumors are consistently negative for Merkel cell polyomavirus. Little is known about their immunophenotypic or molecular profile. Herein, we studied 10 combined cutaneous squamous cell and neuroendocrine carcinomas for immunohistochemical expression of p53, retinoblastoma 1 protein, neurofilament, p63, and cytokeratin 20 (CK20). We compared mutation profiles of five combined Merkel cell carcinomas and seven 'pure' Merkel cell carcinomas using targeted next-generation sequencing. Combined tumors were from the head, trunk, and leg of Caucasian males and one female aged 52-89. All cases were highly p53- and p63-positive and neurofilament-negative in the squamous component, whereas RB1-negative in both components. Eight out of 10 were p53-positive, 3/10 p63-positive, and 3/10 focally neurofilament-positive in the neuroendocrine component. Six out of 10 were CK20-positive in any part. By next-generation sequencing, combined tumors were highly mutated, with an average of 48 mutations per megabase compared with pure tumors, which showed 1.25 mutations per megabase. RB1 and p53 mutations were identified in all five combined tumors. Combined tumors represent an immunophenotypically and genetically distinct variant of primary cutaneous neuroendocrine carcinomas, notable for a highly mutated genetic profile, significant p53 expression and/or mutation, absent RB1 expression in the context of increased RB1 mutation, and minimal neurofilament expression.
Sequencing of the needle transcriptome from Norway spruce (Picea abies Karst L.) reveals lower substitution rates, but similar selective constraints in gymnosperms and angiosperms

PubMed Central

2012-01-01

Background A detailed knowledge about spatial and temporal gene expression is important for understanding both the function of genes and their evolution. For the vast majority of species, transcriptomes are still largely uncharacterized and even in those where substantial information is available it is often in the form of partially sequenced transcriptomes. With the development of next generation sequencing, a single experiment can now simultaneously identify the transcribed part of a species genome and estimate levels of gene expression. Results mRNA from actively growing needles of Norway spruce (Picea abies) was sequenced using next generation sequencing technology. In total, close to 70 million fragments with a length of 76 bp were sequenced resulting in 5 Gbp of raw data. A de novo assembly of these reads, together with publicly available expressed sequence tag (EST) data from Norway spruce, was used to create a reference transcriptome. Of the 38,419 PUTs (putative unique transcripts) longer than 150 bp in this reference assembly, 83.5% show similarity to ESTs from other spruce species and of the remaining PUTs, 3,704 show similarity to protein sequences from other plant species, leaving 4,167 PUTs with limited similarity to currently available plant proteins. By predicting coding frames and comparing not only the Norway spruce PUTs, but also PUTs from the close relatives Picea glauca and Picea sitchensis to both Pinus taeda and Taxus mairei, we obtained estimates of synonymous and non-synonymous divergence among conifer species. In addition, we detected close to 15,000 SNPs of high quality and estimated gene expression differences between samples collected under dark and light conditions. Conclusions Our study yielded a large number of single nucleotide polymorphisms as well as estimates of gene expression on transcriptome scale. In agreement with a recent study we find that the synonymous substitution rate per year (0.6 × 10−09 and 1.1 × 10−09) is an order of magnitude smaller than values reported for angiosperm herbs. However, if one takes generation time into account, most of this difference disappears. The estimates of the dN/dS ratio (non-synonymous over synonymous divergence) reported here are in general much lower than 1 and only a few genes showed a ratio larger than 1. PMID:23122049

Transcriptome analysis of stem development in the tumourous stem mustard Brassica juncea var. tumida Tsen et Lee by RNA sequencing.

PubMed

Sun, Quan; Zhou, Guanfan; Cai, Yingfan; Fan, Yonghong; Zhu, Xiaoyan; Liu, Yihua; He, Xiaohong; Shen, Jinjuan; Jiang, Huaizhong; Hu, Daiwen; Pan, Zheng; Xiang, Liuxin; He, Guanghua; Dong, Daiwen; Yang, Jianping

2012-04-21

Tumourous stem mustard (Brassica juncea var. tumida Tsen et Lee) is an economically and nutritionally important vegetable crop of the Cruciferae family that also provides the raw material for Fuling mustard. The genetics breeding, physiology, biochemistry and classification of mustards have been extensively studied, but little information is available on tumourous stem mustard at the molecular level. To gain greater insight into the molecular mechanisms underlying stem swelling in this vegetable and to provide additional information for molecular research and breeding, we sequenced the transcriptome of tumourous stem mustard at various stem developmental stages and compared it with that of a mutant variety lacking swollen stems. Using Illumina short-read technology with a tag-based digital gene expression (DGE) system, we performed de novo transcriptome assembly and gene expression analysis. In our analysis, we assembled genetic information for tumourous stem mustard at various stem developmental stages. In addition, we constructed five DGE libraries, which covered the strains Yong'an and Dayejie at various development stages. Illumina sequencing identified 146,265 unigenes, including 11,245 clusters and 135,020 singletons. The unigenes were subjected to a BLAST search and annotated using the GO and KO databases. We also compared the gene expression profiles of three swollen stem samples with those of two non-swollen stem samples. A total of 1,042 genes with significantly different expression levels occurring simultaneously in the six comparison groups were screened out. Finally, the altered expression levels of a number of randomly selected genes were confirmed by quantitative real-time PCR. Our data provide comprehensive gene expression information at the transcriptional level and the first insight into the understanding of the molecular mechanisms and regulatory pathways of stem swelling and development in this plant, and will help define new mechanisms of stem development in non-model plant organisms.
CCAAT/enhancer-binding protein β is involved in the breed-dependent transcriptional regulation of 3β-hydroxysteroid dehydrogenase/Δ(5)-Δ(4)-isomerase in adrenal gland of preweaning piglets.

PubMed

Li, Xian; Li, Runsheng; Jia, Yimin; Sun, Zhiyuan; Yang, Xiaojing; Sun, Qinwei; Zhao, Ruqian

2013-11-01

The enzyme 3β-hydroxysteroid dehydrogenase/Δ(5)-Δ(4)-isomerase (3β-HSD) catalyzes the biosynthesis of all steroid hormones. The molecular mechanisms regulating porcine adrenal 3β-HSD expression in different breeds are still poorly understood. In this study, we aimed to compare the expression of 3β-HSD between preweaning purebred Large White (LW) and Erhualian (EHL) piglets and to explore the potential factors regulating 3β-HSD transcription. EHL had significantly higher serum levels of cortisol (P<0.01) and testosterone (P<0.01), which were associated with significantly higher expression of 3β-HSD mRNA (P<0.01) and protein (P<0.05) in the adrenal gland, compared with LW piglets. The 5' flanking region of the porcine 3β-HSD gene showed significant sequence variations between breeds, and the sequence of EHL demonstrated an elevated promoter activity (P<0.05) in luciferase reporter gene assay. Higher adrenal expression of 3β-HSD in EHL was accompanied with higher CCAAT/enhancer binding protein β (C/EBPβ) expression (P<0.05), enriched histone H3 acetylation (P<0.05) and C/EBPβ binding to 3β-HSD promoter (P<0.05). In addition, higher androgen receptor (AR) (P=0.06) and lower glucocorticoid receptor (GR) (P<0.05) were detected in EHL. Co-immunoprecipitation analysis revealed interactions of C/EBPβ with both AR and GR. These results indicate that the C/EBPβ binding to 3β-HSD promoter is responsible, at least in part, for the breed-dependent 3β-HSD expression in adrenal gland of piglets. The sequence variations of 3β-HSD promoter and the interactions of AR and/or GR with C/EBPβ may also participate in the regulation. Copyright © 2013 Elsevier Ltd. All rights reserved.
Human endogenous retrovirus expression is inversely related with the up-regulation of interferon-inducible genes in the skin of patients with lichen planus.

PubMed

Nogueira, Marcelle Almeida de Sousa; Gavioli, Camila Fátima Biancardi; Pereira, Nátalli Zanete; de Carvalho, Gabriel Costa; Domingues, Rosana; Aoki, Valéria; Sato, Maria Notomi

2015-04-01

Lichen planus (LP) is a common inflammatory skin disease of unknown etiology. Reports of a common transactivation of quiescent human endogenous retroviruses (HERVs) support the connection of viruses to the disease. HERVs are ancient retroviral sequences in the human genome and their transcription is often deregulated in cancer and autoimmune diseases. We explored the transcriptional activity of HERV sequences as well as the antiviral restriction factor and interferon-inducible genes in the skin from LP patients and healthy control (HC) donors. The study included 13 skin biopsies from patients with LP and 12 controls. Real-time PCR assay identified significant decrease in the HERV-K gag and env mRNA expression levels in LP subjects, when compared to control group. The expressions of HERV-K18 and HERV-W env were also inhibited in the skin of LP patients. We observed a strong correlation between HERV-K gag with other HERV sequences, regardless the down-modulation of transcripts levels in LP group. In contrast, a significant up-regulation of the cytidine deaminase APOBEC 3G (apolipoprotein B mRNA-editing), and the GTPase MxA (Myxovirus resistance A) mRNA expression level was identified in the LP skin specimens. Other transcript expressions, such as the master regulator of type I interferon-dependent immune responses, STING (stimulator of interferon genes) and IRF-7 (interferon regulatory factor 7), IFN-β and the inflammassome NALP3, had increased levels in LP, when compared to HC group. Our study suggests that interferon-inducible factors, in addition to their role in innate immunity against exogenous pathogens, contribute to the immune control of HERVs. Evaluation of the balance between HERV and interferon-inducible factor expression could possibly contribute to surveillance of inflammatory/malignant status of skin diseases.
The role of heterologous chloroplast sequence elements in transgene integration and expression.

PubMed

Ruhlman, Tracey; Verma, Dheeraj; Samson, Nalapalli; Daniell, Henry

2010-04-01

Heterologous regulatory elements and flanking sequences have been used in chloroplast transformation of several crop species, but their roles and mechanisms have not yet been investigated. Nucleotide sequence identity in the photosystem II protein D1 (psbA) upstream region is 59% across all taxa; similar variation was consistent across all genes and taxa examined. Secondary structure and predicted Gibbs free energy values of the psbA 5' untranslated region (UTR) among different families reflected this variation. Therefore, chloroplast transformation vectors were made for tobacco (Nicotiana tabacum) and lettuce (Lactuca sativa), with endogenous (Nt-Nt, Ls-Ls) or heterologous (Nt-Ls, Ls-Nt) psbA promoter, 5' UTR and 3' UTR, regulating expression of the anthrax protective antigen (PA) or human proinsulin (Pins) fused with the cholera toxin B-subunit (CTB). Unique lettuce flanking sequences were completely eliminated during homologous recombination in the transplastomic tobacco genomes but not unique tobacco sequences. Nt-Ls or Ls-Nt transplastomic lines showed reduction of 80% PA and 97% CTB-Pins expression when compared with endogenous psbA regulatory elements, which accumulated up to 29.6% total soluble protein PA and 72.0% total leaf protein CTB-Pins, 2-fold higher than Rubisco. Transgene transcripts were reduced by 84% in Ls-Nt-CTB-Pins and by 72% in Nt-Ls-PA lines. Transcripts containing endogenous 5' UTR were stabilized in nonpolysomal fractions. Stromal RNA-binding proteins were preferentially associated with endogenous psbA 5' UTR. A rapid and reproducible regeneration system was developed for lettuce commercial cultivars by optimizing plant growth regulators. These findings underscore the need for sequencing complete crop chloroplast genomes, utilization of endogenous regulatory elements and flanking sequences, as well as optimization of plant growth regulators for efficient chloroplast transformation.
The Role of Heterologous Chloroplast Sequence Elements in Transgene Integration and Expression1[W][OA

PubMed Central

Ruhlman, Tracey; Verma, Dheeraj; Samson, Nalapalli; Daniell, Henry

2010-01-01

Heterologous regulatory elements and flanking sequences have been used in chloroplast transformation of several crop species, but their roles and mechanisms have not yet been investigated. Nucleotide sequence identity in the photosystem II protein D1 (psbA) upstream region is 59% across all taxa; similar variation was consistent across all genes and taxa examined. Secondary structure and predicted Gibbs free energy values of the psbA 5′ untranslated region (UTR) among different families reflected this variation. Therefore, chloroplast transformation vectors were made for tobacco (Nicotiana tabacum) and lettuce (Lactuca sativa), with endogenous (Nt-Nt, Ls-Ls) or heterologous (Nt-Ls, Ls-Nt) psbA promoter, 5′ UTR and 3′ UTR, regulating expression of the anthrax protective antigen (PA) or human proinsulin (Pins) fused with the cholera toxin B-subunit (CTB). Unique lettuce flanking sequences were completely eliminated during homologous recombination in the transplastomic tobacco genomes but not unique tobacco sequences. Nt-Ls or Ls-Nt transplastomic lines showed reduction of 80% PA and 97% CTB-Pins expression when compared with endogenous psbA regulatory elements, which accumulated up to 29.6% total soluble protein PA and 72.0% total leaf protein CTB-Pins, 2-fold higher than Rubisco. Transgene transcripts were reduced by 84% in Ls-Nt-CTB-Pins and by 72% in Nt-Ls-PA lines. Transcripts containing endogenous 5′ UTR were stabilized in nonpolysomal fractions. Stromal RNA-binding proteins were preferentially associated with endogenous psbA 5′ UTR. A rapid and reproducible regeneration system was developed for lettuce commercial cultivars by optimizing plant growth regulators. These findings underscore the need for sequencing complete crop chloroplast genomes, utilization of endogenous regulatory elements and flanking sequences, as well as optimization of plant growth regulators for efficient chloroplast transformation. PMID:20130101
DMRT gene cluster analysis in the platypus: new insights into genomic organization and regulatory regions.

PubMed

El-Mogharbel, Nisrine; Wakefield, Matthew; Deakin, Janine E; Tsend-Ayush, Enkhjargal; Grützner, Frank; Alsop, Amber; Ezaz, Tariq; Marshall Graves, Jennifer A

2007-01-01

We isolated and characterized a cluster of platypus DMRT genes and compared their arrangement, location, and sequence across vertebrates. The DMRT gene cluster on human 9p24.3 harbors, in order, DMRT1, DMRT3, and DMRT2, which share a DM domain. DMRT1 is highly conserved and involved in sexual development in vertebrates, and deletions in this region cause sex reversal in humans. Sequence comparisons of DMRT genes between species have been valuable in identifying exons, control regions, and conserved nongenic regions (CNGs). The addition of platypus sequences is expected to be particularly valuable, since monotremes fill a gap in the vertebrate genome coverage. We therefore isolated and fully sequenced platypus BAC clones containing DMRT3 and DMRT2 as well as DMRT1 and then generated multispecies alignments and ran prediction programs followed by experimental verification to annotate this gene cluster. We found that the three genes have 58-66% identity to their human orthologues, lie in the same order as in other vertebrates, and colocate on 1 of the 10 platypus sex chromosomes, X5. We also predict that optimal annotation of the newly sequenced platypus genome will be challenging. The analysis of platypus sequence revealed differences in structure and sequence of the DMRT gene cluster. Multispecies comparison was particularly effective for detecting CNGs, revealing several novel potential regulatory regions within DMRT3 and DMRT2 as well as DMRT1. RT-PCR indicated that platypus DMRT1 and DMRT3 are expressed specifically in the adult testis (and not ovary), but DMRT2 has a wider expression profile, as it does for other mammals. The platypus DMRT1 expression pattern, and its location on an X chromosome, suggests an involvement in monotreme sexual development.
Identification of an evolutionarily conserved regulatory element of the zebrafish col2a1a gene.

PubMed

Dale, Rodney M; Topczewski, Jacek

2011-09-15

Zebrafish (Danio rerio) is an excellent model organism for the study of vertebrate development including skeletogenesis. Studies of mammalian cartilage formation were greatly advanced through the use of a cartilage specific regulatory element of the Collagen type II alpha 1 (Col2a1) gene. In an effort to isolate such an element in zebrafish, we compared the expression of two col2a1 homologues and found that expression of col2a1b, a previously uncharacterized zebrafish homologue, only partially overlaps with col2a1a. We focused our analysis on col2a1a, as it is expressed in both the stacked chondrocytes and the perichondrium. By comparing the genomic sequence surrounding the predicted transcriptional start site of col2a1a among several species of teleosts we identified a small highly conserved sequence (R2) located 1.7 kb upstream of the presumptive transcriptional initiation site. Interestingly, neither the sequence nor location of this element is conserved between teleost and mammalian Col2a1. We generated transient and stable transgenic lines with just the R2 element or the entire 1.7 kb fragment 5' of the transcriptional initiation site. The identified regulatory elements enable the tracking of cellular development in various tissues by driving robust reporter expression in craniofacial cartilage, ear, notochord, floor plate, hypochord and fins in a pattern similar to the expression of endogenous col2a1a. Using a reporter gene driven by the R2 regulatory element, we analyzed the morphogenesis of the notochord sheath cells as they withdraw from the stack of initially uniform cells and encase the inflating vacuolated notochord cells. Finally, we show that like endogenous col2a1a, craniofacial expression of these reporter constructs depends on Sox9a transcription factor activity. At the same time, notochord expression is maintained after Sox9a knockdown, suggesting that other factors can activate expression through the identified regulatory element in this tissue. Copyright © 2011 Elsevier Inc. All rights reserved.
Identification of an evolutionarily conserved regulatory element of the zebrafish col2a1a gene

PubMed Central

Dale, Rodney M.; Topczewski, Jacek

2011-01-01

Zebrafish (Danio rerio) is an excellent model organism for the study of vertebrate development including skeletogenesis. Studies of mammalian cartilage formation were greatly advanced through the use of a cartilage specific regulatory element of the Collagen type II alpha 1 (Col2a1) gene. In an effort to isolate such an element in zebrafish, we compared the expression of two col2a1 homologues and found that expression of col2a1b, a previously uncharacterized zebrafish homologue, only partially overlaps with col2a1a. We focused our analysis on col2a1a, as it is expressed in both the stacked chondrocytes and the perichondrium. By comparing the genomic sequence surrounding the predicted transcriptional start site of col2a1a among several species of teleosts we identified a small highly conserved sequence (R2) located 1.7 kb upstream of the presumptive transcriptional initiation site. Interestingly, neither the sequence nor location of this element is conserved between teleost and mammalian Col2a1. We generated transient and stable transgenic lines with just the R2 element or the entire 1.7 kb fragment 5’ of the transcriptional initiation site. The identified regulatory elements enable the tracking of cellular development in various tissues by driving robust reporter expression in craniofacial cartilage, ear, notochord, floor plate, hypochord and fins in a pattern similar to the expression of endogenous col2a1a. Using a reporter gene driven by the R2 regulatory element, we analyzed the morphogenesis of the notochord sheath cells as they withdraw from the stack of initially uniform cells and encase the inflating vacuolated notochord cells. Finally, we show that like endogenous col2a1a, craniofacial expression of these reporter constructs depends on Sox9a transcription factor activity. At the same time, notochord expression is maintained after Sox9a knockdown, suggesting that other factors can activate expression through the identified regulatory element in this tissue. PMID:21723274
E74-like factor 2 regulates valosin-containing protein expression

DOE Office of Scientific and Technical Information (OSTI.GOV)

Zhang, Binglin; Tomita, Yasuhiko; Qiu, Ying

2007-05-11

Enhanced expression of valosin-containing protein (VCP) correlates with invasion and metastasis of cancers. To clarify the transcription mechanism of VCP, human and mouse genomic sequence was compared, revealing a 260 bp DNA sequence in the 5'-flanking region of VCP gene to be highly conserved between the two, in which binding motif of E74-like factor 2/new Ets-related factor (ELF2/NERF) was identified. Chromatin immunoprecipitation assay showed binding of ELF2/NERF to the 5'-flanking region of VCP gene. Knock-down of ELF2/NERF by siRNA decreased expression level of VCP. Viability of cells under tumor necrosis factor-alpha treatment significantly reduced in ELF2/NERF-knock-down breast cancer cell line.more » Immunohistochemical analysis on clinical breast cancer specimens showed a correlation of nuclear ELF2/NERF expression with VCP expression and proliferative activity of cells shown by Ki-67 immunohistochemistry. These findings indicate that ELF2/NERF promotes VCP transcription and that ELF2/NERF-VCP pathway might be important for cell survival and proliferation under cytokine stress.« less
Molecular cloning and characterization of a gene regulating flowering time from Alfalfa (Medicago sativa L.).

PubMed

Zhang, Tiejun; Chao, Yuehui; Kang, Junmei; Ding, Wang; Yang, Qingchuan

2013-07-01

Genes that regulate flowering time play crucial roles in plant development and biomass formation. Based on the cDNA sequence of Medicago truncatula (accession no. AY690425), the LFY gene of alfalfa was cloned. Sequence similarity analysis revealed high homology with FLO/LFY family genes of other plants. When fused to the green fluorescent protein, MsLFY protein was localized in the nucleus of onion (Allium cepa L.) epidermal cells. The RT-qPCR analysis of MsLFY expression patterns showed that the expression of MsLFY gene was at a low level in roots, stems, leaves and pods, and the expression level in floral buds was the highest. The expression of MsLFY was induced by GA3 and long photoperiod. Plant expression vector was constructed and transformed into Arabidopsis by the agrobacterium-mediated methods. PCR amplification with the transgenic Arabidopsis genome DNA indicated that MsLFY gene had integrated in Arabidopsis genome. Overexpression of MsLFY specifically caused early flowering under long day conditions compared with non-transgenic plants. These results indicated MsLFY played roles in promoting flowering time.
Comparing the normalization methods for the differential analysis of Illumina high-throughput RNA-Seq data.

PubMed

Li, Peipei; Piao, Yongjun; Shon, Ho Sun; Ryu, Keun Ho

2015-10-28

Recently, rapid improvements in technology and decrease in sequencing costs have made RNA-Seq a widely used technique to quantify gene expression levels. Various normalization approaches have been proposed, owing to the importance of normalization in the analysis of RNA-Seq data. A comparison of recently proposed normalization methods is required to generate suitable guidelines for the selection of the most appropriate approach for future experiments. In this paper, we compared eight non-abundance (RC, UQ, Med, TMM, DESeq, Q, RPKM, and ERPKM) and two abundance estimation normalization methods (RSEM and Sailfish). The experiments were based on real Illumina high-throughput RNA-Seq of 35- and 76-nucleotide sequences produced in the MAQC project and simulation reads. Reads were mapped with human genome obtained from UCSC Genome Browser Database. For precise evaluation, we investigated Spearman correlation between the normalization results from RNA-Seq and MAQC qRT-PCR values for 996 genes. Based on this work, we showed that out of the eight non-abundance estimation normalization methods, RC, UQ, Med, TMM, DESeq, and Q gave similar normalization results for all data sets. For RNA-Seq of a 35-nucleotide sequence, RPKM showed the highest correlation results, but for RNA-Seq of a 76-nucleotide sequence, least correlation was observed than the other methods. ERPKM did not improve results than RPKM. Between two abundance estimation normalization methods, for RNA-Seq of a 35-nucleotide sequence, higher correlation was obtained with Sailfish than that with RSEM, which was better than without using abundance estimation methods. However, for RNA-Seq of a 76-nucleotide sequence, the results achieved by RSEM were similar to without applying abundance estimation methods, and were much better than with Sailfish. Furthermore, we found that adding a poly-A tail increased alignment numbers, but did not improve normalization results. Spearman correlation analysis revealed that RC, UQ, Med, TMM, DESeq, and Q did not noticeably improve gene expression normalization, regardless of read length. Other normalization methods were more efficient when alignment accuracy was low; Sailfish with RPKM gave the best normalization results. When alignment accuracy was high, RC was sufficient for gene expression calculation. And we suggest ignoring poly-A tail during differential gene expression analysis.
Characterization of fusion genes and the significantly expressed fusion isoforms in breast cancer by hybrid sequencing

PubMed Central

Weirather, Jason L.; Afshar, Pegah Tootoonchi; Clark, Tyson A.; Tseng, Elizabeth; Powers, Linda S.; Underwood, Jason G.; Zabner, Joseph; Korlach, Jonas; Wong, Wing Hung; Au, Kin Fai

2015-01-01

We developed an innovative hybrid sequencing approach, IDP-fusion, to detect fusion genes, determine fusion sites and identify and quantify fusion isoforms. IDP-fusion is the first method to study gene fusion events by integrating Third Generation Sequencing long reads and Second Generation Sequencing short reads. We applied IDP-fusion to PacBio data and Illumina data from the MCF-7 breast cancer cells. Compared with the existing tools, IDP-fusion detects fusion genes at higher precision and a very low false positive rate. The results show that IDP-fusion will be useful for unraveling the complexity of multiple fusion splices and fusion isoforms within tumorigenesis-relevant fusion genes. PMID:26040699
Expression of caveolin in trabecular meshwork cells and its possible implication in pathogenesis of primary open angle glaucoma

PubMed Central

Surgucheva, Irina

2011-01-01

Purpose Primary open-angle glaucoma (POAG), which is the most common form of glaucoma, has been associated with a heterogeneous genetic component. A genome-wide association study has identified a common sequence variant at 7q31 (rs4236601 [A]) near the caveolin genes in patients with POAG. Caveolins are a family of integral membrane proteins which participate in many cellular processes, including vesicular transport, cholesterol homeostasis, signal transduction, cell adhesion and migration. The goal of this study was to investigate the expression and regulation of caveolin 1 (CAV-1) and caveolin 2 (CAV-2) in normal and glaucoma trabecular meshwork (TM) cells. Methods CAV-1 and CAV-2 protein expression was quantified by immunoblot analysis using lysates isolated from primary and immortalized TM cells or TM tissue dissected from normal and POAG eyes. The localization of caveolins in TM cells was assessed by immunofluorescent microscopy. CAV-1 and CAV-2 protein expression was also investigated in TM cells at various time points after subjecting the cells to known glaucomatous insults like dexamethasone (DEX) and tumor growth factor beta2 (TGF-β2) treatment. Phosphorylation of CAV-1 at tyrosine 14 in normal and glaucoma TM cell lines was evaluated using a specific monoclonal antibody (Ab). The 5′ upstream region of the CAV-1 gene was amplified and the sequence variant rs4236601 (A/G polymorphic site) and several putative transcription factor-binding sites were modified by in vitro mutagenesis. The effect of nucleotide sequence modifications in the CAV-1 upstream region on gene expression was assayed in a luciferase-based system in TM and non-TM cells. Results CAV-1 and CAV-2 are expressed in TM cells, with localization to the cytoplasm and perinuclear region. DEX increased CAV-1 expression in immortalized glaucoma TM cells by 2.8±0.1 (n=3) fold at 24 h and 2.5±0.1 (n=3) fold at 48 h, compared to 1.3±0.06 (n=3) fold at 24 and 48 h in immortalized normal TM cells. Phosphorylation of CAV-1 at Tyr14 was reduced by 3.2±0.15 (n=3) fold in glaucomatous TM cells when compared to normal TM cells. In POAG and normal TM tissue, CAV-1 expression was found to be uniform. CAV-2, on the other hand, was variable in independent normal and glaucoma TM tissue. Substitution of a G for an A at base pair −2,388 upstream of the start codon of CAV-1, corresponding to the minor allele rs4236601 [A], increased transcriptional activity in TM and non-TM cells when compared to the native sequence. Deletion analysis of putative transcription factor binding sites in the CAV-1 promoter region caused cell-specific effects on gene expression. Conclusions CAV-1 and CAV-2 are expressed in normal and glaucoma tissue and TM cell lines. Phosphorylation of Tyr14 in CAV-1 and transcriptional regulation of CAV-1 expression may have a role in glaucomatous alterations in TM cells. PMID:22128235
Coordinate cytokine regulatory sequences

DOEpatents

Frazer, Kelly A.; Rubin, Edward M.; Loots, Gabriela G.

2005-05-10

The present invention provides CNS sequences that regulate the cytokine gene expression, expression cassettes and vectors comprising or lacking the CNS sequences, host cells and non-human transgenic animals comprising the CNS sequences or lacking the CNS sequences. The present invention also provides methods for identifying compounds that modulate the functions of CNS sequences as well as methods for diagnosing defects in the CNS sequences of patients.
Dynamic visual attention: motion direction versus motion magnitude

NASA Astrophysics Data System (ADS)

Bur, A.; Wurtz, P.; Müri, R. M.; Hügli, H.

2008-02-01

Defined as an attentive process in the context of visual sequences, dynamic visual attention refers to the selection of the most informative parts of video sequence. This paper investigates the contribution of motion in dynamic visual attention, and specifically compares computer models designed with the motion component expressed either as the speed magnitude or as the speed vector. Several computer models, including static features (color, intensity and orientation) and motion features (magnitude and vector) are considered. Qualitative and quantitative evaluations are performed by comparing the computer model output with human saliency maps obtained experimentally from eye movement recordings. The model suitability is evaluated in various situations (synthetic and real sequences, acquired with fixed and moving camera perspective), showing advantages and inconveniences of each method as well as preferred domain of application.
Determination of multidrug resistance mechanisms in Clostridium perfringens type A isolates using RNA sequencing and 2D-electrophoresis.

PubMed

Ma, Yu-Hua; Ye, Gui-Sheng

2018-06-11

In this study, we screened differentially expressed genes in a multidrug-resistant isolate strain of Clostridium perfringens by RNA sequencing. We also separated and identified differentially expressed proteins (DEPs) in the isolate strain by two-dimensional electrophoresis (2-DE) and mass spectrometry (MS). The RNA sequencing results showed that, compared with the control strain, 1128 genes were differentially expressed in the isolate strain, and these included 227 up-regulated genes and 901 down-regulated genes. Bioinformatics analysis identified the following genes and gene categories that are potentially involved in multidrug resistance (MDR) in the isolate strain: drug transport, drug response, hydrolase activity, transmembrane transporter, transferase activity, amidase transmembrane transporter, efflux transmembrane transporter, bacterial chemotaxis, ABC transporter, and others. The results of the 2-DE showed that 70 proteins were differentially expressed in the isolate strain, 45 of which were up-regulated and 25 down-regulated. Twenty-seven DEPs were identified by MS and these included the following protein categories: ribosome, antimicrobial peptide resistance, and ABC transporter, all of which may be involved in MDR in the isolate strain of C. perfringens. The results provide reference data for further investigations on the drug resistant molecular mechanisms of C. perfringens.
De novo sequencing and analysis of the cranberry fruit transcriptome to identify putative genes involved in flavonoid biosynthesis, transport and regulation.

PubMed

Sun, Haiyue; Liu, Yushan; Gai, Yuzhuo; Geng, Jinman; Chen, Li; Liu, Hongdi; Kang, Limin; Tian, Youwen; Li, Yadong

2015-09-02

Cranberries (Vaccinium macrocarpon Ait.), renowned for their excellent health benefits, are an important berry crop. Here, we performed transcriptome sequencing of one cranberry cultivar, from fruits at two different developmental stages, on the Illumina HiSeq 2000 platform. Our main goals were to identify putative genes for major metabolic pathways of bioactive compounds and compare the expression patterns between white fruit (W) and red fruit (R) in cranberry. In this study, two cDNA libraries of W and R were constructed. Approximately 119 million raw sequencing reads were generated and assembled de novo, yielding 57,331 high quality unigenes with an average length of 739 bp. Using BLASTx, 38,460 unigenes were identified as putative homologs of annotated sequences in public protein databases, including NCBI NR, NT, Swiss-Prot, KEGG, COG and GO. Of these, 21,898 unigenes mapped to 128 KEGG pathways, with the metabolic pathways, secondary metabolites, glycerophospholipid metabolism, ether lipid metabolism, starch and sucrose metabolism, purine metabolism, and pyrimidine metabolism being well represented. Among them, many candidate genes were involved in flavonoid biosynthesis, transport and regulation. Furthermore, digital gene expression (DEG) analysis identified 3,257 unigenes that were differentially expressed between the two fruit developmental stages. In addition, 14,473 simple sequence repeats (SSRs) were detected. Our results present comprehensive gene expression information about the cranberry fruit transcriptome that could facilitate our understanding of the molecular mechanisms of fruit development in cranberries. Although it will be necessary to validate the functions carried out by these genes, these results could be used to improve the quality of breeding programs for the cranberry and related species.
CoLIde: a bioinformatics tool for CO-expression-based small RNA Loci Identification using high-throughput sequencing data.

PubMed

Mohorianu, Irina; Stocks, Matthew Benedict; Wood, John; Dalmay, Tamas; Moulton, Vincent

2013-07-01

Small RNAs (sRNAs) are 20-25 nt non-coding RNAs that act as guides for the highly sequence-specific regulatory mechanism known as RNA silencing. Due to the recent increase in sequencing depth, a highly complex and diverse population of sRNAs in both plants and animals has been revealed. However, the exponential increase in sequencing data has also made the identification of individual sRNA transcripts corresponding to biological units (sRNA loci) more challenging when based exclusively on the genomic location of the constituent sRNAs, hindering existing approaches to identify sRNA loci. To infer the location of significant biological units, we propose an approach for sRNA loci detection called CoLIde (Co-expression based sRNA Loci Identification) that combines genomic location with the analysis of other information such as variation in expression levels (expression pattern) and size class distribution. For CoLIde, we define a locus as a union of regions sharing the same pattern and located in close proximity on the genome. Biological relevance, detected through the analysis of size class distribution, is also calculated for each locus. CoLIde can be applied on ordered (e.g., time-dependent) or un-ordered (e.g., organ, mutant) series of samples both with or without biological/technical replicates. The method reliably identifies known types of loci and shows improved performance on sequencing data from both plants (e.g., A. thaliana, S. lycopersicum) and animals (e.g., D. melanogaster) when compared with existing locus detection techniques. CoLIde is available for use within the UEA Small RNA Workbench which can be downloaded from: http://srna-workbench.cmp.uea.ac.uk.
The processing of images of biological threats in visual short-term memory.

PubMed

Quinlan, Philip T; Yue, Yue; Cohen, Dale J

2017-08-30

The idea that there is enhanced memory for negatively, emotionally charged pictures was examined. Performance was measured under rapid, serial visual presentation (RSVP) conditions in which, on every trial, a sequence of six photo-images was presented. Briefly after the offset of the sequence, two alternative images (a target and a foil) were presented and participants attempted to choose which image had occurred in the sequence. Images were of threatening and non-threatening cats and dogs. The target depicted either an animal expressing an emotion distinct from the other images, or the sequences contained only images depicting the same emotional valence. Enhanced memory was found for targets that differed in emotional valence from the other sequence images, compared to targets that expressed the same emotional valence. Further controls in stimulus selection were then introduced and the same emotional distinctiveness effect obtained. In ruling out possible visual and attentional accounts of the data, an informal dual route topic model is discussed. This places emphasis on how visual short-term memory reveals a sensitivity to the emotional content of the input as it unfolds over time. Items that present with a distinctive emotional content stand out in memory. © 2017 The Author(s).
Identification and temporal expression of putative circadian clock transcripts in the amphipod crustacean Talitrus saltator

PubMed Central

O’Grady, Joseph F.; Hoelters, Laura S.; Swain, Martin T.

2016-01-01

Background Talitrus saltator is an amphipod crustacean that inhabits the supralittoral zone on sandy beaches in the Northeast Atlantic and Mediterranean. T. saltator exhibits endogenous locomotor activity rhythms and time-compensated sun and moon orientation, both of which necessitate at least one chronometric mechanism. Whilst their behaviour is well studied, currently there are no descriptions of the underlying molecular components of a biological clock in this animal, and very few in other crustacean species. Methods We harvested brain tissue from animals expressing robust circadian activity rhythms and used homology cloning and Illumina RNAseq approaches to sequence and identify the core circadian clock and clock-related genes in these samples. We assessed the temporal expression of these genes in time-course samples from rhythmic animals using RNAseq. Results We identified a comprehensive suite of circadian clock gene homologues in T. saltator including the ‘core’ clock genes period (Talper), cryptochrome 2 (Talcry2), timeless (Taltim), clock (Talclk), and bmal1 (Talbmal1). In addition we describe the sequence and putative structures of 23 clock-associated genes including two unusual, extended isoforms of pigment dispersing hormone (Talpdh). We examined time-course RNAseq expression data, derived from tissues harvested from behaviourally rhythmic animals, to reveal rhythmic expression of these genes with approximately circadian period in Talper and Talbmal1. Of the clock-related genes, casein kinase IIβ (TalckIIβ), ebony (Talebony), jetlag (Taljetlag), pigment dispensing hormone (Talpdh), protein phosphatase 1 (Talpp1), shaggy (Talshaggy), sirt1 (Talsirt1), sirt7 (Talsirt7) and supernumerary limbs (Talslimb) show temporal changes in expression. Discussion We report the sequences of principle genes that comprise the circadian clock of T. saltator and highlight the conserved structural and functional domains of their deduced cognate proteins. Our sequencing data contribute to the growing inventory of described comparative clocks. Expression profiling of the identified clock genes illuminates tantalising targets for experimental manipulation to elucidate the molecular and cellular control of clock-driven phenotypes in this crustacean. PMID:27761341

Analysis of expressed sequence tags generated from full-length enriched cDNA libraries of melon

PubMed Central

2011-01-01

Background Melon (Cucumis melo), an economically important vegetable crop, belongs to the Cucurbitaceae family which includes several other important crops such as watermelon, cucumber, and pumpkin. It has served as a model system for sex determination and vascular biology studies. However, genomic resources currently available for melon are limited. Result We constructed eleven full-length enriched and four standard cDNA libraries from fruits, flowers, leaves, roots, cotyledons, and calluses of four different melon genotypes, and generated 71,577 and 22,179 ESTs from full-length enriched and standard cDNA libraries, respectively. These ESTs, together with ~35,000 ESTs available in public domains, were assembled into 24,444 unigenes, which were extensively annotated by comparing their sequences to different protein and functional domain databases, assigning them Gene Ontology (GO) terms, and mapping them onto metabolic pathways. Comparative analysis of melon unigenes and other plant genomes revealed that 75% to 85% of melon unigenes had homologs in other dicot plants, while approximately 70% had homologs in monocot plants. The analysis also identified 6,972 gene families that were conserved across dicot and monocot plants, and 181, 1,192, and 220 gene families specific to fleshy fruit-bearing plants, the Cucurbitaceae family, and melon, respectively. Digital expression analysis identified a total of 175 tissue-specific genes, which provides a valuable gene sequence resource for future genomics and functional studies. Furthermore, we identified 4,068 simple sequence repeats (SSRs) and 3,073 single nucleotide polymorphisms (SNPs) in the melon EST collection. Finally, we obtained a total of 1,382 melon full-length transcripts through the analysis of full-length enriched cDNA clones that were sequenced from both ends. Analysis of these full-length transcripts indicated that sizes of melon 5' and 3' UTRs were similar to those of tomato, but longer than many other dicot plants. Codon usages of melon full-length transcripts were largely similar to those of Arabidopsis coding sequences. Conclusion The collection of melon ESTs generated from full-length enriched and standard cDNA libraries is expected to play significant roles in annotating the melon genome. The ESTs and associated analysis results will be useful resources for gene discovery, functional analysis, marker-assisted breeding of melon and closely related species, comparative genomic studies and for gaining insights into gene expression patterns. PMID:21599934
An Integrated Analysis of MicroRNA and mRNA Expression Profiles to Identify RNA Expression Signatures in Lambskin Hair Follicles in Hu Sheep

PubMed Central

Lv, Xiaoyang; Sun, Wei; Yin, Jinfeng; Ni, Rong; Su, Rui; Wang, Qingzeng; Gao, Wen; Bao, Jianjun; Yu, Jiarui; Wang, Lihong; Chen, Ling

2016-01-01

Wave patterns in lambskin hair follicles are an important factor determining the quality of sheep’s wool. Hair follicles in lambskin from Hu sheep, a breed unique to China, have 3 types of waves, designated as large, medium, and small. The quality of wool from small wave follicles is excellent, while the quality of large waves is considered poor. Because no molecular and biological studies on hair follicles of these sheep have been conducted to date, the molecular mechanisms underlying the formation of different wave patterns is currently unknown. The aim of this article was to screen the candidate microRNAs (miRNA) and genes for the development of hair follicles in Hu sheep. Two-day-old Hu lambs were selected from full-sib individuals that showed large, medium, and small waves. Integrated analysis of microRNA and mRNA expression profiles employed high-throughout sequencing technology. Approximately 13, 24, and 18 differentially expressed miRNAs were found between small and large waves, small and medium waves, and medium and large waves, respectively. A total of 54, 190, and 81 differentially expressed genes were found between small and large waves, small and medium waves, and medium and large waves, respectively, by RNA sequencing (RNA-seq) analysis. Differentially expressed genes were classified using gene ontology and pathway analyses. They were found to be mainly involved in cell differentiation, proliferation, apoptosis, growth, immune response, and ion transport, and were associated with MAPK and the Notch signaling pathway. Reverse transcription-polymerase chain reaction (RT-PCR) analyses of differentially-expressed miRNA and genes were consistent with sequencing results. Integrated analysis of miRNA and mRNA expression indicated that, compared to small waves, large waves included 4 downregulated miRNAs that had regulatory effects on 8 upregulated genes and 3 upregulated miRNAs, which in turn influenced 13 downregulated genes. Compared to small waves, medium waves included 13 downregulated miRNAs that had regulatory effects on 64 upregulated genes and 4 upregulated miRNAs, which in turn had regulatory effects on 22 downregulated genes. Compared to medium waves, large waves consisted of 13 upregulated miRNAs that had regulatory effects on 48 downregulated genes. These differentially expressed miRNAs and genes may play a significant role in forming different patterns, and provide evidence for the molecular mechanisms underlying the formation of hair follicles of varying patterns. PMID:27404636
Molecular cloning, sequence characterization and recombinant expression of Nanog gene in goat fibroblast cells using lentiviral based expression system.

PubMed

Singhal, Dinesh K; Singhal, Raxita; Malik, Hruda N; Kumar, Surender; Kumar, Sudarshan; Mohanty, Ashok K; Kaushik, Jai K; Malakar, Dhruba

2014-01-01

Nanog is a homeodomain containing protein which plays important roles in regulation of signaling pathways for maintenance and induction of pluripotency in stem cells. Because of its unique expression in stem cells it is also regarded as pluripotency marker. In this study goat Nanog (gNanog) gene has been amplified, cloned and characterized at sequence level with successful over-expression in CHO-K1 cell line using a lentiviral based system. gNanog ORF is 903 bp long which codes for Nanog protein of size 300 amino acids (aas). Complete nucleotide sequence shows some evolutionary mutation in goat in comparision to other species. Protein sequence of goat is highly similar to other species. Overall, gNanog nucleotide sequence and predicted protein sequence showed high similarity and minimum divergence with cattle (96 % identity/4 % divergence) and buffalo (94/5 %) while low similarity and high divergence with pig (84/15 %), human (81/23 %) and mouse (69/40 %) indicating evolutionary closeness of gNanog to cattle and buffalo. gNanog lentiviral expression construct was prepared for over-expression of Nanog gene in adult goat fibroblast cells. Lentiviral expression construct of Nanog enabled continuous protein expression for induction and maintenance of pluripotency. Western blotting revealed the expression of Nanog gene at protein level which supported that the lentiviral expression system is highly promising for Nanog protein expression in differentiated goat cell.
Efficient Coproduction of Mannanase and Cellulase by the Transformation of a Codon-Optimized Endomannanase Gene from Aspergillus niger into Trichoderma reesei.

PubMed

Sun, Xianhua; Xue, Xianli; Li, Mengzhu; Gao, Fei; Hao, Zhenzhen; Huang, Huoqing; Luo, Huiying; Qin, Lina; Yao, Bin; Su, Xiaoyun

2017-12-20

Cellulase and mannanase are both important enzyme additives in animal feeds. Expressing the two enzymes simultaneously within one microbial host could potentially lead to cost reductions in the feeding of animals. For this purpose, we codon-optimized the Aspergillus niger Man5A gene to the codon-usage bias of Trichoderma reesei. By comparing the free energies and the local structures of the nucleotide sequences, one optimized sequence was finally selected and transformed into the T. reesei pyridine-auxotrophic strain TU-6. The codon-optimized gene was expressed to a higher level than the original one. Further expressing the codon-optimized gene in a mutated T. reesei strain through fed-batch cultivation resulted in coproduction of cellulase and mannanase up to 1376 U·mL -1 and 1204 U·mL -1 , respectively.
Cloning, annotation and expression analysis of mycoparasitism-related genes in Trichoderma harzianum 88.

PubMed

Yao, Lin; Yang, Qian; Song, Jinzhu; Tan, Chong; Guo, Changhong; Wang, Li; Qu, Lianhai; Wang, Yun

2013-04-01

Trichoderma harzianum 88, a filamentous soil fungus, is an effective biocontrol agent against several plant pathogens. High-throughput sequencing was used here to study the mycoparasitism mechanisms of T. harzianum 88. Plate confrontation tests of T. harzianum 88 against plant pathogens were conducted, and a cDNA library was constructed from T. harzianum 88 mycelia in the presence of plant pathogen cell walls. Randomly selected transcripts from the cDNA library were compared with eukaryotic plant and fungal genomes. Of the 1,386 transcripts sequenced, the most abundant Gene Ontology (GO) classification group was "physiological process". Differential expression of 19 genes was confirmed by real-time RT-PCR at different mycoparasitism stages against plant pathogens. Gene expression analysis revealed the transcription of various genes involved in mycoparasitism of T. harzianum 88. Our study provides helpful insights into the mechanisms of T. harzianum 88-plant pathogen interactions.
Influence of silencing soluble epoxide hydrolase with RNA interference on cardiomyocytes apoptosis induced by doxorubicin.

PubMed

Du, Guangsheng; Lv, Jiagao; He, Li; Ma, Yexin

2011-06-01

In order to investigate the influence of silencing soluble epoxide hydrolase (sEH) with double-stranded small interfering RNA (siRNA) on cardiomyocytes apoptosis induced by doxorubicin (DOX), two plasmids containing siRNA sequences specific to sEH were constructed and transfected into the primary cultured cardiomyocytes by using FuGENE HD transfection agents. The mRNA and protein expression levels of sEH were detected by semiquantitative RT-PCR and Western blotting respectively, and the plasmids that silenced sEH most significantly were selected, and renamed EH-R. The plasmids carrying a nonspecific siRNA coding sequence (PCN) served as the negative control. Cardiomyocytes were divided into four groups: control group, DOX group, PCN+DOX group, and EH-R+DOX group. Apoptosis of cardiomyocytes was induced by DOX at a concentration of 1 μmol/L. Apoptosis rate of cardiomyocytes was determined by flow cytometery. The protein expression levels of Bcl-2 and Bax were detected by Western blotting. The results showed that the expression of sEH was down-regulated by EH-R plasmid. The expression levels of sEH mRNA and protein in the EH-R+DOX group were significantly decreased as compared with other groups (P<0.01). As compared with the control group, the apoptosis rate of cardiomyocytes in three DOX-treated groups was obviously increased, the expression levels of Bax increased, and those of Bcl-2 decreased (P<0.01). However, the expression levels of Bax were decreased, those of Bcl-2 increased and the apoptosis rate of cardiomyocytes obviously decreased in EH-R+DOX group when compared with those in the DOX group and the PCN+DOX group (P<0.01 for each). It was concluded that the recombinant plasmids could be successfully constructed, and transfected into the primary cultured cardiomyocytes. They could ameliorate the DOX-induced cardiomyocytes apoptosis by selectively inhibiting the expression of sEH with RNAi and increasing the expression of Bcl-2.
Many human accelerated regions are developmental enhancers

PubMed Central

Capra, John A.; Erwin, Genevieve D.; McKinsey, Gabriel; Rubenstein, John L. R.; Pollard, Katherine S.

2013-01-01

The genetic changes underlying the dramatic differences in form and function between humans and other primates are largely unknown, although it is clear that gene regulatory changes play an important role. To identify regulatory sequences with potentially human-specific functions, we and others used comparative genomics to find non-coding regions conserved across mammals that have acquired many sequence changes in humans since divergence from chimpanzees. These regions are good candidates for performing human-specific regulatory functions. Here, we analysed the DNA sequence, evolutionary history, histone modifications, chromatin state and transcription factor (TF) binding sites of a combined set of 2649 non-coding human accelerated regions (ncHARs) and predicted that at least 30% of them function as developmental enhancers. We prioritized the predicted ncHAR enhancers using analysis of TF binding site gain and loss, along with the functional annotations and expression patterns of nearby genes. We then tested both the human and chimpanzee sequence for 29 ncHARs in transgenic mice, and found 24 novel developmental enhancers active in both species, 17 of which had very consistent patterns of activity in specific embryonic tissues. Of these ncHAR enhancers, five drove expression patterns suggestive of different activity for the human and chimpanzee sequence at embryonic day 11.5. The changes to human non-coding DNA in these ncHAR enhancers may modify the complex patterns of gene expression necessary for proper development in a human-specific manner and are thus promising candidates for understanding the genetic basis of human-specific biology. PMID:24218637
Analysis of expressed sequence tags from Uromyces appendiculatus hyphae and haustoria and their comparison to sequences from other rust fungi.

PubMed

Puthoff, D P; Neelam, A; Ehrenfried, M L; Scheffler, B E; Ballard, L; Song, Q; Campbell, K B; Cooper, B; Tucker, M L

2008-10-01

Hyphae, 2 to 8 days postinoculation (dpi), and haustoria, 5 dpi, were isolated from Uromyces appendiculatus infected bean leaves (Phaseolus vulgaris cv. Pinto 111) and a separate cDNA library prepared for each fungal preparation. Approximately 10,000 hyphae and 2,700 haustoria clones were sequenced from both the 5' and 3' ends. Assembly of all of the fungal sequences yielded 3,359 contigs and 927 singletons. The U. appendiculatus sequences were compared with sequence data for other rust fungi, Phakopsora pachyrhizi, Uromyces fabae, and Puccinia graminis. The U. appendiculatus haustoria library included a large number of genes with unknown cellular function; however, summation of sequences of known cellular function suggested that haustoria at 5 dpi had fewer transcripts linked to protein synthesis in favor of energy metabolism and nutrient uptake. In addition, open reading frames in the U. appendiculatus data set with an N-terminal signal peptide were identified and compared with other proteins putatively secreted from rust fungi. In this regard, a small family of putatively secreted RTP1-like proteins was identified in U. appendiculatus and P. graminis.
QNB: differential RNA methylation analysis for count-based small-sample sequencing data with a quad-negative binomial model.

PubMed

Liu, Lian; Zhang, Shao-Wu; Huang, Yufei; Meng, Jia

2017-08-31

As a newly emerged research area, RNA epigenetics has drawn increasing attention recently for the participation of RNA methylation and other modifications in a number of crucial biological processes. Thanks to high throughput sequencing techniques, such as, MeRIP-Seq, transcriptome-wide RNA methylation profile is now available in the form of count-based data, with which it is often of interests to study the dynamics at epitranscriptomic layer. However, the sample size of RNA methylation experiment is usually very small due to its costs; and additionally, there usually exist a large number of genes whose methylation level cannot be accurately estimated due to their low expression level, making differential RNA methylation analysis a difficult task. We present QNB, a statistical approach for differential RNA methylation analysis with count-based small-sample sequencing data. Compared with previous approaches such as DRME model based on a statistical test covering the IP samples only with 2 negative binomial distributions, QNB is based on 4 independent negative binomial distributions with their variances and means linked by local regressions, and in the way, the input control samples are also properly taken care of. In addition, different from DRME approach, which relies only the input control sample only for estimating the background, QNB uses a more robust estimator for gene expression by combining information from both input and IP samples, which could largely improve the testing performance for very lowly expressed genes. QNB showed improved performance on both simulated and real MeRIP-Seq datasets when compared with competing algorithms. And the QNB model is also applicable to other datasets related RNA modifications, including but not limited to RNA bisulfite sequencing, m 1 A-Seq, Par-CLIP, RIP-Seq, etc.
Comparative transgenic analysis of enhancers from the human SHOX and mouse Shox2 genomic regions.

PubMed

Rosin, Jessica M; Abassah-Oppong, Samuel; Cobb, John

2013-08-01

Disruption of presumptive enhancers downstream of the human SHOX gene (hSHOX) is a frequent cause of the zeugopodal limb defects characteristic of Léri-Weill dyschondrosteosis (LWD). The closely related mouse Shox2 gene (mShox2) is also required for limb development, but in the more proximal stylopodium. In this study, we used transgenic mice in a comparative approach to characterize enhancer sequences in the hSHOX and mShox2 genomic regions. Among conserved noncoding elements (CNEs) that function as enhancers in vertebrate genomes, those that are maintained near paralogous genes are of particular interest given their ancient origins. Therefore, we first analyzed the regulatory potential of a genomic region containing one such duplicated CNE (dCNE) downstream of mShox2 and hSHOX. We identified a strong limb enhancer directly adjacent to the mShox2 dCNE that recapitulates the expression pattern of the endogenous gene. Interestingly, this enhancer requires sequences only conserved in the mammalian lineage in order to drive strong limb expression, whereas the more deeply conserved sequences of the dCNE function as a neural enhancer. Similarly, we found that a conserved element downstream of hSHOX (CNE9) also functions as a neural enhancer in transgenic mice. However, when the CNE9 transgenic construct was enlarged to include adjacent, non-conserved sequences frequently deleted in LWD patients, the transgene drove expression in the zeugopodium of the limbs. Therefore, both hSHOX and mShox2 limb enhancers are coupled to distinct neural enhancers. This is the first report demonstrating the activity of cis-regulatory elements from the hSHOX and mShox2 genomic regions in mammalian embryos.
Methods and compositions for regulating gene expression in plant cells

NASA Technical Reports Server (NTRS)

Dai, Shunhong (Inventor); Beachy, Roger N. (Inventor); Luis, Maria Isabel Ordiz (Inventor)

2010-01-01

Novel chimeric plant promoter sequences are provided, together with plant gene expression cassettes comprising such sequences. In certain preferred embodiments, the chimeric plant promoters comprise the BoxII cis element and/or derivatives thereof. In addition, novel transcription factors are provided, together with nucleic acid sequences encoding such transcription factors and plant gene expression cassettes comprising such nucleic acid sequences. In certain preferred embodiments, the novel transcription factors comprise the acidic domain, or fragments thereof, of the RF2a transcription factor. Methods for using the chimeric plant promoter sequences and novel transcription factors in regulating the expression of at least one gene of interest are provided, together with transgenic plants comprising such chimeric plant promoter sequences and novel transcription factors.
CoNekT: an open-source framework for comparative genomic and transcriptomic network analyses.

PubMed

Proost, Sebastian; Mutwil, Marek

2018-05-01

The recent accumulation of gene expression data in the form of RNA sequencing creates unprecedented opportunities to study gene regulation and function. Furthermore, comparative analysis of the expression data from multiple species can elucidate which functional gene modules are conserved across species, allowing the study of the evolution of these modules. However, performing such comparative analyses on raw data is not feasible for many biologists. Here, we present CoNekT (Co-expression Network Toolkit), an open source web server, that contains user-friendly tools and interactive visualizations for comparative analyses of gene expression data and co-expression networks. These tools allow analysis and cross-species comparison of (i) gene expression profiles; (ii) co-expression networks; (iii) co-expressed clusters involved in specific biological processes; (iv) tissue-specific gene expression; and (v) expression profiles of gene families. To demonstrate these features, we constructed CoNekT-Plants for green alga, seed plants and flowering plants (Picea abies, Chlamydomonas reinhardtii, Vitis vinifera, Arabidopsis thaliana, Oryza sativa, Zea mays and Solanum lycopersicum) and thus provide a web-tool with the broadest available collection of plant phyla. CoNekT-Plants is freely available from http://conekt.plant.tools, while the CoNekT source code and documentation can be found at https://github.molgen.mpg.de/proost/CoNekT/.
Radiosensitivity in HeLa cervical cancer cells overexpressing glutathione S-transferase π 1

PubMed Central

YANG, LIANG; LIU, REN; MA, HONG-BIN; YING, MING-ZHEN; WANG, YA-JIE

2015-01-01

The aims of the present study were to investigate the effect of overexpressed exogenous glutathione S-transferase π 1 (GSTP1) gene on the radiosensitivity of the HeLa human cervical cancer cell line and conduct a preliminarily investigation into the underlying mechanisms of the effect. The full-length sequence of human GSTP1 was obtained by performing a polymerase chain reaction (PCR) using primers based on the GenBank sequence of GSTP1. Subsequently, the gene was cloned into a recombinant eukaryotic expression plasmid, and the resulting construct was confirmed by restriction analysis and DNA sequencing. A HeLa cell line that was stably expressing high levels of GSTP1 was obtained through stable transfection of the constructed plasmids using lipofectamine and screening for G418 resistance, as demonstrated by reverse transcription-PCR. Using the transfected HeLa cells, a colony formation assay was conducted to detect the influence of GSTP1 overexpression on the cell radiosensitivity. Furthermore, flow cytometry was used to investigate the effect of GSTP1 overexpression on cell cycle progression, with the protein expression levels of the cell cycle regulating factor cyclin B1 detected using western blot analysis. Colony formation and G2/M phase arrest in the GSTP1-expressing cells were significantly increased compared with the control group (P<0.01). In addition, the expression of cyclin B1 was significantly reduced in the GSTP1-expressing cells. These results demonstrated that increased expression of GSTP1 inhibits radiosensitivity in HeLa cells. The mechanism underlying this effect may be associated with the ability of the GSTP1 protein to reduce cyclin B1 expression, resulting in significant G2/M phase arrest. PMID:26622693
Radiosensitivity in HeLa cervical cancer cells overexpressing glutathione S-transferase π 1.

PubMed

Yang, Liang; Liu, Ren; Ma, Hong-Bin; Ying, Ming-Zhen; Wang, Ya-Jie

2015-09-01

The aims of the present study were to investigate the effect of overexpressed exogenous glutathione S-transferase π 1 ( GSTP1 ) gene on the radiosensitivity of the HeLa human cervical cancer cell line and conduct a preliminarily investigation into the underlying mechanisms of the effect. The full-length sequence of human GSTP1 was obtained by performing a polymerase chain reaction (PCR) using primers based on the GenBank sequence of GSTP1. Subsequently, the gene was cloned into a recombinant eukaryotic expression plasmid, and the resulting construct was confirmed by restriction analysis and DNA sequencing. A HeLa cell line that was stably expressing high levels of GSTP1 was obtained through stable transfection of the constructed plasmids using lipofectamine and screening for G418 resistance, as demonstrated by reverse transcription-PCR. Using the transfected HeLa cells, a colony formation assay was conducted to detect the influence of GSTP1 overexpression on the cell radiosensitivity. Furthermore, flow cytometry was used to investigate the effect of GSTP1 overexpression on cell cycle progression, with the protein expression levels of the cell cycle regulating factor cyclin B1 detected using western blot analysis. Colony formation and G 2 /M phase arrest in the GSTP1 -expressing cells were significantly increased compared with the control group (P<0.01). In addition, the expression of cyclin B1 was significantly reduced in the GSTP1 -expressing cells. These results demonstrated that increased expression of GSTP1 inhibits radiosensitivity in HeLa cells. The mechanism underlying this effect may be associated with the ability of the GSTP1 protein to reduce cyclin B1 expression, resulting in significant G 2 /M phase arrest.
The membrane skeleton in Paramecium: Molecular characterization of a novel epiplasmin family and preliminary GFP expression results.

PubMed

Pomel, Sébastien; Diogon, Marie; Bouchard, Philippe; Pradel, Lydie; Ravet, Viviane; Coffe, Gérard; Viguès, Bernard

2006-02-01

Previous attempts to identify the membrane skeleton of Paramecium cells have revealed a protein pattern that is both complex and specific. The most prominent structural elements, epiplasmic scales, are centered around ciliary units and are closely apposed to the cytoplasmic side of the inner alveolar membrane. We sought to characterize epiplasmic scale proteins (epiplasmins) at the molecular level. PCR approaches enabled the cloning and sequencing of two closely related genes by amplifications of sequences from a macronuclear genomic library. Using these two genes (EPI-1 and EPI-2), we have contributed to the annotation of the Paramecium tetraurelia macronuclear genome and identified 39 additional (paralogous) sequences. Two orthologous sequences were found in the Tetrahymena thermophila genome. Structural analysis of the 43 sequences indicates that the hallmark of this new multigenic family is a 79 aa domain flanked by two Q-, P- and V-rich stretches of sequence that are much more variable in amino-acid composition. Such features clearly distinguish members of the multigenic family from epiplasmic proteins previously sequenced in other ciliates. The expression of Green Fluorescent Protein (GFP)-tagged epiplasmin showed significant labeling of epiplasmic scales as well as oral structures. We expect that the GFP construct described herein will prove to be a useful tool for comparative subcellular localization of different putative epiplasmins in Paramecium.
Quantitative Antisense Screening and Optimization for Exon 51 Skipping in Duchenne Muscular Dystrophy.

PubMed

Echigoya, Yusuke; Lim, Kenji Rowel Q; Trieu, Nhu; Bao, Bo; Miskew Nichols, Bailey; Vila, Maria Candida; Novak, James S; Hara, Yuko; Lee, Joshua; Touznik, Aleksander; Mamchaoui, Kamel; Aoki, Yoshitsugu; Takeda, Shin'ichi; Nagaraju, Kanneboyina; Mouly, Vincent; Maruyama, Rika; Duddy, William; Yokota, Toshifumi

2017-11-01

Duchenne muscular dystrophy (DMD), the most common lethal genetic disorder, is caused by mutations in the dystrophin (DMD) gene. Exon skipping is a therapeutic approach that uses antisense oligonucleotides (AOs) to modulate splicing and restore the reading frame, leading to truncated, yet functional protein expression. In 2016, the US Food and Drug Administration (FDA) conditionally approved the first phosphorodiamidate morpholino oligomer (morpholino)-based AO drug, eteplirsen, developed for DMD exon 51 skipping. Eteplirsen remains controversial with insufficient evidence of its therapeutic effect in patients. We recently developed an in silico tool to design antisense morpholino sequences for exon skipping. Here, we designed morpholino AOs targeting DMD exon 51 using the in silico tool and quantitatively evaluated the effects in immortalized DMD muscle cells in vitro. To our surprise, most of the newly designed morpholinos induced exon 51 skipping more efficiently compared with the eteplirsen sequence. The efficacy of exon 51 skipping and rescue of dystrophin protein expression were increased by up to more than 12-fold and 7-fold, respectively, compared with the eteplirsen sequence. Significant in vivo efficacy of the most effective morpholino, determined in vitro, was confirmed in mice carrying the human DMD gene. These findings underscore the importance of AO sequence optimization for exon skipping. Copyright © 2017 The American Society of Gene and Cell Therapy. Published by Elsevier Inc. All rights reserved.
Synthetic versions of firefly luciferase and Renilla luciferase reporter genes that resist transgene silencing in sugarcane

PubMed Central

2014-01-01

Background Down-regulation or silencing of transgene expression can be a major hurdle to both molecular studies and biotechnology applications in many plant species. Sugarcane is particularly effective at silencing introduced transgenes, including reporter genes such as the firefly luciferase gene. Synthesizing transgene coding sequences optimized for usage in the host plant is one method of enhancing transgene expression and stability. Using specified design rules we have synthesised new coding sequences for both the firefly luciferase and Renilla luciferase reporter genes. We have tested these optimized versions for enhanced levels of luciferase activity and for increased steady state luciferase mRNA levels in sugarcane. Results The synthetic firefly luciferase (luc*) and Renilla luciferase (Renluc*) coding sequences have elevated G + C contents in line with sugarcane codon usage, but maintain 75% identity to the native firefly or Renilla luciferase nucleotide sequences and 100% identity to the protein coding sequences. Under the control of the maize pUbi promoter, the synthetic luc* and Renluc* genes yielded 60x and 15x higher luciferase activity respectively, over the native firefly and Renilla luciferase genes in transient assays on sugarcane suspension cell cultures. Using a novel transient assay in sugarcane suspension cells combining co-bombardment and qRT-PCR, we showed that synthetic luc* and Renluc* genes generate increased transcript levels compared to the native firefly and Renilla luciferase genes. In stable transgenic lines, the luc* transgene generated significantly higher levels of expression than the native firefly luciferase transgene. The fold difference in expression was highest in the youngest tissues. Conclusions We developed synthetic versions of both the firefly and Renilla luciferase reporter genes that resist transgene silencing in sugarcane. These transgenes will be particularly useful for evaluating the expression patterns conferred by existing and newly isolated promoters in sugarcane tissues. The strategies used to design the synthetic luciferase transgenes could be applied to other transgenes that are aggressively silenced in sugarcane. PMID:24708613
Synthetic versions of firefly luciferase and Renilla luciferase reporter genes that resist transgene silencing in sugarcane.

PubMed

Chou, Ting-Chun; Moyle, Richard L

2014-04-08

Down-regulation or silencing of transgene expression can be a major hurdle to both molecular studies and biotechnology applications in many plant species. Sugarcane is particularly effective at silencing introduced transgenes, including reporter genes such as the firefly luciferase gene.Synthesizing transgene coding sequences optimized for usage in the host plant is one method of enhancing transgene expression and stability. Using specified design rules we have synthesised new coding sequences for both the firefly luciferase and Renilla luciferase reporter genes. We have tested these optimized versions for enhanced levels of luciferase activity and for increased steady state luciferase mRNA levels in sugarcane. The synthetic firefly luciferase (luc*) and Renilla luciferase (Renluc*) coding sequences have elevated G + C contents in line with sugarcane codon usage, but maintain 75% identity to the native firefly or Renilla luciferase nucleotide sequences and 100% identity to the protein coding sequences.Under the control of the maize pUbi promoter, the synthetic luc* and Renluc* genes yielded 60x and 15x higher luciferase activity respectively, over the native firefly and Renilla luciferase genes in transient assays on sugarcane suspension cell cultures.Using a novel transient assay in sugarcane suspension cells combining co-bombardment and qRT-PCR, we showed that synthetic luc* and Renluc* genes generate increased transcript levels compared to the native firefly and Renilla luciferase genes.In stable transgenic lines, the luc* transgene generated significantly higher levels of expression than the native firefly luciferase transgene. The fold difference in expression was highest in the youngest tissues. We developed synthetic versions of both the firefly and Renilla luciferase reporter genes that resist transgene silencing in sugarcane. These transgenes will be particularly useful for evaluating the expression patterns conferred by existing and newly isolated promoters in sugarcane tissues. The strategies used to design the synthetic luciferase transgenes could be applied to other transgenes that are aggressively silenced in sugarcane.
Isolation, characterization, and evaluation of three Citrus sinensis-derived constitutive gene promoters.

PubMed

Erpen, L; Tavano, E C R; Harakava, R; Dutt, M; Grosser, J W; Piedade, S M S; Mendes, B M J; Mourão Filho, F A A

2018-05-23

Regulatory sequences from the citrus constitutive genes cyclophilin (CsCYP), glyceraldehyde-3-phosphate dehydrogenase C2 (CsGAPC2), and elongation factor 1-alpha (CsEF1) were isolated, fused to the uidA gene, and qualitatively and quantitatively evaluated in transgenic sweet orange plants. The 5' upstream region of a gene (the promoter) is the most important component for the initiation and regulation of gene transcription of both native genes and transgenes in plants. The isolation and characterization of gene regulatory sequences are essential to the development of intragenic or cisgenic genetic manipulation strategies, which imply the use of genetic material from the same species or from closely related species. We describe herein the isolation and evaluation of the promoter sequence from three constitutively expressed citrus genes: cyclophilin (CsCYP), glyceraldehyde-3-phosphate dehydrogenase C2 (CsGAPC2), and elongation factor 1-alpha (CsEF1). The functionality of the promoters was confirmed by a histochemical GUS assay in leaves, stems, and roots of stably transformed citrus plants expressing the promoter-uidA construct. Lower uidA mRNA levels were detected when the transgene was under the control of citrus promoters as compared to the expression under the control of the CaMV35S promoter. The association of the uidA gene with the citrus-derived promoters resulted in mRNA levels of up to 60-41.8% of the value obtained with the construct containing CaMV35S driving the uidA gene. Moreover, a lower inter-individual variability in transgene expression was observed amongst the different transgenic lines, where gene constructs containing citrus-derived promoters were used. In silico analysis of the citrus-derived promoter sequences revealed that their activity may be controlled by several putative cis-regulatory elements. These citrus promoters will expand the availability of regulatory sequences for driving gene expression in citrus gene-modification programs.
Sequences 5' to translation start regulate expression of petunia rbcS genes.

PubMed Central

Dean, C; Favreau, M; Bedbrook, J; Dunsmuir, P

1989-01-01

The promoter sequences that contribute to quantitative differences in expression of the petunia genes (rbcS) encoding the small subunit of ribulose bisphosphate carboxylase have been characterized. The promoter regions of the two most abundantly expressed petunia rbcS genes, SSU301 and SSU611, show sequence similarity not present in other rbcS genes. We investigated the significance of these and other sequences by adding specific regions from the SSU301 promoter (the most strongly expressed gene) to equivalent regions in the SSU911 promoter (the least strongly expressed gene) and assaying the expression of the fusions in transgenic tobacco plants. In this way, we characterized an SSU301 promoter region (either from -285 to -178 or -291 to -204) which, when added to SSU911, in either orientation, increased SSU911 expression 25-fold. This increase was equivalent to that caused by addition of the entire SSU301 5'-flanking region. Replacement of SSU911 promoter sequences between -198 and the start codon with sequences from the equivalent region of SSU301 did not increase SSU911 expression significantly. The -291 to -204 SSU301 promoter fragment contributes significantly to quantitative differences in expression between the petunia rbcS genes. PMID:2535543

A comprehensive assessment of RNA-seq accuracy, reproducibility and information content by the Sequencing Quality Control consortium

PubMed Central

2014-01-01

We present primary results from the Sequencing Quality Control (SEQC) project, coordinated by the United States Food and Drug Administration. Examining Illumina HiSeq, Life Technologies SOLiD and Roche 454 platforms at multiple laboratory sites using reference RNA samples with built-in controls, we assess RNA sequencing (RNA-seq) performance for junction discovery and differential expression profiling and compare it to microarray and quantitative PCR (qPCR) data using complementary metrics. At all sequencing depths, we discover unannotated exon-exon junctions, with >80% validated by qPCR. We find that measurements of relative expression are accurate and reproducible across sites and platforms if specific filters are used. In contrast, RNA-seq and microarrays do not provide accurate absolute measurements, and gene-specific biases are observed, for these and qPCR. Measurement performance depends on the platform and data analysis pipeline, and variation is large for transcript-level profiling. The complete SEQC data sets, comprising >100 billion reads (10Tb), provide unique resources for evaluating RNA-seq analyses for clinical and regulatory settings. PMID:25150838
Characterization of genic microsatellite markers derived from expressed sequence tags in Pacific abalone ( Haliotis discus hannai)

NASA Astrophysics Data System (ADS)

Li, Qi; Shu, Jing; Zhao, Cui; Liu, Shikai; Kong, Lingfeng; Zheng, Xiaodong

2010-01-01

Simple sequence repeat (SSR) markers were developed from the expressed sequence tags (ESTs) of Pacific abalone ( Haliotis discus hannai). Repeat motifs were found in 4.95% of the ESTs at a frequency of one repeat every 10.04 kb of EST sequences, after redundancy elimination. Seventeen polymorphic EST-SSRs were developed. The number of alleles per locus varied from 2-17, with an average of 6.8 alleles per locus. The expected and observed heterozygosities ranged from 0.159 to 0.928 and from 0.132 to 0.922, respectively. Twelve of the 17 loci (70.6%) were successfully amplified in H. diversicolor. Seventeen loci segregated in three families, with three showing the presence of null alleles (17.6%). The adequate level of variability and low frequency of null alleles observed in H. discus hannai, together with the high rate of transportability across Haliotis species, make this set of EST-SSR markers an important tool for comparative mapping, marker-assisted selection, and evolutionary studies, not only in the Pacific abalone, but also in related species.
Altered miR-193a-5p expression in children with cow's milk allergy.

PubMed

D'Argenio, V; Del Monaco, V; Paparo, L; De Palma, F D E; Nocerino, R; D'Alessio, F; Visconte, F; Discepolo, V; Del Vecchio, L; Salvatore, F; Berni Canani, R

2018-02-01

Cow's milk allergy (CMA) is one of the most common food allergies in children. Epigenetic mechanisms have been suggested to play a role in CMA pathogenesis. We have shown that DNA methylation of Th1/Th2 cytokine genes and FoxP3 affects CMA disease course. Preliminary evidence suggests that also the miRNome could be implicated in the pathogenesis of allergy. Main study outcome was to comparatively evaluate miRNome in children with CMA and in healthy controls. Peripheral blood mononuclear cells were obtained from children aged 4-18 months: 10 CMA patients, 9 CMA patients who outgrew CMA, and 11 healthy controls. Small RNA libraries were sequenced using a next-generation sequencing-based approach. Functional assessment of IL-4 expression was also performed. Among the miRNAs differently expressed, 2 were upregulated and 14 were downregulated in children with active CMA compared to healthy controls. miR-193a-5p resulted the most downregulated miRNA in children with active CMA compared to healthy controls. The predicted targets of miR-193a-5p resulted upregulated in CMA patients compared to healthy controls. Peripheral blood CD4 + T cells transfected with a miR193a-5 inhibitor showed a significant upregulation of IL-4 mRNA and its protein expression. Children who outgrew CMA showed miRNA-193a-5p level, and its related targets expression, similar to that observed in healthy controls. Our results suggest that miR-193a-5p is a post-transcriptional regulator of IL-4 expression and could have a role in IgE-mediated CMA. This miRNA could be a novel diagnostic and therapeutic target for this common form of food allergy in childhood. © 2017 EAACI and John Wiley and Sons A/S. Published by John Wiley and Sons Ltd.
Transcriptome Analysis of the Differentially Expressed Genes in the Male and Female Shrub Willows (Salix suchowensis)

PubMed Central

Liu, Jingjing; Yin, Tongming; Ye, Ning; Chen, Yingnan; Yin, Tingting; Liu, Min; Hassani, Danial

2013-01-01

Background The dioecious system is relatively rare in plants. Shrub willow is an annual flowering dioecious woody plant, and possesses many characteristics that lend it as a great model for tracking the missing pieces of sex determination evolution. To gain a global view of the genes differentially expressed in the male and female shrub willows and to develop a database for further studies, we performed a large-scale transcriptome sequencing of flower buds which were separately collected from two types of sexes. Results Totally, 1,201,931 high quality reads were obtained, with an average length of 389 bp and a total length of 467.96 Mb. The ESTs were assembled into 29,048 contigs, and 132,709 singletons. These unigenes were further functionally annotated by comparing their sequences to different proteins and functional domain databases and assigned with Gene Ontology (GO) terms. A biochemical pathway database containing 291 predicted pathways was also created based on the annotations of the unigenes. Digital expression analysis identified 806 differentially expressed genes between the male and female flower buds. And 33 of them located on the incipient sex chromosome of Salicaceae, among which, 12 genes might involve in plant sex determination empirically. These genes were worthy of special notification in future studies. Conclusions In this study, a large number of EST sequences were generated from the flower buds of a male and a female shrub willow. We also reported the differentially expressed genes between the two sex-type flowers. This work provides valuable information and sequence resources for uncovering the sex determining genes and for future functional genomics analysis of Salicaceae spp. PMID:23560075
"De-novo" amino acid sequence elucidation of protein G'e by combined "top-down" and "bottom-up" mass spectrometry.

PubMed

Yefremova, Yelena; Al-Majdoub, Mahmoud; Opuni, Kwabena F M; Koy, Cornelia; Cui, Weidong; Yan, Yuetian; Gross, Michael L; Glocker, Michael O

2015-03-01

Mass spectrometric de-novo sequencing was applied to review the amino acid sequence of a commercially available recombinant protein G´ with great scientific and economic importance. Substantial deviations to the published amino acid sequence (Uniprot Q54181) were found by the presence of 46 additional amino acids at the N-terminus, including a so-called "His-tag" as well as an N-terminal partial α-N-gluconoylation and α-N-phosphogluconoylation, respectively. The unexpected amino acid sequence of the commercial protein G' comprised 241 amino acids and resulted in a molecular mass of 25,998.9 ± 0.2 Da for the unmodified protein. Due to the higher mass that is caused by its extended amino acid sequence compared with the original protein G' (185 amino acids), we named this protein "protein G'e." By means of mass spectrometric peptide mapping, the suggested amino acid sequence, as well as the N-terminal partial α-N-gluconoylations, was confirmed with 100% sequence coverage. After the protein G'e sequence was determined, we were able to determine the expression vector pET-28b from Novagen with the Xho I restriction enzyme cleavage site as the best option that was used for cloning and expressing the recombinant protein G'e in E. coli. A dissociation constant (K(d)) value of 9.4 nM for protein G'e was determined thermophoretically, showing that the N-terminal flanking sequence extension did not cause significant changes in the binding affinity to immunoglobulins.
Comparative Transcriptomics to Identify Novel Genes and Pathways in Dinoflagellates

NASA Astrophysics Data System (ADS)

Ryan, D.

2016-02-01

The unarmored dinoflagellate Karenia brevis is among the most prominent harmful, bloom-forming phytoplankton species in the Gulf of Mexico. During blooms, the polyketides PbTx-1 and PbTx-2 (brevetoxins) are produced by K. brevis. Brevetoxins negatively impact human health and the Gulf shellfish harvest. However, the genes underlying brevetoxin synthesis are currently unknown. Because the K. brevis genome is extremely large ( 1 × 1011 base pairs long), and with a high proportion of repetitive, non-coding DNA, it has not been sequenced. In fact, large, repetitive genomes are common among the dinoflagellate group. High-throughput RNA sequencing technology enabled us to assemble Karenia transcriptomes de novo and investigate potential genes in the brevetoxin pathway through comparative transcriptomics. The brevetoxin profile varies among K. brevis clonal cultures. For example, well-documented Wilson-CCFWC268 typically produces 8-10 pg PbTx per cell, whereas SP1 produces < 2 pg PbTx/cell, and the mutant low-toxin Wilson clone produces undetectable to low (<0.05 pg/cell) amounts. Further, PbTx-2 has been measured in Karenia papilionacea but not Karenia mikimotoi. We compared the transcriptomes of four K. brevis clones (Wilson-CCFWC268, SP3, SP1, and mutant low-toxin Wilson) with K. papilionacea and K. mikimotoi to investigate nucleotide-level genetic variations and differences in gene expression. Of the 85,000 transcripts in the K. brevis transcriptome, 4,600 transcripts, including novel unannotated orthologs and putative polyketide synthases (PKSs), were only expressed by brevetoxin-producing K. brevis and K. papilionacea, not K. mikimotoi. Examination of gene expression between the typical- and low-toxin Wilson clones identified about 3,500 genes with significantly different expression levels, including 2 putative PKSs. One of the 2 PKSs was only found in the brevetoxin-producing Karenia species. These transcriptomes could not have been characterized without high-throughput RNA sequencing.
Suppressive subtractive hybridization approach revealed differential expression of hypersensitive response and reactive oxygen species production genes in tea (Camellia sinensis (L.) O. Kuntze) leaves during Pestalotiopsis thea infection.

PubMed

Senthilkumar, Palanisamy; Thirugnanasambantham, Krishnaraj; Mandal, Abul Kalam Azad

2012-12-01

Tea (Camellia sinensis (L.) O. Kuntze) is an economically important plant cultivated for its leaves. Infection of Pestalotiopsis theae in leaves causes gray blight disease and enormous loss to the tea industry. We used suppressive subtractive hybridization (SSH) technique to unravel the differential gene expression pattern during gray blight disease development in tea. Complementary DNA from P. theae-infected and uninfected leaves of disease tolerant cultivar UPASI-10 was used as tester and driver populations respectively. Subtraction efficiency was confirmed by comparing abundance of β-actin gene. A total of 377 and 720 clones with insert size >250 bp from forward and reverse library respectively were sequenced and analyzed. Basic Local Alignment Search Tool analysis revealed 17 sequences in forward SSH library have high degree of similarity with disease and hypersensitive response related genes and 20 sequences with hypothetical proteins while in reverse SSH library, 23 sequences have high degree of similarity with disease and stress response-related genes and 15 sequences with hypothetical proteins. Functional analysis indicated unknown (61 and 59 %) or hypothetical functions (23 and 18 %) for most of the differentially regulated genes in forward and reverse SSH library, respectively, while others have important role in different cellular activities. Majority of the upregulated genes are related to hypersensitive response and reactive oxygen species production. Based on these expressed sequence tag data, putative role of differentially expressed genes were discussed in relation to disease. We also demonstrated the efficiency of SSH as a tool in enriching gray blight disease related up- and downregulated genes in tea. The present study revealed that many genes related to disease resistance were suppressed during P. theae infection and enhancing these genes by the application of inducers may impart better disease tolerance to the plants.
Molecular analysis of two phytohemagglutinin genes and their expression in Phaseolus vulgaris cv. Pinto, a lectin-deficient cultivar of the bean

PubMed Central

Voelker, Toni A.; Staswick, Paul; Chrispeels, Maarten J.

1986-01-01

Phytohemagglutinin (PHA), the seed lectin of the common bean, Phaseolus vulgaris, is encoded by two highly homologous, tandemly linked genes, dlec1 and dlec2, which are coordinately expressed at high levels in developing cotyledons. Their respective transcripts translate into closely related polypeptides, PHA-E and PHA-L, constituents of the tetrameric lectin which accumulates at high levels in developing seeds. In the bean cultivar Pinto UI111, PHA-E is not detectable, and PHA-L accumulates at very reduced levels. To investigate the cause of the Pinto phenotype, we cloned and sequenced the two PHA genes of Pinto, called Pdlec1 and Pdlec2, and determined the abundance of their respective mRNAs in developing cotyledons. Both genes are more than 90% homologous to the normal PHA genes found in other cultivars. Pdlec1 carries a 1-bp frameshift mutation close to the 5' end of its coding sequence. Only very truncated polypeptides could be made from its mRNA. The gene Pdlec2 encodes a polypeptide, which resembles PHA-L and its predicted amino acid sequence agrees with the available Pinto PHA amino acid sequence data. Analysis of the mRNA of developing cotyledons revealed that the Pdlec1 message is reduced 600-fold, and Pdlec2 mRNA is reduced 20-fold with respect to mRNA levels in normal cultivars. A comparison of the sequences which are upstream from the coding sequence shows that Pdlec2 has a 100-bp deletion compared to the other genes (dlec1, dlec2 and Pdlec1). This deletion which contains a large tandem repeat may be responsible for the low level of expression of Pdlec2. The very low expression of Pdlec1 is as yet unexplained. ImagesFig. 5. PMID:16453730
Comparable contributions of structural-functional constraints and expression level to the rate of protein sequence evolution

PubMed Central

Wolf, Maxim Y; Wolf, Yuri I; Koonin, Eugene V

2008-01-01

Background Proteins show a broad range of evolutionary rates. Understanding the factors that are responsible for the characteristic rate of evolution of a given protein arguably is one of the major goals of evolutionary biology. A long-standing general assumption used to be that the evolution rate is, primarily, determined by the specific functional constraints that affect the given protein. These constrains were traditionally thought to depend both on the specific features of the protein's structure and its biological role. The advent of systems biology brought about new types of data, such as expression level and protein-protein interactions, and unexpectedly, a variety of correlations between protein evolution rate and these variables have been observed. The strongest connections by far were repeatedly seen between protein sequence evolution rate and the expression level of the respective gene. It has been hypothesized that this link is due to the selection for the robustness of the protein structure to mistranslation-induced misfolding that is particularly important for highly expressed proteins and is the dominant determinant of the sequence evolution rate. Results This work is an attempt to assess the relative contributions of protein domain structure and function, on the one hand, and expression level on the other hand, to the rate of sequence evolution. To this end, we performed a genome-wide analysis of the effect of the fusion of a pair of domains in multidomain proteins on the difference in the domain-specific evolutionary rates. The mistranslation-induced misfolding hypothesis would predict that, within multidomain proteins, fused domains, on average, should evolve at substantially closer rates than the same domains in different proteins because, within a mutlidomain protein, all domains are translated at the same rate. We performed a comprehensive comparison of the evolutionary rates of mammalian and plant protein domains that are either joined in multidomain proteins or contained in distinct proteins. Substantial homogenization of evolutionary rates in multidomain proteins was, indeed, observed in both animals and plants, although highly significant differences between domain-specific rates remained. The contributions of the translation rate, as determined by the effect of the fusion of a pair of domains within a multidomain protein, and intrinsic, domain-specific structural-functional constraints appear to be comparable in magnitude. Conclusion Fusion of domains in a multidomain protein results in substantial homogenization of the domain-specific evolutionary rates but significant differences between domain-specific evolution rates remain. Thus, the rate of translation and intrinsic structural-functional constraints both exert sizable and comparable effects on sequence evolution. Reviewers This article was reviewed by Sergei Maslov, Dennis Vitkup, Claus Wilke (nominated by Orly Alter), and Allan Drummond (nominated by Joel Bader). For the full reviews, please go to the Reviewers' Reports section. PMID:18840284
Differential substrate behaviours of ethylene oxide and propylene oxide towards human glutathione transferase theta hGSTT1-1.

PubMed

Thier, R; Wiebel, F A; Bolt, H M

1999-11-01

The transformation of ethylene oxide (EO), propylene oxide (PO) and 1-butylene oxide (1-BuO) by human glutathione transferase theta (hGSTT1-1) was studied comparatively using 'conjugator' (GSTT1 + individuals) erythrocyte lysates. The relative sequence of velocity of enzymic transformation was PO > EO > 1-BuO. The faster transformation of PO compared to EO was corroborated in studies with human and rat GSTT1-1 (hGSTT1-1 and rGSTT1-1, respectively) expressed by Salmonella typhimurium TA1535. This sequence of reactivities of homologous epoxides towards GSTT1-1 contrasts to the sequence observed in homologous alkyl halides (methyl bromide, MBr; ethyl bromide, EtBr; n-propyl bromide, PrBr) where the relative sequence MeBr > EtBr > PrBr is observed. The higher reactivity towards GSTT1-1 of propylene oxide compared to ethylene oxide is consistent with a higher chemical reactivity. This is corroborated by experimental data of acid-catalysed hydrolysis of a number of aliphatic epoxides, including ethylene oxide and propylene oxide and consistent with semi-empirical molecular orbital modelings.
Integration of Bioinformatics and Synthetic Promoters Leads to the Discovery of Novel Elicitor-Responsive cis-Regulatory Sequences in Arabidopsis1[C][W][OA

PubMed Central

Koschmann, Jeannette; Machens, Fabian; Becker, Marlies; Niemeyer, Julia; Schulze, Jutta; Bülow, Lorenz; Stahl, Dietmar J.; Hehl, Reinhard

2012-01-01

A combination of bioinformatic tools, high-throughput gene expression profiles, and the use of synthetic promoters is a powerful approach to discover and evaluate novel cis-sequences in response to specific stimuli. With Arabidopsis (Arabidopsis thaliana) microarray data annotated to the PathoPlant database, 732 different queries with a focus on fungal and oomycete pathogens were performed, leading to 510 up-regulated gene groups. Using the binding site estimation suite of tools, BEST, 407 conserved sequence motifs were identified in promoter regions of these coregulated gene sets. Motif similarities were determined with STAMP, classifying the 407 sequence motifs into 37 families. A comparative analysis of these 37 families with the AthaMap, PLACE, and AGRIS databases revealed similarities to known cis-elements but also led to the discovery of cis-sequences not yet implicated in pathogen response. Using a parsley (Petroselinum crispum) protoplast system and a modified reporter gene vector with an internal transformation control, 25 elicitor-responsive cis-sequences from 10 different motif families were identified. Many of the elicitor-responsive cis-sequences also drive reporter gene expression in an Agrobacterium tumefaciens infection assay in Nicotiana benthamiana. This work significantly increases the number of known elicitor-responsive cis-sequences and demonstrates the successful integration of a diverse set of bioinformatic resources combined with synthetic promoter analysis for data mining and functional screening in plant-pathogen interaction. PMID:22744985
Evaluation of anonymous and expressed sequence tag derived polymorphic microsatellite markers in the tobacco budworm Heliothis virescens (Lepidoptera: noctuidae)

USDA-ARS?s Scientific Manuscript database

Polymorphic genetic markers were identified and characterized using a partial genomic library of Heliothis virescens enriched for simple sequence repeats (SSR) and nucleotide sequences of expressed sequence tags (EST). Nucleotide sequences of 192 clones from the partial genomic library yielded 147 u...
An Automated Pipeline for Engineering Many-Enzyme Pathways: Computational Sequence Design, Pathway Expression-Flux Mapping, and Scalable Pathway Optimization.

PubMed

Halper, Sean M; Cetnar, Daniel P; Salis, Howard M

2018-01-01

Engineering many-enzyme metabolic pathways suffers from the design curse of dimensionality. There are an astronomical number of synonymous DNA sequence choices, though relatively few will express an evolutionary robust, maximally productive pathway without metabolic bottlenecks. To solve this challenge, we have developed an integrated, automated computational-experimental pipeline that identifies a pathway's optimal DNA sequence without high-throughput screening or many cycles of design-build-test. The first step applies our Operon Calculator algorithm to design a host-specific evolutionary robust bacterial operon sequence with maximally tunable enzyme expression levels. The second step applies our RBS Library Calculator algorithm to systematically vary enzyme expression levels with the smallest-sized library. After characterizing a small number of constructed pathway variants, measurements are supplied to our Pathway Map Calculator algorithm, which then parameterizes a kinetic metabolic model that ultimately predicts the pathway's optimal enzyme expression levels and DNA sequences. Altogether, our algorithms provide the ability to efficiently map the pathway's sequence-expression-activity space and predict DNA sequences with desired metabolic fluxes. Here, we provide a step-by-step guide to applying the Pathway Optimization Pipeline on a desired multi-enzyme pathway in a bacterial host.
Control of total GFP expression by alterations to the 3′ region nucleotide sequence

PubMed Central

2013-01-01

Background Previously, we distinguished the Escherichia coli type II cytoplasmic membrane translocation pathways of Tat, Yid, and Sec for unfolded and folded soluble target proteins. The translocation of folded protein to the periplasm for soluble expression via the Tat pathway was controlled by an N-terminal hydrophilic leader sequence. In this study, we investigated the effect of the hydrophilic C-terminal end and its nucleotide sequence on total and soluble protein expression. Results The native hydrophilic C-terminal end of GFP was obtained by deleting the C-terminal peptide LeuGlu-6×His, derived from pET22b(+). The corresponding clones induced total and soluble GFP expression that was either slightly increased or dramatically reduced, apparently through reconstruction of the nucleotide sequence around the stop codon in the 3′ region. In the expression-induced clones, the hydrophilic C-terminus showed increased Tat pathway specificity for soluble expression. However, in the expression-reduced clone, after analyzing the role of the 5′ poly(A) coding sequence with a substituted synonymous codon, we proved that the longer 5′ poly(A) coding sequence interacted with the reconstructed 3′ region nucleotide sequence to create a new mRNA tertiary structure between the 5′ and 3′ regions, which resulted in reduced total GFP expression. Further, to recover the reduced expression by changing the 3′ nucleotide sequence, after replacing selected C-terminal 5′ codons and the stop codon in the ORF with synonymous codons, total GFP expression in most of the clones was recovered to the undeleted control level. The insertion of trinucleotides after the stop codon in the 3′-UTR recovered or reduced total GFP expression. RT-PCR revealed that the level of total protein expression was controlled by changes in translational or transcriptional regulation, which were induced or reduced by the substitution or insertion of 3′ region nucleotides. Conclusions We found that the hydrophilic C-terminal end of GFP increased Tat pathway specificity and that the 3′ nucleotide sequence played an important role in total protein expression through translational and transcriptional regulation. These findings may be useful for efficiently producing recombinant proteins as well as for potentially controlling the expression level of specific genes in the body for therapeutic purposes. PMID:23834827
Comparison of secretory signal peptides for heterologous protein expression in microalgae: Expanding the secretion portfolio for Chlamydomonas reinhardtii

PubMed Central

de Carvalho, João Carlos Monteiro; Mayfield, Stephen Patrick

2018-01-01

Efficient protein secretion is a desirable trait for any recombinant protein expression system, together with simple, low-cost, and defined media, such as the typical media used for photosynthetic cultures of microalgae. However, low titers of secreted heterologous proteins are usually obtained, even with the most extensively studied microalga Chlamydomonas reinhardtii, preventing their industrial application. In this study, we aimed to expand and evaluate secretory signal peptides (SP) for heterologous protein secretion in C. reinhardtii by comparing previously described SP with untested sequences. We compared the SPs from arylsulfatase 1 and carbonic anhydrase 1, with those of untried SPs from binding protein 1, an ice-binding protein, and six sequences identified in silico. We identified over 2000 unique SPs using the SignalP 4.0 software. mCherry fluorescence was used to compare the protein secretion of up to 96 colonies for each construct, non-secretion construct, and parental wild-type cc1690 cells. Supernatant fluorescence varied according to the SP used, with a 10-fold difference observed between the highest and lowest secretors. Moreover, two SPs identified in silico secreted the highest amount of mCherry. Our results demonstrate that the SP should be carefully selected and that efficient sequences can be coded in the C. reinhardtii genome. The SPs described here expand the portfolio available for research on heterologous protein secretion and for biomanufacturing applications. PMID:29408937
Transcriptome Analysis of Beta macrocarpa and Identification of Differentially Expressed Transcripts in Response to Beet Necrotic Yellow Vein Virus Infection.

PubMed

Fan, Huiyan; Zhang, Yongliang; Sun, Haiwen; Liu, Junying; Wang, Ying; Wang, Xianbing; Li, Dawei; Yu, Jialin; Han, Chenggui

2015-01-01

Rhizomania is one of the most devastating diseases of sugar beet. It is caused by Beet necrotic yellow vein virus (BNYVV) transmitted by the obligate root-infecting parasite Polymyxa betae. Beta macrocarpa, a wild beet species widely used as a systemic host in the laboratory, can be rub-inoculated with BNYVV to avoid variation associated with the presence of the vector P. betae. To better understand disease and resistance between beets and BNYVV, we characterized the transcriptome of B. macrocarpa and analyzed global gene expression of B. macrocarpa in response to BNYVV infection using the Illumina sequencing platform. The overall de novo assembly of cDNA sequence data generated 75,917 unigenes, with an average length of 1054 bp. Based on a BLASTX search (E-value ≤ 10-5) against the non-redundant (NR, NCBI) protein, Swiss-Prot, the Gene Ontology (GO), Clusters of Orthologous Groups of proteins (COG) and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases, there were 39,372 unigenes annotated. In addition, 4,834 simple sequence repeats (SSRs) were also predicted, which could serve as a foundation for various applications in beet breeding. Furthermore, comparative analysis of the two transcriptomes revealed that 261 genes were differentially expressed in infected compared to control plants, including 128 up- and 133 down-regulated genes. GO analysis showed that the changes in the differently expressed genes were mainly enrichment in response to biotic stimulus and primary metabolic process. Our results not only provide a rich genomic resource for beets, but also benefit research into the molecular mechanisms of beet- BNYV Vinteraction.
CHD8 regulates neurodevelopmental pathways associated with autism spectrum disorder in neural progenitors

PubMed Central

Sugathan, Aarathi; Biagioli, Marta; Golzio, Christelle; Erdin, Serkan; Blumenthal, Ian; Manavalan, Poornima; Ragavendran, Ashok; Brand, Harrison; Lucente, Diane; Miles, Judith; Sheridan, Steven D.; Stortchevoi, Alexei; Kellis, Manolis; Haggarty, Stephen J.; Katsanis, Nicholas; Gusella, James F.; Talkowski, Michael E.

2014-01-01

Truncating mutations of chromodomain helicase DNA-binding protein 8 (CHD8), and of many other genes with diverse functions, are strong-effect risk factors for autism spectrum disorder (ASD), suggesting multiple mechanisms of pathogenesis. We explored the transcriptional networks that CHD8 regulates in neural progenitor cells (NPCs) by reducing its expression and then integrating transcriptome sequencing (RNA sequencing) with genome-wide CHD8 binding (ChIP sequencing). Suppressing CHD8 to levels comparable with the loss of a single allele caused altered expression of 1,756 genes, 64.9% of which were up-regulated. CHD8 showed widespread binding to chromatin, with 7,324 replicated sites that marked 5,658 genes. Integration of these data suggests that a limited array of direct regulatory effects of CHD8 produced a much larger network of secondary expression changes. Genes indirectly down-regulated (i.e., without CHD8-binding sites) reflect pathways involved in brain development, including synapse formation, neuron differentiation, cell adhesion, and axon guidance, whereas CHD8-bound genes are strongly associated with chromatin modification and transcriptional regulation. Genes associated with ASD were strongly enriched among indirectly down-regulated loci (P < 10−8) and CHD8-bound genes (P = 0.0043), which align with previously identified coexpression modules during fetal development. We also find an intriguing enrichment of cancer-related gene sets among CHD8-bound genes (P < 10−10). In vivo suppression of chd8 in zebrafish produced macrocephaly comparable to that of humans with inactivating mutations. These data indicate that heterozygous disruption of CHD8 precipitates a network of gene-expression changes involved in neurodevelopmental pathways in which many ASD-associated genes may converge on shared mechanisms of pathogenesis. PMID:25294932
Comparative and Evolutionary Analysis of Grass Pollen Allergens Using Brachypodium distachyon as a Model System.

PubMed

Sharma, Akanksha; Sharma, Niharika; Bhalla, Prem; Singh, Mohan

2017-01-01

Comparative genomics have facilitated the mining of biological information from a genome sequence, through the detection of similarities and differences with genomes of closely or more distantly related species. By using such comparative approaches, knowledge can be transferred from the model to non-model organisms and insights can be gained in the structural and evolutionary patterns of specific genes. In the absence of sequenced genomes for allergenic grasses, this study was aimed at understanding the structure, organisation and expression profiles of grass pollen allergens using the genomic data from Brachypodium distachyon as it is phylogenetically related to the allergenic grasses. Combining genomic data with the anther RNA-Seq dataset revealed 24 pollen allergen genes belonging to eight allergen groups mapping on the five chromosomes in B. distachyon. High levels of anther-specific expression profiles were observed for the 24 identified putative allergen-encoding genes in Brachypodium. The genomic evidence suggests that gene encoding the group 5 allergen, the most potent trigger of hay fever and allergic asthma originated as a pollen specific orphan gene in a common grass ancestor of Brachypodium and Triticiae clades. Gene structure analysis showed that the putative allergen-encoding genes in Brachypodium either lack or contain reduced number of introns. Promoter analysis of the identified Brachypodium genes revealed the presence of specific cis-regulatory sequences likely responsible for high anther/pollen-specific expression. With the identification of putative allergen-encoding genes in Brachypodium, this study has also described some important plant gene families (e.g. expansin superfamily, EF-Hand family, profilins etc) for the first time in the model plant Brachypodium. Altogether, the present study provides new insights into structural characterization and evolution of pollen allergens and will further serve as a base for their functional characterization in related grass species.
Complete genome sequence and the expression pattern of plasmids of the model ethanologen Zymomonas mobilis ZM4 and its xylose-utilizing derivatives 8b and 2032.

PubMed

Yang, Shihui; Vera, Jessica M; Grass, Jeff; Savvakis, Giannis; Moskvin, Oleg V; Yang, Yongfu; McIlwain, Sean J; Lyu, Yucai; Zinonos, Irene; Hebert, Alexander S; Coon, Joshua J; Bates, Donna M; Sato, Trey K; Brown, Steven D; Himmel, Michael E; Zhang, Min; Landick, Robert; Pappas, Katherine M; Zhang, Yaoping

2018-01-01

Zymomonas mobilis is a natural ethanologen being developed and deployed as an industrial biofuel producer. To date, eight Z. mobilis strains have been completely sequenced and found to contain 2-8 native plasmids. However, systematic verification of predicted Z. mobilis plasmid genes and their contribution to cell fitness has not been hitherto addressed. Moreover, the precise number and identities of plasmids in Z. mobilis model strain ZM4 have been unclear. The lack of functional information about plasmid genes in ZM4 impedes ongoing studies for this model biofuel-producing strain. In this study, we determined the complete chromosome and plasmid sequences of ZM4 and its engineered xylose-utilizing derivatives 2032 and 8b. Compared to previously published and revised ZM4 chromosome sequences, the ZM4 chromosome sequence reported here contains 65 nucleotide sequence variations as well as a 2400-bp insertion. Four plasmids were identified in all three strains, with 150 plasmid genes predicted in strain ZM4 and 2032, and 153 plasmid genes predicted in strain 8b due to the insertion of heterologous DNA for expanded substrate utilization. Plasmid genes were then annotated using Blast2GO, InterProScan, and systems biology data analyses, and most genes were found to have apparent orthologs in other organisms or identifiable conserved domains. To verify plasmid gene prediction, RNA-Seq was used to map transcripts and also compare relative gene expression under various growth conditions, including anaerobic and aerobic conditions, or growth in different concentrations of biomass hydrolysates. Overall, plasmid genes were more responsive to varying hydrolysate concentrations than to oxygen availability. Additionally, our results indicated that although all plasmids were present in low copy number (about 1-2 per cell), the copy number of some plasmids varied under specific growth conditions or due to heterologous gene insertion. The complete genome of ZM4 and two xylose-utilizing derivatives is reported in this study, with an emphasis on identifying and characterizing plasmid genes. Plasmid gene annotation, validation, expression levels at growth conditions of interest, and contribution to host fitness are reported for the first time.
Expressed sequence tag analysis of adult human optic nerve for NEIBank: Identification of cell type and tissue markers

PubMed Central

Bernstein, Steven L; Guo, Yan; Peterson, Katherine; Wistow, Graeme

2009-01-01

Background The optic nerve is a pure white matter central nervous system (CNS) tract with an isolated blood supply, and is widely used in physiological studies of white matter response to various insults. We examined the gene expression profile of human optic nerve (ON) and, through the NEIBANK online resource, to provide a resource of sequenced verified cDNA clones. An un-normalized cDNA library was constructed from pooled human ON tissues and was used in expressed sequence tag (EST) analysis. Location of an abundant oligodendrocyte marker was examined by immunofluorescence. Quantitative real time polymerase chain reaction (qRT-PCR) and Western analysis were used to compare levels of expression for key calcium channel protein genes and protein product in primate and rodent ON. Results Our analyses revealed a profile similar in many respects to other white matter related tissues, but significantly different from previously available ON cDNA libraries. The previous libraries were found to include specific markers for other eye tissues, suggesting contamination. Immune/inflammatory markers were abundant in the new ON library. The oligodendrocyte marker QKI was abundant at the EST level. Immunofluorescence revealed that this protein is a useful oligodendrocyte cell-type marker in rodent and primate ONs. L-type calcium channel EST abundance was found to be particularly low. A qRT-PCR-based comparative mammalian species analysis reveals that L-type calcium channel expression levels are significantly lower in primate than in rodent ON, which may help account for the class-specific difference in responsiveness to calcium channel blocking agents. Several known eye disease genes are abundantly expressed in ON. Many genes associated with normal axonal function, mRNAs associated with axonal transport, inflammation and neuroprotection are observed. Conclusion We conclude that the new cDNA library is a faithful representation of human ON and EST data provide an initial overview of gene expression patterns in this tissue. The data provide clues for tissue-specific and species-specific properties of human ON that will help in design of therapeutic models. PMID:19778450

WHOLE-GENOME SEQUENCING OF SALIVARY GLAND ADENOID CYSTIC CARCINOMA

PubMed Central

Rettig, Eleni M; Talbot, C Conover; Sausen, Mark; Jones, Sian; Bishop, Justin A; Wood, Laura D; Tokheim, Collin; Niknafs, Noushin; Karchin, Rachel; Fertig, Elana J; Wheelan, Sarah J; Marchionni, Luigi; Considine, Michael; Ling, Shizhang; Fakhry, Carole; Papadopoulos, Nickolas; Kinzler, Kenneth W; Vogelstein, Bert; Ha, Patrick K; Agrawal, Nishant

2016-01-01

Adenoid cystic carcinomas (ACCs) of the salivary glands are challenging to understand, treat, and cure. To better understand the genetic alterations underlying the pathogenesis of these tumors, we performed comprehensive genome analyses of 25 fresh-frozen tumors, including whole genome sequencing, expression and pathway analyses. In addition to the well-described MYB-NFIB fusion which was found in 11 tumors (44%), we observed five different rearrangements involving the NFIB transcription factor gene in seven tumors (28%). Taken together, NFIB translocations occurred in 15 of 25 samples (60%, 95%CI=41–77%). In addition, mRNA expression analysis of 17 tumors revealed overexpression of NFIB in ACC tumors compared with normal tissues (p=0.002). There was no difference in NFIB mRNA expression in tumors with NFIB fusions compared to those without. We also report somatic mutations of genes involved in the axonal guidance and Rho family signaling pathways. Finally, we confirm previously described alterations in genes related to chromatin regulation and Notch signaling. Our findings suggest a separate role for NFIB in ACC oncogenesis and highlight important signaling pathways for future functional characterization and potential therapeutic targeting. PMID:26862087
Macroarray expression analysis of barley susceptibility and nonhost resistance to Blumeria graminis.

PubMed

Eichmann, Ruth; Biemelt, Sophia; Schäfer, Patrick; Scholz, Uwe; Jansen, Carin; Felk, Angelika; Schäfer, Wilhelm; Langen, Gregor; Sonnewald, Uwe; Kogel, Karl-Heinz; Hückelhoven, Ralph

2006-04-01

Different formae speciales of the grass powdery mildew fungus Blumeria graminis undergo basic-compatible or basic-incompatible (nonhost) interactions with barley. Background resistance in compatible interactions and nonhost resistance require common genetic and mechanistic elements of plant defense. To build resources for differential screening for genes that potentially distinguish a compatible from an incompatible interaction on the level of differential gene expression of the plant, we constructed eight dedicated cDNA libraries, established 13.000 expressed sequence tag (EST) sequences and designed DNA macroarrays. Using macroarrays based on cDNAs derived from epidermal peels of plants pretreated with the chemical resistance activating compound acibenzolar-S-methyl, we compared the expression of barley gene transcripts in the early host interaction with B. graminis f.sp. hordei or the nonhost pathogen B. graminis f.sp. tritici, respectively. We identified 102 spots corresponding to 94 genes on the macroarray that gave significant B. graminis-responsive signals at 12 and/or 24 h after inoculation. In independent expression analyses, we confirmed the macroarray results for 11 selected genes. Although the majority of genes showed a similar expression profile in compatible versus incompatible interactions, about 30 of the 94 genes were expressed on slightly different levels in compatible versus incompatible interactions.
Molecular cloning of the Coch gene of guinea pig inner ear and its expression analysis in cultured fibrocytes of the spiral ligament.

PubMed

Li, Lishu; Ikezono, Tetsuo; Sekine, Kuwon; Shindo, Susumu; Matsumura, Tomohiro; Pawankar, Ruby; Ichimiya, Issei; Yagi, Toshiaki

2010-08-01

We have cloned guinea pig Coch cDNA and the sequence information will be useful for future molecular study combined with physiological experiments. Proper Coch gene expression appears to be dependent on the unique extracellular micro-environment of the inner ear in vivo. These results provide insight into the Coch gene expression and its regulation. To characterize the guinea pig Coch gene, we performed molecular cloning and expression analysis in the inner ear and cultured fibrocytes of the spiral ligament. The Coch cDNA was isolated using RACE. Cochlin isofoms were studied by Western blot using three different types of mammalian inner ear. The cochlear fibrocytes were cultured and characterized by immunostaining. Coch gene expression in the fibrocytes was investigated and the influence of cytokine stimulation was evaluated. The full-length 1991 bp Coch cDNA that encodes a 553 amino acid protein was isolated. The sequence had significant homology with other mammals, and the sizes of the Cochlin isoforms were identical. In the cultured fibrocytes, Coch mRNA was expressed in a very small amount and the isoform production was different, compared with the results in vivo. Cytokine stimulation did not alter the level of mRNA expression or isoform formation.
Transcriptome Sequencing and Characterization of Japanese Scallop Patinopecten yessoensis from Different Shell Color Lines

PubMed Central

Chang, Yaqing; Zhao, Wenming; Du, Zhenlin; Hao, Zhenlin

2015-01-01

Shell color is an important trait that is used in breeding the Japanese scallop Patinopecten yessoensis, the most economically important scallop species in China. We constructed four transcriptome libraries from different shell color lines of P. yessoensis: the left and right shell mantles of ordinary strains of P. yessoensis and the left shell mantles of the ‘Ivory’ and ‘Maple’ strains. These four libraries were paired-end sequenced using the Illumina HiSeq 2000 platform and contained 54,802,692 sequences, 40,798,962 sequences, 74,019,262 sequences, and 44,466,166 sequences, respectively. A total of 214,087,082 expressed sequence tags were assembled into 73,522 unigenes with an average size of 1,163 bp. When the data were compared against the public Nr and Swiss-Prot databases using BlastX, nearly 30.55% (22,458) of the unigenes were significantly matched to known unique proteins. Gene Ontology annotation and pathway mapping analysis using the Kyoto Encyclopedia of Genes and Genomes categorized unigenes according to their diverse biological functions and processes and identified candidate genes that were potentially involved in growth, pigmentation, metal transcription, and immunity. Expression profile analysis was performed on all four libraries and many differentially expressed genes were identified. In addition, 5,772 simple sequence repeats were obtained from the P. yessoensis transcriptomes, and 464,197, 395,646, and 310,649 single nucleotide polymorphisms were revealed in the ordinary strains, the ‘Ivory’ strain, and the ‘Maple’ strain, respectively. These results provide valuable information for future genomic studies on P. yessoensis and improve our understanding of the molecular mechanisms involved in the growth, immunity, shell coloring, and shell biomineralization of this species. These resources also may be used in a variety of applications, such as trait mapping, marker-assisted breeding, studies of population genetics and genomics, and work on functional genomics. PMID:25680107
Genome-wide identification of conserved intronic non-coding sequences using a Bayesian segmentation approach.

PubMed

Algama, Manjula; Tasker, Edward; Williams, Caitlin; Parslow, Adam C; Bryson-Richardson, Robert J; Keith, Jonathan M

2017-03-27

Computational identification of non-coding RNAs (ncRNAs) is a challenging problem. We describe a genome-wide analysis using Bayesian segmentation to identify intronic elements highly conserved between three evolutionarily distant vertebrate species: human, mouse and zebrafish. We investigate the extent to which these elements include ncRNAs (or conserved domains of ncRNAs) and regulatory sequences. We identified 655 deeply conserved intronic sequences in a genome-wide analysis. We also performed a pathway-focussed analysis on genes involved in muscle development, detecting 27 intronic elements, of which 22 were not detected in the genome-wide analysis. At least 87% of the genome-wide and 70% of the pathway-focussed elements have existing annotations indicative of conserved RNA secondary structure. The expression of 26 of the pathway-focused elements was examined using RT-PCR, providing confirmation that they include expressed ncRNAs. Consistent with previous studies, these elements are significantly over-represented in the introns of transcription factors. This study demonstrates a novel, highly effective, Bayesian approach to identifying conserved non-coding sequences. Our results complement previous findings that these sequences are enriched in transcription factors. However, in contrast to previous studies which suggest the majority of conserved sequences are regulatory factor binding sites, the majority of conserved sequences identified using our approach contain evidence of conserved RNA secondary structures, and our laboratory results suggest most are expressed. Functional roles at DNA and RNA levels are not mutually exclusive, and many of our elements possess evidence of both. Moreover, ncRNAs play roles in transcriptional and post-transcriptional regulation, and this may contribute to the over-representation of these elements in introns of transcription factors. We attribute the higher sensitivity of the pathway-focussed analysis compared to the genome-wide analysis to improved alignment quality, suggesting that enhanced genomic alignments may reveal many more conserved intronic sequences.
Characterization of an estrogen-responsive element implicated in regulation of the rainbow trout estrogen receptor gene.

PubMed

Le Dréan, Y; Lazennec, G; Kern, L; Saligaut, D; Pakdel, F; Valotaire, Y

1995-08-01

We previously reported that the expression of the rainbow trout estrogen receptor (rtER) gene is markedly increased by estradiol (E2). In this paper, we have used transient transfection assays with reporter plasmids expressing chloramphenicol acetyl transferase (CAT), linked to 5' flanking regions of the rtER gene promoter, to identify cis-elements responsible for E2 inducibility. Deletion analysis localized an estrogen-responsive element (ERE), at position +242, with one mutation on the first base compared with the consensus sequence. This element confers estrogen responsiveness to CAT reporter linked to both the herpes simplex virus thymidine kinase promoter and the homologous rtER promoter. Moreover, using a 0.2 kb fragment of the rtER promoter encompassing the ERE and the rtER DNA binding domain obtained from a bacterial expression system, DNase I footprinting experiments demonstrated a specific protection covering 20 bp (+240/+260) containing the ERE sequence. Based on these studies, we believe that this ERE sequence, identified in the rtER gene promoter, may be a major cis-acting element involved in the regulation of the gene by estrogen.
Developing expressed sequence tag libraries and the discovery of simple sequence repeat markers for two species of raspberry (Rubus L.).

PubMed

Bushakra, Jill M; Lewers, Kim S; Staton, Margaret E; Zhebentyayeva, Tetyana; Saski, Christopher A

2015-10-26

Due to a relatively high level of codominant inheritance and transferability within and among taxonomic groups, simple sequence repeat (SSR) markers are important elements in comparative mapping and delineation of genomic regions associated with traits of economic importance. Expressed sequence tags (ESTs) are a source of SSRs that can be used to develop markers to facilitate plant breeding and for more basic research across genera and higher plant orders. Leaf and meristem tissue from 'Heritage' red raspberry (Rubus idaeus) and 'Bristol' black raspberry (R. occidentalis) were utilized for RNA extraction. After conversion to cDNA and library construction, ESTs were sequenced, quality verified, assembled and scanned for SSRs. Primers flanking the SSRs were designed and a subset tested for amplification, polymorphism and transferability across species. ESTs containing SSRs were functionally annotated using the GenBank non-redundant (nr) database and further classified using the gene ontology database. To accelerate development of EST-SSRs in the genus Rubus (Rosaceae), 1149 and 2358 cDNA sequences were generated from red raspberry and black raspberry, respectively. The cDNA sequences were screened using rigorous filtering criteria which resulted in the identification of 121 and 257 SSR loci for red and black raspberry, respectively. Primers were designed from the surrounding sequences resulting in 131 and 288 primer pairs, respectively, as some sequences contained more than one SSR locus. Sequence analysis revealed that the SSR-containing genes span a diversity of functions and share more sequence identity with strawberry genes than with other Rosaceous species. This resource of Rubus-specific, gene-derived markers will facilitate the construction of linkage maps composed of transferable markers for studying and manipulating important traits in this economically important genus.
Development of a Stable Cell Line, Overexpressing Human T-cell Immunoglobulin Mucin 1

PubMed Central

Ebrahimi, Mina; Kazemi, Tohid; Ganjalikhani-hakemi, Mazdak; Majidi, Jafar; khanahmad, Hossein; Rahimmanesh, Ilnaz; Homayouni, Vida; Kohpayeh, Shirin

2015-01-01

Background Recent researches have demonstrated that human T-cell immunoglobulin mucin 1 (TIM-1) glycoprotein plays important roles in regulation of autoimmune and allergic diseases, as well as in tumor immunity and response to viral infections. Therefore, targeting TIM-1 could be a potential therapeutic approach against such diseases. Objectives In this study, we aimed to express TIM-1 protein on Human Embryonic kidney (HEK) 293T cell line in order to have an available source of the TIM-1 antigen. Materials and Methods The cDNA was synthesized after RNA extraction from peripheral blood mononuclear cells (PBMC) and TIM-1 cDNA was amplified by PCR with specific primers. The PCR product was cloned in pcDNA™3.1/Hygro (+) and transformed in Escherichia coli TOP 10 F’. After cloning, authenticity of DNA sequence was checked and expressed in HEK 293T cells. Finally, expression of TIM-1 was analyzed by flow cytometry and real-time PCR. Results The result of DNA sequencing demonstrated correctness of TIM-1 DNA sequence. The flow cytometry results indicated that TIM-1 was expressed in about 90% of transfected HEK 293T cells. The real-time PCR analysis showed TIM-1 mRNA expression increased 195-fold in transfected cells compared with un-transfected cells. Conclusions Findings of present study demonstrated the successful cloning and expression of TIM-1 on HEK 293T cells. These cells could be used as an immunogenic source for production of specific monoclonal antibodies, nanobodies and aptamers against human TIM-1. PMID:28959306
Measles virus minigenomes encoding two autofluorescent proteins reveal cell-to-cell variation in reporter expression dependent on viral sequences between the transcription units.

PubMed

Rennick, Linda J; Duprex, W Paul; Rima, Bert K

2007-10-01

Transcription from morbillivirus genomes commences at a single promoter in the 3' non-coding terminus, with the six genes being transcribed sequentially. The 3' and 5' untranslated regions (UTRs) of the genes (mRNA sense), together with the intergenic trinucleotide spacer, comprise the non-coding sequences (NCS) of the virus and contain the conserved gene end and gene start signals, respectively. Bicistronic minigenomes containing transcription units (TUs) encoding autofluorescent reporter proteins separated by measles virus (MV) NCS were used to give a direct estimation of gene expression in single, living cells by assessing the relative amounts of each fluorescent protein in each cell. Initially, five minigenomes containing each of the MV NCS were generated. Assays were developed to determine the amount of each fluorescent protein in cells at both cell population and single-cell levels. This revealed significant variations in gene expression between cells expressing the same NCS-containing minigenome. The minigenome containing the M/F NCS produced significantly lower amounts of fluorescent protein from the second TU (TU2), compared with the other minigenomes. A minigenome with a truncated F 5' UTR had increased expression from TU2. This UTR is 524 nt longer than the other MV 5' UTRs. Insertions into the 5' UTR of the enhanced green fluorescent protein gene in the minigenome containing the N/P NCS showed that specific sequences, rather than just the additional length of F 5' UTR, govern this decreased expression from TU2.
THE INVOLVEMENT OF HUMAN MONOGENIC CARDIOMYOPATHY GENES IN EXPERIMENTAL POLYGENIC CARDIAC HYPERTROPHY.

PubMed

Prestes, Priscilla R; Marques, Francine Z; Lopez-Campos, Guillermo; Lewandowski, Paul; Delbridge, Lea M D; Charchar, Fadi J; Harrap, Stephen B

2018-05-18

Hypertrophic cardiomyopathy thickens heart muscles reducing functionality and increasing risk of cardiac disease and morbidity. Genetic factors are involved, but their contribution is poorly understood. We used the hypertrophic heart rat (HHR), a unique normotensive polygenic model of cardiac hypertrophy and heart failure to investigate the role of genes associated with monogenic human cardiomyopathy. We selected 42 genes involved in monogenic human cardiomyopathies to study: 1) DNA variants, by sequencing the whole-genome of 13-week old HHR and age-matched normal heart rat (NHR), its genetic control strain; 2) mRNA expression, by targeted RNA-sequencing in left ventricles of HHR and NHR at five ages (2-days old, 4-, 13-, 33- and 50-weeks old) compared to human idiopathic dilated data; and 3) microRNA expression, with rat microRNA microarrays in left ventricles of 2-days old HHR and age-matched NHR. We also investigated experimentally validated microRNA-mRNA interactions. Whole-genome sequencing revealed unique variants mostly located in non-coding regions of HHR and NHR. We found 29 genes differentially expressed in at least one age. Genes encoding desmoglein 2 (Dsg2) and transthyretin (Ttr) were significantly differentially expressed at all ages in the HHR, but only Ttr was also differentially expressed in human idiopathic cardiomyopathy. Lastly, only two microRNAs differentially expressed in the HHR were present in our comparison of validated microRNA-mRNA interactions. These two microRNAs interact with five of the genes studied. Our study shows that genes involved in monogenic forms of human cardiomyopathies may also influence polygenic forms of the disease.
Molecular characterization of a 40 kDa OmpC-like porin from Serratia marcescens.

PubMed

Hutsul, J A; Worobec, E

1994-02-01

An oligonucleotide that encodes the N-terminal portion of a 41 kDa porin of Serratia marcescens was used to probe S. marcescens UOC-51 genomic DNA. An 11 kb EcoRI fragment which hybridized with the oligonucleotide was subcloned into Escherichia coli, examined for expression, and sequenced. The product expressed by the cloned gene was 40 kDa. The nucleotide sequence has an ORF of 1.13 kb. When the deduced amino acid sequence was aligned and compared to other enterobacterial porins the cloned S. marcescens porin most closely resembled E. coli OmpC. Although we did not detect osmoregulation or thermoregulation of any porins in S. marcescens UOC-51, sequences analogous to the E. coli osmoregulator OmpR-binding regions are seen upstream to the cloned gene. We examined the regulation of the S. marcescens porin in E. coli and found that its expression increased in a high salt environment. A micF gene, whose transcriptional product functions to inhibit synthesis of OmpF by hybridizing with the ompF transcript, was also seen upstream of the S. marcescens ompC. An alignment with the E. coli micF gene revealed that the functional region of the S. marcescens micF gene is conserved. Based on the results obtained we have determined that S. marcescens UOC-51 produces a 40 kDa porin similar to the E. coli OmpC porin.
Gene Expression Profiling Reveals Functional Specialization along the Intestinal Tract of a Carnivorous Teleostean Fish (Dicentrarchus labrax)

PubMed Central

Calduch-Giner, Josep A.; Sitjà-Bobadilla, Ariadna; Pérez-Sánchez, Jaume

2016-01-01

High-quality sequencing reads from the intestine of European sea bass were assembled, annotated by similarity against protein reference databases and combined with nucleotide sequences from public and private databases. After redundancy filtering, 24,906 non-redundant annotated sequences encoding 15,367 different gene descriptions were obtained. These annotated sequences were used to design a custom, high-density oligo-microarray (8 × 15 K) for the transcriptomic profiling of anterior (AI), middle (MI), and posterior (PI) intestinal segments. Similar molecular signatures were found for AI and MI segments, which were combined in a single group (AI-MI) whereas the PI outstood separately, with more than 1900 differentially expressed genes with a fold-change cutoff of 2. Functional analysis revealed that molecular and cellular functions related to feed digestion and nutrient absorption and transport were over-represented in AI-MI segments. By contrast, the initiation and establishment of immune defense mechanisms became especially relevant in PI, although the microarray expression profiling validated by qPCR indicated that these functional changes are gradual from anterior to posterior intestinal segments. This functional divergence occurred in association with spatial transcriptional changes in nutrient transporters and the mucosal chemosensing system via G protein-coupled receptors. These findings contribute to identify key indicators of gut functions and to compare different fish feeding strategies and immune defense mechanisms acquired along the evolution of teleosts. PMID:27610085
Gene Expression Profiling Reveals Functional Specialization along the Intestinal Tract of a Carnivorous Teleostean Fish (Dicentrarchus labrax).

PubMed

Calduch-Giner, Josep A; Sitjà-Bobadilla, Ariadna; Pérez-Sánchez, Jaume

2016-01-01

High-quality sequencing reads from the intestine of European sea bass were assembled, annotated by similarity against protein reference databases and combined with nucleotide sequences from public and private databases. After redundancy filtering, 24,906 non-redundant annotated sequences encoding 15,367 different gene descriptions were obtained. These annotated sequences were used to design a custom, high-density oligo-microarray (8 × 15 K) for the transcriptomic profiling of anterior (AI), middle (MI), and posterior (PI) intestinal segments. Similar molecular signatures were found for AI and MI segments, which were combined in a single group (AI-MI) whereas the PI outstood separately, with more than 1900 differentially expressed genes with a fold-change cutoff of 2. Functional analysis revealed that molecular and cellular functions related to feed digestion and nutrient absorption and transport were over-represented in AI-MI segments. By contrast, the initiation and establishment of immune defense mechanisms became especially relevant in PI, although the microarray expression profiling validated by qPCR indicated that these functional changes are gradual from anterior to posterior intestinal segments. This functional divergence occurred in association with spatial transcriptional changes in nutrient transporters and the mucosal chemosensing system via G protein-coupled receptors. These findings contribute to identify key indicators of gut functions and to compare different fish feeding strategies and immune defense mechanisms acquired along the evolution of teleosts.
SigEMD: A powerful method for differential gene expression analysis in single-cell RNA sequencing data.

PubMed

Wang, Tianyu; Nabavi, Sheida

2018-04-24

Differential gene expression analysis is one of the significant efforts in single cell RNA sequencing (scRNAseq) analysis to discover the specific changes in expression levels of individual cell types. Since scRNAseq exhibits multimodality, large amounts of zero counts, and sparsity, it is different from the traditional bulk RNA sequencing (RNAseq) data. The new challenges of scRNAseq data promote the development of new methods for identifying differentially expressed (DE) genes. In this study, we proposed a new method, SigEMD, that combines a data imputation approach, a logistic regression model and a nonparametric method based on the Earth Mover's Distance, to precisely and efficiently identify DE genes in scRNAseq data. The regression model and data imputation are used to reduce the impact of large amounts of zero counts, and the nonparametric method is used to improve the sensitivity of detecting DE genes from multimodal scRNAseq data. By additionally employing gene interaction network information to adjust the final states of DE genes, we further reduce the false positives of calling DE genes. We used simulated datasets and real datasets to evaluate the detection accuracy of the proposed method and to compare its performance with those of other differential expression analysis methods. Results indicate that the proposed method has an overall powerful performance in terms of precision in detection, sensitivity, and specificity. Copyright © 2018 Elsevier Inc. All rights reserved.
Sequence and expression variations suggest an adaptive role for the DA1-like gene family in the evolution of soybeans.

PubMed

Zhao, Man; Gu, Yongzhe; He, Lingli; Chen, Qingshan; He, Chaoying

2015-05-15

The DA1 gene family is plant-specific and Arabidopsis DA1 regulates seed and organ size, but the functions in soybeans are unknown. The cultivated soybean (Glycine max) is believed to be domesticated from the annual wild soybeans (Glycine soja). To evaluate whether DA1-like genes were involved in the evolution of soybeans, we compared variation at both sequence and expression levels of DA1-like genes from G. max (GmaDA1) and G. soja (GsoDA1). Sequence identities were extremely high between the orthologous pairs between soybeans, while the paralogous copies in a soybean species showed a relatively high divergence. Moreover, the expression variation of DA1-like paralogous genes in soybean was much greater than the orthologous gene pairs between the wild and cultivated soybeans during development and challenging abiotic stresses such as salinity. We further found that overexpressing GsoDA1 genes did not affect seed size. Nevertheless, overexpressing them reduced transgenic Arabidopsis seed germination sensitivity to salt stress. Moreover, most of these genes could improve salt tolerance of the transgenic Arabidopsis plants, corroborated by a detection of expression variation of several key genes in the salt-tolerance pathways. Our work suggested that expression diversification of DA1-like genes is functionally associated with adaptive radiation of soybeans, reinforcing that the plant-specific DA1 gene family might have contributed to the successful adaption to complex environments and radiation of the plants.
Conservation and divergence of plant LHP1 protein sequences and expression patterns in angiosperms and gymnosperms.

PubMed

Guan, Hexin; Zheng, Zhengui; Grey, Paris H; Li, Yuhua; Oppenheimer, David G

2011-05-01

Floral transition is a critical and strictly regulated developmental process in plants. Mutations in Arabidopsis LIKE HETEROCHROMATIN PROTEIN 1 (AtLHP1)/TERMINAL FLOWER 2 (TFL2) result in early and terminal flowers. Little is known about the gene expression, function and evolution of plant LHP1 homologs, except for Arabidopsis LHP1. In this study, the conservation and divergence of plant LHP1 protein sequences was analyzed by sequence alignments and phylogeny. LHP1 expression patterns were compared among taxa that occupy pivotal phylogenetic positions. Several relatively conserved new motifs/regions were identified among LHP1 homologs. Phylogeny of plant LHP1 proteins agreed with established angiosperm relationships. In situ hybridization unveiled conserved expression of plant LHP1 in the axillary bud/tiller, vascular bundles, developing stamens, and carpels. Unlike AtLHP1, cucumber CsLHP1-2, sugarcane SoLHP1 and maize ZmLHP1, rice OsLHP1 is not expressed in the shoot apical meristem (SAM) and the OsLHP1 transcript level is consistently low in shoots. "Unequal crossover" might have contributed to the divergence in the N-terminal and hinge region lengths of LHP1 homologs. We propose an "insertion-deletion" model for soybean (Glycine max L.) GmLHP1s evolution. Plant LHP1 homologs are more conserved than previously expected, and may favor vegetative meristem identity and primordia formation. OsLHP1 may not function in rice SAM during floral induction.
High-throughput sequencing of the chloroplast and mitochondrion of Chlamydomonas reinhardtii to generate improved de novo assemblies, analyze expression patterns and transcript speciation, and evaluate diversity among laboratory strains and wild isolates

DOE Office of Scientific and Technical Information (OSTI.GOV)

Gallaher, Sean D.; Fitz-Gibbon, Sorel T.; Strenkert, Daniela

Chlamydomonas reinhardtii is a unicellular chlorophyte alga that is widely studied as a reference organism for understanding photosynthesis, sensory and motile cilia, and for development of an algal-based platform for producing biofuels and bio-products. Its highly repetitive, ~205-kbp circular chloroplast genome and ~15.8-kbp linear mitochondrial genome were sequenced prior to the advent of high-throughput sequencing technologies. Here, high coverage shotgun sequencing was used to assemble both organellar genomes de novo. These new genomes correct dozens of errors in the prior genome sequences and annotations. Gen-ome sequencing coverage indicates that each cell contains on average 83 copies of the chloroplast genomemore » and 130 copies of the mitochondrial genome. Using protocols and analyses optimized for organellar tran-scripts, RNA-Seq was used to quantify their relative abundances across 12 different growth conditions. Forty-six percent of total cellular mRNA is attributable to high expression from a few dozen chloroplast genes. RNA-Seq data were used to guide gene annotation, to demonstrate polycistronic gene expression, and to quantify splicing of psaA and psbA introns. In contrast to a conclusion from a recent study, we found that chloroplast transcripts are not edited. Unexpectedly, cytosine-rich polynucleotide tails were observed at the 3’-end of all mitochondrial transcripts. A comparative genomics analysis of eight laboratory strains and 11 wild isolates of C. reinhardtii identified 2658 variants in the organellargenomes, which is 1/10th as much genetic diversity as is found in the nucleus.« less
Gene discovery in an invasive tephritid model pest species, the Mediterranean fruit fly, Ceratitis capitata

PubMed Central

Gomulski, Ludvik M; Dimopoulos, George; Xi, Zhiyong; Soares, Marcelo B; Bonaldo, Maria F; Malacrida, Anna R; Gasperi, Giuliano

2008-01-01

Background The medfly, Ceratitis capitata, is a highly invasive agricultural pest that has become a model insect for the development of biological control programs. Despite research into the behavior and classical and population genetics of this organism, the quantity of sequence data available is limited. We have utilized an expressed sequence tag (EST) approach to obtain detailed information on transcriptome signatures that relate to a variety of physiological systems in the medfly; this information emphasizes on reproduction, sex determination, and chemosensory perception, since the study was based on normalized cDNA libraries from embryos and adult heads. Results A total of 21,253 high-quality ESTs were obtained from the embryo and head libraries. Clustering analyses performed separately for each library resulted in 5201 embryo and 6684 head transcripts. Considering an estimated 19% overlap in the transcriptomes of the two libraries, they represent about 9614 unique transcripts involved in a wide range of biological processes and molecular functions. Of particular interest are the sequences that share homology with Drosophila genes involved in sex determination, olfaction, and reproductive behavior. The medfly transformer2 (tra2) homolog was identified among the embryonic sequences, and its genomic organization and expression were characterized. Conclusion The sequences obtained in this study represent the first major dataset of expressed genes in a tephritid species of agricultural importance. This resource provides essential information to support the investigation of numerous questions regarding the biology of the medfly and other related species and also constitutes an invaluable tool for the annotation of complete genome sequences. Our study has revealed intriguing findings regarding the transcript regulation of tra2 and other sex determination genes, as well as insights into the comparative genomics of genes implicated in chemosensory reception and reproduction. PMID:18500975
Cardiac Magnetic Resonance Imaging Using an Open 1.0T MR Platform: A Comparative Study with a 1.5T Tunnel System.

PubMed

Fischbach, Katharina; Kosiek, Otrud; Friebe, Björn; Wybranski, Christian; Schnackenburg, Bernhard; Schmeisser, Alexander; Smid, Jan; Ricke, Jens; Pech, Maciej

2017-01-01

Cardiac magnetic resonance imaging (cMRI) has become the non-invasive reference standard for the evaluation of cardiac function and viability. The introduction of open, high-field, 1.0T (HFO) MR scanners offers advantages for examinations of obese, claustrophobic and paediatric patients.The aim of our study was to compare standard cMRI sequences from an HFO scanner and those from a cylindrical, 1.5T MR system. Fifteen volunteers underwent cMRI both in an open HFO and in a cylindrical MR system. The protocol consisted of cine and unenhanced tissue sequences. The signal-to-noise ratio (SNR) for each sequence and blood-myocardium contrast for the cine sequences were assessed. Image quality and artefacts were rated. The location and number of non-diagnostic segments was determined. Volunteers' tolerance to examinations in both scanners was investigated. SNR was significantly lower in the HFO scanner (all p<0.001). However, the contrast of the cine sequence was significantly higher in the HFO platform compared to the 1.5T MR scanner (0.685±0.41 vs. 0.611±0.54; p<0.001). Image quality was comparable for all sequences (all p>0.05). Overall, only few non-diagnostic myocardial segments were recorded: 6/960 (0.6%) by the HFO and 17/960 (1.8%) segments by the cylindrical system. The volunteers expressed a preference for the open MR system (p<0.01). Standard cardiac MRI sequences in an HFO platform offer a high image quality that is comparable to the quality of images acquired in a cylindrical 1.5T MR scanner. An open scanner design may potentially improve tolerance of cardiac MRI and therefore allow to examine an even broader patient spectrum.
Recombinant protein secretion in Pseudozyma flocculosa and Pseudozyma antarctica with a novel signal peptide.

PubMed

Cheng, Yali; Avis, Tyler J; Bolduc, Sébastien; Zhao, Yingyi; Anguenot, Raphaël; Neveu, Bertrand; Labbé, Caroline; Belzile, François; Bélanger, Richard R

2008-12-01

Secretion of recombinant proteins aims to reproduce the correct posttranslational modifications of the expressed protein while simplifying its recovery. In this study, secretion signal sequences from an abundantly secreted 34-kDa protein (P34) from Pseudozyma flocculosa were cloned. The efficiency of these sequences in the secretion of recombinant green fluorescent protein (GFP) was investigated in two Pseudozyma species and compared with other secretion signal sequences, from S. cerevisiae and Pseudozyma spp. The results indicate that various secretion signal sequences were functional and that the P34 signal peptide was the most effective secretion signal sequence in both P. flocculosa and P. antarctica. The cells correctly processed the secretion signal sequences, including P34 signal peptide, and mature GFP was recovered from the culture medium. This is the first report of functional secretion signal sequences in P. flocculosa. These sequences can be used to test the secretion of other recombinant proteins and for studying the secretion pathway in P. flocculosa and P. antarctica.

Transcriptome analysis and gene expression profiling of abortive and developing ovules during fruit development in hazelnut.

PubMed

Cheng, Yunqing; Liu, Jianfeng; Zhang, Huidi; Wang, Ju; Zhao, Yixin; Geng, Wanting

2015-01-01

A high ratio of blank fruit in hazelnut (Corylus heterophylla Fisch) is a very common phenomenon that causes serious yield losses in northeast China. The development of blank fruit in the Corylus genus is known to be associated with embryo abortion. However, little is known about the molecular mechanisms responsible for embryo abortion during the nut development stage. Genomic information for C. heterophylla Fisch is not available; therefore, data related to transcriptome and gene expression profiling of developing and abortive ovules are needed. In this study, de novo transcriptome sequencing and RNA-seq analysis were conducted using short-read sequencing technology (Illumina HiSeq 2000). The results of the transcriptome assembly analysis revealed genetic information that was associated with the fruit development stage. Two digital gene expression libraries were constructed, one for a full (normally developing) ovule and one for an empty (abortive) ovule. Transcriptome sequencing and assembly results revealed 55,353 unigenes, including 18,751 clusters and 36,602 singletons. These results were annotated using the public databases NR, NT, Swiss-Prot, KEGG, COG, and GO. Using digital gene expression profiling, gene expression differences in developing and abortive ovules were identified. A total of 1,637 and 715 unigenes were significantly upregulated and downregulated, respectively, in abortive ovules, compared with developing ovules. Quantitative real-time polymerase chain reaction analysis was used in order to verify the differential expression of some genes. The transcriptome and digital gene expression profiling data of normally developing and abortive ovules in hazelnut provide exhaustive information that will improve our understanding of the molecular mechanisms of abortive ovule formation in hazelnut.
OptSSeq: High-throughput sequencing readout of growth enrichment defines optimal gene expression elements for homoethanologenesis

DOE Office of Scientific and Technical Information (OSTI.GOV)

Ghosh, Indro Neil; Landick, Robert

The optimization of synthetic pathways is a central challenge in metabolic engineering. OptSSeq (Optimization by Selection and Sequencing) is one approach to this challenge. OptSSeq couples selection of optimal enzyme expression levels linked to cell growth rate with high-throughput sequencing to track enrichment of gene expression elements (promoters and ribosomebinding sites) from a combinatorial library. OptSSeq yields information on both optimal and suboptimal enzyme levels, and helps identify constraints that limit maximal product formation. Here we report a proof-of-concept implementation of OptSSeq using homoethanologenesis, a two-step pathway consisting of pyruvate decarboxylase (Pdc) and alcohol dehydrogenase (Adh) that converts pyruvate tomore » ethanol and is naturally optimized in the bacterium Zymomonas mobilis. We used OptSSeq to determine optimal gene expression elements and enzyme levels for Z. mobilis Pdc, AdhA, and AdhB expressed in Escherichia coli. By varying both expression signals and gene order, we identified an optimal solution using only Pdc and AdhB. We resolved current uncertainty about the functions of the Fe 2+-dependent AdhB and Zn 2+- dependent AdhA by showing that AdhB is preferred over AdhA for rapid growth in both E. coli and Z. mobilis. Finally, by comparing predictions of growth-linked metabolic flux to enzyme synthesis costs, we established that optimal E. coli homoethanologenesis was achieved by our best pdc-adhB expression cassette and that the remaining constraints lie in the E. coli metabolic network or inefficient Pdc or AdhB function in E. coli. Furthermore, OptSSeq is a general tool for synthetic biology to tune enzyme levels in any pathway whose optimal function can be linked to cell growth or survival.« less
OptSSeq: High-throughput sequencing readout of growth enrichment defines optimal gene expression elements for homoethanologenesis

DOE PAGES

Ghosh, Indro Neil; Landick, Robert

2016-07-16

The optimization of synthetic pathways is a central challenge in metabolic engineering. OptSSeq (Optimization by Selection and Sequencing) is one approach to this challenge. OptSSeq couples selection of optimal enzyme expression levels linked to cell growth rate with high-throughput sequencing to track enrichment of gene expression elements (promoters and ribosomebinding sites) from a combinatorial library. OptSSeq yields information on both optimal and suboptimal enzyme levels, and helps identify constraints that limit maximal product formation. Here we report a proof-of-concept implementation of OptSSeq using homoethanologenesis, a two-step pathway consisting of pyruvate decarboxylase (Pdc) and alcohol dehydrogenase (Adh) that converts pyruvate tomore » ethanol and is naturally optimized in the bacterium Zymomonas mobilis. We used OptSSeq to determine optimal gene expression elements and enzyme levels for Z. mobilis Pdc, AdhA, and AdhB expressed in Escherichia coli. By varying both expression signals and gene order, we identified an optimal solution using only Pdc and AdhB. We resolved current uncertainty about the functions of the Fe 2+-dependent AdhB and Zn 2+- dependent AdhA by showing that AdhB is preferred over AdhA for rapid growth in both E. coli and Z. mobilis. Finally, by comparing predictions of growth-linked metabolic flux to enzyme synthesis costs, we established that optimal E. coli homoethanologenesis was achieved by our best pdc-adhB expression cassette and that the remaining constraints lie in the E. coli metabolic network or inefficient Pdc or AdhB function in E. coli. Furthermore, OptSSeq is a general tool for synthetic biology to tune enzyme levels in any pathway whose optimal function can be linked to cell growth or survival.« less
Microarray analysis of differential gene expression elicited in Trametes versicolor during interspecific mycelial interactions.

PubMed

Eyre, Catherine; Muftah, Wafa; Hiscox, Jennifer; Hunt, Julie; Kille, Peter; Boddy, Lynne; Rogers, Hilary J

2010-08-01

Trametes versicolor is an important white rot fungus of both industrial and ecological interest. Saprotrophic basidiomycetes are the major decomposition agents in woodland ecosystems, and rarely form monospecific populations, therefore interspecific mycelial interactions continually occur. Interactions have different outcomes including replacement of one species by the other or deadlock. We have made subtractive cDNA libraries to enrich for genes that are expressed when T. versicolor interacts with another saprotrophic basidiomycete, Stereum gausapatum, an interaction that results in the replacement of the latter. Expressed sequence tags (ESTs) (1920) were used for microarray analysis, and their expression compared during interaction with three different fungi: S. gausapatum (replaced by T. versicolor), Bjerkandera adusta (deadlock) and Hypholoma fasciculare (replaced T. versicolor). Expression of significantly more probes changed in the interaction between T. versicolor and S. gausapatum or B. adusta compared to H. fasciculare, suggesting a relationship between interaction outcome and changes in gene expression. Copyright © 2010 The British Mycological Society. Published by Elsevier Ltd. All rights reserved.
Generation and analysis of blueberry transcriptome sequences from leaves, developing fruit, and flower buds from cold acclimation through deacclimation.

PubMed

Rowland, Lisa J; Alkharouf, Nadim; Darwish, Omar; Ogden, Elizabeth L; Polashock, James J; Bassil, Nahla V; Main, Dorrie

2012-04-02

There has been increased consumption of blueberries in recent years fueled in part because of their many recognized health benefits. Blueberry fruit is very high in anthocyanins, which have been linked to improved night vision, prevention of macular degeneration, anti-cancer activity, and reduced risk of heart disease. Very few genomic resources have been available for blueberry, however. Further development of genomic resources like expressed sequence tags (ESTs), molecular markers, and genetic linkage maps could lead to more rapid genetic improvement. Marker-assisted selection could be used to combine traits for climatic adaptation with fruit and nutritional quality traits. Efforts to sequence the transcriptome of the commercial highbush blueberry (Vaccinium corymbosum) cultivar Bluecrop and use the sequences to identify genes associated with cold acclimation and fruit development and develop SSR markers for mapping studies are presented here. Transcriptome sequences were generated from blueberry fruit at different stages of development, flower buds at different stages of cold acclimation, and leaves by next-generation Roche 454 sequencing. Over 600,000 reads were assembled into approximately 15,000 contigs and 124,000 singletons. The assembled sequences were annotated and functionally mapped to Gene Ontology (GO) terms. Frequency of the most abundant sequences in each of the libraries was compared across all libraries to identify genes that are potentially differentially expressed during cold acclimation and fruit development. Real-time PCR was performed to confirm their differential expression patterns. Overall, 14 out of 17 of the genes examined had differential expression patterns similar to what was predicted from their reads alone. The assembled sequences were also mined for SSRs. From these sequences, 15,886 blueberry EST-SSR loci were identified. Primers were designed from 7,705 of the SSR-containing sequences with adequate flanking sequence. One hundred primer pairs were tested for amplification and polymorphism among parents of two blueberry populations currently being used for genetic linkage map construction. The tetraploid mapping population was based on a cross between the highbush cultivars Draper and Jewel (V. darrowii is also in the background of 'Jewel'). The diploid mapping population was based on a cross between an F1 hybrid of V. darrowii and diploid V. corymbosum and another diploid V. corymbosum. The overall amplification rate of the SSR primers was 68% and the polymorphism rate was 43%. These results indicate that this large collection of 454 ESTs will be a valuable resource for identifying genes that are potentially differentially expressed and play important roles in flower bud development, cold acclimation, chilling unit accumulation, and fruit development in blueberry and related species. In addition, the ESTs have already proved useful for the development of SSR and EST-PCR markers, and are currently being used for construction of genetic linkage maps in blueberry.
Generation and analysis of blueberry transcriptome sequences from leaves, developing fruit, and flower buds from cold acclimation through deacclimation

PubMed Central

2012-01-01

Background There has been increased consumption of blueberries in recent years fueled in part because of their many recognized health benefits. Blueberry fruit is very high in anthocyanins, which have been linked to improved night vision, prevention of macular degeneration, anti-cancer activity, and reduced risk of heart disease. Very few genomic resources have been available for blueberry, however. Further development of genomic resources like expressed sequence tags (ESTs), molecular markers, and genetic linkage maps could lead to more rapid genetic improvement. Marker-assisted selection could be used to combine traits for climatic adaptation with fruit and nutritional quality traits. Results Efforts to sequence the transcriptome of the commercial highbush blueberry (Vaccinium corymbosum) cultivar Bluecrop and use the sequences to identify genes associated with cold acclimation and fruit development and develop SSR markers for mapping studies are presented here. Transcriptome sequences were generated from blueberry fruit at different stages of development, flower buds at different stages of cold acclimation, and leaves by next-generation Roche 454 sequencing. Over 600,000 reads were assembled into approximately 15,000 contigs and 124,000 singletons. The assembled sequences were annotated and functionally mapped to Gene Ontology (GO) terms. Frequency of the most abundant sequences in each of the libraries was compared across all libraries to identify genes that are potentially differentially expressed during cold acclimation and fruit development. Real-time PCR was performed to confirm their differential expression patterns. Overall, 14 out of 17 of the genes examined had differential expression patterns similar to what was predicted from their reads alone. The assembled sequences were also mined for SSRs. From these sequences, 15,886 blueberry EST-SSR loci were identified. Primers were designed from 7,705 of the SSR-containing sequences with adequate flanking sequence. One hundred primer pairs were tested for amplification and polymorphism among parents of two blueberry populations currently being used for genetic linkage map construction. The tetraploid mapping population was based on a cross between the highbush cultivars Draper and Jewel (V. darrowii is also in the background of 'Jewel'). The diploid mapping population was based on a cross between an F1 hybrid of V. darrowii and diploid V. corymbosum and another diploid V. corymbosum. The overall amplification rate of the SSR primers was 68% and the polymorphism rate was 43%. Conclusions These results indicate that this large collection of 454 ESTs will be a valuable resource for identifying genes that are potentially differentially expressed and play important roles in flower bud development, cold acclimation, chilling unit accumulation, and fruit development in blueberry and related species. In addition, the ESTs have already proved useful for the development of SSR and EST-PCR markers, and are currently being used for construction of genetic linkage maps in blueberry. PMID:22471859
Comparative Genomics of Interreplichore Translocations in Bacteria: A Measure of Chromosome Topology?

PubMed

Khedkar, Supriya; Seshasayee, Aswin Sai Narain

2016-06-01

Genomes evolve not only in base sequence but also in terms of their architecture, defined by gene organization and chromosome topology. Whereas genome sequence data inform us about the changes in base sequences for a large variety of organisms, the study of chromosome topology is restricted to a few model organisms studied using microscopy and chromosome conformation capture techniques. Here, we exploit whole genome sequence data to study the link between gene organization and chromosome topology in bacteria. Using comparative genomics across ∼250 pairs of closely related bacteria we show that: (a) many organisms show a high degree of interreplichore translocations throughout the chromosome and not limited to the inversion-prone terminus (ter) or the origin of replication (oriC); (b) translocation maps may reflect chromosome topologies; and (c) symmetric interreplichore translocations do not disrupt the distance of a gene from oriC or affect gene expression states or strand biases in gene densities. In summary, we suggest that translocation maps might be a first line in defining a gross chromosome topology given a pair of closely related genome sequences. Copyright © 2016 Khedkar and Seshasayee.
Comparative Genomics of Interreplichore Translocations in Bacteria: A Measure of Chromosome Topology?

PubMed Central

Khedkar, Supriya; Seshasayee, Aswin Sai Narain

2016-01-01

Genomes evolve not only in base sequence but also in terms of their architecture, defined by gene organization and chromosome topology. Whereas genome sequence data inform us about the changes in base sequences for a large variety of organisms, the study of chromosome topology is restricted to a few model organisms studied using microscopy and chromosome conformation capture techniques. Here, we exploit whole genome sequence data to study the link between gene organization and chromosome topology in bacteria. Using comparative genomics across ∼250 pairs of closely related bacteria we show that: (a) many organisms show a high degree of interreplichore translocations throughout the chromosome and not limited to the inversion-prone terminus (ter) or the origin of replication (oriC); (b) translocation maps may reflect chromosome topologies; and (c) symmetric interreplichore translocations do not disrupt the distance of a gene from oriC or affect gene expression states or strand biases in gene densities. In summary, we suggest that translocation maps might be a first line in defining a gross chromosome topology given a pair of closely related genome sequences. PMID:27172194
Advanced colorectal adenoma related gene expression signature may predict prognostic for colorectal cancer patients with adenoma-carcinoma sequence.

PubMed

Li, Bing; Shi, Xiao-Yu; Liao, Dai-Xiang; Cao, Bang-Rong; Luo, Cheng-Hua; Cheng, Shu-Jun

2015-01-01

There are still no absolute parameters predicting progression of adenoma into cancer. The present study aimed to characterize functional differences on the multistep carcinogenetic process from the adenoma-carcinoma sequence. All samples were collected and mRNA expression profiling was performed by using Agilent Microarray high-throughput gene-chip technology. Then, the characteristics of mRNA expression profiles of adenoma-carcinoma sequence were described with bioinformatics software, and we analyzed the relationship between gene expression profiles of adenoma-adenocarcinoma sequence and clinical prognosis of colorectal cancer. The mRNA expressions of adenoma-carcinoma sequence were significantly different between high-grade intraepithelial neoplasia group and adenocarcinoma group. The biological process of gene ontology function enrichment analysis on differentially expressed genes between high-grade intraepithelial neoplasia group and adenocarcinoma group showed that genes enriched in the extracellular structure organization, skeletal system development, biological adhesion and itself regulated growth regulation, with the P value after FDR correction of less than 0.05. In addition, IPR-related protein mainly focused on the insulin-like growth factor binding proteins. The variable trends of gene expression profiles for adenoma-carcinoma sequence were mainly concentrated in high-grade intraepithelial neoplasia and adenocarcinoma. The differentially expressed genes are significantly correlated between high-grade intraepithelial neoplasia group and adenocarcinoma group. Bioinformatics analysis is an effective way to study the gene expression profiles in the adenoma-carcinoma sequence, and may provide an effective tool to involve colorectal cancer research strategy into colorectal adenoma or advanced adenoma.
Both positive and negative regulatory elements mediate expression of a photoregulated CAB gene from Nicotiana plumbaginifolia.

PubMed Central

Castresana, C; Garcia-Luque, I; Alonso, E; Malik, V S; Cashmore, A R

1988-01-01

We have analyzed promoter regulatory elements from a photoregulated CAB gene (Cab-E) isolated from Nicotiana plumbaginifolia. These studies have been performed by introducing chimeric gene constructs into tobacco cells via Agrobacterium tumefaciens-mediated transformation. Expression studies on the regenerated transgenic plants have allowed us to characterize three positive and one negative cis-acting elements that influence photoregulated expression of the Cab-E gene. Within the upstream sequences we have identified two positive regulatory elements (PRE1 and PRE2) which confer maximum levels of photoregulated expression. These sequences contain multiple repeated elements related to the sequence-ACCGGCCCACTT-. We have also identified within the upstream region a negative regulatory element (NRE) extremely rich in AT sequences, which reduces the level of gene expression in the light. We have defined a light regulatory element (LRE) within the promoter region extending from -396 to -186 bp which confers photoregulated expression when fused to a constitutive nopaline synthase ('nos') promoter. Within this region there is a 132-bp element, extending from -368 to -234 bp, which on deletion from the Cab-E promoter reduces gene expression from high levels to undetectable levels. Finally, we have demonstrated for a full length Cab-E promoter conferring high levels of photoregulated expression, that sequences proximal to the Cab-E TATA box are not replaceable by corresponding sequences from a 'nos' promoter. This contrasts with the apparent equivalence of these Cab-E and 'nos' TATA box-proximal sequences in truncated promoters conferring low levels of photoregulated expression. Images PMID:2901343
Channel-Opening Kinetic Mechanism of Wild-Type GluK1 Kainate Receptors and a C-Terminal Mutant

PubMed Central

Han, Yan; Wang, Congzhou; Park, Jae Seon; Niu, Li

2012-01-01

GluK1 is a kainate receptor subunit in the ionotropic glutamate receptor family and can form functional channels when expressed, for instance, in HEK-293 cells. However, the channel-opening mechanism of GluK1 is poorly understood. One major challenge to studying the GluK1 channel is its apparent low surface expression, which results in a low whole-cell current response even to a saturating concentration of agonist. The low surface expression is thought to be contributed by an endoplasmic reticulum (ER) retention signal sequence. When this sequence motif is present as in the wild-type GluK1-2b C-terminus, the receptor is significantly retained in the ER. Conversely, when this sequence is lacking, as in wild-type GluK1-2a (i.e., a different alternatively spliced isoform at the C-terminus) and in a GluK1-2b mutant (i.e., R896A, R897A, R900A and K901A) that disrupts the ER retention signal, there is higher surface expression and greater whole-cell current response. Here we characterize the channel-opening kinetic mechanism for these three GluK1 receptors expressed in HEK-293 cells by using a laser-pulse photolysis technique. Our results show that the wild-type GluK1-2a, wild-type GluK1-2b and the mutant GluK1-2b have identical channel-opening and channel-closing rate constants. These results indicate that the C-terminal ER retention signal sequence, which affects receptor trafficking/expression, does not affect channel-gating properties. Furthermore, as compared with the GluK2 kainate receptor, the GluK1 channel is faster to open, close, and desensitize by at least two-fold, yet the EC50 value of GluK1 is similar to that of GluK2. PMID:22191429
Tissue- and agonist-specific regulation of human and murine plasminogen activator inhibitor-1 promoters in transgenic mice.

PubMed

Eren, M; Painter, C A; Gleaves, L A; Schoenhard, J A; Atkinson, J B; Brown, N J; Vaughan, D E

2003-11-01

Numerous studies have described regulatory factors and sequences that control transcriptional responses in vitro. However, there is a paucity of information on the qualitative and quantitative regulation of heterologous promoters using transgenic strategies. In order to investigate the physiological regulation of human plasminogen activator inhibitor type-1 (hPAI-1) expression in vivo compared to murine PAI-1 (mPAI-1) and to test the physiological relevance of regulatory mechanisms described in vitro, we generated transgenic mice expressing enhanced green fluorescent protein (EGFP) driven by the proximal -2.9 kb of the hPAI-1 promoter. Transgenic animals were treated with Ang II, TGF-beta1 and lipopolysaccharide (LPS) to compare the relative activation of the human and murine PAI-1 promoters. Ang II increased EGFP expression most effectively in brain, kidney and spleen, while mPAI-1 expression was quantitatively enhanced most prominently in heart and spleen. TGF-beta1 failed to induce activation of the hPAI-1 promoter but potently stimulated mPAI-1 in kidney and spleen. LPS administration triggered robust expression of mPAI-1 in liver, kidney, pancreas, spleen and lung, while EGFP was induced only modestly in heart and kidney. These results indicate that the transcriptional response of the endogenous mPAI-1 promoter varies widely in terms of location and magnitude of response to specific stimuli. Moreover, the physiological regulation of PAI-1 expression likely involves a complex interaction of transcription factors and DNA sequences that are not adequately replicated by in vitro functional studies focused on the proximal -2.9 kb promoter.
Molecular characterization and expression profiles of four transformer-2 isoforms in the Chinese mitten crab Eriocheir sinensis

NASA Astrophysics Data System (ADS)

Luo, Danli; Liu, Yuan; Hui, Min; Song, Chengwen; Liu, Hourong; Cui, Zhaoxia

2017-07-01

The transformer-2 ( tra-2) gene plays a key role in the regulatory hierarchy of sexual differentiation in somatic tissues and in the germline of Drosophila melanogaster. In this study, sequences and expression profiles of tra-2 in the Chinese mitten crab Eriocheir sinensis were characterized. Four tra-2 isoforms, designated as Estra-2a, Estra-2b, Estra-2c, and Estra-2d, were isolated. They all contained an RNA-recognition motif (RRM) and a linker region, which shared high similarity with other reported tra-2s. Sequence analysis revealed that Estra-2a, Estra-2b and Estra-2c are encoded by the same genomic locus and are generated by alternative splicing of the pre-mRNA. Compared with the other three isoforms, Estra-2d lacks the RS2 domain. Quantitative real-time PCR showed that all four isoforms were highly expressed in the fertilized egg, and in the 2-4 cell and blastula stages compared with larval stages ( P≤0.01), suggesting their maternal origin in early embryonic developmental stages. Notably, Estra-2a was highly expressed in male somatic tissues, while Estra-2c was significantly highly expressed in the ovary. These results suggest that Estra-2c is involved in sexual differentiation of the Chinese mitten crab. Our findings provide basic information for further functional studies of the tra-2 gene/protein in this species.
FluG affects secretion in colonies of Aspergillus niger.

PubMed

Wang, Fengfeng; Krijgsheld, Pauline; Hulsman, Marc; de Bekker, Charissa; Müller, Wally H; Reinders, Marcel; de Vries, Ronald P; Wösten, Han A B

2015-01-01

Colonies of Aspergillus niger are characterized by zonal heterogeneity in growth, sporulation, gene expression and secretion. For instance, the glucoamylase gene glaA is more highly expressed at the periphery of colonies when compared to the center. As a consequence, its encoded protein GlaA is mainly secreted at the outer part of the colony. Here, multiple copies of amyR were introduced in A. niger. Most transformants over-expressing this regulatory gene of amylolytic genes still displayed heterogeneous glaA expression and GlaA secretion. However, heterogeneity was abolished in transformant UU-A001.13 by expressing glaA and secreting GlaA throughout the mycelium. Sequencing the genome of UU-A001.13 revealed that transformation had been accompanied by deletion of part of the fluG gene and disrupting its 3' end by integration of a transformation vector. Inactivation of fluG in the wild-type background of A. niger also resulted in breakdown of starch under the whole colony. Asexual development of the ∆fluG strain was not affected, unlike what was previously shown in Aspergillus nidulans. Genes encoding proteins with a signal sequence for secretion, including part of the amylolytic genes, were more often downregulated in the central zone of maltose-grown ∆fluG colonies and upregulated in the intermediate part and periphery when compared to the wild-type. Together, these data indicate that FluG of A. niger is a repressor of secretion.
Molecular characterization and expression study of a histidine auxotrophic mutant (his1-) of Nicotiana plumbaginifolia.

PubMed

El Malki, F; Jacobs, M

2001-01-01

The histidine auxotroph mutant his 1(-) isolated from Nicotiana plumbaginifolia haploid protoplasts was first characterized to be deficient for the enzyme histidinol phosphate aminotransferase that is responsible for one of the last steps of histidine biosynthesis. Expression of the mutated gene at the RNA level was assessed by northern analysis of various tissues. Transcriptional activity was unimpaired by the mutation and, in contrast, a higher level of expression was obtained when compared to the wild-type. The cDNA sequence encoding the mutated gene was isolated by RT-PCR and compared to the wild-type gene. A single point mutation corresponding to the substitution of a G nucleotide by A was identified at position 1212 starting from the translation site. The alignment of the deduced amino acid sequences from the mutated and wild-type gene showed that this mutation resulted in the substitution of an Arg by a His residue at position 381. This Arg residue is a conserved amino acid for histidinol phosphate aminotransferase of many species. These results indicate that the identified mutation results in an altered histidinol phosphate aminotransferase enzyme that is unable to convert the substrate imidazole acetol phosphate to histidinol phosphate and thereby leads to the blockage of histidine biosynthesis. Possible consequences of this blockage on the expression of other amino acid biosynthesis genes were evaluated by analysing the expression of the dhdps gene encoding dihydrodipicolinate synthase, the first key enzyme of the lysine pathway.
Expressed Sequence Reference Standards for Evaluating Stage-specific Gene Expression in Southern Green Lacewings, Chrysoperla rufilabris

USDA-ARS?s Scientific Manuscript database

Five developmental stages of Chrysoperla rufilabris were tested using nine primer pairs. Three sequences were highly expressed at all life stages and six were differentially expressed. These primer pairs may be used as standards to quantitate functional gene expression associated with physiological ...
Genome-Wide Identification of Differentially Expressed Genes Associated with the High Yielding of Oleoresin in Secondary Xylem of Masson Pine (Pinus massoniana Lamb) by Transcriptomic Analysis

PubMed Central

Liu, Qinghua; Zhou, Zhichun; Wei, Yongcheng; Shen, Danyu; Feng, Zhongping; Hong, Shanping

2015-01-01

Masson pine is an important timber and resource for oleoresin in South China. Increasing yield of oleoresin in stems can raise economic benefits and enhance the resistance to bark beetles. However, the genetic mechanisms for regulating the yield of oleoresin were still unknown. Here, high-throughput sequencing technology was used to investigate the transcriptome and compare the gene expression profiles of high and low oleoresin-yielding genotypes. A total of 40,690,540 reads were obtained and assembled into 137,499 transcripts from the secondary xylem tissues. We identified 84,842 candidate unigenes based on sequence annotation using various databases and 96 unigenes were candidates for terpenoid backbone biosynthesis in pine. By comparing the expression profiles of high and low oleoresin-yielding genotypes, 649 differentially expressed genes (DEGs) were identified. GO enrichment analysis of DEGs revealed that multiple pathways were related to high yield of oleoresin. Nine candidate genes were validated by QPCR analysis. Among them, the candidate genes encoding geranylgeranyl diphosphate synthase (GGPS) and (-)-alpha/beta-pinene synthase were up-regulated in the high oleoresin-yielding genotype, while tricyclene synthase revealed lower expression level, which was in good agreement with the GC/MS result. In addition, DEG encoding ABC transporters, pathogenesis-related proteins (PR5 and PR9), phosphomethylpyrimidine synthase, non-specific lipid-transfer protein-like protein and ethylene responsive transcription factors (ERFs) were also confirmed to be critical for the biosynthesis of oleoresin. The next-generation sequencing strategy used in this study has proven to be a powerful means for analyzing transcriptome variation related to the yield of oleoresin in masson pine. The candidate genes encoding GGPS, (-)-alpha/beta-pinene, tricyclene synthase, ABC transporters, non-specific lipid-transfer protein-like protein, phosphomethylpyrimidine synthase, ERFs and pathogen responses may play important roles in regulating the yield of oleoresin. These DEGs are worthy of special attention in future studies. PMID:26167875
Comparative genome sequencing of Drosophila pseudoobscura: Chromosomal, gene, and cis-element evolution

PubMed Central

Richards, Stephen; Liu, Yue; Bettencourt, Brian R.; Hradecky, Pavel; Letovsky, Stan; Nielsen, Rasmus; Thornton, Kevin; Hubisz, Melissa J.; Chen, Rui; Meisel, Richard P.; Couronne, Olivier; Hua, Sujun; Smith, Mark A.; Zhang, Peili; Liu, Jing; Bussemaker, Harmen J.; van Batenburg, Marinus F.; Howells, Sally L.; Scherer, Steven E.; Sodergren, Erica; Matthews, Beverly B.; Crosby, Madeline A.; Schroeder, Andrew J.; Ortiz-Barrientos, Daniel; Rives, Catharine M.; Metzker, Michael L.; Muzny, Donna M.; Scott, Graham; Steffen, David; Wheeler, David A.; Worley, Kim C.; Havlak, Paul; Durbin, K. James; Egan, Amy; Gill, Rachel; Hume, Jennifer; Morgan, Margaret B.; Miner, George; Hamilton, Cerissa; Huang, Yanmei; Waldron, Lenée; Verduzco, Daniel; Clerc-Blankenburg, Kerstin P.; Dubchak, Inna; Noor, Mohamed A.F.; Anderson, Wyatt; White, Kevin P.; Clark, Andrew G.; Schaeffer, Stephen W.; Gelbart, William; Weinstock, George M.; Gibbs, Richard A.

2005-01-01

We have sequenced the genome of a second Drosophila species, Drosophila pseudoobscura, and compared this to the genome sequence of Drosophila melanogaster, a primary model organism. Throughout evolution the vast majority of Drosophila genes have remained on the same chromosome arm, but within each arm gene order has been extensively reshuffled, leading to a minimum of 921 syntenic blocks shared between the species. A repetitive sequence is found in the D. pseudoobscura genome at many junctions between adjacent syntenic blocks. Analysis of this novel repetitive element family suggests that recombination between offset elements may have given rise to many paracentric inversions, thereby contributing to the shuffling of gene order in the D. pseudoobscura lineage. Based on sequence similarity and synteny, 10,516 putative orthologs have been identified as a core gene set conserved over 25–55 million years (Myr) since the pseudoobscura/melanogaster divergence. Genes expressed in the testes had higher amino acid sequence divergence than the genome-wide average, consistent with the rapid evolution of sex-specific proteins. Cis-regulatory sequences are more conserved than random and nearby sequences between the species—but the difference is slight, suggesting that the evolution of cis-regulatory elements is flexible. Overall, a pattern of repeat-mediated chromosomal rearrangement, and high coadaptation of both male genes and cis-regulatory sequences emerges as important themes of genome divergence between these species of Drosophila. PMID:15632085
RNA Sequencing Reveals Differences between the Global Transcriptomes of Salmonella enterica Serovar Enteritidis Strains with High and Low Pathogenicities

PubMed Central

2014-01-01

Salmonella enterica serovar Enteritidis is one of the important causes of bacterial food-borne gastroenteritis worldwide. Field strains of S. Enteritidis are relatively genetically homogeneous; however, they show extensive phenotypic diversity and differences in virulence potential. RNA sequencing (RNA-Seq) was used to characterize differences in the global transcriptome between several genetically similar but phenotypically diverse poultry-associated field strains of S. Enteritidis grown in laboratory medium at avian body temperature (42°C). These S. Enteritidis strains were previously characterized as high-pathogenicity (HP; n = 3) and low-pathogenicity (LP; n = 3) strains based on both in vitro and in vivo virulence assays. Using the negative binomial distribution-based statistical tools edgeR and DESeq, 252 genes were identified as differentially expressed in LP strains compared with their expression in the HP strains (P < 0.05). A majority of genes (235, or 93.2%) showed significantly reduced expression, whereas a few genes (17, or 6.8%) showed increased expression in all LP strains compared with HP strains. LP strains showed a unique transcriptional profile that is characterized by significantly reduced expression of several transcriptional regulators and reduced expression of genes involved in virulence (e.g., Salmonella pathogenicity island 1 [SPI-1], SPI-5, and fimbrial and motility genes) and protection against osmotic, oxidative, and other stresses, such as iron-limiting conditions commonly encountered within the host. Several functionally uncharacterized genes also showed reduced expression. This study provides a first concise view of the global transcriptional differences between field strains of S. Enteritidis with various levels of pathogenicity, providing the basis for future functional characterization of several genes with potential roles in virulence or stress regulation of S. Enteritidis. PMID:24271167
Recovering complete mitochondrial genome sequences from RNA-Seq: A case study of Polytomella non-photosynthetic green algae.

PubMed

Tian, Yao; Smith, David Roy

2016-05-01

Thousands of mitochondrial genomes have been sequenced, but there are comparatively few available mitochondrial transcriptomes. This might soon be changing. High-throughput RNA sequencing (RNA-Seq) techniques have made it fast and cheap to generate massive amounts of mitochondrial transcriptomic data. Here, we explore the utility of RNA-Seq for assembling mitochondrial genomes and studying their expression patterns. Specifically, we investigate the mitochondrial transcriptomes from Polytomella non-photosynthetic green algae, which have among the smallest, most reduced mitochondrial genomes from the Archaeplastida as well as fragmented rRNA-coding regions, palindromic genes, and linear chromosomes with telomeres. Isolation of whole genomic RNA from the four known Polytomella species followed by Illumina paired-end sequencing generated enough mitochondrial-derived reads to easily recover almost-entire mitochondrial genome sequences. Read-mapping and coverage statistics also gave insights into Polytomella mitochondrial transcriptional architecture, revealing polycistronic transcripts and the expression of telomeres and palindromic genes. Ultimately, RNA-Seq is a promising, cost-effective technique for studying mitochondrial genetics, but it does have drawbacks, which are discussed. One of its greatest potentials, as shown here, is that it can be used to generate near-complete mitochondrial genome sequences, which could be particularly useful in situations where there is a lack of available mtDNA data. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.

mESAdb: microRNA Expression and Sequence Analysis Database

PubMed Central

Kaya, Koray D.; Karakülah, Gökhan; Yakıcıer, Cengiz M.; Acar, Aybar C.; Konu, Özlen

2011-01-01

microRNA expression and sequence analysis database (http://konulab.fen.bilkent.edu.tr/mirna/) (mESAdb) is a regularly updated database for the multivariate analysis of sequences and expression of microRNAs from multiple taxa. mESAdb is modular and has a user interface implemented in PHP and JavaScript and coupled with statistical analysis and visualization packages written for the R language. The database primarily comprises mature microRNA sequences and their target data, along with selected human, mouse and zebrafish expression data sets. mESAdb analysis modules allow (i) mining of microRNA expression data sets for subsets of microRNAs selected manually or by motif; (ii) pair-wise multivariate analysis of expression data sets within and between taxa; and (iii) association of microRNA subsets with annotation databases, HUGE Navigator, KEGG and GO. The use of existing and customized R packages facilitates future addition of data sets and analysis tools. Furthermore, the ability to upload and analyze user-specified data sets makes mESAdb an interactive and expandable analysis tool for microRNA sequence and expression data. PMID:21177657
mESAdb: microRNA expression and sequence analysis database.

PubMed

Kaya, Koray D; Karakülah, Gökhan; Yakicier, Cengiz M; Acar, Aybar C; Konu, Ozlen

2011-01-01

microRNA expression and sequence analysis database (http://konulab.fen.bilkent.edu.tr/mirna/) (mESAdb) is a regularly updated database for the multivariate analysis of sequences and expression of microRNAs from multiple taxa. mESAdb is modular and has a user interface implemented in PHP and JavaScript and coupled with statistical analysis and visualization packages written for the R language. The database primarily comprises mature microRNA sequences and their target data, along with selected human, mouse and zebrafish expression data sets. mESAdb analysis modules allow (i) mining of microRNA expression data sets for subsets of microRNAs selected manually or by motif; (ii) pair-wise multivariate analysis of expression data sets within and between taxa; and (iii) association of microRNA subsets with annotation databases, HUGE Navigator, KEGG and GO. The use of existing and customized R packages facilitates future addition of data sets and analysis tools. Furthermore, the ability to upload and analyze user-specified data sets makes mESAdb an interactive and expandable analysis tool for microRNA sequence and expression data.
RNA sequencing uncovers antisense RNAs and novel small RNAs in Streptococcus pyogenes.

PubMed

Le Rhun, Anaïs; Beer, Yan Yan; Reimegård, Johan; Chylinski, Krzysztof; Charpentier, Emmanuelle

2016-01-01

Streptococcus pyogenes is a human pathogen responsible for a wide spectrum of diseases ranging from mild to life-threatening infections. During the infectious process, the temporal and spatial expression of pathogenicity factors is tightly controlled by a complex network of protein and RNA regulators acting in response to various environmental signals. Here, we focus on the class of small RNA regulators (sRNAs) and present the first complete analysis of sRNA sequencing data in S. pyogenes. In the SF370 clinical isolate (M1 serotype), we identified 197 and 428 putative regulatory RNAs by visual inspection and bioinformatics screening of the sequencing data, respectively. Only 35 from the 197 candidates identified by visual screening were assigned a predicted function (T-boxes, ribosomal protein leaders, characterized riboswitches or sRNAs), indicating how little is known about sRNA regulation in S. pyogenes. By comparing our list of predicted sRNAs with previous S. pyogenes sRNA screens using bioinformatics or microarrays, 92 novel sRNAs were revealed, including antisense RNAs that are for the first time shown to be expressed in this pathogen. We experimentally validated the expression of 30 novel sRNAs and antisense RNAs. We show that the expression profile of 9 sRNAs including 2 predicted regulatory elements is affected by the endoribonucleases RNase III and/or RNase Y, highlighting the critical role of these enzymes in sRNA regulation.
A regulatory sequence from the retinoid X receptor γ gene directs expression to horizontal cells and photoreceptors in the embryonic chicken retina.

PubMed

Blixt, Maria K E; Hallböök, Finn

2016-01-01

Combining techniques of episomal vector gene-specific Cre expression and genomic integration using the piggyBac transposon system enables studies of gene expression-specific cell lineage tracing in the chicken retina. In this work, we aimed to target the retinal horizontal cell progenitors. A 208 bp gene regulatory sequence from the chicken retinoid X receptor γ gene (RXRγ208) was used to drive Cre expression. RXRγ is expressed in progenitors and photoreceptors during development. The vector was combined with a piggyBac "donor" vector containing a floxed STOP sequence followed by enhanced green fluorescent protein (EGFP), as well as a piggyBac helper vector for efficient integration into the host cell genome. The vectors were introduced into the embryonic chicken retina with in ovo electroporation. Tissue electroporation targets specific developmental time points and in specific structures. Cells that drove Cre expression from the regulatory RXRγ208 sequence excised the floxed STOP-sequence and expressed GFP. The approach generated a stable lineage with robust expression of GFP in retinal cells that have activated transcription from the RXRγ208 sequence. Furthermore, GFP was expressed in cells that express horizontal or photoreceptor markers when electroporation was performed between developmental stages 22 and 28. Electroporation of a stage 12 optic cup gave multiple cell types in accordance with RXRγ gene expression in the early retina. In this study, we describe an easy, cost-effective, and time-efficient method for testing regulatory sequences in general. More specifically, our results open up the possibility for further studies of the RXRγ-gene regulatory network governing the formation of photoreceptor and horizontal cells. In addition, the method presents approaches to target the expression of effector genes, such as regulators of cell fate or cell cycle progression, to these cells and their progenitor.
RNA sequencing reveals pronounced changes in the noncoding transcriptome of aging synaptosomes.

PubMed

Chen, Bei Jun; Ueberham, Uwe; Mills, James D; Kirazov, Ludmil; Kirazov, Evgeni; Knobloch, Mara; Bochmann, Jana; Jendrek, Renate; Takenaka, Konii; Bliim, Nicola; Arendt, Thomas; Janitz, Michael

2017-08-01

Normal aging is associated with impairments in cognitive functions. These alterations are caused by diminutive changes in the biology of synapses, and ineffective neurotransmission, rather than loss of neurons. Hitherto, only a few studies, exploring molecular mechanisms of healthy brain aging in higher vertebrates, utilized synaptosomal fractions to survey local changes in aging-related transcriptome dynamics. Here we present, for the first time, a comparative analysis of the synaptosomes transcriptome in the aging mouse brain using RNA sequencing. Our results show changes in the expression of genes contributing to biological pathways related to neurite guidance, synaptosomal physiology, and RNA splicing. More intriguingly, we also discovered alterations in the expression of thousands of novel, unannotated lincRNAs during aging. Further, detailed characterization of the cleavage and polyadenylation factor I subunit 1 (Clp1) mRNA and protein expression indicates its increased expression in neuronal processes of hippocampal stratum radiatum in aging mice. Together, our study uncovers a new layer of transcriptional regulation which is targeted by aging within the local environment of interconnecting neuronal cells. Copyright © 2017 Elsevier Inc. All rights reserved.
The semantics of verbs in the dissolution and development of language.

PubMed

Lahey, M; Feier, C D

1982-03-01

Evidence of the dissolution (DL) of verbs was examined in the written logs kept daily for 4 1/2 years by a woman (Mrs. W) who suffered from cerebral atrophy of unknown origin. Results were compared with similar analyses of written samples obtained from elementary school children (CWL), from normal adults (AWL) and from the literature on early oral language development (COL). The major finding of this study was that the sequence of the dissolution of verbs, in terms of the meanings expressed, mirrored the sequence of early acquisition. In the DL data reported here, Mrs. W continued to write about dynamic events after she ceased writing about stative events; in COL, children talk about dynamic events before stative events. Based on the AWL and CWL data, frequency of use is rejected as an explanation for the dominance and stability of dynamic relations in DL. Rather, it is suggested that the expression of dynamic relations may be less complex than the expression of stative relations due to possible differences in imagery and implication, but particularly due to the linguistic contexts in which each can be expressed.
[Effects of bushen yinao tablet on physiology and cerebral gene expression in senescence-accelerated mice].

PubMed

Zhang, Chong; Wang, Jin-gang; Yang, Ting

2006-06-01

To study the effects of Bushen Yin' ao Tablet (BSYNT) on physiology and cerebral gene expression in senescence-accelerated mice (SAM). The change of cerebral tissues mRNA expression in SAM was analyzed and compared by messenger ribonucleic acids reverse transcription differential display polymerase chain reaction (mRNA DDRT-PCR) between the medicated group and the control group. BSYNT could increase the level of hemoglobin (Hb) and amount of erythrocyte (RBC) of blood deficiency mice, improve the spatial learning and memory function and the escape response by conditional stimulus. In this study, 14 differential display bands had been discerned, and three of them had been sequenced. The sequence of the three fragments was similar to fatty acid binding protein 7, ubiquinol-cytochrome C reductase complex (7. 2 kD) and 60S ribosomal protein L21 respectively. And the homogeneity was 97% , 100% , and 99% , respectively. BSYNT has effect on the physiological changing of mice, and its effect on cerebral tissues mRNA expression maybe play an important role in anti-aging on the molecular level.
Calcium-Dependent Protein Kinase Genes in Corn Roots

NASA Technical Reports Server (NTRS)

Takezawa, D.; Patil, S.; Bhatia, A.; Poovaiah, B. W.

1996-01-01

Two cDNAs encoding Ca-2(+) - Dependent Protein Kinases (CDPKs), Corn Root Protein Kinase 1 and 2 (CRPK 1, CRPK 2) were isolated from the root tip library of corn (Zea mays L., cv. Merit) and their nucleotide sequences were determined. Deduced amino acid sequences of both the clones have features characteristic of plant CDPKS, including all 11 conserved serine/threonine kinase subdomains, a junction domain and a calmodulin-like domain with four Ca-2(+), -binding sites. Northern analysis revealed that CRPKI mRNA is preferentially expressed in roots, especially in the root tip; whereas, the expression of CRPK2 mRNA was very low in all the tissues tested. In situ hybridization experiments revealed that CRPKI mRNA is highly expressed in the root apex, as compared to other parts of the root. Partially purified CDPK from the root tip phosphorylates syntide-2, a common peptide substrate for plant CDPKs, and the phosphorylation was stimulated 7-fold by the addition of Ca-2(+). Our results show that two CDPK isoforms are expressed in corn roots and they may be involved in the Ca-2(+)-dependent signal transduction process.
Signatures from Tissue-specific MPSS Libraries Identify Transcripts Preferentially Expressed in the Mouse Inner Ear

PubMed Central

Peters, Linda M.; Belyantseva, Inna A.; Lagziel, Ayala; Battey, James F.; Friedman, Thomas B.; Morell, Robert J.

2007-01-01

Specialization in cell function and morphology is influenced by the differential expression of mRNAs, many of which are expressed at low abundance and restricted to certain cell types. Detecting such transcripts in cDNA libraries may require sequencing millions of clones. Massively parallel signature sequencing (MPSS) is well-suited for identifying transcripts that are expressed in discrete cell types and in low abundance. We have made MPSS libraries from microdissections of three inner ear tissues. By comparing these MPSS libraries to those of 87 other tissues included in the Mouse Reference Transcriptome (MRT) online resource, we have identified genes that are highly enriched in, or specific to, the inner ear. We show by RT-PCR and in situ hybridization that signatures unique to the inner ear libraries identify transcripts with highly specific cell-type localizations. These transcripts serve to illustrate the utility of a resource that is available to the research community. Utilization of these resources will increase the number of known transcription units and expand our knowledge of the tissue-specific regulation of the transcriptome. PMID:17049805
Structure, Expression, Chromosomal Location and Product of the Gene Encoding Adh2 in Petunia

PubMed Central

Gregerson, R. G.; Cameron, L.; McLean, M.; Dennis, P.; Strommer, J.

1993-01-01

In most higher plants the genes encoding alcohol dehydrogenase comprise a small gene family, usually with two members. The Adh1 gene of Petunia has been cloned and analyzed, but a second identifiable gene was not recovered from any of three genomic libraries. We have therefore employed the polymerase chain reaction to obtain the major portion of a second Adh gene. From sequence, mapping and northern data we conclude this gene encodes ADH2, the major anaerobically inducible Adh gene of Petunia. The availability of both Adh1 and Adh2 from Petunia has permitted us to compare their structures and patterns of expression to those of the well-studied Adh genes of maize, of which one is highly expressed developmentally, while both are induced in response to hypoxia. Despite their evolutionary distance, evidenced by deduced amino acid sequence as well as taxonomic classification, the pairs of genes are regulated in strikingly similar ways in maize and Petunia. Our findings suggest a significant biological basis for the regulatory strategy employed by these distant species for differential expression of multiple Adh genes. PMID:8096485
Older persons' expressions of emotional cues and concerns during home care visits. Application of the Verona coding definitions of emotional sequences (VR-CoDES) in home care.

PubMed

Sundler, Annelie J; Höglander, Jessica; Eklund, Jakob Håkansson; Eide, Hilde; Holmström, Inger K

2017-02-01

This study aims to a) explore to what extent older persons express emotional cues and concerns during home care visits; b) describe what cues and concerns these older persons expressed, and c) explore who initiated these cues and concerns. A descriptive and cross-sectional study was conducted. Data consisted of 188 audio recorded home care visits with older persons and registered nurses or nurse assistants, coded with the Verona coding definitions on emotional sequences (VR-CoDES). Emotional expressions of cues and concerns occurred in 95 (51%) of the 188 recorded home care visits. Most frequent were implicit expressions of cues (n=292) rather than explicit concerns (n=24). Utterances with hints to hidden concerns (63,9%, n=202) were most prevalent, followed by vague or unspecific expressions of emotional worries (15,8%, n=50). Most of these were elicited by the nursing staff (63%, n=200). Emotional needs expressed by the older persons receiving home care were mainly communicated implicitly. To be attentive to such vaguely expressed emotions may demand nursing staff to be sensitive and open. The VR-CoDES can be applied on audio recorded home care visits to analyse verbal and emotional communication, and may allow comparative research. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Metatranscriptome sequence analysis reveals diel periodicity of microbial community gene expression in the ocean's interior

NASA Astrophysics Data System (ADS)

Vislova, A.; Aylward, F.; Sosa, O.; DeLong, E.

2016-02-01

Previous work has revealed diel periodicity of gene expression in key metabolic pathways in both autotrophic and heterotrophic microbes in the surface ocean. In this study, we investigated patterns of diel periodicity of gene expression in depth profiles (25, 75, 125 and 250 meters). We postulated that microbial diel transcriptional signals would be increasingly dampened with depth, and that the timing of peak expression of specific transcripts would be shifted in time between depths, in accordance with depth-dependent diel light variability. Bacterioplankton were sampled from four depths every four hours at station ALOHA (22° 45' N 158° W) over 2 days. RNA was extracted from cells preserved on filters, converted to cDNA, and sequenced on the Illumina platform. Surprisingly, harmonic regression analysis revealed an increasing proportion of genes with diel periodic expression patterns with increasing depth between 25- 125 meters. At 250 meters, the proportion of genes exhibiting diel expression patterns decreased an order of magnitude compared to the photic zone. Community composition, functional gene categories, and diel patterns of gene expression were significantly different between the photic zone and 250 meter samples. The signals driving diel periodic gene expression in microbes at 250 meters is under further investigation. These data are now beginning provide a better understanding of the tempo and mode of microbial dynamics among specific taxa, throughout the ocean's interior.
Identification of human chromosome 22 transcribed sequences with ORF expressed sequence tags

PubMed Central

de Souza, Sandro J.; Camargo, Anamaria A.; Briones, Marcelo R. S.; Costa, Fernando F.; Nagai, Maria Aparecida; Verjovski-Almeida, Sergio; Zago, Marco A.; Andrade, Luis Eduardo C.; Carrer, Helaine; El-Dorry, Hamza F. A.; Espreafico, Enilza M.; Habr-Gama, Angelita; Giannella-Neto, Daniel; Goldman, Gustavo H.; Gruber, Arthur; Hackel, Christine; Kimura, Edna T.; Maciel, Rui M. B.; Marie, Suely K. N.; Martins, Elizabeth A. L.; Nóbrega, Marina P.; Paçó-Larson, Maria Luisa; Pardini, Maria Inês M. C.; Pereira, Gonçalo G.; Pesquero, João Bosco; Rodrigues, Vanderlei; Rogatto, Silvia R.; da Silva, Ismael D. C. G.; Sogayar, Mari C.; de Fátima Sonati, Maria; Tajara, Eloiza H.; Valentini, Sandro R.; Acencio, Marcio; Alberto, Fernando L.; Amaral, Maria Elisabete J.; Aneas, Ivy; Bengtson, Mário Henrique; Carraro, Dirce M.; Carvalho, Alex F.; Carvalho, Lúcia Helena; Cerutti, Janete M.; Corrêa, Maria Lucia C.; Costa, Maria Cristina R.; Curcio, Cyntia; Gushiken, Tsieko; Ho, Paulo L.; Kimura, Elza; Leite, Luciana C. C.; Maia, Gustavo; Majumder, Paromita; Marins, Mozart; Matsukuma, Adriana; Melo, Analy S. A.; Mestriner, Carlos Alberto; Miracca, Elisabete C.; Miranda, Daniela C.; Nascimento, Ana Lucia T. O.; Nóbrega, Francisco G.; Ojopi, Élida P. B.; Pandolfi, José Rodrigo C.; Pessoa, Luciana Gilbert; Rahal, Paula; Rainho, Claudia A.; da Ro's, Nancy; de Sá, Renata G.; Sales, Magaly M.; da Silva, Neusa P.; Silva, Tereza C.; da Silva, Wilson; Simão, Daniel F.; Sousa, Josane F.; Stecconi, Daniella; Tsukumo, Fernando; Valente, Valéria; Zalcberg, Heloisa; Brentani, Ricardo R.; Reis, Luis F. L.; Dias-Neto, Emmanuel; Simpson, Andrew J. G.

2000-01-01

Transcribed sequences in the human genome can be identified with confidence only by alignment with sequences derived from cDNAs synthesized from naturally occurring mRNAs. We constructed a set of 250,000 cDNAs that represent partial expressed gene sequences and that are biased toward the central coding regions of the resulting transcripts. They are termed ORF expressed sequence tags (ORESTES). The 250,000 ORESTES were assembled into 81,429 contigs. Of these, 1,181 (1.45%) were found to match sequences in chromosome 22 with at least one ORESTES contig for 162 (65.6%) of the 247 known genes, for 67 (44.6%) of the 150 related genes, and for 45 of the 148 (30.4%) EST-predicted genes on this chromosome. Using a set of stringent criteria to validate our sequences, we identified a further 219 previously unannotated transcribed sequences on chromosome 22. Of these, 171 were in fact also defined by EST or full length cDNA sequences available in GenBank but not utilized in the initial annotation of the first human chromosome sequence. Thus despite representing less than 15% of all expressed human sequences in the public databases at the time of the present analysis, ORESTES sequences defined 48 transcribed sequences on chromosome 22 not defined by other sequences. All of the transcribed sequences defined by ORESTES coincided with DNA regions predicted as encoding exons by genscan. (http://genes.mit.edu/GENSCAN.html). PMID:11070084
The FDA's Experience with Emerging Genomics Technologies-Past, Present, and Future.

PubMed

Xu, Joshua; Thakkar, Shraddha; Gong, Binsheng; Tong, Weida

2016-07-01

The rapid advancement of emerging genomics technologies and their application for assessing safety and efficacy of FDA-regulated products require a high standard of reliability and robustness supporting regulatory decision-making in the FDA. To facilitate the regulatory application, the FDA implemented a novel data submission program, Voluntary Genomics Data Submission (VGDS), and also to engage the stakeholders. As part of the endeavor, for the past 10 years, the FDA has led an international consortium of regulatory agencies, academia, pharmaceutical companies, and genomics platform providers, which was named MicroArray Quality Control Consortium (MAQC), to address issues such as reproducibility, precision, specificity/sensitivity, and data interpretation. Three projects have been completed so far assessing these genomics technologies: gene expression microarrays, whole genome genotyping arrays, and whole transcriptome sequencing (i.e., RNA-seq). The resultant studies provide the basic parameters for fit-for-purpose application of these new data streams in regulatory environments, and the solutions have been made available to the public through peer-reviewed publications. The latest MAQC project is also called the SEquencing Quality Control (SEQC) project focused on next-generation sequencing. Using reference samples with built-in controls, SEQC studies have demonstrated that relative gene expression can be measured accurately and reliably across laboratories and RNA-seq platforms. Besides prediction performance comparable to microarrays in clinical settings and safety assessments, RNA-seq is shown to have better sensitivity for low expression and reveal novel transcriptomic features. Future effort of MAQC will be focused on quality control of whole genome sequencing and targeted sequencing.
Vesicular monoamine transporter-1 (VMAT-1) mRNA and immunoreactive proteins in mouse brain.

PubMed

Ashe, Karen M; Chiu, Wan-Ling; Khalifa, Ahmed M; Nicolas, Antoine N; Brown, Bonnie L; De Martino, Randall R; Alexander, Clayton P; Waggener, Christopher T; Fischer-Stenger, Krista; Stewart, Jennifer K

2011-01-01

Vesicular monoamine transporter 1 (VMAT-1) mRNA and protein were examined (1) to determine whether adult mouse brain expresses full-length VMAT-1 mRNA that can be translated to functional transporter protein and (2) to compare immunoreactive VMAT-1 proteins in brain and adrenal. VMAT-1 mRNA was detected in mouse brain with RT-PCR. The cDNA was sequenced, cloned into an expression vector, transfected into COS-1 cells, and cell protein was assayed for VMAT-1 activity. Immunoreactive proteins were examined on western blots probed with four different antibodies to VMAT-1. Sequencing confirmed identity of the entire coding sequences of VMAT-1 cDNA from mouse medulla oblongata/pons and adrenal to a Gen-Bank reference sequence. Transfection of the brain cDNA into COS-1 cells resulted in transporter activity that was blocked by the VMAT inhibitor reserpine and a proton ionophore, but not by tetrabenazine, which has a high affinity for VMAT-2. Antibodies to either the C- or N- terminus of VMAT-1 detected two proteins (73 and 55 kD) in transfected COS-1 cells. The C-terminal antibodies detected both proteins in extracts of mouse medulla/pons, cortex, hypothalamus, and cerebellum but only the 73 kD protein and higher molecular weight immunoreactive proteins in mouse adrenal and rat PC12 cells, which are positive controls for rodent VMAT-1. These findings demonstrate that a functional VMAT-1 mRNA coding sequence is expressed in mouse brain and suggest processing of VMAT-1 protein differs in mouse adrenal and brain.
The FDA’s Experience with Emerging Genomics Technologies—Past, Present, and Future

PubMed Central

Xu, Joshua; Thakkar, Shraddha; Gong, Binsheng; Tong, Weida

2016-01-01

The rapid advancement of emerging genomics technologies and their application for assessing safety and efficacy of FDA-regulated products require a high standard of reliability and robustness supporting regulatory decision-making in the FDA. To facilitate the regulatory application, the FDA implemented a novel data submission program, Voluntary Genomics Data Submission (VGDS), and also to engage the stakeholders. As part of the endeavor, for the past 10 years, the FDA has led an international consortium of regulatory agencies, academia, pharmaceutical companies, and genomics platform providers, which was named MicroArray Quality Control Consortium (MAQC), to address issues such as reproducibility, precision, specificity/sensitivity, and data interpretation. Three projects have been completed so far assessing these genomics technologies: gene expression microarrays, whole genome genotyping arrays, and whole transcriptome sequencing (i.e., RNA-seq). The resultant studies provide the basic parameters for fit-for-purpose application of these new data streams in regulatory environments, and the solutions have been made available to the public through peer-reviewed publications. The latest MAQC project is also called the SEquencing Quality Control (SEQC) project focused on next-generation sequencing. Using reference samples with built-in controls, SEQC studies have demonstrated that relative gene expression can be measured accurately and reliably across laboratories and RNA-seq platforms. Besides prediction performance comparable to microarrays in clinical settings and safety assessments, RNA-seq is shown to have better sensitivity for low expression and reveal novel transcriptomic features. Future effort of MAQC will be focused on quality control of whole genome sequencing and targeted sequencing. PMID:27116022
Creation and characterization of an airway epithelial cell line for stable expression of CFTR variants

PubMed Central

Gottschalk, Laura B.; Vecchio-Pagan, Briana; Sharma, Neeraj; Han, Sangwoo T.; Franca, Arianna; Wohler, Elizabeth S.; Batista, Denise A.S.; Goff, Loyal A.; Cutting, Garry R.

2016-01-01

Background Analysis of the functional consequences and treatment response of rare CFTR variants is challenging due to the limited availability of primary airways cells. Methods A Flp recombination target (FRT) site for stable expression of CFTR was incorporated into an immortalized CF bronchial epithelial cell line (CFBE41o−). CFTR cDNA was integrated into the FRT site. Expression was evaluated by western blotting and confocal microscopy and function measured by short circuit current. RNA sequencing was used to compare the transcriptional profile of the resulting CF8Flp cell line to primary cells and tissues. Results Functional CFTR was expressed from integrated cDNA at the FRT site of the CF8Flp cell line at levels comparable to that seen in native airway cells. CF8Flp cells expressing WT-CFTR have a stable transcriptome comparable to that of primary cultured airway epithelial cells, including genes that play key roles in CFTR pathways. Conclusion CF8Flp cells provide a viable substitute for primary CF airway cells for the analysis of CFTR variants in a native context. PMID:26694805
A genetically adjuvanted influenza B virus vector increases immunogenicity and protective efficacy in mice.

PubMed

Kittel, Christian; Wressnigg, Nina; Shurygina, Anna Polina; Wolschek, Markus; Stukova, Marina; Romanovskaya-Romanko, Ekatherina; Romanova, Julia; Kiselev, Oleg; Muster, Thomas; Egorov, Andrej

2015-10-01

The existence of multiple antigenically distinct types and subtypes of influenza viruses allows the construction of a multivalent vector system for the mucosal delivery of foreign sequences. Influenza A viruses have been exploited successfully for the expression of extraneous antigens as well as immunostimulatory molecules. In this study, we describe the development of an influenza B virus vector whose functional part of the interferon antagonist NS1 was replaced by human interleukin 2 (IL2) as a genetic adjuvant. We demonstrate that IL2 expressed by this viral vector displays immune adjuvant activity in immunized mice. Animals vaccinated with the IL2 viral vector showed an increased hemagglutination inhibition antibody response and higher protective efficacy after challenge with a wild-type influenza B virus when compared to mice vaccinated with a control virus. Our results demonstrate that it is feasible to construct influenza B vaccine strains expressing immune-potentiating foreign sequences from the NS genomic segment. Based on these data, it is now hypothetically possible to create a trivalent (or quadrivalent) live attenuated influenza vaccine in which each component expresses a selected genetic adjuvant with tailored expression levels.
A synthetic promoter library for constitutive gene expression in Lactobacillus plantarum.

PubMed

Rud, Ida; Jensen, Peter Ruhdal; Naterstad, Kristine; Axelsson, Lars

2006-04-01

A synthetic promoter library (SPL) for Lactobacillus plantarum has been developed, which generalizes the approach for obtaining synthetic promoters. The consensus sequence, derived from rRNA promoters extracted from the L. plantarum WCFS1 genome, was kept constant, and the non-consensus sequences were randomized. Construction of the SPL was performed in a vector (pSIP409) previously developed for high-level, inducible gene expression in L. plantarum and Lactobacillus sakei. A wide range of promoter strengths was obtained with the approach, covering 3-4 logs of expression levels in small increments of activity. The SPL was evaluated for the ability to drive beta-glucuronidase (GusA) and aminopeptidase N (PepN) expression. Protein production from the synthetic promoters was constitutive, and the most potent promoters gave high protein production with levels comparable to those of native rRNA promoters, and production of PepN protein corresponding to approximately 10-15 % of the total cellular protein. High correlation was obtained between the activities of promoters when tested in L. sakei and L. plantarum, which indicates the potential of the SPL for other Lactobacillus species. The SPL enables fine-tuning of stable gene expression for various applications in L. plantarum.
ASR5 is involved in the regulation of miRNA expression in rice.

PubMed

Neto, Lauro Bücker; Arenhart, Rafael Augusto; de Oliveira, Luiz Felipe Valter; de Lima, Júlio Cesar; Bodanese-Zanettini, Maria Helena; Margis, Rogerio; Margis-Pinheiro, Márcia

2015-11-01

The work describes an ASR knockdown transcriptomic analysis by deep sequencing of rice root seedlings and the transactivation of ASR cis-acting elements in the upstream region of a MIR gene. MicroRNAs are key regulators of gene expression that guide post-transcriptional control of plant development and responses to environmental stresses. ASR (ABA, Stress and Ripening) proteins are plant-specific transcription factors with key roles in different biological processes. In rice, ASR proteins have been suggested to participate in the regulation of stress response genes. This work describes the transcriptomic analysis by deep sequencing two libraries, comparing miRNA abundance from the roots of transgenic ASR5 knockdown rice seedlings with that of the roots of wild-type non-transformed rice seedlings. Members of 59 miRNA families were detected, and 276 mature miRNAs were identified. Our analysis detected 112 miRNAs that were differentially expressed between the two libraries. A predicted inverse correlation between miR167abc and its target gene (LOC_Os07g29820) was confirmed using RT-qPCR. Protoplast transactivation assays showed that ASR5 is able to recognize binding sites upstream of the MIR167a gene and drive its expression in vivo. Together, our data establish a comparative study of miRNAome profiles and is the first study to suggest the involvement of ASR proteins in miRNA gene regulation.

Expressed sequence tags from the flower pathogen Claviceps purpurea.

PubMed

Oeser, Birgitt; Beaussart, François; Haarmann, Thomas; Lorenz, Nicole; Nathues, Eva; Rolke, Yvonne; Scheffer, Jan; Weiner, January; Tudzynski, Paul

2009-09-01

SUMMARY The ascomycete Claviceps purpurea (ergot) is a biotrophic flower pathogen of rye and other grasses. The deleterious toxic effects of infected rye seeds on humans and grazing animals have been known since the Middle Ages. To gain further insight into the molecular basis of this disease, we generated about 10 000 expressed sequence tags (ESTs)-about 25% originating from axenic fungal culture and about 75% from tissues collected 6-20 days after infection of rye spikes. The pattern of axenic vs. in planta gene expression was compared. About 200 putative plant genes were identified within the in planta library. A high percentage of these were predicted to function in plant defence against the ergot fungus and other pathogens, for example pathogenesis-related proteins. Potential fungal pathogenicity and virulence genes were found via comparison with the pathogen-host interaction database (PHI-base; http://www.phi-base.org) and with genes known to be highly expressed in the haustoria of the bean rust fungus. Comparative analysis of Claviceps and two other fungal flower pathogens (necrotrophic Fusarium graminearum and biotrophic Ustilago maydis) highlighted similarities and differences in their lifestyles, for example all three fungi have signalling components and cell wall-degrading enzymes in their arsenal. In summary, the analysis of axenic and in planta ESTs yielded a collection of candidate genes to be evaluated for functional roles in this plant-microbe interaction.
Searching for resistance genes to Bursaphelenchus xylophilus using high throughput screening

PubMed Central

2012-01-01

Background Pine wilt disease (PWD), caused by the pinewood nematode (PWN; Bursaphelenchus xylophilus), damages and kills pine trees and is causing serious economic damage worldwide. Although the ecological mechanism of infestation is well described, the plant’s molecular response to the pathogen is not well known. This is due mainly to the lack of genomic information and the complexity of the disease. High throughput sequencing is now an efficient approach for detecting the expression of genes in non-model organisms, thus providing valuable information in spite of the lack of the genome sequence. In an attempt to unravel genes potentially involved in the pine defense against the pathogen, we hereby report the high throughput comparative sequence analysis of infested and non-infested stems of Pinus pinaster (very susceptible to PWN) and Pinus pinea (less susceptible to PWN). Results Four cDNA libraries from infested and non-infested stems of P. pinaster and P. pinea were sequenced in a full 454 GS FLX run, producing a total of 2,083,698 reads. The putative amino acid sequences encoded by the assembled transcripts were annotated according to Gene Ontology, to assign Pinus contigs into Biological Processes, Cellular Components and Molecular Functions categories. Most of the annotated transcripts corresponded to Picea genes-25.4-39.7%, whereas a smaller percentage, matched Pinus genes, 1.8-12.8%, probably a consequence of more public genomic information available for Picea than for Pinus. The comparative transcriptome analysis showed that when P. pinaster was infested with PWN, the genes malate dehydrogenase, ABA, water deficit stress related genes and PAR1 were highly expressed, while in PWN-infested P. pinea, the highly expressed genes were ricin B-related lectin, and genes belonging to the SNARE and high mobility group families. Quantitative PCR experiments confirmed the differential gene expression between the two pine species. Conclusions Defense-related genes triggered by nematode infestation were detected in both P. pinaster and P. pinea transcriptomes utilizing 454 pyrosequencing technology. P. pinaster showed higher abundance of genes related to transcriptional regulation, terpenoid secondary metabolism (including some with nematicidal activity) and pathogen attack. P. pinea showed higher abundance of genes related to oxidative stress and higher levels of expression in general of stress responsive genes. This study provides essential information about the molecular defense mechanisms utilized by P. pinaster and P. pinea against PWN infestation and contributes to a better understanding of PWD. PMID:23134679
Dcode.org anthology of comparative genomic tools.

PubMed

Loots, Gabriela G; Ovcharenko, Ivan

2005-07-01

Comparative genomics provides the means to demarcate functional regions in anonymous DNA sequences. The successful application of this method to identifying novel genes is currently shifting to deciphering the non-coding encryption of gene regulation across genomes. To facilitate the practical application of comparative sequence analysis to genetics and genomics, we have developed several analytical and visualization tools for the analysis of arbitrary sequences and whole genomes. These tools include two alignment tools, zPicture and Mulan; a phylogenetic shadowing tool, eShadow for identifying lineage- and species-specific functional elements; two evolutionary conserved transcription factor analysis tools, rVista and multiTF; a tool for extracting cis-regulatory modules governing the expression of co-regulated genes, Creme 2.0; and a dynamic portal to multiple vertebrate and invertebrate genome alignments, the ECR Browser. Here, we briefly describe each one of these tools and provide specific examples on their practical applications. All the tools are publicly available at the http://www.dcode.org/ website.
Differences in Cell Morphometry, Cell Wall Topography and Gp70 Expression Correlate with the Virulence of Sporothrix brasiliensis Clinical Isolates

PubMed Central

Castro, Rafaela A.; Kubitschek-Barreira, Paula H.; Teixeira, Pedro A. C.; Sanches, Glenda F.; Teixeira, Marcus M.; Quintella, Leonardo P.; Almeida, Sandro R.; Costa, Rosane O.; Camargo, Zoilo P.; Felipe, Maria S. S.; de Souza, Wanderley; Lopes-Bezerra, Leila M.

2013-01-01

Sporotrichosis is a chronic infectious disease affecting both humans and animals. For many years, this subcutaneous mycosis had been attributed to a single etiological agent; however, it is now known that this taxon consists of a complex of at least four pathogenic species, including Sporothrix schenckii and Sporothrix brasiliensis. Gp70 was previously shown to be an important antigen and adhesin expressed on the fungal cell surface and may have a key role in immunomodulation and host response. The aim of this work was to study the virulence, morphometry, cell surface topology and gp70 expression of clinical isolates of S. brasiliensis compared with two reference strains of S. schenckii. Several clinical isolates related to severe human cases or associated with the Brazilian zoonotic outbreak of sporotrichosis were genotyped and clustered as S. brasiliensis. Interestingly, in a murine subcutaneous model of sporotrichosis, these isolates showed a higher virulence profile compared with S. schenckii. A single S. brasiliensis isolate from an HIV-positive patient not only showed lower virulence but also presented differences in cell morphometry, cell wall topography and abundant gp70 expression compared with the virulent isolates. In contrast, the highly virulent S. brasiliensis isolates showed reduced levels of cell wall gp70. These observations were confirmed by the topographical location of the gp70 antigen using immunoelectromicroscopy in both species. In addition, the gp70 molecule was sequenced and identified using mass spectrometry, and the sequenced peptides were aligned into predicted proteins using Blastp with the S. schenckii and S. brasiliensis genomes. PMID:24116065
Genome-Wide Classification and Evolutionary and Expression Analyses of Citrus MYB Transcription Factor Families in Sweet Orange

PubMed Central

Hou, Xiao-Jin; Li, Si-Bei; Liu, Sheng-Rui; Hu, Chun-Gen; Zhang, Jin-Zhi

2014-01-01

MYB family genes are widely distributed in plants and comprise one of the largest transcription factors involved in various developmental processes and defense responses of plants. To date, few MYB genes and little expression profiling have been reported for citrus. Here, we describe and classify 177 members of the sweet orange MYB gene (CsMYB) family in terms of their genomic gene structures and similarity to their putative Arabidopsis orthologs. According to these analyses, these CsMYBs were categorized into four groups (4R-MYB, 3R-MYB, 2R-MYB and 1R-MYB). Gene structure analysis revealed that 1R-MYB genes possess relatively more introns as compared with 2R-MYB genes. Investigation of their chromosomal localizations revealed that these CsMYBs are distributed across nine chromosomes. Sweet orange includes a relatively small number of MYB genes compared with the 198 members in Arabidopsis, presumably due to a paralog reduction related to repetitive sequence insertion into promoter and non-coding transcribed region of the genes. Comparative studies of CsMYBs and Arabidopsis showed that CsMYBs had fewer gene duplication events. Expression analysis revealed that the MYB gene family has a wide expression profile in sweet orange development and plays important roles in development and stress responses. In addition, 337 new putative microsatellites with flanking sequences sufficient for primer design were also identified from the 177 CsMYBs. These results provide a useful reference for the selection of candidate MYB genes for cloning and further functional analysis forcitrus. PMID:25375352
Next-generation sequencing identifies major DNA methylation changes during progression of Ph+ chronic myeloid leukemia

PubMed Central

Heller, G; Topakian, T; Altenberger, C; Cerny-Reiterer, S; Herndlhofer, S; Ziegler, B; Datlinger, P; Byrgazov, K; Bock, C; Mannhalter, C; Hörmann, G; Sperr, W R; Lion, T; Zielinski, C C; Valent, P; Zöchbauer-Müller, S

2016-01-01

Little is known about the impact of DNA methylation on the evolution/progression of Ph+ chronic myeloid leukemia (CML). We investigated the methylome of CML patients in chronic phase (CP-CML), accelerated phase (AP-CML) and blast crisis (BC-CML) as well as in controls by reduced representation bisulfite sequencing. Although only ~600 differentially methylated CpG sites were identified in samples obtained from CP-CML patients compared with controls, ~6500 differentially methylated CpG sites were found in samples from BC-CML patients. In the majority of affected CpG sites, methylation was increased. In CP-CML patients who progressed to AP-CML/BC-CML, we identified up to 897 genes that were methylated at the time of progression but not at the time of diagnosis. Using RNA-sequencing, we observed downregulated expression of many of these genes in BC-CML compared with CP-CML samples. Several of them are well-known tumor-suppressor genes or regulators of cell proliferation, and gene re-expression was observed by the use of epigenetic active drugs. Together, our results demonstrate that CpG site methylation clearly increases during CML progression and that it may provide a useful basis for revealing new targets of therapy in advanced CML. PMID:27211271
Toward reliable biomarker signatures in the age of liquid biopsies - how to standardize the small RNA-Seq workflow

PubMed Central

Buschmann, Dominik; Haberberger, Anna; Kirchner, Benedikt; Spornraft, Melanie; Riedmaier, Irmgard; Schelling, Gustav; Pfaffl, Michael W.

2016-01-01

Small RNA-Seq has emerged as a powerful tool in transcriptomics, gene expression profiling and biomarker discovery. Sequencing cell-free nucleic acids, particularly microRNA (miRNA), from liquid biopsies additionally provides exciting possibilities for molecular diagnostics, and might help establish disease-specific biomarker signatures. The complexity of the small RNA-Seq workflow, however, bears challenges and biases that researchers need to be aware of in order to generate high-quality data. Rigorous standardization and extensive validation are required to guarantee reliability, reproducibility and comparability of research findings. Hypotheses based on flawed experimental conditions can be inconsistent and even misleading. Comparable to the well-established MIQE guidelines for qPCR experiments, this work aims at establishing guidelines for experimental design and pre-analytical sample processing, standardization of library preparation and sequencing reactions, as well as facilitating data analysis. We highlight bottlenecks in small RNA-Seq experiments, point out the importance of stringent quality control and validation, and provide a primer for differential expression analysis and biomarker discovery. Following our recommendations will encourage better sequencing practice, increase experimental transparency and lead to more reproducible small RNA-Seq results. This will ultimately enhance the validity of biomarker signatures, and allow reliable and robust clinical predictions. PMID:27317696
Genome Sequence, Assembly and Characterization of Two Metschnikowia fructicola Strains Used as Biocontrol Agents of Postharvest Diseases

PubMed Central

Piombo, Edoardo; Sela, Noa; Wisniewski, Michael; Hoffmann, Maria; Gullino, Maria L.; Allard, Marc W.; Levin, Elena; Spadaro, Davide; Droby, Samir

2018-01-01

The yeast Metschnikowia fructicola was reported as an efficient biological control agent of postharvest diseases of fruits and vegetables, and it is the bases of the commercial formulated product “Shemer.” Several mechanisms of action by which M. fructicola inhibits postharvest pathogens were suggested including iron-binding compounds, induction of defense signaling genes, production of fungal cell wall degrading enzymes and relatively high amounts of superoxide anions. We assembled the whole genome sequence of two strains of M. fructicola using PacBio and Illumina shotgun sequencing technologies. Using the PacBio, a high-quality draft genome consisting of 93 contigs, with an estimated genome size of approximately 26 Mb, was obtained. Comparative analysis of M. fructicola proteins with the other three available closely related genomes revealed a shared core of homologous proteins coded by 5,776 genes. Comparing the genomes of the two M. fructicola strains using a SNP calling approach resulted in the identification of 564,302 homologous SNPs with 2,004 predicted high impact mutations. The size of the genome is exceptionally high when compared with those of available closely related organisms, and the high rate of homology among M. fructicola genes points toward a recent whole-genome duplication event as the cause of this large genome. Based on the assembled genome, sequences were annotated with a gene description and gene ontology (GO term) and clustered in functional groups. Analysis of CAZymes family genes revealed 1,145 putative genes, and transcriptomic analysis of CAZyme expression levels in M. fructicola during its interaction with either grapefruit peel tissue or Penicillium digitatum revealed a high level of CAZyme gene expression when the yeast was placed in wounded fruit tissue. PMID:29666611
Application of Cydia pomonella expressed sequence tags: identification and expression of three general odorant binding proteins in codling moth

USDA-ARS?s Scientific Manuscript database

The codling moth, Cydia pomonella, is one of the most important pests of pome fruits in the world, yet the molecular genetics and physiology of this insect remains poorly understood. A combined assembly of 8340 expressed sequence tags (ESTs) was generated from Roche 454 GS-FLX sequencing of 8 tissu...
Diversity, expression and mRNA targeting abilities of Argonaute-targeting miRNAs among selected vascular plants.

PubMed

Jagtap, Soham; Shivaprasad, Padubidri V

2014-12-02

Micro (mi)RNAs are important regulators of plant development. Across plant lineages, Dicer-like 1 (DCL1) proteins process long ds-like structures to produce micro (mi) RNA duplexes in a stepwise manner. These miRNAs are incorporated into Argonaute (AGO) proteins and influence expression of RNAs that have sequence complementarity with miRNAs. Expression levels of AGOs are greatly regulated by plants in order to minimize unwarranted perturbations using miRNAs to target mRNAs coding for AGOs. AGOs may also have high promoter specificity-sometimes expression of AGO can be limited to just a few cells in a plant. Viral pathogens utilize various means to counter antiviral roles of AGOs including hijacking the host encoded miRNAs to target AGOs. Two host encoded miRNAs namely miR168 and miR403 that target AGOs have been described in the model plant Arabidopsis and such a mechanism is thought to be well conserved across plants because AGO sequences are well conserved. We show that the interaction between AGO mRNAs and miRNAs is species-specific due to the diversity in sequences of two miRNAs that target AGOs, sequence diversity among corresponding target regions in AGO mRNAs and variable expression levels of these miRNAs among vascular plants. We used miRNA sequences from 68 plant species representing 31 plant families for this analysis. Sequences of miR168 and miR403 are not conserved among plant lineages, but surprisingly they differ drastically in their sequence diversity and expression levels even among closely related plants. Variation in miR168 expression among plants correlates well with secondary structures/length of loop sequences of their precursors. Our data indicates a complex AGO targeting interaction among plant lineages due to miRNA sequence diversity and sequences of miRNA targeting regions among AGO mRNAs, thus leading to the assumption that the perturbations by viruses that use host miRNAs to target antiviral AGOs can only be species-specific. We also show that rapid evolution and likely loss of expression of miR168 isoforms in tobacco is related to the insertion of MITE-like transposons between miRNA and miRNA* sequences, a possible mechanism showing how miRNAs are lost in few plant lineages even though other close relatives have abundantly expressing miRNAs.
In silico analysis of expressed sequence tags from Trichostrongylus vitrinus (Nematoda): comparison of the automated ESTExplorer workflow platform with conventional database searches.

PubMed

Nagaraj, Shivashankar H; Gasser, Robin B; Nisbet, Alasdair J; Ranganathan, Shoba

2008-01-01

The analysis of expressed sequence tags (EST) offers a rapid and cost effective approach to elucidate the transcriptome of an organism, but requires several computational methods for assembly and annotation. Researchers frequently analyse each step manually, which is laborious and time consuming. We have recently developed ESTExplorer, a semi-automated computational workflow system, in order to achieve the rapid analysis of EST datasets. In this study, we evaluated EST data analysis for the parasitic nematode Trichostrongylus vitrinus (order Strongylida) using ESTExplorer, compared with database matching alone. We functionally annotated 1776 ESTs obtained via suppressive-subtractive hybridisation from T. vitrinus, an important parasitic trichostrongylid of small ruminants. Cluster and comparative genomic analyses of the transcripts using ESTExplorer indicated that 290 (41%) sequences had homologues in Caenorhabditis elegans, 329 (42%) in parasitic nematodes, 202 (28%) in organisms other than nematodes, and 218 (31%) had no significant match to any sequence in the current databases. Of the C. elegans homologues, 90 were associated with 'non-wildtype' double-stranded RNA interference (RNAi) phenotypes, including embryonic lethality, maternal sterility, sterile progeny, larval arrest and slow growth. We could functionally classify 267 (38%) sequences using the Gene Ontologies (GO) and establish pathway associations for 230 (33%) sequences using the Kyoto Encyclopedia of Genes and Genomes (KEGG). Further examination of this EST dataset revealed a number of signalling molecules, proteases, protease inhibitors, enzymes, ion channels and immune-related genes. In addition, we identified 40 putative secreted proteins that could represent potential candidates for developing novel anthelmintics or vaccines. We further compared the automated EST sequence annotations, using ESTExplorer, with database search results for individual T. vitrinus ESTs. ESTExplorer reliably and rapidly annotated 301 ESTs, with pathway and GO information, eliminating 60 low quality hits from database searches. We evaluated the efficacy of ESTExplorer in analysing EST data, and demonstrate that computational tools can be used to accelerate the process of gene discovery in EST sequencing projects. The present study has elucidated sets of relatively conserved and potentially novel genes for biological investigation, and the annotated EST set provides further insight into the molecular biology of T. vitrinus, towards the identification of novel drug targets.
Distinct profiles of expressed sequence tags during intestinal regeneration in the sea cucumber Holothuria glaberrima

PubMed Central

Rojas-Cartagena, Carmencita; Ortíz-Pineda, Pablo; Ramírez-Gómez, Francisco; Suárez-Castillo, Edna C.; Matos-Cruz, Vanessa; Rodríguez, Carlos; Ortíz-Zuazaga, Humberto; García-Arrarás, José E.

2010-01-01

Repair and regeneration are key processes for tissue maintenance, and their disruption may lead to disease states. Little is known about the molecular mechanisms that underline the repair and regeneration of the digestive tract. The sea cucumber Holothuria glaberrima represents an excellent model to dissect and characterize the molecular events during intestinal regeneration. To study the gene expression profile, cDNA libraries were constructed from normal, 3-day, and 7-day regenerating intestines of H. glaberrima. Clones were randomly sequenced and queried against the nonredundant protein database at the National Center for Biotechnology Information. RT-PCR analyses were made of several genes to determine their expression profile during intestinal regeneration. A total of 5,173 sequences from three cDNA libraries were obtained. About 46.2, 35.6, and 26.2% of the sequences for the normal, 3-days, and 7-days cDNA libraries, respectively, shared significant similarity with known sequences in the protein database of GenBank but only present 10% of similarity among them. Analysis of the libraries in terms of functional processes, protein domains, and most common sequences suggests that a differential expression profile is taking place during the regeneration process. Further examination of the expressed sequence tag dataset revealed that 12 putative genes are differentially expressed at significant level (R > 6). Experimental validation by RT-PCR analysis reveals that at least three genes (unknown C-4677-1, melanotransferrin, and centaurin) present a differential expression during regeneration. These findings strongly suggest that the gene expression profile varies among regeneration stages and provide evidence for the existence of differential gene expression. PMID:17579180
Going single but not solo with podocytes: potentials, limitations, and pitfalls of single-cell analysis.

PubMed

Schiffer, Mario

2017-11-01

Single-cell RNA-sequence (RNA-seq) is a widely used tool to study biological questions in single cells. The discussed study identified 92 genes being predominantly expressed in podocytes based on a 5-fold higher expression compared with endothelial and mesangial cells. In addition to technical pitfalls, the question that is discussed in this commentary is whether results of a single-cell RNAseq study are able to deliver expression data that truly characterize a podocyte. Copyright © 2017 International Society of Nephrology. Published by Elsevier Inc. All rights reserved.
Identification of genes differentially expressed during ripening of banana.

PubMed

Manrique-Trujillo, Sandra Mabel; Ramírez-López, Ana Cecilia; Ibarra-Laclette, Enrique; Gómez-Lim, Miguel Angel

2007-08-01

The banana (Musa acuminata, subgroup Cavendish 'Grand Nain') is a climacteric fruit of economic importance. A better understanding of the banana ripening process is needed to improve fruit quality and to extend shelf life. Eighty-four up-regulated unigenes were identified by differential screening of a banana fruit cDNA subtraction library at a late ripening stage. The ripening stages in this study were defined according to the peel color index (PCI). Unigene sequences were analyzed with different databases to assign a putative identification. The expression patterns of 36 transcripts confirmed as positive by differential screening were analyzed comparing the PCI 1, PCI 5 and PCI 7 ripening stages. Expression profiles were obtained for unigenes annotated as orcinol O-methyltransferase, putative alcohol dehydrogenase, ubiquitin-protein ligase, chorismate mutase and two unigenes with non-significant matches with any reported sequence. Similar expression profiles were observed in banana pulp and peel. Our results show differential expression of a group of genes involved in processes associated with fruit ripening, such as stress, detoxification, cytoskeleton and biosynthesis of volatile compounds. Some of the identified genes had not been characterized in banana fruit. Besides providing an overview of gene expression programs and metabolic pathways at late stages of banana fruit ripening, this study contributes to increasing the information available on banana fruit ESTs.
Oncogene GAEC1 regulates CAPN10 expression which predicts survival in esophageal squamous cell carcinoma

PubMed Central

Chan, Dessy; Tsoi, Miriam Yuen-Tung; Liu, Christina Di; Chan, Sau-Hing; Law, Simon Ying-Kit; Chan, Kwok-Wah; Chan, Yuen-Piu; Gopalan, Vinod; Lam, Alfred King-Yin; Tang, Johnny Cheuk-On

2013-01-01

AIM: To identify the downstream regulated genes of GAEC1 oncogene in esophageal squamous cell carcinoma and their clinicopathological significance. METHODS: The anti-proliferative effect of knocking down the expression of GAEC1 oncogene was studied by using the RNA interference (RNAi) approach through transfecting the GAEC1-overexpressed esophageal carcinoma cell line KYSE150 with the pSilencer vector cloned with a GAEC1-targeted sequence, followed by MTS cell proliferation assay and cell cycle analysis using flow cytometry. RNA was then extracted from the parental, pSilencer-GAEC1-targeted sequence transfected and pSilencer negative control vector transfected KYSE150 cells for further analysis of different patterns in gene expression. Genes differentially expressed with suppressed GAEC1 expression were then determined using Human Genome U133 Plus 2.0 cDNA microarray analysis by comparing with the parental cells and normalized with the pSilencer negative control vector transfected cells. The most prominently regulated genes were then studied by immunohistochemical staining using tissue microarrays to determine their clinicopathological correlations in esophageal squamous cell carcinoma by statistical analyses. RESULTS: The RNAi approach of knocking down gene expression showed the effective suppression of GAEC1 expression in esophageal squamous cell carcinoma cell line KYSE150 that resulted in the inhibition of cell proliferation and increase of apoptotic population. cDNA microarray analysis for identifying differentially expressed genes detected the greatest levels of downregulation of calpain 10 (CAPN10) and upregulation of trinucleotide repeat containing 6C (TNRC6C) transcripts when GAEC1 expression was suppressed. At the tissue level, the high level expression of calpain 10 protein was significantly associated with longer patient survival (month) of esophageal squamous cell carcinoma compared to the patients with low level of calpain 10 expression (37.73 ± 16.33 vs 12.62 ± 12.44, P = 0.032). No significant correction was observed among the TNRC6C protein expression level and the clinocopathologcial features of esophageal squamous cell carcinoma. CONCLUSION: GAEC1 regulates the expression of CAPN10 and TNRC6C downstream. Calpain 10 expression is a potential prognostic marker in patients with esophageal squamous cell carcinoma. PMID:23687414
Cloning and expression analysis of carboxyltransferase of acetyl-coA carboxylase from Jatropha curcas.

PubMed

Xie, Wu-Wei; Gao, Shun; Wang, Sheng-Hua; Zhu, Jin-Qiu; Xu, Ying; Tang, Lin; Chen, Fang

2010-01-01

A full-length cDNA of the carboxyltransferase (accA) gene of acetyl-coenzym A (acetyl-CoA) carboxylase from Jatropha curcas was cloned and sequenced. The gene with an open reading frame (ORF) of 1149 bp encodes a polypeptide of 383 amino acids, with a molecular mass of 41.9 kDa. Utilizing fluorogenic real-time polymerase chain reaction (RT-PCR), the expression levels of the accA gene in leaves and fruits at early, middle and late stages under pH 7.0/8.0 and light/darkness stress were investigated. The expression levels of the accA gene in leaves at early, middle and late stages increased significantly under pH 8.0 stress compared to pH 7.0. Similarly, the expression levels in fruits showed a significant increase under darkness condition compared to the control. Under light stress, the expression levels in the fruits at early, middle and late stages showed the largest fluctuations compared to those of the control. These findings suggested that the expression levels of the accA gene are closely related to the growth conditions and developmental stages in the leaves and fruits of Jatropha curcas.
Generation of 2A-linked multicistronic cassettes by recombinant PCR.

PubMed

Szymczak-Workman, Andrea L; Vignali, Kate M; Vignali, Dario A A

2012-02-01

The need for reliable, multicistronic vectors for multigene delivery is at the forefront of biomedical technology. It is now possible to express multiple proteins from a single open reading frame (ORF) using 2A peptide-linked multicistronic vectors. These small sequences, when cloned between genes, allow for efficient, stoichiometric production of discrete protein products within a single vector through a novel "cleavage" event within the 2A peptide sequence. Expression of more than two genes using conventional approaches has several limitations, most notably imbalanced protein expression and large size. The use of 2A peptide sequences alleviates these concerns. They are small (18-22 amino acids) and have divergent amino-terminal sequences, which minimizes the chance for homologous recombination and allows for multiple, different 2A peptide sequences to be used within a single vector. Importantly, separation of genes placed between 2A peptide sequences is nearly 100%, which allows for stoichiometric and concordant expression of the genes, regardless of the order of placement within the vector. This protocol describes the use of recombinant polymerase chain reaction (PCR) to connect multiple 2A-linked protein sequences. The final construct is subcloned into an expression vector.
Global versus Local Regulatory Roles for Lrp-Related Proteins: Haemophilus influenzae as a Case Study

PubMed Central

Friedberg, Devorah; Midkiff, Michael; Calvo, Joseph M.

2001-01-01

Lrp (leucine-responsive regulatory protein) plays a global regulatory role in Escherichia coli, affecting expression of dozens of operons. Numerous lrp-related genes have been identified in different bacteria and archaea, including asnC, an E. coli gene that was the first reported member of this family. Pairwise comparisons of amino acid sequences of the corresponding proteins shows an average sequence identity of only 29% for the vast majority of comparisons. By contrast, Lrp-related proteins from enteric bacteria show more than 97% amino acid identity. Is the global regulatory role associated with E. coli Lrp limited to enteric bacteria? To probe this question we investigated LrfB, an Lrp-related protein from Haemophilus influenzae that shares 75% sequence identity with E. coli Lrp (highest sequence identity among 42 sequences compared). A strain of H. influenzae having an lrfB null allele grew at the wild-type growth rate but with a filamentous morphology. A comparison of two-dimensional (2D) electrophoretic patterns of proteins from parent and mutant strains showed only two differences (comparable studies with lrp+ and lrp E. coli strains by others showed 20 differences). The abundance of LrfB in H. influenzae, estimated by Western blotting experiments, was about 130 dimers per cell (compared to 3,000 dimers per E. coli cell). LrfB expressed in E. coli replaced Lrp as a repressor of the lrp gene but acted only to a limited extent as an activator of the ilvIH operon. Thus, although LrfB resembles Lrp sufficiently to perform some of its functions, its low abundance is consonant with a more local role in regulating but a few genes, a view consistent with the results of the 2D electrophoretic analysis. We speculate that an Lrp having a global regulatory role evolved to help enteric bacteria adapt to their ecological niches and that it is unlikely that Lrp-related proteins in other organisms have a broad regulatory function. PMID:11395465
Composite transcriptome assembly of RNA-seq data in a sheep model for delayed bone healing.

PubMed

Jäger, Marten; Ott, Claus-Eric; Grünhagen, Johannes; Hecht, Jochen; Schell, Hanna; Mundlos, Stefan; Duda, Georg N; Robinson, Peter N; Lienau, Jasmin

2011-03-24

The sheep is an important model organism for many types of medically relevant research, but molecular genetic experiments in the sheep have been limited by the lack of knowledge about ovine gene sequences. Prior to our study, mRNA sequences for only 1,556 partial or complete ovine genes were publicly available. Therefore, we developed a composite de novo transcriptome assembly method for next-generation sequence data to combine known ovine mRNA and EST sequences, mRNA sequences from mouse and cow, and sequences assembled de novo from short read RNA-Seq data into a composite reference transcriptome, and identified transcripts from over 12 thousand previously undescribed ovine genes. Gene expression analysis based on these data revealed substantially different expression profiles in standard versus delayed bone healing in an ovine tibial osteotomy model. Hundreds of transcripts were differentially expressed between standard and delayed healing and between the time points of the standard and delayed healing groups. We used the sheep sequences to design quantitative RT-PCR assays with which we validated the differential expression of 26 genes that had been identified by RNA-seq analysis. A number of clusters of characteristic expression profiles could be identified, some of which showed striking differences between the standard and delayed healing groups. Gene Ontology (GO) analysis showed that the differentially expressed genes were enriched in terms including extracellular matrix, cartilage development, contractile fiber, and chemokine activity. Our results provide a first atlas of gene expression profiles and differentially expressed genes in standard and delayed bone healing in a large-animal model and provide a number of clues as to the shifts in gene expression that underlie delayed bone healing. In the course of our study, we identified transcripts of 13,987 ovine genes, including 12,431 genes for which no sequence information was previously available. This information will provide a basis for future molecular research involving the sheep as a model organism.
Composite transcriptome assembly of RNA-seq data in a sheep model for delayed bone healing

PubMed Central

2011-01-01

Background The sheep is an important model organism for many types of medically relevant research, but molecular genetic experiments in the sheep have been limited by the lack of knowledge about ovine gene sequences. Results Prior to our study, mRNA sequences for only 1,556 partial or complete ovine genes were publicly available. Therefore, we developed a composite de novo transcriptome assembly method for next-generation sequence data to combine known ovine mRNA and EST sequences, mRNA sequences from mouse and cow, and sequences assembled de novo from short read RNA-Seq data into a composite reference transcriptome, and identified transcripts from over 12 thousand previously undescribed ovine genes. Gene expression analysis based on these data revealed substantially different expression profiles in standard versus delayed bone healing in an ovine tibial osteotomy model. Hundreds of transcripts were differentially expressed between standard and delayed healing and between the time points of the standard and delayed healing groups. We used the sheep sequences to design quantitative RT-PCR assays with which we validated the differential expression of 26 genes that had been identified by RNA-seq analysis. A number of clusters of characteristic expression profiles could be identified, some of which showed striking differences between the standard and delayed healing groups. Gene Ontology (GO) analysis showed that the differentially expressed genes were enriched in terms including extracellular matrix, cartilage development, contractile fiber, and chemokine activity. Conclusions Our results provide a first atlas of gene expression profiles and differentially expressed genes in standard and delayed bone healing in a large-animal model and provide a number of clues as to the shifts in gene expression that underlie delayed bone healing. In the course of our study, we identified transcripts of 13,987 ovine genes, including 12,431 genes for which no sequence information was previously available. This information will provide a basis for future molecular research involving the sheep as a model organism. PMID:21435219

Comparative studies of gene expression and the evolution of gene regulation

PubMed Central

Romero, Irene Gallego; Ruvinsky, Ilya; Gilad, Yoav

2014-01-01

The hypothesis that differences in gene regulation play an important role in speciation and adaptation is more than 40 years old. With the advent of new sequencing technologies, we are able to characterize and study gene expression levels and associated regulatory mechanisms in a large number of individuals and species at unprecedented resolution and scale. We have thus gained new insights into the evolutionary pressures that shape gene expression levels, as well as developed an appreciation for the relative importance of evolutionary changes in different regulatory genetic and epigenetic mechanisms. The current challenge is to link gene regulatory changes to adaptive evolution of complex phenotypes. Here we mainly focus on comparative studies in primates, and how they are complemented by studies in model organisms. PMID:22705669
Structure, organization and expression of common carp (Cyprinus carpio L.) SLP-76 gene.

PubMed

Huang, Rong; Sun, Xiao-Feng; Hu, Wei; Wang, Ya-Ping; Guo, Qiong-Lin

2008-05-01

SLP-76 is an important member of the SLP-76 family of adapters, and it plays a key role in TCR signaling and T cell function. Partial cDNA sequence of SLP-76 of common carp (Cyprinus carpio L.) was isolated from thymus cDNA library by the method of suppression subtractive hybridization (SSH). Subsequently, the full length cDNA of carp SLP-76 was obtained by means of 3' RACE and 5' RACE, respectively. The full length cDNA of carp SLP-76 was 2007 bp, consisting of a 5'-terminal untranslated region (UTR) of 285 bp, a 3'-terminal UTR of 240 bp, and an open reading frame of 1482 bp. Sequence comparison showed that the deduced amino acid sequence of carp SLP-76 had an overall similarity of 34-73% to that of other species homologues, and it was composed of an NH2-terminal domain, a central proline-rich domain, and a C-terminal SH2 domain. Amino acid sequence analysis indicated the existence of a Gads binding site R-X-X-K, a 10-aa-long sequence which binds to the SH3 domain of LCK in vitro, and three conserved tyrosine-containing sequence in the NH2-terminal domain. Then we used PCR to obtain a genomic DNA which covers the entire coding region of carp SLP-76. In the 9.2k-long genomic sequence, twenty one exons and twenty introns were identified. RT-PCR results showed that carp SLP-76 was expressed predominantly in hematopoietic tissues, and was upregulated in thymus tissue of four-month carp compared to one-year old carp. RT-PCR and virtual northern hybridization results showed that carp SLP-76 was also upregulated in thymus tissue of GH transgenic carp at the age of four-months. These results suggest that the expression level of SLP-76 gene may be related to thymocyte development in teleosts.
Bioinformatics and reanalysis of subtracted expressed sequence tags from the human ciliary body: Identification of novel biological functions.

PubMed

Escribano, Julio; Coca-Prados, Miguel

2002-08-28

The ciliary body is largely known for its major roles in the regulation of aqueous humor secretion, intraocular pressure, and accommodation of the lens. In this review article we applied bioinformatics to re-examine hundreds of expressed sequence tags (ESTs) previously isolated by subtractive hybridization from a human ciliary body library [1]. The DNA sequences of these clones have been recently added to the web site of NEIBank. DNA sequence comparisons of subtracted ESTs were performed against all entries in the last available release of the non-redundant database containing GenBank, EMBL, DDBJ and PDB sequences using the BlastN program accessed through NCBI's BLAST services on the internet (NCBI). Sequences were also compared and mapped using the Blast search program provided through the Internet by the Human Genome Project (UCSC). A total number of 284 independent ESTs were classified in 17 functional groups. Analysis of their relationships allowed to define the expression of five major groups of known genes: (i) protein synthesis, folding, secretion and degradation (20%); (ii) energy supply and biosynthesis (12%); (iii) contractility and cytoskeleton structure (6%); (iv) cellular signaling and cell cycle regulation (7%); and (v) nerve cell related tasks (2%), including neuropeptide processing and putative non-visual phototransduction and circadian rhythm control. The largest group contain unidentified sequences, a total of 105 sequences, accounting for 37% of ESTs. The unidentified sequences show similarity to genomic non-coding regions, or genes of unknown function. The most highly represented EST, correspond to myocilin, a gene involved in glaucoma. The data also confirms the secretory functions of the ciliary epithelium, and its high metabolism; the presence of a neuroendocrine peptidergic system presumably involved in the regulation of the intraocular pressure and/or aqueous humor secretion. Additional genes may be related to a non-visual phototransduction cascade and/or to circadian rhythms. Overall this initial group of subtracted ESTs can lead to uncover novel physiological functions of the ciliary body in normal and in disease, as well as novel candidate genes for ocular diseases.
Anomalous Diffusion Measured by a Twice-Refocused Spin Echo Pulse Sequence: Analysis Using Fractional Order Calculus

PubMed Central

2011-01-01

Purpose To theoretically develop and experimentally validate a formulism based on a fractional order calculus (FC) diffusion model to characterize anomalous diffusion in brain tissues measured with a twice-refocused spin-echo (TRSE) pulse sequence. Materials and Methods The FC diffusion model is the fractional order generalization of the Bloch-Torrey equation. Using this model, an analytical expression was derived to describe the diffusion-induced signal attenuation in a TRSE pulse sequence. To experimentally validate this expression, a set of diffusion-weighted (DW) images was acquired at 3 Tesla from healthy human brains using a TRSE sequence with twelve b-values ranging from 0 to 2,600 s/mm2. For comparison, DW images were also acquired using a Stejskal-Tanner diffusion gradient in a single-shot spin-echo echo planar sequence. For both datasets, a Levenberg-Marquardt fitting algorithm was used to extract three parameters: diffusion coefficient D, fractional order derivative in space β, and a spatial parameter μ (in units of μm). Using adjusted R-squared values and standard deviations, D, β and μ values and the goodness-of-fit in three specific regions of interest (ROI) in white matter, gray matter, and cerebrospinal fluid were evaluated for each of the two datasets. In addition, spatially resolved parametric maps were assessed qualitatively. Results The analytical expression for the TRSE sequence, derived from the FC diffusion model, accurately characterized the diffusion-induced signal loss in brain tissues at high b-values. In the selected ROIs, the goodness-of-fit and standard deviations for the TRSE dataset were comparable with the results obtained from the Stejskal-Tanner dataset, demonstrating the robustness of the FC model across multiple data acquisition strategies. Qualitatively, the D, β, and μ maps from the TRSE dataset exhibited fewer artifacts, reflecting the improved immunity to eddy currents. Conclusion The diffusion-induced signal attenuation in a TRSE pulse sequence can be described by an FC diffusion model at high b-values. This model performs equally well for data acquired from the human brain tissues with a TRSE pulse sequence or a conventional Stejskal-Tanner sequence. PMID:21509877
Anomalous diffusion measured by a twice-refocused spin echo pulse sequence: analysis using fractional order calculus.

PubMed

Gao, Qing; Srinivasan, Girish; Magin, Richard L; Zhou, Xiaohong Joe

2011-05-01

To theoretically develop and experimentally validate a formulism based on a fractional order calculus (FC) diffusion model to characterize anomalous diffusion in brain tissues measured with a twice-refocused spin-echo (TRSE) pulse sequence. The FC diffusion model is the fractional order generalization of the Bloch-Torrey equation. Using this model, an analytical expression was derived to describe the diffusion-induced signal attenuation in a TRSE pulse sequence. To experimentally validate this expression, a set of diffusion-weighted (DW) images was acquired at 3 Tesla from healthy human brains using a TRSE sequence with twelve b-values ranging from 0 to 2600 s/mm(2). For comparison, DW images were also acquired using a Stejskal-Tanner diffusion gradient in a single-shot spin-echo echo planar sequence. For both datasets, a Levenberg-Marquardt fitting algorithm was used to extract three parameters: diffusion coefficient D, fractional order derivative in space β, and a spatial parameter μ (in units of μm). Using adjusted R-squared values and standard deviations, D, β, and μ values and the goodness-of-fit in three specific regions of interest (ROIs) in white matter, gray matter, and cerebrospinal fluid, respectively, were evaluated for each of the two datasets. In addition, spatially resolved parametric maps were assessed qualitatively. The analytical expression for the TRSE sequence, derived from the FC diffusion model, accurately characterized the diffusion-induced signal loss in brain tissues at high b-values. In the selected ROIs, the goodness-of-fit and standard deviations for the TRSE dataset were comparable with the results obtained from the Stejskal-Tanner dataset, demonstrating the robustness of the FC model across multiple data acquisition strategies. Qualitatively, the D, β, and μ maps from the TRSE dataset exhibited fewer artifacts, reflecting the improved immunity to eddy currents. The diffusion-induced signal attenuation in a TRSE pulse sequence can be described by an FC diffusion model at high b-values. This model performs equally well for data acquired from the human brain tissues with a TRSE pulse sequence or a conventional Stejskal-Tanner sequence. Copyright © 2011 Wiley-Liss, Inc.
Principles of gene microarray data analysis.

PubMed

Mocellin, Simone; Rossi, Carlo Riccardo

2007-01-01

The development of several gene expression profiling methods, such as comparative genomic hybridization (CGH), differential display, serial analysis of gene expression (SAGE), and gene microarray, together with the sequencing of the human genome, has provided an opportunity to monitor and investigate the complex cascade of molecular events leading to tumor development and progression. The availability of such large amounts of information has shifted the attention of scientists towards a nonreductionist approach to biological phenomena. High throughput technologies can be used to follow changing patterns of gene expression over time. Among them, gene microarray has become prominent because it is easier to use, does not require large-scale DNA sequencing, and allows for the parallel quantification of thousands of genes from multiple samples. Gene microarray technology is rapidly spreading worldwide and has the potential to drastically change the therapeutic approach to patients affected with tumor. Therefore, it is of paramount importance for both researchers and clinicians to know the principles underlying the analysis of the huge amount of data generated with microarray technology.
RISC RNA sequencing for context-specific identification of in vivo miR targets

PubMed Central

Matkovich, Scot J; Van Booven, Derek J; Eschenbacher, William H; Dorn, Gerald W

2010-01-01

Rationale MicroRNAs (miRs) are expanding our understanding of cardiac disease and have the potential to transform cardiovascular therapeutics. One miR can target hundreds of individual mRNAs, but existing methodologies are not sufficient to accurately and comprehensively identify these mRNA targets in vivo. Objective To develop methods permitting identification of in vivo miR targets in an unbiased manner, using massively parallel sequencing of mouse cardiac transcriptomes in combination with sequencing of mRNA associated with mouse cardiac RNA-induced silencing complexes (RISCs). Methods and Results We optimized techniques for expression profiling small amounts of RNA without introducing amplification bias, and applied this to anti-Argonaute 2 immunoprecipitated RISCs (RISC-Seq) from mouse hearts. By comparing RNA-sequencing results of cardiac RISC and transcriptome from the same individual hearts, we defined 1,645 mRNAs consistently targeted to mouse cardiac RISCs. We employed this approach in hearts overexpressing miRs from Myh6 promoter-driven precursors (programmed RISC-Seq) to identify 209 in vivo targets of miR-133a and 81 in vivo targets of miR-499. Consistent with the fact that miR-133a and miR-499 have widely differing ‘seed’ sequences and belong to different miR families, only 6 targets were common to miR-133a- and miR-499-programmed hearts. Conclusions RISC-sequencing is a highly sensitive method for general RISC profiling and individual miR target identification in biological context, and is applicable to any tissue and any disease state. Summary MicroRNAs (miRs) are key regulators of mRNA translation in health and disease. While bioinformatic predictions suggest that a single miR may target hundreds of mRNAs, the number of experimentally verified targets of miRs is low. To enable comprehensive, unbiased examination of miR targets, we have performed deep RNA sequencing of cardiac transcriptomes in parallel with cardiac RNA-induced silencing complex (RISC)-associated RNAs (the RISCome), called RISC sequencing. We developed methods that did not require cross-linking of RNAs to RISCs or amplification of mRNA prior to sequencing, making it possible to rapidly perform RISC sequencing from intact tissue while avoiding amplification bias. Comparison of RISCome with transcriptome expression defined the degree of RISC enrichment for each mRNA. The majority of the mRNAs enriched in wild-type cardiac RISComes compared to transcriptomes were bioinformatically predicted to be targets of at least 1 of 139 cardiac-expressed miRs. Programming cardiomyocyte RISCs via transgenic overexpression in adult hearts of miR-133a or miR-499, two miRs that contain entirely different ‘seed’ sequences, elicited differing profiles of RISC-targeted mRNAs. Thus, RISC sequencing represents a highly sensitive method for general RISC profiling and individual miR target identification in biological context. PMID:21030712
Comparative analysis of homoeoallele expression in the tocol biosynthetic pathway during oat seed development

USDA-ARS?s Scientific Manuscript database

Oats are a rich source of compounds that collectively constitute vitamin E, the tocols. Significant attention has been given to the health benefits of tocols in oats, but little is known about themolecular control of their accumulation during grain development. Next generation sequencing provides an...
Thermodynamics-based models of transcriptional regulation with gene sequence.

PubMed

Wang, Shuqiang; Shen, Yanyan; Hu, Jinxing

2015-12-01

Quantitative models of gene regulatory activity have the potential to improve our mechanistic understanding of transcriptional regulation. However, the few models available today have been based on simplistic assumptions about the sequences being modeled or heuristic approximations of the underlying regulatory mechanisms. In this work, we have developed a thermodynamics-based model to predict gene expression driven by any DNA sequence. The proposed model relies on a continuous time, differential equation description of transcriptional dynamics. The sequence features of the promoter are exploited to derive the binding affinity which is derived based on statistical molecular thermodynamics. Experimental results show that the proposed model can effectively identify the activity levels of transcription factors and the regulatory parameters. Comparing with the previous models, the proposed model can reveal more biological sense.
Characterization of fusion genes and the significantly expressed fusion isoforms in breast cancer by hybrid sequencing.

PubMed

Weirather, Jason L; Afshar, Pegah Tootoonchi; Clark, Tyson A; Tseng, Elizabeth; Powers, Linda S; Underwood, Jason G; Zabner, Joseph; Korlach, Jonas; Wong, Wing Hung; Au, Kin Fai

2015-10-15

We developed an innovative hybrid sequencing approach, IDP-fusion, to detect fusion genes, determine fusion sites and identify and quantify fusion isoforms. IDP-fusion is the first method to study gene fusion events by integrating Third Generation Sequencing long reads and Second Generation Sequencing short reads. We applied IDP-fusion to PacBio data and Illumina data from the MCF-7 breast cancer cells. Compared with the existing tools, IDP-fusion detects fusion genes at higher precision and a very low false positive rate. The results show that IDP-fusion will be useful for unraveling the complexity of multiple fusion splices and fusion isoforms within tumorigenesis-relevant fusion genes. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Rhythm sensitivity in macaque monkeys

PubMed Central

Selezneva, Elena; Deike, Susann; Knyazeva, Stanislava; Scheich, Henning; Brechmann, André; Brosch, Michael

2013-01-01

This study provides evidence that monkeys are rhythm sensitive. We composed isochronous tone sequences consisting of repeating triplets of two short tones and one long tone which humans perceive as repeating triplets of two weak and one strong beat. This regular sequence was compared to an irregular sequence with the same number of randomly arranged short and long tones with no such beat structure. To search for indication of rhythm sensitivity we employed an oddball paradigm in which occasional duration deviants were introduced in the sequences. In a pilot study on humans we showed that subjects more easily detected these deviants when they occurred in a regular sequence. In the monkeys we searched for spontaneous behaviors the animals executed concomitant with the deviants. We found that monkeys more frequently exhibited changes of gaze and facial expressions to the deviants when they occurred in the regular sequence compared to the irregular sequence. In addition we recorded neuronal firing and local field potentials from 175 sites of the primary auditory cortex during sequence presentation. We found that both types of neuronal signals differentiated regular from irregular sequences. Both signals were stronger in regular sequences and occurred after the onset of the long tones, i.e., at the position of the strong beat. Local field potential responses were also significantly larger for the durational deviants in regular sequences, yet in a later time window. We speculate that these temporal pattern-selective mechanisms with a focus on strong beats and their deviants underlie the perception of rhythm in the chosen sequences. PMID:24046732
Dynamics of Delayed p53 Mutations in Mice Given Whole-Body Irradiation at 8 Weeks

DOE Office of Scientific and Technical Information (OSTI.GOV)

Okazaki, Ryuji, E-mail: ryuji-o@med.uoeh-u.ac.j; Ootsuyama, Akira; Kakihara, Hiroyo

2011-01-01

Purpose: Ionizing irradiation might induce delayed genotoxic effects in a p53-dependent manner. However, a few reports have shown a p53 mutation as a delayed effect of radiation. In this study, we investigated the p53 gene mutation by the translocation frequency in chromosome 11, loss of p53 alleles, p53 gene methylation, p53 nucleotide sequence, and p53 protein expression/phosphorylation in p53{sup +/+} and p53{sup +/-} mice after irradiation at a young age. Methods and Materials: p53{sup +/+} and p53{sup +/-} mice were exposed to 3 Gy of whole-body irradiation at 8 weeks of age. Chromosome instability was evaluated by fluorescence in situmore » hybridization analysis. p53 allele loss was evaluated by polymerase chain reaction, and p53 methylation was evaluated by methylation-specific polymerase chain reaction. p53 sequence analysis was performed. p53 protein expression was evaluated by Western blotting. Results: The translocation frequency in chromosome 11 showed a delayed increase after irradiation. In old irradiated mice, the number of mice that showed p53 allele loss and p53 methylation increased compared to these numbers in old non-irradiated mice. In two old irradiated p53{sup +/-} mice, the p53 sequence showed heteromutation. In old irradiated mice, the p53 and phospho-p53 protein expressions decreased compared to old non-irradiated mice. Conclusion: We concluded that irradiation at a young age induced delayed p53 mutations and p53 protein suppression.« less
Gene expression profile analysis of Ligon lintless-1 (Li1) mutant reveals important genes and pathways in cotton leaf and fiber development.

PubMed

Ding, Mingquan; Jiang, Yurong; Cao, Yuefen; Lin, Lifeng; He, Shae; Zhou, Wei; Rong, Junkang

2014-02-10

Ligon lintless-1 (Li1) is a monogenic dominant mutant of Gossypium hirsutum (upland cotton) with a phenotype of impaired vegetative growth and short lint fibers. Despite years of research involving genetic mapping and gene expression profile analysis of Li1 mutant ovule tissues, the gene remains uncloned and the underlying pathway of cotton fiber elongation is still unclear. In this study, we report the whole genome-level deep-sequencing analysis of leaf tissues of the Li1 mutant. Differentially expressed genes in leaf tissues of mutant versus wild-type (WT) plants are identified, and the underlying pathways and potential genes that control leaf and fiber development are inferred. The results show that transcription factors AS2, YABBY5, and KANDI-like are significantly differentially expressed in mutant tissues compared with WT ones. Interestingly, several fiber development-related genes are found in the downregulated gene list of the mutant leaf transcriptome. These genes include heat shock protein family, cytoskeleton arrangement, cell wall synthesis, energy, H2O2 metabolism-related genes, and WRKY transcription factors. This finding suggests that the genes are involved in leaf morphology determination and fiber elongation. The expression data are also compared with the previously published microarray data of Li1 ovule tissues. Comparative analysis of the ovule transcriptomes of Li1 and WT reveals that a number of pathways important for fiber elongation are enriched in the downregulated gene list at different fiber development stages (0, 6, 9, 12, 15, 18dpa). Differentially expressed genes identified in both leaf and fiber samples are aligned with cotton whole genome sequences and combined with the genetic fine mapping results to identify a list of candidate genes for Li1. Copyright © 2013 Elsevier B.V. All rights reserved.
Molecular genetic basis for fluoroquinolone-induced retinal degeneration in cats.

PubMed

Ramirez, Christina J; Minch, Jonathan D; Gay, John M; Lahmers, Sunshine M; Guerra, Dan J; Haldorson, Gary J; Schneider, Terri; Mealey, Katrina L

2011-02-01

Distribution of fluoroquinolones to the retina is normally restricted by ABCG2 at the blood-retinal barrier. As the cat develops a species-specific adverse reaction to photoreactive fluoroquinolones, our goal was to investigate ABCG2 as a candidate gene for fluoroquinolone-induced retinal degeneration and blindness in cats. Feline ABCG2 was sequenced and the consensus amino acid sequence was compared with that of 10 other mammalian species. Expression of ABCG2 in feline retina was assessed by immunoblot. cDNA constructs for feline and human ABCG2 were constructed in a pcDNA3 expression vector and expressed in HEK-293 cells, and ABCG2 expression was analyzed by western blot and immunofluorescence. Mitoxantrone and BODIPY-prazosin efflux measured by flow cytometry and a phototoxicity assay were used to assess feline and human ABCG2 function. Four feline-specific (compared with 10 other mammalian species) amino acid changes in conserved regions of ABCG2 were identified. Expression of ABCG2 on plasma membranes was confirmed in feline retina and in cells transfected with human and feline ABCG2, although some intracellular expression of feline ABCG2 was detected by immunofluorescence. Function of feline ABCG2, compared with human ABCG2, was found to be deficient as determined by flow cytometric measurement of mitoxantrone and BODIPY-prazosin efflux and enrofloxacin-induced phototoxicity assays. Feline-specific amino acid changes in ABCG2 cause a functional defect of the transport protein in cats. This functional defect may be owing, in part, to defective cellular localization of feline ABCG2. Regardless, dysfunction of ABCG2 at the blood-retinal barrier likely results in accumulation of photoreactive fluoroquinolones in feline retina. Exposure of the retina to light would then generate reactive oxygen species that would cause the characteristic retinal degeneration and blindness documented in some cats receiving high doses of some fluoroquinolones. Pharmacological inhibition of ABCG2 in other species might result in retinal damage if fluoroquinolones are concurrently administered.
First somatic mutation of E2F1 in a critical DNA binding residue discovered in well-differentiated papillary mesothelioma of the peritoneum

PubMed Central

2011-01-01

Background Well differentiated papillary mesothelioma of the peritoneum (WDPMP) is a rare variant of epithelial mesothelioma of low malignancy potential, usually found in women with no history of asbestos exposure. In this study, we perform the first exome sequencing of WDPMP. Results WDPMP exome sequencing reveals the first somatic mutation of E2F1, R166H, to be identified in human cancer. The location is in the evolutionarily conserved DNA binding domain and computationally predicted to be mutated in the critical contact point between E2F1 and its DNA target. We show that the R166H mutation abrogates E2F1's DNA binding ability and is associated with reduced activation of E2F1 downstream target genes. Mutant E2F1 proteins are also observed in higher quantities when compared with wild-type E2F1 protein levels and the mutant protein's resistance to degradation was found to be the cause of its accumulation within mutant over-expressing cells. Cells over-expressing wild-type E2F1 show decreased proliferation compared to mutant over-expressing cells, but cell proliferation rates of mutant over-expressing cells were comparable to cells over-expressing the empty vector. Conclusions The R166H mutation in E2F1 is shown to have a deleterious effect on its DNA binding ability as well as increasing its stability and subsequent accumulation in R166H mutant cells. Based on the results, two compatible theories can be formed: R166H mutation appears to allow for protein over-expression while minimizing the apoptotic consequence and the R166H mutation may behave similarly to SV40 large T antigen, inhibiting tumor suppressive functions of retinoblastoma protein 1. PMID:21955916
Large-Scale Concatenation cDNA Sequencing

PubMed Central

Yu, Wei; Andersson, Björn; Worley, Kim C.; Muzny, Donna M.; Ding, Yan; Liu, Wen; Ricafrente, Jennifer Y.; Wentland, Meredith A.; Lennon, Greg; Gibbs, Richard A.

1997-01-01

A total of 100 kb of DNA derived from 69 individual human brain cDNA clones of 0.7–2.0 kb were sequenced by concatenated cDNA sequencing (CCS), whereby multiple individual DNA fragments are sequenced simultaneously in a single shotgun library. The method yielded accurate sequences and a similar efficiency compared with other shotgun libraries constructed from single DNA fragments (>20 kb). Computer analyses were carried out on 65 cDNA clone sequences and their corresponding end sequences to examine both nucleic acid and amino acid sequence similarities in the databases. Thirty-seven clones revealed no DNA database matches, 12 clones generated exact matches (≥98% identity), and 16 clones generated nonexact matches (57%–97% identity) to either known human or other species genes. Of those 28 matched clones, 8 had corresponding end sequences that failed to identify similarities. In a protein similarity search, 27 clone sequences displayed significant matches, whereas only 20 of the end sequences had matches to known protein sequences. Our data indicate that full-length cDNA insert sequences provide significantly more nucleic acid and protein sequence similarity matches than expressed sequence tags (ESTs) for database searching. [All 65 cDNA clone sequences described in this paper have been submitted to the GenBank data library under accession nos. U79240–U79304.] PMID:9110174
Identification of an inducible regulator of c-myb expression during T-cell activation.

PubMed Central

Phan, S C; Feeley, B; Withers, D; Boxer, L M

1996-01-01

Resting T cells express very low levels of c-Myb protein. During T-cell activation, c-myb expression is induced and much of the increase in expression occurs at the transcriptional level. We identified a region of the c-myb 5' flanking sequence that increased c-myb expression during T-cell activation. In vivo footprinting by ligation-mediated PCR was performed to correlate in vivo protein binding with functional activity. A protein footprint was visible over this region of the c-myb 5' flanking sequence in activated T cells but not in unactivated T cells. An electrophoretic mobility shift assay (EMSA) with nuclear extract from activated T cells and an oligonucleotide of this binding site demonstrated a new protein-DNA complex, referred to as CMAT for c-myb in activated T cells; this complex was not present in unactivated T cells. Because the binding site showed some sequence similarity with the nuclear factor of activated T cells (NFAT) binding site, we compared the kinetics of induction of the two binding complexes and the molecular masses of the two proteins. Studies of the kinetics of induction showed that the NFAT EMSA binding complex appeared earlier than the CMAT complex. The NFAT protein migrated more slowly in a sodium dodecyl sulfate-polyacrylamide gel than the CMAT protein did. In addition, an antibody against NFAT did not cross-react with the CMAT protein. The appearance of the CMAT binding complex was inhibited by both cyclosporin A and rapamycin. The CMAT protein appears to be a novel inducible protein involved in the regulation of c-myb expression during T-cell activation. PMID:8628306
Characterisation of microRNAs from apple (Malus domestica 'Royal Gala') vascular tissue and phloem sap.

PubMed

Varkonyi-Gasic, Erika; Gould, Nick; Sandanayaka, Manoharie; Sutherland, Paul; MacDiarmid, Robin M

2010-08-04

Plant microRNAs (miRNAs) are a class of small, non-coding RNAs that play an important role in development and environmental responses. Hundreds of plant miRNAs have been identified to date, mainly from the model species for which there are available genome sequences. The current challenge is to characterise miRNAs from plant species with agricultural and horticultural importance, to aid our understanding of important regulatory mechanisms in crop species and enable improvement of crops and rootstocks. Based on the knowledge that many miRNAs occur in large gene families and are highly conserved among distantly related species, we analysed expression of twenty-one miRNA sequences in different tissues of apple (Malus x domestica 'Royal Gala'). We identified eighteen sequences that are expressed in at least one of the tissues tested. Some, but not all, miRNAs expressed in apple tissues including the phloem tissue were also detected in the phloem sap sample derived from the stylets of woolly apple aphids. Most of the miRNAs detected in apple phloem sap were also abundant in the phloem sap of herbaceous species. Potential targets for apple miRNAs were identified that encode putative proteins shown to be targets of corresponding miRNAs in a number of plant species. Expression patterns of potential targets were analysed and correlated with expression of corresponding miRNAs. This study validated tissue-specific expression of apple miRNAs that target genes responsible for plant growth, development, and stress response. A subset of characterised miRNAs was also present in the apple phloem translocation stream. A comparative analysis of phloem miRNAs in herbaceous species and woody perennials will aid our understanding of non-cell autonomous roles of miRNAs in plants.
Epigenomics and bolting tolerance in sugar beet genotypes.

PubMed

Hébrard, Claire; Peterson, Daniel G; Willems, Glenda; Delaunay, Alain; Jesson, Béline; Lefèbvre, Marc; Barnes, Steve; Maury, Stéphane

2016-01-01

In sugar beet (Beta vulgaris altissima), bolting tolerance is an essential agronomic trait reflecting the bolting response of genotypes after vernalization. Genes involved in induction of sugar beet bolting have now been identified, and evidence suggests that epigenetic factors are involved in their control. Indeed, the time course and amplitude of DNA methylation variations in the shoot apical meristem have been shown to be critical in inducing sugar beet bolting, and a few functional targets of DNA methylation during vernalization have been identified. However, molecular mechanisms controlling bolting tolerance levels among genotypes are still poorly understood. Here, gene expression and DNA methylation profiles were compared in shoot apical meristems of three bolting-resistant and three bolting-sensitive genotypes after vernalization. Using Cot fractionation followed by 454 sequencing of the isolated low-copy DNA, 6231 contigs were obtained that were used along with public sugar beet DNA sequences to design custom Agilent microarrays for expression (56k) and methylation (244k) analyses. A total of 169 differentially expressed genes and 111 differentially methylated regions were identified between resistant and sensitive vernalized genotypes. Fourteen sequences were both differentially expressed and differentially methylated, with a negative correlation between their methylation and expression levels. Genes involved in cold perception, phytohormone signalling, and flowering induction were over-represented and collectively represent an integrative gene network from environmental perception to bolting induction. Altogether, the data suggest that the genotype-dependent control of DNA methylation and expression of an integrative gene network participate in bolting tolerance in sugar beet, opening up perspectives for crop improvement. © The Author 2015. Published by Oxford University Press on behalf of the Society for Experimental Biology.
Characterisation of microRNAs from apple (Malus domestica 'Royal Gala') vascular tissue and phloem sap

PubMed Central

2010-01-01

Background Plant microRNAs (miRNAs) are a class of small, non-coding RNAs that play an important role in development and environmental responses. Hundreds of plant miRNAs have been identified to date, mainly from the model species for which there are available genome sequences. The current challenge is to characterise miRNAs from plant species with agricultural and horticultural importance, to aid our understanding of important regulatory mechanisms in crop species and enable improvement of crops and rootstocks. Results Based on the knowledge that many miRNAs occur in large gene families and are highly conserved among distantly related species, we analysed expression of twenty-one miRNA sequences in different tissues of apple (Malus x domestica 'Royal Gala'). We identified eighteen sequences that are expressed in at least one of the tissues tested. Some, but not all, miRNAs expressed in apple tissues including the phloem tissue were also detected in the phloem sap sample derived from the stylets of woolly apple aphids. Most of the miRNAs detected in apple phloem sap were also abundant in the phloem sap of herbaceous species. Potential targets for apple miRNAs were identified that encode putative proteins shown to be targets of corresponding miRNAs in a number of plant species. Expression patterns of potential targets were analysed and correlated with expression of corresponding miRNAs. Conclusions This study validated tissue-specific expression of apple miRNAs that target genes responsible for plant growth, development, and stress response. A subset of characterised miRNAs was also present in the apple phloem translocation stream. A comparative analysis of phloem miRNAs in herbaceous species and woody perennials will aid our understanding of non-cell autonomous roles of miRNAs in plants. PMID:20682080

Conserved and divergent rhythms of crassulacean acid metabolism-related and core clock gene expression in the cactus Opuntia ficus-indica.

PubMed

Mallona, Izaskun; Egea-Cortines, Marcos; Weiss, Julia

2011-08-01

The cactus Opuntia ficus-indica is a constitutive Crassulacean acid metabolism (CAM) species. Current knowledge of CAM metabolism suggests that the enzyme phosphoenolpyruvate carboxylase kinase (PPCK) is circadian regulated at the transcriptional level, whereas phosphoenolpyruvate carboxylase (PEPC), malate dehydrogenase (MDH), NADP-malic enzyme (NADP-ME), and pyruvate phosphate dikinase (PPDK) are posttranslationally controlled. As little transcriptomic data are available from obligate CAM plants, we created an expressed sequence tag database derived from different organs and developmental stages. Sequences were assembled, compared with sequences in the National Center for Biotechnology Information nonredundant database for identification of putative orthologs, and mapped using Kyoto Encyclopedia of Genes and Genomes Orthology and Gene Ontology. We identified genes involved in circadian regulation and CAM metabolism for transcriptomic analysis in plants grown in long days. We identified stable reference genes for quantitative polymerase chain reaction and found that OfiSAND, like its counterpart in Arabidopsis (Arabidopsis thaliana), and OfiTUB are generally appropriate standards for use in the quantification of gene expression in O. ficus-indica. Three kinds of expression profiles were found: transcripts of OfiPPCK oscillated with a 24-h periodicity; transcripts of the light-active OfiNADP-ME and OfiPPDK genes adapted to 12-h cycles, while transcript accumulation patterns of OfiPEPC and OfiMDH were arrhythmic. Expression of the circadian clock gene OfiTOC1, similar to Arabidopsis, oscillated with a 24-h periodicity, peaking at night. Expression of OfiCCA1 and OfiPRR9, unlike in Arabidopsis, adapted best to a 12-h rhythm, suggesting that circadian clock gene interactions differ from those of Arabidopsis. Our results indicate that the evolution of CAM metabolism could be the result of modified circadian regulation at both the transcriptional and posttranscriptional levels.
The expression of the clock gene cycle has rhythmic pattern and is affected by photoperiod in the moth Sesamia nonagrioides.

PubMed

Kontogiannatos, Dimitrios; Gkouvitsas, Theodoros; Kourti, Anna

2017-06-01

To obtain clues to the link between the molecular mechanism of circadian and photoperiod clocks, we have cloned the circadian clock gene cycle (Sncyc) in the corn stalk borer, Sesamia nonagrioides, which undergoes facultative diapause controlled by photoperiod. Sequence analysis revealed a high degree of conservation among insects for this gene. SnCYC consists of 667 amino acids and structural analysis showed that it contains a BCTR domain in its C-terminal in addition to the common domains found in Drosophila CYC, i.e. bHLH, PAS-A, PAS-B domains. The results revealed that the sequence of Sncyc showed a similarity to that of its mammalian orthologue, Bmal1. We also investigated the expression patterns of Sncyc in the brain of larvae growing under long-day 16L: 8D (LD), constant darkness (DD) and short-day 10L: 14D (SD) conditions using qRT-PCR assays. The mRNAs of Sncyc expression was rhythmic in LD, DD and SD cycles. Also, it is remarkable that the photoperiodic conditions affect the expression patterns and/or amplitudes of circadian clock gene Sncyc. This gene is associated with diapause in S. nonagrioides, because under SD (diapause conditions) the photoperiodic signal altered mRNA accumulation. Sequence and expression analysis of cyc in S. nonagrioides shows interesting differences compared to Drosophila where this gene does not oscillate or change in expression patterns in response to photoperiod, suggesting that this species is an interesting new model to study the molecular control of insect circadian and photoperiodic clocks. Copyright © 2017 Elsevier Inc. All rights reserved.
Four out of eight genes in a mouse chromosome 7 congenic donor region are candidate obesity genes.

PubMed

Sarahan, Kari A; Fisler, Janis S; Warden, Craig H

2011-09-22

We previously identified a region of mouse chromosome 7 that influences body fat mass in F2 littermates of congenic × background intercrosses. Current analyses revealed that alleles in the donor region of the subcongenic B6.C-D7Mit318 (318) promoted a twofold increase in adiposity in homozygous lines of 318 compared with background C57BL/6ByJ (B6By) mice. Parent-of-origin effects were discounted through cross-fostering studies and an F1 reciprocal cross. Mapping of the donor region revealed that it has a maximal size of 2.8 Mb (minimum 1.8 Mb) and contains a maximum of eight protein coding genes. Quantitative PCR in whole brain, liver, and gonadal white adipose tissue (GWAT) revealed differential expression between genotypes for three genes in females and two genes in males. Alpha-2,8-sialyltransferase 8B (St8sia2) showed reduced 318 mRNA levels in brain for females and males and in GWAT for females only. Both sexes of 318 mice had reduced Repulsive guidance molecule-a (Rgma) expression in GWAT. In brain, Family with sequence similarity 174 member b (Fam174b) had increased expression in 318 females, whereas Chromodomain helicase DNA binding protein 2 (Chd2-2) had reduced expression in 318 males. No donor region genes were differentially expressed in liver. Sequence analysis of coding exons for all genes in the 318 donor region revealed only one single nucleotide polymorphism that produced a nonsynonymous missense mutation, Gln7Pro, in Fam174b. Our findings highlight the difficulty of using expression and sequence to identify quantitative trait genes underlying obesity even in small genomic regions.
Molecular Characterization and Expression Analysis of Chloroplast Protein Import Components in Tomato (Solanum lycopersicum)

PubMed Central

Yan, Jianmin; Campbell, James H.; Glick, Bernard R.; Smith, Matthew D.; Liang, Yan

2014-01-01

The translocon at the outer envelope membrane of chloroplasts (Toc) mediates the recognition and initial import into the organelle of thousands of nucleus-encoded proteins. These proteins are translated in the cytosol as precursor proteins with cleavable amino-terminal targeting sequences called transit peptides. The majority of the known Toc components that mediate chloroplast protein import were originally identified in pea, and more recently have been studied most extensively in Arabidopsis. With the completion of the tomato genome sequencing project, it is now possible to identify putative homologues of the chloroplast import components in tomato. In the work reported here, the Toc GTPase cDNAs from tomato were identified, cloned and analyzed. The analysis revealed that there are four Toc159 homologues (slToc159-1, -2, -3 and -4) and two Toc34 homologues (slToc34-1 and -2) in tomato, and it was shown that tomato Toc159 and Toc34 homologues share high sequence similarity with the comparable import apparatus components from Arabidopsis and pea. Thus, tomato is a valid model for further study of this system. The expression level of Toc complex components was also investigated in different tissues during tomato development. The two tomato Toc34 homologues are expressed at higher levels in non-photosynthetic tissues, whereas, the expression of two tomato Toc159 homologues, slToc159-1 and slToc159-4, were higher in photosynthetic tissues, and the expression patterns of slToc159-2 was not significantly different in photosynthetic and non-photosynthetic tissues, and slToc159-3 expression was limited to a few select tissues. PMID:24751891
De novo sequencing and characterization of floral transcriptome in two species of buckwheat (Fagopyrum)

PubMed Central

2011-01-01

Background Transcriptome sequencing data has become an integral component of modern genetics, genomics and evolutionary biology. However, despite advances in the technologies of DNA sequencing, such data are lacking for many groups of living organisms, in particular, many plant taxa. We present here the results of transcriptome sequencing for two closely related plant species. These species, Fagopyrum esculentum and F. tataricum, belong to the order Caryophyllales - a large group of flowering plants with uncertain evolutionary relationships. F. esculentum (common buckwheat) is also an important food crop. Despite these practical and evolutionary considerations Fagopyrum species have not been the subject of large-scale sequencing projects. Results Normalized cDNA corresponding to genes expressed in flowers and inflorescences of F. esculentum and F. tataricum was sequenced using the 454 pyrosequencing technology. This resulted in 267 (for F. esculentum) and 229 (F. tataricum) thousands of reads with average length of 341-349 nucleotides. De novo assembly of the reads produced about 25 thousands of contigs for each species, with 7.5-8.2× coverage. Comparative analysis of two transcriptomes demonstrated their overall similarity but also revealed genes that are presumably differentially expressed. Among them are retrotransposon genes and genes involved in sugar biosynthesis and metabolism. Thirteen single-copy genes were used for phylogenetic analysis; the resulting trees are largely consistent with those inferred from multigenic plastid datasets. The sister relationships of the Caryophyllales and asterids now gained high support from nuclear gene sequences. Conclusions 454 transcriptome sequencing and de novo assembly was performed for two congeneric flowering plant species, F. esculentum and F. tataricum. As a result, a large set of cDNA sequences that represent orthologs of known plant genes as well as potential new genes was generated. PMID:21232141
Agave: a biofuel feedstock for arid and semi-arid environments

DOE Office of Scientific and Technical Information (OSTI.GOV)

Gross, Stephen; Martin, Jeffrey; Simpson, June

2011-05-31

Efficient production of plant-based, lignocellulosic biofuels relies upon continued improvement of existing biofuel feedstock species, as well as the introduction of newfeedstocks capable of growing on marginal lands to avoid conflicts with existing food production and minimize use of water and nitrogen resources. To this end, specieswithin the plant genus Agave have recently been proposed as new biofuel feedstocks. Many Agave species are adapted to hot and arid environments generally unsuitable forfood production, yet have biomass productivity rates comparable to other second-generation biofuel feedstocks such as switchgrass and Miscanthus. Agavesachieve remarkable heat tolerance and water use efficiency in part throughmore » a Crassulacean Acid Metabolism (CAM) mode of photosynthesis, but the genes andregulatory pathways enabling CAM and thermotolerance in agaves remain poorly understood. We seek to accelerate the development of agave as a new biofuelfeedstock through genomic approaches using massively-parallel sequencing technologies. First, we plan to sequence the transcriptome of A. tequilana to provide adatabase of protein-coding genes to the agave research community. Second, we will compare transcriptome-wide gene expression of agaves under different environmentalconditions in order to understand genetic pathways controlling CAM, water use efficiency, and thermotolerance. Finally, we aim to compare the transcriptome of A.tequilana with that of other Agave species to gain further insight into molecular mechanisms underlying traits desirable for biofuel feedstocks. These genomicapproaches will provide sequence and gene expression information critical to the breeding and domestication of Agave species suitable for biofuel production.« less
A normalization strategy for comparing tag count data

PubMed Central

2012-01-01

Background High-throughput sequencing, such as ribonucleic acid sequencing (RNA-seq) and chromatin immunoprecipitation sequencing (ChIP-seq) analyses, enables various features of organisms to be compared through tag counts. Recent studies have demonstrated that the normalization step for RNA-seq data is critical for a more accurate subsequent analysis of differential gene expression. Development of a more robust normalization method is desirable for identifying the true difference in tag count data. Results We describe a strategy for normalizing tag count data, focusing on RNA-seq. The key concept is to remove data assigned as potential differentially expressed genes (DEGs) before calculating the normalization factor. Several R packages for identifying DEGs are currently available, and each package uses its own normalization method and gene ranking algorithm. We compared a total of eight package combinations: four R packages (edgeR, DESeq, baySeq, and NBPSeq) with their default normalization settings and with our normalization strategy. Many synthetic datasets under various scenarios were evaluated on the basis of the area under the curve (AUC) as a measure for both sensitivity and specificity. We found that packages using our strategy in the data normalization step overall performed well. This result was also observed for a real experimental dataset. Conclusion Our results showed that the elimination of potential DEGs is essential for more accurate normalization of RNA-seq data. The concept of this normalization strategy can widely be applied to other types of tag count data and to microarray data. PMID:22475125
Diversity analysis in Cannabis sativa based on large-scale development of expressed sequence tag-derived simple sequence repeat markers.

PubMed

Gao, Chunsheng; Xin, Pengfei; Cheng, Chaohua; Tang, Qing; Chen, Ping; Wang, Changbiao; Zang, Gonggu; Zhao, Lining

2014-01-01

Cannabis sativa L. is an important economic plant for the production of food, fiber, oils, and intoxicants. However, lack of sufficient simple sequence repeat (SSR) markers has limited the development of cannabis genetic research. Here, large-scale development of expressed sequence tag simple sequence repeat (EST-SSR) markers was performed to obtain more informative genetic markers, and to assess genetic diversity in cannabis (Cannabis sativa L.). Based on the cannabis transcriptome, 4,577 SSRs were identified from 3,624 ESTs. From there, a total of 3,442 complementary primer pairs were designed as SSR markers. Among these markers, trinucleotide repeat motifs (50.99%) were the most abundant, followed by hexanucleotide (25.13%), dinucleotide (16.34%), tetranucloetide (3.8%), and pentanucleotide (3.74%) repeat motifs, respectively. The AAG/CTT trinucleotide repeat (17.96%) was the most abundant motif detected in the SSRs. One hundred and seventeen EST-SSR markers were randomly selected to evaluate primer quality in 24 cannabis varieties. Among these 117 markers, 108 (92.31%) were successfully amplified and 87 (74.36%) were polymorphic. Forty-five polymorphic primer pairs were selected to evaluate genetic diversity and relatedness among the 115 cannabis genotypes. The results showed that 115 varieties could be divided into 4 groups primarily based on geography: Northern China, Europe, Central China, and Southern China. Moreover, the coefficient of similarity when comparing cannabis from Northern China with the European group cannabis was higher than that when comparing with cannabis from the other two groups, owing to a similar climate. This study outlines the first large-scale development of SSR markers for cannabis. These data may serve as a foundation for the development of genetic linkage, quantitative trait loci mapping, and marker-assisted breeding of cannabis.
Diversity Analysis in Cannabis sativa Based on Large-Scale Development of Expressed Sequence Tag-Derived Simple Sequence Repeat Markers

PubMed Central

Cheng, Chaohua; Tang, Qing; Chen, Ping; Wang, Changbiao; Zang, Gonggu; Zhao, Lining

2014-01-01

Cannabis sativa L. is an important economic plant for the production of food, fiber, oils, and intoxicants. However, lack of sufficient simple sequence repeat (SSR) markers has limited the development of cannabis genetic research. Here, large-scale development of expressed sequence tag simple sequence repeat (EST-SSR) markers was performed to obtain more informative genetic markers, and to assess genetic diversity in cannabis (Cannabis sativa L.). Based on the cannabis transcriptome, 4,577 SSRs were identified from 3,624 ESTs. From there, a total of 3,442 complementary primer pairs were designed as SSR markers. Among these markers, trinucleotide repeat motifs (50.99%) were the most abundant, followed by hexanucleotide (25.13%), dinucleotide (16.34%), tetranucloetide (3.8%), and pentanucleotide (3.74%) repeat motifs, respectively. The AAG/CTT trinucleotide repeat (17.96%) was the most abundant motif detected in the SSRs. One hundred and seventeen EST-SSR markers were randomly selected to evaluate primer quality in 24 cannabis varieties. Among these 117 markers, 108 (92.31%) were successfully amplified and 87 (74.36%) were polymorphic. Forty-five polymorphic primer pairs were selected to evaluate genetic diversity and relatedness among the 115 cannabis genotypes. The results showed that 115 varieties could be divided into 4 groups primarily based on geography: Northern China, Europe, Central China, and Southern China. Moreover, the coefficient of similarity when comparing cannabis from Northern China with the European group cannabis was higher than that when comparing with cannabis from the other two groups, owing to a similar climate. This study outlines the first large-scale development of SSR markers for cannabis. These data may serve as a foundation for the development of genetic linkage, quantitative trait loci mapping, and marker-assisted breeding of cannabis. PMID:25329551
Whole-transcriptome, high-throughput RNA sequence analysis of the bovine macrophage response to Mycobacterium bovis infection in vitro.

PubMed

Nalpas, Nicolas C; Park, Stephen D E; Magee, David A; Taraktsoglou, Maria; Browne, John A; Conlon, Kevin M; Rue-Albrecht, Kévin; Killick, Kate E; Hokamp, Karsten; Lohan, Amanda J; Loftus, Brendan J; Gormley, Eamonn; Gordon, Stephen V; MacHugh, David E

2013-04-08

Mycobacterium bovis, the causative agent of bovine tuberculosis, is an intracellular pathogen that can persist inside host macrophages during infection via a diverse range of mechanisms that subvert the host immune response. In the current study, we have analysed and compared the transcriptomes of M. bovis-infected monocyte-derived macrophages (MDM) purified from six Holstein-Friesian females with the transcriptomes of non-infected control MDM from the same animals over a 24 h period using strand-specific RNA sequencing (RNA-seq). In addition, we compare gene expression profiles generated using RNA-seq with those previously generated by us using the high-density Affymetrix® GeneChip® Bovine Genome Array platform from the same MDM-extracted RNA. A mean of 7.2 million reads from each MDM sample mapped uniquely and unambiguously to single Bos taurus reference genome locations. Analysis of these mapped reads showed 2,584 genes (1,392 upregulated; 1,192 downregulated) and 757 putative natural antisense transcripts (558 upregulated; 119 downregulated) that were differentially expressed based on sense and antisense strand data, respectively (adjusted P-value ≤ 0.05). Of the differentially expressed genes, 694 were common to both the sense and antisense data sets, with the direction of expression (i.e. up- or downregulation) positively correlated for 693 genes and negatively correlated for the remaining gene. Gene ontology analysis of the differentially expressed genes revealed an enrichment of immune, apoptotic and cell signalling genes. Notably, the number of differentially expressed genes identified from RNA-seq sense strand analysis was greater than the number of differentially expressed genes detected from microarray analysis (2,584 genes versus 2,015 genes). Furthermore, our data reveal a greater dynamic range in the detection and quantification of gene transcripts for RNA-seq compared to microarray technology. This study highlights the value of RNA-seq in identifying novel immunomodulatory mechanisms that underlie host-mycobacterial pathogen interactions during infection, including possible complex post-transcriptional regulation of host gene expression involving antisense RNA.
Diversity, abundance, and consistency of microbial oxygenase expression and biodegradation in a shallow contaminated aquifer

DOE Office of Scientific and Technical Information (OSTI.GOV)

Yagi, J.M.; Madsen, E.L.

The diversity of Rieske dioxygenase genes and short-term temporal variability in the abundance of two selected dioxygenase gene sequences were examined in a naphthalene-rich, coal tar waste-contaminated subsurface study site. Using a previously published PCR-based approach (S. M. Ni Chadhain, R. S. Norman, K. V. Pesce, J. J. Kukor, and G. J. Zylstra, Appl. Environ. Microbiol. 72: 4078-4087, 2006) a broad suite of genes was detected, ranging from dioxygenase sequences associated with Rhodococcus and Sphingomonas to 32 previously uncharacterized Rieske gene sequence clone groups. The nag genes appeared frequently (20% of the total) in two groundwater monitoring wells characterized bymore » low (similar to 10{sup 2} ppb; similar to 1 {mu} M) ambient concentrations of naphthalene. A quantitative competitive PCR assay was used to show that abundances of nag genes (and archetypal nah genes) fluctuated substantially over a 9-month period. To contrast short-term variation with long-term community stability, in situ community gene expression (dioxygenase mRNA) and biodegradation potential (community metabolism of naphthalene in microcosms) were compared to measurements from 6 years earlier. cDNA sequences amplified from total RNA extracts revealed that nah- and nag-type genes were expressed in situ, corresponding well with structural gene abundances. Despite evidence for short-term (9-month) shifts in dioxygenase gene copy number, agreement in field gene expression (dioxygenase mRNA) and biodegradation potential was observed in comparisons to equivalent assays performed 6 years earlier. Thus, stability in community biodegradation characteristics at the hemidecadal time frame has been documented for these subsurface microbial communities.« less
cDNA cloning, functional expression and cellular localization of rat liver mitochondrial electron-transfer flavoprotein-ubiquinone oxidoreductase protein.

PubMed

Huang, Shengbing; Song, Wei; Lin, Qishui

2005-08-01

A membrane-bound protein was purified from rat liver mitochondria. After being digested with V8 protease, two peptides containing identical 14 amino acid residue sequences were obtained. Using the 14 amino acid peptide derived DNA sequence as gene specific primer, the cDNA of correspondent gene 5'-terminal and 3'-terminal were obtained by RACE technique. The full-length cDNA that encoded a protein of 616 amino acids was thus cloned, which included the above mentioned peptide sequence. The full length cDNA was highly homologous to that of human ETF-QO, indicating that it may be the cDNA of rat ETF-QO. ETF-QO is an iron sulfur protein located in mitochondria inner membrane containing two kinds of redox center: FAD and [4Fe-4S] center. After comparing the sequence from the cDNA of the 616 amino acids protein with that of the mature protein of rat liver mitochondria, it was found that the N terminal 32 amino acid residues did not exist in the mature protein, indicating that the cDNA was that of ETF-QOp. When the cDNA was expressed in Saccharomyces cerevisiae with inducible vectors, the protein product was enriched in mitochondrial fraction and exhibited electron transfer activity (NBT reductase activity) of ETF-QO. Results demonstrated that the 32 amino acid peptide was a mitochondrial targeting peptide, and both FAD and iron-sulfur cluster were inserted properly into the expressed ETF-QO. ETF-QO had a high level expression in rat heart, liver and kidney. The fusion protein of GFP-ETF-QO co-localized with mitochondria in COS-7 cells.
Cross-species inference of long non-coding RNAs greatly expands the ruminant transcriptome.

PubMed

Bush, Stephen J; Muriuki, Charity; McCulloch, Mary E B; Farquhar, Iseabail L; Clark, Emily L; Hume, David A

2018-04-24

mRNA-like long non-coding RNAs (lncRNAs) are a significant component of mammalian transcriptomes, although most are expressed only at low levels, with high tissue-specificity and/or at specific developmental stages. Thus, in many cases lncRNA detection by RNA-sequencing (RNA-seq) is compromised by stochastic sampling. To account for this and create a catalogue of ruminant lncRNAs, we compared de novo assembled lncRNAs derived from large RNA-seq datasets in transcriptional atlas projects for sheep and goats with previous lncRNAs assembled in cattle and human. We then combined the novel lncRNAs with the sheep transcriptional atlas to identify co-regulated sets of protein-coding and non-coding loci. Few lncRNAs could be reproducibly assembled from a single dataset, even with deep sequencing of the same tissues from multiple animals. Furthermore, there was little sequence overlap between lncRNAs that were assembled from pooled RNA-seq data. We combined positional conservation (synteny) with cross-species mapping of candidate lncRNAs to identify a consensus set of ruminant lncRNAs and then used the RNA-seq data to demonstrate detectable and reproducible expression in each species. In sheep, 20 to 30% of lncRNAs were located close to protein-coding genes with which they are strongly co-expressed, which is consistent with the evolutionary origin of some ncRNAs in enhancer sequences. Nevertheless, most of the lncRNAs are not co-expressed with neighbouring protein-coding genes. Alongside substantially expanding the ruminant lncRNA repertoire, the outcomes of our analysis demonstrate that stochastic sampling can be partly overcome by combining RNA-seq datasets from related species. This has practical implications for the future discovery of lncRNAs in other species.
SNP discovery in the bovine milk transcriptome using RNA-Seq technology.

PubMed

Cánovas, Angela; Rincon, Gonzalo; Islas-Trejo, Alma; Wickramasinghe, Saumya; Medrano, Juan F

2010-12-01

High-throughput sequencing of RNA (RNA-Seq) was developed primarily to analyze global gene expression in different tissues. However, it also is an efficient way to discover coding SNPs. The objective of this study was to perform a SNP discovery analysis in the milk transcriptome using RNA-Seq. Seven milk samples from Holstein cows were analyzed by sequencing cDNAs using the Illumina Genome Analyzer system. We detected 19,175 genes expressed in milk samples corresponding to approximately 70% of the total number of genes analyzed. The SNP detection analysis revealed 100,734 SNPs in Holstein samples, and a large number of those corresponded to differences between the Holstein breed and the Hereford bovine genome assembly Btau4.0. The number of polymorphic SNPs within Holstein cows was 33,045. The accuracy of RNA-Seq SNP discovery was tested by comparing SNPs detected in a set of 42 candidate genes expressed in milk that had been resequenced earlier using Sanger sequencing technology. Seventy of 86 SNPs were detected using both RNA-Seq and Sanger sequencing technologies. The KASPar Genotyping System was used to validate unique SNPs found by RNA-Seq but not observed by Sanger technology. Our results confirm that analyzing the transcriptome using RNA-Seq technology is an efficient and cost-effective method to identify SNPs in transcribed regions. This study creates guidelines to maximize the accuracy of SNP discovery and prevention of false-positive SNP detection, and provides more than 33,000 SNPs located in coding regions of genes expressed during lactation that can be used to develop genotyping platforms to perform marker-trait association studies in Holstein cattle.
Human population-specific gene expression and transcriptional network modification with polymorphic transposable elements

PubMed Central

Wang, Lu; Mariño-Ramírez, Leonardo

2017-01-01

Abstract Transposable element (TE) derived sequences are known to contribute to the regulation of the human genome. The majority of known TE-derived regulatory sequences correspond to relatively ancient insertions, which are fixed across human populations. The extent to which human genetic variation caused by recent TE activity leads to regulatory polymorphisms among populations has yet to be thoroughly explored. In this study, we searched for associations between polymorphic TE (polyTE) loci and human gene expression levels using an expression quantitative trait loci (eQTL) approach. We compared locus-specific polyTE insertion genotypes to B cell gene expression levels among 445 individuals from 5 human populations. Numerous human polyTE loci correspond to both cis and trans eQTL, and their regulatory effects are directly related to cell type-specific function in the immune system. PolyTE loci are associated with differences in expression between European and African population groups, and a single polyTE loci is indirectly associated with the expression of numerous genes via the regulation of the B cell-specific transcription factor PAX5. The polyTE-gene expression associations we found indicate that human TE genetic variation can have important phenotypic consequences. Our results reveal that TE-eQTL are involved in population-specific gene regulation as well as transcriptional network modification. PMID:27998931
Intra and Interspecific Variations of Gene Expression Levels in Yeast Are Largely Neutral: (Nei Lecture, SMBE 2016, Gold Coast).

PubMed

Yang, Jian-Rong; Maclean, Calum J; Park, Chungoo; Zhao, Huabin; Zhang, Jianzhi

2017-09-01

It is commonly, although not universally, accepted that most intra and interspecific genome sequence variations are more or less neutral, whereas a large fraction of organism-level phenotypic variations are adaptive. Gene expression levels are molecular phenotypes that bridge the gap between genotypes and corresponding organism-level phenotypes. Yet, it is unknown whether natural variations in gene expression levels are mostly neutral or adaptive. Here we address this fundamental question by genome-wide profiling and comparison of gene expression levels in nine yeast strains belonging to three closely related Saccharomyces species and originating from five different ecological environments. We find that the transcriptome-based clustering of the nine strains approximates the genome sequence-based phylogeny irrespective of their ecological environments. Remarkably, only ∼0.5% of genes exhibit similar expression levels among strains from a common ecological environment, no greater than that among strains with comparable phylogenetic relationships but different environments. These and other observations strongly suggest that most intra and interspecific variations in yeast gene expression levels result from the accumulation of random mutations rather than environmental adaptations. This finding has profound implications for understanding the driving force of gene expression evolution, genetic basis of phenotypic adaptation, and general role of stochasticity in evolution. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Subgenome-specific assembly of vitamin E biosynthesis genes and expression patterns during seed development provide insight into the evolution of oat genome.

PubMed

Gutierrez-Gonzalez, Juan J; Garvin, David F

2016-11-01

Vitamin E is essential for humans and thus must be a component of a healthy diet. Among the cereal grains, hexaploid oats (Avena sativa L.) have high vitamin E content. To date, no gene sequences in the vitamin E biosynthesis pathway have been reported for oats. Using deep sequencing and orthology-guided assembly, coding sequences of genes for each step in vitamin E synthesis in oats were reconstructed, including resolution of the sequences of homeologs. Three homeologs, presumably representing each of the three oat subgenomes, were identified for the main steps of the pathway. Partial sequences, likely representing pseudogenes, were recovered in some instances as well. Pairwise comparisons among homeologs revealed that two of the three putative subgenome-specific homeologs are almost identical for each gene. Synonymous substitution rates indicate the time of divergence of the two more similar subgenomes from the distinct one at 7.9-8.7 MYA, and a divergence between the similar subgenomes from a common ancestor 1.1 MYA. A new proposed evolutionary model for hexaploid oat formation is discussed. Homeolog-specific gene expression was quantified during oat seed development and compared with vitamin E accumulation. Homeolog expression largely appears to be similar for most of genes; however, for some genes, homoeolog-specific transcriptional bias was observed. The expression of HPPD, as well as certain homoeologs of VTE2 and VTE4, is highly correlated with seed vitamin E accumulation. Our findings expand our understanding of oat genome evolution and will assist efforts to modify vitamin E content and composition in oats. Published 2016. This article is a U.S. Government work and is in the public domain in the USA. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.
De novo Sequencing and Comparative Transcriptomics of Floral Development of the Distylous Species Lithospermum multiflorum

PubMed Central

Cohen, James I.

2016-01-01

Genes controlling the morphological, micromorphological, and physiological components of the breeding system distyly have been hypothesized, but many of the genes have not been investigated throughout development of the two floral morphs. To this end, the present study is an examination of comparative transcriptomes from three stages of development for the floral organs of the morphs of Lithospermum multiflorum. Transcriptomes of flowers of the two morphs, from various stages of development, were sequenced using an Illumina HiSeq 2000. The floral transcriptome of L. multiflorum was assembled, and differential gene expression (DE) was identified between morphs, throughout development. Additionally, Gene Ontology (GO) terms for DE genes were determined. Fewer genes were DE early in development compared to later in development, with more genes highly expressed in the gynoecium of the SS morph and the corolla and androecium of the LS morph. A reciprocal pattern was observed later in development, and many more genes were DE during this latter stage. During early development, DE genes appear to be involved in growth and floral development, and during later development, DE genes seem to affect physiological functions. Interestingly, many genes involved in response to stress were identified as DE between morphs. PMID:28066486
De novo Sequencing and Comparative Transcriptomics of Floral Development of the Distylous Species Lithospermum multiflorum.

PubMed

Cohen, James I

2016-01-01

Genes controlling the morphological, micromorphological, and physiological components of the breeding system distyly have been hypothesized, but many of the genes have not been investigated throughout development of the two floral morphs. To this end, the present study is an examination of comparative transcriptomes from three stages of development for the floral organs of the morphs of Lithospermum multiflorum . Transcriptomes of flowers of the two morphs, from various stages of development, were sequenced using an Illumina HiSeq 2000. The floral transcriptome of L. multiflorum was assembled, and differential gene expression (DE) was identified between morphs, throughout development. Additionally, Gene Ontology (GO) terms for DE genes were determined. Fewer genes were DE early in development compared to later in development, with more genes highly expressed in the gynoecium of the SS morph and the corolla and androecium of the LS morph. A reciprocal pattern was observed later in development, and many more genes were DE during this latter stage. During early development, DE genes appear to be involved in growth and floral development, and during later development, DE genes seem to affect physiological functions. Interestingly, many genes involved in response to stress were identified as DE between morphs.
Covariance Matrix Estimation for Massive MIMO

NASA Astrophysics Data System (ADS)

Upadhya, Karthik; Vorobyov, Sergiy A.

2018-04-01

We propose a novel pilot structure for covariance matrix estimation in massive multiple-input multiple-output (MIMO) systems in which each user transmits two pilot sequences, with the second pilot sequence multiplied by a random phase-shift. The covariance matrix of a particular user is obtained by computing the sample cross-correlation of the channel estimates obtained from the two pilot sequences. This approach relaxes the requirement that all the users transmit their uplink pilots over the same set of symbols. We derive expressions for the achievable rate and the mean-squared error of the covariance matrix estimate when the proposed method is used with staggered pilots. The performance of the proposed method is compared with existing methods through simulations.

Molecular cloning, mRNA expression and tissue distribution analysis of Slc7a11 gene in alpaca (Lama paco) skins associated with different coat colors.

PubMed

Tian, Xue; Meng, Xiaolin; Wang, Liangyan; Song, Yunfei; Zhang, Danli; Ji, Yuankai; Li, Xuejun; Dong, Changsheng

2015-01-25

Slc7a11 encoding solute carrier family 7 member 11 (amionic amino acid transporter light chain, xCT), has been identified to be a critical genetic regulator of pheomelanin synthesis in hair and melanocytes. To better understand the molecular characterization of Slc7a11 and the expression patterns in skin of white versus brown alpaca (lama paco), we cloned the full length coding sequence (CDS) of alpaca Slc7a11 gene and analyzed the expression patterns using Real Time PCR, Western blotting and immunohistochemistry. The full length CDS of 1512bp encodes a 503 amino acid polypeptide. Sequence analysis showed that alpaca xCT contains 12 transmembrane regions consistent with the highly conserved amino acid permease (AA_permease_2) domain similar to other vertebrates. Sequence alignment and phylogenetic analysis revealed that alpaca xCT had the highest identity and shared the same branch with Camelus ferus. Real Time PCR and Western blotting suggested that xCT was expressed at significantly high levels in brown alpaca skin, and transcripts and protein possessed the same expression pattern in white and brown alpaca skins. Additionally, immunohistochemical analysis further demonstrated that xCT staining was robustly increased in the matrix and root sheath of brown alpaca skin compared with that of white. These results suggest that Slc7a11 functions in alpaca coat color regulation and offer essential information for further exploration on the role of Slc7a11 in melanogenesis. Copyright © 2014 Elsevier B.V. All rights reserved.
Transcriptome Profile Reveals that Pu-Erh Tea Represses the Expression of Vitellogenin Family to Reduce Fat Accumulation in Caenorhabditis elegans.

PubMed

Xiao, Ru-Yue; Hao, Junjun; Ding, Yi-Hong; Che, Yan-Yun; Zou, Xiao-Ju; Liang, Bin

2016-10-17

Due to misbalanced energy surplus and expenditure, obesity has become a common chronic disorder that is highly associated with many metabolic diseases. Pu-erh tea, a traditional Chinese beverage, has been believed to have numerous health benefits, such as anti-obesity. However, the underlying mechanisms of its anti-obesity effect are yet to be understood. Here, we take the advantages of transcriptional profile by RNA sequencing (RNA-Seq) to view the global gene expression of Pu-erh tea. The model organism Caenorhabditis elegans was treated with different concentrations of Pu-erh tea water extract (PTE, 0 g/mL, 0.025 g/mL, and 0.05 g/mL). Compared with the control, PTE indeed decreases lipid droplets size and fat accumulation. The high-throughput RNA-Sequence technique detected 18073 and 18105 genes expressed in 0.025 g/mL and 0.05 g/mL PTE treated groups, respectively. Interestingly, the expression of the vitellogenin family ( vit-1 , vit-2 , vit-3, vit-4 and vit-5 ) was significantly decreased by PTE, which was validated by qPCR analysis. Furthermore, vit-1(ok2616) , vit-3(ok2348) and vit-5(ok3239) mutants are insensitive to PTE triggered fat reduction. In conclusion, our transcriptional profile by RNA-Sequence suggests that Pu-erh tea lowers the fat accumulation primarily through repression of the expression of vit (vitellogenin) family, in addition to our previously reported (sterol regulatory element binding protein) SREBP-SCD (stearoyl-CoA desaturase) axis.
Transfection and heat-inducible expression of molluscan promoter-luciferase reporter gene constructs in the Biomphalaria glabrata embryonic snail cell line.

PubMed

Yoshino, T P; Wu, X J; Liu, H D

1998-09-01

Studies were initiated to begin developing a genetic transformation system for cells derived from the freshwater gastropod, Biomphalaria glabrata, an intermediate host of the human blood fluke Schistosoma mansoni. Using a 70-kD heat-shock protein (HSP70) cDNA probe obtained from the B. glabrata embryonic (Bge) cell line, we cloned from Bge cells a complete HSP70 gene including a 1-kb genomic DNA fragment in its 5'-flanking region containing sequences indicative of a HSP promoter. Identified in the 5'-half (416 nucleotides) of this genomic fragment were TATA and CAAT boxes, two putative transcription initiation sites, and a series of palindromic DNA repeats with shared homology to the heat-shock element consensus sequence (Bge HSP70(0.5k) promoter). The 3'-half of this upstream flanking region was comprised of a 508-base intron located immediately 5' of the ATG start codon. To determine the functionality of the putative snail promoter sequence, Bge HSP promoter/luciferase (Luc) reporter gene constructs were introduced into Bge cells by N-(1-(2,3-dioleoyloxy) propyl)-N,N,N-trimethylammonium methylsulfate (DOTAP)-mediated transfection methods, and assayed for Luc activity 48 hr following a 1.5-hr heat-shock treatment (40 degrees C). Compared with control vectors or the Bge HSP70(0.5k/1.0k) promoter constructs at 26 degrees C, a 10- to 300-fold increase in Luc expression was obtained only in the Bge HSP70 promoter/Luc-transfected cells following heat-shock. Results of transfection experiments demonstrate that the Bge HSP70(0.5k) DNA segment contains appropriate promoter sequences for driving temperature-inducible gene expression in the Bge snail cell line. This report represents the first isolation and functional characterization of an inducible promoter from a freshwater gastropod mollusc. Successful transient expression of a foreign reporter gene in Bge cells using a homologous, inducible promoter sequence now paves the way for development of methods for stable integration and expression of snail genes of interest into the Bge cell line.
First Transcriptome and Digital Gene Expression Analysis in Neuroptera with an Emphasis on Chemoreception Genes in Chrysopa pallens (Rambur).

PubMed

Li, Zhao-Qun; Zhang, Shuai; Ma, Yan; Luo, Jun-Yu; Wang, Chun-Yi; Lv, Li-Min; Dong, Shuang-Lin; Cui, Jin-Jie

2013-01-01

Chrysopa pallens (Rambur) are the most important natural enemies and predators of various agricultural pests. Understanding the sophisticated olfactory system in insect antennae is crucial for studying the physiological bases of olfaction and also could lead to effective applications of C. pallens in integrated pest management. However no transcriptome information is available for Neuroptera, and sequence data for C. pallens are scarce, so obtaining more sequence data is a priority for researchers on this species. To facilitate identifying sets of genes involved in olfaction, a normalized transcriptome of C. pallens was sequenced. A total of 104,603 contigs were obtained and assembled into 10,662 clusters and 39,734 singletons; 20,524 were annotated based on BLASTX analyses. A large number of candidate chemosensory genes were identified, including 14 odorant-binding proteins (OBPs), 22 chemosensory proteins (CSPs), 16 ionotropic receptors, 14 odorant receptors, and genes potentially involved in olfactory modulation. To better understand the OBPs, CSPs and cytochrome P450s, phylogenetic trees were constructed. In addition, 10 digital gene expression libraries of different tissues were constructed and gene expression profiles were compared among different tissues in males and females. Our results provide a basis for exploring the mechanisms of chemoreception in C. pallens, as well as other insects. The evolutionary analyses in our study provide new insights into the differentiation and evolution of insect OBPs and CSPs. Our study provided large-scale sequence information for further studies in C. pallens.
Integrating transcriptome and genome re-sequencing data to identify key genes and mutations affecting chicken eggshell qualities.

PubMed

Zhang, Quan; Zhu, Feng; Liu, Long; Zheng, Chuan Wei; Wang, De He; Hou, Zhuo Cheng; Ning, Zhong Hua

2015-01-01

Eggshell damages lead to economic losses in the egg production industry and are a threat to human health. We examined 49-wk-old Rhode Island White hens (Gallus gallus) that laid eggs having shells with significantly different strengths and thicknesses. We used HiSeq 2000 (Illumina) sequencing to characterize the chicken transcriptome and whole genome to identify the key genes and genetic mutations associated with eggshell calcification. We identified a total of 14,234 genes expressed in the chicken uterus, representing 89% of all annotated chicken genes. A total of 889 differentially expressed genes were identified by comparing low eggshell strength (LES) and normal eggshell strength (NES) genomes. The DEGs are enriched in calcification-related processes, including calcium ion transport and calcium signaling pathways as revealed by gene ontology (GO) and Kyoto encyclopedia of genes and genomes (KEGG) pathway analysis. Some important matrix proteins, such as OC-116, LTF and SPP1, were also expressed differentially between two groups. A total of 3,671,919 single-nucleotide polymorphisms (SNPs) and 508,035 Indels were detected in protein coding genes by whole-genome re-sequencing, including 1775 non-synonymous variations and 19 frame-shift Indels in DEGs. SNPs and Indels found in this study could be further investigated for eggshell traits. This is the first report to integrate the transcriptome and genome re-sequencing to target the genetic variations which decreased the eggshell qualities. These findings further advance our understanding of eggshell calcification in the chicken uterus.
Purifying Selection Maintains Dosage-Sensitive Genes during Degeneration of the Threespine Stickleback Y Chromosome

PubMed Central

White, Michael A.; Kitano, Jun; Peichel, Catherine L.

2015-01-01

Sex chromosomes are subject to unique evolutionary forces that cause suppression of recombination, leading to sequence degeneration and the formation of heteromorphic chromosome pairs (i.e., XY or ZW). Although progress has been made in characterizing the outcomes of these evolutionary processes on vertebrate sex chromosomes, it is still unclear how recombination suppression and sequence divergence typically occur and how gene dosage imbalances are resolved in the heterogametic sex. The threespine stickleback fish (Gasterosteus aculeatus) is a powerful model system to explore vertebrate sex chromosome evolution, as it possesses an XY sex chromosome pair at relatively early stages of differentiation. Using a combination of whole-genome and transcriptome sequencing, we characterized sequence evolution and gene expression across the sex chromosomes. We uncovered two distinct evolutionary strata that correspond with known structural rearrangements on the Y chromosome. In the oldest stratum, only a handful of genes remain, and these genes are under strong purifying selection. By comparing sex-linked gene expression with expression of autosomal orthologs in an outgroup, we show that dosage compensation has not evolved in threespine sticklebacks through upregulation of the X chromosome in males. Instead, in the oldest stratum, the genes that still possess a Y chromosome allele are enriched for genes predicted to be dosage sensitive in mammals and yeast. Our results suggest that dosage imbalances may have been avoided at haploinsufficient genes by retaining function of the Y chromosome allele through strong purifying selection. PMID:25818858
Discovery of common sequences absent in the human reference genome using pooled samples from next generation sequencing.

PubMed

Liu, Yu; Koyutürk, Mehmet; Maxwell, Sean; Xiang, Min; Veigl, Martina; Cooper, Richard S; Tayo, Bamidele O; Li, Li; LaFramboise, Thomas; Wang, Zhenghe; Zhu, Xiaofeng; Chance, Mark R

2014-08-16

Sequences up to several megabases in length have been found to be present in individual genomes but absent in the human reference genome. These sequences may be common in populations, and their absence in the reference genome may indicate rare variants in the genomes of individuals who served as donors for the human genome project. As the reference genome is used in probe design for microarray technology and mapping short reads in next generation sequencing (NGS), this missing sequence could be a source of bias in functional genomic studies and variant analysis. One End Anchor (OEA) and/or orphan reads from paired-end sequencing have been used to identify novel sequences that are absent in reference genome. However, there is no study to investigate the distribution, evolution and functionality of those sequences in human populations. To systematically identify and study the missing common sequences (micSeqs), we extended the previous method by pooling OEA reads from large number of individuals and applying strict filtering methods to remove false sequences. The pipeline was applied to data from phase 1 of the 1000 Genomes Project. We identified 309 micSeqs that are present in at least 1% of the human population, but absent in the reference genome. We confirmed 76% of these 309 micSeqs by comparison to other primate genomes, individual human genomes, and gene expression data. Furthermore, we randomly selected fifteen micSeqs and confirmed their presence using PCR validation in 38 additional individuals. Functional analysis using published RNA-seq and ChIP-seq data showed that eleven micSeqs are highly expressed in human brain and three micSeqs contain transcription factor (TF) binding regions, suggesting they are functional elements. In addition, the identified micSeqs are absent in non-primates and show dynamic acquisition during primate evolution culminating with most micSeqs being present in Africans, suggesting some micSeqs may be important sources of human diversity. 76% of micSeqs were confirmed by a comparative genomics approach. Fourteen micSeqs are expressed in human brain or contain TF binding regions. Some micSeqs are primate-specific, conserved and may play a role in the evolution of primates.
Alu sequence involvement in transcriptional insulation of the keratin 18 gene in transgenic mice.

PubMed Central

Thorey, I S; Ceceña, G; Reynolds, W; Oshima, R G

1993-01-01

The human keratin 18 (K18) gene is expressed in a variety of adult simple epithelial tissues, including liver, intestine, lung, and kidney, but is not normally found in skin, muscle, heart, spleen, or most of the brain. Transgenic animals derived from the cloned K18 gene express the transgene in appropriate tissues at levels directly proportional to the copy number and independently of the sites of integration. We have investigated in transgenic mice the dependence of K18 gene expression on the distal 5' and 3' flanking sequences and upon the RNA polymerase III promoter of an Alu repetitive DNA transcription unit immediately upstream of the K18 promoter. Integration site-independent expression of tandemly duplicated K18 transgenes requires the presence of either an 825-bp fragment of the 5' flanking sequence or the 3.5-kb 3' flanking sequence. Mutation of the RNA polymerase III promoter of the Alu element within the 825-bp fragment abolishes copy number-dependent expression in kidney but does not abolish integration site-independent expression when assayed in the absence of the 3' flanking sequence of the K18 gene. The characteristics of integration site-independent expression and copy number-dependent expression are separable. In addition, the formation of the chromatin state of the K18 gene, which likely restricts the tissue-specific expression of this gene, is not dependent upon the distal flanking sequences of the 10-kb K18 gene but rather may depend on internal regulatory regions of the gene. Images PMID:7692231
Comparative Genomics Analyses Reveal Extensive Chromosome Colinearity and Novel Quantitative Trait Loci in Eucalyptus.

PubMed

Li, Fagen; Zhou, Changpin; Weng, Qijie; Li, Mei; Yu, Xiaoli; Guo, Yong; Wang, Yu; Zhang, Xiaohong; Gan, Siming

2015-01-01

Dense genetic maps, along with quantitative trait loci (QTLs) detected on such maps, are powerful tools for genomics and molecular breeding studies. In the important woody genus Eucalyptus, the recent release of E. grandis genome sequence allows for sequence-based genomic comparison and searching for positional candidate genes within QTL regions. Here, dense genetic maps were constructed for E. urophylla and E. tereticornis using genomic simple sequence repeats (SSR), expressed sequence tag (EST) derived SSR, EST-derived cleaved amplified polymorphic sequence (EST-CAPS), and diversity arrays technology (DArT) markers. The E. urophylla and E. tereticornis maps comprised 700 and 585 markers across 11 linkage groups, totaling at 1,208.2 and 1,241.4 cM in length, respectively. Extensive synteny and colinearity were observed as compared to three earlier DArT-based eucalypt maps (two maps with E. grandis × E. urophylla and one map of E. globulus) and with the E. grandis genome sequence. Fifty-three QTLs for growth (10-56 months of age) and wood density (56 months) were identified in 22 discrete regions on both maps, in which only one colocalizaiton was found between growth and wood density. Novel QTLs were revealed as compared with those previously detected on DArT-based maps for similar ages in Eucalyptus. Eleven to 585 positional candidate genes were obained for a 56-month-old QTL through aligning QTL confidence interval with the E. grandis genome. These results will assist in comparative genomics studies, targeted gene characterization, and marker-assisted selection in Eucalyptus and the related taxa.
Comparative Genomics Analyses Reveal Extensive Chromosome Colinearity and Novel Quantitative Trait Loci in Eucalyptus

PubMed Central

Weng, Qijie; Li, Mei; Yu, Xiaoli; Guo, Yong; Wang, Yu; Zhang, Xiaohong; Gan, Siming

2015-01-01

Dense genetic maps, along with quantitative trait loci (QTLs) detected on such maps, are powerful tools for genomics and molecular breeding studies. In the important woody genus Eucalyptus, the recent release of E. grandis genome sequence allows for sequence-based genomic comparison and searching for positional candidate genes within QTL regions. Here, dense genetic maps were constructed for E. urophylla and E. tereticornis using genomic simple sequence repeats (SSR), expressed sequence tag (EST) derived SSR, EST-derived cleaved amplified polymorphic sequence (EST-CAPS), and diversity arrays technology (DArT) markers. The E. urophylla and E. tereticornis maps comprised 700 and 585 markers across 11 linkage groups, totaling at 1,208.2 and 1,241.4 cM in length, respectively. Extensive synteny and colinearity were observed as compared to three earlier DArT-based eucalypt maps (two maps with E. grandis × E. urophylla and one map of E. globulus) and with the E. grandis genome sequence. Fifty-three QTLs for growth (10–56 months of age) and wood density (56 months) were identified in 22 discrete regions on both maps, in which only one colocalizaiton was found between growth and wood density. Novel QTLs were revealed as compared with those previously detected on DArT-based maps for similar ages in Eucalyptus. Eleven to 585 positional candidate genes were obained for a 56-month-old QTL through aligning QTL confidence interval with the E. grandis genome. These results will assist in comparative genomics studies, targeted gene characterization, and marker-assisted selection in Eucalyptus and the related taxa. PMID:26695430
Skipping Strategy (SS) for Initial Population of Job-Shop Scheduling Problem

NASA Astrophysics Data System (ADS)

Abdolrazzagh-Nezhad, M.; Nababan, E. B.; Sarim, H. M.

2018-03-01

Initial population in job-shop scheduling problem (JSSP) is an essential step to obtain near optimal solution. Techniques used to solve JSSP are computationally demanding. Skipping strategy (SS) is employed to acquire initial population after sequence of job on machine and sequence of operations (expressed in Plates-jobs and mPlates-jobs) are determined. The proposed technique is applied to benchmark datasets and the results are compared to that of other initialization techniques. It is shown that the initial population obtained from the SS approach could generate optimal solution.
Comparative analysis of miRNA expression during the development of insects of different metamorphosis modes and germ-band types.

PubMed

Ylla, Guillem; Piulachs, Maria-Dolors; Belles, Xavier

2017-10-11

Do miRNAs contribute to specify the germ-band type and the body structure in the insect embryo? Our goal was to address that issue by studying the changes in miRNA expression along the ontogeny of the German cockroach Blattella germanica, which is a short germ-band and hemimetabolan species. We sequenced small RNA libraries representing 11 developmental stages of B. germanica ontogeny (with especial emphasis on embryogenesis) and the changes in miRNA expression were examined. Data were compared with equivalent data for two long germ-band holometabolan species Drosophila melanogaster and Drosophila virilis, and the short germ-band holometabolan species Tribolium castaneum. The identification of B. germanica embryo small RNA sequences unveiled miRNAs not detected in previous studies, such as those of the MIR-309 family and 54 novel miRNAs. Four main waves of miRNA expression were recognized (with most miRNA changes occurring during the embryonic stages): the first from day 0 to day 1 of embryogenesis, the second during mid-embryogenesis (days 0-6), the third (with an acute expression peak) on day 2 of embryonic development, and the fourth during post-embryonic development. The second wave defined the boundaries of maternal-to-zygotic transition, with maternal mRNAs being cleared, presumably by Mir-309 and associated scavenger miRNAs. miRNAs follow well-defined patterns of expression over hemimetabolan ontogeny, patterns that are more diverse during embryonic development than during the nymphal stages. The results suggest that miRNAs play important roles in the developmental transitions between the embryonic stages of development (starting with maternal loading), during which they might influence the germ-band type and metamorphosis mode.
The influence of methylated septin 9 gene on RNA and protein level in colorectal cancer.

PubMed

Tóth, Kinga; Galamb, Orsolya; Spisák, Sándor; Wichmann, Barnabás; Sipos, Ferenc; Valcz, Gábor; Leiszter, Katalin; Molnár, Béla; Tulassay, Zsolt

2011-09-01

Colorectal cancer is one of the leading death causes in the world. Specificity and sensitivity of the present screening methods are unsuitable and their compliance is too low. Nowadays the most effective method is the colonoscopy, because it gives not only macroscopic diagnosis but therapeutic possibility as well, however the compliance of the patients is very low. Hence development of new diagnostic methods is needed. Altered expression of septin 9 was found in several tumor types including colorectal cancer. The aim of this study was to detect the methylation related mRNA and protein expression changes of septin 9 in colorectal adenoma-dysplasia-carcinoma sequence and to analyze its reversibility by demethylation treatment. Septin 9 protein expression showed significant difference between normal and colorectal cancer (CRC) samples (p < 0,001). According to biopsy microarray results, septin 9 mRNA expression decreased in the progression of colon neoplastic disease (p < 0,001). In laser microdissected epithelial cells, septin 9 significantly underexpressed in CRC compared to healthy controls (p < 0,001). The expression of septin9_v1 region was higher in the healthy samples, while septin9_v2, v4, v4*, v5 overexpression were detected in cancer epithelial cells compared to normal. The septin 9 mRNA and protein levels of HT29 cells increased after demethylation treatment. The increasing methylation of septin 9 gene during colorectal adenoma-dysplasia-carcinoma sequence progression is reflected in the decreasing mRNA and protein expression, especially in the epithelium. These changes can be reversed by demethylation agents converting this screening marker gene into therapeutic target.
Viral Infection Induces Expression of Novel Phased MicroRNAs from Conserved Cellular MicroRNA Precursors

PubMed Central

Zhang, Jiayao; Zhao, Shuqi; Zheng, Hong; Gao, Ge; Wei, Liping; Li, Yi

2011-01-01

RNA silencing, mediated by small RNAs including microRNAs (miRNAs) and small interfering RNAs (siRNAs), is a potent antiviral or antibacterial mechanism, besides regulating normal cellular gene expression critical for development and physiology. To gain insights into host small RNA metabolism under infections by different viruses, we used Solexa/Illumina deep sequencing to characterize the small RNA profiles of rice plants infected by two distinct viruses, Rice dwarf virus (RDV, dsRNA virus) and Rice stripe virus (RSV, a negative sense and ambisense RNA virus), respectively, as compared with those from non-infected plants. Our analyses showed that RSV infection enhanced the accumulation of some rice miRNA*s, but not their corresponding miRNAs, as well as accumulation of phased siRNAs from a particular precursor. Furthermore, RSV infection also induced the expression of novel miRNAs in a phased pattern from several conserved miRNA precursors. In comparison, no such changes in host small RNA expression was observed in RDV-infected rice plants. Significantly RSV infection elevated the expression levels of selective OsDCLs and OsAGOs, whereas RDV infection only affected the expression of certain OsRDRs. Our results provide a comparative analysis, via deep sequencing, of changes in the small RNA profiles and in the genes of RNA silencing machinery induced by different viruses in a natural and economically important crop host plant. They uncover new mechanisms and complexity of virus-host interactions that may have important implications for further studies on the evolution of cellular small RNA biogenesis that impact pathogen infection, pathogenesis, as well as organismal development. PMID:21901091
Faster-X Evolution of Gene Expression in Drosophila

PubMed Central

Meisel, Richard P.; Malone, John H.; Clark, Andrew G.

2012-01-01

DNA sequences on X chromosomes often have a faster rate of evolution when compared to similar loci on the autosomes, and well articulated models provide reasons why the X-linked mode of inheritance may be responsible for the faster evolution of X-linked genes. We analyzed microarray and RNA–seq data collected from females and males of six Drosophila species and found that the expression levels of X-linked genes also diverge faster than autosomal gene expression, similar to the “faster-X” effect often observed in DNA sequence evolution. Faster-X evolution of gene expression was recently described in mammals, but it was limited to the evolutionary lineages shortly following the creation of the therian X chromosome. In contrast, we detect a faster-X effect along both deep lineages and those on the tips of the Drosophila phylogeny. In Drosophila males, the dosage compensation complex (DCC) binds the X chromosome, creating a unique chromatin environment that promotes the hyper-expression of X-linked genes. We find that DCC binding, chromatin environment, and breadth of expression are all predictive of the rate of gene expression evolution. In addition, estimates of the intraspecific genetic polymorphism underlying gene expression variation suggest that X-linked expression levels are not under relaxed selective constraints. We therefore hypothesize that the faster-X evolution of gene expression is the result of the adaptive fixation of beneficial mutations at X-linked loci that change expression level in cis. This adaptive faster-X evolution of gene expression is limited to genes that are narrowly expressed in a single tissue, suggesting that relaxed pleiotropic constraints permit a faster response to selection. Finally, we present a conceptional framework to explain faster-X expression evolution, and we use this framework to examine differences in the faster-X effect between Drosophila and mammals. PMID:23071459
Analysis of Epstein-Barr Virus Genomes and Expression Profiles in Gastric Adenocarcinoma.

PubMed

Borozan, Ivan; Zapatka, Marc; Frappier, Lori; Ferretti, Vincent

2018-01-15

Epstein-Barr virus (EBV) is a causative agent of a variety of lymphomas, nasopharyngeal carcinoma (NPC), and ∼9% of gastric carcinomas (GCs). An important question is whether particular EBV variants are more oncogenic than others, but conclusions are currently hampered by the lack of sequenced EBV genomes. Here, we contribute to this question by mining whole-genome sequences of 201 GCs to identify 13 EBV-positive GCs and by assembling 13 new EBV genome sequences, almost doubling the number of available GC-derived EBV genome sequences and providing the first non-Asian EBV genome sequences from GC. Whole-genome sequence comparisons of all EBV isolates sequenced to date (85 from tumors and 57 from healthy individuals) showed that most GC and NPC EBV isolates were closely related although American Caucasian GC samples were more distant, suggesting a geographical component. However, EBV GC isolates were found to contain some consistent changes in protein sequences regardless of geographical origin. In addition, transcriptome data available for eight of the EBV-positive GCs were analyzed to determine which EBV genes are expressed in GC. In addition to the expected latency proteins (EBNA1, LMP1, and LMP2A), specific subsets of lytic genes were consistently expressed that did not reflect a typical lytic or abortive lytic infection, suggesting a novel mechanism of EBV gene regulation in the context of GC. These results are consistent with a model in which a combination of specific latent and lytic EBV proteins promotes tumorigenesis. IMPORTANCE Epstein-Barr virus (EBV) is a widespread virus that causes cancer, including gastric carcinoma (GC), in a small subset of individuals. An important question is whether particular EBV variants are more cancer associated than others, but more EBV sequences are required to address this question. Here, we have generated 13 new EBV genome sequences from GC, almost doubling the number of EBV sequences from GC isolates and providing the first EBV sequences from non-Asian GC. We further identify sequence changes in some EBV proteins common to GC isolates. In addition, gene expression analysis of eight of the EBV-positive GCs showed consistent expression of both the expected latency proteins and a subset of lytic proteins that was not consistent with typical lytic or abortive lytic expression. These results suggest that novel mechanisms activate expression of some EBV lytic proteins and that their expression may contribute to oncogenesis. Copyright © 2018 American Society for Microbiology.
Transcriptome and Gene Expression Analysis of the Rice Leaf Folder, Cnaphalocrosis medinalis

PubMed Central

Li, Shang-Wei; Yang, Hong; Liu, Yue-Feng; Liao, Qi-Rong; Du, Juan; Jin, Dao-Chao

2012-01-01

Background The rice leaf folder (RLF), Cnaphalocrocis medinalis (Guenee) (Lepidoptera: Pyralidae), is one of the most destructive pests affecting rice in Asia. Although several studies have been performed on the ecological and physiological aspects of this species, the molecular mechanisms underlying its developmental regulation, behavior, and insecticide resistance remain largely unknown. Presently, there is a lack of genomic information for RLF; therefore, studies aimed at profiling the RLF transcriptome expression would provide a better understanding of its biological function at the molecular level. Principal Findings De novo assembly of the RLF transcriptome was performed via the short read sequencing technology (Illumina). In a single run, we produced more than 23 million sequencing reads that were assembled into 44,941 unigenes (mean size = 474 bp) by Trinity. Through a similarity search, 25,281 (56.82%) unigenes matched known proteins in the NCBI Nr protein database. The transcriptome sequences were annotated with gene ontology (GO), cluster of orthologous groups of proteins (COG), and KEGG orthology (KO). Additionally, we profiled gene expression during RLF development using a tag-based digital gene expression (DGE) system. Five DGE libraries were constructed, and variations in gene expression were compared between collected samples: eggs vs. 3rd instar larvae, 3rd instar larvae vs. pupae, pupae vs. adults. The results demonstrated that thousands of genes were significantly differentially expressed during various developmental stages. A number of the differentially expressed genes were confirmed by quantitative real-time PCR (qRT-PCR). Conclusions The RLF transcriptome and DGE data provide a comprehensive and global gene expression profile that would further promote our understanding of the molecular mechanisms underlying various biological characteristics, including development, elevated fecundity, flight, sex differentiation, olfactory behavior, and insecticide resistance in RLF. Therefore, these findings could help elucidate the intrinsic factors involved in the RLF-mediated destruction of rice and offer sustainable insect pest management. PMID:23185238
In vitro resistance to 5-nitroimidazoles and benzimidazoles in Giardia duodenalis: variability and variation in gene expression.

PubMed

Argüello-García, Raúl; Cruz-Soto, Maricela; Romero-Montoya, Lydia; Ortega-Pierres, Guadalupe

2009-12-01

The susceptibility of Giardia duodenalis trophozoites exposed in vitro to sublethal concentrations of metronidazole (MTZ) and albendazole (ABZ) may exhibit inter-culture (variability) and intra-culture (variation) differences in drug susceptibility. It was previously reported that MTZ-resistant trophozoites may display changes in pyruvate:ferredoxin oxidoreductase (PFOR) expression while changes at the beta-tubulin molecule are apparently absent in ABZ-resistant cultures. To assess the levels of gene expression of these molecules, we obtained cloned cultures growing at concentrations up to 23 microM MTZ (WBRM23) and up to 8muM ABZ (WBRA8) and gene sequence and expression of pfor and beta-tubulin loci were compared with these of drug-susceptible clone WB1. Neither the pfor nor the beta-tubulin genes showed changes at sequence level but the MTZ-resistant clones WBRM21 and WBRM23 showed up-regulation of the pfor RNA using the gdh gene as reference. By using WB1 and WBRA8 clones in representational difference analyses of gene expression (RDA) an insert referred to as ARR-VSP was selected and sequenced. It showed the highest homology to one VSP molecule in the Giardia Genome Database (orf GL50803_101765). This isogene was up-regulated in five ABZ-resistant clones and the clone WBRA8 exhibited the highest RNA expression level. When successive progenies of clones WB1, WBRM23 and WBRA8 were analyzed in Northern blot assays to detect pfor and ARR-VSP RNAs respectively, the expression patterns showed variation for both genes but it was much lower in the clone WBRA8. These results suggest that G. duodenalis cultures either susceptible or resistant to MTZ and ABZ may display variability and variation at RNA expression levels albeit these were more marked in the MTZ-resistant parasites. These data might have further implications defining major mechanisms involved in drug resistance of Giardia.
dictyExpress: a web-based platform for sequence data management and analytics in Dictyostelium and beyond.

PubMed

Stajdohar, Miha; Rosengarten, Rafael D; Kokosar, Janez; Jeran, Luka; Blenkus, Domen; Shaulsky, Gad; Zupan, Blaz

2017-06-02

Dictyostelium discoideum, a soil-dwelling social amoeba, is a model for the study of numerous biological processes. Research in the field has benefited mightily from the adoption of next-generation sequencing for genomics and transcriptomics. Dictyostelium biologists now face the widespread challenges of analyzing and exploring high dimensional data sets to generate hypotheses and discovering novel insights. We present dictyExpress (2.0), a web application designed for exploratory analysis of gene expression data, as well as data from related experiments such as Chromatin Immunoprecipitation sequencing (ChIP-Seq). The application features visualization modules that include time course expression profiles, clustering, gene ontology enrichment analysis, differential expression analysis and comparison of experiments. All visualizations are interactive and interconnected, such that the selection of genes in one module propagates instantly to visualizations in other modules. dictyExpress currently stores the data from over 800 Dictyostelium experiments and is embedded within a general-purpose software framework for management of next-generation sequencing data. dictyExpress allows users to explore their data in a broader context by reciprocal linking with dictyBase-a repository of Dictyostelium genomic data. In addition, we introduce a companion application called GenBoard, an intuitive graphic user interface for data management and bioinformatics analysis. dictyExpress and GenBoard enable broad adoption of next generation sequencing based inquiries by the Dictyostelium research community. Labs without the means to undertake deep sequencing projects can mine the data available to the public. The entire information flow, from raw sequence data to hypothesis testing, can be accomplished in an efficient workspace. The software framework is generalizable and represents a useful approach for any research community. To encourage more wide usage, the backend is open-source, available for extension and further development by bioinformaticians and data scientists.
Efficient experimental design and analysis strategies for the detection of differential expression using RNA-Sequencing

PubMed Central

2012-01-01

Background RNA sequencing (RNA-Seq) has emerged as a powerful approach for the detection of differential gene expression with both high-throughput and high resolution capabilities possible depending upon the experimental design chosen. Multiplex experimental designs are now readily available, these can be utilised to increase the numbers of samples or replicates profiled at the cost of decreased sequencing depth generated per sample. These strategies impact on the power of the approach to accurately identify differential expression. This study presents a detailed analysis of the power to detect differential expression in a range of scenarios including simulated null and differential expression distributions with varying numbers of biological or technical replicates, sequencing depths and analysis methods. Results Differential and non-differential expression datasets were simulated using a combination of negative binomial and exponential distributions derived from real RNA-Seq data. These datasets were used to evaluate the performance of three commonly used differential expression analysis algorithms and to quantify the changes in power with respect to true and false positive rates when simulating variations in sequencing depth, biological replication and multiplex experimental design choices. Conclusions This work quantitatively explores comparisons between contemporary analysis tools and experimental design choices for the detection of differential expression using RNA-Seq. We found that the DESeq algorithm performs more conservatively than edgeR and NBPSeq. With regard to testing of various experimental designs, this work strongly suggests that greater power is gained through the use of biological replicates relative to library (technical) replicates and sequencing depth. Strikingly, sequencing depth could be reduced as low as 15% without substantial impacts on false positive or true positive rates. PMID:22985019

Efficient experimental design and analysis strategies for the detection of differential expression using RNA-Sequencing.

PubMed

Robles, José A; Qureshi, Sumaira E; Stephen, Stuart J; Wilson, Susan R; Burden, Conrad J; Taylor, Jennifer M

2012-09-17

RNA sequencing (RNA-Seq) has emerged as a powerful approach for the detection of differential gene expression with both high-throughput and high resolution capabilities possible depending upon the experimental design chosen. Multiplex experimental designs are now readily available, these can be utilised to increase the numbers of samples or replicates profiled at the cost of decreased sequencing depth generated per sample. These strategies impact on the power of the approach to accurately identify differential expression. This study presents a detailed analysis of the power to detect differential expression in a range of scenarios including simulated null and differential expression distributions with varying numbers of biological or technical replicates, sequencing depths and analysis methods. Differential and non-differential expression datasets were simulated using a combination of negative binomial and exponential distributions derived from real RNA-Seq data. These datasets were used to evaluate the performance of three commonly used differential expression analysis algorithms and to quantify the changes in power with respect to true and false positive rates when simulating variations in sequencing depth, biological replication and multiplex experimental design choices. This work quantitatively explores comparisons between contemporary analysis tools and experimental design choices for the detection of differential expression using RNA-Seq. We found that the DESeq algorithm performs more conservatively than edgeR and NBPSeq. With regard to testing of various experimental designs, this work strongly suggests that greater power is gained through the use of biological replicates relative to library (technical) replicates and sequencing depth. Strikingly, sequencing depth could be reduced as low as 15% without substantial impacts on false positive or true positive rates.
B-Bolivia, an Allele of the Maize b1 Gene with Variable Expression, Contains a High Copy Retrotransposon-Related Sequence Immediately Upstream1

PubMed Central

Selinger, David A.; Chandler, Vicki L.

2001-01-01

The maize (Zea mays) b1 gene encodes a transcription factor that regulates the anthocyanin pigment pathway. Of the b1 alleles with distinct tissue-specific expression, B-Peru and B-Bolivia are the only alleles that confer seed pigmentation. B-Bolivia produces variable and weaker seed expression but darker, more regular plant expression relative to B-Peru. Our experiments demonstrated that B-Bolivia is not expressed in the seed when transmitted through the male. When transmitted through the female the proportion of kernels pigmented and the intensity of pigment varied. Molecular characterization of B-Bolivia demonstrated that it shares the first 530 bp of the upstream region with B-Peru, a region sufficient for seed expression. Immediately upstream of 530 bp, B-Bolivia is completely divergent from B-Peru. These sequences share sequence similarity to retrotransposons. Transient expression assays of various promoter constructs identified a 33-bp region in B-Bolivia that can account for the reduced aleurone pigment amounts (40%) observed with B-Bolivia relative to B-Peru. Transgenic plants carrying the B-Bolivia promoter proximal region produced pigmented seeds. Similar to native B-Bolivia, some transgene loci are variably expressed in seeds. In contrast to native B-Bolivia, the transgene loci are expressed in seeds when transmitted through both the male and female. Some transgenic lines produced pigment in vegetative tissues, but the tissue-specificity was different from B-Bolivia, suggesting the introduced sequences do not contain the B-Bolivia plant-specific regulatory sequences. We hypothesize that the chromatin context of the B-Bolivia allele controls its epigenetic seed expression properties, which could be influenced by the adjacent highly repeated retrotransposon sequence. PMID:11244116
Missing value imputation for gene expression data by tailored nearest neighbors.

PubMed

Faisal, Shahla; Tutz, Gerhard

2017-04-25

High dimensional data like gene expression and RNA-sequences often contain missing values. The subsequent analysis and results based on these incomplete data can suffer strongly from the presence of these missing values. Several approaches to imputation of missing values in gene expression data have been developed but the task is difficult due to the high dimensionality (number of genes) of the data. Here an imputation procedure is proposed that uses weighted nearest neighbors. Instead of using nearest neighbors defined by a distance that includes all genes the distance is computed for genes that are apt to contribute to the accuracy of imputed values. The method aims at avoiding the curse of dimensionality, which typically occurs if local methods as nearest neighbors are applied in high dimensional settings. The proposed weighted nearest neighbors algorithm is compared to existing missing value imputation techniques like mean imputation, KNNimpute and the recently proposed imputation by random forests. We use RNA-sequence and microarray data from studies on human cancer to compare the performance of the methods. The results from simulations as well as real studies show that the weighted distance procedure can successfully handle missing values for high dimensional data structures where the number of predictors is larger than the number of samples. The method typically outperforms the considered competitors.
A polyvalent hybrid protein elicits antibodies against the diverse allelic types of block 2 in Plasmodium falciparum merozoite surface protein 1.

PubMed

Tetteh, Kevin K A; Conway, David J

2011-10-13

Merozoite surface protein 1 (MSP1) of Plasmodium falciparum has been implicated as an important target of acquired immunity, and candidate components for a vaccine include polymorphic epitopes in the N-terminal polymorphic block 2 region. We designed a polyvalent hybrid recombinant protein incorporating sequences of the three major allelic types of block 2 together with a composite repeat sequence of one of the types and N-terminal flanking T cell epitopes, and compared this with a series of recombinant proteins containing modular sub-components and similarly expressed in Escherichia coli. Immunogenicity of the full polyvalent hybrid protein was tested in both mice and rabbits, and comparative immunogenicity studies of the sub-component modules were performed in mice. The full hybrid protein induced high titre antibodies against each of the major block 2 allelic types expressed as separate recombinant proteins and against a wide range of allelic types naturally expressed by a panel of diverse P. falciparum isolates, while the sub-component modules had partial antigenic coverage as expected. This encourages further development and evaluation of the full MSP1 block 2 polyvalent hybrid protein as a candidate blood-stage component of a malaria vaccine. Copyright © 2011 Elsevier Ltd. All rights reserved.
Streptococcal Adhesin P (SadP) contributes to Streptococcus suis adhesion to the human intestinal epithelium.

PubMed

Ferrando, Maria Laura; Willemse, Niels; Zaccaria, Edoardo; Pannekoek, Yvonne; van der Ende, Arie; Schultsz, Constance

2017-01-01

Streptococcus suis is a zoonotic pathogen, causing meningitis and septicemia. We previously demonstrated that the gastrointestinal tract (GIT) is an entry site for zoonotic S. suis infection. Here we studied the contribution of Streptococcal adhesin Protein (SadP) to host-pathogen interaction at GIT level. SadP expression in presence of Intestinal Epithelial Cells (IEC) was compared with expression of other virulence factors by measuring transcript levels using quantitative Real Time PCR (qRT-PCR). SadP variants were identified by phylogenetic analysis of complete DNA sequences. The interaction of SadP knockout and complementation mutants with IEC was tested in vitro. Expression of sadP was significantly increased in presence of IEC. Sequence analysis of 116 invasive strains revealed five SadP sequence variants, correlating with genotype. SadP1, present in zoonotic isolates of clonal complex 1, contributed to binding to both human and porcine IEC and translocation across human IEC. Antibodies against the globotriaosylceramide Gb3/CD77 receptor significantly inhibited adhesion to human IEC. SadP is involved in the host-pathogen interaction in the GIT. Differences between SadP variants may determine different affinities to the Gb3/CD77 host-receptor, contributing to variation in adhesion capacity to host IEC and thus to S. suis zoonotic potential.
Increasing transcriptome response of serpins during the ontogenetic stages in the salmon louse Caligus rogercresseyi (Copepoda: Caligidae).

PubMed

Maldonado-Aguayo, W; Gallardo-Escárate, C

2014-06-01

Serine protease inhibitors, or serpins, target serine proteases, and are important regulators of intra- and extracellular proteolysis. For parasite survival, parasite-derived protease inhibitors have been suggested to play essential roles in evading the host's immune system and protecting against exogenous host proteases. The aim of this work was to identify serpins via high throughput transcriptome sequencing and elucidate their potential functions during the lifecycle of the salmon louse Caligus rogercresseyi. Eleven putative, partial serpin sequences in the C. rogercresseyi transcriptome were identified and denoted as Cr-serpins 1 to 11. Comparative analysis of the deduced serpin-like amino acid sequences revealed a highly conserved reactive center loop region. Interestingly, P1 residues suggest putative functions involved with the trypsin/subtilisin, elastase, or subtilisin inhibitors, which evidenced increasing gene expression profiles from the copepodid to adult stage in C. rogercresseyi. Concerning this, Cr-serpin 10 was mainly expressed in the copepodid stage, while Cr-serpins 3, 4, 5, and 11 were mostly expressed in chalimus and adult stages. These results suggest that serpins could be involved in evading the immune response of the host fish. The identification of these serpins furthers the understanding of the immune system in this important ectoparasite species. Copyright © 2014 Elsevier B.V. All rights reserved.
cDNA nucleotide sequence coding for stearoyl-CoA desaturase and its expression in the zebrafish (Danio rerio) embryo.

PubMed

Hsieh, S L; Liu, R W; Wu, C H; Cheng, W T; Kuo, Ching-Ming

2003-12-01

A cDNA sequence of stearoyl-CoA desaturase (SCD) was determined from zebrafish (Danio rerio) and compared to the corresponding genes in several teleosts. Zebrafish SCD cDNA has a size of 1,061 bp, encodes a polypeptide of 325 amino acids, and shares 88, 85, 84, and 83% similarities with tilapia (Oreochromis mossambicus), grass carp (Ctenopharyngodon idella), common carp (Cyprinus carpio), and milkfish (Chanos chanos), respectively. This 1,061 bp sequence specifies a protein that, in common with other fatty acid desaturases, contains three histidine boxes, believed to be involved in catalysis. These observations suggested that SCD genes are highly conserved. In addition, an oligonucleotide probe complementary to zebrafish SCD mRNA was hybridized to mRNA of approximately 396 bases with Northern blot analysis. The Northern blot and RT-PCR analyses showed that the SCD mRNA was expressed predominantly in the liver, intestine, gill, and muscle, while a lower level was found in the brain. Furthermore, we utilized whole-mount in situ hybridization and real-time quantitative RT-PCR to identify expression of the zebrafish SCD gene at five different stages of development. This revealed that very high levels of transcripts were found in zebrafish at all stages during embryogenesis and early development. Copyright 2003 Wiley-Liss, Inc.
Generation of a total of 6483 expressed sequence tags from 60 day-old bovine whole fetus and fetal placenta.

PubMed

Oishi, M; Gohma, H; Lejukole, H Y; Taniguchi, Y; Yamada, T; Suzuki, K; Shinkai, H; Uenishi, H; Yasue, H; Sasaki, Y

2004-05-01

Expressed sequence tags (ESTs) generated based on characterization of clones isolated randomly from cDNA libraries are used to study gene expression profiles in specific tissues and to provide useful information for characterizing tissue physiology. In this study, two directionally cloned cDNA libraries were constructed from 60 day-old bovine whole fetus and fetal placenta. We have characterized 5357 and 1126 clones, and then identified 3464 and 795 unique sequences for the fetus and placenta cDNA libraries: 1851 and 504 showed homology to already identified genes, and 1613 and 291 showed no significant matches to any of the sequences in DNA databases, respectively. Further, we found 94 unique sequences overlapping in both the fetus and the placenta, leading to a catalog of 4165 genes expressed in 60 day-old fetus and placenta. The catalog is used to examine expression profile of genes in 60 day-old bovine fetus and placenta.
Generation of Mast Cells from Mouse Fetus: Analysis of Differentiation and Functionality, and Transcriptome Profiling Using Next Generation Sequencer

PubMed Central

Fukuishi, Nobuyuki; Igawa, Yuusuke; Kunimi, Tomoyo; Hamano, Hirofumi; Toyota, Masao; Takahashi, Hironobu; Kenmoku, Hiromichi; Yagi, Yasuyuki; Matsui, Nobuaki; Akagi, Masaaki

2013-01-01

While gene knockout technology can reveal the roles of proteins in cellular functions, including in mast cells, fetal death due to gene manipulation frequently interrupts experimental analysis. We generated mast cells from mouse fetal liver (FLMC), and compared the fundamental functions of FLMC with those of bone marrow-derived mouse mast cells (BMMC). Under electron microscopy, numerous small and electron-dense granules were observed in FLMC. In FLMC, the expression levels of a subunit of the FcεRI receptor and degranulation by IgE cross-linking were comparable with BMMC. By flow cytometry we observed surface expression of c-Kit prior to that of FcεRI on FLMC, although on BMMC the expression of c-Kit came after FcεRI. The surface expression levels of Sca-1 and c-Kit, a marker of putative mast cell precursors, were slightly different between bone marrow cells and fetal liver cells, suggesting that differentiation stage or cell type are not necessarily equivalent between both lineages. Moreover, this indicates that phenotypically similar mast cells may not have undergone an identical process of differentiation. By comprehensive analysis using the next generation sequencer, the same frequency of gene expression was observed for 98.6% of all transcripts in both cell types. These results indicate that FLMC could represent a new and useful tool for exploring mast cell differentiation, and may help to elucidate the roles of individual proteins in the function of mast cells where gene manipulation can induce embryonic lethality in the mid to late stages of pregnancy. PMID:23573287
Ram locus is a key regulator to trigger multidrug resistance in Enterobacter aerogenes.

PubMed

Molitor, Alexander; James, Chloë E; Fanning, Séamus; Pagès, Jean-Marie; Davin-Regli, Anne

2018-02-01

Several genetic regulators belonging to AraC family are involved in the emergence of MDR isolates of E. aerogenes due to alterations in membrane permeability. Compared with the genetic regulator Mar, RamA may be more relevant towards the emergence of antibiotic resistance. Focusing on the global regulators, Mar and Ram, we compared the amino acid sequences of the Ram repressor in 59 clinical isolates and laboratory strains of E. aerogenes. Sequence types were associated with their corresponding multi-drug resistance phenotypes and membrane protein expression profiles using MIC and immunoblot assays. Quantitative gene expression analysis of the different regulators and their targets (porins and efflux pump components) were performed. In the majority of the MDR isolates tested, ramR and a region upstream of ramA were mutated but marR or marA were unchanged. Expression and cloning experiments highlighted the involvement of the ram locus in the modification of membrane permeability. Overexpression of RamA lead to decreased porin production and increased expression of efflux pump components, whereas overexpression of RamR had the opposite effects. Mutations or deletions in ramR, leading to the overexpression of RamA predominated in clinical MDR E. aerogenes isolates and were associated with a higher-level of expression of efflux pump components. It was hypothesised that mutations in ramR, and the self-regulating region proximal to ramA, probably altered the binding properties of the RamR repressor; thereby producing the MDR phenotype. Consequently, mutability of RamR may play a key role in predisposing E. aerogenes towards the emergence of a MDR phenotype.
A baculovirus (Bombyx mori nuclear polyhedrosis virus) repeat element functions as a powerful constitutive enhancer in transfected insect cells.

PubMed

Lu, M; Farrell, P J; Johnson, R; Iatrou, K

1997-12-05

It has been previously reported that baculovirus homologous regions, the regions of baculovirus genomes that contain the origins of DNA replication, can augment the expression of a small number of baculovirus genes in vitro. We are now reporting that a region of the genome of Bombyx mori nuclear polyhedrosis virus (BmNPV) containing the homologous region 3 (HR3) acts as an enhancer for the promoter of a nonviral gene, the cytoplasmic actin gene of the silkmoth B. mori. Incorporation of the HR3 sequences of BmNPV into an actin promoter-based expression cassette results in an augmentation of transgene expression in transfected cells by two orders of magnitude relative to the control recombinant expression cassette. This increase is due to a corresponding increase in the rate of transcription from the actin promoter and not to replication of the expression cassette and occurs only when the HR3 element is linked to the expression cassette in cis. A comparable degree of enhancement in the activity of the silkworm actin promoter occurs also in heterologous lepidopteran cells. Concomitant supplementation of transfected cells with the BmIE1 trans-activator, which was previously shown to be capable of functioning in vitro as a transcriptional co-activator of the cytoplasmic actin gene promoter, results in more than a 1,000-fold increase in the level of expression of recombinant proteins placed under the control of the actin gene promoter. These findings provide the foundation for the development of a nonlytic insect cell expression system for continuous high-level expression of recombinant proteins. Such a system should provide levels of expression of recombinant proteins comparable to those obtained from baculovirus expression systems and should also have the additional advantage of continuous production in a cellular environment that, in contrast to that generated by a baculovirus infection, supports continuously proper posttranslational modifications of recombinant proteins and the capability of expression of proteins from genomic as well as cDNA sequences.
[Molecular cloning and characterization of a novel Clonorchis sinensis antigenic protein containing tandem repeat sequences].

PubMed

Liu, Qian; Xu, Xue-Nian; Zhou, Yan; Cheng, Na; Dong, Yu-Ting; Zheng, Hua-Jun; Zhu, Yong-Qiang; Zhu, Yong-Qiang

2013-08-01

To find and clone new antigen genes from the lambda-ZAP cDNA expression library of adult Clonorchis sinensis, and determine the immunological characteristics of the recombinant proteins. The cDNA expression library of adult C. sinensis was screened by pooled sera of clonorchiasis patients. The sequences of the positive phage clones were compared with the sequences in EST database, and the full-length sequence of the gene (Cs22 gene) was obtained by RT-PCR. cDNA fragments containing 2 and 3 times tandem repeat sequences were generated by jumping PCR. The sequence encoding the mature peptide or the tandem repeat sequence was respectively cloned into the prokaryotic expression vector pET28a (+), and then transformed into E. coli Rosetta DE3 cells for expression. The recombinant proteins (rCs22-2r, rCs22-3r, rCs22M-2r, and rCs22M-3r) were purified by His-bind-resin (Ni-NTA) affinity chromatography. The immunogenicity of rCs22-2r and rCs22-3r was identified by ELISA. To evaluate the immunological diagnostic value of rCs22-2r and rCs22-3r, serum samples from 35 clonorchiasis patients, 31 healthy individuals, 15 schistosomiasis patients, 15 paragonimiasis westermani patients and 13 cysticercosis patients were examined by ELISA. To locate antigenic determinants, the pooled sera of clonorchiasis patients and healthy persons were analyzed for specific antibodies by ELISA with recombinant protein rCs22M-2r and rCs22M-3r containing the tandem repeat sequences. The full-length sequence of Cs22 antigen gene of C. sinensis was obtained. It contained 13 times tandem repeat sequences of EQQDGDEEGMGGDGGRGKEKGKVEGEDGAGEQKEQA. Bioinformatics analysis indicated that the protein (Cs22) belonged to GPI-anchored proteins family. The recombinant proteins rCs22-2r and rCs22-3r showed a certain level of immunogenicity. The positive rate by ELISA coated with the purified PrCs22-2r and PrCs22-3r for sera of clonorchiasis patients both were 45.7% (16/35), and 3.2% (1/31) for those of healthy persons. There was no cross reaction with sera of schistosomiasis and cysticercosis patients. The cross reaction with sera of paragonimiasis westermani patients was 1/15. The recombinant proteins rCs22M-2r and rCs22M-3r which only contained tandem repeats were specifically recognized by pooled sera of clonorchiasis patients. The Cs22 antigen gene of Clonorchis sinensis is obtained, and the recombinant proteins have certain diagnostic value. The antigenic determinant is located in tandem repeat sequences.
Comparisons between Arabidopsis thaliana and Drosophila melanogaster in relation to Coding and Noncoding Sequence Length and Gene Expression

PubMed Central

Caldwell, Rachel; Lin, Yan-Xia; Zhang, Ren

2015-01-01

There is a continuing interest in the analysis of gene architecture and gene expression to determine the relationship that may exist. Advances in high-quality sequencing technologies and large-scale resource datasets have increased the understanding of relationships and cross-referencing of expression data to the large genome data. Although a negative correlation between expression level and gene (especially transcript) length has been generally accepted, there have been some conflicting results arising from the literature concerning the impacts of different regions of genes, and the underlying reason is not well understood. The research aims to apply quantile regression techniques for statistical analysis of coding and noncoding sequence length and gene expression data in the plant, Arabidopsis thaliana, and fruit fly, Drosophila melanogaster, to determine if a relationship exists and if there is any variation or similarities between these species. The quantile regression analysis found that the coding sequence length and gene expression correlations varied, and similarities emerged for the noncoding sequence length (5′ and 3′ UTRs) between animal and plant species. In conclusion, the information described in this study provides the basis for further exploration into gene regulation with regard to coding and noncoding sequence length. PMID:26114098
Using microarrays to identify positional candidate genes for QTL: the case study of ACTH response in pigs.

PubMed

Jouffe, Vincent; Rowe, Suzanne; Liaubet, Laurence; Buitenhuis, Bart; Hornshøj, Henrik; SanCristobal, Magali; Mormède, Pierre; de Koning, D J

2009-07-16

Microarray studies can supplement QTL studies by suggesting potential candidate genes in the QTL regions, which by themselves are too large to provide a limited selection of candidate genes. Here we provide a case study where we explore ways to integrate QTL data and microarray data for the pig, which has only a partial genome sequence. We outline various procedures to localize differentially expressed genes on the pig genome and link this with information on published QTL. The starting point is a set of 237 differentially expressed cDNA clones in adrenal tissue from two pig breeds, before and after treatment with adrenocorticotropic hormone (ACTH). Different approaches to localize the differentially expressed (DE) genes to the pig genome showed different levels of success and a clear lack of concordance for some genes between the various approaches. For a focused analysis on 12 genes, overlapping QTL from the public domain were presented. Also, differentially expressed genes underlying QTL for ACTH response were described. Using the latest version of the draft sequence, the differentially expressed genes were mapped to the pig genome. This enabled co-location of DE genes and previously studied QTL regions, but the draft genome sequence is still incomplete and will contain many errors. A further step to explore links between DE genes and QTL at the pathway level was largely unsuccessful due to the lack of annotation of the pig genome. This could be improved by further comparative mapping analyses but this would be time consuming. This paper provides a case study for the integration of QTL data and microarray data for a species with limited genome sequence information and annotation. The results illustrate the challenges that must be addressed but also provide a roadmap for future work that is applicable to other non-model species.
First comparative characterization of three distinct ferritin subunits from a teleost: Evidence for immune-responsive mRNA expression and iron depriving activity of seahorse (Hippocampus abdominalis) ferritins.

PubMed

Oh, Minyoung; Umasuthan, Navaneethaiyer; Elvitigala, Don Anushka Sandaruwan; Wan, Qiang; Jo, Eunyoung; Ko, Jiyeon; Noh, Gyeong Eon; Shin, Sangok; Rho, Sum; Lee, Jehee

2016-02-01

Ferritins play an indispensable role in iron homeostasis through their iron-withholding function in living beings. In the current study, cDNA sequences of three distinct ferritin subunits, including a ferritin H, a ferritin M, and a ferritin L, were identified from big belly seahorse, Hippocampus abdominalis, and molecularly characterized. Complete coding sequences (CDS) of seahorse ferritin H (HaFerH), ferritin M (HaFerM), and ferritin L (HaFerL) subunits were comprised of 531, 528, and 522 base pairs (bp), respectively, which encode polypeptides of 177, 176, and 174 amino acids, respectively, with molecular masses of ∼20-21 kDa. Our in silico analyses demonstrate that these three ferritin subunits exhibit the typical characteristics of ferritin superfamily members including iron regulatory elements, domain signatures, and reactive centers. The coding sequences of HaFerH, M, and L were cloned and the corresponding proteins were overexpressed in a bacterial system. Recombinantly expressed HaFer proteins demonstrated detectable in vivo iron sequestrating (ferroxidase) activity, consistent with their putative iron binding capability. Quantification of the basal expression of these three HaFer sequences in selected tissues demonstrated a gene-specific ubiquitous spatial distribution pattern, with abundance of mRNA in HaFerM in the liver and predominant expression of HaFerH and HaFerL in blood. Interestingly, the basal expression of all three ferritin genes was found to be significantly modulated against pathogenic stress mounted by lipopolysaccharides (LPS), poly I:C, Streptococcus iniae, and Edwardsiella tarda. Collectively, our findings suggest that the three HaFer subunits may be involved in iron (II) homeostasis in big belly seahorse and that they are important in its host defense mechanisms. Copyright © 2016 Elsevier Ltd. All rights reserved.
ISOL@: an Italian SOLAnaceae genomics resource.

PubMed

Chiusano, Maria Luisa; D'Agostino, Nunzio; Traini, Alessandra; Licciardello, Concetta; Raimondo, Enrico; Aversano, Mario; Frusciante, Luigi; Monti, Luigi

2008-03-26

Present-day '-omics' technologies produce overwhelming amounts of data which include genome sequences, information on gene expression (transcripts and proteins) and on cell metabolic status. These data represent multiple aspects of a biological system and need to be investigated as a whole to shed light on the mechanisms which underpin the system functionality. The gathering and convergence of data generated by high-throughput technologies, the effective integration of different data-sources and the analysis of the information content based on comparative approaches are key methods for meaningful biological interpretations. In the frame of the International Solanaceae Genome Project, we propose here ISOLA, an Italian SOLAnaceae genomics resource. ISOLA (available at http://biosrv.cab.unina.it/isola) represents a trial platform and it is conceived as a multi-level computational environment.ISOLA currently consists of two main levels: the genome and the expression level. The cornerstone of the genome level is represented by the Solanum lycopersicum genome draft sequences generated by the International Tomato Genome Sequencing Consortium. Instead, the basic element of the expression level is the transcriptome information from different Solanaceae species, mainly in the form of species-specific comprehensive collections of Expressed Sequence Tags (ESTs). The cross-talk between the genome and the expression levels is based on data source sharing and on tools that enhance data quality, that extract information content from the levels' under parts and produce value-added biological knowledge. ISOLA is the result of a bioinformatics effort that addresses the challenges of the post-genomics era. It is designed to exploit '-omics' data based on effective integration to acquire biological knowledge and to approach a systems biology view. Beyond providing experimental biologists with a preliminary annotation of the tomato genome, this effort aims to produce a trial computational environment where different aspects and details are maintained as they are relevant for the analysis of the organization, the functionality and the evolution of the Solanaceae family.
The Human EST Ontology Explorer: a tissue-oriented visualization system for ontologies distribution in human EST collections.

PubMed

Merelli, Ivan; Caprera, Andrea; Stella, Alessandra; Del Corvo, Marcello; Milanesi, Luciano; Lazzari, Barbara

2009-10-15

The NCBI dbEST currently contains more than eight million human Expressed Sequenced Tags (ESTs). This wide collection represents an important source of information for gene expression studies, provided it can be inspected according to biologically relevant criteria. EST data can be browsed using different dedicated web resources, which allow to investigate library specific gene expression levels and to make comparisons among libraries, highlighting significant differences in gene expression. Nonetheless, no tool is available to examine distributions of quantitative EST collections in Gene Ontology (GO) categories, nor to retrieve information concerning library-dependent EST involvement in metabolic pathways. In this work we present the Human EST Ontology Explorer (HEOE) http://www.itb.cnr.it/ptp/human_est_explorer, a web facility for comparison of expression levels among libraries from several healthy and diseased tissues. The HEOE provides library-dependent statistics on the distribution of sequences in the GO Direct Acyclic Graph (DAG) that can be browsed at each GO hierarchical level. The tool is based on large-scale BLAST annotation of EST sequences. Due to the huge number of input sequences, this BLAST analysis was performed with the aid of grid computing technology, which is particularly suitable to address data parallel task. Relying on the achieved annotation, library-specific distributions of ESTs in the GO Graph were inferred. A pathway-based search interface was also implemented, for a quick evaluation of the representation of libraries in metabolic pathways. EST processing steps were integrated in a semi-automatic procedure that relies on Perl scripts and stores results in a MySQL database. A PHP-based web interface offers the possibility to simultaneously visualize, retrieve and compare data from the different libraries. Statistically significant differences in GO categories among user selected libraries can also be computed. The HEOE provides an alternative and complementary way to inspect EST expression levels with respect to approaches currently offered by other resources. Furthermore, BLAST computation on the whole human EST dataset was a suitable test of grid scalability in the context of large-scale bioinformatics analysis. The HEOE currently comprises sequence analysis from 70 non-normalized libraries, representing a comprehensive overview on healthy and unhealthy tissues. As the analysis procedure can be easily applied to other libraries, the number of represented tissues is intended to increase.
Comparative Transcriptome Analysis of Latex Reveals Molecular Mechanisms Underlying Increased Rubber Yield in Hevea brasiliensis Self-Rooting Juvenile Clones

PubMed Central

Li, Hui-Liang; Guo, Dong; Zhu, Jia-Hong; Wang, Ying; Chen, Xiong-Ting; Peng, Shi-Qing

2016-01-01

Rubber tree (Hevea brasiliensis) self-rooting juvenile clones (JCs) are promising planting materials for rubber production. In a comparative trial between self-rooting JCs and donor clones (DCs), self-rooting JCs exhibited better performance in rubber yield. To study the molecular mechanism associated with higher rubber yield in self-rooting JCs, we sequenced and comparatively analyzed the latex of rubber tree self-rooting JCs and DCs at the transcriptome level. Total raw reads of 34,632,012 and 35,913,020 bp were obtained from the library of self-rooting JCs and DCs, respectively, by using Illumina HiSeq 2000 sequencing technology. De novo assemblies yielded 54689 unigenes from the library of self-rooting JCs and DCs. Among 54689 genes, 1716 genes were identified as differentially expressed between self-rooting JCs and DCs via comparative transcript profiling. Functional analysis showed that the genes related to the mass of categories were differentially enriched between the two clones. Several genes involved in carbohydrate metabolism, hormone metabolism and reactive oxygen species scavenging were up-regulated in self-rooting JCs, suggesting that the self-rooting JCs provide sufficient molecular basis for the increased rubber yielding, especially in the aspects of improved latex metabolisms and latex flow. Some genes encoding epigenetic modification enzymes were also differentially expressed between self-rooting JCs and DCs. Epigenetic modifications may lead to gene differential expression between self-rooting JCs and DCs. These data will provide new cues to understand the molecular mechanism underlying the improved rubber yield of H. brasiliensis self-rooting clones. PMID:27555864
Comparative Transcriptome Analysis of Latex Reveals Molecular Mechanisms Underlying Increased Rubber Yield in Hevea brasiliensis Self-Rooting Juvenile Clones.

PubMed

Li, Hui-Liang; Guo, Dong; Zhu, Jia-Hong; Wang, Ying; Chen, Xiong-Ting; Peng, Shi-Qing

2016-01-01

Rubber tree (Hevea brasiliensis) self-rooting juvenile clones (JCs) are promising planting materials for rubber production. In a comparative trial between self-rooting JCs and donor clones (DCs), self-rooting JCs exhibited better performance in rubber yield. To study the molecular mechanism associated with higher rubber yield in self-rooting JCs, we sequenced and comparatively analyzed the latex of rubber tree self-rooting JCs and DCs at the transcriptome level. Total raw reads of 34,632,012 and 35,913,020 bp were obtained from the library of self-rooting JCs and DCs, respectively, by using Illumina HiSeq 2000 sequencing technology. De novo assemblies yielded 54689 unigenes from the library of self-rooting JCs and DCs. Among 54689 genes, 1716 genes were identified as differentially expressed between self-rooting JCs and DCs via comparative transcript profiling. Functional analysis showed that the genes related to the mass of categories were differentially enriched between the two clones. Several genes involved in carbohydrate metabolism, hormone metabolism and reactive oxygen species scavenging were up-regulated in self-rooting JCs, suggesting that the self-rooting JCs provide sufficient molecular basis for the increased rubber yielding, especially in the aspects of improved latex metabolisms and latex flow. Some genes encoding epigenetic modification enzymes were also differentially expressed between self-rooting JCs and DCs. Epigenetic modifications may lead to gene differential expression between self-rooting JCs and DCs. These data will provide new cues to understand the molecular mechanism underlying the improved rubber yield of H. brasiliensis self-rooting clones.
Transcriptome Analysis of the Emerald Ash Borer (EAB), Agrilus planipennis: De Novo Assembly, Functional Annotation and Comparative Analysis.

PubMed

Duan, Jun; Ladd, Tim; Doucet, Daniel; Cusson, Michel; vanFrankenhuyzen, Kees; Mittapalli, Omprakash; Krell, Peter J; Quan, Guoxing

2015-01-01

The Emerald ash borer (EAB), Agrilus planipennis, is an invasive phloem-feeding insect pest of ash trees. Since its initial discovery near the Detroit, US- Windsor, Canada area in 2002, the spread of EAB has had strong negative economic, social and environmental impacts in both countries. Several transcriptomes from specific tissues including midgut, fat body and antenna have recently been generated. However, the relatively low sequence depth, gene coverage and completeness limited the usefulness of these EAB databases. High-throughput deep RNA-Sequencing (RNA-Seq) was used to obtain 473.9 million pairs of 100 bp length paired-end reads from various life stages and tissues. These reads were assembled into 88,907 contigs using the Trinity strategy and integrated into 38,160 unigenes after redundant sequences were removed. We annotated 11,229 unigenes by searching against the public nr, Swiss-Prot and COG. The EAB transcriptome assembly was compared with 13 other sequenced insect species, resulting in the prediction of 536 unigenes that are Coleoptera-specific. Differential gene expression revealed that 290 unigenes are expressed during larval molting and 3,911 unigenes during metamorphosis from larvae to pupae, respectively (FDR< 0.01 and log2 FC>2). In addition, 1,167 differentially expressed unigenes were identified from larval and adult midguts, 435 unigenes were up-regulated in larval midgut and 732 unigenes were up-regulated in adult midgut. Most of the genes involved in RNA interference (RNAi) pathways were identified, which implies the existence of a system RNAi in EAB. This study provides one of the most fundamental and comprehensive transcriptome resources available for EAB to date. Identification of the tissue- stage- or species- specific unigenes will benefit the further study of gene functions during growth and metamorphosis processes in EAB and other pest insects.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.