de Jong, Simone; Boks, Marco P. M.; Fuller, Tova F.; Strengman, Eric; Janson, Esther; de Kovel, Carolien G. F.; Ori, Anil P. S.; Vi, Nancy; Mulder, Flip; Blom, Jan Dirk; Glenthøj, Birte; Schubart, Chris D.; Cahn, Wiepke; Kahn, René S.; Horvath, Steve; Ophoff, Roel A.
2012-01-01
Despite large-scale genome-wide association studies (GWAS), the underlying genes for schizophrenia are largely unknown. Additional approaches are therefore required to identify the genetic background of this disorder. Here we report findings from a large gene expression study in peripheral blood of schizophrenia patients and controls. We applied a systems biology approach to genome-wide expression data from whole blood of 92 medicated and 29 antipsychotic-free schizophrenia patients and 118 healthy controls. We show that gene expression profiling in whole blood can identify twelve large gene co-expression modules associated with schizophrenia. Several of these disease related modules are likely to reflect expression changes due to antipsychotic medication. However, two of the disease modules could be replicated in an independent second data set involving antipsychotic-free patients and controls. One of these robustly defined disease modules is significantly enriched with brain-expressed genes and with genetic variants that were implicated in a GWAS study, which could imply a causal role in schizophrenia etiology. The most highly connected intramodular hub gene in this module (ABCF1), is located in, and regulated by the major histocompatibility (MHC) complex, which is intriguing in light of the fact that common allelic variants from the MHC region have been implicated in schizophrenia. This suggests that the MHC increases schizophrenia susceptibility via altered gene expression of regulatory genes in this network. PMID:22761806
A Review of Feature Extraction Software for Microarray Gene Expression Data
Tan, Ching Siang; Ting, Wai Soon; Mohamad, Mohd Saberi; Chan, Weng Howe; Deris, Safaai; Ali Shah, Zuraini
2014-01-01
When gene expression data are too large to be processed, they are transformed into a reduced representation set of genes. Transforming large-scale gene expression data into a set of genes is called feature extraction. If the genes extracted are carefully chosen, this gene set can extract the relevant information from the large-scale gene expression data, allowing further analysis by using this reduced representation instead of the full size data. In this paper, we review numerous software applications that can be used for feature extraction. The software reviewed is mainly for Principal Component Analysis (PCA), Independent Component Analysis (ICA), Partial Least Squares (PLS), and Local Linear Embedding (LLE). A summary and sources of the software are provided in the last section for each feature extraction method. PMID:25250315
Sequential and combinatorial roles of maf family genes define proper lens development.
Reza, Hasan Mahmud; Urano, Atsuyo; Shimada, Naoko; Yasuda, Kunio
2007-01-16
Maf proteins have been shown to play pivotal roles in lens development in vertebrates. The developing chick lens expresses at least three large Maf proteins. However, the transcriptional relationship among the three large maf genes and their various roles in transactivating the downstream genes largely remain to be elucidated. Chick embryos were electroporated with wild-type L-maf, c-maf, and mafB by in ovo electroporation, and their effects on gene expression were determined by in situ hybridization using specific probes or by immunostaining. Endogenous gene expression was determined using nonelectroporated samples. A regulation mechanism exists among the members of maf family gene. An early-expressed member of this gene family typically stimulates the expression of later-expressed members. We also examined the regulation of various lens-expressing genes with a focus on the interaction between different Maf proteins. We found that the transcriptional ability of Maf proteins varies, even when the target is the same, in parallel with their discrete functions. L-Maf and c-Maf have no effect on E-cadherin expression, whereas MafB enhances its expression and thereby impedes lens vesicle formation. This study also revealed that Maf proteins can regulate the expression of gap junction genes, connexins, and their interacting partner, major intrinsic protein (MIP), during lens development. Misexpression of L-Maf and c-Maf induces ectopic expression of Cx43 and MIP; in contrast, MafB appears to have no effect on Cx43, but induces MIP significantly as evidenced from our gain-of-function experiments. Our results indicate that large Maf function is indispensable for chick lens initiation and development. In addition, L-Maf positively regulates most of the essential genes in this program and directs a series of molecular events leading to proper formation of the lens.
Rethinking cell-cycle-dependent gene expression in Schizosaccharomyces pombe.
Cooper, Stephen
2017-11-01
Three studies of gene expression during the division cycle of Schizosaccharomyces pombe led to the proposal that a large number of genes are expressed at particular times during the S. pombe cell cycle. Yet only a small fraction of genes proposed to be expressed in a cell-cycle-dependent manner are reproducible in all three published studies. In addition to reproducibility problems, questions about expression amplitudes, cell-cycle timing of expression, synchronization artifacts, and the problem with methods for synchronizing cells must be considered. These problems and complications prompt the idea that caution should be used before accepting the conclusion that there are a large number of genes expressed in a cell-cycle-dependent manner in S. pombe.
Lv, Xiaoyang; Sun, Wei; Yin, Jinfeng; Ni, Rong; Su, Rui; Wang, Qingzeng; Gao, Wen; Bao, Jianjun; Yu, Jiarui; Wang, Lihong; Chen, Ling
2016-01-01
Wave patterns in lambskin hair follicles are an important factor determining the quality of sheep’s wool. Hair follicles in lambskin from Hu sheep, a breed unique to China, have 3 types of waves, designated as large, medium, and small. The quality of wool from small wave follicles is excellent, while the quality of large waves is considered poor. Because no molecular and biological studies on hair follicles of these sheep have been conducted to date, the molecular mechanisms underlying the formation of different wave patterns is currently unknown. The aim of this article was to screen the candidate microRNAs (miRNA) and genes for the development of hair follicles in Hu sheep. Two-day-old Hu lambs were selected from full-sib individuals that showed large, medium, and small waves. Integrated analysis of microRNA and mRNA expression profiles employed high-throughout sequencing technology. Approximately 13, 24, and 18 differentially expressed miRNAs were found between small and large waves, small and medium waves, and medium and large waves, respectively. A total of 54, 190, and 81 differentially expressed genes were found between small and large waves, small and medium waves, and medium and large waves, respectively, by RNA sequencing (RNA-seq) analysis. Differentially expressed genes were classified using gene ontology and pathway analyses. They were found to be mainly involved in cell differentiation, proliferation, apoptosis, growth, immune response, and ion transport, and were associated with MAPK and the Notch signaling pathway. Reverse transcription-polymerase chain reaction (RT-PCR) analyses of differentially-expressed miRNA and genes were consistent with sequencing results. Integrated analysis of miRNA and mRNA expression indicated that, compared to small waves, large waves included 4 downregulated miRNAs that had regulatory effects on 8 upregulated genes and 3 upregulated miRNAs, which in turn influenced 13 downregulated genes. Compared to small waves, medium waves included 13 downregulated miRNAs that had regulatory effects on 64 upregulated genes and 4 upregulated miRNAs, which in turn had regulatory effects on 22 downregulated genes. Compared to medium waves, large waves consisted of 13 upregulated miRNAs that had regulatory effects on 48 downregulated genes. These differentially expressed miRNAs and genes may play a significant role in forming different patterns, and provide evidence for the molecular mechanisms underlying the formation of hair follicles of varying patterns. PMID:27404636
Tanaka, F; Wada, H; Fukui, Y; Fukushima, M
2011-08-01
Previous small-sized studies showed lower thymidylate synthase (TS) expression in adenocarcinoma of the lung, which may explain higher antitumor activity of TS-inhibiting agents such as pemetrexed. To quantitatively measure TS gene expression in a large-scale Japanese population (n = 2621) with primary lung cancer, laser-captured microdissected sections were cut from primary tumors, surrounding normal lung tissues and involved nodes. TS gene expression level in primary tumor was significantly higher than that in normal lung tissue (mean TS/β-actin, 3.4 and 1.0, respectively; P < 0.01), and TS gene expression level was further higher in involved node (mean TS/β-actin, 7.7; P < 0.01). Analyses of TS gene expression levels in primary tumor according to histologic cell type revealed that small-cell carcinoma showed highest TS expression (mean TS/β-actin, 13.8) and that squamous cell carcinoma showed higher TS expression as compared with adenocarcinoma (mean TS/β-actin, 4.3 and 2.3, respectively; P < 0.01); TS gene expression was significantly increased along with a decrease in the grade of tumor cell differentiation. There was no significant difference in TS gene expression according to any other patient characteristics including tumor progression. Lower TS expression in adenocarcinoma of the lung was confirmed in a large-scale study.
Integrative approaches for large-scale transcriptome-wide association studies
Gusev, Alexander; Ko, Arthur; Shi, Huwenbo; Bhatia, Gaurav; Chung, Wonil; Penninx, Brenda W J H; Jansen, Rick; de Geus, Eco JC; Boomsma, Dorret I; Wright, Fred A; Sullivan, Patrick F; Nikkola, Elina; Alvarez, Marcus; Civelek, Mete; Lusis, Aldons J.; Lehtimäki, Terho; Raitoharju, Emma; Kähönen, Mika; Seppälä, Ilkka; Raitakari, Olli T.; Kuusisto, Johanna; Laakso, Markku; Price, Alkes L.; Pajukanta, Päivi; Pasaniuc, Bogdan
2016-01-01
Many genetic variants influence complex traits by modulating gene expression, thus altering the abundance levels of one or multiple proteins. Here, we introduce a powerful strategy that integrates gene expression measurements with summary association statistics from large-scale genome-wide association studies (GWAS) to identify genes whose cis-regulated expression is associated to complex traits. We leverage expression imputation to perform a transcriptome wide association scan (TWAS) to identify significant expression-trait associations. We applied our approaches to expression data from blood and adipose tissue measured in ~3,000 individuals overall. We imputed gene expression into GWAS data from over 900,000 phenotype measurements to identify 69 novel genes significantly associated to obesity-related traits (BMI, lipids, and height). Many of the novel genes are associated with relevant phenotypes in the Hybrid Mouse Diversity Panel. Our results showcase the power of integrating genotype, gene expression and phenotype to gain insights into the genetic basis of complex traits. PMID:26854917
Differential gene expression patterns between smokers and non‐smokers: cause or consequence?
Jansen, Rick; Brooks, Andy; Willemsen, Gonneke; van Grootheest, Gerard; de Geus, Eco; Smit, Jan H.; Penninx, Brenda W.; Boomsma, Dorret I.
2015-01-01
Abstract The molecular mechanisms causing smoking‐induced health decline are largely unknown. To elucidate the molecular pathways involved in cause and consequences of smoking behavior, we conducted a genome‐wide gene expression study in peripheral blood samples targeting 18 238 genes. Data of 743 smokers, 1686 never smokers and 890 ex‐smokers were available from two population‐based cohorts from the Netherlands. In addition, data of 56 monozygotic twin pairs discordant for ever smoking were used. One hundred thirty‐two genes were differentially expressed between current smokers and never smokers (P < 1.2 × 10−6, Bonferroni correction). The most significant genes were G protein‐coupled receptor 15 (P < 1 × 10−150) and leucine‐rich repeat neuronal 3 (P < 1 × 10−44). The smoking‐related genes were enriched for immune system, blood coagulation, natural killer cell and cancer pathways. By taking the data of ex‐smokers into account, expression of these 132 genes was classified into reversible (94 genes), slowly reversible (31 genes), irreversible (6 genes) or inconclusive (1 gene). Expression of 6 of the 132 genes (three reversible and three slowly reversible) was confirmed to be reactive to smoking as they were differentially expressed in monozygotic pairs discordant for smoking. Cis‐expression quantitative trait loci for GPR56 and RARRES3 (downregulated in smokers) were associated with increased number of cigarettes smoked per day in a large genome‐wide association meta‐analysis, suggesting a causative effect of GPR56 and RARRES3 expression on smoking behavior. In conclusion, differential gene expression patterns in smokers are extensive and cluster in several underlying disease pathways. Gene expression differences seem mainly direct consequences of smoking, and largely reversible after smoking cessation. However, we also identified DNA variants that may influence smoking behavior via the mediating gene expression. PMID:26594007
Dryselius, Rikard; Izutsu, Kaori; Honda, Takeshi; Iida, Tetsuya
2008-01-01
Background Replication of bacterial chromosomes increases copy numbers of genes located near origins of replication relative to genes located near termini. Such differential gene dosage depends on replication rate, doubling time and chromosome size. Although little explored, differential gene dosage may influence both gene expression and location. For vibrios, a diverse family of fast growing gammaproteobacteria, gene dosage may be particularly important as they harbor two chromosomes of different size. Results Here we examined replication dynamics and gene dosage effects for the separate chromosomes of three Vibrio species. We also investigated locations for specific gene types within the genome. The results showed consistently larger gene dosage differences for the large chromosome which also initiated replication long before the small. Accordingly, large chromosome gene expression levels were generally higher and showed an influence from gene dosage. This was reflected by a higher abundance of growth essential and growth contributing genes of which many locate near the origin of replication. In contrast, small chromosome gene expression levels were low and appeared independent of gene dosage. Also, species specific genes are highly abundant and an over-representation of genes involved in transcription could explain its gene dosage independent expression. Conclusion Here we establish a link between replication dynamics and differential gene dosage on one hand and gene expression levels and the location of specific gene types on the other. For vibrios, this relationship appears connected to a polarisation of genetic content between its chromosomes, which may both contribute to and be enhanced by an improved adaptive capacity. PMID:19032792
NASA Technical Reports Server (NTRS)
Mjolsness, Eric; Castano, Rebecca; Mann, Tobias; Wold, Barbara
2000-01-01
We provide preliminary evidence that existing algorithms for inferring small-scale gene regulation networks from gene expression data can be adapted to large-scale gene expression data coming from hybridization microarrays. The essential steps are (I) clustering many genes by their expression time-course data into a minimal set of clusters of co-expressed genes, (2) theoretically modeling the various conditions under which the time-courses are measured using a continuous-time analog recurrent neural network for the cluster mean time-courses, (3) fitting such a regulatory model to the cluster mean time courses by simulated annealing with weight decay, and (4) analysing several such fits for commonalities in the circuit parameter sets including the connection matrices. This procedure can be used to assess the adequacy of existing and future gene expression time-course data sets for determining transcriptional regulatory relationships such as coregulation.
DNA-Demethylase Regulated Genes Show Methylation-Independent Spatiotemporal Expression Patterns
Schumann, Ulrike; Lee, Joanne; Kazan, Kemal; Ayliffe, Michael; Wang, Ming-Bo
2017-01-01
Recent research has indicated that a subset of defense-related genes is downregulated in the Arabidopsis DNA demethylase triple mutant rdd (ros1 dml2 dml3) resulting in increased susceptibility to the fungal pathogen Fusarium oxysporum. In rdd plants these downregulated genes contain hypermethylated transposable element sequences (TE) in their promoters, suggesting that this methylation represses gene expression in the mutant and that these sequences are actively demethylated in wild-type plants to maintain gene expression. In this study, the tissue-specific and pathogen-inducible expression patterns of rdd-downregulated genes were investigated and the individual role of ROS1, DML2, and DML3 demethylases in these spatiotemporal regulation patterns was determined. Large differences in defense gene expression were observed between pathogen-infected and uninfected tissues and between root and shoot tissues in both WT and rdd plants, however, only subtle changes in promoter TE methylation patterns occurred. Therefore, while TE hypermethylation caused decreased gene expression in rdd plants it did not dramatically effect spatiotemporal gene regulation, suggesting that this latter regulation is largely methylation independent. Analysis of ros1-3, dml2-1, and dml3-1 single gene mutant lines showed that promoter TE hypermethylation and defense-related gene repression was predominantly, but not exclusively, due to loss of ROS1 activity. These data demonstrate that DNA demethylation of TE sequences, largely by ROS1, promotes defense-related gene expression but does not control spatiotemporal expression in Arabidopsis. Summary: Ros1-mediated DNA demethylation of promoter transposable elements is essential for activation of defense-related gene expression in response to fungal infection in Arabidopsis thaliana. PMID:28894455
Kahlau, Sabine; Bock, Ralph
2008-01-01
Plastid genes are expressed at high levels in photosynthetically active chloroplasts but are generally believed to be drastically downregulated in nongreen plastids. The genome-wide changes in the expression patterns of plastid genes during the development of nongreen plastid types as well as the contributions of transcriptional versus translational regulation are largely unknown. We report here a systematic transcriptomics and translatomics analysis of the tomato (Solanum lycopersicum) plastid genome during fruit development and chloroplast-to-chromoplast conversion. At the level of RNA accumulation, most but not all plastid genes are strongly downregulated in fruits compared with leaves. By contrast, chloroplast-to-chromoplast differentiation during fruit ripening is surprisingly not accompanied by large changes in plastid RNA accumulation. However, most plastid genes are translationally downregulated during chromoplast development. Both transcriptional and translational downregulation are more pronounced for photosynthesis-related genes than for genes involved in gene expression, indicating that some low-level plastid gene expression must be sustained in chromoplasts. High-level expression during chromoplast development identifies accD, the only plastid-encoded gene involved in fatty acid biosynthesis, as the target gene for which gene expression activity in chromoplasts is maintained. In addition, we have determined the developmental patterns of plastid RNA polymerase activities, intron splicing, and RNA editing and report specific developmental changes in the splicing and editing patterns of plastid transcripts. PMID:18441214
USDA-ARS?s Scientific Manuscript database
The amount of microarray gene expression data in public repositories has been increasing exponentially for the last couple of decades. High-throughput microarray data integration and analysis has become a critical step in exploring the large amount of expression data for biological discovery. Howeve...
Altobelli, Gioia; Bogdarina, Irina G; Stupka, Elia; Clark, Adrian J L; Langley-Evans, Simon
2013-01-01
A large body of evidence from human and animal studies demonstrates that the maternal diet during pregnancy can programme physiological and metabolic functions in the developing fetus, effectively determining susceptibility to later disease. The mechanistic basis of such programming is unclear but may involve resetting of epigenetic marks and fetal gene expression. The aim of this study was to evaluate genome-wide DNA methylation and gene expression in the livers of newborn rats exposed to maternal protein restriction. On day one postnatally, there were 618 differentially expressed genes and 1183 differentially methylated regions (FDR 5%). The functional analysis of differentially expressed genes indicated a significant effect on DNA repair/cycle/maintenance functions and of lipid, amino acid metabolism and circadian functions. Enrichment for known biological functions was found to be associated with differentially methylated regions. Moreover, these epigenetically altered regions overlapped genetic loci associated with metabolic and cardiovascular diseases. Both expression changes and DNA methylation changes were largely reversed by supplementing the protein restricted diet with folic acid. Although the epigenetic and gene expression signatures appeared to underpin largely different biological processes, the gene expression profile of DNA methyl transferases was altered, providing a potential link between the two molecular signatures. The data showed that maternal protein restriction is associated with widespread differential gene expression and DNA methylation across the genome, and that folic acid is able to reset both molecular signatures.
NASA Technical Reports Server (NTRS)
Nebenfuhr, A.; Lomax, T. L.
1998-01-01
We have developed an improved method for determination of gene expression levels with RT-PCR. The procedure is rapid and does not require extensive optimization or densitometric analysis. Since the detection of individual transcripts is PCR-based, small amounts of tissue samples are sufficient for the analysis of expression patterns in large gene families. Using this method, we were able to rapidly screen nine members of the Aux/IAA family of auxin-responsive genes and identify those genes which vary in message abundance in a tissue- and light-specific manner. While not offering the accuracy of conventional semi-quantitative or competitive RT-PCR, our method allows quick screening of large numbers of genes in a wide range of RNA samples with just a thermal cycler and standard gel analysis equipment.
Gene Expression: Sizing it all up
USDA-ARS?s Scientific Manuscript database
Genomic architecture appears to be a largely unexplored component of gene expression. Although surely not the end of the story, we are learning that when it comes to gene expression, size is important. We have been surprised to find that certain patterns of expression, tissue-specific versus constit...
Chao, Tianle; Wang, Guizhi; Ji, Zhibin; Liu, Zhaohua; Hou, Lei; Wang, Jin; Wang, Jianmin
2017-07-13
The large intestine, also known as the hindgut, is an important part of the animal digestive system. Recent studies on digestive system development in ruminants have focused on the rumen and the small intestine, but the molecular mechanisms underlying sheep large intestine metabolism remain poorly understood. To identify genes related to intestinal metabolism and to reveal molecular regulation mechanisms, we sequenced and compared the transcriptomes of mucosal epithelial tissues among the cecum, proximal colon and duodenum. A total of 4,221 transcripts from 3,254 genes were identified as differentially expressed transcripts. Between the large intestine and duodenum, differentially expressed transcripts were found to be significantly enriched in 6 metabolism-related pathways, among which PPAR signaling was identified as a key pathway. Three genes, CPT1A, LPL and PCK1, were identified as higher expression hub genes in the large intestine. Between the cecum and colon, differentially expressed transcripts were significantly enriched in 5 lipid metabolism related pathways, and CEPT1 and MBOAT1 were identified as hub genes. This study provides important information regarding the molecular mechanisms of intestinal metabolism in sheep and may provide a basis for further study.
Shahdoust, Maryam; Hajizadeh, Ebrahim; Mozdarani, Hossein; Chehrei, Ali
2013-01-01
Cigarette smoking is the major risk factor for development of lung cancer. Identification of effects of tobacco on airway gene expression may provide insight into the causes. This research aimed to compare gene expression of large airway epithelium cells in normal smokers (n=13) and non-smokers (n=9) in order to find genes which discriminate the two groups and assess cigarette smoking effects on large airway epithelium cells. Genes discriminating smokers from non-smokers were identified by applying a neural network clustering method, growing self-organizing maps (GSOM), to microarray data according to class discrimination scores. An index was computed based on differentiation between each mean of gene expression in the two groups. This clustering approach provided the possibility of comparing thousands of genes simultaneously. The applied approach compared the mean of 7,129 genes in smokers and non-smokers simultaneously and classified the genes of large airway epithelium cells which had differently expressed in smokers comparing with non-smokers. Seven genes were identified which had the highest different expression in smokers compared with the non-smokers group: NQO1, H19, ALDH3A1, AKR1C1, ABHD2, GPX2 and ADH7. Most (NQO1, ALDH3A1, AKR1C1, H19 and GPX2) are known to be clinically notable in lung cancer studies. Furthermore, statistical discriminate analysis showed that these genes could classify samples in smokers and non-smokers correctly with 100% accuracy. With the performed GSOM map, other nodes with high average discriminate scores included genes with alterations strongly related to the lung cancer such as AKR1C3, CYP1B1, UCHL1 and AKR1B10. This clustering by comparing expression of thousands of genes at the same time revealed alteration in normal smokers. Most of the identified genes were strongly relevant to lung cancer in the existing literature. The genes may be utilized to identify smokers with increased risk for lung cancer. A large sample study is now recommended to determine relations between the genes ABHD2 and ADH7 and smoking.
Hiss, Manuel; Laule, Oliver; Meskauskiene, Rasa M; Arif, Muhammad A; Decker, Eva L; Erxleben, Anika; Frank, Wolfgang; Hanke, Sebastian T; Lang, Daniel; Martin, Anja; Neu, Christina; Reski, Ralf; Richardt, Sandra; Schallenberg-Rüdinger, Mareike; Szövényi, Peter; Tiko, Theodhor; Wiedemann, Gertrud; Wolf, Luise; Zimmermann, Philip; Rensing, Stefan A
2014-08-01
The moss Physcomitrella patens is an important model organism for studying plant evolution, development, physiology and biotechnology. Here we have generated microarray gene expression data covering the principal developmental stages, culture forms and some environmental/stress conditions. Example analyses of developmental stages and growth conditions as well as abiotic stress treatments demonstrate that (i) growth stage is dominant over culture conditions, (ii) liquid culture is not stressful for the plant, (iii) low pH might aid protoplastation by reduced expression of cell wall structure genes, (iv) largely the same gene pool mediates response to dehydration and rehydration, and (v) AP2/EREBP transcription factors play important roles in stress response reactions. With regard to the AP2 gene family, phylogenetic analysis and comparison with Arabidopsis thaliana shows commonalities as well as uniquely expressed family members under drought, light perturbations and protoplastation. Gene expression profiles for P. patens are available for the scientific community via the easy-to-use tool at https://www.genevestigator.com. By providing large-scale expression profiles, the usability of this model organism is further enhanced, for example by enabling selection of control genes for quantitative real-time PCR. Now, gene expression levels across a broad range of conditions can be accessed online for P. patens. © 2014 The Authors The Plant Journal © 2014 John Wiley & Sons Ltd.
Soybean kinome: functional classification and gene expression patterns
Liu, Jinyi; Chen, Nana; Grant, Joshua N.; Cheng, Zong-Ming (Max); Stewart, C. Neal; Hewezi, Tarek
2015-01-01
The protein kinase (PK) gene family is one of the largest and most highly conserved gene families in plants and plays a role in nearly all biological functions. While a large number of genes have been predicted to encode PKs in soybean, a comprehensive functional classification and global analysis of expression patterns of this large gene family is lacking. In this study, we identified the entire soybean PK repertoire or kinome, which comprised 2166 putative PK genes, representing 4.67% of all soybean protein-coding genes. The soybean kinome was classified into 19 groups, 81 families, and 122 subfamilies. The receptor-like kinase (RLK) group was remarkably large, containing 1418 genes. Collinearity analysis indicated that whole-genome segmental duplication events may have played a key role in the expansion of the soybean kinome, whereas tandem duplications might have contributed to the expansion of specific subfamilies. Gene structure, subcellular localization prediction, and gene expression patterns indicated extensive functional divergence of PK subfamilies. Global gene expression analysis of soybean PK subfamilies revealed tissue- and stress-specific expression patterns, implying regulatory functions over a wide range of developmental and physiological processes. In addition, tissue and stress co-expression network analysis uncovered specific subfamilies with narrow or wide interconnected relationships, indicative of their association with particular or broad signalling pathways, respectively. Taken together, our analyses provide a foundation for further functional studies to reveal the biological and molecular functions of PKs in soybean. PMID:25614662
Salinas, Yasmmyn D.; Shi, YiJun; Greenwood, Michael; Hoe, See Ziau; Murphy, David; Gainer, Harold
2015-01-01
Magnocellular neurons (MCNs) in the hypothalamo-neurohypophysial system (HNS) are highly specialized to release large amounts of arginine vasopressin (Avp) or oxytocin (Oxt) into the blood stream and play critical roles in the regulation of body fluid homeostasis. The MCNs are osmosensory neurons and are excited by exposure to hypertonic solutions and inhibited by hypotonic solutions. The MCNs respond to systemic hypertonic and hypotonic stimulation with large changes in the expression of their Avp and Oxt genes, and microarray studies have shown that these osmotic perturbations also cause large changes in global gene expression in the HNS. In this paper, we examine gene expression in the rat supraoptic nucleus (SON) under normosmotic and chronic salt-loading SL) conditions by the first time using “new-generation”, RNA sequencing (RNA-Seq) methods. We reliably detect 9,709 genes as present in the SON by RNA-Seq, and 552 of these genes were changed in expression as a result of chronic SL. These genes reflect diverse functions, and 42 of these are involved in either transcriptional or translational processes. In addition, we compare the SON transcriptomes resolved by RNA-Seq methods with the SON transcriptomes determined by Affymetrix microarray methods in rats under the same osmotic conditions, and find that there are 6,466 genes present in the SON that are represented in both data sets, although 1,040 of the expressed genes were found only in the microarray data, and 2,762 of the expressed genes are selectively found in the RNA-Seq data and not the microarray data. These data provide the research community a comprehensive view of the transcriptome in the SON under normosmotic conditions and the changes in specific gene expression evoked by salt loading. PMID:25897513
Sequeira, Ana Filipa; Brás, Joana L A; Guerreiro, Catarina I P D; Vincentelli, Renaud; Fontes, Carlos M G A
2016-12-01
Gene synthesis is becoming an important tool in many fields of recombinant DNA technology, including recombinant protein production. De novo gene synthesis is quickly replacing the classical cloning and mutagenesis procedures and allows generating nucleic acids for which no template is available. In addition, when coupled with efficient gene design algorithms that optimize codon usage, it leads to high levels of recombinant protein expression. Here, we describe the development of an optimized gene synthesis platform that was applied to the large scale production of small genes encoding venom peptides. This improved gene synthesis method uses a PCR-based protocol to assemble synthetic DNA from pools of overlapping oligonucleotides and was developed to synthesise multiples genes simultaneously. This technology incorporates an accurate, automated and cost effective ligation independent cloning step to directly integrate the synthetic genes into an effective Escherichia coli expression vector. The robustness of this technology to generate large libraries of dozens to thousands of synthetic nucleic acids was demonstrated through the parallel and simultaneous synthesis of 96 genes encoding animal toxins. An automated platform was developed for the large-scale synthesis of small genes encoding eukaryotic toxins. Large scale recombinant expression of synthetic genes encoding eukaryotic toxins will allow exploring the extraordinary potency and pharmacological diversity of animal venoms, an increasingly valuable but unexplored source of lead molecules for drug discovery.
Differential gene expression in human abdominal aortic aneurysm and aortic occlusive disease
Moran, Corey S.; Schreurs, Charlotte; Lindeman, Jan H. N.; Walker, Philip J.; Nataatmadja, Maria; West, Malcolm; Holdt, Lesca M.; Hinterseher, Irene; Pilarsky, Christian; Golledge, Jonathan
2015-01-01
Abdominal aortic aneurysm (AAA) and aortic occlusive disease (AOD) represent common causes of morbidity and mortality in elderly populations which were previously believed to have common aetiologies. The aim of this study was to assess the gene expression in human AAA and AOD. We performed microarrays using aortic specimen obtained from 20 patients with small AAAs (≤ 55mm), 29 patients with large AAAs (> 55mm), 9 AOD patients, and 10 control aortic specimens obtained from organ donors. Some differentially expressed genes were validated by quantitative-PCR (qRT-PCR)/immunohistochemistry. We identified 840 and 1,014 differentially expressed genes in small and large AAAs, respectively. Immune-related pathways including cytokine-cytokine receptor interaction and T-cell-receptor signalling were upregulated in both small and large AAAs. Examples of validated genes included CTLA4 (2.01-fold upregulated in small AAA, P = 0.002), NKTR (2.37-and 2.66-fold upregulated in small and large AAA with P = 0.041 and P = 0.015, respectively), and CD8A (2.57-fold upregulated in large AAA, P = 0.004). 1,765 differentially expressed genes were identified in AOD. Pathways upregulated in AOD included metabolic and oxidative phosphorylation categories. The UCP2 gene was downregulated in AOD (3.73-fold downregulated, validated P = 0.017). In conclusion, the AAA and AOD transcriptomes were very different suggesting that AAA and AOD have distinct pathogenic mechanisms. PMID:25944698
2014-01-01
Background Growth in fishes is regulated via many environmental and physiological factors and is shaped by the genetic background of each individual. Previous microarray studies of salmonid growth have examined fish experiencing either muscle wastage or accelerated growth patterns following refeeding, or the influence of growth hormone and transgenesis. This study determines the gene expression profiles of genetically unmanipulated large and small fish from a domesticated salmonid strain reared on a typical feeding regime. Gene expression profiles of white muscle and liver from rainbow trout (Oncorhynchus mykiss) from two seasonal spawning groups (September and December lots) within a single strain were examined when the fish were 15 months of age to assess the influence of season (late fall vs. onset of spring) and body size (large vs. small). Results Although IGFBP1 gene expression was up-regulated in the livers of small fish in both seasonal lots, few expression differences were detected in the liver overall. Faster growing Dec. fish showed a greater number of differences in white muscle expression compared to Sept. fish. Significant differences in the GO Generic Level 3 categories ‘response to external stimulus’, ‘establishment of localization’, and ‘response to stress’ were detected in white muscle tissue between large and small fish. Larger fish showed up-regulation of cytoskeletal component genes while many genes related to myofibril components of muscle tissue were up-regulated in small fish. Most of the genes up-regulated in large fish within the ‘response to stress’ category are involved in immunity while in small fish most of these gene functions are related to apoptosis. Conclusions A higher proportion of genes in white muscle compared to liver showed similar patterns of up- or down-regulation within the same size class across seasons supporting their utility as biomarkers for growth in rainbow trout. Differences between large and small Sept. fish in the ‘response to stress’ and ‘response to external stimulus’ categories for white muscle tissue, suggests that smaller fish have a greater inability to handle stress compared to the large fish. Sampling season had a significant impact on the expression of genes related to the growth process in rainbow trout. PMID:24450799
Comparative modular analysis of gene expression in vertebrate organs.
Piasecka, Barbara; Kutalik, Zoltán; Roux, Julien; Bergmann, Sven; Robinson-Rechavi, Marc
2012-03-29
The degree of conservation of gene expression between homologous organs largely remains an open question. Several recent studies reported some evidence in favor of such conservation. Most studies compute organs' similarity across all orthologous genes, whereas the expression level of many genes are not informative about organ specificity. Here, we use a modularization algorithm to overcome this limitation through the identification of inter-species co-modules of organs and genes. We identify such co-modules using mouse and human microarray expression data. They are functionally coherent both in terms of genes and of organs from both organisms. We show that a large proportion of genes belonging to the same co-module are orthologous between mouse and human. Moreover, their zebrafish orthologs also tend to be expressed in the corresponding homologous organs. Notable exceptions to the general pattern of conservation are the testis and the olfactory bulb. Interestingly, some co-modules consist of single organs, while others combine several functionally related organs. For instance, amygdala, cerebral cortex, hypothalamus and spinal cord form a clearly discernible unit of expression, both in mouse and human. Our study provides a new framework for comparative analysis which will be applicable also to other sets of large-scale phenotypic data collected across different species.
Computational approaches were developed to identify factors that regulate Nrf2 in a large gene expression compendium of microarray profiles including >2000 comparisons which queried the effects of chemicals, genes, diets, and infectious agents on gene expression in the mouse l...
Llopart, Ana
2012-12-01
The X chromosome has a large effect on hybrid dysfunction, particularly on hybrid male sterility. Although the evidence for this so-called large-X effect is clear, its molecular causes are not yet fully understood. One possibility is that, under certain conditions, evolution proceeds faster in X-linked than in autosomal loci (i.e., faster-X effect) due to both natural selection and their hemizygosity in males, an effect that is expected to be greatest in genes with male-biased expression. Here, I study genome-wide variation in transcript abundance between Drosophila yakuba and D. santomea, within these species and in their hybrid males to evaluate both the faster-X and large-X effects at the level of expression. I find that in X-linked male-biased genes (MBGs) expression evolves faster than in their autosomal counterparts, an effect that is accompanied by a unique reduction in expression polymorphism. This suggests that Darwinian selection is driving expression differences between species, likely enhanced by the hemizygosity of the X chromosome in males. Despite the recent split of the two sister species under study, abundant changes in both cis- and trans-regulatory elements underlie expression divergence in the majority of the genes analyzed, with significant differences in allelic ratios of transcript abundance between the two reciprocal F(1) hybrid males. Cis-trans coevolution at molecular level, evolved shortly after populations become isolated, may therefore contribute to explain the breakdown of the regulation of gene expression in hybrid males. Additionally, the X chromosome plays a large role in this hybrid male misexpression, which affects not only MBG but also, to a lesser degree, nonsex-biased genes. Interestingly, hybrid male misexpression is concentrated mostly in autosomal genes, likely facilitated by the rapid evolution of sex-linked trans-acting factors. I suggest that the faster evolution of X-linked MBGs, at both protein and expression levels, contributes to explain the large effect of the X chromosome on hybrid male sterility, likely mediating widespread autosomal misexpression through the preferential recognition of cis-regulatory elements by conspecific trans-acting factors (i.e., cis-trans conspecific recognition).
Extraordinary diversity of visual opsin genes in dragonflies
Futahashi, Ryo; Kawahara-Miki, Ryouka; Kinoshita, Michiyo; Yoshitake, Kazutoshi; Yajima, Shunsuke; Arikawa, Kentaro; Fukatsu, Takema
2015-01-01
Dragonflies are colorful and large-eyed animals strongly dependent on color vision. Here we report an extraordinary large number of opsin genes in dragonflies and their characteristic spatiotemporal expression patterns. Exhaustive transcriptomic and genomic surveys of three dragonflies of the family Libellulidae consistently identified 20 opsin genes, consisting of 4 nonvisual opsin genes and 16 visual opsin genes of 1 UV, 5 short-wavelength (SW), and 10 long-wavelength (LW) type. Comprehensive transcriptomic survey of the other dragonflies representing an additional 10 families also identified as many as 15–33 opsin genes. Molecular phylogenetic analysis revealed dynamic multiplications and losses of the opsin genes in the course of evolution. In contrast to many SW and LW genes expressed in adults, only one SW gene and several LW genes were expressed in larvae, reflecting less visual dependence and LW-skewed light conditions for their lifestyle under water. In this context, notably, the sand-burrowing or pit-dwelling species tended to lack SW gene expression in larvae. In adult visual organs: (i) many SW genes and a few LW genes were expressed in the dorsal region of compound eyes, presumably for processing SW-skewed light from the sky; (ii) a few SW genes and many LW genes were expressed in the ventral region of compound eyes, probably for perceiving terrestrial objects; and (iii) expression of a specific LW gene was associated with ocelli. Our findings suggest that the stage- and region-specific expressions of the diverse opsin genes underlie the behavior, ecology, and adaptation of dragonflies. PMID:25713365
Cohen, David; Bogeat-Triboulot, Marie-Béatrice; Vialet-Chabrand, Silvère; Merret, Rémy; Courty, Pierre-Emmanuel; Moretti, Sébastien; Bizet, François; Guilliot, Agnès; Hummel, Irène
2013-01-01
Aquaporins (AQPs) are membrane channels belonging to the major intrinsic proteins family and are known for their ability to facilitate water movement. While in Populus trichocarpa, AQP proteins form a large family encompassing fifty-five genes, most of the experimental work focused on a few genes or subfamilies. The current work was undertaken to develop a comprehensive picture of the whole AQP gene family in Populus species by delineating gene expression domain and distinguishing responsiveness to developmental and environmental cues. Since duplication events amplified the poplar AQP family, we addressed the question of expression redundancy between gene duplicates. On these purposes, we carried a meta-analysis of all publicly available Affymetrix experiments. Our in-silico strategy controlled for previously identified biases in cross-species transcriptomics, a necessary step for any comparative transcriptomics based on multispecies design chips. Three poplar AQPs were not supported by any expression data, even in a large collection of situations (abiotic and biotic constraints, temporal oscillations and mutants). The expression of 11 AQPs was never or poorly regulated whatever the wideness of their expression domain and their expression level. Our work highlighted that PtTIP1;4 was the most responsive gene of the AQP family. A high functional divergence between gene duplicates was detected across species and in response to tested cues, except for the root-expressed PtTIP2;3/PtTIP2;4 pair exhibiting 80% convergent responses. Our meta-analysis assessed key features of aquaporin expression which had remained hidden in single experiments, such as expression wideness, response specificity and genotype and environment interactions. By consolidating expression profiles using independent experimental series, we showed that the large expansion of AQP family in poplar was accompanied with a strong divergence of gene expression, even if some cases of functional redundancy could be suspected. PMID:23393587
Cohen, David; Bogeat-Triboulot, Marie-Béatrice; Vialet-Chabrand, Silvère; Merret, Rémy; Courty, Pierre-Emmanuel; Moretti, Sébastien; Bizet, François; Guilliot, Agnès; Hummel, Irène
2013-01-01
Aquaporins (AQPs) are membrane channels belonging to the major intrinsic proteins family and are known for their ability to facilitate water movement. While in Populus trichocarpa, AQP proteins form a large family encompassing fifty-five genes, most of the experimental work focused on a few genes or subfamilies. The current work was undertaken to develop a comprehensive picture of the whole AQP gene family in Populus species by delineating gene expression domain and distinguishing responsiveness to developmental and environmental cues. Since duplication events amplified the poplar AQP family, we addressed the question of expression redundancy between gene duplicates. On these purposes, we carried a meta-analysis of all publicly available Affymetrix experiments. Our in-silico strategy controlled for previously identified biases in cross-species transcriptomics, a necessary step for any comparative transcriptomics based on multispecies design chips. Three poplar AQPs were not supported by any expression data, even in a large collection of situations (abiotic and biotic constraints, temporal oscillations and mutants). The expression of 11 AQPs was never or poorly regulated whatever the wideness of their expression domain and their expression level. Our work highlighted that PtTIP1;4 was the most responsive gene of the AQP family. A high functional divergence between gene duplicates was detected across species and in response to tested cues, except for the root-expressed PtTIP2;3/PtTIP2;4 pair exhibiting 80% convergent responses. Our meta-analysis assessed key features of aquaporin expression which had remained hidden in single experiments, such as expression wideness, response specificity and genotype and environment interactions. By consolidating expression profiles using independent experimental series, we showed that the large expansion of AQP family in poplar was accompanied with a strong divergence of gene expression, even if some cases of functional redundancy could be suspected.
Kocmarek, Andrea L; Ferguson, Moira M; Danzmann, Roy G
2015-04-01
All-female lines of fish are created by crossing sex reversed (XX genotype) males with normal females. All-female lines avoid the deleterious phenotypic effects that are typical of precocious maturation in males. To determine whether all-female and mixed sex populations of rainbow trout (Oncorhynchus mykiss) differ in performance, we compared the growth and gene expression profiles in progeny groups produced by crossing a XX male and a XY male to the same five females. Body weight and length were measured in the resulting all-female (XX) and mixed sex (XX/XY) offspring groups. Microarray experiments with liver and white muscle were used to determine if the gene expression profiles of large and small XX offspring differ from those in large and small XX/XY offspring. We detected no significant differences in body length and weight between offspring groups but XX offspring were significantly less variable in the value of these traits. A large number of upregulated genes were shared between the large XX and large XX/XY offspring; the small XX and small XX/XY offspring also shared similar expression profiles. No GO category differences were seen in the liver or between the large XX and large XX/XY offspring in the muscle. The greatest differences between the small XX and small XX/XY offspring were in the genes assigned to the "small molecule metabolic process" and "cellular metabolic process" GO level 3 categories. Similarly, genes within these categories as well as the category "macromolecule metabolic process" were more highly expressed in small compared to large XX fish.
Zhang, Ya-Nan; Jin, Jun-Yan; Jin, Rong; Xia, Yi-Han; Zhou, Jing-Jiang; Deng, Jian-Yu; Dong, Shuang-Lin
2013-01-01
Background A large number of insect chemosensory genes from different gene subfamilies have been identified and annotated, but their functional diversity and complexity are largely unknown. A systemic examination of expression patterns in chemosensory organs could provide important information. Methodology/Principal Findings We identified 92 putative chemosensory genes by analysing the transcriptome of the antennae and female sex pheromone gland of the purple stem borer Sesamia inferens, among them 87 are novel in this species, including 24 transcripts encoding for odorant binding proteins (OBPs), 24 for chemosensory proteins (CSPs), 2 for sensory neuron membrane proteins (SNMPs), 39 for odorant receptors (ORs) and 3 for ionotropic receptors (IRs). The transcriptome analyses were validated and quantified with a detailed global expression profiling by Reverse Transcription-PCR for all 92 transcripts and by Quantitative Real Time RT-PCR for selected 16 ones. Among the chemosensory gene subfamilies, CSP transcripts are most widely and evenly expressed in different tissues and stages, OBP transcripts showed a clear antenna bias and most of OR transcripts are only detected in adult antennae. Our results also revealed that some OR transcripts, such as the transcripts of SNMP2 and 2 IRs were expressed in non-chemosensory tissues, and some CSP transcripts were antenna-biased expression. Furthermore, no chemosensory transcript is specific to female sex pheromone gland and very few are found in the heads. Conclusion Our study revealed that there are a large number of chemosensory genes expressed in S. inferens, and some of them displayed unusual expression profile in non-chemosensory tissues. The identification of a large set of putative chemosensory genes of each subfamily from a single insect species, together with their different expression profiles provide further information in understanding the functions of these chemosensory genes in S. inferens as well as other insects. PMID:23894529
Zhang, Ya-Nan; Jin, Jun-Yan; Jin, Rong; Xia, Yi-Han; Zhou, Jing-Jiang; Deng, Jian-Yu; Dong, Shuang-Lin
2013-01-01
A large number of insect chemosensory genes from different gene subfamilies have been identified and annotated, but their functional diversity and complexity are largely unknown. A systemic examination of expression patterns in chemosensory organs could provide important information. We identified 92 putative chemosensory genes by analysing the transcriptome of the antennae and female sex pheromone gland of the purple stem borer Sesamia inferens, among them 87 are novel in this species, including 24 transcripts encoding for odorant binding proteins (OBPs), 24 for chemosensory proteins (CSPs), 2 for sensory neuron membrane proteins (SNMPs), 39 for odorant receptors (ORs) and 3 for ionotropic receptors (IRs). The transcriptome analyses were validated and quantified with a detailed global expression profiling by Reverse Transcription-PCR for all 92 transcripts and by Quantitative Real Time RT-PCR for selected 16 ones. Among the chemosensory gene subfamilies, CSP transcripts are most widely and evenly expressed in different tissues and stages, OBP transcripts showed a clear antenna bias and most of OR transcripts are only detected in adult antennae. Our results also revealed that some OR transcripts, such as the transcripts of SNMP2 and 2 IRs were expressed in non-chemosensory tissues, and some CSP transcripts were antenna-biased expression. Furthermore, no chemosensory transcript is specific to female sex pheromone gland and very few are found in the heads. Our study revealed that there are a large number of chemosensory genes expressed in S. inferens, and some of them displayed unusual expression profile in non-chemosensory tissues. The identification of a large set of putative chemosensory genes of each subfamily from a single insect species, together with their different expression profiles provide further information in understanding the functions of these chemosensory genes in S. inferens as well as other insects.
2009-01-01
Background Sequence identification of ESTs from non-model species offers distinct challenges particularly when these species have duplicated genomes and when they are phylogenetically distant from sequenced model organisms. For the common carp, an environmental model of aquacultural interest, large numbers of ESTs remained unidentified using BLAST sequence alignment. We have used the expression profiles from large-scale microarray experiments to suggest gene identities. Results Expression profiles from ~700 cDNA microarrays describing responses of 7 major tissues to multiple environmental stressors were used to define a co-expression landscape. This was based on the Pearsons correlation coefficient relating each gene with all other genes, from which a network description provided clusters of highly correlated genes as 'mountains'. We show that these contain genes with known identities and genes with unknown identities, and that the correlation constitutes evidence of identity in the latter. This procedure has suggested identities to 522 of 2701 unknown carp ESTs sequences. We also discriminate several common carp genes and gene isoforms that were not discriminated by BLAST sequence alignment alone. Precision in identification was substantially improved by use of data from multiple tissues and treatments. Conclusion The detailed analysis of co-expression landscapes is a sensitive technique for suggesting an identity for the large number of BLAST unidentified cDNAs generated in EST projects. It is capable of detecting even subtle changes in expression profiles, and thereby of distinguishing genes with a common BLAST identity into different identities. It benefits from the use of multiple treatments or contrasts, and from the large-scale microarray data. PMID:19939286
Su, Yuhua; Nielsen, Dahlia; Zhu, Lei; Richards, Kristy; Suter, Steven; Breen, Matthew; Motsinger-Reif, Alison; Osborne, Jason
2013-01-05
: A bivariate mixture model utilizing information across two species was proposed to solve the fundamental problem of identifying differentially expressed genes in microarray experiments. The model utility was illustrated using a dog and human lymphoma data set prepared by a group of scientists in the College of Veterinary Medicine at North Carolina State University. A small number of genes were identified as being differentially expressed in both species and the human genes in this cluster serve as a good predictor for classifying diffuse large-B-cell lymphoma (DLBCL) patients into two subgroups, the germinal center B-cell-like diffuse large B-cell lymphoma and the activated B-cell-like diffuse large B-cell lymphoma. The number of human genes that were observed to be significantly differentially expressed (21) from the two-species analysis was very small compared to the number of human genes (190) identified with only one-species analysis (human data). The genes may be clinically relevant/important, as this small set achieved low misclassification rates of DLBCL subtypes. Additionally, the two subgroups defined by this cluster of human genes had significantly different survival functions, indicating that the stratification based on gene-expression profiling using the proposed mixture model provided improved insight into the clinical differences between the two cancer subtypes.
Guo, Yuan; Qiu, Caisheng; Long, Songhua; Chen, Ping; Hao, Dongmei; Preisner, Marta; Wang, Hui; Wang, Yufu
2017-08-30
To better understand the molecular mechanisms and gene expression characteristics associated with development of bast fiber cell within flax stem phloem, the gene expression profiling of flax stem peels and leaves were screened, using Illumina's Digital Gene Expression (DGE) analysis. Four DGE libraries (2 for stem peel and 2 for leaf), ranging from 6.7 to 9.2 million clean reads were obtained, which produced 7.0 million and 6.8 million mapped reads for flax stem peel and leave, respectively. By differential gene expression analysis, a total of 975 genes, of which 708 (73%) genes have protein-coding annotation, were identified as phloem enriched genes putatively involved in the processes of polysaccharide and cell wall metabolism. Differential expression genes (DEGs) was validated using quantitative RT-PCR, the expression pattern of all nine genes determined by qRT-PCR fitted in well with that obtained by sequencing analysis. Cluster and Gene Ontology (GO) analysis revealed that a large number of genes related to metabolic process, catalytic activity and binding category were expressed predominantly in the stem peels. The Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis of the phloem enriched genes suggested approximately 111 biological pathways. The large number of genes and pathways produced from DGE sequencing will expand our understanding of the complex molecular and cellular events in flax bast fiber development and provide a foundation for future studies on fiber development in other bast fiber crops. Copyright © 2017 Elsevier B.V. All rights reserved.
DEXTER: Disease-Expression Relation Extraction from Text.
Gupta, Samir; Dingerdissen, Hayley; Ross, Karen E; Hu, Yu; Wu, Cathy H; Mazumder, Raja; Vijay-Shanker, K
2018-01-01
Gene expression levels affect biological processes and play a key role in many diseases. Characterizing expression profiles is useful for clinical research, and diagnostics and prognostics of diseases. There are currently several high-quality databases that capture gene expression information, obtained mostly from large-scale studies, such as microarray and next-generation sequencing technologies, in the context of disease. The scientific literature is another rich source of information on gene expression-disease relationships that not only have been captured from large-scale studies but have also been observed in thousands of small-scale studies. Expression information obtained from literature through manual curation can extend expression databases. While many of the existing databases include information from literature, they are limited by the time-consuming nature of manual curation and have difficulty keeping up with the explosion of publications in the biomedical field. In this work, we describe an automated text-mining tool, Disease-Expression Relation Extraction from Text (DEXTER) to extract information from literature on gene and microRNA expression in the context of disease. One of the motivations in developing DEXTER was to extend the BioXpress database, a cancer-focused gene expression database that includes data derived from large-scale experiments and manual curation of publications. The literature-based portion of BioXpress lags behind significantly compared to expression information obtained from large-scale studies and can benefit from our text-mined results. We have conducted two different evaluations to measure the accuracy of our text-mining tool and achieved average F-scores of 88.51 and 81.81% for the two evaluations, respectively. Also, to demonstrate the ability to extract rich expression information in different disease-related scenarios, we used DEXTER to extract information on differential expression information for 2024 genes in lung cancer, 115 glycosyltransferases in 62 cancers and 826 microRNA in 171 cancers. All extractions using DEXTER are integrated in the literature-based portion of BioXpress.Database URL: http://biotm.cis.udel.edu/DEXTER.
Gene expression profile of the plant pathogen Xylella fastidiosa during biofilm formation in vitro.
de Souza, Alessandra A; Takita, Marco A; Coletta-Filho, Helvécio D; Caldana, Camila; Yanai, Giane M; Muto, Nair H; de Oliveira, Regina C; Nunes, Luiz R; Machado, Marcos A
2004-08-15
A biofilm is a community of microorganisms attached to a solid surface. Cells within biofilms differ from planktonic cells, showing higher resistance to biocides, detergent, antibiotic treatments and host defense responses. Even though there are a number of gene expression studies in bacterial biofilm formation, limited information is available concerning plant pathogen. It was previously demonstrated that the plant pathogen Xylella fastidiosa could grow as a biofilm, a possibly important factor for its pathogenicity. In this study we utilized analysis of microarrays to specifically identify genes expressed in X. fastidiosa cells growing in a biofilm, when compared to planktonic cells. About half of the differentially expressed genes encode hypothetical proteins, reflecting the large number of ORFs with unknown functions in bacterial genomes. However, under the biofilm condition we observed an increase in the expression of some housekeeping genes responsible for metabolic functions. We also found a large number of genes from the pXF51 plasmid being differentially expressed. Some of the overexpressed genes in the biofilm condition encode proteins involved in attachment to surfaces. Other genes possibly confer advantages to the bacterium in the environment that it colonizes. This study demonstrates that the gene expression in the biofilm growth condition of the plant pathogen X. fastidiosa is quite similar to other characterized systems.
Higo, Noriyuki; Sato, Akira; Yamamoto, Tatsuya; Oishi, Takao; Nishimura, Yukio; Murata, Yumi; Onoe, Hirotaka; Isa, Tadashi; Kojima, Toshio
2018-05-01
The present study aimed to assess the molecular bases of cortical compensatory mechanisms following spinal cord injury in primates. To accomplish this, comprehensive changes in gene expression were investigated in the bilateral primary motor cortex (M1), dorsal premotor cortex (PMd), and ventral premotor cortex (PMv) after a unilateral lesion of the lateral corticospinal tract (l-CST). At 2 weeks after the lesion, a large number of genes exhibited altered expression levels in the contralesional M1, which is directly linked to the lesioned l-CST. Gene ontology and network analyses indicated that these changes in gene expression are involved in the atrophy and plasticity changes observed in neurons. Orchestrated gene expression changes were present when behavioral recovery was attained 3 months after the lesion, particularly among the bilateral premotor areas, and a large number of these genes are involved in plasticity. Moreover, several genes abundantly expressed in M1 of intact monkeys were upregulated in both the PMd and PMv after the l-CST lesion. These area-specific and time-dependent changes in gene expression may underlie the molecular mechanisms of functional recovery following a lesion of the l-CST. © 2018 Wiley Periodicals, Inc.
Gene length as a biological timer to establish temporal transcriptional regulation
Kirkconnell, Killeen S.; Magnuson, Brian; Paulsen, Michelle T.; Lu, Brian; Bedi, Karan; Ljungman, Mats
2017-01-01
ABSTRACT Transcriptional timing is inherently influenced by gene length, thus providing a mechanism for temporal regulation of gene expression. While gene size has been shown to be important for the expression timing of specific genes during early development, whether it plays a role in the timing of other global gene expression programs has not been extensively explored. Here, we investigate the role of gene length during the early transcriptional response of human fibroblasts to serum stimulation. Using the nascent sequencing techniques Bru-seq and BruUV-seq, we identified immediate genome-wide transcriptional changes following serum stimulation that were linked to rapid activation of enhancer elements. We identified 873 significantly induced and 209 significantly repressed genes. Variations in gene size allowed for a large group of genes to be simultaneously activated but produce full-length RNAs at different times. The median length of the group of serum-induced genes was significantly larger than the median length of all expressed genes, housekeeping genes, and serum-repressed genes. These gene length relationships were also observed in corresponding mouse orthologs, suggesting that relative gene size is evolutionarily conserved. The sizes of transcription factor and microRNA genes immediately induced after serum stimulation varied dramatically, setting up a cascade mechanism for temporal expression arising from a single activation event. The retention and expansion of large intronic sequences during evolution have likely played important roles in fine-tuning the temporal expression of target genes in various cellular response programs. PMID:28055303
Non-Small-Cell Lung Cancer Molecular Signatures Recapitulate Lung Developmental Pathways
Borczuk, Alain C.; Gorenstein, Lyall; Walter, Kristin L.; Assaad, Adel A.; Wang, Liqun; Powell, Charles A.
2003-01-01
Current paradigms hold that lung carcinomas arise from pleuripotent stem cells capable of differentiation into one or several histological types. These paradigms suggest lung tumor cell ontogeny is determined by consequences of gene expression that recapitulate events important in embryonic lung development. Using oligonucleotide microarrays, we acquired gene profiles from 32 microdissected non-small-cell lung tumors. We determined the 100 top-ranked marker genes for adenocarcinoma, squamous cell, large cell, and carcinoid using nearest neighbor analysis. Results were validated by immunostaining for 11 selected proteins using a tissue microarray representing 80 tumors. Gene expression data of lung development were accessed from a publicly available dataset generated with the murine Mu11k genome microarray. Self-organized mapping identified two temporally distinct clusters of murine orthologues. Supervised clustering of lung development data showed large-cell carcinoma gene orthologues were in a cluster expressed in pseudoglandular and canalicular stages whereas adenocarcinoma homologues were predominantly in a cluster expressed later in the terminal sac and alveolar stages of murine lung development. Representative large-cell genes (E2F3, MYBL2, HDAC2, CDK4, PCNA) are expressed in the nucleus and are associated with cell cycle and proliferation. In contrast, adenocarcinoma genes are associated with lung-specific transcription pathways (SFTPB, TTF-1), cell adhesion, and signal transduction. In sum, non-small-cell lung tumors histology gene profiles suggest mechanisms relevant to ontogeny and clinical course. Adenocarcinoma genes are associated with differentiation and glandular formation whereas large-cell genes are associated with proliferation and differentiation arrest. The identification of developmentally regulated pathways active in tumorigenesis provides insights into lung carcinogenesis and suggests early steps may differ according to the eventual tumor morphology. PMID:14578194
Wyatt, Linda S; Xiao, Wei; Americo, Jeffrey L; Earl, Patricia L; Moss, Bernard
2017-06-06
Viruses are used as expression vectors for protein synthesis, immunology research, vaccines, and therapeutics. Advantages of poxvirus vectors include the accommodation of large amounts of heterologous DNA, the presence of a cytoplasmic site of transcription, and high expression levels. On the other hand, competition of approximately 200 viral genes with the target gene for expression and immune recognition may be disadvantageous. We describe a vaccinia virus (VACV) vector that uses an early promoter to express the bacteriophage T7 RNA polymerase; has the A23R intermediate transcription factor gene deleted, thereby restricting virus replication to complementing cells; and has a heterologous gene regulated by a T7 promoter. In noncomplementing cells, viral early gene expression and DNA replication occurred normally but synthesis of intermediate and late proteins was prevented. Nevertheless, the progeny viral DNA provided templates for abundant expression of heterologous genes regulated by a T7 promoter. Selective expression of the Escherichia coli lac repressor gene from an intermediate promoter reduced transcription of the heterologous gene specifically in complementing cells, where large amounts might adversely impact VACV replication. Expression of heterologous proteins mediated by the A23R deletion vector equaled that of a replicating VACV, was higher than that of a nonreplicating modified vaccinia virus Ankara (MVA) vector used for candidate vaccines in vitro and in vivo , and was similarly immunogenic in mice. Unlike the MVA vector, the A23R deletion vector still expresses numerous early genes that can restrict immunogenicity as demonstrated here by the failure of the prototype vector to induce interferon alpha. By deleting immunomodulatory genes, we anticipate further improvements in the system. IMPORTANCE Vaccines provide an efficient and effective way of preventing infectious diseases. Nevertheless, new and better vaccines are needed. Vaccinia virus, which was used successfully as a live vaccine to eradicate smallpox, has been further attenuated and adapted as a recombinant vector for immunization against other pathogens. However, since the initial description of this vector system, only incremental improvements largely related to safety have been implemented. Here we described novel modifications of the platform that increased expression of the heterologous target gene and decreased expression of endogenous vaccinia virus genes while providing safety by preventing replication of the candidate vaccine except in complementing cells used for vector propagation. Copyright © 2017 Wyatt et al.
Basse, Astrid L; Dixen, Karen; Yadav, Rachita; Tygesen, Malin P; Qvortrup, Klaus; Kristiansen, Karsten; Quistorff, Bjørn; Gupta, Ramneek; Wang, Jun; Hansen, Jacob B
2015-03-19
Large mammals are capable of thermoregulation shortly after birth due to the presence of brown adipose tissue (BAT). The majority of BAT disappears after birth and is replaced by white adipose tissue (WAT). We analyzed the postnatal transformation of adipose in sheep with a time course study of the perirenal adipose depot. We observed changes in tissue morphology, gene expression and metabolism within the first two weeks of postnatal life consistent with the expected transition from BAT to WAT. The transformation was characterized by massively decreased mitochondrial abundance and down-regulation of gene expression related to mitochondrial function and oxidative phosphorylation. Global gene expression profiling demonstrated that the time points grouped into three phases: a brown adipose phase, a transition phase and a white adipose phase. Between the brown adipose and the transition phase 170 genes were differentially expressed, and 717 genes were differentially expressed between the transition and the white adipose phase. Thirty-eight genes were shared among the two sets of differentially expressed genes. We identified a number of regulated transcription factors, including NR1H3, MYC, KLF4, ESR1, RELA and BCL6, which were linked to the overall changes in gene expression during the adipose tissue remodeling. Finally, the perirenal adipose tissue expressed both brown and brite/beige adipocyte marker genes at birth, the expression of which changed substantially over time. Using global gene expression profiling of the postnatal BAT to WAT transformation in sheep, we provide novel insight into adipose tissue plasticity in a large mammal, including identification of novel transcriptional components linked to adipose tissue remodeling. Moreover, our data set provides a useful resource for further studies in adipose tissue plasticity.
Expression of the human hepatitis B virus large surface antigen gene in transgenic tomato plants.
Lou, Xiao-Ming; Yao, Quan-Hong; Zhang, Zhen; Peng, Ri-He; Xiong, Ai-Sheng; Wang, Hua-Kun
2007-04-01
The original hepatitis B virus (HBV) large surface antigen gene was synthesized. In order to optimize the expression of this gene in tomato plants, the tobacco pathogenesis-related protein S signal peptide was fused to the 5' end of the modified gene and the sequence encoding amino acids S, E, K, D, E, and L was placed at the 3' end. The gene encoding the modified HBV large surface antigen under the control of a fruit-specific promoter was constructed and expressed in transgenic tomato plants. The expression of the antigen from transgenic plants was confirmed by PCR and reverse transcriptase PCR. Enzyme-linked immunoassays using a monoclonal antibody directed against human serum-derived HBsAg revealed that the maximal level of HBsAg was about 0.02% of the soluble protein in transgenic tomato fruit. The amount of HBsAg in mature fruits was found to be 65- to 171-fold larger than in small or medium fruits and leaf tissues. Examination of transgenic plant samples by transmission electron microscopy proved that HBsAg had been expressed and had accumulated. The HBsAg protein was capable of assembling into capsomers and virus-like particles. To our knowledge, this is the first time the HBV large surface antigen has been expressed in plants. This work suggests the possibility of producing a new alternative vaccine for human HBV.
Hepatic gene expression profiling of 5'-AMP-induced hypometabolism in mice.
Zhao, Zhaoyang; Miki, Takao; Van Oort-Jansen, Anita; Matsumoto, Tomoko; Loose, David S; Lee, Cheng Chi
2011-04-12
There is currently much interest in clinical applications of therapeutic hypothermia. Hypothermia can be a consequence of hypometabolism. We have recently established a procedure for the induction of a reversible deep hypometabolic state in mice using 5'-adenosine monophosphate (5'-AMP) in conjunction with moderate ambient temperature. The current study aims at investigating the impact of this technology at the gene expression level in a major metabolic organ, the liver. Our findings reveal that expression levels of the majority of genes in liver are not significantly altered by deep hypometabolism. However, among those affected by hypometabolism, more genes are differentially upregulated than downregulated both in a deep hypometabolic state and in the early arousal state. These altered gene expression levels during 5'-AMP induced hypometabolism are largely restored to normal levels within 2 days of the treatment. Our data also suggest that temporal control of circadian genes is largely stalled during deep hypometabolism.
A gene expression resource generated by genome-wide lacZ profiling in the mouse
Tuck, Elizabeth; Estabel, Jeanne; Oellrich, Anika; Maguire, Anna Karin; Adissu, Hibret A.; Souter, Luke; Siragher, Emma; Lillistone, Charlotte; Green, Angela L.; Wardle-Jones, Hannah; Carragher, Damian M.; Karp, Natasha A.; Smedley, Damian; Adams, Niels C.; Bussell, James N.; Adams, David J.; Ramírez-Solis, Ramiro; Steel, Karen P.; Galli, Antonella; White, Jacqueline K.
2015-01-01
ABSTRACT Knowledge of the expression profile of a gene is a critical piece of information required to build an understanding of the normal and essential functions of that gene and any role it may play in the development or progression of disease. High-throughput, large-scale efforts are on-going internationally to characterise reporter-tagged knockout mouse lines. As part of that effort, we report an open access adult mouse expression resource, in which the expression profile of 424 genes has been assessed in up to 47 different organs, tissues and sub-structures using a lacZ reporter gene. Many specific and informative expression patterns were noted. Expression was most commonly observed in the testis and brain and was most restricted in white adipose tissue and mammary gland. Over half of the assessed genes presented with an absent or localised expression pattern (categorised as 0-10 positive structures). A link between complexity of expression profile and viability of homozygous null animals was observed; inactivation of genes expressed in ≥21 structures was more likely to result in reduced viability by postnatal day 14 compared with more restricted expression profiles. For validation purposes, this mouse expression resource was compared with Bgee, a federated composite of RNA-based expression data sets. Strong agreement was observed, indicating a high degree of specificity in our data. Furthermore, there were 1207 observations of expression of a particular gene in an anatomical structure where Bgee had no data, indicating a large amount of novelty in our data set. Examples of expression data corroborating and extending genotype-phenotype associations and supporting disease gene candidacy are presented to demonstrate the potential of this powerful resource. PMID:26398943
The goal of this project is to use an eight-gene expression profile to define functional signatures for small molecules and natural products with heretofore undefined mechanism of action. Two genes in the eight gene set are used as internal controls and do not vary across gene expression array data collected from the public domain. The remaining six genes are found to vary independently across a large collection of publically available gene expression array datasets. Read the abstract
Martin, Elizabeth M; Clapp, Phillip W; Rebuli, Meghan E; Pawlak, Erica A; Glista-Baker, Ellen; Benowitz, Neal L; Fry, Rebecca C; Jaspers, Ilona
2016-07-01
Exposure to cigarette smoke is known to result in impaired host defense responses and immune suppressive effects. However, the effects of new and emerging tobacco products, such as e-cigarettes, on the immune status of the respiratory epithelium are largely unknown. We conducted a clinical study collecting superficial nasal scrape biopsies, nasal lavage, urine, and serum from nonsmokers, cigarette smokers, and e-cigarette users and assessed them for changes in immune gene expression profiles. Smoking status was determined based on a smoking history and a 3- to 4-wk smoking diary and confirmed using serum cotinine and urine 4-(methylnitrosamino)-1-(3-pyridyl)-1-butanol (NNAL) levels. Total RNA from nasal scrape biopsies was analyzed using the nCounter Human Immunology v2 Expression panel. Smoking cigarettes or vaping e-cigarettes resulted in decreased expression of immune-related genes. All genes with decreased expression in cigarette smokers (n = 53) were also decreased in e-cigarette smokers. Additionally, vaping e-cigarettes was associated with suppression of a large number of unique genes (n = 305). Furthermore, the e-cigarette users showed a greater suppression of genes common with those changed in cigarette smokers. This was particularly apparent for suppressed expression of transcription factors, such as EGR1, which was functionally associated with decreased expression of 5 target genes in cigarette smokers and 18 target genes in e-cigarette users. Taken together, these data indicate that vaping e-cigarettes is associated with decreased expression of a large number of immune-related genes, which are consistent with immune suppression at the level of the nasal mucosa. Copyright © 2016 the American Physiological Society.
Carey, Michelle; Ramírez, Juan Camilo; Wu, Shuang; Wu, Hulin
2018-07-01
A biological host response to an external stimulus or intervention such as a disease or infection is a dynamic process, which is regulated by an intricate network of many genes and their products. Understanding the dynamics of this gene regulatory network allows us to infer the mechanisms involved in a host response to an external stimulus, and hence aids the discovery of biomarkers of phenotype and biological function. In this article, we propose a modeling/analysis pipeline for dynamic gene expression data, called Pipeline4DGEData, which consists of a series of statistical modeling techniques to construct dynamic gene regulatory networks from the large volumes of high-dimensional time-course gene expression data that are freely available in the Gene Expression Omnibus repository. This pipeline has a consistent and scalable structure that allows it to simultaneously analyze a large number of time-course gene expression data sets, and then integrate the results across different studies. We apply the proposed pipeline to influenza infection data from nine studies and demonstrate that interesting biological findings can be discovered with its implementation.
Light-dependent expression of flg22-induced defense genes in Arabidopsis.
Sano, Satoshi; Aoyama, Mayu; Nakai, Kana; Shimotani, Koji; Yamasaki, Kanako; Sato, Masa H; Tojo, Daisuke; Suwastika, I Nengah; Nomura, Hironari; Shiina, Takashi
2014-01-01
Chloroplasts have been reported to generate retrograde immune signals that activate defense gene expression in the nucleus. However, the roles of light and photosynthesis in plant immunity remain largely elusive. In this study, we evaluated the effects of light on the expression of defense genes induced by flg22, a peptide derived from bacterial flagellins which acts as a potent elicitor in plants. Whole-transcriptome analysis of flg22-treated Arabidopsis thaliana seedlings under light and dark conditions for 30 min revealed that a number of (30%) genes strongly induced by flg22 (>4.0) require light for their rapid expression, whereas flg22-repressed genes include a significant number of genes that are down-regulated by light. Furthermore, light is responsible for the flg22-induced accumulation of salicylic acid (SA), indicating that light is indispensable for basal defense responses in plants. To elucidate the role of photosynthesis in defense, we further examined flg22-induced defense gene expression in the presence of specific inhibitors of photosynthetic electron transport: 3-(3,4-dichlorophenyl)-1,1-dimethylurea (DCMU) and 2,5-dibromo-3-methyl-6-isopropyl-benzoquinone (DBMIB). Light-dependent expression of defense genes was largely suppressed by DBMIB, but only partially suppressed by DCMU. These findings suggest that photosynthetic electron flow plays a role in controlling the light-dependent expression of flg22-inducible defense genes.
Sharon, Dror; Blackshaw, Seth; Cepko, Constance L.; Dryja, Thaddeus P.
2002-01-01
We used the serial analysis of gene expression (SAGE) technique to catalogue and measure the relative levels of expression of the genes expressed in the human peripheral retina, macula, and retinal pigment epithelium (RPE) from one or both of two humans, aged 88 and 44 years. The cone photoreceptor contribution to all transcription in the retina was found to be similar in the macula versus the retinal periphery, whereas the rod contribution was greater in the periphery versus the macula. Genes encoding structural proteins for axons were found to be expressed at higher levels in the macula versus the retinal periphery, probably reflecting the large proportion of ganglion cells in the central retina. In comparison with the younger eye, the peripheral retina of the older eye had a substantially higher proportion of mRNAs from genes encoding proteins involved in iron metabolism or protection against oxidative damage and a substantially lower proportion of mRNAs from genes encoding proteins involved in rod phototransduction. These differences may reflect the difference in age between the two donors or merely interindividual variation. The RPE library had numerous previously unencountered tags, suggesting that this cell type has a large, idiosyncratic repertoire of expressed genes. Comparison of these libraries with 100 reported nonocular SAGE libraries revealed 89 retina-specific or enriched genes expressed at substantial levels, of which 14 are known to cause a retinal disease and 53 are RPE-specific genes. We expect that these libraries will serve as a resource for understanding the relative expression levels of genes in the retina and the RPE and for identifying additional disease genes. PMID:11756676
paraGSEA: a scalable approach for large-scale gene expression profiling
Peng, Shaoliang; Yang, Shunyun
2017-01-01
Abstract More studies have been conducted using gene expression similarity to identify functional connections among genes, diseases and drugs. Gene Set Enrichment Analysis (GSEA) is a powerful analytical method for interpreting gene expression data. However, due to its enormous computational overhead in the estimation of significance level step and multiple hypothesis testing step, the computation scalability and efficiency are poor on large-scale datasets. We proposed paraGSEA for efficient large-scale transcriptome data analysis. By optimization, the overall time complexity of paraGSEA is reduced from O(mn) to O(m+n), where m is the length of the gene sets and n is the length of the gene expression profiles, which contributes more than 100-fold increase in performance compared with other popular GSEA implementations such as GSEA-P, SAM-GS and GSEA2. By further parallelization, a near-linear speed-up is gained on both workstations and clusters in an efficient manner with high scalability and performance on large-scale datasets. The analysis time of whole LINCS phase I dataset (GSE92742) was reduced to nearly half hour on a 1000 node cluster on Tianhe-2, or within 120 hours on a 96-core workstation. The source code of paraGSEA is licensed under the GPLv3 and available at http://github.com/ysycloud/paraGSEA. PMID:28973463
Zha, Xianfeng; Yin, Qingsong; Tan, Huo; Wang, Chunyan; Chen, Shaohua; Yang, Lijian; Li, Bo; Wu, Xiuli; Li, Yangqiu
2013-05-01
Antigen-specific, T-cell receptor (TCR)-modified cytotoxic T lymphocytes (CTLs) that target tumors are an attractive strategy for specific adoptive immunotherapy. Little is known about whether there are any alterations in the gene expression profile after TCR gene transduction in T cells. We constructed TCR gene-redirected CTLs with specificity for diffuse large B-cell lymphoma (DLBCL)-associated antigens to elucidate the gene expression profiles of TCR gene-redirected T-cells, and we further analyzed the gene expression profile pattern of these redirected T-cells by Affymetrix microarrays. The resulting data were analyzed using Bioconductor software, a two-fold cut-off expression change was applied together with anti-correlation of the profile ratios to render the microarray analysis set. The fold change of all genes was calculated by comparing the three TCR gene-modified T-cells and a negative control counterpart. The gene pathways were analyzed using Bioconductor and Kyoto Encyclopedia of Genes and Genomes. Identical genes whose fold change was greater than or equal to 2.0 in all three TCR gene-redirected T-cell groups in comparison with the negative control were identified as the differentially expressed genes. The differentially expressed genes were comprised of 33 up-regulated genes and 1 down-regulated gene including JUNB, FOS, TNF, INF-γ, DUSP2, IL-1B, CXCL1, CXCL2, CXCL9, CCL2, CCL4, and CCL8. These genes are mainly involved in the TCR signaling, mitogen-activated protein kinase signaling, and cytokine-cytokine receptor interaction pathways. In conclusion, we characterized the gene expression profile of DLBCL-specific TCR gene-redirected T-cells. The changes corresponded to an up-regulation in the differentiation and proliferation of the T-cells. These data may help to explain some of the characteristics of the redirected T-cells.
Klingenberg, Jennifer M; McFarland, Kevin L; Friedman, Aaron J; Boyce, Steven T; Aronow, Bruce J; Supp, Dorothy M
2010-02-01
Bioengineered skin substitutes can facilitate wound closure in severely burned patients, but deficiencies limit their outcomes compared with native skin autografts. To identify gene programs associated with their in vivo capabilities and limitations, we extended previous gene expression profile analyses to now compare engineered skin after in vivo grafting with both in vitro maturation and normal human skin. Cultured skin substitutes were grafted on full-thickness wounds in athymic mice, and biopsy samples for microarray analyses were collected at multiple in vitro and in vivo time points. Over 10,000 transcripts exhibited large-scale expression pattern differences during in vitro and in vivo maturation. Using hierarchical clustering, 11 different expression profile clusters were partitioned on the basis of differential sample type and temporal stage-specific activation or repression. Analyses show that the wound environment exerts a massive influence on gene expression in skin substitutes. For example, in vivo-healed skin substitutes gained the expression of many native skin-expressed genes, including those associated with epidermal barrier and multiple categories of cell-cell and cell-basement membrane adhesion. In contrast, immunological, trichogenic, and endothelial gene programs were largely lacking. These analyses suggest important areas for guiding further improvement of engineered skin for both increased homology with native skin and enhanced wound healing.
USDA-ARS?s Scientific Manuscript database
Large-scale, gene expression methods allow for high throughput analysis of physiological pathways at a fraction of the cost of individual gene expression analysis. Systems, such as the Fluidigm quantitative PCR array described here, can provide powerful assessments of the effects of diet, environme...
The goal of this project is to use an eight-gene expression profile to define functional signatures for small molecules and natural products with heretofore undefined mechanism of action. Two genes in the eight gene set are used as internal controls and do not vary across gene expression array data collected from the public domain. The remaining six genes are found to vary independently across a large collection of publically available gene expression array datasets. Read the abstract
Caldwell, Rachel; Lin, Yan-Xia; Zhang, Ren
2015-01-01
There is a continuing interest in the analysis of gene architecture and gene expression to determine the relationship that may exist. Advances in high-quality sequencing technologies and large-scale resource datasets have increased the understanding of relationships and cross-referencing of expression data to the large genome data. Although a negative correlation between expression level and gene (especially transcript) length has been generally accepted, there have been some conflicting results arising from the literature concerning the impacts of different regions of genes, and the underlying reason is not well understood. The research aims to apply quantile regression techniques for statistical analysis of coding and noncoding sequence length and gene expression data in the plant, Arabidopsis thaliana, and fruit fly, Drosophila melanogaster, to determine if a relationship exists and if there is any variation or similarities between these species. The quantile regression analysis found that the coding sequence length and gene expression correlations varied, and similarities emerged for the noncoding sequence length (5′ and 3′ UTRs) between animal and plant species. In conclusion, the information described in this study provides the basis for further exploration into gene regulation with regard to coding and noncoding sequence length. PMID:26114098
Geographical Genomics of Human Leukocyte Gene Expression Variation in Southern Morocco
Idaghdour, Youssef; Czika, Wendy; Shianna, Kevin V.; Lee, S. Hong; Visscher, Peter M.; Martin, Hilary C.; Miclaus, Kelci; Jadallah, Sami J.; Goldstein, David B.; Wolfinger, Russell D.; Gibson, Greg
2009-01-01
Studies of the genetics of gene expression reveal expression SNPs that explain variation in transcript abundance. Here we address the robustness of eSNP associations to environmental geography and population structure in a comparison of 194 Arab and Amazigh individuals from a city and two villages in southern Morocco. Gene expression differed between pairs of locations for up to a third of all transcripts, with notable enrichment for ribosomal biosynthesis and oxidative phosphorylation. Robust associations were observed in the leukocyte samples with cis-eSNPs (P < 10−08) for 346 genes, and trans-eSNPs (P < 10−11) with 10 genes. All of these were consistent across the three sample locations and after controlling for ethnicity and relatedness. No evidence for large-effect trans-acting mediators of the pervasive environmental influence was found and instead genetic and environmental factors acted in a largely additive manner. PMID:19966804
The opportunities and challenges of large-scale molecular approaches to songbird neurobiology
Mello, C.V.; Clayton, D.F.
2014-01-01
High-through put methods for analyzing genome structure and function are having a large impact in song-bird neurobiology. Methods include genome sequencing and annotation, comparative genomics, DNA microarrays and transcriptomics, and the development of a brain atlas of gene expression. Key emerging findings include the identification of complex transcriptional programs active during singing, the robust brain expression of non-coding RNAs, evidence of profound variations in gene expression across brain regions, and the identification of molecular specializations within song production and learning circuits. Current challenges include the statistical analysis of large datasets, effective genome curations, the efficient localization of gene expression changes to specific neuronal circuits and cells, and the dissection of behavioral and environmental factors that influence brain gene expression. The field requires efficient methods for comparisons with organisms like chicken, which offer important anatomical, functional and behavioral contrasts. As sequencing costs plummet, opportunities emerge for comparative approaches that may help reveal evolutionary transitions contributing to vocal learning, social behavior and other properties that make songbirds such compelling research subjects. PMID:25280907
Evolution of Synonymous Codon Usage in Neurospora tetrasperma and Neurospora discreta
Whittle, C. A.; Sun, Y.; Johannesson, H.
2011-01-01
Neurospora comprises a primary model system for the study of fungal genetics and biology. In spite of this, little is known about genome evolution in Neurospora. For example, the evolution of synonymous codon usage is largely unknown in this genus. In the present investigation, we conducted a comprehensive analysis of synonymous codon usage and its relationship to gene expression and gene length (GL) in Neurospora tetrasperma and Neurospora discreta. For our analysis, we examined codon usage among 2,079 genes per organism and assessed gene expression using large-scale expressed sequenced tag (EST) data sets (279,323 and 453,559 ESTs for N. tetrasperma and N. discreta, respectively). Data on relative synonymous codon usage revealed 24 codons (and two putative codons) that are more frequently used in genes with high than with low expression and thus were defined as optimal codons. Although codon-usage bias was highly correlated with gene expression, it was independent of selectively neutral base composition (introns); thus demonstrating that translational selection drives synonymous codon usage in these genomes. We also report that GL (coding sequences [CDS]) was inversely associated with optimal codon usage at each gene expression level, with highly expressed short genes having the greatest frequency of optimal codons. Optimal codon frequency was moderately higher in N. tetrasperma than in N. discreta, which might be due to variation in selective pressures and/or mating systems. PMID:21402862
Msn2 Coordinates a Stoichiometric Gene Expression Program
Stewart-Ornstein, Jacob; Nelson, Christopher; DeRisi, Joe; Weissman, Jonathan S.; El-Samad, Hana
2014-01-01
Summary Background Many cellular processes operate in an “analog” regime in which the magnitude of the response is precisely tailored to the intensity of the stimulus. In order to maintain the coherence of such responses, the cell must provide for proportional expression of multiple target genes across a wide dynamic range of induction states. Our understanding of the strategies used to achieve graded gene regulation is limited. Results In this work, we document a relationship between stress responsive gene expression and the transcription factor Msn2 that is graded over a large range of Msn2 cocnentrations. We use computational modeling, in vivo, and in vitro analysis to dissect the roots of this relationship. Our studies reveal a simple and general strategy based on non-cooperative low-affinity interactions between Msn2 and its cognate binding sites, as well as competition over a large number of Msn2 binding sites in the genome relative to the number of Msn2 molecules. Conclusions In addition to enabling precise tuning of gene expression to the state of the environment, this strategy ensures co-linear activation of target genes, allowing for stoichiometric expression of large groups of genes without extensive promoter tuning. Furthermore, such a strategy enables precise modulation of the activity of any given promoter by addition of binding sites without altering the qualitative relationship between different genes in a regulon. This feature renders a given regulon highly ‘evolvable’. PMID:24210615
Zhang, Qingyang
2018-05-16
Differential co-expression analysis, as a complement of differential expression analysis, offers significant insights into the changes in molecular mechanism of different phenotypes. A prevailing approach to detecting differentially co-expressed genes is to compare Pearson's correlation coefficients in two phenotypes. However, due to the limitations of Pearson's correlation measure, this approach lacks the power to detect nonlinear changes in gene co-expression which is common in gene regulatory networks. In this work, a new nonparametric procedure is proposed to search differentially co-expressed gene pairs in different phenotypes from large-scale data. Our computational pipeline consisted of two main steps, a screening step and a testing step. The screening step is to reduce the search space by filtering out all the independent gene pairs using distance correlation measure. In the testing step, we compare the gene co-expression patterns in different phenotypes by a recently developed edge-count test. Both steps are distribution-free and targeting nonlinear relations. We illustrate the promise of the new approach by analyzing the Cancer Genome Atlas data and the METABRIC data for breast cancer subtypes. Compared with some existing methods, the new method is more powerful in detecting nonlinear type of differential co-expressions. The distance correlation screening can greatly improve computational efficiency, facilitating its application to large data sets.
Goldstein, Darlene R
2006-10-01
Studies of gene expression using high-density short oligonucleotide arrays have become a standard in a variety of biological contexts. Of the expression measures that have been proposed to quantify expression in these arrays, multi-chip-based measures have been shown to perform well. As gene expression studies increase in size, however, utilizing multi-chip expression measures is more challenging in terms of computing memory requirements and time. A strategic alternative to exact multi-chip quantification on a full large chip set is to approximate expression values based on subsets of chips. This paper introduces an extrapolation method, Extrapolation Averaging (EA), and a resampling method, Partition Resampling (PR), to approximate expression in large studies. An examination of properties indicates that subset-based methods can perform well compared with exact expression quantification. The focus is on short oligonucleotide chips, but the same ideas apply equally well to any array type for which expression is quantified using an entire set of arrays, rather than for only a single array at a time. Software implementing Partition Resampling and Extrapolation Averaging is under development as an R package for the BioConductor project.
Rottiers, P; Verfaillie, T; Contreras, R; Revets, H; Desmedt, M; Dooms, H; Fiers, W; Grooten, J
1998-11-09
Progression to malignancy of transformed cells involves complex genetic alterations and aberrant gene expression patterns. While aberrant gene expression is often caused by alterations in individual genes, the contribution of the tumoral environment to the triggering of this gene expression is less well established. The stable but heterogeneous expression in cultured EL4/13 cells of a novel tumor-associated antigen, designated as HTgp-175, was chosen for the investigation of gene expression during tumor formation. Homogeneously HTgp-175-negative EL4/13 cells, isolated by cell sorting or obtained by subcloning, acquired HTgp-175 expression as a result of tumor formation. The tumorigenicity of HTgp-175-negative vs. HTgp-175-positive EL4 variants was identical, indicating that induction but not selection accounted for the phenotypic switch from HTgp-175-negative to HTgp-175-positive. Although mutagenesis experiments showed that the protein was not essential for tumor establishment, tumor-derived cells showed increased malignancy, linking HTgp-175 expression with genetic changes accompanying tumor progression. This novel gene expression was not an isolated event, since it was accompanied by ectopic expression of the large chondroitin sulfate proteoglycan PG-M and of normal differentiation antigens. We conclude that signals derived from the tumoral microenvironment contribute significantly to the aberrant gene expression pattern of malignant cells, apparently by fortuitous activation of differentiation processes and cause expression of novel differentiation antigens as well as of inappropriate tumor-associated and ectopic antigens.
Han, Fang; Wang, Xiaoqing; Yang, Qilian; Cai, Mingyi; Wang, Zhi Yong
2011-02-01
The Rac proteins are members of the Rho family of small G proteins and are implicated in the regulation of several pathways, including those leading to cytoskeleton reorganization, gene expression, cell proliferation, cell adhesion and cell migration and survival. In this investigation, a Rac gene (named as LycRac gene) was obtained from the large yellow croaker and it was expressed in Escherichia coli and purified. Subsequently the specific antibody was raised using the purified fusion protein (GST-LycRac). Moreover, the GTP-binding assay showed that the LycRac protein had GTP-binding activity. The LycRac gene was ubiquitously transcribed and expressed in 9 tissues. Quantitative real-time RT-PCR and Western blot analysis revealed the highest expression in gill and the weakest expression in spleen. Time-course analysis revealed that LycRac expression was obviously up-regulated in blood, spleen and liver after immunization with polyinosinic polycytidynic acid (poly I:C), formalin-inactive Gram-negative bacterium Vibrio parahemolyticus and bacterial lipopolysaccharides (LPS). These results suggested that LycRac protein might play an important role in the immune response against microorganisms in large yellow croaker. Crown Copyright © 2010. Published by Elsevier Ltd. All rights reserved.
Lehnert, Sigrid A; Reverter, Antonio; Byrne, Keren A; Wang, Yonghong; Nattrass, Greg S; Hudson, Nicholas J; Greenwood, Paul L
2007-01-01
Background The muscle fiber number and fiber composition of muscle is largely determined during prenatal development. In order to discover genes that are involved in determining adult muscle phenotypes, we studied the gene expression profile of developing fetal bovine longissimus muscle from animals with two different genetic backgrounds using a bovine cDNA microarray. Fetal longissimus muscle was sampled at 4 stages of myogenesis and muscle maturation: primary myogenesis (d 60), secondary myogenesis (d 135), as well as beginning (d 195) and final stages (birth) of functional differentiation of muscle fibers. All fetuses and newborns (total n = 24) were from Hereford dams and crossed with either Wagyu (high intramuscular fat) or Piedmontese (GDF8 mutant) sires, genotypes that vary markedly in muscle and compositional characteristics later in postnatal life. Results We obtained expression profiles of three individuals for each time point and genotype to allow comparisons across time and between sire breeds. Quantitative reverse transcription-PCR analysis of RNA from developing longissimus muscle was able to validate the differential expression patterns observed for a selection of differentially expressed genes, with one exception. We detected large-scale changes in temporal gene expression between the four developmental stages in genes coding for extracellular matrix and for muscle fiber structural and metabolic proteins. FSTL1 and IGFBP5 were two genes implicated in growth and differentiation that showed developmentally regulated expression levels in fetal muscle. An abundantly expressed gene with no functional annotation was found to be developmentally regulated in the same manner as muscle structural proteins. We also observed differences in gene expression profiles between the two different sire breeds. Wagyu-sired calves showed higher expression of fatty acid binding protein 5 (FABP5) RNA at birth. The developing longissimus muscle of fetuses carrying the Piedmontese mutation shows an emphasis on glycolytic muscle biochemistry and a large-scale up-regulation of the translational machinery at birth. We also document evidence for timing differences in differentiation events between the two breeds. Conclusion Taken together, these findings provide a detailed description of molecular events accompanying skeletal muscle differentiation in the bovine, as well as gene expression differences that may underpin the phenotype differences between the two breeds. In addition, this study has highlighted a non-coding RNA, which is abundantly expressed and developmentally regulated in bovine fetal muscle. PMID:17697390
Wu, Jun-Zheng; Liu, Qin; Geng, Xiao-Shan; Li, Kai-Mian; Luo, Li-Juan; Liu, Jin-Ping
2017-03-14
Cassava (Manihot esculenta Crantz) is a major crop extensively cultivated in the tropics as both an important source of calories and a promising source for biofuel production. Although stable gene expression have been used for transgenic breeding and gene function study, a quick, easy and large-scale transformation platform has been in urgent need for gene functional characterization, especially after the cassava full genome was sequenced. Fully expanded leaves from in vitro plantlets of Manihot esculenta were used to optimize the concentrations of cellulase R-10 and macerozyme R-10 for obtaining protoplasts with the highest yield and viability. Then, the optimum conditions (PEG4000 concentration and transfection time) were determined for cassava protoplast transient gene expression. In addition, the reliability of the established protocol was confirmed for subcellular protein localization. In this work we optimized the main influencing factors and developed an efficient mesophyll protoplast isolation and PEG-mediated transient gene expression in cassava. The suitable enzyme digestion system was established with the combination of 1.6% cellulase R-10 and 0.8% macerozyme R-10 for 16 h of digestion in the dark at 25 °C, resulting in the high yield (4.4 × 10 7 protoplasts/g FW) and vitality (92.6%) of mesophyll protoplasts. The maximum transfection efficiency (70.8%) was obtained with the incubation of the protoplasts/vector DNA mixture with 25% PEG4000 for 10 min. We validated the applicability of the system for studying the subcellular localization of MeSTP7 (an H + /monosaccharide cotransporter) with our transient expression protocol and a heterologous Arabidopsis transient gene expression system. We optimized the main influencing factors and developed an efficient mesophyll protoplast isolation and transient gene expression in cassava, which will facilitate large-scale characterization of genes and pathways in cassava.
Hepatic gene expression profiling of 5′-AMP-induced hypometabolism in mice
Miki, Takao; Van Oort-Jansen, Anita; Matsumoto, Tomoko; Loose, David S.; Lee, Cheng Chi
2011-01-01
There is currently much interest in clinical applications of therapeutic hypothermia. Hypothermia can be a consequence of hypometabolism. We have recently established a procedure for the induction of a reversible deep hypometabolic state in mice using 5′-adenosine monophosphate (5′-AMP) in conjunction with moderate ambient temperature. The current study aims at investigating the impact of this technology at the gene expression level in a major metabolic organ, the liver. Our findings reveal that expression levels of the majority of genes in liver are not significantly altered by deep hypometabolism. However, among those affected by hypometabolism, more genes are differentially upregulated than downregulated both in a deep hypometabolic state and in the early arousal state. These altered gene expression levels during 5′-AMP induced hypometabolism are largely restored to normal levels within 2 days of the treatment. Our data also suggest that temporal control of circadian genes is largely stalled during deep hypometabolism. PMID:21224422
Expression profiling identifies novel Hh/Gli regulated genes in developing zebrafish embryos.
Bergeron, Sadie A.; Milla, Luis A.; Villegas, Rosario; Shen, Meng-Chieh; Burgess, Shawn M.; Allende, Miguel L.; Karlstrom, Rolf O.; Palma, Verónica
2008-01-01
The Hedgehog (Hh) signaling pathway plays critical instructional roles during embryonic development. Mis-regulation of Hh/Gli signaling is a major causative factor in human congenital disorders and in a variety of cancers. The zebrafish is a powerful genetic model for the study of Hh signaling during embryogenesis, as a large number of mutants have been identified affecting different components of the Hh/Gli signaling system. By performing global profiling of gene expression in different Hh/Gli gain- and loss-of-function scenarios we identified several known (e.g. ptc1 and nkx2.2a) as well as a large number of novel Hh regulated genes that are differentially expressed in embryos with altered Hh/Gli signaling function. By uncovering changes in tissue specific gene expression, we revealed new embryological processes that are influenced by Hh signaling. We thus provide a comprehensive survey of Hh/Gli regulated genes during embryogenesis and we identify new Hh-regulated genes that may be targets of mis-regulation during tumorogenesis. PMID:18055165
Tamazawa, Satoshi; Yamamoto, Kyosuke; Takasaki, Kazuto; Mitani, Yasuo; Hanada, Satoshi; Kamagata, Yoichi; Tamaki, Hideyuki
2016-01-01
We investigated the in situ gene expression profile of sulfur-turf microbial mats dominated by an uncultured large sausage-shaped Aquificae bacterium, a key metabolic player in sulfur-turfs in sulfidic hot springs. A reverse transcription-PCR analysis revealed that the genes responsible for sulfide, sulfite, and thiosulfate oxidation and carbon fixation via the reductive TCA cycle were continuously expressed in sulfur-turf mats taken at different sampling points, seasons, and years. These results suggest that the uncultured large sausage-shaped bacterium has the ability to grow chemolithoautotrophically and plays key roles as a primary producer in the sulfidic hot spring ecosystem in situ. PMID:27297893
Tamazawa, Satoshi; Yamamoto, Kyosuke; Takasaki, Kazuto; Mitani, Yasuo; Hanada, Satoshi; Kamagata, Yoichi; Tamaki, Hideyuki
2016-06-25
We investigated the in situ gene expression profile of sulfur-turf microbial mats dominated by an uncultured large sausage-shaped Aquificae bacterium, a key metabolic player in sulfur-turfs in sulfidic hot springs. A reverse transcription-PCR analysis revealed that the genes responsible for sulfide, sulfite, and thiosulfate oxidation and carbon fixation via the reductive TCA cycle were continuously expressed in sulfur-turf mats taken at different sampling points, seasons, and years. These results suggest that the uncultured large sausage-shaped bacterium has the ability to grow chemolithoautotrophically and plays key roles as a primary producer in the sulfidic hot spring ecosystem in situ.
Liu, Li-Zhi; Wu, Fang-Xiang; Zhang, Wen-Jun
2014-01-01
As an abstract mapping of the gene regulations in the cell, gene regulatory network is important to both biological research study and practical applications. The reverse engineering of gene regulatory networks from microarray gene expression data is a challenging research problem in systems biology. With the development of biological technologies, multiple time-course gene expression datasets might be collected for a specific gene network under different circumstances. The inference of a gene regulatory network can be improved by integrating these multiple datasets. It is also known that gene expression data may be contaminated with large errors or outliers, which may affect the inference results. A novel method, Huber group LASSO, is proposed to infer the same underlying network topology from multiple time-course gene expression datasets as well as to take the robustness to large error or outliers into account. To solve the optimization problem involved in the proposed method, an efficient algorithm which combines the ideas of auxiliary function minimization and block descent is developed. A stability selection method is adapted to our method to find a network topology consisting of edges with scores. The proposed method is applied to both simulation datasets and real experimental datasets. It shows that Huber group LASSO outperforms the group LASSO in terms of both areas under receiver operating characteristic curves and areas under the precision-recall curves. The convergence analysis of the algorithm theoretically shows that the sequence generated from the algorithm converges to the optimal solution of the problem. The simulation and real data examples demonstrate the effectiveness of the Huber group LASSO in integrating multiple time-course gene expression datasets and improving the resistance to large errors or outliers.
Yue, Runqing; Lu, Caixia; Sun, Tao; Peng, Tingting; Han, Xiaohua; Qi, Jianshuang; Yan, Shufeng; Tie, Shuanggui
2015-01-01
The calmodulin-binding transcription activators (CAMTA) play critical roles in plant growth and responses to environmental stimuli. However, how CAMTAs function in responses to abiotic and biotic stresses in maize (Zea mays L.) is largely unknown. In this study, we first identified all the CAMTA homologous genes in the whole genome of maize. The results showed that nine ZmCAMTA genes showed highly diversified gene structures and tissue-specific expression patterns. Many ZmCAMTA genes displayed high expression levels in the roots. We then surveyed the distribution of stress-related cis-regulatory elements in the −1.5 kb promoter regions of ZmCAMTA genes. Notably, a large number of stress-related elements present in the promoter regions of some ZmCAMTA genes, indicating a genetic basis of stress expression regulation of these genes. Quantitative real-time PCR was used to test the expression of ZmCAMTA genes under several abiotic stresses (drought, salt, and cold), various stress-related hormones [abscisic acid, auxin, salicylic acid (SA), and jasmonic acid] and biotic stress [rice black-streaked dwarf virus (RBSDV) infection]. Furthermore, the expression pattern of ZmCAMTA genes under RBSDV infection was analyzed to investigate their potential roles in responses of different maize cultivated varieties to RBSDV. The expression of most ZmCAMTA genes responded to both abiotic and biotic stresses. The data will help us to understand the roles of CAMTA-mediated Ca2+ signaling in maize tolerance to environmental stresses. PMID:26284092
Analysis of blood-based gene expression in idiopathic Parkinson disease.
Shamir, Ron; Klein, Christine; Amar, David; Vollstedt, Eva-Juliane; Bonin, Michael; Usenovic, Marija; Wong, Yvette C; Maver, Ales; Poths, Sven; Safer, Hershel; Corvol, Jean-Christophe; Lesage, Suzanne; Lavi, Ofer; Deuschl, Günther; Kuhlenbaeumer, Gregor; Pawlack, Heike; Ulitsky, Igor; Kasten, Meike; Riess, Olaf; Brice, Alexis; Peterlin, Borut; Krainc, Dimitri
2017-10-17
To examine whether gene expression analysis of a large-scale Parkinson disease (PD) patient cohort produces a robust blood-based PD gene signature compared to previous studies that have used relatively small cohorts (≤220 samples). Whole-blood gene expression profiles were collected from a total of 523 individuals. After preprocessing, the data contained 486 gene profiles (n = 205 PD, n = 233 controls, n = 48 other neurodegenerative diseases) that were partitioned into training, validation, and independent test cohorts to identify and validate a gene signature. Batch-effect reduction and cross-validation were performed to ensure signature reliability. Finally, functional and pathway enrichment analyses were applied to the signature to identify PD-associated gene networks. A gene signature of 100 probes that mapped to 87 genes, corresponding to 64 upregulated and 23 downregulated genes differentiating between patients with idiopathic PD and controls, was identified with the training cohort and successfully replicated in both an independent validation cohort (area under the curve [AUC] = 0.79, p = 7.13E-6) and a subsequent independent test cohort (AUC = 0.74, p = 4.2E-4). Network analysis of the signature revealed gene enrichment in pathways, including metabolism, oxidation, and ubiquitination/proteasomal activity, and misregulation of mitochondria-localized genes, including downregulation of COX4I1 , ATP5A1 , and VDAC3 . We present a large-scale study of PD gene expression profiling. This work identifies a reliable blood-based PD signature and highlights the importance of large-scale patient cohorts in developing potential PD biomarkers. © 2017 American Academy of Neurology.
Gene expression profiling in the hippocampus of learned helpless and nonhelpless rats.
Kohen, R; Kirov, S; Navaja, G P; Happe, H Kevin; Hamblin, M W; Snoddy, J R; Neumaier, J F; Petty, F
2005-01-01
In the learned helplessness (LH) animal model of depression, failure to attempt escape from avoidable environmental stress, LH, indicates behavioral despair, whereas nonhelpless (NH) behavior reflects behavioral resilience to the effects of environmental stress. Comparing hippocampal gene expression with large-scale oligonucleotide microarrays, we found that stress-resilient (NH) rats, although behaviorally indistinguishable from controls, showed a distinct gene expression profile compared to LH, sham stressed, and naïve control animals. Genes that were confirmed as differentially expressed in the NH group by quantitative PCR strongly correlated in their levels of expression across all four animal groups. Differential expression could not be confirmed at the protein level. We identified several shared degenerate sequence motifs in the 3' untranslated region (3'UTR) of differentially expressed genes that could be a factor in this tight correlation of expression levels among differentially expressed genes.
Unstable genomes elevate transcriptome dynamics
Stevens, Joshua B.; Liu, Guo; Abdallah, Batoul Y.; Horne, Steven D.; Ye, Karen J.; Bremer, Steven W.; Ye, Christine J.; Krawetz, Stephen A.; Heng, Henry H.
2015-01-01
The challenge of identifying common expression signatures in cancer is well known, however the reason behind this is largely unclear. Traditionally variation in expression signatures has been attributed to technological problems, however recent evidence suggests that chromosome instability (CIN) and resultant karyotypic heterogeneity may be a large contributing factor. Using a well-defined model of immortalization, we systematically compared the pattern of genome alteration and expression dynamics during somatic evolution. Co-measurement of global gene expression and karyotypic alteration throughout the immortalization process reveals that karyotype changes influence gene expression as major structural and numerical karyotypic alterations result in large gene expression deviation. Replicate samples from stages with stable genomes are more similar to each other than are replicate samples with karyotypic heterogeneity. Karyotypic and gene expression change during immortalization is dynamic as each stage of progression has a unique expression pattern. This was further verified by comparing global expression in two replicates grown in one flask with known karyotypes. Replicates with higher karyotypic instability were found to be less similar than replicates with stable karyotypes. This data illustrates the karyotype, transcriptome, and transcriptome determined pathways are in constant flux during somatic cellular evolution (particularly during the macroevolutionary phase) and this flux is an inextricable feature of CIN and essential for cancer formation. The findings presented here underscore the importance of understanding the evolutionary process of cancer in order to design improved treatment modalities. PMID:24122714
Zhang, Zhang; Liu, Jingxing; Wu, Jiayan; Yu, Jun
2013-01-01
The regulation of gene expression is essential for eukaryotes, as it drives the processes of cellular differentiation and morphogenesis, leading to the creation of different cell types in multicellular organisms. RNA-Sequencing (RNA-Seq) provides researchers with a powerful toolbox for characterization and quantification of transcriptome. Many different human tissue/cell transcriptome datasets coming from RNA-Seq technology are available on public data resource. The fundamental issue here is how to develop an effective analysis method to estimate expression pattern similarities between different tumor tissues and their corresponding normal tissues. We define the gene expression pattern from three directions: 1) expression breadth, which reflects gene expression on/off status, and mainly concerns ubiquitously expressed genes; 2) low/high or constant/variable expression genes, based on gene expression level and variation; and 3) the regulation of gene expression at the gene structure level. The cluster analysis indicates that gene expression pattern is higher related to physiological condition rather than tissue spatial distance. Two sets of human housekeeping (HK) genes are defined according to cell/tissue types, respectively. To characterize the gene expression pattern in gene expression level and variation, we firstly apply improved K-means algorithm and a gene expression variance model. We find that cancer-associated HK genes (a HK gene is specific in cancer group, while not in normal group) are expressed higher and more variable in cancer condition than in normal condition. Cancer-associated HK genes prefer to AT-rich genes, and they are enriched in cell cycle regulation related functions and constitute some cancer signatures. The expression of large genes is also avoided in cancer group. These studies will help us understand which cell type-specific patterns of gene expression differ among different cell types, and particularly for cancer. PMID:23382867
Identification and resolution of artifacts in the interpretation of imprinted gene expression.
Proudhon, Charlotte; Bourc'his, Déborah
2010-12-01
Genomic imprinting refers to genes that are epigenetically programmed in the germline to express exclusively or preferentially one allele in a parent-of-origin manner. Expression-based genome-wide screening for the identification of imprinted genes has failed to uncover a significant number of new imprinted genes, probably because of the high tissue- and developmental-stage specificity of imprinted gene expression. A very large number of technical and biological artifacts can also lead to the erroneous evidence of imprinted gene expression. In this article, we focus on three common sources of potential confounding effects: (i) random monoallelic expression in monoclonal cell populations, (ii) genetically determined monoallelic expression and (iii) contamination or infiltration of embryonic tissues with maternal material. This last situation specifically applies to genes that occur as maternally expressed in the placenta. Beside the use of reciprocal crosses that are instrumental to confirm the parental specificity of expression, we provide additional methods for the detection and elimination of these situations that can be misinterpreted as cases of imprinted expression.
Pao, Sheng-Ying; Lin, Win-Li; Hwang, Ming-Jing
2006-01-01
Background Screening for differentially expressed genes on the genomic scale and comparative analysis of the expression profiles of orthologous genes between species to study gene function and regulation are becoming increasingly feasible. Expressed sequence tags (ESTs) are an excellent source of data for such studies using bioinformatic approaches because of the rich libraries and tremendous amount of data now available in the public domain. However, any large-scale EST-based bioinformatics analysis must deal with the heterogeneous, and often ambiguous, tissue and organ terms used to describe EST libraries. Results To deal with the issue of tissue source, in this work, we carefully screened and organized more than 8 million human and mouse ESTs into 157 human and 108 mouse tissue/organ categories, to which we applied an established statistic test using different thresholds of the p value to identify genes differentially expressed in different tissues. Further analysis of the tissue distribution and level of expression of human and mouse orthologous genes showed that tissue-specific orthologs tended to have more similar expression patterns than those lacking significant tissue specificity. On the other hand, a number of orthologs were found to have significant disparity in their expression profiles, hinting at novel functions, divergent regulation, or new ortholog relationships. Conclusion Comprehensive statistics on the tissue-specific expression of human and mouse genes were obtained in this very large-scale, EST-based analysis. These statistical results have been organized into a database, freely accessible at our website , for easy searching of human and mouse tissue-specific genes and for investigating gene expression profiles in the context of comparative genomics. Comparative analysis showed that, although highly tissue-specific genes tend to exhibit similar expression profiles in human and mouse, there are significant exceptions, indicating that orthologous genes, while sharing basic genomic properties, could result in distinct phenotypes. PMID:16626500
Laffaire, Julien; Rivals, Isabelle; Dauphinot, Luce; Pasteau, Fabien; Wehrle, Rosine; Larrat, Benoit; Vitalis, Tania; Moldrich, Randal X; Rossier, Jean; Sinkus, Ralph; Herault, Yann; Dusart, Isabelle; Potier, Marie-Claude
2009-01-01
Background Down syndrome is a chromosomal disorder caused by the presence of three copies of chromosome 21. The mechanisms by which this aneuploidy produces the complex and variable phenotype observed in people with Down syndrome are still under discussion. Recent studies have demonstrated an increased transcript level of the three-copy genes with some dosage compensation or amplification for a subset of them. The impact of this gene dosage effect on the whole transcriptome is still debated and longitudinal studies assessing the variability among samples, tissues and developmental stages are needed. Results We thus designed a large scale gene expression study in mice (the Ts1Cje Down syndrome mouse model) in which we could measure the effects of trisomy 21 on a large number of samples (74 in total) in a tissue that is affected in Down syndrome (the cerebellum) and where we could quantify the defect during postnatal development in order to correlate gene expression changes to the phenotype observed. Statistical analysis of microarray data revealed a major gene dosage effect: for the three-copy genes as well as for a 2 Mb segment from mouse chromosome 12 that we show for the first time as being deleted in the Ts1Cje mice. This gene dosage effect impacts moderately on the expression of euploid genes (2.4 to 7.5% differentially expressed). Only 13 genes were significantly dysregulated in Ts1Cje mice at all four postnatal development stages studied from birth to 10 days after birth, and among them are 6 three-copy genes. The decrease in granule cell proliferation demonstrated in newborn Ts1Cje cerebellum was correlated with a major gene dosage effect on the transcriptome in dissected cerebellar external granule cell layer. Conclusion High throughput gene expression analysis in the cerebellum of a large number of samples of Ts1Cje and euploid mice has revealed a prevailing gene dosage effect on triplicated genes. Moreover using an enriched cell population that is thought responsible for the cerebellar hypoplasia in Down syndrome, a global destabilization of gene expression was not detected. Altogether these results strongly suggest that the three-copy genes are directly responsible for the phenotype present in cerebellum. We provide here a short list of candidate genes. PMID:19331679
Govindaraj, Lekha; Gupta, Tania; Esvaran, Vijaya Gowri; Awasthi, Arvind Kumar; Ponnuvel, Kangayam M
2016-04-01
Sugar transporters play an essential role in controlling carbohydrate transport and are responsible for mediating the movement of sugars into cells. These genes exist as large multigene families within the insect genome. In insects, sugar transporters not only have a role in sugar transport, but may also act as receptors for virus entry. Genome-wide annotation of silkworm Bombyx mori (B. mori) revealed 100 putative sugar transporter (BmST) genes exists as a large multigene family and were classified into 11 sub families, through phylogenetic analysis. Chromosomes 27, 26 and 20 were found to possess the highest number of BmST paralogous genes, harboring 22, 7 and 6 genes, respectively. These genes occurred in clusters exhibiting the phenomenon of tandem gene duplication. The ovary, silk gland, hemocytes, midgut and malphigian tubules were the different tissues/cells enriched with BmST gene expression. The BmST gene BGIBMGA001498 had maximum EST transcripts of 134 and expressed exclusively in the malphigian tubule. The expression of EST transcripts of the BmST clustered genes on chromosome 27 was distributed in various tissues like testis, ovary, silk gland, malphigian tubule, maxillary galea, prothoracic gland, epidermis, fat body and midgut. Three sugar transporter genes (BmST) were constitutively expressed in the susceptible race and were down regulated upon BmNPV infection at 12h post infection (hpi). The expression pattern of these three genes was validated through real-time PCR in the midgut tissues at different time intervals from 0 to 30hpi. In the susceptible B. mori race, expression of sugar transporter genes was constitutively expressed making the host succumb to viral infection. Copyright © 2015 Elsevier B.V. All rights reserved.
An Oomycete CRN Effector Reprograms Expression of Plant HSP Genes by Targeting their Promoters
Song, Tianqiao; Ma, Zhenchuan; Shen, Danyu; Li, Qi; Li, Wanlin; Su, Liming; Ye, Tingyue; Zhang, Meixiang; Wang, Yuanchao; Dou, Daolong
2015-01-01
Oomycete pathogens produce a large number of CRN effectors to manipulate plant immune responses and promote infection. However, their functional mechanisms are largely unknown. Here, we identified a Phytophthora sojae CRN effector PsCRN108 which contains a putative DNA-binding helix-hairpin-helix (HhH) motif and acts in the plant cell nucleus. Silencing of the PsCRN108 gene reduced P. sojae virulence to soybean, while expression of the gene in Nicotiana benthamiana and Arabidopsis thaliana enhanced plant susceptibility to P. capsici. Moreover, PsCRN108 could inhibit expression of HSP genes in A. thaliana, N. benthamiana and soybean. Both the HhH motif and nuclear localization signal of this effector were required for its contribution to virulence and its suppression of HSP gene expression. Furthermore, we found that PsCRN108 targeted HSP promoters in an HSE- and HhH motif-dependent manner. PsCRN108 could inhibit the association of the HSE with the plant heat shock transcription factor AtHsfA1a, which initializes HSP gene expression in response to stress. Therefore, our data support a role for PsCRN108 as a nucleomodulin in down-regulating the expression of plant defense-related genes by directly targeting specific plant promoters. PMID:26714171
An Oomycete CRN Effector Reprograms Expression of Plant HSP Genes by Targeting their Promoters.
Song, Tianqiao; Ma, Zhenchuan; Shen, Danyu; Li, Qi; Li, Wanlin; Su, Liming; Ye, Tingyue; Zhang, Meixiang; Wang, Yuanchao; Dou, Daolong
2015-12-01
Oomycete pathogens produce a large number of CRN effectors to manipulate plant immune responses and promote infection. However, their functional mechanisms are largely unknown. Here, we identified a Phytophthora sojae CRN effector PsCRN108 which contains a putative DNA-binding helix-hairpin-helix (HhH) motif and acts in the plant cell nucleus. Silencing of the PsCRN108 gene reduced P. sojae virulence to soybean, while expression of the gene in Nicotiana benthamiana and Arabidopsis thaliana enhanced plant susceptibility to P. capsici. Moreover, PsCRN108 could inhibit expression of HSP genes in A. thaliana, N. benthamiana and soybean. Both the HhH motif and nuclear localization signal of this effector were required for its contribution to virulence and its suppression of HSP gene expression. Furthermore, we found that PsCRN108 targeted HSP promoters in an HSE- and HhH motif-dependent manner. PsCRN108 could inhibit the association of the HSE with the plant heat shock transcription factor AtHsfA1a, which initializes HSP gene expression in response to stress. Therefore, our data support a role for PsCRN108 as a nucleomodulin in down-regulating the expression of plant defense-related genes by directly targeting specific plant promoters.
Liscovitch, Noa; Bazak, Lily; Levanon, Erez Y; Chechik, Gal
2014-01-01
A-to-I RNA editing by adenosine deaminases acting on RNA is a post-transcriptional modification that is crucial for normal life and development in vertebrates. RNA editing has been shown to be very abundant in the human transcriptome, specifically at the primate-specific Alu elements. The functional role of this wide-spread effect is still not clear; it is believed that editing of transcripts is a mechanism for their down-regulation via processes such as nuclear retention or RNA degradation. Here we combine 2 neural gene expression datasets with genome-level editing information to examine the relation between the expression of ADAR genes with the expression of their target genes. Specifically, we computed the spatial correlation across structures of post-mortem human brains between ADAR and a large set of targets that were found to be edited in their Alu repeats. Surprisingly, we found that a large fraction of the edited genes are positively correlated with ADAR, opposing the assumption that editing would reduce expression. When considering the correlations between ADAR and its targets over development, 2 gene subsets emerge, positively correlated and negatively correlated with ADAR expression. Specifically, in embryonic time points, ADAR is positively correlated with many genes related to RNA processing and regulation of gene expression. These findings imply that the suggested mechanism of regulation of expression by editing is probably not a global one; ADAR expression does not have a genome wide effect reducing the expression of editing targets. It is possible, however, that RNA editing by ADAR in non-coding regions of the gene might be a part of a more complex expression regulation mechanism. PMID:25692240
Liscovitch, Noa; Bazak, Lily; Levanon, Erez Y; Chechik, Gal
2014-01-01
A-to-I RNA editing by adenosine deaminases acting on RNA is a post-transcriptional modification that is crucial for normal life and development in vertebrates. RNA editing has been shown to be very abundant in the human transcriptome, specifically at the primate-specific Alu elements. The functional role of this wide-spread effect is still not clear; it is believed that editing of transcripts is a mechanism for their down-regulation via processes such as nuclear retention or RNA degradation. Here we combine 2 neural gene expression datasets with genome-level editing information to examine the relation between the expression of ADAR genes with the expression of their target genes. Specifically, we computed the spatial correlation across structures of post-mortem human brains between ADAR and a large set of targets that were found to be edited in their Alu repeats. Surprisingly, we found that a large fraction of the edited genes are positively correlated with ADAR, opposing the assumption that editing would reduce expression. When considering the correlations between ADAR and its targets over development, 2 gene subsets emerge, positively correlated and negatively correlated with ADAR expression. Specifically, in embryonic time points, ADAR is positively correlated with many genes related to RNA processing and regulation of gene expression. These findings imply that the suggested mechanism of regulation of expression by editing is probably not a global one; ADAR expression does not have a genome wide effect reducing the expression of editing targets. It is possible, however, that RNA editing by ADAR in non-coding regions of the gene might be a part of a more complex expression regulation mechanism.
Molecular Structure-Based Large-Scale Prediction of Chemical-Induced Gene Expression Changes.
Liu, Ruifeng; AbdulHameed, Mohamed Diwan M; Wallqvist, Anders
2017-09-25
The quantitative structure-activity relationship (QSAR) approach has been used to model a wide range of chemical-induced biological responses. However, it had not been utilized to model chemical-induced genomewide gene expression changes until very recently, owing to the complexity of training and evaluating a very large number of models. To address this issue, we examined the performance of a variable nearest neighbor (v-NN) method that uses information on near neighbors conforming to the principle that similar structures have similar activities. Using a data set of gene expression signatures of 13 150 compounds derived from cell-based measurements in the NIH Library of Integrated Network-based Cellular Signatures program, we were able to make predictions for 62% of the compounds in a 10-fold cross validation test, with a correlation coefficient of 0.61 between the predicted and experimentally derived signatures-a reproducibility rivaling that of high-throughput gene expression measurements. To evaluate the utility of the predicted gene expression signatures, we compared the predicted and experimentally derived signatures in their ability to identify drugs known to cause specific liver, kidney, and heart injuries. Overall, the predicted and experimentally derived signatures had similar receiver operating characteristics, whose areas under the curve ranged from 0.71 to 0.77 and 0.70 to 0.73, respectively, across the three organ injury models. However, detailed analyses of enrichment curves indicate that signatures predicted from multiple near neighbors outperformed those derived from experiments, suggesting that averaging information from near neighbors may help improve the signal from gene expression measurements. Our results demonstrate that the v-NN method can serve as a practical approach for modeling large-scale, genomewide, chemical-induced, gene expression changes.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Walter, Pauline; Hoffmann, Xenia-Katharina; Ebeling, Britta
2013-05-24
Highlights: •We investigate reprogramming of gene expression in multinucleate single cells. •Cells of two differentiation control mutants are fused. •Fused cells proceed to alternative gene expression patterns. •The population of nuclei damps stochastic fluctuations in gene expression. •Dynamic processes of cellular reprogramming can be observed by repeated sampling of a cell. -- Abstract: Nonlinear dynamic processes involving the differential regulation of transcription factors are considered to impact the reprogramming of stem cells, germ cells, and somatic cells. Here, we fused two multinucleate plasmodial cells of Physarum polycephalum mutants defective in different sporulation control genes while being in different physiological states.more » The resulting heterokaryons established one of two significantly different expression patterns of marker genes while the plasmodial halves that were fused to each other synchronized spontaneously. Spontaneous synchronization suggests that switch-like control mechanisms spread over and finally control the entire plasmodium as a result of cytoplasmic mixing. Regulatory molecules due to the large volume of the vigorously streaming cytoplasm will define concentrations in acting on the population of nuclei and in the global setting of switches. Mixing of a large cytoplasmic volume is expected to damp stochasticity when individual nuclei deliver certain RNAs at low copy number into the cytoplasm. We conclude that spontaneous synchronization, the damping of molecular noise in gene expression by the large cytoplasmic volume, and the option to take multiple macroscopic samples from the same plasmodium provide unique options for studying the dynamics of cellular reprogramming at the single cell level.« less
Lan, Hui; Carson, Rachel; Provart, Nicholas J; Bonner, Anthony J
2007-09-21
Arabidopsis thaliana is the model species of current plant genomic research with a genome size of 125 Mb and approximately 28,000 genes. The function of half of these genes is currently unknown. The purpose of this study is to infer gene function in Arabidopsis using machine-learning algorithms applied to large-scale gene expression data sets, with the goal of identifying genes that are potentially involved in plant response to abiotic stress. Using in house and publicly available data, we assembled a large set of gene expression measurements for A. thaliana. Using those genes of known function, we first evaluated and compared the ability of basic machine-learning algorithms to predict which genes respond to stress. Predictive accuracy was measured using ROC50 and precision curves derived through cross validation. To improve accuracy, we developed a method for combining these classifiers using a weighted-voting scheme. The combined classifier was then trained on genes of known function and applied to genes of unknown function, identifying genes that potentially respond to stress. Visual evidence corroborating the predictions was obtained using electronic Northern analysis. Three of the predicted genes were chosen for biological validation. Gene knockout experiments confirmed that all three are involved in a variety of stress responses. The biological analysis of one of these genes (At1g16850) is presented here, where it is shown to be necessary for the normal response to temperature and NaCl. Supervised learning methods applied to large-scale gene expression measurements can be used to predict gene function. However, the ability of basic learning methods to predict stress response varies widely and depends heavily on how much dimensionality reduction is used. Our method of combining classifiers can improve the accuracy of such predictions - in this case, predictions of genes involved in stress response in plants - and it effectively chooses the appropriate amount of dimensionality reduction automatically. The method provides a useful means of identifying genes in A. thaliana that potentially respond to stress, and we expect it would be useful in other organisms and for other gene functions.
Rice Ribosomal Protein Large Subunit Genes and Their Spatio-temporal and Stress Regulation
Moin, Mazahar; Bakshi, Achala; Saha, Anusree; Dutta, Mouboni; Madhav, Sheshu M.; Kirti, P. B.
2016-01-01
Ribosomal proteins (RPs) are well-known for their role in mediating protein synthesis and maintaining the stability of the ribosomal complex, which includes small and large subunits. In the present investigation, in a genome-wide survey, we predicted that the large subunit of rice ribosomes is encoded by at least 123 genes including individual gene copies, distributed throughout the 12 chromosomes. We selected 34 candidate genes, each having 2–3 identical copies, for a detailed characterization of their gene structures, protein properties, cis-regulatory elements and comprehensive expression analysis. RPL proteins appear to be involved in interactions with other RP and non-RP proteins and their encoded RNAs have a higher content of alpha-helices in their predicted secondary structures. The majority of RPs have binding sites for metal and non-metal ligands. Native expression profiling of 34 ribosomal protein large (RPL) subunit genes in tissues covering the major stages of rice growth shows that they are predominantly expressed in vegetative tissues and seedlings followed by meiotically active tissues like flowers. The putative promoter regions of these genes also carry cis-elements that respond specifically to stress and signaling molecules. All the 34 genes responded differentially to the abiotic stress treatments. Phytohormone and cold treatments induced significant up-regulation of several RPL genes, while heat and H2O2 treatments down-regulated a majority of them. Furthermore, infection with a bacterial pathogen, Xanthomonas oryzae, which causes leaf blight also induced the expression of 80% of the RPL genes in leaves. Although the expression of RPL genes was detected in all the tissues studied, they are highly responsive to stress and signaling molecules indicating that their encoded proteins appear to have roles in stress amelioration besides house-keeping. This shows that the RPL gene family is a valuable resource for manipulation of stress tolerance in rice and other crops, which may be achieved by overexpressing and raising independent transgenic plants carrying the genes that became up-regulated significantly and instantaneously. PMID:27605933
Yang, Jian-Rong; Maclean, Calum J; Park, Chungoo; Zhao, Huabin; Zhang, Jianzhi
2017-09-01
It is commonly, although not universally, accepted that most intra and interspecific genome sequence variations are more or less neutral, whereas a large fraction of organism-level phenotypic variations are adaptive. Gene expression levels are molecular phenotypes that bridge the gap between genotypes and corresponding organism-level phenotypes. Yet, it is unknown whether natural variations in gene expression levels are mostly neutral or adaptive. Here we address this fundamental question by genome-wide profiling and comparison of gene expression levels in nine yeast strains belonging to three closely related Saccharomyces species and originating from five different ecological environments. We find that the transcriptome-based clustering of the nine strains approximates the genome sequence-based phylogeny irrespective of their ecological environments. Remarkably, only ∼0.5% of genes exhibit similar expression levels among strains from a common ecological environment, no greater than that among strains with comparable phylogenetic relationships but different environments. These and other observations strongly suggest that most intra and interspecific variations in yeast gene expression levels result from the accumulation of random mutations rather than environmental adaptations. This finding has profound implications for understanding the driving force of gene expression evolution, genetic basis of phenotypic adaptation, and general role of stochasticity in evolution. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Stekel, Dov J.; Sarti, Donatella; Trevino, Victor; Zhang, Lihong; Salmon, Mike; Buckley, Chris D.; Stevens, Mark; Pallen, Mark J.; Penn, Charles; Falciani, Francesco
2005-01-01
A key step in the analysis of microarray data is the selection of genes that are differentially expressed. Ideally, such experiments should be properly replicated in order to infer both technical and biological variability, and the data should be subjected to rigorous hypothesis tests to identify the differentially expressed genes. However, in microarray experiments involving the analysis of very large numbers of biological samples, replication is not always practical. Therefore, there is a need for a method to select differentially expressed genes in a rational way from insufficiently replicated data. In this paper, we describe a simple method that uses bootstrapping to generate an error model from a replicated pilot study that can be used to identify differentially expressed genes in subsequent large-scale studies on the same platform, but in which there may be no replicated arrays. The method builds a stratified error model that includes array-to-array variability, feature-to-feature variability and the dependence of error on signal intensity. We apply this model to the characterization of the host response in a model of bacterial infection of human intestinal epithelial cells. We demonstrate the effectiveness of error model based microarray experiments and propose this as a general strategy for a microarray-based screening of large collections of biological samples. PMID:15800204
Lavitrano, Marialuisa; Bacci, Maria Laura; Forni, Monica; Lazzereschi, Davide; Di Stefano, Carla; Fioretti, Daniela; Giancotti, Paola; Marfé, Gabriella; Pucci, Loredana; Renzi, Luigina; Wang, Hongjun; Stoppacciaro, Antonella; Stassi, Giorgio; Sargiacomo, Massimo; Sinibaldi, Paola; Turchi, Valeria; Giovannoni, Roberto; Della Casa, Giacinto; Seren, Eraldo; Rossi, Giancarlo
2002-01-01
A large number of hDAF transgenic pigs to be used for xenotransplantation research were generated by using sperm-mediated gene transfer (SMGT). The efficiency of transgenesis obtained with SMGT was much greater than with any other method. In the experiments reported, up to 80% of pigs had the transgene integrated into the genome. Most of the pigs carrying the hDAF gene transcribed it in a stable manner (64%). The great majority of pigs that transcribed the gene expressed the protein (83%). The hDAF gene was transmitted to progeny. Expression was stable and found in caveolae as it is in human cells. The expressed gene was functional based on in vitro experiments performed on peripheral blood mononuclear cells. These results show that our SMGT approach to transgenesis provides an efficient procedure for studies involving large animal models. PMID:12393815
Evolution and expression analysis of the grape (Vitis vinifera L.) WRKY gene family.
Guo, Chunlei; Guo, Rongrong; Xu, Xiaozhao; Gao, Min; Li, Xiaoqin; Song, Junyang; Zheng, Yi; Wang, Xiping
2014-04-01
WRKY proteins comprise a large family of transcription factors that play important roles in plant defence regulatory networks, including responses to various biotic and abiotic stresses. To date, no large-scale study of WRKY genes has been undertaken in grape (Vitis vinifera L.). In this study, a total of 59 putative grape WRKY genes (VvWRKY) were identified and renamed on the basis of their respective chromosome distribution. A multiple sequence alignment analysis using all predicted grape WRKY genes coding sequences, together with those from Arabidopsis thaliana and tomato (Solanum lycopersicum), indicated that the 59 VvWRKY genes can be classified into three main groups (I-III). An evaluation of the duplication events suggested that several WRKY genes arose before the divergence of the grape and Arabidopsis lineages. Moreover, expression profiles derived from semiquantitative PCR and real-time quantitative PCR analyses showed distinct expression patterns in various tissues and in response to different treatments. Four VvWRKY genes showed a significantly higher expression in roots or leaves, 55 responded to varying degrees to at least one abiotic stress treatment, and the expression of 38 were altered following powdery mildew (Erysiphe necator) infection. Most VvWRKY genes were downregulated in response to abscisic acid or salicylic acid treatments, while the expression of a subset was upregulated by methyl jasmonate or ethylene treatments.
Evolution and expression analysis of the grape (Vitis vinifera L.) WRKY gene family
Guo, Chunlei; Guo, Rongrong; Wang, Xiping
2014-01-01
WRKY proteins comprise a large family of transcription factors that play important roles in plant defence regulatory networks, including responses to various biotic and abiotic stresses. To date, no large-scale study of WRKY genes has been undertaken in grape (Vitis vinifera L.). In this study, a total of 59 putative grape WRKY genes (VvWRKY) were identified and renamed on the basis of their respective chromosome distribution. A multiple sequence alignment analysis using all predicted grape WRKY genes coding sequences, together with those from Arabidopsis thaliana and tomato (Solanum lycopersicum), indicated that the 59 VvWRKY genes can be classified into three main groups (I–III). An evaluation of the duplication events suggested that several WRKY genes arose before the divergence of the grape and Arabidopsis lineages. Moreover, expression profiles derived from semiquantitative PCR and real-time quantitative PCR analyses showed distinct expression patterns in various tissues and in response to different treatments. Four VvWRKY genes showed a significantly higher expression in roots or leaves, 55 responded to varying degrees to at least one abiotic stress treatment, and the expression of 38 were altered following powdery mildew (Erysiphe necator) infection. Most VvWRKY genes were downregulated in response to abscisic acid or salicylic acid treatments, while the expression of a subset was upregulated by methyl jasmonate or ethylene treatments. PMID:24510937
Principles of gene microarray data analysis.
Mocellin, Simone; Rossi, Carlo Riccardo
2007-01-01
The development of several gene expression profiling methods, such as comparative genomic hybridization (CGH), differential display, serial analysis of gene expression (SAGE), and gene microarray, together with the sequencing of the human genome, has provided an opportunity to monitor and investigate the complex cascade of molecular events leading to tumor development and progression. The availability of such large amounts of information has shifted the attention of scientists towards a nonreductionist approach to biological phenomena. High throughput technologies can be used to follow changing patterns of gene expression over time. Among them, gene microarray has become prominent because it is easier to use, does not require large-scale DNA sequencing, and allows for the parallel quantification of thousands of genes from multiple samples. Gene microarray technology is rapidly spreading worldwide and has the potential to drastically change the therapeutic approach to patients affected with tumor. Therefore, it is of paramount importance for both researchers and clinicians to know the principles underlying the analysis of the huge amount of data generated with microarray technology.
Galfalvy, Hanga C; Erraji-Benchekroun, Loubna; Smyrniotopoulos, Peggy; Pavlidis, Paul; Ellis, Steven P; Mann, J John; Sibille, Etienne; Arango, Victoria
2003-01-01
Background Genomic studies of complex tissues pose unique analytical challenges for assessment of data quality, performance of statistical methods used for data extraction, and detection of differentially expressed genes. Ideally, to assess the accuracy of gene expression analysis methods, one needs a set of genes which are known to be differentially expressed in the samples and which can be used as a "gold standard". We introduce the idea of using sex-chromosome genes as an alternative to spiked-in control genes or simulations for assessment of microarray data and analysis methods. Results Expression of sex-chromosome genes were used as true internal biological controls to compare alternate probe-level data extraction algorithms (Microarray Suite 5.0 [MAS5.0], Model Based Expression Index [MBEI] and Robust Multi-array Average [RMA]), to assess microarray data quality and to establish some statistical guidelines for analyzing large-scale gene expression. These approaches were implemented on a large new dataset of human brain samples. RMA-generated gene expression values were markedly less variable and more reliable than MAS5.0 and MBEI-derived values. A statistical technique controlling the false discovery rate was applied to adjust for multiple testing, as an alternative to the Bonferroni method, and showed no evidence of false negative results. Fourteen probesets, representing nine Y- and two X-chromosome linked genes, displayed significant sex differences in brain prefrontal cortex gene expression. Conclusion In this study, we have demonstrated the use of sex genes as true biological internal controls for genomic analysis of complex tissues, and suggested analytical guidelines for testing alternate oligonucleotide microarray data extraction protocols and for adjusting multiple statistical analysis of differentially expressed genes. Our results also provided evidence for sex differences in gene expression in the brain prefrontal cortex, supporting the notion of a putative direct role of sex-chromosome genes in differentiation and maintenance of sexual dimorphism of the central nervous system. Importantly, these analytical approaches are applicable to all microarray studies that include male and female human or animal subjects. PMID:12962547
Galfalvy, Hanga C; Erraji-Benchekroun, Loubna; Smyrniotopoulos, Peggy; Pavlidis, Paul; Ellis, Steven P; Mann, J John; Sibille, Etienne; Arango, Victoria
2003-09-08
Genomic studies of complex tissues pose unique analytical challenges for assessment of data quality, performance of statistical methods used for data extraction, and detection of differentially expressed genes. Ideally, to assess the accuracy of gene expression analysis methods, one needs a set of genes which are known to be differentially expressed in the samples and which can be used as a "gold standard". We introduce the idea of using sex-chromosome genes as an alternative to spiked-in control genes or simulations for assessment of microarray data and analysis methods. Expression of sex-chromosome genes were used as true internal biological controls to compare alternate probe-level data extraction algorithms (Microarray Suite 5.0 [MAS5.0], Model Based Expression Index [MBEI] and Robust Multi-array Average [RMA]), to assess microarray data quality and to establish some statistical guidelines for analyzing large-scale gene expression. These approaches were implemented on a large new dataset of human brain samples. RMA-generated gene expression values were markedly less variable and more reliable than MAS5.0 and MBEI-derived values. A statistical technique controlling the false discovery rate was applied to adjust for multiple testing, as an alternative to the Bonferroni method, and showed no evidence of false negative results. Fourteen probesets, representing nine Y- and two X-chromosome linked genes, displayed significant sex differences in brain prefrontal cortex gene expression. In this study, we have demonstrated the use of sex genes as true biological internal controls for genomic analysis of complex tissues, and suggested analytical guidelines for testing alternate oligonucleotide microarray data extraction protocols and for adjusting multiple statistical analysis of differentially expressed genes. Our results also provided evidence for sex differences in gene expression in the brain prefrontal cortex, supporting the notion of a putative direct role of sex-chromosome genes in differentiation and maintenance of sexual dimorphism of the central nervous system. Importantly, these analytical approaches are applicable to all microarray studies that include male and female human or animal subjects.
Clustering cancer gene expression data by projective clustering ensemble
Yu, Xianxue; Yu, Guoxian
2017-01-01
Gene expression data analysis has paramount implications for gene treatments, cancer diagnosis and other domains. Clustering is an important and promising tool to analyze gene expression data. Gene expression data is often characterized by a large amount of genes but with limited samples, thus various projective clustering techniques and ensemble techniques have been suggested to combat with these challenges. However, it is rather challenging to synergy these two kinds of techniques together to avoid the curse of dimensionality problem and to boost the performance of gene expression data clustering. In this paper, we employ a projective clustering ensemble (PCE) to integrate the advantages of projective clustering and ensemble clustering, and to avoid the dilemma of combining multiple projective clusterings. Our experimental results on publicly available cancer gene expression data show PCE can improve the quality of clustering gene expression data by at least 4.5% (on average) than other related techniques, including dimensionality reduction based single clustering and ensemble approaches. The empirical study demonstrates that, to further boost the performance of clustering cancer gene expression data, it is necessary and promising to synergy projective clustering with ensemble clustering. PCE can serve as an effective alternative technique for clustering gene expression data. PMID:28234920
Honey Bee Aggression Supports a Link Between Gene Regulation and Behavioral Evolution
USDA-ARS?s Scientific Manuscript database
A prominent theory holds that animal phenotypes arise by evolutionary changes in the regulation of gene expression. Emerging from studies of animal development, evidence for this theory consists largely of differences in temporal or spatial patterns of gene expression that are related to morphologi...
Long-Range Chromosome Interactions Mediated by Cohesin Shape Circadian Gene Expression
Xu, Yichi; Guo, Weimin; Li, Ping; Zhang, Yan; Zhao, Meng; Fan, Zenghua; Zhao, Zhihu; Yan, Jun
2016-01-01
Mammalian circadian rhythm is established by the negative feedback loops consisting of a set of clock genes, which lead to the circadian expression of thousands of downstream genes in vivo. As genome-wide transcription is organized under the high-order chromosome structure, it is largely uncharted how circadian gene expression is influenced by chromosome architecture. We focus on the function of chromatin structure proteins cohesin as well as CTCF (CCCTC-binding factor) in circadian rhythm. Using circular chromosome conformation capture sequencing, we systematically examined the interacting loci of a Bmal1-bound super-enhancer upstream of a clock gene Nr1d1 in mouse liver. These interactions are largely stable in the circadian cycle and cohesin binding sites are enriched in the interactome. Global analysis showed that cohesin-CTCF co-binding sites tend to insulate the phases of circadian oscillating genes while cohesin-non-CTCF sites are associated with high circadian rhythmicity of transcription. A model integrating the effects of cohesin and CTCF markedly improved the mechanistic understanding of circadian gene expression. Further experiments in cohesin knockout cells demonstrated that cohesin is required at least in part for driving the circadian gene expression by facilitating the enhancer-promoter looping. This study provided a novel insight into the relationship between circadian transcriptome and the high-order chromosome structure. PMID:27135601
General statistics of stochastic process of gene expression in eukaryotic cells.
Kuznetsov, V A; Knott, G D; Bonner, R F
2002-01-01
Thousands of genes are expressed at such very low levels (< or =1 copy per cell) that global gene expression analysis of rarer transcripts remains problematic. Ambiguity in identification of rarer transcripts creates considerable uncertainty in fundamental questions such as the total number of genes expressed in an organism and the biological significance of rarer transcripts. Knowing the distribution of the true number of genes expressed at each level and the corresponding gene expression level probability function (GELPF) could help resolve these uncertainties. We found that all observed large-scale gene expression data sets in yeast, mouse, and human cells follow a Pareto-like distribution model skewed by many low-abundance transcripts. A novel stochastic model of the gene expression process predicts the universality of the GELPF both across different cell types within a multicellular organism and across different organisms. This model allows us to predict the frequency distribution of all gene expression levels within a single cell and to estimate the number of expressed genes in a single cell and in a population of cells. A random "basal" transcription mechanism for protein-coding genes in all or almost all eukaryotic cell types is predicted. This fundamental mechanism might enhance the expression of rarely expressed genes and, thus, provide a basic level of phenotypic diversity, adaptability, and random monoallelic expression in cell populations. PMID:12136033
Raaphorst, Frank M.; Vermeer, Maarten; Fieret, Elly; Blokzijl, Tjasso; Dukers, Danny; Sewalt, Richard G.A.B.; Otte, Arie P.; Willemze, Rein; Meijer, Chris J.L.M.
2004-01-01
Polycomb-group (PcG) genes preserve cell identity by gene silencing, and contribute to regulation of lymphopoiesis and malignant transformation. We show that primary nodal large B-cell lymphomas (LBCLs), and secondary cutaneous deposits from such lymphomas, abnormally express the BMI-1, RING1, and HPH1 PcG genes in cycling neoplastic cells. By contrast, tumor cells in primary cutaneous LBCLs lacked BMI-1 expression, whereas RING1 was variably detected. Lack of BMI-1 expression was characteristic for primary cutaneous LBCLs, because other primary extranodal LBCLs originating from brain, testes, and stomach were BMI-1-positive. Expression of HPH1 was rarely detected in primary cutaneous LBCLs of the head or trunk and abundant in primary cutaneous LBCLs of the legs, which fits well with its earlier recognition as a distinct clinical pathological entity with different clinical behavior. We conclude that clinically defined subclasses of primary LBCLs display site-specific abnormal expression patterns of PcG genes of the HPC-HPH/PRC1 PcG complex. Some of these patterns (such as the expression profile of BMI-1) may be diagnostically relevant. We propose that distinct expression profiles of PcG genes results in abnormal formation of HPC-HPH/PRC1 PcG complexes, and that this contributes to lymphomagenesis and different clinical behavior of clinically defined LBCLs. PMID:14742259
Gene-expression signatures of Atlantic salmon's plastic life cycle.
Aubin-Horth, Nadia; Letcher, Benjamin H; Hofmann, Hans A
2009-09-15
How genomic expression differs as a function of life history variation is largely unknown. Atlantic salmon exhibits extreme alternative life histories. We defined the gene-expression signatures of wild-caught salmon at two different life stages by comparing the brain expression profiles of mature sneaker males and immature males, and early migrants and late migrants. In addition to life-stage-specific signatures, we discovered a surprisingly large gene set that was differentially regulated-at similar magnitudes, yet in opposite direction-in both life history transitions. We suggest that this co-variation is not a consequence of many independent cellular and molecular switches in the same direction but rather represents the molecular equivalent of a physiological shift orchestrated by one or very few master regulators.
Vicente, Juan J; Galardi-Castilla, María; Escalante, Ricardo; Sastre, Leandro
2008-01-03
The social amoeba Dictyostelium discoideum executes a multicellular development program upon starvation. This morphogenetic process requires the differential regulation of a large number of genes and is coordinated by extracellular signals. The MADS-box transcription factor SrfA is required for several stages of development, including slug migration and spore terminal differentiation. Subtractive hybridization allowed the isolation of a gene, sigN (SrfA-induced gene N), that was dependent on the transcription factor SrfA for expression at the slug stage of development. Homology searches detected the existence of a large family of sigN-related genes in the Dictyostelium discoideum genome. The 13 most similar genes are grouped in two regions of chromosome 2 and have been named Group1 and Group2 sigN genes. The putative encoded proteins are 87-89 amino acids long. All these genes have a similar structure, composed of a first exon containing a 13 nucleotides long open reading frame and a second exon comprising the remaining of the putative coding region. The expression of these genes is induced at10 hours of development. Analyses of their promoter regions indicate that these genes are expressed in the prestalk region of developing structures. The addition of antibodies raised against SigN Group 2 proteins induced disintegration of multi-cellular structures at the mound stage of development. A large family of genes coding for small proteins has been identified in D. discoideum. Two groups of very similar genes from this family have been shown to be specifically expressed in prestalk cells during development. Functional studies using antibodies raised against Group 2 SigN proteins indicate that these genes could play a role during multicellular development.
Cao, Huojun; Amendt, Brad A
2016-11-01
Developmental dental anomalies are common forms of congenital defects. The molecular mechanisms of dental anomalies are poorly understood. Systematic approaches such as clustering genes based on similar expression patterns could identify novel genes involved in dental anomalies and provide a framework for understanding molecular regulatory mechanisms of these genes during tooth development (odontogenesis). A python package (pySAPC) of sparse affinity propagation clustering algorithm for large datasets was developed. Whole genome pair-wise similarity was calculated based on expression pattern similarity based on 45 microarrays of several stages during odontogenesis. pySAPC identified 743 gene clusters based on expression pattern similarity during mouse tooth development. Three clusters are significantly enriched for genes associated with dental anomalies (with FDR <0.1). The three clusters of genes have distinct expression patterns during odontogenesis. Clustering genes based on similar expression profiles recovered several known regulatory relationships for genes involved in odontogenesis, as well as many novel genes that may be involved with the same genetic pathways as genes that have already been shown to contribute to dental defects. By using sparse similarity matrix, pySAPC use much less memory and CPU time compared with the original affinity propagation program that uses a full similarity matrix. This python package will be useful for many applications where dataset(s) are too large to use full similarity matrix. This article is part of a Special Issue entitled "System Genetics" Guest Editor: Dr. Yudong Cai and Dr. Tao Huang. Copyright © 2016. Published by Elsevier B.V.
The low noise limit in gene expression
Dar, Roy D.; Weinberger, Leor S.; Cox, Chris D.; ...
2015-10-21
Protein noise measurements are increasingly used to elucidate biophysical parameters. Unfortunately noise analyses are often at odds with directly measured parameters. Here we show that these inconsistencies arise from two problematic analytical choices: (i) the assumption that protein translation rate is invariant for different proteins of different abundances, which has inadvertently led to (ii) the assumption that a large constitutive extrinsic noise sets the low noise limit in gene expression. While growing evidence suggests that transcriptional bursting may set the low noise limit, variability in translational bursting has been largely ignored. We show that genome-wide systematic variation in translational efficiencymore » can-and in the case of E. coli does-control the low noise limit in gene expression. Therefore constitutive extrinsic noise is small and only plays a role in the absence of a systematic variation in translational efficiency. Lastly, these results show the existence of two distinct expression noise patterns: (1) a global noise floor uniformly imposed on all genes by expression bursting; and (2) high noise distributed to only a select group of genes.« less
Control of developmentally primed erythroid genes by combinatorial co-repressor actions
Stadhouders, Ralph; Cico, Alba; Stephen, Tharshana; Thongjuea, Supat; Kolovos, Petros; Baymaz, H. Irem; Yu, Xiao; Demmers, Jeroen; Bezstarosti, Karel; Maas, Alex; Barroca, Vilma; Kockx, Christel; Ozgur, Zeliha; van Ijcken, Wilfred; Arcangeli, Marie-Laure; Andrieu-Soler, Charlotte; Lenhard, Boris; Grosveld, Frank; Soler, Eric
2015-01-01
How transcription factors (TFs) cooperate within large protein complexes to allow rapid modulation of gene expression during development is still largely unknown. Here we show that the key haematopoietic LIM-domain-binding protein-1 (LDB1) TF complex contains several activator and repressor components that together maintain an erythroid-specific gene expression programme primed for rapid activation until differentiation is induced. A combination of proteomics, functional genomics and in vivo studies presented here identifies known and novel co-repressors, most notably the ETO2 and IRF2BP2 proteins, involved in maintaining this primed state. The ETO2–IRF2BP2 axis, interacting with the NCOR1/SMRT co-repressor complex, suppresses the expression of the vast majority of archetypical erythroid genes and pathways until its decommissioning at the onset of terminal erythroid differentiation. Our experiments demonstrate that multimeric regulatory complexes feature a dynamic interplay between activating and repressing components that determines lineage-specific gene expression and cellular differentiation. PMID:26593974
SEGEL: A Web Server for Visualization of Smoking Effects on Human Lung Gene Expression.
Xu, Yan; Hu, Brian; Alnajm, Sammy S; Lu, Yin; Huang, Yangxin; Allen-Gipson, Diane; Cheng, Feng
2015-01-01
Cigarette smoking is a major cause of death worldwide resulting in over six million deaths per year. Cigarette smoke contains complex mixtures of chemicals that are harmful to nearly all organs of the human body, especially the lungs. Cigarette smoking is considered the major risk factor for many lung diseases, particularly chronic obstructive pulmonary diseases (COPD) and lung cancer. However, the underlying molecular mechanisms of smoking-induced lung injury associated with these lung diseases still remain largely unknown. Expression microarray techniques have been widely applied to detect the effects of smoking on gene expression in different human cells in the lungs. These projects have provided a lot of useful information for researchers to understand the potential molecular mechanism(s) of smoke-induced pathogenesis. However, a user-friendly web server that would allow scientists to fast query these data sets and compare the smoking effects on gene expression across different cells had not yet been established. For that reason, we have integrated eight public expression microarray data sets from trachea epithelial cells, large airway epithelial cells, small airway epithelial cells, and alveolar macrophage into an online web server called SEGEL (Smoking Effects on Gene Expression of Lung). Users can query gene expression patterns across these cells from smokers and nonsmokers by gene symbols, and find the effects of smoking on the gene expression of lungs from this web server. Sex difference in response to smoking is also shown. The relationship between the gene expression and cigarette smoking consumption were calculated and are shown in the server. The current version of SEGEL web server contains 42,400 annotated gene probe sets represented on the Affymetrix Human Genome U133 Plus 2.0 platform. SEGEL will be an invaluable resource for researchers interested in the effects of smoking on gene expression in the lungs. The server also provides useful information for drug development against smoking-related diseases. The SEGEL web server is available online at http://www.chengfeng.info/smoking_database.html.
Kleene, Kenneth C
2005-01-01
This review proposes that the peculiar patterns of gene expression in spermatogenic cells are the consequence of powerful evolutionary forces known as sexual selection. Sexual selection is generally characterized by intense competition of males for females, an enormous variety of the strategies to maximize male reproductive success, exaggerated male traits at all levels of biological organization, co-evolution of sexual traits in males and females, and conflict between the sexual advantage of the male trait and the reproductive fitness of females and the individual fitness of both sexes. In addition, spermatogenesis is afflicted by selfish genes that promote their transmission to progeny while causing deleterious effects. Sexual selection, selfish genes, and genetic conflict provide compelling explanations for many atypical features of gene expression in spermatogenic cells including the gross overexpression of certain mRNAs, transcripts encoding truncated proteins that cannot carry out basic functions of the proteins encoded by the same genes in somatic cells, the large number of gene families containing paralogous genes encoding spermatogenic cell-specific isoforms, the large number of testis-cancer-associated genes that are expressed only in spermatogenic cells and malignant cells, and the overbearing role of Sertoli cells in regulating the number and quality of spermatozoa.
Jiang, Zhenhong; He, Fei; Zhang, Ziding
2017-07-01
Through large-scale transcriptional data analyses, we highlighted the importance of plant metabolism in plant immunity and identified 26 metabolic pathways that were frequently influenced by the infection of 14 different pathogens. Reprogramming of plant metabolism is a common phenomenon in plant defense responses. Currently, a large number of transcriptional profiles of infected tissues in Arabidopsis (Arabidopsis thaliana) have been deposited in public databases, which provides a great opportunity to understand the expression patterns of metabolic pathways during plant defense responses at the systems level. Here, we performed a large-scale transcriptome analysis based on 135 previously published expression samples, including 14 different pathogens, to explore the expression pattern of Arabidopsis metabolic pathways. Overall, metabolic genes are significantly changed in expression during plant defense responses. Upregulated metabolic genes are enriched on defense responses, and downregulated genes are enriched on photosynthesis, fatty acid and lipid metabolic processes. Gene set enrichment analysis (GSEA) identifies 26 frequently differentially expressed metabolic pathways (FreDE_Paths) that are differentially expressed in more than 60% of infected samples. These pathways are involved in the generation of energy, fatty acid and lipid metabolism as well as secondary metabolite biosynthesis. Clustering analysis based on the expression levels of these 26 metabolic pathways clearly distinguishes infected and control samples, further suggesting the importance of these metabolic pathways in plant defense responses. By comparing with FreDE_Paths from abiotic stresses, we find that the expression patterns of 26 FreDE_Paths from biotic stresses are more consistent across different infected samples. By investigating the expression correlation between transcriptional factors (TFs) and FreDE_Paths, we identify several notable relationships. Collectively, the current study will deepen our understanding of plant metabolism in plant immunity and provide new insights into disease-resistant crop improvement.
Mating Changes Sexually Dimorphic Gene Expression in the Seed Beetle Callosobruchus maculatus.
Immonen, Elina; Sayadi, Ahmed; Bayram, Helen; Arnqvist, Göran
2017-03-01
Sexually dimorphic phenotypes arise largely from sex-specific gene expression, which has mainly been characterized in sexually naïve adults. However, we expect sexual dimorphism in transcription to be dynamic and dependent on factors such as reproductive status. Mating induces many behavioral and physiological changes distinct to each sex and is therefore expected to activate regulatory changes in many sex-biased genes. Here, we first characterized sexual dimorphism in gene expression in Callosobruchus maculatus seed beetles. We then examined how females and males respond to mating and how it affects sex-biased expression, both in sex-limited (abdomen) and sex-shared (head and thorax) tissues. Mating responses were largely sex-specific and, as expected, females showed more genes responding compared with males (∼2,000 vs. ∼300 genes in the abdomen, ∼500 vs. ∼400 in the head and thorax, respectively). Of the sex-biased genes present in virgins, 16% (1,041 genes) in the abdomen and 17% (243 genes) in the head and thorax altered their relative expression between the sexes as a result of mating. Sex-bias status changed in 2% of the genes in the abdomen and 4% in the head and thorax following mating. Mating responses involved de-feminization of females and, to a lesser extent, de-masculinization of males relative to their virgin state: mating decreased rather than increased dimorphic expression of sex-biased genes. The fact that regulatory changes of both types of sex-biased genes occurred in both sexes suggests that male- and female-specific selection is not restricted to male- and female-biased genes, respectively, as is sometimes assumed. © The Author(s) 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Mating Changes Sexually Dimorphic Gene Expression in the Seed Beetle Callosobruchus maculatus
Sayadi, Ahmed; Bayram, Helen; Arnqvist, Göran
2017-01-01
Sexually dimorphic phenotypes arise largely from sex-specific gene expression, which has mainly been characterized in sexually naïve adults. However, we expect sexual dimorphism in transcription to be dynamic and dependent on factors such as reproductive status. Mating induces many behavioral and physiological changes distinct to each sex and is therefore expected to activate regulatory changes in many sex-biased genes. Here, we first characterized sexual dimorphism in gene expression in Callosobruchus maculatus seed beetles. We then examined how females and males respond to mating and how it affects sex-biased expression, both in sex-limited (abdomen) and sex-shared (head and thorax) tissues. Mating responses were largely sex-specific and, as expected, females showed more genes responding compared with males (∼2,000 vs. ∼300 genes in the abdomen, ∼500 vs. ∼400 in the head and thorax, respectively). Of the sex-biased genes present in virgins, 16% (1,041 genes) in the abdomen and 17% (243 genes) in the head and thorax altered their relative expression between the sexes as a result of mating. Sex-bias status changed in 2% of the genes in the abdomen and 4% in the head and thorax following mating. Mating responses involved de-feminization of females and, to a lesser extent, de-masculinization of males relative to their virgin state: mating decreased rather than increased dimorphic expression of sex-biased genes. The fact that regulatory changes of both types of sex-biased genes occurred in both sexes suggests that male- and female-specific selection is not restricted to male- and female-biased genes, respectively, as is sometimes assumed. PMID:28391318
Rapid stress-induced transcriptomic changes in the brain depend on beta-adrenergic signaling.
Roszkowski, Martin; Manuella, Francesca; von Ziegler, Lukas; Durán-Pacheco, Gonzalo; Moreau, Jean-Luc; Mansuy, Isabelle M; Bohacek, Johannes
2016-08-01
Acute exposure to stressful experiences can rapidly increase anxiety and cause neuropsychiatric disorders. The effects of stress result in part from the release of neurotransmitters and hormones, which regulate gene expression in different brain regions. The fast neuroendocrine response to stress is largely mediated by norepinephrine (NE) and corticotropin releasing hormone (CRH), followed by a slower and more sustained release of corticosterone. While corticosterone is an important regulator of gene expression, it is not clear which stress-signals contribute to the rapid regulation of gene expression observed immediately after stress exposure. Here, we demonstrate in mice that 45 min after an acute swim stress challenge, large changes in gene expression occur across the transcriptome in the hippocampus, a region sensitive to the effects of stress. We identify multiple candidate genes that are rapidly and transiently altered in both males and females. Using a pharmacological approach, we show that most of these rapidly induced genes are regulated by NE through β-adrenergic receptor signaling. We find that CRH and corticosterone can also contribute to rapid changes in gene expression, although these effects appear to be restricted to fewer genes. These results newly reveal a widespread impact of NE on the transcriptome and identify novel genes associated with stress and adrenergic signaling. Copyright © 2016 Elsevier Ltd. All rights reserved.
Ingestion of bacterially expressed double-stranded RNA inhibits gene expression in planarians.
Newmark, Phillip A; Reddien, Peter W; Cebrià, Francesc; Sánchez Alvarado, Alejandro
2003-09-30
Freshwater planarian flatworms are capable of regenerating complete organisms from tiny fragments of their bodies; the basis for this regenerative prowess is an experimentally accessible stem cell population that is present in the adult planarian. The study of these organisms, classic experimental models for investigating metazoan regeneration, has been revitalized by the application of modern molecular biological approaches. The identification of thousands of unique planarian ESTs, coupled with large-scale whole-mount in situ hybridization screens, and the ability to inhibit planarian gene expression through double-stranded RNA-mediated genetic interference, provide a wealth of tools for studying the molecular mechanisms that regulate tissue regeneration and stem cell biology in these organisms. Here we show that, as in Caenorhabditis elegans, ingestion of bacterially expressed double-stranded RNA can inhibit gene expression in planarians. This inhibition persists throughout the process of regeneration, allowing phenotypes with disrupted regenerative patterning to be identified. These results pave the way for large-scale screens for genes involved in regenerative processes.
Maximova, Siela N; Florez, Sergio; Shen, Xiangling; Niemenak, Nicolas; Zhang, Yufan; Curtis, Wayne; Guiltinan, Mark J
2014-07-16
Theobroma cacao L. is a tropical fruit tree, the seeds of which are used to create chocolate. In vitro somatic embryogenesis (SE) of cacao is a propagation system useful for rapid mass-multiplication to accelerate breeding programs and to provide plants directly to farmers. Two major limitations of cacao SE remain: the efficiency of embryo production is highly genotype dependent and the lack of full cotyledon development results in low embryo to plant conversion rates. With the goal to better understand SE development and to improve the efficiency of SE conversion we examined gene expression differences between zygotic and somatic embryos using a whole genome microarray. The expression of 28,752 genes was determined at 4 developmental time points during zygotic embryogenesis (ZE) and 2 time points during cacao somatic embryogenesis (SE). Within the ZE time course, 10,288 differentially expressed genes were enriched for functions related to responses to abiotic and biotic stimulus, metabolic and cellular processes. A comparison ZE and SE expression profiles identified 10,175 differentially expressed genes. Many TF genes, putatively involved in ethylene metabolism and response, were more strongly expressed in SEs as compared to ZEs. Expression levels of genes involved in fatty acid metabolism, flavonoid biosynthesis and seed storage protein genes were also differentially expressed in the two types of embryos. Large numbers of genes were differentially regulated during various stages of both ZE and SE development in cacao. The relatively higher expression of ethylene and flavonoid related genes during SE suggests that the developing tissues may be experiencing high levels of stress during SE maturation caused by the in vitro environment. The expression of genes involved in the synthesis of auxin, polyunsaturated fatty acids and secondary metabolites was higher in SEs relative to ZEs despite lack of lipid and metabolite accumulation. These differences in gene transcript levels associated with critical processes during seed development are consistent with the fact that somatic embryos do not fully develop the large storage cotyledons found in zygotic embryos. These results provide insight towards design of improved protocols for cacao somatic embryogenesis.
2014-01-01
Background Theobroma cacao L. is a tropical fruit tree, the seeds of which are used to create chocolate. In vitro somatic embryogenesis (SE) of cacao is a propagation system useful for rapid mass-multiplication to accelerate breeding programs and to provide plants directly to farmers. Two major limitations of cacao SE remain: the efficiency of embryo production is highly genotype dependent and the lack of full cotyledon development results in low embryo to plant conversion rates. With the goal to better understand SE development and to improve the efficiency of SE conversion we examined gene expression differences between zygotic and somatic embryos using a whole genome microarray. Results The expression of 28,752 genes was determined at 4 developmental time points during zygotic embryogenesis (ZE) and 2 time points during cacao somatic embryogenesis (SE). Within the ZE time course, 10,288 differentially expressed genes were enriched for functions related to responses to abiotic and biotic stimulus, metabolic and cellular processes. A comparison ZE and SE expression profiles identified 10,175 differentially expressed genes. Many TF genes, putatively involved in ethylene metabolism and response, were more strongly expressed in SEs as compared to ZEs. Expression levels of genes involved in fatty acid metabolism, flavonoid biosynthesis and seed storage protein genes were also differentially expressed in the two types of embryos. Conclusions Large numbers of genes were differentially regulated during various stages of both ZE and SE development in cacao. The relatively higher expression of ethylene and flavonoid related genes during SE suggests that the developing tissues may be experiencing high levels of stress during SE maturation caused by the in vitro environment. The expression of genes involved in the synthesis of auxin, polyunsaturated fatty acids and secondary metabolites was higher in SEs relative to ZEs despite lack of lipid and metabolite accumulation. These differences in gene transcript levels associated with critical processes during seed development are consistent with the fact that somatic embryos do not fully develop the large storage cotyledons found in zygotic embryos. These results provide insight towards design of improved protocols for cacao somatic embryogenesis. PMID:25030026
Pan- and core- network analysis of co-expression genes in a model plant
He, Fei; Maslov, Sergei
2016-12-16
Genome-wide gene expression experiments have been performed using the model plant Arabidopsis during the last decade. Some studies involved construction of coexpression networks, a popular technique used to identify groups of co-regulated genes, to infer unknown gene functions. One approach is to construct a single coexpression network by combining multiple expression datasets generated in different labs. We advocate a complementary approach in which we construct a large collection of 134 coexpression networks based on expression datasets reported in individual publications. To this end we reanalyzed public expression data. To describe this collection of networks we introduced concepts of ‘pan-network’ andmore » ‘core-network’ representing union and intersection between a sizeable fractions of individual networks, respectively. Here, we showed that these two types of networks are different both in terms of their topology and biological function of interacting genes. For example, the modules of the pan-network are enriched in regulatory and signaling functions, while the modules of the core-network tend to include components of large macromolecular complexes such as ribosomes and photosynthetic machinery. Our analysis is aimed to help the plant research community to better explore the information contained within the existing vast collection of gene expression data in Arabidopsis.« less
Pan- and core- network analysis of co-expression genes in a model plant
DOE Office of Scientific and Technical Information (OSTI.GOV)
He, Fei; Maslov, Sergei
Genome-wide gene expression experiments have been performed using the model plant Arabidopsis during the last decade. Some studies involved construction of coexpression networks, a popular technique used to identify groups of co-regulated genes, to infer unknown gene functions. One approach is to construct a single coexpression network by combining multiple expression datasets generated in different labs. We advocate a complementary approach in which we construct a large collection of 134 coexpression networks based on expression datasets reported in individual publications. To this end we reanalyzed public expression data. To describe this collection of networks we introduced concepts of ‘pan-network’ andmore » ‘core-network’ representing union and intersection between a sizeable fractions of individual networks, respectively. Here, we showed that these two types of networks are different both in terms of their topology and biological function of interacting genes. For example, the modules of the pan-network are enriched in regulatory and signaling functions, while the modules of the core-network tend to include components of large macromolecular complexes such as ribosomes and photosynthetic machinery. Our analysis is aimed to help the plant research community to better explore the information contained within the existing vast collection of gene expression data in Arabidopsis.« less
Evidence for a large expansion and subfunctionalisation of globin genes in sea anemones.
Smith, Hayden L; Pavasovic, Ana; Surm, Joachim M; Phillips, Matthew J; Prentis, Peter J
2018-06-27
The globin gene superfamily has been well-characterised in vertebrates, however, there has been limited research in early-diverging lineages, such as phylum Cnidaria. This study aimed to identify globin genes in multiple cnidarian lineages, and use bioinformatic approaches to characterise the evolution, structure and expression of these genes. Phylogenetic analyses and in silico protein predictions showed that all cnidarians have undergone an expansion of globin genes, which likely have a hexacoordinate protein structure. Our protein modelling has also revealed the possibility of a single pentacoordinate globin lineage in anthozoan species. Some cnidarian globin genes displayed tissue and development specific expression with very few orthologous genes similarly expressed across species. Our phylogenetic analyses also revealed that eumetazoan globin genes form a polyphyletic relationship with vertebrate globin genes. Overall, our analyses suggest that a Ngb-like and GbX-like gene were most likely present in the globin gene repertoire for the last common ancestor of eumetazoans. The identification of a large-scale expansion and subfunctionalisation of globin genes in actiniarians provides an excellent starting point to further our understanding of the evolution and function of the globin gene superfamily in early-diverging lineages.
Valera, Alexandra; López-Guillermo, Armando; Cardesa-Salzmann, Teresa; Climent, Fina; González-Barca, Eva; Mercadal, Santiago; Espinosa, Íñigo; Novelli, Silvana; Briones, Javier; Mate, José L.; Salamero, Olga; Sancho, Juan M.; Arenillas, Leonor; Serrano, Sergi; Erill, Nadina; Martínez, Daniel; Castillo, Paola; Rovira, Jordina; Martínez, Antonio; Campo, Elias; Colomo, Luis
2013-01-01
MYC alterations influence the survival of patients with diffuse large B-cell lymphoma. Most studies have focused on MYC translocations but there is little information regarding the impact of numerical alterations and protein expression. We analyzed the genetic alterations and protein expression of MYC, BCL2, BCL6, and MALT1 in 219 cases of diffuse large B-cell lymphoma. MYC rearrangement occurred as the sole abnormality (MYC single-hit) in 3% of cases, MYC and concurrent BCL2 and/or BCL6 rearrangements (MYC double/triple-hit) in 4%, MYC amplifications in 2% and MYC gains in 19%. MYC single-hit, MYC double/triple-hit and MYC amplifications, but not MYC gains or other gene rearrangements, were associated with unfavorable progression-free survival and overall survival. MYC protein expression, evaluated using computerized image analysis, captured the unfavorable prognosis of MYC translocations/amplifications and identified an additional subset of patients without gene alterations but with similar poor prognosis. Patients with tumors expressing both MYC/BCL2 had the worst prognosis, whereas those with double-negative tumors had the best outcome. High MYC expression was associated with shorter overall survival irrespectively of the International Prognostic Index and BCL2 expression. In conclusion, MYC protein expression identifies a subset of diffuse large B-cell lymphoma with very poor prognosis independently of gene alterations and other prognostic parameters. PMID:23716551
Valera, Alexandra; López-Guillermo, Armando; Cardesa-Salzmann, Teresa; Climent, Fina; González-Barca, Eva; Mercadal, Santiago; Espinosa, Iñigo; Novelli, Silvana; Briones, Javier; Mate, José L; Salamero, Olga; Sancho, Juan M; Arenillas, Leonor; Serrano, Sergi; Erill, Nadina; Martínez, Daniel; Castillo, Paola; Rovira, Jordina; Martínez, Antonio; Campo, Elias; Colomo, Luis
2013-10-01
MYC alterations influence the survival of patients with diffuse large B-cell lymphoma. Most studies have focused on MYC translocations but there is little information regarding the impact of numerical alterations and protein expression. We analyzed the genetic alterations and protein expression of MYC, BCL2, BCL6, and MALT1 in 219 cases of diffuse large B-cell lymphoma. MYC rearrangement occurred as the sole abnormality (MYC single-hit) in 3% of cases, MYC and concurrent BCL2 and/or BCL6 rearrangements (MYC double/triple-hit) in 4%, MYC amplifications in 2% and MYC gains in 19%. MYC single-hit, MYC double/triple-hit and MYC amplifications, but not MYC gains or other gene rearrangements, were associated with unfavorable progression-free survival and overall survival. MYC protein expression, evaluated using computerized image analysis, captured the unfavorable prognosis of MYC translocations/amplifications and identified an additional subset of patients without gene alterations but with similar poor prognosis. Patients with tumors expressing both MYC/BCL2 had the worst prognosis, whereas those with double-negative tumors had the best outcome. High MYC expression was associated with shorter overall survival irrespectively of the International Prognostic Index and BCL2 expression. In conclusion, MYC protein expression identifies a subset of diffuse large B-cell lymphoma with very poor prognosis independently of gene alterations and other prognostic parameters.
Identification and resolution of artifacts in the interpretation of imprinted gene expression
Proudhon, Charlotte
2010-01-01
Genomic imprinting refers to genes that are epigenetically programmed in the germline to express exclusively or preferentially one allele in a parent-of-origin manner. Expression-based genome-wide screening for the identification of imprinted genes has failed to uncover a significant number of new imprinted genes, probably because of the high tissue- and developmental-stage specificity of imprinted gene expression. A very large number of technical and biological artifacts can also lead to the erroneous evidence of imprinted gene expression. In this article, we focus on three common sources of potential confounding effects: (i) random monoallelic expression in monoclonal cell populations, (ii) genetically determined monoallelic expression and (iii) contamination or infiltration of embryonic tissues with maternal material. This last situation specifically applies to genes that occur as maternally expressed in the placenta. Beside the use of reciprocal crosses that are instrumental to confirm the parental specificity of expression, we provide additional methods for the detection and elimination of these situations that can be misinterpreted as cases of imprinted expression. PMID:20829207
Zhang, Ai; Li, Ning; Gong, Lei; Gou, Xiaowan; Wang, Bin; Deng, Xin; Li, Changping; Dong, Qianli; Zhang, Huakun
2017-01-01
Aneuploidy, a condition of unbalanced chromosome content, represents a large-effect mutation that bears significant relevance to human health and microbe adaptation. As such, extensive studies of aneuploidy have been conducted in unicellular model organisms and cancer cells. Aneuploidy also frequently is associated with plant polyploidization, but its impact on gene expression and its relevance to polyploid genome evolution/functional innovation remain largely unknown. Here, we used a panel of diverse types of whole-chromosome aneuploidy of hexaploid wheat (Triticum aestivum), all under the common genetic background of cv Chinese Spring, to systemically investigate the impact of aneuploidy on genome-, subgenome-, and chromosome-wide gene expression. Compared with prior findings in haploid or diploid aneuploid systems, we unravel additional and novel features of alteration in global gene expression resulting from the two major impacts of aneuploidy, cis- and trans-regulation, as well as dosage compensation. We show that the expression-altered genes map evenly along each chromosome, with no evidence for coregulating aggregated expression domains. However, chromosomes and subgenomes in hexaploid wheat are unequal in their responses to aneuploidy with respect to the number of genes being dysregulated. Strikingly, homeologous chromosomes do not differ from nonhomologous chromosomes in terms of aneuploidy-induced trans-acting effects, suggesting that the three constituent subgenomes of hexaploid wheat are largely uncoupled at the transcriptional level of gene regulation. Together, our findings shed new insights into the functional interplay between homeologous chromosomes and interactions between subgenomes in hexaploid wheat, which bear implications to further our understanding of allopolyploid genome evolution and efforts in breeding new allopolyploid crops. PMID:28821592
Expression-based clustering of CAZyme-encoding genes of Aspergillus niger.
Gruben, Birgit S; Mäkelä, Miia R; Kowalczyk, Joanna E; Zhou, Miaomiao; Benoit-Gelber, Isabelle; De Vries, Ronald P
2017-11-23
The Aspergillus niger genome contains a large repertoire of genes encoding carbohydrate active enzymes (CAZymes) that are targeted to plant polysaccharide degradation enabling A. niger to grow on a wide range of plant biomass substrates. Which genes need to be activated in certain environmental conditions depends on the composition of the available substrate. Previous studies have demonstrated the involvement of a number of transcriptional regulators in plant biomass degradation and have identified sets of target genes for each regulator. In this study, a broad transcriptional analysis was performed of the A. niger genes encoding (putative) plant polysaccharide degrading enzymes. Microarray data focusing on the initial response of A. niger to the presence of plant biomass related carbon sources were analyzed of a wild-type strain N402 that was grown on a large range of carbon sources and of the regulatory mutant strains ΔxlnR, ΔaraR, ΔamyR, ΔrhaR and ΔgalX that were grown on their specific inducing compounds. The cluster analysis of the expression data revealed several groups of co-regulated genes, which goes beyond the traditionally described co-regulated gene sets. Additional putative target genes of the selected regulators were identified, based on their expression profile. Notably, in several cases the expression profile puts questions on the function assignment of uncharacterized genes that was based on homology searches, highlighting the need for more extensive biochemical studies into the substrate specificity of enzymes encoded by these non-characterized genes. The data also revealed sets of genes that were upregulated in the regulatory mutants, suggesting interaction between the regulatory systems and a therefore even more complex overall regulatory network than has been reported so far. Expression profiling on a large number of substrates provides better insight in the complex regulatory systems that drive the conversion of plant biomass by fungi. In addition, the data provides additional evidence in favor of and against the similarity-based functions assigned to uncharacterized genes.
Large clusters of co-expressed genes in the Drosophila genome.
Boutanaev, Alexander M; Kalmykova, Alla I; Shevelyov, Yuri Y; Nurminsky, Dmitry I
2002-12-12
Clustering of co-expressed, non-homologous genes on chromosomes implies their co-regulation. In lower eukaryotes, co-expressed genes are often found in pairs. Clustering of genes that share aspects of transcriptional regulation has also been reported in higher eukaryotes. To advance our understanding of the mode of coordinated gene regulation in multicellular organisms, we performed a genome-wide analysis of the chromosomal distribution of co-expressed genes in Drosophila. We identified a total of 1,661 testes-specific genes, one-third of which are clustered on chromosomes. The number of clusters of three or more genes is much higher than expected by chance. We observed a similar trend for genes upregulated in the embryo and in the adult head, although the expression pattern of individual genes cannot be predicted on the basis of chromosomal position alone. Our data suggest that the prevalent mechanism of transcriptional co-regulation in higher eukaryotes operates with extensive chromatin domains that comprise multiple genes.
TOXICOGENOMICS AND HUMAN DISEASE RISK ASSESSMENT
Toxicogenomics and Human Disease Risk Assessment.
Complete sequencing of human and other genomes, availability of large-scale gene
expression arrays with ever-increasing numbers of genes displayed, and steady
improvements in protein expression technology can hav...
Qian, Baoying; Xue, Liangyi; Huang, Hongli
2016-01-01
The large yellow croaker (Larimichthys crocea) is an economically important fish species in Chinese mariculture industry. To understand the molecular basis underlying the response to fasting, Illumina HiSeqTM 2000 was used to analyze the liver transcriptome of fasting large yellow croakers. A total of 54,933,550 clean reads were obtained and assembled into 110,364 contigs. Annotation to the NCBI database identified a total of 38,728 unigenes, of which 19,654 were classified into Gene Ontology and 22,683 were found in Kyoto Encyclopedia of Genes and Genomes (KEGG). Comparative analysis of the expression profiles between fasting fish and normal-feeding fish identified a total of 7,623 differentially expressed genes (P < 0.05), including 2,500 upregulated genes and 5,123 downregulated genes. Dramatic differences were observed in the genes involved in metabolic pathways such as fat digestion and absorption, citrate cycle, and glycolysis/gluconeogenesis, and the similar results were also found in the transcriptome of skeletal muscle. Further qPCR analysis confirmed that the genes encoding the factors involved in those pathways significantly changed in terms of expression levels. The results of the present study provide insights into the molecular mechanisms underlying the metabolic response of the large yellow croaker to fasting as well as identified areas that require further investigation. PMID:26967898
Balta, Burhan; Gumus, Hakan; Bayramov, Ruslan; Korkmaz Bayramov, Keziban; Erdogan, Murat; Oztop, Didem Behice; Dogan, Muhammet Ensar; Taheri, Serpil; Dundar, Munis
2018-05-18
Although there are a large number of sequence variants of different genes and copy number variations at various loci identified in autistic disorder (AD) patients, the pathogenesis of AD has not been elucidated completely. Recently, in AD patients, a large number of expression array and transcriptome studies have shown an increase in the expression of genes especially related to innate immune response. Antimicrobial effects of vitamin D and VDR are exerted through Toll-Like-Receptors (TLR) which have an important role in the innate immune response, are expressed by antigen presenting cells and recognize foreign microorganisms. In this study, age and gender matched 30 patients diagnosed with AD and 30 healthy controls were included in the study. Comparatively whole blood VDR gene expression and rs11568820 and rs4516035 SNP profile of the promoter region of the VDR gene were investigated by real time PCR. Whole blood VDR gene expression was significantly higher in the AD group compared to control subjects (p < 0.0001). There were no significant differences among allele and genotype distribution of rs11568820 and rs4516035 polymorphisms between AD patients and controls. The increase of VDR gene expression in patients with AD may be in accordance with an increase in the innate immune response in patients with AD. Furthermore, this study will stimulate new studies in order to clarify the relationship among AD, vitamin D, VDR, and innate immunity.
Gene expression changes governing extreme dehydration tolerance in an Antarctic insect
Teets, Nicholas M.; Peyton, Justin T.; Colinet, Herve; Renault, David; Kelley, Joanna L.; Kawarasaki, Yuta; Lee, Richard E.; Denlinger, David L.
2012-01-01
Among terrestrial organisms, arthropods are especially susceptible to dehydration, given their small body size and high surface area to volume ratio. This challenge is particularly acute for polar arthropods that face near-constant desiccating conditions, as water is frozen and thus unavailable for much of the year. The molecular mechanisms that govern extreme dehydration tolerance in insects remain largely undefined. In this study, we used RNA sequencing to quantify transcriptional mechanisms of extreme dehydration tolerance in the Antarctic midge, Belgica antarctica, the world’s southernmost insect and only insect endemic to Antarctica. Larvae of B. antarctica are remarkably tolerant of dehydration, surviving losses up to 70% of their body water. Gene expression changes in response to dehydration indicated up-regulation of cellular recycling pathways including the ubiquitin-mediated proteasome and autophagy, with concurrent down-regulation of genes involved in general metabolism and ATP production. Metabolomics results revealed shifts in metabolite pools that correlated closely with changes in gene expression, indicating that coordinated changes in gene expression and metabolism are a critical component of the dehydration response. Finally, using comparative genomics, we compared our gene expression results with a transcriptomic dataset for the Arctic collembolan, Megaphorura arctica. Although B. antarctica and M. arctica are adapted to similar environments, our analysis indicated very little overlap in expression profiles between these two arthropods. Whereas several orthologous genes showed similar expression patterns, transcriptional changes were largely species specific, indicating these polar arthropods have developed distinct transcriptional mechanisms to cope with similar desiccating conditions. PMID:23197828
Gene expression changes governing extreme dehydration tolerance in an Antarctic insect.
Teets, Nicholas M; Peyton, Justin T; Colinet, Herve; Renault, David; Kelley, Joanna L; Kawarasaki, Yuta; Lee, Richard E; Denlinger, David L
2012-12-11
Among terrestrial organisms, arthropods are especially susceptible to dehydration, given their small body size and high surface area to volume ratio. This challenge is particularly acute for polar arthropods that face near-constant desiccating conditions, as water is frozen and thus unavailable for much of the year. The molecular mechanisms that govern extreme dehydration tolerance in insects remain largely undefined. In this study, we used RNA sequencing to quantify transcriptional mechanisms of extreme dehydration tolerance in the Antarctic midge, Belgica antarctica, the world's southernmost insect and only insect endemic to Antarctica. Larvae of B. antarctica are remarkably tolerant of dehydration, surviving losses up to 70% of their body water. Gene expression changes in response to dehydration indicated up-regulation of cellular recycling pathways including the ubiquitin-mediated proteasome and autophagy, with concurrent down-regulation of genes involved in general metabolism and ATP production. Metabolomics results revealed shifts in metabolite pools that correlated closely with changes in gene expression, indicating that coordinated changes in gene expression and metabolism are a critical component of the dehydration response. Finally, using comparative genomics, we compared our gene expression results with a transcriptomic dataset for the Arctic collembolan, Megaphorura arctica. Although B. antarctica and M. arctica are adapted to similar environments, our analysis indicated very little overlap in expression profiles between these two arthropods. Whereas several orthologous genes showed similar expression patterns, transcriptional changes were largely species specific, indicating these polar arthropods have developed distinct transcriptional mechanisms to cope with similar desiccating conditions.
The oncogenic potential of BK-polyomavirus is linked to viral integration into the human genome.
Kenan, Daniel J; Mieczkowski, Piotr A; Burger-Calderon, Raquel; Singh, Harsharan K; Nickeleit, Volker
2015-11-01
It has been suggested that BK-polyomavirus is linked to oncogenesis via high expression levels of large T-antigen in some urothelial neoplasms arising following kidney transplantation. However, a causal association between BK-polyomavirus, large T-antigen expression and oncogenesis has never been demonstrated in humans. Here we describe an investigation using high-throughput sequencing of tumour DNA obtained from an urothelial carcinoma arising in a renal allograft. We show that a novel BK-polyomavirus strain, named CH-1, is integrated into exon 26 of the myosin-binding protein C1 gene (MYBPC1) on chromosome 12 in tumour cells but not in normal renal cells. Integration of the BK-polyomavirus results in a number of discrete alterations in viral gene expression, including: (a) disruption of VP1 protein expression and robust expression of large T-antigen; (b) preclusion of viral replication; and (c) deletions in the non-coding control region (NCCR), with presumed alterations in promoter feedback loops. Viral integration disrupts one MYBPC1 gene copy and likely alters its expression. Circular episomal BK-polyomavirus gene sequences are not found, and the renal allograft shows no productive polyomavirus infection or polyomavirus nephropathy. These findings support the hypothesis that integration of polyomaviruses is essential to tumourigenesis. It is likely that dysregulation of large T-antigen, with persistent over-expression in non-lytic cells, promotes cell growth, genetic instability and neoplastic transformation. © 2015 The Authors. The Journal of Pathology published by John Wiley & Sons Ltd on behalf of Pathological Society of Great Britain and Ireland.
Tummala, Seshu B; Junne, Stefan G; Paredes, Carlos J; Papoutsakis, Eleftherios T
2003-12-30
Antisense RNA (asRNA) downregulation alters protein expression without changing the regulation of gene expression. Downregulation of primary metabolic enzymes possibly combined with overexpression of other metabolic enzymes may result in profound changes in product formation, and this may alter the large-scale transcriptional program of the cells. DNA-array based large-scale transcriptional analysis has the potential to elucidate factors that control cellular fluxes even in the absence of proteome data. These themes are explored in the study of large-scale transcriptional analysis programs and the in vivo primary-metabolism fluxes of several related recombinant C. acetobutylicum strains: C. acetobutylicum ATCC 824(pSOS95del) (plasmid control; produces high levels of butanol snd acetone), 824(pCTFB1AS) (expresses antisense RNA against CoA transferase (ctfb1-asRNA); produces very low levels of butanol and acetone), and 824(pAADB1) (expresses ctfb1-asRNA and the alcohol-aldehyde dahydrogenase gene (aad); produce high alcohol and low acetone levels). DNA-array based transcriptional analysis revealed that the large changes in product concentrations (snd notably butanol concentration) due to ctfb1-asRNA expression alone and in combination with aad overexpression resulted in dramatic changes of the cellular transcriptome. Cluster analysis and gene expression patterns of established and putative operons involved in stress response, motility, sporulation, and fatty-acid biosynthesis indicate that these simple genetic changes dramatically alter the cellular programs of C. acetobutylicum. Comparison of gene expression and flux analysis data may point to possible flux-controling steps and suggest unknown regulatory mechanisms. Copyright 2003; Wiley Periodicals, Inc.
Gene-expression signatures of Atlantic salmon's plastic life cycle
Aubin-Horth, N.; Letcher, B.H.; Hofmann, H.A.
2009-01-01
How genomic expression differs as a function of life history variation is largely unknown. Atlantic salmon exhibits extreme alternative life histories. We defined the gene-expression signatures of wild-caught salmon at two different life stages by comparing the brain expression profiles of mature sneaker males and immature males, and early migrants and late migrants. In addition to life-stage-specific signatures, we discovered a surprisingly large gene set that was differentially regulated-at similar magnitudes, yet in opposite direction-in both life history transitions. We suggest that this co-variation is not a consequence of many independent cellular and molecular switches in the same direction but rather represents the molecular equivalent of a physiological shift orchestrated by one or very few master regulators. ?? 2009 Elsevier Inc. All rights reserved.
Gene-expression signatures of Atlantic salmon’s plastic life cycle
Aubin-Horth, Nadia; Letcher, Benjamin H.; Hofmann, Hans A.
2009-01-01
How genomic expression differs as a function of life history variation is largely unknown. Atlantic salmon exhibits extreme alternative life histories. We defined the gene-expression signatures of wild-caught salmon at two different life stages by comparing the brain expression profiles of mature sneaker males and immature males, and early migrants and late migrants. In addition to life-stage-specific signatures, we discovered a surprisingly large gene set that was differentially regulated - at similar magnitudes, yet in opposite direction - in both life history transitions. We suggest that this co-variation is not a consequence of many independent cellular and molecular switches in the same direction but rather represents the molecular equivalent of a physiological shift orchestrated by one or very few master regulators. PMID:19401203
Expression analysis of genes encoding double B-box zinc finger proteins in maize.
Li, Wenlan; Wang, Jingchao; Sun, Qi; Li, Wencai; Yu, Yanli; Zhao, Meng; Meng, Zhaodong
2017-11-01
The B-box proteins play key roles in plant development. The double B-box (DBB) family is one of the subfamily of the B-box family, with two B-box domains and without a CCT domain. In this study, 12 maize double B-box genes (ZmDBBs) were identified through a genome-wide survey. Phylogenetic analysis of DBB proteins from maize, rice, Sorghum bicolor, Arabidopsis, and poplar classified them into five major clades. Gene duplication analysis indicated that segmental duplications made a large contribution to the expansion of ZmDBBs. Furthermore, a large number of cis-acting regulatory elements related to plant development, response to light and phytohormone were identified in the promoter regions of the ZmDBB genes. The expression patterns of the ZmDBB genes in various tissues and different developmental stages demonstrated that ZmDBBs might play essential roles in plant development, and some ZmDBB genes might have unique function in specific developmental stages. In addition, several ZmDBB genes showed diurnal expression pattern. The expression levels of some ZmDBB genes changed significantly under light/dark treatment conditions and phytohormone treatments, implying that they might participate in light signaling pathway and hormone signaling. Our results will provide new information to better understand the complexity of the DBB gene family in maize.
Gene coexpression measures in large heterogeneous samples using count statistics.
Wang, Y X Rachel; Waterman, Michael S; Huang, Haiyan
2014-11-18
With the advent of high-throughput technologies making large-scale gene expression data readily available, developing appropriate computational tools to process these data and distill insights into systems biology has been an important part of the "big data" challenge. Gene coexpression is one of the earliest techniques developed that is still widely in use for functional annotation, pathway analysis, and, most importantly, the reconstruction of gene regulatory networks, based on gene expression data. However, most coexpression measures do not specifically account for local features in expression profiles. For example, it is very likely that the patterns of gene association may change or only exist in a subset of the samples, especially when the samples are pooled from a range of experiments. We propose two new gene coexpression statistics based on counting local patterns of gene expression ranks to take into account the potentially diverse nature of gene interactions. In particular, one of our statistics is designed for time-course data with local dependence structures, such as time series coupled over a subregion of the time domain. We provide asymptotic analysis of their distributions and power, and evaluate their performance against a wide range of existing coexpression measures on simulated and real data. Our new statistics are fast to compute, robust against outliers, and show comparable and often better general performance.
Spatial expression of Hox cluster genes in the ontogeny of a sea urchin
NASA Technical Reports Server (NTRS)
Arenas-Mena, C.; Cameron, A. R.; Davidson, E. H.
2000-01-01
The Hox cluster of the sea urchin Strongylocentrous purpuratus contains ten genes in a 500 kb span of the genome. Only two of these genes are expressed during embryogenesis, while all of eight genes tested are expressed during development of the adult body plan in the larval stage. We report the spatial expression during larval development of the five 'posterior' genes of the cluster: SpHox7, SpHox8, SpHox9/10, SpHox11/13a and SpHox11/13b. The five genes exhibit a dynamic, largely mesodermal program of expression. Only SpHox7 displays extensive expression within the pentameral rudiment itself. A spatially sequential and colinear arrangement of expression domains is found in the somatocoels, the paired posterior mesodermal structures that will become the adult perivisceral coeloms. No such sequential expression pattern is observed in endodermal, epidermal or neural tissues of either the larva or the presumptive juvenile sea urchin. The spatial expression patterns of the Hox genes illuminate the evolutionary process by which the pentameral echinoderm body plan emerged from a bilateral ancestor.
Wang, Yang; Chen, Zhi-Hao; Yin, Chun; Ma, Jian-Hua; Li, Di-Jie; Zhao, Fan; Sun, Yu-Long; Hu, Li-Fang; Shang, Peng; Qian, Ai-Rong
2015-01-01
The diamagnetic levitation as a novel ground-based model for simulating a reduced gravity environment has recently been applied in life science research. In this study a specially designed superconducting magnet with a large gradient high magnetic field (LG-HMF), which can provide three apparent gravity levels (μ-g, 1-g, and 2-g), was used to simulate a space-like gravity environment. Osteocyte, as the most important mechanosensor in bone, takes a pivotal position in mediating the mechano-induced bone remodeling. In this study, the effects of LG-HMF on gene expression profiling of osteocyte-like cell line MLO-Y4 were investigated by Affymetrix DNA microarray. LG-HMF affected osteocyte gene expression profiling. Differentially expressed genes (DEGs) and data mining were further analyzed by using bioinfomatic tools, such as DAVID, iReport. 12 energy metabolism related genes (PFKL, AK4, ALDOC, COX7A1, STC1, ADM, CA9, CA12, P4HA1, APLN, GPR35 and GPR84) were further confirmed by real-time PCR. An integrated gene interaction network of 12 DEGs was constructed. Bio-data mining showed that genes involved in glucose metabolic process and apoptosis changed notablly. Our results demostrated that LG-HMF affected the expression of energy metabolism related genes in osteocyte. The identification of sensitive genes to special environments may provide some potential targets for preventing and treating bone loss or osteoporosis. PMID:25635858
Wang, Yang; Chen, Zhi-Hao; Yin, Chun; Ma, Jian-Hua; Li, Di-Jie; Zhao, Fan; Sun, Yu-Long; Hu, Li-Fang; Shang, Peng; Qian, Ai-Rong
2015-01-01
The diamagnetic levitation as a novel ground-based model for simulating a reduced gravity environment has recently been applied in life science research. In this study a specially designed superconducting magnet with a large gradient high magnetic field (LG-HMF), which can provide three apparent gravity levels (μ-g, 1-g, and 2-g), was used to simulate a space-like gravity environment. Osteocyte, as the most important mechanosensor in bone, takes a pivotal position in mediating the mechano-induced bone remodeling. In this study, the effects of LG-HMF on gene expression profiling of osteocyte-like cell line MLO-Y4 were investigated by Affymetrix DNA microarray. LG-HMF affected osteocyte gene expression profiling. Differentially expressed genes (DEGs) and data mining were further analyzed by using bioinfomatic tools, such as DAVID, iReport. 12 energy metabolism related genes (PFKL, AK4, ALDOC, COX7A1, STC1, ADM, CA9, CA12, P4HA1, APLN, GPR35 and GPR84) were further confirmed by real-time PCR. An integrated gene interaction network of 12 DEGs was constructed. Bio-data mining showed that genes involved in glucose metabolic process and apoptosis changed notablly. Our results demostrated that LG-HMF affected the expression of energy metabolism related genes in osteocyte. The identification of sensitive genes to special environments may provide some potential targets for preventing and treating bone loss or osteoporosis.
Co-Option and De Novo Gene Evolution Underlie Molluscan Shell Diversity
Aguilera, Felipe; McDougall, Carmel
2017-01-01
Abstract Molluscs fabricate shells of incredible diversity and complexity by localized secretions from the dorsal epithelium of the mantle. Although distantly related molluscs express remarkably different secreted gene products, it remains unclear if the evolution of shell structure and pattern is underpinned by the differential co-option of conserved genes or the integration of lineage-specific genes into the mantle regulatory program. To address this, we compare the mantle transcriptomes of 11 bivalves and gastropods of varying relatedness. We find that each species, including four Pinctada (pearl oyster) species that diverged within the last 20 Ma, expresses a unique mantle secretome. Lineage- or species-specific genes comprise a large proportion of each species’ mantle secretome. A majority of these secreted proteins have unique domain architectures that include repetitive, low complexity domains (RLCDs), which evolve rapidly, and have a proclivity to expand, contract and rearrange in the genome. There are also a large number of secretome genes expressed in the mantle that arose before the origin of gastropods and bivalves. Each species expresses a unique set of these more ancient genes consistent with their independent co-option into these mantle gene regulatory networks. From this analysis, we infer lineage-specific secretomes underlie shell diversity, and include both rapidly evolving RLCD-containing proteins, and the continual recruitment and loss of both ancient and recently evolved genes into the periphery of the regulatory network controlling gene expression in the mantle epithelium. PMID:28053006
Liu, Han; Yang, Qingyong; Fan, Chuchuan; Zhao, Xiaoqin; Wang, Xuemin; Zhou, Yongming
2015-04-01
The silique of oilseed rape (Brassica napus) is a composite organ including seeds and the silique wall (SW) that possesses distinctly physiological, biochemical and functional differentiations. Yet, the molecular events controlling such differences between the SW and seeds, as well as their coordination during silique development at transcriptional level are largely unknown. Here, we identified large sets of differentially expressed genes in the SW and seeds of siliques at 21-22 days after flowering with a Brassica 95K EST microarray. At this particular stage, there were 3278 SW preferentially expressed genes and 2425 seed preferentially expressed genes. Using the MapMan visualization software, genes differentially regulated in various metabolic pathways and sub-pathways between the SW and seeds were revealed. Photosynthesis and transport-related genes were more actively transcripted in the SW, while those involved in lipid metabolism were more active in seeds during the seed filling stage. On the other hand, genes involved in secondary metabolisms were selectively regulated in the SW and seeds. Large numbers of transcription factors were identified to be differentially expressed between the SW and seeds, suggesting a complex pattern of transcriptional control in these two organs. Furthermore, most genes discussed in categories or pathways showed a similar expression pattern through 21 DAF to 42 DAF. Our results thus provide insights into the coordination of seeds and the SW in the developing silique at the transcriptional levels, which will facilitate the functional studies of important genes for improving B. napus seed productivity and quality. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Zhang, Bo; Peng, Yu; Zheng, Jincheng; Liang, Lina; Hoffmann, Ary A; Ma, Chun-Sen
2016-07-01
Heat shock protein gene (Hsp) families are thought to be important in thermal adaptation, but their expression patterns under various thermal stresses have still been poorly characterized outside of model systems. We have therefore characterized Hsp genes and their stress responses in the oriental fruit moth (OFM), Grapholita molesta, a widespread global orchard pest, and compared patterns of expression in this species to that of other insects. Genes from four Hsp families showed variable expression levels among tissues and developmental stages. Members of the Hsp40, 70, and 90 families were highly expressed under short exposures to heat and cold. Expression of Hsp40, 70, and Hsc70 family members increased in OFM undergoing diapause, while Hsp90 was downregulated. We found that there was strong sequence conservation of members of large Hsp families (Hsp40, Hsp60, Hsp70, Hsc70) across taxa, but this was not always matched by conservation of expression patterns. When the large Hsps as well as small Hsps from OFM were compared under acute and ramping heat stress, two groups of sHsps expression patterns were apparent, depending on whether expression increased or decreased immediately after stress exposure. These results highlight potential differences in conservation of function as opposed to sequence in this gene family and also point to Hsp genes potentially useful as bioindicators of diapause and thermal stress in OFM.
Lee, Siu Sylvia
2004-05-05
Aging is a complex process that involves the gradual functional decline of many different tissues and cells. Gene expression microarray analysis provides a comprehensive view of the gene expression signature associated with age and is particularly valuable for understanding the molecular mechanisms that contribute to the aging process. However, because of the stochastic nature of the aging process, animals of the same chronological age often manifest great physiological differences. Therefore, profiling the gene expression pattern of a large population of aging animals risks either exaggerating or masking the changes in gene expression that correspond to physiological aging. In a recent paper, Golden and Melov surveyed the gene expression profiles of individual aging Caenorhabditis elegans, hoping to circumvent the problem of variability among worms of the same chronological age. This initial analysis of age-dependent gene expression in individual aging worms is an important step toward deciphering the molecular basis of physiological aging.
Yao, Qiu-Yang; Xia, En-Hua; Liu, Fei-Hu; Gao, Li-Zhi
2015-02-15
WRKY transcription factors (TFs), one of the ten largest TF families in higher plants, play important roles in regulating plant development and resistance. To date, little is known about the WRKY TF family in Brassica oleracea. Recently, the completed genome sequence of cabbage (B. oleracea var. capitata) allows us to systematically analyze WRKY genes in this species. A total of 148 WRKY genes were characterized and classified into seven subgroups that belong to three major groups. Phylogenetic and synteny analyses revealed that the repertoire of cabbage WRKY genes was derived from a common ancestor shared with Arabidopsis thaliana. The B. oleracea WRKY genes were found to be preferentially retained after the whole-genome triplication (WGT) event in its recent ancestor, suggesting that the WGT event had largely contributed to a rapid expansion of the WRKY gene family in B. oleracea. The analysis of RNA-Seq data from various tissues (i.e., roots, stems, leaves, buds, flowers and siliques) revealed that most of the identified WRKY genes were positively expressed in cabbage, and a large portion of them exhibited patterns of differential and tissue-specific expression, demonstrating that these gene members might play essential roles in plant developmental processes. Comparative analysis of the expression level among duplicated genes showed that gene expression divergence was evidently presented among cabbage WRKY paralogs, indicating functional divergence of these duplicated WRKY genes. Copyright © 2014 Elsevier B.V. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tabuchi, Yoshiaki; Kondo, Takashi; Suzuki, Yoshihisa
2005-04-15
Sertoli TTE3 cells, derived from transgenic mice bearing temperature-sensitive simian virus 40 large T (tsSV40LT)-antigen, proliferated continuously at a permissive temperature (33 deg C) whereas inactivation of the large T-antigen by a nonpermissive temperature (39 deg C) led to differentiation as judged by elevation of transferrin. To clarify the detailed mechanisms of differentiation, we investigated the time course of changes in gene expression using cDNA microarrays. Of the 865 genes analyzed, 14 genes showed increased levels of expression. Real-time quantitative PCR revealed that the mRNA levels of p21{sup waf1}, milk fat globule membrane protein E8, heat-responsive protein 12, and selenoproteinmore » P were markedly elevated. Moreover, the differentiated condition induced by the nonpermissive temperature significantly increased mRNA levels of these four genes in several cell lines from the transgenic mice bearing the oncogene. The present results regarding changes in gene expression will provide a basis for a further understanding of molecular mechanisms of differentiation in both Sertoli cells and cell lines transformed by tsSV40LT-antigen.« less
Stam, L. F.; Laurie, C. C.
1996-01-01
A molecular mapping experiment shows that a major gene effect on a quantitative trait, the level of alcohol dehydrogenase expression in Drosophila melanogaster, is due to multiple polymorphisms within the Adh gene. These polymorphisms are located in an intron, the coding sequence, and the 3' untranslated region. Because of nonrandom associations among polymorphisms at different sites, the individual effects combine (in some cases epistatically) to produce ``superalleles'' with large effect. These results have implications for the interpretation of major gene effects detected by quantitative trait locus mapping methods. They show that large effects due to a single locus may be due to multiple associated polymorphisms (or sequential fixations in isolated populations) rather than individual mutations of large effect. PMID:8978044
FastGCN: A GPU Accelerated Tool for Fast Gene Co-Expression Networks
Liang, Meimei; Zhang, Futao; Jin, Gulei; Zhu, Jun
2015-01-01
Gene co-expression networks comprise one type of valuable biological networks. Many methods and tools have been published to construct gene co-expression networks; however, most of these tools and methods are inconvenient and time consuming for large datasets. We have developed a user-friendly, accelerated and optimized tool for constructing gene co-expression networks that can fully harness the parallel nature of GPU (Graphic Processing Unit) architectures. Genetic entropies were exploited to filter out genes with no or small expression changes in the raw data preprocessing step. Pearson correlation coefficients were then calculated. After that, we normalized these coefficients and employed the False Discovery Rate to control the multiple tests. At last, modules identification was conducted to construct the co-expression networks. All of these calculations were implemented on a GPU. We also compressed the coefficient matrix to save space. We compared the performance of the GPU implementation with those of multi-core CPU implementations with 16 CPU threads, single-thread C/C++ implementation and single-thread R implementation. Our results show that GPU implementation largely outperforms single-thread C/C++ implementation and single-thread R implementation, and GPU implementation outperforms multi-core CPU implementation when the number of genes increases. With the test dataset containing 16,000 genes and 590 individuals, we can achieve greater than 63 times the speed using a GPU implementation compared with a single-thread R implementation when 50 percent of genes were filtered out and about 80 times the speed when no genes were filtered out. PMID:25602758
FastGCN: a GPU accelerated tool for fast gene co-expression networks.
Liang, Meimei; Zhang, Futao; Jin, Gulei; Zhu, Jun
2015-01-01
Gene co-expression networks comprise one type of valuable biological networks. Many methods and tools have been published to construct gene co-expression networks; however, most of these tools and methods are inconvenient and time consuming for large datasets. We have developed a user-friendly, accelerated and optimized tool for constructing gene co-expression networks that can fully harness the parallel nature of GPU (Graphic Processing Unit) architectures. Genetic entropies were exploited to filter out genes with no or small expression changes in the raw data preprocessing step. Pearson correlation coefficients were then calculated. After that, we normalized these coefficients and employed the False Discovery Rate to control the multiple tests. At last, modules identification was conducted to construct the co-expression networks. All of these calculations were implemented on a GPU. We also compressed the coefficient matrix to save space. We compared the performance of the GPU implementation with those of multi-core CPU implementations with 16 CPU threads, single-thread C/C++ implementation and single-thread R implementation. Our results show that GPU implementation largely outperforms single-thread C/C++ implementation and single-thread R implementation, and GPU implementation outperforms multi-core CPU implementation when the number of genes increases. With the test dataset containing 16,000 genes and 590 individuals, we can achieve greater than 63 times the speed using a GPU implementation compared with a single-thread R implementation when 50 percent of genes were filtered out and about 80 times the speed when no genes were filtered out.
Functional and topological characteristics of mammalian regulatory domains
Symmons, Orsolya; Uslu, Veli Vural; Tsujimura, Taro; Ruf, Sandra; Nassari, Sonya; Schwarzer, Wibke; Ettwiller, Laurence; Spitz, François
2014-01-01
Long-range regulatory interactions play an important role in shaping gene-expression programs. However, the genomic features that organize these activities are still poorly characterized. We conducted a large operational analysis to chart the distribution of gene regulatory activities along the mouse genome, using hundreds of insertions of a regulatory sensor. We found that enhancers distribute their activities along broad regions and not in a gene-centric manner, defining large regulatory domains. Remarkably, these domains correlate strongly with the recently described TADs, which partition the genome into distinct self-interacting blocks. Different features, including specific repeats and CTCF-binding sites, correlate with the transition zones separating regulatory domains, and may help to further organize promiscuously distributed regulatory influences within large domains. These findings support a model of genomic organization where TADs confine regulatory activities to specific but large regulatory domains, contributing to the establishment of specific gene expression profiles. PMID:24398455
Kawamura, Kiyoko; Wada, Akihiko; Wang, Ji-Yang; Li, Quanhai; Ishii, Akihiro; Tsujimura, Hideki; Takagi, Toshiyuki; Itami, Makiko; Tada, Yuji; Tatsumi, Koichiro; Shimada, Hideaki; Hiroshima, Kenzo; Tagawa, Masatoshi
2016-01-01
Activation-induced cytidine deaminase (AID) is involved in somatic hypermutation and class switch recombination processes in the antibody formation. The AID activity induces gene mutations and could be associated with transformation processes of B cells. Nevertheless, the relation between AID expression and the prognosis of B cell lymphoma patients remains uncharacterized. We examined expression levels of the AID gene in 89 lymph node specimens from lymphoma and non-lymphoma patients with Northern blot analysis and investigated an association with their survival. The AID gene was preferentially expressed in B cell lymphoma in particular in diffuse large B cell lymphoma and follicular lymphoma. We confirmed AID protein expression in the mRNA-positive but not in the negative specimens with Western blot analysis and immunohistochemical staining. Survival of the patients treated with cyclophosphamide-/doxorubicin-/vincristine-/prednisone-based chemotherapy demonstrated that the prognosis of diffuse large B cell patients was unfavorable in the mRNA-positive group compared with the negative group, and that AID expression levels were correlated with the poor prognosis. In contrast, AID expression was not linked with the prognosis of follicular lymphoma patients. AID expression is a predictive marker for an unfavorable outcome in DLBCL patients treated with the chemotherapy.
NASA Astrophysics Data System (ADS)
Moin, Mazahar; Bakshi, Achala; Madhav, M. S.; Kirti, P. B.
2017-11-01
Our previous findings on the screening of a large-pool of activation tagged rice plants grown under limited water conditions revealed the activation of Ribosomal Protein Large (RPL) subunit genes, RPL6 and RPL23A in two mutants that exhibited high water-use efficiency (WUE) with the genes getting activated by the integrated 4x enhancers (Moin et al., 2016a). In continuation of these findings, we have comprehensively characterized the Ribosomal Protein (RP) gene family including both small (RPS) and large (RPL) subunits, which have been identified to be encoded by at least 70 representative genes; RP-genes exist as multiple expressed copies with high nucleotide and amino acid sequence similarity. The differential expression of all the representative genes in rice was performed under limited water and drought conditions at progressive time intervals in the present study. More than 50% of the RP genes were upregulated in both shoot and root tissues. Some of them exhibited an overlap in the upregulation under both the treatments indicating that they might have a common role in inducing tolerance under limited water and drought conditions. Among the genes that became significantly upregulated in both the tissues and under both the treatments are RPL6, 7, 23A, 24 and 31 and RPS4, 10 and 18a. To further validate the role of RP genes in WUE and inducing tolerance to other stresses, we have raised transgenic plants overexpressing RPL23A in rice. The high expression lines of RPL23A exhibited low Δ13C, increased quantum efficiency along with suitable growth and yield parameters with respect to negative control under the conditions of limited water availability. The constitutive expression of RPL23A was also associated with transcriptional upregulation of many other RPL and RPS genes. The seedlings of RPL23A high expression lines also showed a significant increase in fresh weight, root length, proline and chlorophyll contents under simulated drought and salt stresses. Taken together, our findings provide a secure basis for the RPL gene family expression as a potential resource for exploring abiotic stress tolerant properties in rice.
Methylomics of gene expression in human monocytes
Liu, Yongmei; Ding, Jingzhong; Reynolds, Lindsay M.; Lohman, Kurt; Register, Thomas C.; De La Fuente, Alberto; Howard, Timothy D.; Hawkins, Greg A.; Cui, Wei; Morris, Jessica; Smith, Shelly G.; Barr, R. Graham; Kaufman, Joel D.; Burke, Gregory L.; Post, Wendy; Shea, Steven; Mccall, Charles E.; Siscovick, David; Jacobs, David R.; Tracy, Russell P.; Herrington, David M.; Hoeschele, Ina
2013-01-01
DNA methylation is one of several epigenetic mechanisms that contribute to the regulation of gene expression; however, the extent to which methylation of CpG dinucleotides correlates with gene expression at the genome-wide level is still largely unknown. Using purified primary monocytes from subjects in a large community-based cohort (n = 1264), we characterized methylation (>485 000 CpG sites) and mRNA expression (>48K transcripts) and carried out genome-wide association analyses of 8370 expression phenotypes. We identified 11 203 potential cis-acting CpG loci whose degree of methylation was associated with gene expression (eMS) at a false discovery rate threshold of 0.001. Most of the associations were consistent in effect size and direction of effect across sex and three ethnicities. Contrary to expectation, these eMS were not predominately enriched in promoter regions, or CpG islands, but rather in the 3′ UTR, gene bodies, CpG shores or ‘offshore’ sites, and both positive and negative correlations between methylation and expression were observed across all locations. eMS were enriched for regions predicted to be regulatory by ENCODE (Encyclopedia of DNA Elements) data in multiple cell types, particularly enhancers. One of the strongest association signals detected (P < 2.2 × 10−308) was a methylation probe (cg17005068) in the promoter/enhancer region of the glutathione S-transferase theta 1 gene (GSTT1, encoding the detoxification enzyme) with GSTT1 mRNA expression. Our study provides a detailed description of the epigenetic architecture in human monocytes and its relationship to gene expression. These data may help prioritize interrogation of biologically relevant methylation loci and provide new insights into the epigenetic basis of human health and diseases. PMID:23900078
Expression of forkhead box transcription factor genes Foxp1 and Foxp2 during jaw development.
Cesario, Jeffry M; Almaidhan, Asma A; Jeong, Juhee
2016-03-01
Development of the face is regulated by a large number of genes that are expressed in temporally and spatially specific patterns. While significant progress has been made on characterizing the genes that operate in the oral region of the face, those regulating development of the aboral (lateral) region remain largely unknown. Recently, we discovered that transcription factors LIM homeobox (LHX) 6 and LHX8, which are key regulators of oral development, repressed the expression of the genes encoding forkhead box transcription factors, Foxp1 and Foxp2, in the oral region. To gain insights into the potential role of the Foxp genes in region-specific development of the face, we examined their expression patterns in the first pharyngeal arch (primordium for the jaw) of mouse embryos at a high spatial and temporal resolution. Foxp1 and Foxp2 were preferentially expressed in the aboral and posterior parts of the first pharyngeal arch, including the developing temporomandibular joint. Through double immunofluorescence and double fluorescent RNA in situ hybridization, we found that Foxp1 was expressed in the progenitor cells for the muscle, bone, and connective tissue. Foxp2 was expressed in subsets of bone and connective tissue progenitors but not in the myoblasts. Neither gene was expressed in the dental mesenchyme nor in the oral half of the palatal shelf undergoing extensive growth and morphogenesis. Together, we demonstrated for the first time that Foxp1 and Foxp2 are expressed during craniofacial development. Our data suggest that the Foxp genes may regulate development of the aboral and posterior regions of the jaw. Copyright © 2016 Elsevier B.V. All rights reserved.
Functional regression method for whole genome eQTL epistasis analysis with sequencing data.
Xu, Kelin; Jin, Li; Xiong, Momiao
2017-05-18
Epistasis plays an essential rule in understanding the regulation mechanisms and is an essential component of the genetic architecture of the gene expressions. However, interaction analysis of gene expressions remains fundamentally unexplored due to great computational challenges and data availability. Due to variation in splicing, transcription start sites, polyadenylation sites, post-transcriptional RNA editing across the entire gene, and transcription rates of the cells, RNA-seq measurements generate large expression variability and collectively create the observed position level read count curves. A single number for measuring gene expression which is widely used for microarray measured gene expression analysis is highly unlikely to sufficiently account for large expression variation across the gene. Simultaneously analyzing epistatic architecture using the RNA-seq and whole genome sequencing (WGS) data poses enormous challenges. We develop a nonlinear functional regression model (FRGM) with functional responses where the position-level read counts within a gene are taken as a function of genomic position, and functional predictors where genotype profiles are viewed as a function of genomic position, for epistasis analysis with RNA-seq data. Instead of testing the interaction of all possible pair-wises SNPs, the FRGM takes a gene as a basic unit for epistasis analysis, which tests for the interaction of all possible pairs of genes and use all the information that can be accessed to collectively test interaction between all possible pairs of SNPs within two genome regions. By large-scale simulations, we demonstrate that the proposed FRGM for epistasis analysis can achieve the correct type 1 error and has higher power to detect the interactions between genes than the existing methods. The proposed methods are applied to the RNA-seq and WGS data from the 1000 Genome Project. The numbers of pairs of significantly interacting genes after Bonferroni correction identified using FRGM, RPKM and DESeq were 16,2361, 260 and 51, respectively, from the 350 European samples. The proposed FRGM for epistasis analysis of RNA-seq can capture isoform and position-level information and will have a broad application. Both simulations and real data analysis highlight the potential for the FRGM to be a good choice of the epistatic analysis with sequencing data.
Hudson, Sandra; Wang, Dongliang; Middleton, Frank; Nevaldine, Barbara H; Naous, Rana; Hutchison, Robert E
2018-04-26
Anaplastic lymphoma kinase (ALK)-positive anaplastic large cell lymphoma (ALCL) shows 60-70% event free survival with standard treatments. Targeted therapies are being tested for increased benefit and/or reduced toxicity, but interactions with standard agents are not well known. We exposed four ALCL cell lines to two targeted agents, crizotinib and brentuximab vedotin, and to two standard agents, doxorubicin and vinblastine. For each agent and combination, we measured apoptosis and expression of approximately 300 previously annotated genes of interest using targeted RNA-sequencing. An aurora kinase inhibitor, alisertib, was similarly tested for gene expression effects. Only crizotinib, alone or in combination, showed significant effects (adjusted P < 0.05) on expression and apoptosis. One hundred and nine of 277 gene expressions showed crizotinib-associated differential expression, mostly downregulation, 62 associated with apoptosis, and 28 associated with both crizotinib and apoptosis. Doxorubicin was antagonistic with crizotinib on gene expression and apoptosis. Brentuximab was synergistic with crizotinib in apoptosis, and not antagonistic in gene expression. Vinblastine also appeared synergistic with crizotinib but did not achieve statistical significance. Alisertib did not show significant expression changes. Our data suggest that crizotinib induces apoptosis through orderly changes in cell signaling associated with ALK inhibition. Expression effects of crizotinib and associated apoptosis are antagonized by doxorubicin, but apoptosis is synergized by brentuximab vedotin and possibly vinblastine. These findings suggest that concurrent use of crizotinib and doxorubicin may be counterproductive, while the pairing of crizotinib with brentuximab (or vinblastine) may increase efficacy. Alisertib did not induce expression changes at cytotoxic dosage. © 2018 Wiley Periodicals, Inc.
Carlson, Kimberly A.; Gardner, Kylee; Pashaj, Anjeza; Carlson, Darby J.; Yu, Fang; Eudy, James D.; Zhang, Chi; Harshman, Lawrence G.
2015-01-01
Aging is a complex process characterized by a steady decline in an organism's ability to perform life-sustaining tasks. In the present study, two cages of approximately 12,000 mated Drosophila melanogaster females were used as a source of RNA from individuals sampled frequently as a function of age. A linear model for microarray data method was used for the microarray analysis to adjust for the box effect; it identified 1,581 candidate aging genes. Cluster analyses using a self-organizing map algorithm on the 1,581 significant genes identified gene expression patterns across different ages. Genes involved in immune system function and regulation, chorion assembly and function, and metabolism were all significantly differentially expressed as a function of age. The temporal pattern of data indicated that gene expression related to aging is affected relatively early in life span. In addition, the temporal variance in gene expression in immune function genes was compared to a random set of genes. There was an increase in the variance of gene expression within each cohort, which was not observed in the set of random genes. This observation is compatible with the hypothesis that D. melanogaster immune function genes lose control of gene expression as flies age. PMID:26090231
Czechowski, Tomasz; Stitt, Mark; Altmann, Thomas; Udvardi, Michael K.; Scheible, Wolf-Rüdiger
2005-01-01
Gene transcripts with invariant abundance during development and in the face of environmental stimuli are essential reference points for accurate gene expression analyses, such as RNA gel-blot analysis or quantitative reverse transcription-polymerase chain reaction (PCR). An exceptionally large set of data from Affymetrix ATH1 whole-genome GeneChip studies provided the means to identify a new generation of reference genes with very stable expression levels in the model plant species Arabidopsis (Arabidopsis thaliana). Hundreds of Arabidopsis genes were found that outperform traditional reference genes in terms of expression stability throughout development and under a range of environmental conditions. Most of these were expressed at much lower levels than traditional reference genes, making them very suitable for normalization of gene expression over a wide range of transcript levels. Specific and efficient primers were developed for 22 genes and tested on a diverse set of 20 cDNA samples. Quantitative reverse transcription-PCR confirmed superior expression stability and lower absolute expression levels for many of these genes, including genes encoding a protein phosphatase 2A subunit, a coatomer subunit, and an ubiquitin-conjugating enzyme. The developed PCR primers or hybridization probes for the novel reference genes will enable better normalization and quantification of transcript levels in Arabidopsis in the future. PMID:16166256
Kramer, Maxwell; Rao, Prashant; Ercan, Sevinc
2016-01-01
Dosage compensation mechanisms equalize the level of X chromosome expression between sexes. Yet the X chromosome is often enriched for genes exhibiting sex-biased, i.e., imbalanced expression. The relationship between X chromosome dosage compensation and sex-biased gene expression remains largely unexplored. Most studies determine sex-biased gene expression without distinguishing between contributions from X chromosome copy number (dose) and the animal’s sex. Here, we uncoupled X chromosome dose from sex-specific gene regulation in Caenorhabditis elegans to determine the effect of each on X expression. In early embryogenesis, when dosage compensation is not yet fully active, X chromosome dose drives the hermaphrodite-biased expression of many X-linked genes, including several genes that were shown to be responsible for hermaphrodite fate. A similar effect is seen in the C. elegans germline, where X chromosome dose contributes to higher hermaphrodite X expression, suggesting that lack of dosage compensation in the germline may have a role in supporting higher expression of X chromosomal genes with female-biased functions in the gonad. In the soma, dosage compensation effectively balances X expression between the sexes. As a result, somatic sex-biased expression is almost entirely due to sex-specific gene regulation. These results suggest that lack of dosage compensation in different tissues and developmental stages allow X chromosome copy number to contribute to sex-biased gene expression and function. PMID:27356611
Hershkovitz, Eli; Loewenthal, Neta; Peretz, Asaf; Parvari, Ruti
2008-01-01
X-linked Kallmann syndrome (KS) is caused mainly by point mutations, in the KAL1 gene. Large deletions >1 Mb are rare events in the human population and commonly result in contiguous gene syndromes. A search for the mutation causing KS carried out on two pairs of first-degree cousins of 2 sisters. Two different apparently independent deletions were found. The deleted sequences encompass the KAL1 gene and four known additional genes exclusively expressed in testis. Two of these genes belong to the FAM9 gene family, which shares some homology with the SCYP3 gene, previously implicated in azoospermia. One of the events causing the deletion may have been mediated by an L1 transposition, the other by a non-homologous end joining. Such non-homologous recombinations have not yet been reported in the KAL genomic region and thus this area may be more prone to deletions than previously expected. This is the first report on genetic characterization of KS with a deletion of solely testis-expressed genes. The absence of these genes may have unfavorable implications for the patients regarding future fertility. (c) 2008 S. Karger AG, Basel
Hong, Y K; Kim, D H; Beletskii, A; Lee, C; Memili, E; Strauss, W M
2001-04-01
Most conditional expression vectors designed for mammalian cells have been valuable systems for studying genes of interest by regulating their expressions. The available vectors, however, are reliable for the short-length cDNA clones and not optimal for relatively long fragments of genomic DNA or long cDNAs. Here, we report the construction of two bacterial artificial chromosome (BAC) vectors, capable of harboring large inserts and shuttling among Escherichia coli, yeast, and mammalian cells. These two vectors, pEYMT and pEYMI, contain conditional expression systems which are designed to be regulated by tetracycline and mouse interferons, respectively. To test the properties of the vectors, we cloned in both vectors the green fluorescence protein (GFP) through an in vitro ligation reaction and the 17.8-kb-long X-inactive-specific transcript (Xist) cDNA through homologous recombination in yeast. Subsequently, we characterized their regulated expression properties using real-time quantitative RT-PCR (TaqMan) and RNA-fluorescent in situ hybridization (FISH). We demonstrate that these two BAC vectors are good systems for recombination-based cloning and regulated expression of large genes in mammalian cells. Copyright 2001 Academic Press.
Computing and Applying Atomic Regulons to Understand Gene Expression and Regulation
Faria, José P.; Davis, James J.; Edirisinghe, Janaka N.; Taylor, Ronald C.; Weisenhorn, Pamela; Olson, Robert D.; Stevens, Rick L.; Rocha, Miguel; Rocha, Isabel; Best, Aaron A.; DeJongh, Matthew; Tintle, Nathan L.; Parrello, Bruce; Overbeek, Ross; Henry, Christopher S.
2016-01-01
Understanding gene function and regulation is essential for the interpretation, prediction, and ultimate design of cell responses to changes in the environment. An important step toward meeting the challenge of understanding gene function and regulation is the identification of sets of genes that are always co-expressed. These gene sets, Atomic Regulons (ARs), represent fundamental units of function within a cell and could be used to associate genes of unknown function with cellular processes and to enable rational genetic engineering of cellular systems. Here, we describe an approach for inferring ARs that leverages large-scale expression data sets, gene context, and functional relationships among genes. We computed ARs for Escherichia coli based on 907 gene expression experiments and compared our results with gene clusters produced by two prevalent data-driven methods: Hierarchical clustering and k-means clustering. We compared ARs and purely data-driven gene clusters to the curated set of regulatory interactions for E. coli found in RegulonDB, showing that ARs are more consistent with gold standard regulons than are data-driven gene clusters. We further examined the consistency of ARs and data-driven gene clusters in the context of gene interactions predicted by Context Likelihood of Relatedness (CLR) analysis, finding that the ARs show better agreement with CLR predicted interactions. We determined the impact of increasing amounts of expression data on AR construction and find that while more data improve ARs, it is not necessary to use the full set of gene expression experiments available for E. coli to produce high quality ARs. In order to explore the conservation of co-regulated gene sets across different organisms, we computed ARs for Shewanella oneidensis, Pseudomonas aeruginosa, Thermus thermophilus, and Staphylococcus aureus, each of which represents increasing degrees of phylogenetic distance from E. coli. Comparison of the organism-specific ARs showed that the consistency of AR gene membership correlates with phylogenetic distance, but there is clear variability in the regulatory networks of closely related organisms. As large scale expression data sets become increasingly common for model and non-model organisms, comparative analyses of atomic regulons will provide valuable insights into fundamental regulatory modules used across the bacterial domain. PMID:27933038
Co-expression network analysis of duplicate genes in maize (Zea mays L.) reveals no subgenome bias.
Li, Lin; Briskine, Roman; Schaefer, Robert; Schnable, Patrick S; Myers, Chad L; Flagel, Lex E; Springer, Nathan M; Muehlbauer, Gary J
2016-11-04
Gene duplication is prevalent in many species and can result in coding and regulatory divergence. Gene duplications can be classified as whole genome duplication (WGD), tandem and inserted (non-syntenic). In maize, WGD resulted in the subgenomes maize1 and maize2, of which maize1 is considered the dominant subgenome. However, the landscape of co-expression network divergence of duplicate genes in maize is still largely uncharacterized. To address the consequence of gene duplication on co-expression network divergence, we developed a gene co-expression network from RNA-seq data derived from 64 different tissues/stages of the maize reference inbred-B73. WGD, tandem and inserted gene duplications exhibited distinct regulatory divergence. Inserted duplicate genes were more likely to be singletons in the co-expression networks, while WGD duplicate genes were likely to be co-expressed with other genes. Tandem duplicate genes were enriched in the co-expression pattern where co-expressed genes were nearly identical for the duplicates in the network. Older gene duplications exhibit more extensive co-expression variation than younger duplications. Overall, non-syntenic genes primarily from inserted duplications show more co-expression divergence. Also, such enlarged co-expression divergence is significantly related to duplication age. Moreover, subgenome dominance was not observed in the co-expression networks - maize1 and maize2 exhibit similar levels of intra subgenome correlations. Intriguingly, the level of inter subgenome co-expression was similar to the level of intra subgenome correlations, and genes from specific subgenomes were not likely to be the enriched in co-expression network modules and the hub genes were not predominantly from any specific subgenomes in maize. Our work provides a comprehensive analysis of maize co-expression network divergence for three different types of gene duplications and identifies potential relationships between duplication types, duplication ages and co-expression consequences.
Müller, Christian; Schillert, Arne; Röthemeier, Caroline; Trégouët, David-Alexandre; Proust, Carole; Binder, Harald; Pfeiffer, Norbert; Beutel, Manfred; Lackner, Karl J.; Schnabel, Renate B.; Tiret, Laurence; Wild, Philipp S.; Blankenberg, Stefan
2016-01-01
Technical variation plays an important role in microarray-based gene expression studies, and batch effects explain a large proportion of this noise. It is therefore mandatory to eliminate technical variation while maintaining biological variability. Several strategies have been proposed for the removal of batch effects, although they have not been evaluated in large-scale longitudinal gene expression data. In this study, we aimed at identifying a suitable method for batch effect removal in a large study of microarray-based longitudinal gene expression. Monocytic gene expression was measured in 1092 participants of the Gutenberg Health Study at baseline and 5-year follow up. Replicates of selected samples were measured at both time points to identify technical variability. Deming regression, Passing-Bablok regression, linear mixed models, non-linear models as well as ReplicateRUV and ComBat were applied to eliminate batch effects between replicates. In a second step, quantile normalization prior to batch effect correction was performed for each method. Technical variation between batches was evaluated by principal component analysis. Associations between body mass index and transcriptomes were calculated before and after batch removal. Results from association analyses were compared to evaluate maintenance of biological variability. Quantile normalization, separately performed in each batch, combined with ComBat successfully reduced batch effects and maintained biological variability. ReplicateRUV performed perfectly in the replicate data subset of the study, but failed when applied to all samples. All other methods did not substantially reduce batch effects in the replicate data subset. Quantile normalization plus ComBat appears to be a valuable approach for batch correction in longitudinal gene expression data. PMID:27272489
GESearch: An Interactive GUI Tool for Identifying Gene Expression Signature.
Ye, Ning; Yin, Hengfu; Liu, Jingjing; Dai, Xiaogang; Yin, Tongming
2015-01-01
The huge amount of gene expression data generated by microarray and next-generation sequencing technologies present challenges to exploit their biological meanings. When searching for the coexpression genes, the data mining process is largely affected by selection of algorithms. Thus, it is highly desirable to provide multiple options of algorithms in the user-friendly analytical toolkit to explore the gene expression signatures. For this purpose, we developed GESearch, an interactive graphical user interface (GUI) toolkit, which is written in MATLAB and supports a variety of gene expression data files. This analytical toolkit provides four models, including the mean, the regression, the delegate, and the ensemble models, to identify the coexpression genes, and enables the users to filter data and to select gene expression patterns by browsing the display window or by importing knowledge-based genes. Subsequently, the utility of this analytical toolkit is demonstrated by analyzing two sets of real-life microarray datasets from cell-cycle experiments. Overall, we have developed an interactive GUI toolkit that allows for choosing multiple algorithms for analyzing the gene expression signatures.
Groten, Karin; Pahari, Nabin T; Xu, Shuqing; Miloradovic van Doorn, Maja; Baldwin, Ian T
2015-01-01
Most land plants live in a symbiotic association with arbuscular mycorrhizal fungi (AMF) that belong to the phylum Glomeromycota. Although a number of plant genes involved in the plant-AMF interactions have been identified by analyzing mutants, the ability to rapidly manipulate gene expression to study the potential functions of new candidate genes remains unrealized. We analyzed changes in gene expression of wild tobacco roots (Nicotiana attenuata) after infection with mycorrhizal fungi (Rhizophagus irregularis) by serial analysis of gene expression (SuperSAGE) combined with next generation sequencing, and established a virus-induced gene-silencing protocol to study the function of candidate genes in the interaction. From 92,434 SuperSAGE Tag sequences, 32,808 (35%) matched with our in-house Nicotiana attenuata transcriptome database and 3,698 (4%) matched to Rhizophagus genes. In total, 11,194 Tags showed a significant change in expression (p<0.05, >2-fold change) after infection. When comparing the functions of highly up-regulated annotated Tags in this study with those of two previous large-scale gene expression studies, 18 gene functions were found to be up-regulated in all three studies mainly playing roles related to phytohormone metabolism, catabolism and defense. To validate the function of identified candidate genes, we used the technique of virus-induced gene silencing (VIGS) to silence the expression of three putative N. attenuata genes: germin-like protein, indole-3-acetic acid-amido synthetase GH3.9 and, as a proof-of-principle, calcium and calmodulin-dependent protein kinase (CCaMK). The silencing of the three plant genes in roots was successful, but only CCaMK silencing had a significant effect on the interaction with R. irregularis. Interestingly, when a highly activated inoculum was used for plant inoculation, the effect of CCaMK silencing on fungal colonization was masked, probably due to trans-complementation. This study demonstrates that large-scale gene expression studies across different species induce of a core set of genes of similar functions. However, additional factors seem to influence the overall pattern of gene expression, resulting in high variability among independent studies with different hosts. We conclude that VIGS is a powerful tool with which to investigate the function of genes involved in plant-AMF interactions but that inoculum strength can strongly influence the outcome of the interaction.
Saha, Anusree; Das, Shubhajit; Moin, Mazahar; Dutta, Mouboni; Bakshi, Achala; Madhav, M. S.; Kirti, P. B.
2017-01-01
Ribosomal proteins (RPs) are indispensable in ribosome biogenesis and protein synthesis, and play a crucial role in diverse developmental processes. Our previous studies on Ribosomal Protein Large subunit (RPL) genes provided insights into their stress responsive roles in rice. In the present study, we have explored the developmental and stress regulated expression patterns of Ribosomal Protein Small (RPS) subunit genes for their differential expression in a spatiotemporal and stress dependent manner. We have also performed an in silico analysis of gene structure, cis-elements in upstream regulatory regions, protein properties and phylogeny. Expression studies of the 34 RPS genes in 13 different tissues of rice covering major growth and developmental stages revealed that their expression was substantially elevated, mostly in shoots and leaves indicating their possible involvement in the development of vegetative organs. The majority of the RPS genes have manifested significant expression under all abiotic stress treatments with ABA, PEG, NaCl, and H2O2. Infection with important rice pathogens, Xanthomonas oryzae pv. oryzae (Xoo) and Rhizoctonia solani also induced the up-regulation of several of the RPS genes. RPS4, 13a, 18a, and 4a have shown higher transcript levels under all the abiotic stresses, whereas, RPS4 is up-regulated in both the biotic stress treatments. The information obtained from the present investigation would be useful in appreciating the possible stress-regulatory attributes of the genes coding for rice ribosomal small subunit proteins apart from their functions as house-keeping proteins. A detailed functional analysis of independent genes is required to study their roles in stress tolerance and generating stress- tolerant crops. PMID:28966624
Expression profiles of urbilaterian genes uniquely shared between honey bee and vertebrates
Matsui, Toshiaki; Yamamoto, Toshiyuki; Wyder, Stefan; Zdobnov, Evgeny M; Kadowaki, Tatsuhiko
2009-01-01
Background Large-scale comparison of metazoan genomes has revealed that a significant fraction of genes of the last common ancestor of Bilateria (Urbilateria) is lost in each animal lineage. This event could be one of the underlying mechanisms involved in generating metazoan diversity. However, the present functions of these ancient genes have not been addressed extensively. To understand the functions and evolutionary mechanisms of such ancient Urbilaterian genes, we carried out comprehensive expression profile analysis of genes shared between vertebrates and honey bees but not with the other sequenced ecdysozoan genomes (honey bee-vertebrate specific, HVS genes) as a model. Results We identified 30 honey bee and 55 mouse HVS genes. Many HVS genes exhibited tissue-selective expression patterns; intriguingly, the expression of 60% of honey bee HVS genes was found to be brain enriched, and 24% of mouse HVS genes were highly expressed in either or both the brain and testis. Moreover, a minimum of 38% of mouse HVS genes demonstrated neuron-enriched expression patterns, and 62% of them exhibited expression in selective brain areas, particularly the forebrain and cerebellum. Furthermore, gene ontology (GO) analysis of HVS genes predicted that 35% of genes are associated with DNA transcription and RNA processing. Conclusion These results suggest that HVS genes include genes that are biased towards expression in the brain and gonads. They also demonstrate that at least some of Urbilaterian genes retained in the specific animal lineage may be selectively maintained to support the species-specific phenotypes. PMID:19138430
Expression profiles of urbilaterian genes uniquely shared between honey bee and vertebrates.
Matsui, Toshiaki; Yamamoto, Toshiyuki; Wyder, Stefan; Zdobnov, Evgeny M; Kadowaki, Tatsuhiko
2009-01-12
Large-scale comparison of metazoan genomes has revealed that a significant fraction of genes of the last common ancestor of Bilateria (Urbilateria) is lost in each animal lineage. This event could be one of the underlying mechanisms involved in generating metazoan diversity. However, the present functions of these ancient genes have not been addressed extensively. To understand the functions and evolutionary mechanisms of such ancient Urbilaterian genes, we carried out comprehensive expression profile analysis of genes shared between vertebrates and honey bees but not with the other sequenced ecdysozoan genomes (honey bee-vertebrate specific, HVS genes) as a model. We identified 30 honey bee and 55 mouse HVS genes. Many HVS genes exhibited tissue-selective expression patterns; intriguingly, the expression of 60% of honey bee HVS genes was found to be brain enriched, and 24% of mouse HVS genes were highly expressed in either or both the brain and testis. Moreover, a minimum of 38% of mouse HVS genes demonstrated neuron-enriched expression patterns, and 62% of them exhibited expression in selective brain areas, particularly the forebrain and cerebellum. Furthermore, gene ontology (GO) analysis of HVS genes predicted that 35% of genes are associated with DNA transcription and RNA processing. These results suggest that HVS genes include genes that are biased towards expression in the brain and gonads. They also demonstrate that at least some of Urbilaterian genes retained in the specific animal lineage may be selectively maintained to support the species-specific phenotypes.
Mapping the Shh long-range regulatory domain
Anderson, Eve; Devenney, Paul S.; Hill, Robert E.; Lettice, Laura A.
2014-01-01
Coordinated gene expression controlled by long-distance enhancers is orchestrated by DNA regulatory sequences involving transcription factors and layers of control mechanisms. The Shh gene and well-established regulators are an example of genomic composition in which enhancers reside in a large desert extending into neighbouring genes to control the spatiotemporal pattern of expression. Exploiting the local hopping activity of the Sleeping Beauty transposon, the lacZ reporter gene was dispersed throughout the Shh region to systematically map the genomic features responsible for expression activity. We found that enhancer activities are retained inside a genomic region that corresponds to the topological associated domain (TAD) defined by Hi-C. This domain of approximately 900 kb is in an open conformation over its length and is generally susceptible to all Shh enhancers. Similar to the distal enhancers, an enhancer residing within the Shh second intron activates the reporter gene located at distances of hundreds of kilobases away, suggesting that both proximal and distal enhancers have the capacity to survey the Shh topological domain to recognise potential promoters. The widely expressed Rnf32 gene lying within the Shh domain evades enhancer activities by a process that may be common among other housekeeping genes that reside in large regulatory domains. Finally, the boundaries of the Shh TAD do not represent the absolute expression limits of enhancer activity, as expression activity is lost stepwise at a number of genomic positions at the verges of these domains. PMID:25252942
A Search for Parent-of-Origin Effects on Honey Bee Gene Expression.
Kocher, Sarah D; Tsuruda, Jennifer M; Gibson, Joshua D; Emore, Christine M; Arechavaleta-Velasco, Miguel E; Queller, David C; Strassmann, Joan E; Grozinger, Christina M; Gribskov, Michael R; San Miguel, Phillip; Westerman, Rick; Hunt, Greg J
2015-06-05
Parent-specific gene expression (PSGE) is little known outside of mammals and plants. PSGE occurs when the expression level of a gene depends on whether an allele was inherited from the mother or the father. Kin selection theory predicts that there should be extensive PSGE in social insects because social insect parents can gain inclusive fitness benefits by silencing parental alleles in female offspring. We searched for evidence of PSGE in honey bees using transcriptomes from reciprocal crosses between European and Africanized strains. We found 46 transcripts with significant parent-of-origin effects on gene expression, many of which overexpressed the maternal allele. Interestingly, we also found a large proportion of genes showing a bias toward maternal alleles in only one of the reciprocal crosses. These results indicate that PSGE may occur in social insects. The nonreciprocal effects could be largely driven by hybrid incompatibility between these strains. Future work will help to determine if these are indeed parent-of-origin effects that can modulate inclusive fitness benefits. Copyright © 2015 Kocher et al.
Hu, Shimin; Xu-Monette, Zijun Y.; Balasubramanyam, Aarthi; Manyam, Ganiraju C.; Visco, Carlo; Tzankov, Alexander; Liu, Wei-min; Miranda, Roberto N.; Zhang, Li; Montes-Moreno, Santiago; Dybkær, Karen; Chiu, April; Orazi, Attilio; Zu, Youli; Bhagat, Govind; Richards, Kristy L.; Hsi, Eric D.; Choi, William W. L.; Han van Krieken, J.; Huang, Qin; Huh, Jooryung; Ai, Weiyun; Ponzoni, Maurilio; Ferreri, Andrés J. M.; Zhao, Xiaoying; Winter, Jane N.; Zhang, Mingzhi; Li, Ling; Møller, Michael B.; Piris, Miguel A.; Li, Yong; Go, Ronald S.; Wu, Lin; Medeiros, L. Jeffrey; Young, Ken H.
2013-01-01
CD30, originally identified as a cell-surface marker of Reed-Sternberg and Hodgkin cells of classical Hodgkin lymphoma, is also expressed by several types of non-Hodgkin lymphoma, including a subset of diffuse large B-cell lymphoma (DLBCL). However, the prognostic and biological importance of CD30 expression in DLBCL is unknown. Here we report that CD30 expression is a favorable prognostic factor in a cohort of 903 de novo DLBCL patients. CD30 was expressed in ∼14% of DLBCL patients. Patients with CD30+ DLBCL had superior 5-year overall survival (CD30+, 79% vs CD30–, 59%; P = .001) and progression-free survival (P = .003). The favorable outcome of CD30 expression was maintained in both the germinal center B-cell and activated B-cell subtypes. Gene expression profiling revealed the upregulation of genes encoding negative regulators of nuclear factor κB activation and lymphocyte survival, and downregulation of genes encoding B-cell receptor signaling and proliferation, as well as prominent cytokine and stromal signatures in CD30+ DLBCL patients, suggesting a distinct molecular basis for its favorable outcome. Given the superior prognostic value, unique gene expression signature, and significant value of CD30 as a therapeutic target for brentuximab vedotin in ongoing successful clinical trials, it seems appropriate to consider CD30+ DLBCL as a distinct subgroup of DLBCL. PMID:23343832
Sex-specific gene expression during asexual development of Neurospora crassa.
Wang, Zheng; Kin, Koryu; López-Giráldez, Francesc; Johannesson, Hanna; Townsend, Jeffrey P
2012-07-01
The impact of loci that determine sexual identity upon the asexual, dominant stage of fungal life history has been well studied. To investigate their impact, expression differences between strains of different mating type during asexual development were assayed, with RNA sampled from otherwise largely isogenic mat A and mat a strains of Neurospora crassa at early, middle, and late clonal stages of development. We observed significant differences in overall gene expression between mating types across clonal development, especially at late development stages. The expression levels of mating-type genes and pheromone genes were assayed by reverse transcription and quantitative PCR, revealing expression of pheromone and receptor genes in strains of both mating types in all development stages, and revealing that mating type (mat) genes were increasingly expressed over the course of asexual development. Interestingly, among differentially expressed genes, the mat A genotype more frequently exhibited a higher expression level than mat a, and demonstrated greater transcriptional regulatory dynamism. Significant up-regulation of expression was observed for many late light-responsive genes at late asexual development stages. Further investigation of the impact of light and the roles of light response genes in asexual development of both mating types are warranted. Copyright © 2012 Elsevier Inc. All rights reserved.
Choi, Mi-Jin; Kim, Gun-Do; Kim, Jong-Myoung; Lim, Han Kyu
2015-01-01
The Pacific abalone Haliotis discus hannai is used for commercial aquaculture in Korea. We examined the transcriptome of Pacific abalone Haliotis discus hannai siblings using NGS technology to identify genes associated with high growth rates. Pacific abalones grown for 200 days post-fertilization were divided into small-, medium-, and large-size groups with mean weights of 0.26 ± 0.09 g, 1.43 ± 0.405 g, and 5.24 ± 1.09 g, respectively. RNA isolated from the soft tissues of each group was subjected to RNA sequencing. Approximately 1%–3% of the transcripts were differentially expressed in abalones, depending on the growth rate. RT-PCR was carried out on thirty four genes selected to confirm the relative differences in expression detected by RNA sequencing. Six differentially-expressed genes were identified as associated with faster growth of the Pacific abalone. These include five up-regulated genes (including one specific to females) encoding transcripts homologous to incilarin A, perlucin, transforming growth factor-beta-induced protein immunoglobulin-heavy chain 3 (ig-h3), vitelline envelope zona pellucida domain 4, and defensin, and one down-regulated gene encoding tomoregulin in large abalones. Most of the transcripts were expressed predominantly in the hepatopancreas. The genes identified in this study will lead to development of markers for identification of high-growth-rate abalones and female abalones. PMID:26593905
Choi, Mi-Jin; Kim, Gun-Do; Kim, Jong-Myoung; Lim, Han Kyu
2015-11-18
The Pacific abalone Haliotis discus hannai is used for commercial aquaculture in Korea. We examined the transcriptome of Pacific abalone Haliotis discus hannai siblings using NGS technology to identify genes associated with high growth rates. Pacific abalones grown for 200 days post-fertilization were divided into small-, medium-, and large-size groups with mean weights of 0.26 ± 0.09 g, 1.43 ± 0.405 g, and 5.24 ± 1.09 g, respectively. RNA isolated from the soft tissues of each group was subjected to RNA sequencing. Approximately 1%-3% of the transcripts were differentially expressed in abalones, depending on the growth rate. RT-PCR was carried out on thirty four genes selected to confirm the relative differences in expression detected by RNA sequencing. Six differentially-expressed genes were identified as associated with faster growth of the Pacific abalone. These include five up-regulated genes (including one specific to females) encoding transcripts homologous to incilarin A, perlucin, transforming growth factor-beta-induced protein immunoglobulin-heavy chain 3 (ig-h3), vitelline envelope zona pellucida domain 4, and defensin, and one down-regulated gene encoding tomoregulin in large abalones. Most of the transcripts were expressed predominantly in the hepatopancreas. The genes identified in this study will lead to development of markers for identification of high-growth-rate abalones and female abalones.
Gene expression analysis of bud and leaf color in tea.
Wei, Kang; Zhang, Yazhen; Wu, Liyun; Li, Hailin; Ruan, Li; Bai, Peixian; Zhang, Chengcai; Zhang, Fen; Xu, Liyi; Wang, Liyuan; Cheng, Hao
2016-10-01
Purple shoot tea attributing to the high anthocyanin accumulation is of great interest for its wide health benefits. To better understand potential mechanisms involved in purple buds and leaves formation in tea plants, we performed transcriptome analysis of six green or purple shoot tea individuals from a F1 population using the Illumina sequencing method. Totally 292 million RNA-Seq reads were obtained and assembled into 112,233 unigenes, with an average length of 759 bp and an N50 of 1081 bp. Moreover, totally 2193 unigenes showed significant differences in expression levels between green and purple tea samples, with 1143 up- and 1050 down-regulated in the purple teas. Further real time PCR analysis confirmed RNA-Seq results. Our study identified 28 differentially expressed transcriptional factors and A CsMYB gene was found to be highly similar to AtPAP1 in Arabidopsis. Further analysis of differentially expressed genes involved in anthocyanin biosynthesis and transportation showed that the late biosynthetic genes and genes involved in anthocyanin transportation were largely affected but the early biosynthetic genes were less or none affected. Overall, the identification of a large number of differentially expressed genes offers a global view of the potential mechanisms associated with purple buds and leaves formation, which will facilitate molecular breeding in tea plants. Copyright © 2016 Elsevier Masson SAS. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Price, Morgan N.; Arkin, Adam P.; Alm, Eric J.
Operons are a major feature of all prokaryotic genomes, but how and why operon structures vary is not well understood. To elucidate the life-cycle of operons, we compared gene order between Escherichia coli K12 and its relatives and identified the recently formed and destroyed operons in E. coli. This allowed us to determine how operons form, how they become closely spaced, and how they die. Our findings suggest that operon evolution is driven by selection on gene expression patterns. First, both operon creation and operon destruction lead to large changes in gene expression patterns. For example, the removal of lysAmore » and ruvA from ancestral operons that contained essential genes allowed their expression to respond to lysine levels and DNA damage, respectively. Second, some operons have undergone accelerated evolution, with multiple new genes being added during a brief period. Third, although most operons are closely spaced because of a neutral bias towards deletion and because of selection against large overlaps, highly expressed operons tend to be widely spaced because of regulatory fine-tuning by intervening sequences. Although operon evolution seems to be adaptive, it need not be optimal: new operons often comprise functionally unrelated genes that were already in proximity before the operon formed.« less
Birikh, K R; Lebedenko, E N; Boni, I V; Berlin, Y A
1995-10-27
Synthetic intronless genes, coding for human interleukin 1 alpha (IL 1 alpha) and interleukin 1 receptor antagonist (IL1ra), have been expressed efficiently in a specially designed prokaryotic vector, pGMCE (a pGEM1 derivative), where the target gene forms the second part of a two-cistron system. The first part of the system is a translation enhancer-containing mini-cistron, whose termination codon overlaps the start codon of the target gene. In the case of the IL1 alpha gene, the high expression level is largely due to the direct efficient translation initiation at the second cistron, whereas with the IL1ra gene in the same system, the proximal translation initiation region (TIR) provides a high level of coupled expression of the target gene. Thus, pGMCE is a potentially versatile vector for direct prokaryotic expression.
Gao, Ya; Wang, Shu; Fu, Mingjia; Zhong, Guolin
2013-09-04
To determine blue-light induced expression of S-adenosyl-L-homocysteine hydrolase-like (sahhl) gene in fungus Mucor amphibiorum RCS1. In the random process of PCR, a sequence of 555 bp was obtained from M. amphibiorum RCS1. The 555 bp sequence was labeled with digoxin to prepare the probe for northern hybridization. By northern hybridization, the transcription of sahhl gene was analyzed in M. amphibiorum RCS1 mycelia culture process from darkness to blue light to darkness. Simultaneously real-time PCR method was used to the sahhl gene expression analysis. Compared with the sequence of sahh gene from Homo sapiens, Mus musculus and some fungi species, a high homology of the 555 bp sequence was confirmed. Therefore, the preliminary confirmation has supported that the 555 bp sequence should be sahhl gene from M. amphibiorum RCS1. Under the dark pre-culture in 24 h, a large amounts of transcript of sahhl gene in the mycelia can be detected by northern hybridization and real-time PCR in the condition of 24 h blue light. But a large amounts of transcript of sahhl gene were not found in other detection for the dark pre-culture of 48 h, even though M. amphibiorum RCS1 mycelia were induced by blue light. Blue light can induce the expression of sahhl gene in the vigorous growth of M. amphibiorum RCS1 mycelia.
Large-Scale Analysis of Network Bistability for Human Cancers
Shiraishi, Tetsuya; Matsuyama, Shinako; Kitano, Hiroaki
2010-01-01
Protein–protein interaction and gene regulatory networks are likely to be locked in a state corresponding to a disease by the behavior of one or more bistable circuits exhibiting switch-like behavior. Sets of genes could be over-expressed or repressed when anomalies due to disease appear, and the circuits responsible for this over- or under-expression might persist for as long as the disease state continues. This paper shows how a large-scale analysis of network bistability for various human cancers can identify genes that can potentially serve as drug targets or diagnosis biomarkers. PMID:20628618
Cui, Peng; Zhong, Tingyan; Wang, Zhuo; Wang, Tao; Zhao, Hongyu; Liu, Chenglin; Lu, Hui
2018-06-01
Circadian genes express periodically in an approximate 24-h period and the identification and study of these genes can provide deep understanding of the circadian control which plays significant roles in human health. Although many circadian gene identification algorithms have been developed, large numbers of false positives and low coverage are still major problems in this field. In this study we constructed a novel computational framework for circadian gene identification using deep neural networks (DNN) - a deep learning algorithm which can represent the raw form of data patterns without imposing assumptions on the expression distribution. Firstly, we transformed time-course gene expression data into categorical-state data to denote the changing trend of gene expression. Two distinct expression patterns emerged after clustering of the state data for circadian genes from our manually created learning dataset. DNN was then applied to discriminate the aperiodic genes and the two subtypes of periodic genes. In order to assess the performance of DNN, four commonly used machine learning methods including k-nearest neighbors, logistic regression, naïve Bayes, and support vector machines were used for comparison. The results show that the DNN model achieves the best balanced precision and recall. Next, we conducted large scale circadian gene detection using the trained DNN model for the remaining transcription profiles. Comparing with JTK_CYCLE and a study performed by Möller-Levet et al. (doi: https://doi.org/10.1073/pnas.1217154110), we identified 1132 novel periodic genes. Through the functional analysis of these novel circadian genes, we found that the GTPase superfamily exhibits distinct circadian expression patterns and may provide a molecular switch of circadian control of the functioning of the immune system in human blood. Our study provides novel insights into both the circadian gene identification field and the study of complex circadian-driven biological control. This article is part of a Special Issue entitled: Accelerating Precision Medicine through Genetic and Genomic Big Data Analysis edited by Yudong Cai & Tao Huang. Copyright © 2017. Published by Elsevier B.V.
Parallel human genome analysis: microarray-based expression monitoring of 1000 genes.
Schena, M; Shalon, D; Heller, R; Chai, A; Brown, P O; Davis, R W
1996-01-01
Microarrays containing 1046 human cDNAs of unknown sequence were printed on glass with high-speed robotics. These 1.0-cm2 DNA "chips" were used to quantitatively monitor differential expression of the cognate human genes using a highly sensitive two-color hybridization assay. Array elements that displayed differential expression patterns under given experimental conditions were characterized by sequencing. The identification of known and novel heat shock and phorbol ester-regulated genes in human T cells demonstrates the sensitivity of the assay. Parallel gene analysis with microarrays provides a rapid and efficient method for large-scale human gene discovery. Images Fig. 1 Fig. 2 Fig. 3 PMID:8855227
Huang, Yu-Juan; Zhou, Zai-wei; Xu, Miao; Ma, Qing-wen; Yan, Jing-bin; Wang, Jian-yi; Zhang, Quo-qin; Huang, Min; Bao, Liming
2015-03-01
Vasovagal syncope (VVS) causes accidental harm for susceptible patients. However, pathophysiology of this disorder remains largely unknown. In an effort to understanding of molecular mechanism for VVS, genome-wide gene expression profiling analyses were performed on VVS patients at syncope state. A total of 66 Type 1 VVS child patients and the same number healthy controls were enrolled in this study. Peripheral blood RNAs were isolated from all subjects, of which 10 RNA samples were randomly selected from each groups for gene expression profile analysis using Gene ST 1.0 arrays (Affymetrix). The results revealed that 103 genes were differently expressed between the patients and controls. Significantly, two G-proteins related genes, GPR174 and GNG2 that have not been related to VVS were among the differently expressed genes. The microarray results were confirmed by qRT-PCR in all the tested individuals. Ingenuity pathway analysis and gene ontology annotation study showed that the differently expressed genes are associated with stress response and apoptosis, suggesting that the alteration of some gene expression including G-proteins related genes is associated with VVS. This study provides new insight into the molecular mechanism of VVS and would be helpful to further identify new molecular biomarkers for the disease.
Wang, Tianyu; Nabavi, Sheida
2018-04-24
Differential gene expression analysis is one of the significant efforts in single cell RNA sequencing (scRNAseq) analysis to discover the specific changes in expression levels of individual cell types. Since scRNAseq exhibits multimodality, large amounts of zero counts, and sparsity, it is different from the traditional bulk RNA sequencing (RNAseq) data. The new challenges of scRNAseq data promote the development of new methods for identifying differentially expressed (DE) genes. In this study, we proposed a new method, SigEMD, that combines a data imputation approach, a logistic regression model and a nonparametric method based on the Earth Mover's Distance, to precisely and efficiently identify DE genes in scRNAseq data. The regression model and data imputation are used to reduce the impact of large amounts of zero counts, and the nonparametric method is used to improve the sensitivity of detecting DE genes from multimodal scRNAseq data. By additionally employing gene interaction network information to adjust the final states of DE genes, we further reduce the false positives of calling DE genes. We used simulated datasets and real datasets to evaluate the detection accuracy of the proposed method and to compare its performance with those of other differential expression analysis methods. Results indicate that the proposed method has an overall powerful performance in terms of precision in detection, sensitivity, and specificity. Copyright © 2018 Elsevier Inc. All rights reserved.
Zhao, Dejian; Lin, Mingyan; Pedrosa, Erika; Lachman, Herbert M; Zheng, Deyou
2017-11-10
Monoallelic expression of autosomal genes has been implicated in human psychiatric disorders. However, there is a paucity of allelic expression studies in human brain cells at the single cell and genome wide levels. In this report, we reanalyzed a previously published single-cell RNA-seq dataset from several postmortem human brains and observed pervasive monoallelic expression in individual cells, largely in a random manner. Examining single nucleotide variants with a predicted functional disruption, we found that the "damaged" alleles were overall expressed in fewer brain cells than their counterparts, and at a lower level in cells where their expression was detected. We also identified many brain cell type-specific monoallelically expressed genes. Interestingly, many of these cell type-specific monoallelically expressed genes were enriched for functions important for those brain cell types. In addition, function analysis showed that genes displaying monoallelic expression and correlated expression across neuronal cells from different individual brains were implicated in the regulation of synaptic function. Our findings suggest that monoallelic gene expression is prevalent in human brain cells, which may play a role in generating cellular identity and neuronal diversity and thus increasing the complexity and diversity of brain cell functions.
Gradia, Scott D; Ishida, Justin P; Tsai, Miaw-Sheue; Jeans, Chris; Tainer, John A; Fuss, Jill O
2017-01-01
Recombinant expression of large, multiprotein complexes is essential and often rate limiting for determining structural, biophysical, and biochemical properties of DNA repair, replication, transcription, and other key cellular processes. Baculovirus-infected insect cell expression systems are especially well suited for producing large, human proteins recombinantly, and multigene baculovirus systems have facilitated studies of multiprotein complexes. In this chapter, we describe a multigene baculovirus system called MacroBac that uses a Biobricks-type assembly method based on restriction and ligation (Series 11) or ligation-independent cloning (Series 438). MacroBac cloning and assembly is efficient and equally well suited for either single subcloning reactions or high-throughput cloning using 96-well plates and liquid handling robotics. MacroBac vectors are polypromoter with each gene flanked by a strong polyhedrin promoter and an SV40 poly(A) termination signal that minimize gene order expression level effects seen in many polycistronic assemblies. Large assemblies are robustly achievable, and we have successfully assembled as many as 10 genes into a single MacroBac vector. Importantly, we have observed significant increases in expression levels and quality of large, multiprotein complexes using a single, multigene, polypromoter virus rather than coinfection with multiple, single-gene viruses. Given the importance of characterizing functional complexes, we believe that MacroBac provides a critical enabling technology that may change the way that structural, biophysical, and biochemical research is done. © 2017 Elsevier Inc. All rights reserved.
2011-01-01
Background Abiotic stresses, such as water deficit and soil salinity, result in changes in physiology, nutrient use, and vegetative growth in vines, and ultimately, yield and flavor in berries of wine grape, Vitis vinifera L. Large-scale expressed sequence tags (ESTs) were generated, curated, and analyzed to identify major genetic determinants responsible for stress-adaptive responses. Although roots serve as the first site of perception and/or injury for many types of abiotic stress, EST sequencing in root tissues of wine grape exposed to abiotic stresses has been extremely limited to date. To overcome this limitation, large-scale EST sequencing was conducted from root tissues exposed to multiple abiotic stresses. Results A total of 62,236 expressed sequence tags (ESTs) were generated from leaf, berry, and root tissues from vines subjected to abiotic stresses and compared with 32,286 ESTs sequenced from 20 public cDNA libraries. Curation to correct annotation errors, clustering and assembly of the berry and leaf ESTs with currently available V. vinifera full-length transcripts and ESTs yielded a total of 13,278 unique sequences, with 2302 singletons and 10,976 mapped to V. vinifera gene models. Of these, 739 transcripts were found to have significant differential expression in stressed leaves and berries including 250 genes not described previously as being abiotic stress responsive. In a second analysis of 16,452 ESTs from a normalized root cDNA library derived from roots exposed to multiple, short-term, abiotic stresses, 135 genes with root-enriched expression patterns were identified on the basis of their relative EST abundance in roots relative to other tissues. Conclusions The large-scale analysis of relative EST frequency counts among a diverse collection of 23 different cDNA libraries from leaf, berry, and root tissues of wine grape exposed to a variety of abiotic stress conditions revealed distinct, tissue-specific expression patterns, previously unrecognized stress-induced genes, and many novel genes with root-enriched mRNA expression for improving our understanding of root biology and manipulation of rootstock traits in wine grape. mRNA abundance estimates based on EST library-enriched expression patterns showed only modest correlations between microarray and quantitative, real-time reverse transcription-polymerase chain reaction (qRT-PCR) methods highlighting the need for deep-sequencing expression profiling methods. PMID:21592389
Zhou, Weichen; Ma, Yanyun; Zhang, Jun; Hu, Jingyi; Zhang, Menghan; Wang, Yi; Li, Yi; Wu, Lijun; Pan, Yida; Zhang, Yitong; Zhang, Xiaonan; Zhang, Xinxin; Zhang, Zhanqing; Zhang, Jiming; Li, Hai; Lu, Lungen; Jin, Li; Wang, Jiucun; Yuan, Zhenghong; Liu, Jie
2017-11-01
Liver biopsy is the gold standard to assess pathological features (eg inflammation grades) for hepatitis B virus-infected patients although it is invasive and traumatic; meanwhile, several gene profiles of chronic hepatitis B (CHB) have been separately described in relatively small hepatitis B virus (HBV)-infected samples. We aimed to analyse correlations among inflammation grades, gene expressions and clinical parameters (serum alanine amino transaminase, aspartate amino transaminase and HBV-DNA) in large-scale CHB samples and to predict inflammation grades by using clinical parameters and/or gene expressions. We analysed gene expressions with three clinical parameters in 122 CHB samples by an improved regression model. Principal component analysis and machine-learning methods including Random Forest, K-nearest neighbour and support vector machine were used for analysis and further diagnosis models. Six normal samples were conducted to validate the predictive model. Significant genes related to clinical parameters were found enriching in the immune system, interferon-stimulated, regulation of cytokine production, anti-apoptosis, and etc. A panel of these genes with clinical parameters can effectively predict binary classifications of inflammation grade (area under the ROC curve [AUC]: 0.88, 95% confidence interval [CI]: 0.77-0.93), validated by normal samples. A panel with only clinical parameters was also valuable (AUC: 0.78, 95% CI: 0.65-0.86), indicating that liquid biopsy method for detecting the pathology of CHB is possible. This is the first study to systematically elucidate the relationships among gene expressions, clinical parameters and pathological inflammation grades in CHB, and to build models predicting inflammation grades by gene expressions and/or clinical parameters as well. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Durrani, Zeeshan; Pillai, Sreerekha S.; Baird, Margaret; Shiels, Brian R.
2013-01-01
Theileria annulata, an intracellular parasite of bovine lymphoid cells, induces substantial phenotypic alterations to its host cell including continuous proliferation, cytoskeletal changes and resistance to apoptosis. While parasite induced modulation of host cell signal transduction pathways and NFκB activation are established, there remains considerable speculation on the complexities of the parasite directed control mechanisms that govern these radical changes to the host cell. Our objectives in this study were to provide a comprehensive analysis of the global changes to host cell gene expression with emphasis on those that result from direct intervention by the parasite. By using comparative microarray analysis of an uninfected bovine cell line and its Theileria infected counterpart, in conjunction with use of the specific parasitacidal agent, buparvaquone, we have identified a large number of host cell gene expression changes that result from parasite infection. Our results indicate that the viable parasite can irreversibly modify the transformed phenotype of a bovine cell line. Fifty percent of genes with altered expression failed to show a reversible response to parasite death, a possible contributing factor to initiation of host cell apoptosis. The genes that did show an early predicted response to loss of parasite viability highlighted a sub-group of genes that are likely to be under direct control by parasite infection. Network and pathway analysis demonstrated that this sub-group is significantly enriched for genes involved in regulation of chromatin modification and gene expression. The results provide evidence that the Theileria parasite has the regulatory capacity to generate widespread change to host cell gene expression in a complex and largely irreversible manner. PMID:23840536
Genome-Wide Identification, Evolution and Expression Analysis of mTERF Gene Family in Maize
Zhao, Yanxin; Cai, Manjun; Zhang, Xiaobo; Li, Yurong; Zhang, Jianhua; Zhao, Hailiang; Kong, Fei; Zheng, Yonglian; Qiu, Fazhan
2014-01-01
Plant mitochondrial transcription termination factor (mTERF) genes comprise a large family with important roles in regulating organelle gene expression. In this study, a comprehensive database search yielded 31 potential mTERF genes in maize (Zea mays L.) and most of them were targeted to mitochondria or chloroplasts. Maize mTERF were divided into nine main groups based on phylogenetic analysis, and group IX represented the mitochondria and species-specific clade that diverged from other groups. Tandem and segmental duplication both contributed to the expansion of the mTERF gene family in the maize genome. Comprehensive expression analysis of these genes, using microarray data and RNA-seq data, revealed that these genes exhibit a variety of expression patterns. Environmental stimulus experiments revealed differential up or down-regulation expression of maize mTERF genes in seedlings exposed to light/dark, salts and plant hormones, respectively, suggesting various important roles of maize mTERF genes in light acclimation and stress-related responses. These results will be useful for elucidating the roles of mTERF genes in the growth, development and stress response of maize. PMID:24718683
Máximo, Wesley P. F.; Zanetti, Ronald; Paiva, Luciano V.
2018-01-01
Although several ant species are important targets for the development of molecular control strategies, only a few studies focus on identifying and validating reference genes for quantitative reverse transcription polymerase chain reaction (RT-qPCR) data normalization. We provide here an extensive study to identify and validate suitable reference genes for gene expression analysis in the ant Atta sexdens, a threatening agricultural pest in South America. The optimal number of reference genes varies according to each sample and the result generated by RefFinder differed about which is the most suitable reference gene. Results suggest that the RPS16, NADH and SDHB genes were the best reference genes in the sample pool according to stability values. The SNF7 gene expression pattern was stable in all evaluated sample set. In contrast, when using less stable reference genes for normalization a large variability in SNF7 gene expression was recorded. There is no universal reference gene suitable for all conditions under analysis, since these genes can also participate in different cellular functions, thus requiring a systematic validation of possible reference genes for each specific condition. The choice of reference genes on SNF7 gene normalization confirmed that unstable reference genes might drastically change the expression profile analysis of target candidate genes. PMID:29419794
Tran, Frances; Penniket, Carolyn; Patel, Rohan V; Provart, Nicholas J; Laroche, André; Rowland, Owen; Robert, Laurian S
2013-06-01
Despite their importance, there remains a paucity of large-scale gene expression-based studies of reproductive development in species belonging to the Triticeae. As a first step to address this deficiency, a gene expression atlas of triticale reproductive development was generated using the 55K Affymetrix GeneChip(®) wheat genome array. The global transcriptional profiles of the anther/pollen, ovary and stigma were analyzed at concurrent developmental stages, and co-expressed as well as preferentially expressed genes were identified. Data analysis revealed both novel and conserved regulatory factors underlying Triticeae floral development and function. This comprehensive resource rests upon detailed gene annotations, and the expression profiles are readily accessible via a web browser. © 2013 Her Majesty the Queen in Right of Canada as represented by the Minister of Agriculture and Agri-Food Canada.
HOX genes in human lung: altered expression in primary pulmonary hypertension and emphysema.
Golpon, H A; Geraci, M W; Moore, M D; Miller, H L; Miller, G J; Tuder, R M; Voelkel, N F
2001-03-01
HOX genes belong to the large family of homeodomain genes that function as transcription factors. Animal studies indicate that they play an essential role in lung development. We investigated the expression pattern of HOX genes in human lung tissue by using microarray and degenerate reverse transcriptase-polymerase chain reaction survey techniques. HOX genes predominantly from the 3' end of clusters A and B were expressed in normal human adult lung and among them HOXA5 was the most abundant, followed by HOXB2 and HOXB6. In fetal (12 weeks old) and diseased lung specimens (emphysema, primary pulmonary hypertension) additional HOX genes from clusters C and D were expressed. Using in situ hybridization, transcripts for HOXA5 were predominantly found in alveolar septal and epithelial cells, both in normal and diseased lungs. A 2.5-fold increase in HOXA5 mRNA expression was demonstrated by quantitative reverse transcriptase-polymerase chain reaction in primary pulmonary hypertension lung specimens when compared to normal lung tissue. In conclusion, we demonstrate that HOX genes are selectively expressed in the human lung. Differences in the pattern of HOX gene expression exist among fetal, adult, and diseased lung specimens. The altered pattern of HOX gene expression may contribute to the development of pulmonary diseases.
Otoupal, Peter B; Erickson, Keesha E; Escalas-Bordoy, Antoni; Chatterjee, Anushree
2017-01-20
The evolution of antibiotic resistance has engendered an impending global health crisis that necessitates a greater understanding of how resistance emerges. The impact of nongenetic factors and how they influence the evolution of resistance is a largely unexplored area of research. Here we present a novel application of CRISPR-Cas9 technology for investigating how gene expression governs the adaptive pathways available to bacteria during the evolution of resistance. We examine the impact of gene expression changes on bacterial adaptation by constructing a library of deactivated CRISPR-Cas9 synthetic devices to tune the expression of a set of stress-response genes in Escherichia coli. We show that artificially inducing perturbations in gene expression imparts significant synthetic control over fitness and growth during stress exposure. We present evidence that these impacts are reversible; strains with synthetically perturbed gene expression regained wild-type growth phenotypes upon stress removal, while maintaining divergent growth characteristics under stress. Furthermore, we demonstrate a prevailing trend toward negative epistatic interactions when multiple gene perturbations are combined simultaneously, thereby posing an intrinsic constraint on gene expression underlying adaptive trajectories. Together, these results emphasize how CRISPR-Cas9 can be employed to engineer gene expression changes that shape bacterial adaptation, and present a novel approach to synthetically control the evolution of antimicrobial resistance.
Gene expression inference with deep learning.
Chen, Yifei; Li, Yi; Narayan, Rajiv; Subramanian, Aravind; Xie, Xiaohui
2016-06-15
Large-scale gene expression profiling has been widely used to characterize cellular states in response to various disease conditions, genetic perturbations, etc. Although the cost of whole-genome expression profiles has been dropping steadily, generating a compendium of expression profiling over thousands of samples is still very expensive. Recognizing that gene expressions are often highly correlated, researchers from the NIH LINCS program have developed a cost-effective strategy of profiling only ∼1000 carefully selected landmark genes and relying on computational methods to infer the expression of remaining target genes. However, the computational approach adopted by the LINCS program is currently based on linear regression (LR), limiting its accuracy since it does not capture complex nonlinear relationship between expressions of genes. We present a deep learning method (abbreviated as D-GEX) to infer the expression of target genes from the expression of landmark genes. We used the microarray-based Gene Expression Omnibus dataset, consisting of 111K expression profiles, to train our model and compare its performance to those from other methods. In terms of mean absolute error averaged across all genes, deep learning significantly outperforms LR with 15.33% relative improvement. A gene-wise comparative analysis shows that deep learning achieves lower error than LR in 99.97% of the target genes. We also tested the performance of our learned model on an independent RNA-Seq-based GTEx dataset, which consists of 2921 expression profiles. Deep learning still outperforms LR with 6.57% relative improvement, and achieves lower error in 81.31% of the target genes. D-GEX is available at https://github.com/uci-cbcl/D-GEX CONTACT: xhx@ics.uci.edu Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Gene expression inference with deep learning
Chen, Yifei; Li, Yi; Narayan, Rajiv; Subramanian, Aravind; Xie, Xiaohui
2016-01-01
Motivation: Large-scale gene expression profiling has been widely used to characterize cellular states in response to various disease conditions, genetic perturbations, etc. Although the cost of whole-genome expression profiles has been dropping steadily, generating a compendium of expression profiling over thousands of samples is still very expensive. Recognizing that gene expressions are often highly correlated, researchers from the NIH LINCS program have developed a cost-effective strategy of profiling only ∼1000 carefully selected landmark genes and relying on computational methods to infer the expression of remaining target genes. However, the computational approach adopted by the LINCS program is currently based on linear regression (LR), limiting its accuracy since it does not capture complex nonlinear relationship between expressions of genes. Results: We present a deep learning method (abbreviated as D-GEX) to infer the expression of target genes from the expression of landmark genes. We used the microarray-based Gene Expression Omnibus dataset, consisting of 111K expression profiles, to train our model and compare its performance to those from other methods. In terms of mean absolute error averaged across all genes, deep learning significantly outperforms LR with 15.33% relative improvement. A gene-wise comparative analysis shows that deep learning achieves lower error than LR in 99.97% of the target genes. We also tested the performance of our learned model on an independent RNA-Seq-based GTEx dataset, which consists of 2921 expression profiles. Deep learning still outperforms LR with 6.57% relative improvement, and achieves lower error in 81.31% of the target genes. Availability and implementation: D-GEX is available at https://github.com/uci-cbcl/D-GEX. Contact: xhx@ics.uci.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:26873929
Jaeger, Emma; Leedham, Simon; Lewis, Annabelle; Segditsas, Stefania; Becker, Martin; Cuadrado, Pedro Rodenas; Davis, Hayley; Kaur, Kulvinder; Heinimann, Karl; Howarth, Kimberley; East, James; Taylor, Jenny; Thomas, Huw; Tomlinson, Ian
2012-05-06
Hereditary mixed polyposis syndrome (HMPS) is characterized by apparent autosomal dominant inheritance of multiple types of colorectal polyp, with colorectal carcinoma occurring in a high proportion of affected individuals. Here, we use genetic mapping, copy-number analysis, exclusion of mutations by high-throughput sequencing, gene expression analysis and functional assays to show that HMPS is caused by a duplication spanning the 3' end of the SCG5 gene and a region upstream of the GREM1 locus. This unusual mutation is associated with increased allele-specific GREM1 expression. Whereas GREM1 is expressed in intestinal subepithelial myofibroblasts in controls, GREM1 is predominantly expressed in the epithelium of the large bowel in individuals with HMPS. The HMPS duplication contains predicted enhancer elements; some of these interact with the GREM1 promoter and can drive gene expression in vitro. Increased GREM1 expression is predicted to cause reduced bone morphogenetic protein (BMP) pathway activity, a mechanism that also underlies tumorigenesis in juvenile polyposis of the large bowel.
A microarray analysis of potential genes underlying the neurosensitivity of mice to propofol.
Lowes, Damon A; Galley, Helen F; Lowe, Peter R; Rikke, Brad A; Johnson, Thomas E; Webster, Nigel R
2005-09-01
Establishing the mechanism of action of general anesthetics at the molecular level is difficult because of the multiple targets with which these drugs are associated. Inbred short sleep (ISS) and long sleep (ILS) mice are differentially sensitive in response to ethanol and other sedative hypnotics and contain a single quantitative trait locus (Lorp1) that accounts for the genetic variance of loss-of-righting reflex in response to propofol (LORP). In this study, we used high-density oligonucleotide microarrays to identify global gene expression and candidate genes differentially expressed within the Lorp1 region that may give insight into the molecular mechanism underlying LORP. Microarray analysis was performed using Affymetrix MG-U74Av2 Genechips and a selection of differentially expressed genes was confirmed by semiquantitative reverse transcription-polymerase chain reaction. Global expression in the brains of ILS and ISS mice revealed 3423 genes that were significantly expressed, of which 139 (4%) were differentially expressed. Analysis of genes located within the Lorp1 region showed that 26 genes were significantly expressed and that just 2 genes (7%) were differentially expressed. These genes encoded for the proteins AWP1 (associated with protein kinase 1) and "BTB (POZ) domain containing 1," whose functions are largely uncharacterized. Genes differentially expressed outside Lorp1 included seven genes with previously characterized neuronal functions and thus stand out as additional candidate genes that may be involved in mediating the neurosensitivity differences between ISS and ILS.
Ando, Tatsuya; Suguro, Miyuki; Hanai, Taizo; Kobayashi, Takeshi; Seto, Masao
2002-01-01
Diffuse large B‐cell lymphoma (DLBCL) is the largest category of aggressive lymphomas. Less than 50% of patients can be cured by combination chemotherapy. Microarray technologies have recently shown that the response to chemotherapy reflects the molecular heterogeneity in DLBCL. On the basis of published microarray data, we attempted to develop a long‐overdue method for the precise and simple prediction of survival of DLBCL patients. We developed a fuzzy neural network (FNN) model to analyze gene expression profiling data for DLBCL. From data on 5857 genes, this model identified four genes (CD10, AA807551, AA805611 and IRF‐4) that could be used to predict prognosis with 93% accuracy. FNNs are powerful tools for extracting significant biological markers affecting prognosis, and are applicable to various kinds of expression profiling data for any malignancy. PMID:12460461
Zhong, Jinshun; Kellogg, Elizabeth A
2015-08-01
• CYCLOIDEA2 (CYC2)-like and RADIALIS (RAD)-like genes are needed for the normal development of corolla bilateral symmetry in Antirrhinum majus L. (snapdragon, Plantaginaceae, Lamiales). However, if and how changes in expression of CYC2-like and RAD-like genes correlate with the origin of corolla bilateral symmetry early in Lamiales remains largely unknown. The asymmetrical expression of CYC2-like and/or RAD-like genes during floral meristem development could be ancestral or derived in Plantaginaceae.• We used in situ RNA localization to examine the expression of CYC2-like and RAD-like genes in two early-diverging Lamiales.• CYC2-like and RAD-like genes are expressed broadly in the floral meristems in early-diverging Lamiales with radially symmetrical corollas, in contrast to their restricted expression in adaxial/lateral regions in core Lamiales. The expression pattern of CYC2-like genes has evolved in stepwise fashion, in that CYC2-like genes are likely expressed briefly in the floral meristem during flower development in sampled Oleaceae; prolonged expression of CYC2-like genes in petals originated in the common ancestor of Tetrachondraceae and core Lamiales, and asymmetrical expression in adaxial/lateral petals appeared later, in the common ancestor of the core Lamiales. Likewise, expression of RAD-like genes in petals appeared in early-diverging Lamiales or earlier; asymmetrical expression in adaxial/lateral petals then appeared in core Lamiales.• These data plus published reports of CYC2-like and RAD-like genes show that asymmetrical expression of these two genes is likely derived and correlates with the origins of corolla bilateral symmetry. © 2015 Botanical Society of America, Inc.
Chapman, Joanne R; Waldenström, Jonas
2015-01-01
The choice of reference genes that are stably expressed amongst treatment groups is a crucial step in real-time quantitative PCR gene expression studies. Recent guidelines have specified that a minimum of two validated reference genes should be used for normalisation. However, a quantitative review of the literature showed that the average number of reference genes used across all studies was 1.2. Thus, the vast majority of studies continue to use a single gene, with β-actin (ACTB) and/or glyceraldehyde 3-phosphate dehydrogenase (GAPDH) being commonly selected in studies of vertebrate gene expression. Few studies (15%) tested a panel of potential reference genes for stability of expression before using them to normalise data. Amongst studies specifically testing reference gene stability, few found ACTB or GAPDH to be optimal, whereby these genes were significantly less likely to be chosen when larger panels of potential reference genes were screened. Fewer reference genes were tested for stability in non-model organisms, presumably owing to a dearth of available primers in less well characterised species. Furthermore, the experimental conditions under which real-time quantitative PCR analyses were conducted had a large influence on the choice of reference genes, whereby different studies of rat brain tissue showed different reference genes to be the most stable. These results highlight the importance of validating the choice of normalising reference genes before conducting gene expression studies.
Mapping cis- and trans-regulatory effects across multiple tissues in twins
Grundberg, Elin; Small, Kerrin S.; Hedman, Åsa K.; Nica, Alexandra C.; Buil, Alfonso; Keildson, Sarah; Bell, Jordana T.; Yang, Tsun-Po; Meduri, Eshwar; Barrett, Amy; Nisbett, James; Sekowska, Magdalena; Wilk, Alicja; Shin, So-Youn; Glass, Daniel; Travers, Mary; Min, Josine L.; Ring, Sue; Ho, Karen; Thorleifsson, Gudmar; Kong, Augustine; Thorsteindottir, Unnur; Ainali, Chrysanthi; Dimas, Antigone S.; Hassanali, Neelam; Ingle, Catherine; Knowles, David; Krestyaninova, Maria; Lowe, Christopher E.; Di Meglio, Paola; Montgomery, Stephen B.; Parts, Leopold; Potter, Simon; Surdulescu, Gabriela; Tsaprouni, Loukia; Tsoka, Sophia; Bataille, Veronique; Durbin, Richard; Nestle, Frank O.; O’Rahilly, Stephen; Soranzo, Nicole; Lindgren, Cecilia M.; Zondervan, Krina T.; Ahmadi, Kourosh R.; Schadt, Eric E.; Stefansson, Kari; Smith, George Davey; McCarthy, Mark I.; Deloukas, Panos; Dermitzakis, Emmanouil T.; Spector, Tim D.
2013-01-01
Sequence-based variation in gene expression is a key driver of disease risk. Common variants regulating expression in cis have been mapped in many eQTL studies typically in single tissues from unrelated individuals. Here, we present a comprehensive analysis of gene expression across multiple tissues conducted in a large set of mono- and dizygotic twins that allows systematic dissection of genetic (cis and trans) and non-genetic effects on gene expression. Using identity-by-descent estimates, we show that at least 40% of the total heritable cis-effect on expression cannot be accounted for by common cis-variants, a finding which exposes the contribution of low frequency and rare regulatory variants with respect to both transcriptional regulation and complex trait susceptibility. We show that a substantial proportion of gene expression heritability is trans to the structural gene and identify several replicating trans-variants which act predominantly in a tissue-restricted manner and may regulate the transcription of many genes. PMID:22941192
Reprogramming Microbes for the Remote Detection of Environmental Threats
2013-10-15
Riboswitches consist of an aptamer that recognizes the ligand and an expression platform that couples ligand binding to a change in gene expression. Using in...vitro selection, it is possible to screen large (~10^13 member) libraries of RNA sequences to discover new aptamers . However, limitations in...consist of an aptamer that recognizes the ligand and an expression platform that couples ligand binding to a change in gene expression. Using in
Knowledge-guided gene prioritization reveals new insights into the mechanisms of chemoresistance.
Emad, Amin; Cairns, Junmei; Kalari, Krishna R; Wang, Liewei; Sinha, Saurabh
2017-08-11
Identification of genes whose basal mRNA expression predicts the sensitivity of tumor cells to cytotoxic treatments can play an important role in individualized cancer medicine. It enables detailed characterization of the mechanism of action of drugs. Furthermore, screening the expression of these genes in the tumor tissue may suggest the best course of chemotherapy or a combination of drugs to overcome drug resistance. We developed a computational method called ProGENI to identify genes most associated with the variation of drug response across different individuals, based on gene expression data. In contrast to existing methods, ProGENI also utilizes prior knowledge of protein-protein and genetic interactions, using random walk techniques. Analysis of two relatively new and large datasets including gene expression data on hundreds of cell lines and their cytotoxic responses to a large compendium of drugs reveals a significant improvement in prediction of drug sensitivity using genes identified by ProGENI compared to other methods. Our siRNA knockdown experiments on ProGENI-identified genes confirmed the role of many new genes in sensitivity to three chemotherapy drugs: cisplatin, docetaxel, and doxorubicin. Based on such experiments and extensive literature survey, we demonstrate that about 73% of our top predicted genes modulate drug response in selected cancer cell lines. In addition, global analysis of genes associated with groups of drugs uncovered pathways of cytotoxic response shared by each group. Our results suggest that knowledge-guided prioritization of genes using ProGENI gives new insight into mechanisms of drug resistance and identifies genes that may be targeted to overcome this phenomenon.
A Self-Directed Method for Cell-Type Identification and Separation of Gene Expression Microarrays
Zuckerman, Neta S.; Noam, Yair; Goldsmith, Andrea J.; Lee, Peter P.
2013-01-01
Gene expression analysis is generally performed on heterogeneous tissue samples consisting of multiple cell types. Current methods developed to separate heterogeneous gene expression rely on prior knowledge of the cell-type composition and/or signatures - these are not available in most public datasets. We present a novel method to identify the cell-type composition, signatures and proportions per sample without need for a-priori information. The method was successfully tested on controlled and semi-controlled datasets and performed as accurately as current methods that do require additional information. As such, this method enables the analysis of cell-type specific gene expression using existing large pools of publically available microarray datasets. PMID:23990767
Inducible repression of multiple expansin genes leads to growth suppression during leaf development.
Goh, Hoe-Han; Sloan, Jennifer; Dorca-Fornell, Carmen; Fleming, Andrew
2012-08-01
Expansins are cell wall proteins implicated in the control of plant growth via loosening of the extracellular matrix. They are encoded by a large gene family, and data linked to loss of single gene function to support a role of expansins in leaf growth remain limited. Here, we provide a quantitative growth analysis of transgenics containing an inducible artificial microRNA construct designed to down-regulate the expression of a number of expansin genes that an expression analysis indicated are expressed during the development of Arabidopsis (Arabidopsis thaliana) leaf 6. The results support the hypothesis that expansins are required for leaf growth and show that decreased expansin gene expression leads to a more marked repression of growth during the later stage of leaf development. In addition, a histological analysis of leaves in which expansin gene expression was suppressed indicates that, despite smaller leaves, mean cell size was increased. These data provide functional evidence for a role of expansins in leaf growth, indicate the importance of tissue/organ developmental context for the outcome of altered expansin gene expression, and highlight the separation of the outcome of expansin gene expression at the cellular and organ levels.
Sémon, Marie; Mouchiroud, Dominique; Duret, Laurent
2005-02-01
Mammalian chromosomes are characterized by large-scale variations of DNA base composition (the so-called isochores). In contradiction with previous studies, Lercher et al. (Hum. Mol. Genet., 12, 2411, 2003) recently reported a strong correlation between gene expression breadth and GC-content, suggesting that there might be a selective pressure favoring the concentration of housekeeping genes in GC-rich isochores. We reassessed this issue by examining in human and mouse the correlation between gene expression and GC-content, using different measures of gene expression (EST, SAGE and microarray) and different measures of GC-content. We show that correlations between GC-content and expression are very weak, and may vary according to the method used to measure expression. Such weak correlations have a very low predictive value. The strong correlations reported by Lercher et al. (2003) are because of the fact that they measured variables over neighboring genes windows. We show here that using gene windows artificially enhances the correlation. The assertion that the expression of a given gene depends on the GC-content of the region where it is located is therefore not supported by the data.
Identification of Candidate B-Lymphoma Genes by Cross-Species Gene Expression Profiling
Tompkins, Van S.; Han, Seong-Su; Olivier, Alicia; Syrbu, Sergei; Bair, Thomas; Button, Anna; Jacobus, Laura; Wang, Zebin; Lifton, Samuel; Raychaudhuri, Pradip; Morse, Herbert C.; Weiner, George; Link, Brian; Smith, Brian J.; Janz, Siegfried
2013-01-01
Comparative genome-wide expression profiling of malignant tumor counterparts across the human-mouse species barrier has a successful track record as a gene discovery tool in liver, breast, lung, prostate and other cancers, but has been largely neglected in studies on neoplasms of mature B-lymphocytes such as diffuse large B cell lymphoma (DLBCL) and Burkitt lymphoma (BL). We used global gene expression profiles of DLBCL-like tumors that arose spontaneously in Myc-transgenic C57BL/6 mice as a phylogenetically conserved filter for analyzing the human DLBCL transcriptome. The human and mouse lymphomas were found to have 60 concordantly deregulated genes in common, including 8 genes that Cox hazard regression analysis associated with overall survival in a published landmark dataset of DLBCL. Genetic network analysis of the 60 genes followed by biological validation studies indicate FOXM1 as a candidate DLBCL and BL gene, supporting a number of studies contending that FOXM1 is a therapeutic target in mature B cell tumors. Our findings demonstrate the value of the “mouse filter” for genomic studies of human B-lineage neoplasms for which a vast knowledge base already exists. PMID:24130802
Lam, Max; Trampush, Joey W; Yu, Jin; Knowles, Emma; Davies, Gail; Liewald, David C; Starr, John M; Djurovic, Srdjan; Melle, Ingrid; Sundet, Kjetil; Christoforou, Andrea; Reinvang, Ivar; DeRosse, Pamela; Lundervold, Astri J; Steen, Vidar M; Espeseth, Thomas; Räikkönen, Katri; Widen, Elisabeth; Palotie, Aarno; Eriksson, Johan G; Giegling, Ina; Konte, Bettina; Roussos, Panos; Giakoumaki, Stella; Burdick, Katherine E; Payton, Antony; Ollier, William; Chiba-Falek, Ornit; Attix, Deborah K; Need, Anna C; Cirulli, Elizabeth T; Voineskos, Aristotle N; Stefanis, Nikos C; Avramopoulos, Dimitrios; Hatzimanolis, Alex; Arking, Dan E; Smyrnis, Nikolaos; Bilder, Robert M; Freimer, Nelson A; Cannon, Tyrone D; London, Edythe; Poldrack, Russell A; Sabb, Fred W; Congdon, Eliza; Conley, Emily Drabant; Scult, Matthew A; Dickinson, Dwight; Straub, Richard E; Donohoe, Gary; Morris, Derek; Corvin, Aiden; Gill, Michael; Hariri, Ahmad R; Weinberger, Daniel R; Pendleton, Neil; Bitsios, Panos; Rujescu, Dan; Lahti, Jari; Le Hellard, Stephanie; Keller, Matthew C; Andreassen, Ole A; Deary, Ian J; Glahn, David C; Malhotra, Anil K; Lencz, Todd
2017-11-28
Here, we present a large (n = 107,207) genome-wide association study (GWAS) of general cognitive ability ("g"), further enhanced by combining results with a large-scale GWAS of educational attainment. We identified 70 independent genomic loci associated with general cognitive ability. Results showed significant enrichment for genes causing Mendelian disorders with an intellectual disability phenotype. Competitive pathway analysis implicated the biological processes of neurogenesis and synaptic regulation, as well as the gene targets of two pharmacologic agents: cinnarizine, a T-type calcium channel blocker, and LY97241, a potassium channel inhibitor. Transcriptome-wide and epigenome-wide analysis revealed that the implicated loci were enriched for genes expressed across all brain regions (most strongly in the cerebellum). Enrichment was exclusive to genes expressed in neurons but not oligodendrocytes or astrocytes. Finally, we report genetic correlations between cognitive ability and disparate phenotypes including psychiatric disorders, several autoimmune disorders, longevity, and maternal age at first birth. Copyright © 2017 The Author(s). Published by Elsevier Inc. All rights reserved.
β-Cell-Specific Mafk Overexpression Impairs Pancreatic Endocrine Cell Development
Abdellatif, Ahmed M.; Oishi, Hisashi; Itagaki, Takahiro; Jung, Yunshin; Shawki, Hossam H.; Okita, Yukari; Hasegawa, Yoshikazu; Suzuki, Hiroyuki; El-Morsy, Salah E.; El-Sayed, Mesbah A.; Shoaib, Mahmoud B.; Sugiyama, Fumihiro; Takahashi, Satoru
2016-01-01
The MAF family transcription factors are homologs of v-Maf, the oncogenic component of the avian retrovirus AS42. They are subdivided into 2 groups, small and large MAF proteins, according to their structure, function, and molecular size. MAFK is a member of the small MAF family and acts as a dominant negative form of large MAFs. In previous research we generated transgenic mice that overexpress MAFK in order to suppress the function of large MAF proteins in pancreatic β-cells. These mice developed hyperglycemia in adulthood due to impairment of glucose-stimulated insulin secretion. The aim of the current study is to examine the effects of β-cell-specific Mafk overexpression in endocrine cell development. The developing islets of Mafk-transgenic embryos appeared to be disorganized with an inversion of total numbers of insulin+ and glucagon+ cells due to reduced β-cell proliferation. Gene expression analysis by quantitative RT-PCR revealed decreased levels of β-cell-related genes whose expressions are known to be controlled by large MAF proteins. Additionally, these changes were accompanied with a significant increase in key β-cell transcription factors likely due to compensatory mechanisms that might have been activated in response to the β-cell loss. Finally, microarray comparison of gene expression profiles between wild-type and transgenic pancreata revealed alteration of some uncharacterized genes including Pcbd1, Fam132a, Cryba2, and Npy, which might play important roles during pancreatic endocrine development. Taken together, these results suggest that Mafk overexpression impairs endocrine development through a regulation of numerous β-cell-related genes. The microarray analysis provided a unique data set of differentially expressed genes that might contribute to a better understanding of the molecular basis that governs the development and function of endocrine pancreas. PMID:26901059
Gene expression analysis of flax seed development
2011-01-01
Background Flax, Linum usitatissimum L., is an important crop whose seed oil and stem fiber have multiple industrial applications. Flax seeds are also well-known for their nutritional attributes, viz., omega-3 fatty acids in the oil and lignans and mucilage from the seed coat. In spite of the importance of this crop, there are few molecular resources that can be utilized toward improving seed traits. Here, we describe flax embryo and seed development and generation of comprehensive genomic resources for the flax seed. Results We describe a large-scale generation and analysis of expressed sequences in various tissues. Collectively, the 13 libraries we have used provide a broad representation of genes active in developing embryos (globular, heart, torpedo, cotyledon and mature stages) seed coats (globular and torpedo stages) and endosperm (pooled globular to torpedo stages) and genes expressed in flowers, etiolated seedlings, leaves, and stem tissue. A total of 261,272 expressed sequence tags (EST) (GenBank accessions LIBEST_026995 to LIBEST_027011) were generated. These EST libraries included transcription factor genes that are typically expressed at low levels, indicating that the depth is adequate for in silico expression analysis. Assembly of the ESTs resulted in 30,640 unigenes and 82% of these could be identified on the basis of homology to known and hypothetical genes from other plants. When compared with fully sequenced plant genomes, the flax unigenes resembled poplar and castor bean more than grape, sorghum, rice or Arabidopsis. Nearly one-fifth of these (5,152) had no homologs in sequences reported for any organism, suggesting that this category represents genes that are likely unique to flax. Digital analyses revealed gene expression dynamics for the biosynthesis of a number of important seed constituents during seed development. Conclusions We have developed a foundational database of expressed sequences and collection of plasmid clones that comprise even low-expressed genes such as those encoding transcription factors. This has allowed us to delineate the spatio-temporal aspects of gene expression underlying the biosynthesis of a number of important seed constituents in flax. Flax belongs to a taxonomic group of diverse plants and the large sequence database will allow for evolutionary studies as well. PMID:21529361
Fine mapping of regulatory loci for mammalian gene expression using radiation hybrids
Park, Christopher C; Ahn, Sangtae; Bloom, Joshua S; Lin, Andy; Wang, Richard T; Wu, Tongtong; Sekar, Aswin; Khan, Arshad H; Farr, Christine J; Lusis, Aldons J; Leahy, Richard M; Lange, Kenneth; Smith, Desmond J
2010-01-01
We mapped regulatory loci for nearly all protein-coding genes in mammals using comparative genomic hybridization and expression array measurements from a panel of mouse–hamster radiation hybrid cell lines. The large number of breaks in the mouse chromosomes and the dense genotyping of the panel allowed extremely sharp mapping of loci. As the regulatory loci result from extra gene dosage, we call them copy number expression quantitative trait loci, or ceQTLs. The −2log10P support interval for the ceQTLs was <150 kb, containing an average of <2–3 genes. We identified 29,769 trans ceQTLs with −log10P > 4, including 13 hotspots each regulating >100 genes in trans. Further, this work identifies 2,761 trans ceQTLs harboring no known genes, and provides evidence for a mode of gene expression autoregulation specific to the X chromosome. PMID:18362883
Hopp, Lydia; Löffler-Wirth, Henry; Galle, Jörg; Binder, Hans
2018-06-11
We present here a novel method that enables unraveling the interplay between gene expression and DNA methylation in complex diseases such as cancer. The method is based on self-organizing maps and allows for analysis of data landscapes from 'governed by methylation' to 'governed by expression'. We identified regulatory modules of coexpressed and comethylated genes in high-grade gliomas: two modes are governed by genes hypermethylated and underexpressed in IDH-mutated cases, while two other modes reflect immune and stromal signatures in the classical and mesenchymal subtypes. A fifth mode with proneural characteristics comprises genes of repressed and poised chromatin states active in healthy brain. Two additional modes enrich genes either in active or repressed chromatin states. The method disentangles the interplay between gene expression and methylation. It has the potential to integrate also mutation and copy number data and to apply to large sample cohorts.
Li, Yongxin; Kikuchi, Mani; Li, Xueyan; Gao, Qionghua; Xiong, Zijun; Ren, Yandong; Zhao, Ruoping; Mao, Bingyu; Kondo, Mariko; Irie, Naoki; Wang, Wen
2018-01-01
Sea cucumbers, one main class of Echinoderms, have a very fast and drastic metamorphosis process during their development. However, the molecular basis under this process remains largely unknown. Here we systematically examined the gene expression profiles of Japanese common sea cucumber (Apostichopus japonicus) for the first time by RNA sequencing across 16 developmental time points from fertilized egg to juvenile stage. Based on the weighted gene co-expression network analysis (WGCNA), we identified 21 modules. Among them, MEdarkmagenta was highly expressed and correlated with the early metamorphosis process from late auricularia to doliolaria larva. Furthermore, gene enrichment and differentially expressed gene analysis identified several genes in the module that may play key roles in the metamorphosis process. Our results not only provide a molecular basis for experimentally studying the development and morphological complexity of sea cucumber, but also lay a foundation for improving its emergence rate. Copyright © 2017 Elsevier Inc. All rights reserved.
Expression Atlas: gene and protein expression across multiple studies and organisms
Tang, Y Amy; Bazant, Wojciech; Burke, Melissa; Fuentes, Alfonso Muñoz-Pomer; George, Nancy; Koskinen, Satu; Mohammed, Suhaib; Geniza, Matthew; Preece, Justin; Jarnuczak, Andrew F; Huber, Wolfgang; Stegle, Oliver; Brazma, Alvis; Petryszak, Robert
2018-01-01
Abstract Expression Atlas (http://www.ebi.ac.uk/gxa) is an added value database that provides information about gene and protein expression in different species and contexts, such as tissue, developmental stage, disease or cell type. The available public and controlled access data sets from different sources are curated and re-analysed using standardized, open source pipelines and made available for queries, download and visualization. As of August 2017, Expression Atlas holds data from 3,126 studies across 33 different species, including 731 from plants. Data from large-scale RNA sequencing studies including Blueprint, PCAWG, ENCODE, GTEx and HipSci can be visualized next to each other. In Expression Atlas, users can query genes or gene-sets of interest and explore their expression across or within species, tissues, developmental stages in a constitutive or differential context, representing the effects of diseases, conditions or experimental interventions. All processed data matrices are available for direct download in tab-delimited format or as R-data. In addition to the web interface, data sets can now be searched and downloaded through the Expression Atlas R package. Novel features and visualizations include the on-the-fly analysis of gene set overlaps and the option to view gene co-expression in experiments investigating constitutive gene expression across tissues or other conditions. PMID:29165655
Chen, Chao; Zhao, Xinqing; Jin, Yingyu; Zhao, Zongbao Kent; Suh, Joo-Won
2014-11-01
Bacterial artificial chromosomal (BAC) vectors are increasingly being used in cloning large DNA fragments containing complex biosynthetic pathways to facilitate heterologous production of microbial metabolites for drug development. To express inserted genes using Streptomyces species as the production hosts, an integration expression cassette is required to be inserted into the BAC vector, which includes genetic elements encoding a phage-specific attachment site, an integrase, an origin of transfer, a selection marker and a promoter. Due to the large sizes of DNA inserted into the BAC vectors, it is normally inefficient and time-consuming to assemble these fragments by routine PCR amplifications and restriction-ligations. Here we present a rapid method to insert fragments to construct BAC-based expression vectors. A DNA fragment of about 130 bp was designed, which contains upstream and downstream homologous sequences of both BAC vector and pIB139 plasmid carrying the whole integration expression cassette. In-Fusion cloning was performed using the designer DNA fragment to modify pIB139, followed by λ-RED-mediated recombination to obtain the BAC-based expression vector. We demonstrated the effectiveness of this method by rapid construction of a BAC-based expression vector with an insert of about 120 kb that contains the entire gene cluster for biosynthesis of immunosuppressant FK506. The empty BAC-based expression vector constructed in this study can be conveniently used for construction of BAC libraries using either microbial pure culture or environmental DNA, and the selected BAC clones can be directly used for heterologous expression. Alternatively, if a BAC library has already been constructed using a commercial BAC vector, the selected BAC vectors can be manipulated using the method described here to get the BAC-based expression vectors with desired gene clusters for heterologous expression. The rapid construction of a BAC-based expression vector facilitates heterologous expression of large gene clusters for drug discovery. Copyright © 2014 Elsevier Inc. All rights reserved.
Distinct types of primary cutaneous large B-cell lymphoma identified by gene expression profiling.
Hoefnagel, Juliette J; Dijkman, Remco; Basso, Katia; Jansen, Patty M; Hallermann, Christian; Willemze, Rein; Tensen, Cornelis P; Vermeer, Maarten H
2005-05-01
In the European Organization for Research and Treatment of Cancer (EORTC) classification 2 types of primary cutaneous large B-cell lymphoma (PCLBCL) are distinguished: primary cutaneous follicle center cell lymphomas (PCFCCL) and PCLBCL of the leg (PCLBCL-leg). Distinction between both groups is considered important because of differences in prognosis (5-year survival > 95% and 52%, respectively) and the first choice of treatment (radiotherapy or systemic chemotherapy, respectively), but is not generally accepted. To establish a molecular basis for this subdivision in the EORTC classification, we investigated the gene expression profiles of 21 PCLBCLs by oligonucleotide microarray analysis. Hierarchical clustering based on a B-cell signature (7450 genes) classified PCLBCL into 2 distinct subgroups consisting of, respectively, 8 PCFCCLs and 13 PCLBCLsleg. PCLBCLs-leg showed increased expression of genes associated with cell proliferation; the proto-oncogenes Pim-1, Pim-2, and c-Myc; and the transcription factors Mum1/IRF4 and Oct-2. In the group of PCFCCL high expression of SPINK2 was observed. Further analysis suggested that PCFCCLs and PCLBCLs-leg have expression profiles similar to that of germinal center B-cell-like and activated B-cell-like diffuse large B-cell lymphoma, respectively. The results of this study suggest that different pathogenetic mechanisms are involved in the development of PCFCCLs and PCLBCLs-leg and provide molecular support for the subdivision used in the EORTC classification.
Methods to increase reproducibility in differential gene expression via meta-analysis
Sweeney, Timothy E.; Haynes, Winston A.; Vallania, Francesco; Ioannidis, John P.; Khatri, Purvesh
2017-01-01
Findings from clinical and biological studies are often not reproducible when tested in independent cohorts. Due to the testing of a large number of hypotheses and relatively small sample sizes, results from whole-genome expression studies in particular are often not reproducible. Compared to single-study analysis, gene expression meta-analysis can improve reproducibility by integrating data from multiple studies. However, there are multiple choices in designing and carrying out a meta-analysis. Yet, clear guidelines on best practices are scarce. Here, we hypothesized that studying subsets of very large meta-analyses would allow for systematic identification of best practices to improve reproducibility. We therefore constructed three very large gene expression meta-analyses from clinical samples, and then examined meta-analyses of subsets of the datasets (all combinations of datasets with up to N/2 samples and K/2 datasets) compared to a ‘silver standard’ of differentially expressed genes found in the entire cohort. We tested three random-effects meta-analysis models using this procedure. We showed relatively greater reproducibility with more-stringent effect size thresholds with relaxed significance thresholds; relatively lower reproducibility when imposing extraneous constraints on residual heterogeneity; and an underestimation of actual false positive rate by Benjamini–Hochberg correction. In addition, multivariate regression showed that the accuracy of a meta-analysis increased significantly with more included datasets even when controlling for sample size. PMID:27634930
Kudo, Toru; Sasaki, Yohei; Terashima, Shin; Matsuda-Imai, Noriko; Takano, Tomoyuki; Saito, Misa; Kanno, Maasa; Ozaki, Soichi; Suwabe, Keita; Suzuki, Go; Watanabe, Masao; Matsuoka, Makoto; Takayama, Seiji; Yano, Kentaro
2016-10-13
In quantitative gene expression analysis, normalization using a reference gene as an internal control is frequently performed for appropriate interpretation of the results. Efforts have been devoted to exploring superior novel reference genes using microarray transcriptomic data and to evaluating commonly used reference genes by targeting analysis. However, because the number of specifically detectable genes is totally dependent on probe design in the microarray analysis, exploration using microarray data may miss some of the best choices for the reference genes. Recently emerging RNA sequencing (RNA-seq) provides an ideal resource for comprehensive exploration of reference genes since this method is capable of detecting all expressed genes, in principle including even unknown genes. We report the results of a comprehensive exploration of reference genes using public RNA-seq data from plants such as Arabidopsis thaliana (Arabidopsis), Glycine max (soybean), Solanum lycopersicum (tomato) and Oryza sativa (rice). To select reference genes suitable for the broadest experimental conditions possible, candidates were surveyed by the following four steps: (1) evaluation of the basal expression level of each gene in each experiment; (2) evaluation of the expression stability of each gene in each experiment; (3) evaluation of the expression stability of each gene across the experiments; and (4) selection of top-ranked genes, after ranking according to the number of experiments in which the gene was expressed stably. Employing this procedure, 13, 10, 12 and 21 top candidates for reference genes were proposed in Arabidopsis, soybean, tomato and rice, respectively. Microarray expression data confirmed that the expression of the proposed reference genes under broad experimental conditions was more stable than that of commonly used reference genes. These novel reference genes will be useful for analyzing gene expression profiles across experiments carried out under various experimental conditions.
Taha, M O; de Oliveira, J V; Dias Borges, M; de Lucca Melo, F; Gualtieri, F G; E Silva Aidar, A L; Pacheco, R L; de Melo Alexandre E Silva, T; Klajner, R K; Iuamoto, L R; Munhoz Torres, L; Morais Mendes de Paula, B J; de Campos, K; Oliveira-Junior, I S; Fagundes, D J
2016-03-01
The goal of this study was to investigate whether exogenous offer of L-arginine (LARG) modulates the gene expression of intestinal dysfunction caused by ischemia and reperfusion. Eighteen Wistar-EPM1 male rats (250-300 g) were anesthetized and subjected to laparotomy. The superior mesenteric vessels were exposed, and the rats were randomized into 3 groups (n = 6): the control group (CG), with no superior mesenteric artery interruption; the ischemia/reperfusion group (IRG), with 60 minutes of ischemia and 120 minutes of reperfusion and saline injections; and the L-arginine group (IRG + LARG), with L-arginine injected in the femoral vein 5 minutes before ischemia, 5 minutes after reperfusion, and after 55 minutes of reperfusion. The total RNA was extracted and purified from samples of the small intestine. The concentration of each total RNA sample was determined by using spectrophotometry. The first-strand complementary DNA (cDNA) was synthesized in equal amounts of cDNA and the Master Mix SYBR Green qPCR Mastermix (SABiosciences, a Qiagen Company, Frederick, Md). Amounts of cDNA and Master Mix SYBR Green qPCR Mastermix were distributed to each well of the polymerase chain reaction microarray plate containing the predispensed gene-specific primer sets for Bax and Bcl2. Each sample was evaluated in triplicate, and the Student t test was applied to validate the homogeneity of each gene expression reaction (P < .05). The gene expression of Bax in IRG (+1.48) was significantly higher than in IRG-LARG (+9.69); the expression of Bcl2L1 in IRG (+1.01) was significantly higher than IRG-LARG (+22.89). The apoptotic cell pathway of 2 protagonists showed that LARG improves the gene expression of anti-apoptotic Bcl2l1 (Bcl2-like 1) more than the pro-apoptotic Bax (Bcl2-associated X protein). Copyright © 2016. Published by Elsevier Inc.
Transcriptome study of differential expression in schizophrenia
Sanders, Alan R.; Göring, Harald H. H.; Duan, Jubao; Drigalenko, Eugene I.; Moy, Winton; Freda, Jessica; He, Deli; Shi, Jianxin; Gejman, Pablo V.
2013-01-01
Schizophrenia genome-wide association studies (GWAS) have identified common SNPs, rare copy number variants (CNVs) and a large polygenic contribution to illness risk, but biological mechanisms remain unclear. Bioinformatic analyses of significantly associated genetic variants point to a large role for regulatory variants. To identify gene expression abnormalities in schizophrenia, we generated whole-genome gene expression profiles using microarrays on lymphoblastoid cell lines (LCLs) from 413 cases and 446 controls. Regression analysis identified 95 transcripts differentially expressed by affection status at a genome-wide false discovery rate (FDR) of 0.05, while simultaneously controlling for confounding effects. These transcripts represented 89 genes with functions such as neurotransmission, gene regulation, cell cycle progression, differentiation, apoptosis, microRNA (miRNA) processing and immunity. This functional diversity is consistent with schizophrenia's likely significant pathophysiological heterogeneity. The overall enrichment of immune-related genes among those differentially expressed by affection status is consistent with hypothesized immune contributions to schizophrenia risk. The observed differential expression of extended major histocompatibility complex (xMHC) region histones (HIST1H2BD, HIST1H2BC, HIST1H2BH, HIST1H2BG and HIST1H4K) converges with the genetic evidence from GWAS, which find the xMHC to be the most significant susceptibility locus. Among the differentially expressed immune-related genes, B3GNT2 is implicated in autoimmune disorders previously tied to schizophrenia risk (rheumatoid arthritis and Graves’ disease), and DICER1 is pivotal in miRNA processing potentially linking to miRNA alterations in schizophrenia (e.g. MIR137, the second strongest GWAS finding). Our analysis provides novel candidate genes for further study to assess their potential contribution to schizophrenia. PMID:23904455
Hepatic gene expression patterns following trauma-hemorrhage: effect of posttreatment with estrogen.
Yu, Huang-Ping; Pang, See-Tong; Chaudry, Irshad H
2013-01-01
The aim of this study was to examine the role of estrogen on hepatic gene expression profiles at an early time point following trauma-hemorrhage in rats. Groups of injured and sham controls receiving estrogen or vehicle were killed 2 h after injury and resuscitation, and liver tissue was harvested. Complementary RNA was synthesized from each RNA sample and hybridized to microarrays. A large number of genes were differentially expressed at the 2-h time point in injured animals with or without estrogen treatment. The upregulation or downregulation of a cohort of 14 of these genes was validated by reverse transcription-polymerase chain reaction. This large-scale microarray analysis shows that at the 2-h time point, there is marked alteration in hepatic gene expression following trauma-hemorrhage. However, estrogen treatment attenuated these changes in injured animals. Pathway analysis demonstrated predominant changes in the expression of genes involved in metabolism, immunity, and apoptosis. Upregulation of low-density lipoprotein receptor, protein phosphatase 1, regulatory subunit 3C, ring-finger protein 11, pyroglutamyl-peptidase I, bactericidal/permeability-increasing protein, integrin, αD, BCL2-like 11, leukemia inhibitory factor receptor, ATPase, Cu transporting, α polypeptide, and Mk1 protein was found in estrogen-treated trauma-hemorrhaged animals. Thus, estrogen produces hepatoprotection following trauma-hemorrhage likely via antiapoptosis and improving/restoring metabolism and immunity pathways.
Positive Selection Underlies Faster-Z Evolution of Gene Expression in Birds
Dean, Rebecca; Harrison, Peter W.; Wright, Alison E.; Zimmer, Fabian; Mank, Judith E.
2015-01-01
The elevated rate of evolution for genes on sex chromosomes compared with autosomes (Fast-X or Fast-Z evolution) can result either from positive selection in the heterogametic sex or from nonadaptive consequences of reduced relative effective population size. Recent work in birds suggests that Fast-Z of coding sequence is primarily due to relaxed purifying selection resulting from reduced relative effective population size. However, gene sequence and gene expression are often subject to distinct evolutionary pressures; therefore, we tested for Fast-Z in gene expression using next-generation RNA-sequencing data from multiple avian species. Similar to studies of Fast-Z in coding sequence, we recover clear signatures of Fast-Z in gene expression; however, in contrast to coding sequence, our data indicate that Fast-Z in expression is due to positive selection acting primarily in females. In the soma, where gene expression is highly correlated between the sexes, we detected Fast-Z in both sexes, although at a higher rate in females, suggesting that many positively selected expression changes in females are also expressed in males. In the gonad, where intersexual correlations in expression are much lower, we detected Fast-Z for female gene expression, but crucially, not males. This suggests that a large amount of expression variation is sex-specific in its effects within the gonad. Taken together, our results indicate that Fast-Z evolution of gene expression is the product of positive selection acting on recessive beneficial alleles in the heterogametic sex. More broadly, our analysis suggests that the adaptive potential of Z chromosome gene expression may be much greater than that of gene sequence, results which have important implications for the role of sex chromosomes in speciation and sexual selection. PMID:26067773
Comparison of gene expression changes induced by biguanides in db/db mice liver.
Heishi, Masayuki; Hayashi, Koji; Ichihara, Junji; Ishikawa, Hironori; Kawamura, Takao; Kanaoka, Masaharu; Taiji, Mutsuo; Kimura, Toru
2008-08-01
Large-scale clinical studies have shown that the biguanide drug metformin, widely used for type 2 diabetes, to be very safe. By contrast, another biguanide, phenformin, has been withdrawn from major markets because of a high incidence of serious adverse effects. The difference in mode of action between the two biguanides remains unclear. To gain insight into the different modes of action of the two drugs, we performed global gene expression profiling using the livers of obese diabetic db/db mice after a single administration of phenformin or metformin at levels sufficient to cause a significant reduction in blood glucose level. Metformin induced modest expression changes, including G6pc in the liver as previously reported. By contrast, phenformin caused changes in expression level of many additional genes. We used a knowledge-based bioinformatic analysis to study the effects of phenformin. Differentially expressed genes identified in this study constitute a large gene network, which may be related to cell death, inflammation or wound response. Our results suggest that the two biguanides show a similar hypoglycemic effect in db/db mice, but phenformin induces a greater stress on the liver even a short time after a single administration. These findings provide a novel insight into the cause of the relatively high occurrence of serious adverse effect after phenformin treatment.
Employing epigenetics to augment the expression of therapeutic proteins in mammalian cells.
Kwaks, Ted H J; Otte, Arie P
2006-03-01
Recombinant proteins form an increasingly large part of the portfolio of biopharmaceutical companies. Production of these often complex transgenic proteins is achieved predominantly in mammalian cell lines but the process is hampered by low yields and unstable expression. Some of these problems are caused by gene silencing at the level of chromatin - so-called epigenetic gene silencing. Here, we describe approaches, which have emerged during the past few years, designed to interfere with epigenetic gene silencing with the aim of enhancing and stabilizing transgene expression. These include targeting histones, the inclusion of specific DNA elements and targeting sites of high gene-expression. We conclude that employing epigenetic gene regulation tools, in combination with further process optimization, might represent the next step forward in the production of therapeutic proteins.
Hacker, David L; Bertschinger, Martin; Baldi, Lucia; Wurm, Florian M
2004-10-27
Human embryonic kidney 293 (HEK293) cells, a widely used host for large-scale transient expression of recombinant proteins, are transformed with the adenovirus E1A and E1B genes. Because the E1A proteins function as transcriptional activators or repressors, they may have a positive or negative effect on transient transgene expression in this cell line. Suspension cultures of HEK293 EBNA (HEK293E) cells were co-transfected with a reporter plasmid expressing the GFP gene and a plasmid expressing a short hairpin RNA (shRNA) targeting the E1A mRNAs for degradation by RNA interference (RNAi). The presence of the shRNA in HEK293E cells reduced the steady state level of E1A mRNA up to 75% and increased transient GFP expression from either the elongation factor-1alpha (EF-1alpha) promoter or the human cytomegalovirus (HCMV) immediate early promoter up to twofold. E1A mRNA depletion also resulted in a twofold increase in transient expression of a recombinant IgG in both small- and large-scale suspension cultures when the IgG light and heavy chain genes were controlled by the EF-1alpha promoter. Finally, transient IgG expression was enhanced 2.5-fold when the anti-E1A shRNA was expressed from the same vector as the IgG light chain gene. These results demonstrated that E1A has a negative effect on transient gene expression in HEK293E cells, and they established that RNAi can be used to enhance recombinant protein expression in mammalian cells.
Feltus, F Alex; Ficklin, Stephen P; Gibson, Scott M; Smith, Melissa C
2013-06-05
In genomics, highly relevant gene interaction (co-expression) networks have been constructed by finding significant pair-wise correlations between genes in expression datasets. These networks are then mined to elucidate biological function at the polygenic level. In some cases networks may be constructed from input samples that measure gene expression under a variety of different conditions, such as for different genotypes, environments, disease states and tissues. When large sets of samples are obtained from public repositories it is often unmanageable to associate samples into condition-specific groups, and combining samples from various conditions has a negative effect on network size. A fixed significance threshold is often applied also limiting the size of the final network. Therefore, we propose pre-clustering of input expression samples to approximate condition-specific grouping of samples and individual network construction of each group as a means for dynamic significance thresholding. The net effect is increase sensitivity thus maximizing the total co-expression relationships in the final co-expression network compendium. A total of 86 Arabidopsis thaliana co-expression networks were constructed after k-means partitioning of 7,105 publicly available ATH1 Affymetrix microarray samples. We term each pre-sorted network a Gene Interaction Layer (GIL). Random Matrix Theory (RMT), an un-supervised thresholding method, was used to threshold each of the 86 networks independently, effectively providing a dynamic (non-global) threshold for the network. The overall gene count across all GILs reached 19,588 genes (94.7% measured gene coverage) and 558,022 unique co-expression relationships. In comparison, network construction without pre-sorting of input samples yielded only 3,297 genes (15.9%) and 129,134 relationships. in the global network. Here we show that pre-clustering of microarray samples helps approximate condition-specific networks and allows for dynamic thresholding using un-supervised methods. Because RMT ensures only highly significant interactions are kept, the GIL compendium consists of 558,022 unique high quality A. thaliana co-expression relationships across almost all of the measurable genes on the ATH1 array. For A. thaliana, these networks represent the largest compendium to date of significant gene co-expression relationships, and are a means to explore complex pathway, polygenic, and pleiotropic relationships for this focal model plant. The networks can be explored at sysbio.genome.clemson.edu. Finally, this method is applicable to any large expression profile collection for any organism and is best suited where a knowledge-independent network construction method is desired.
2013-01-01
Background In genomics, highly relevant gene interaction (co-expression) networks have been constructed by finding significant pair-wise correlations between genes in expression datasets. These networks are then mined to elucidate biological function at the polygenic level. In some cases networks may be constructed from input samples that measure gene expression under a variety of different conditions, such as for different genotypes, environments, disease states and tissues. When large sets of samples are obtained from public repositories it is often unmanageable to associate samples into condition-specific groups, and combining samples from various conditions has a negative effect on network size. A fixed significance threshold is often applied also limiting the size of the final network. Therefore, we propose pre-clustering of input expression samples to approximate condition-specific grouping of samples and individual network construction of each group as a means for dynamic significance thresholding. The net effect is increase sensitivity thus maximizing the total co-expression relationships in the final co-expression network compendium. Results A total of 86 Arabidopsis thaliana co-expression networks were constructed after k-means partitioning of 7,105 publicly available ATH1 Affymetrix microarray samples. We term each pre-sorted network a Gene Interaction Layer (GIL). Random Matrix Theory (RMT), an un-supervised thresholding method, was used to threshold each of the 86 networks independently, effectively providing a dynamic (non-global) threshold for the network. The overall gene count across all GILs reached 19,588 genes (94.7% measured gene coverage) and 558,022 unique co-expression relationships. In comparison, network construction without pre-sorting of input samples yielded only 3,297 genes (15.9%) and 129,134 relationships. in the global network. Conclusions Here we show that pre-clustering of microarray samples helps approximate condition-specific networks and allows for dynamic thresholding using un-supervised methods. Because RMT ensures only highly significant interactions are kept, the GIL compendium consists of 558,022 unique high quality A. thaliana co-expression relationships across almost all of the measurable genes on the ATH1 array. For A. thaliana, these networks represent the largest compendium to date of significant gene co-expression relationships, and are a means to explore complex pathway, polygenic, and pleiotropic relationships for this focal model plant. The networks can be explored at sysbio.genome.clemson.edu. Finally, this method is applicable to any large expression profile collection for any organism and is best suited where a knowledge-independent network construction method is desired. PMID:23738693
Sarcoptes scabiei Mites Modulate Gene Expression in Human Skin Equivalents
Morgan, Marjorie S.; Arlian, Larry G.; Markey, Michael P.
2013-01-01
The ectoparasitic mite, Sarcoptes scabiei that burrows in the epidermis of mammalian skin has a long co-evolution with its hosts. Phenotypic studies show that the mites have the ability to modulate cytokine secretion and expression of cell adhesion molecules in cells of the skin and other cells of the innate and adaptive immune systems that may assist the mites to survive in the skin. The purpose of this study was to identify genes in keratinocytes and fibroblasts in human skin equivalents (HSEs) that changed expression in response to the burrowing of live scabies mites. Overall, of the more than 25,800 genes measured, 189 genes were up-regulated >2-fold in response to scabies mite burrowing while 152 genes were down-regulated to the same degree. HSEs differentially expressed large numbers of genes that were related to host protective responses including those involved in immune response, defense response, cytokine activity, taxis, response to other organisms, and cell adhesion. Genes for the expression of interleukin-1α (IL-1α) precursor, IL-1β, granulocyte/macrophage-colony stimulating factor (GM-CSF) precursor, and G-CSF precursor were up-regulated 2.8- to 7.4-fold, paralleling cytokine secretion profiles. A large number of genes involved in epithelium development and keratinization were also differentially expressed in response to live scabies mites. Thus, these skin cells are directly responding as expected in an inflammatory response to products of the mites and the disruption of the skin’s protective barrier caused by burrowing. This suggests that in vivo the interplay among these skin cells and other cell types, including Langerhans cells, dendritic cells, lymphocytes and endothelial cells, is responsible for depressing the host’s protective response allowing these mites to survive in the skin. PMID:23940705
Hi-C Chromatin Interaction Networks Predict Co-expression in the Mouse Cortex
Hulsman, Marc; Lelieveldt, Boudewijn P. F.; de Ridder, Jeroen; Reinders, Marcel
2015-01-01
The three dimensional conformation of the genome in the cell nucleus influences important biological processes such as gene expression regulation. Recent studies have shown a strong correlation between chromatin interactions and gene co-expression. However, predicting gene co-expression from frequent long-range chromatin interactions remains challenging. We address this by characterizing the topology of the cortical chromatin interaction network using scale-aware topological measures. We demonstrate that based on these characterizations it is possible to accurately predict spatial co-expression between genes in the mouse cortex. Consistent with previous findings, we find that the chromatin interaction profile of a gene-pair is a good predictor of their spatial co-expression. However, the accuracy of the prediction can be substantially improved when chromatin interactions are described using scale-aware topological measures of the multi-resolution chromatin interaction network. We conclude that, for co-expression prediction, it is necessary to take into account different levels of chromatin interactions ranging from direct interaction between genes (i.e. small-scale) to chromatin compartment interactions (i.e. large-scale). PMID:25965262
Shang, Feng; Ding, Bi-Yue; Xiong, Ying; Dou, Wei; Wei, Dong; Jiang, Hong-Bo; Wei, Dan-Dan; Wang, Jin-Jun
2016-01-01
Winged and wingless morphs in insects represent a trade-off between dispersal ability and reproduction. We studied key genes associated with apterous and alate morphs in Toxoptera citricida (Kirkaldy) using RNAseq, digital gene expression (DGE) profiling, and RNA interference. The de novo assembly of the transcriptome was obtained through Illumina short-read sequencing technology. A total of 44,199 unigenes were generated and 27,640 were annotated. The transcriptomic differences between alate and apterous adults indicated that 279 unigenes were highly expressed in alate adults, whereas 5,470 were expressed at low levels. Expression patterns of the top 10 highly expressed genes in alate adults agreed with wing bud development trends. Silencing of the lipid synthesis and degradation gene (3-ketoacyl-CoA thiolase, mitochondrial-like) and glycogen genes (Phosphoenolpyruvate carboxykinase [GTP]-like and Glycogen phosphorylase-like isoform 2) resulted in underdeveloped wings. This suggests that both lipid and glycogen metabolism provide energy for aphid wing development. The large number of sequences and expression data produced from the transcriptome and DGE sequencing, respectively, increases our understanding of wing development mechanisms. PMID:27577531
Fox, I J; Chowdhury, N R; Gupta, S; Kondapalli, R; Schilsky, M L; Stockert, R J; Chowdhury, J R
1995-03-01
Viral vectors and protein carriers utilizing asialoglycoprotein receptor (ASGR)-mediated endocytosis are being developed to transfer genes for the correction of bilirubin-UDP-glucuronosyltransferase (bilirubin-UGT) deficiency. Ex vivo evaluation of these gene transfer vectors would be facilitated by a cell system that lacks bilirubin-UGT, but expresses differentiated liver functions, including ASGR. We immortalized primary Gunn rat hepatocytes by transduction with a recombinant Moloney murine leukemia virus expressing a thermolabile mutant SV40 large T antigen (tsA58). At 33 degrees C, the immortalized hepatocyte clones expressed SV40 large T antigen, synthesized DNA, and doubled in number every 2 to 3 days. At this temperature, differentiated hepatocyte markers, e.g., albumin, ASGR, and androsterone-UGT, were expressed at 5% to 10% of the levels found in primary hepatocytes maintained in culture for 24 hours. Glutathione-S-transferase Yp (GST-Yp), an oncofetal protein, was expressed in these cells at 33 degrees C, but was undetectable in primary hepatocytes. In contrast, when the cells were cultured at 39 degrees C or 37 degrees C, the large T antigen was degraded, DNA synthesis and cell growth stopped, and morphologic characteristics of differentiated hepatocytes were observed. The expression of albumin, ASGR, and androsterone-UGT, and their corresponding mRNAs, increased to 25% to 40% of the level in primary hepatocytes, whereas GST-Yp expression decreased. Functionality of ASGR was demonstrated by internalization of Texas red-labeled asialoorosomucoid, and binding and degradation of 125I-asialoorosomucoid. After liposome-mediated transfer of a plasmid containing the coding region of human bilirubin-UGT1, driven by the SV40 large T promoter, active human bilirubin-UGT1 was expressed in these cells. The immortalized cells were not tumorigenic after transplantation into severe combined immunodeficiency mice. These conditionally immortalized cells will be useful for ex vivo evaluation of bilirubin-UGT gene transfer vectors.
Golpon, Heiko A.; Geraci, Mark W.; Moore, Mark D.; Miller, Heidi L.; Miller, Gary J.; Tuder, Rubin M.; Voelkel, Norbert F.
2001-01-01
HOX genes belong to the large family of homeodomain genes that function as transcription factors. Animal studies indicate that they play an essential role in lung development. We investigated the expression pattern of HOX genes in human lung tissue by using microarray and degenerate reverse transcriptase-polymerase chain reaction survey techniques. HOX genes predominantly from the 3′ end of clusters A and B were expressed in normal human adult lung and among them HOXA5 was the most abundant, followed by HOXB2 and HOXB6. In fetal (12 weeks old) and diseased lung specimens (emphysema, primary pulmonary hypertension) additional HOX genes from clusters C and D were expressed. Using in situ hybridization, transcripts for HOXA5 were predominantly found in alveolar septal and epithelial cells, both in normal and diseased lungs. A 2.5-fold increase in HOXA5 mRNA expression was demonstrated by quantitative reverse transcriptase-polymerase chain reaction in primary pulmonary hypertension lung specimens when compared to normal lung tissue. In conclusion, we demonstrate that HOX genes are selectively expressed in the human lung. Differences in the pattern of HOX gene expression exist among fetal, adult, and diseased lung specimens. The altered pattern of HOX gene expression may contribute to the development of pulmonary diseases. PMID:11238043
Gottlieb, Assaf; Daneshjou, Roxana; DeGorter, Marianne; Bourgeois, Stephane; Svensson, Peter J; Wadelius, Mia; Deloukas, Panos; Montgomery, Stephen B; Altman, Russ B
2017-11-24
Genome-wide association studies are useful for discovering genotype-phenotype associations but are limited because they require large cohorts to identify a signal, which can be population-specific. Mapping genetic variation to genes improves power and allows the effects of both protein-coding variation as well as variation in expression to be combined into "gene level" effects. Previous work has shown that warfarin dose can be predicted using information from genetic variation that affects protein-coding regions. Here, we introduce a method that improves dose prediction by integrating tissue-specific gene expression. In particular, we use drug pathways and expression quantitative trait loci knowledge to impute gene expression-on the assumption that differential expression of key pathway genes may impact dose requirement. We focus on 116 genes from the pharmacokinetic and pharmacodynamic pathways of warfarin within training and validation sets comprising both European and African-descent individuals. We build gene-tissue signatures associated with warfarin dose in a cohort-specific manner and identify a signature of 11 gene-tissue pairs that significantly augments the International Warfarin Pharmacogenetics Consortium dosage-prediction algorithm in both populations. Our results demonstrate that imputed expression can improve dose prediction and bridge population-specific compositions. MATLAB code is available at https://github.com/assafgo/warfarin-cohort.
Mosquera Orgueira, Adrián
2015-01-01
DNA methylation is a frequent epigenetic mechanism that participates in transcriptional repression. Variations in DNA methylation with respect to gene expression are constant, and, for unknown reasons, some genes with highly methylated promoters are sometimes overexpressed. In this study we have analyzed the expression and methylation patterns of thousands of genes in five groups of cancer and normal tissue samples in order to determine local and genome-wide differences. We observed significant changes in global methylation-expression correlation in all the neoplasms, which suggests that differential correlation events are frequent in cancer. A focused analysis in the breast cancer cohort identified 1662 genes whose correlation varies significantly between normal and cancerous breast, but whose DNA methylation and gene expression patterns do not change substantially. These genes were enriched in cancer-related pathways and repressive chromatin features across various model cell lines, such as PRC2 binding and H3K27me3 marks. Substantial changes in methylation-expression correlation indicate that these genes are subject to epigenetic remodeling, where the differential activity of other factors break the expected relationship between both variables. Our findings suggest a complex regulatory landscape where a redistribution of local and large-scale chromatin repressive domains at differentially correlated genes (DCGs) creates epigenetic hotspots that modulate cancer-specific gene expression.
Durand-Dubief, Mickaël; Absalon, Sabrina; Menzer, Linda; Ngwabyt, Sandra; Ersfeld, Klaus; Bastin, Philippe
2007-12-01
The protist Trypanosoma brucei possesses a single Argonaute gene called TbAGO1 that is necessary for RNAi silencing. We previously showed that in strain 427, TbAGO1 knock-out leads to a slow growth phenotype and to chromosome segregation defects. Here we report that the slow growth phenotype is linked to defects in segregation of both large and mini-chromosome populations, with large chromosomes being the most affected. These phenotypes are completely reversed upon inducible re-expression of TbAGO1 fused to GFP, demonstrating their link with TbAGO1. Trypanosomes that do not express TbAGO1 show a general increase in the abundance of transcripts derived from the short retroposon RIME (Ribosomal Interspersed Mobile Element). Supplementary large RIME transcripts emerge in the absence of RNAi, a phenomenon coupled to the disappearance of short transcripts. These fluctuations are reversed by inducible expression of GFP::TbAGO1. Furthermore, we use a combination of Northern blots, RT-PCR and sequencing to reveal that RNAi controls expression of transcripts derived from RHS (Retrotransposon Hot Spot) pseudogenes (RHS genes with retro-element(s) integrated within their coding sequence). Absence of RNAi also leads to an increase of steady-state transcripts from regular RHS genes (those without retro-element), indicating a role for pseudogene in control of gene expression. However, analysis of retroposon abundance and arrangement in the genome of multiple clonal cell lines of TbAGO1-/- failed to reveal movement of mobile elements despite the increased amounts of retroposon transcripts.
Tuller, Tamir; Atar, Shimshi; Ruppin, Eytan; Gurevich, Michael; Achiron, Anat
2011-09-15
Multiple sclerosis (MS) is a central nervous system autoimmune inflammatory T-cell-mediated disease with a relapsing-remitting course in the majority of patients. In this study, we performed a high-resolution systems biology analysis of gene expression and physical interactions in MS relapse and remission. To this end, we integrated 164 large-scale measurements of gene expression in peripheral blood mononuclear cells of MS patients in relapse or remission and healthy subjects, with large-scale information about the physical interactions between these genes obtained from public databases. These data were analyzed with a variety of computational methods. We find that there is a clear and significant global network-level signal that is related to the changes in gene expression of MS patients in comparison to healthy subjects. However, despite the clear differences in the clinical symptoms of MS patients in relapse versus remission, the network level signal is weaker when comparing patients in these two stages of the disease. This result suggests that most of the genes have relatively similar expression levels in the two stages of the disease. In accordance with previous studies, we found that the pathways related to regulation of cell death, chemotaxis and inflammatory response are differentially expressed in the disease in comparison to healthy subjects, while pathways related to cell adhesion, cell migration and cell-cell signaling are activated in relapse in comparison to remission. However, the current study includes a detailed report of the exact set of genes involved in these pathways and the interactions between them. For example, we found that the genes TP53 and IL1 are 'network-hub' that interacts with many of the differentially expressed genes in MS patients versus healthy subjects, and the epidermal growth factor receptor is a 'network-hub' in the case of MS patients with relapse versus remission. The statistical approaches employed in this study enabled us to report new sets of genes that according to their gene expression and physical interactions are predicted to be differentially expressed in MS versus healthy subjects, and in MS patients in relapse versus remission. Some of these genes may be useful biomarkers for diagnosing MS and predicting relapses in MS patients.
Van Gelder, R N; Bae, H; Palazzolo, M J; Krasnow, M A
1995-12-01
Although mRNAs expressed with a circadian rhythm have been isolated from many species, the extent and character of circadianly regulated gene expression is unknown for any animal. In Drosophila melanogaster, only the period (per) gene, an essential component of the circadian pacemaker, is known to show rhythmic mRNA expression. Recent work suggests that the encoded Per protein controls its own transcription by an autoregulatory feedback loop. Per might also control the rhythmic expression of other genes to generate circadian behavior and physiology. The goals of this work were to evaluate the extent and character of circadian control of gene expression in Drosophila, and to identify genes dependent on per for circadian expression. A large collection of anonymous, independent cDNA clones was used to screen for transcripts that are rhythmically expressed in the fly head. 20 of the 261 clones tested detected mRNAs with a greater than two-fold daily change in abundance. Three mRNAs were maximally expressed in the morning, whereas 17 mRNAs were most abundant in the evening--when per mRNA is also maximally expressed (but when the flies are inactive). Further analysis of the three 'morning' cDNAs showed that each has a unique dependence on the presence of a light-dark cycle, on timed feeding, and on the function of the per gene for its oscillation. These dependencies were different from those determined for per and for a novel 'evening' gene. Sequence analysis indicated that all but one of the 20 cDNAs identified previously uncloned genes. Diurnal control of gene expression is a significant but limited phenomenon in the fly head, which involves many uncharacterized genes. Diurnal control is mediated by multiple endogenous and exogenous mechanisms, even at the level of individual genes. A subset of circadianly expressed genes are predominantly or exclusively dependent on per for their rhythmic expression. The per gene can therefore influence the expression of genes other than itself, but for many rhythmically expressed genes, per functions in conjunction with external inputs to control their daily expression patterns.
Discrete domains of gene expression in germinal layers distinguish the development of gyrencephaly
de Juan Romero, Camino; Bruder, Carl; Tomasello, Ugo; Sanz-Anquela, José Miguel; Borrell, Víctor
2015-01-01
Gyrencephalic species develop folds in the cerebral cortex in a stereotypic manner, but the genetic mechanisms underlying this patterning process are unknown. We present a large-scale transcriptomic analysis of individual germinal layers in the developing cortex of the gyrencephalic ferret, comparing between regions prospective of fold and fissure. We find unique transcriptional signatures in each germinal compartment, where thousands of genes are differentially expressed between regions, including ∼80% of genes mutated in human cortical malformations. These regional differences emerge from the existence of discrete domains of gene expression, which occur at multiple locations across the developing cortex of ferret and human, but not the lissencephalic mouse. Complex expression patterns emerge late during development and map the eventual location of folds or fissures. Protomaps of gene expression within germinal layers may contribute to define cortical folds or functional areas, but our findings demonstrate that they distinguish the development of gyrencephalic cortices. PMID:25916825
Predictable transcriptome evolution in the convergent and complex bioluminescent organs of squid
Pankey, M. Sabrina; Minin, Vladimir N.; Imholte, Greg C.; Suchard, Marc A.; Oakley, Todd H.
2014-01-01
Despite contingency in life’s history, the similarity of evolutionarily convergent traits may represent predictable solutions to common conditions. However, the extent to which overall gene expression levels (transcriptomes) underlying convergent traits are themselves convergent remains largely unexplored. Here, we show strong statistical support for convergent evolutionary origins and massively parallel evolution of the entire transcriptomes in symbiotic bioluminescent organs (bacterial photophores) from two divergent squid species. The gene expression similarities are so strong that regression models of one species’ photophore can predict organ identity of a distantly related photophore from gene expression levels alone. Our results point to widespread parallel changes in gene expression evolution associated with convergent origins of complex organs. Therefore, predictable solutions may drive not only the evolution of novel, complex organs but also the evolution of overall gene expression levels that underlie them. PMID:25336755
NASA Technical Reports Server (NTRS)
Wenck, A. R.; Quinn, M.; Whetten, R. W.; Pullman, G.; Sederoff, R.; Brown, C. S. (Principal Investigator)
1999-01-01
Agrobacterium-mediated gene transfer is the method of choice for many plant biotechnology laboratories; however, large-scale use of this organism in conifer transformation has been limited by difficult propagation of explant material, selection efficiencies and low transformation frequency. We have analyzed co-cultivation conditions and different disarmed strains of Agrobacterium to improve transformation. Additional copies of virulence genes were added to three common disarmed strains. These extra virulence genes included either a constitutively active virG or extra copies of virG and virB, both from pTiBo542. In experiments with Norway spruce, we increased transformation efficiencies 1000-fold from initial experiments where little or no transient expression was detected. Over 100 transformed lines expressing the marker gene beta-glucuronidase (GUS) were generated from rapidly dividing embryogenic suspension-cultured cells co-cultivated with Agrobacterium. GUS activity was used to monitor transient expression and to further test lines selected on kanamycin-containing medium. In loblolly pine, transient expression increased 10-fold utilizing modified Agrobacterium strains. Agrobacterium-mediated gene transfer is a useful technique for large-scale generation of transgenic Norway spruce and may prove useful for other conifer species.
Anterior-posterior regionalized gene expression in the Ciona notochord
Veeman, Michael
2014-01-01
Background In the simple ascidian chordate Ciona the signaling pathways and gene regulatory networks giving rise to initial notochord induction are largely understood and the mechanisms of notochord morphogenesis are being systematically elucidated. The notochord has generally been thought of as a non-compartmentalized or regionalized organ that is not finely patterned at the level of gene expression. Quantitative imaging methods have recently shown, however, that notochord cell size, shape and behavior vary consistently along the anterior-posterior (AP) axis. Results Here we screen candidate genes by whole mount in situ hybridization for potential AP asymmetry. We identify 4 genes that show non-uniform expression in the notochord. Ezrin/radixin/moesin (ERM) is expressed more strongly in the secondary notochord lineage than the primary. CTGF is expressed stochastically in a subset of notochord cells. A novel calmodulin-like gene (BCamL) is expressed more strongly at both the anterior and posterior tips of the notochord. A TGF-β ortholog is expressed in a gradient from posterior to anterior. The asymmetries in ERM, BCamL and TGF-β expression are evident even before the notochord cells have intercalated into a single-file column. Conclusions We conclude that the Ciona notochord is not a homogeneous tissue but instead shows distinct patterns of regionalized gene expression. PMID:24288133
Anterior-posterior regionalized gene expression in the Ciona notochord.
Reeves, Wendy; Thayer, Rachel; Veeman, Michael
2014-04-01
In the simple ascidian chordate Ciona, the signaling pathways and gene regulatory networks giving rise to initial notochord induction are largely understood and the mechanisms of notochord morphogenesis are being systematically elucidated. The notochord has generally been thought of as a non-compartmentalized or regionalized organ that is not finely patterned at the level of gene expression. Quantitative imaging methods have recently shown, however, that notochord cell size, shape, and behavior vary consistently along the anterior-posterior (AP) axis. Here we screen candidate genes by whole mount in situ hybridization for potential AP asymmetry. We identify 4 genes that show non-uniform expression in the notochord. Ezrin/radixin/moesin (ERM) is expressed more strongly in the secondary notochord lineage than the primary. CTGF is expressed stochastically in a subset of notochord cells. A novel calmodulin-like gene (BCamL) is expressed more strongly at both the anterior and posterior tips of the notochord. A TGF-β ortholog is expressed in a gradient from posterior to anterior. The asymmetries in ERM, BCamL, and TGF-β expression are evident even before the notochord cells have intercalated into a single-file column. We conclude that the Ciona notochord is not a homogeneous tissue but instead shows distinct patterns of regionalized gene expression. Copyright © 2013 Wiley Periodicals, Inc.
A transcriptional dynamic network during Arabidopsis thaliana pollen development.
Wang, Jigang; Qiu, Xiaojie; Li, Yuhua; Deng, Youping; Shi, Tieliu
2011-01-01
To understand transcriptional regulatory networks (TRNs), especially the coordinated dynamic regulation between transcription factors (TFs) and their corresponding target genes during development, computational approaches would represent significant advances in the genome-wide expression analysis. The major challenges for the experiments include monitoring the time-specific TFs' activities and identifying the dynamic regulatory relationships between TFs and their target genes, both of which are currently not yet available at the large scale. However, various methods have been proposed to computationally estimate those activities and regulations. During the past decade, significant progresses have been made towards understanding pollen development at each development stage under the molecular level, yet the regulatory mechanisms that control the dynamic pollen development processes remain largely unknown. Here, we adopt Networks Component Analysis (NCA) to identify TF activities over time course, and infer their regulatory relationships based on the coexpression of TFs and their target genes during pollen development. We carried out meta-analysis by integrating several sets of gene expression data related to Arabidopsis thaliana pollen development (stages range from UNM, BCP, TCP, HP to 0.5 hr pollen tube and 4 hr pollen tube). We constructed a regulatory network, including 19 TFs, 101 target genes and 319 regulatory interactions. The computationally estimated TF activities were well correlated to their coordinated genes' expressions during the development process. We clustered the expression of their target genes in the context of regulatory influences, and inferred new regulatory relationships between those TFs and their target genes, such as transcription factor WRKY34, which was identified that specifically expressed in pollen, and regulated several new target genes. Our finding facilitates the interpretation of the expression patterns with more biological relevancy, since the clusters corresponding to the activity of specific TF or the combination of TFs suggest the coordinated regulation of TFs to their target genes. Through integrating different resources, we constructed a dynamic regulatory network of Arabidopsis thaliana during pollen development with gene coexpression and NCA. The network illustrated the relationships between the TFs' activities and their target genes' expression, as well as the interactions between TFs, which provide new insight into the molecular mechanisms that control the pollen development.
ROTH, STEPHEN M.; FERRELL, ROBERT E.; PETERS, DAVID G.; METTER, E. JEFFREY; HURLEY, BEN F.; ROGERS, MARC A.
2010-01-01
The purpose of this study was to determine the influence of age, sex, and strength training (ST) on large-scale gene expression patterns in vastus lateralis muscle biopsies using high-density cDNA microarrays and quantitative PCR. Muscle samples from sedentary young (20–30 yr) and older (65–75 yr) men and women (5 per group) were obtained before and after a 9-wk unilateral heavy resistance ST program. RNA was hybridized to cDNA filter microarrays representing ~4,000 known human genes and comparisons were made among arrays to determine differential gene expression as a result of age and sex differences, and/or response to ST. Sex had the strongest influence on muscle gene expression, with differential expression (>1.7-fold) observed for ~200 genes between men and women (~75% with higher expression in men). Age contributed to differential expression as well, as ~50 genes were identified as differentially expressed (>1.7-fold) in relation to age, representing structural, metabolic, and regulatory gene classes. Sixty-nine genes were identified as being differentially expressed (>1.7-fold) in all groups in response to ST, and the majority of these were downregulated. Quantitative PCR was employed to validate expression levels for caldesmon, SWI/SNF (BAF60b), and four-and-a-half LIM domains 1. These significant differences suggest that in the analysis of skeletal muscle gene expression issues of sex, age, and habitual physical activity must be addressed, with sex being the most critical variable. PMID:12209020
Xie, Xin-Ping; Xie, Yu-Feng; Wang, Hong-Qiang
2017-08-23
Large-scale accumulation of omics data poses a pressing challenge of integrative analysis of multiple data sets in bioinformatics. An open question of such integrative analysis is how to pinpoint consistent but subtle gene activity patterns across studies. Study heterogeneity needs to be addressed carefully for this goal. This paper proposes a regulation probability model-based meta-analysis, jGRP, for identifying differentially expressed genes (DEGs). The method integrates multiple transcriptomics data sets in a gene regulatory space instead of in a gene expression space, which makes it easy to capture and manage data heterogeneity across studies from different laboratories or platforms. Specifically, we transform gene expression profiles into a united gene regulation profile across studies by mathematically defining two gene regulation events between two conditions and estimating their occurring probabilities in a sample. Finally, a novel differential expression statistic is established based on the gene regulation profiles, realizing accurate and flexible identification of DEGs in gene regulation space. We evaluated the proposed method on simulation data and real-world cancer datasets and showed the effectiveness and efficiency of jGRP in identifying DEGs identification in the context of meta-analysis. Data heterogeneity largely influences the performance of meta-analysis of DEGs identification. Existing different meta-analysis methods were revealed to exhibit very different degrees of sensitivity to study heterogeneity. The proposed method, jGRP, can be a standalone tool due to its united framework and controllable way to deal with study heterogeneity.
Large-scale analysis of gene expression using cDNA microarrays promises the
rapid detection of the mode of toxicity for drugs and other chemicals. cDNA
microarrays were used to examine chemically-induced alterations of gene
expression in HepG2 cells exposed to oxidative ...
Comparative analyses identify molecular signature of MRI-classified SVZ-associated glioblastoma
Lin, Chin-Hsing Annie; Rhodes, Christopher T.; Lin, ChenWei; Phillips, Joanna J.; Berger, Mitchel S.
2017-01-01
ABSTRACT Glioblastoma (GBM) is a highly aggressive brain cancer with limited therapeutic options. While efforts to identify genes responsible for GBM have revealed mutations and aberrant gene expression associated with distinct types of GBM, patients with GBM are often diagnosed and classified based on MRI features. Therefore, we seek to identify molecular representatives in parallel with MRI classification for group I and group II primary GBM associated with the subventricular zone (SVZ). As group I and II GBM contain stem-like signature, we compared gene expression profiles between these 2 groups of primary GBM and endogenous neural stem progenitor cells to reveal dysregulation of cell cycle, chromatin status, cellular morphogenesis, and signaling pathways in these 2 types of MRI-classified GBM. In the absence of IDH mutation, several genes associated with metabolism are differentially expressed in these subtypes of primary GBM, implicating metabolic reprogramming occurs in tumor microenvironment. Furthermore, histone lysine methyltransferase EZH2 was upregulated while histone lysine demethylases KDM2 and KDM4 were downregulated in both group I and II primary GBM. Lastly, we identified 9 common genes across large data sets of gene expression profiles among MRI-classified group I/II GBM, a large cohort of GBM subtypes from TCGA, and glioma stem cells by unsupervised clustering comparison. These commonly upregulated genes have known functions in cell cycle, centromere assembly, chromosome segregation, and mitotic progression. Our findings highlight altered expression of genes important in chromosome integrity across all GBM, suggesting a common mechanism of disrupted fidelity of chromosome structure in GBM. PMID:28278055
Barling, Adam; Swaminathan, Kankshita; Mitros, Therese; James, Brandon T; Morris, Juliette; Ngamboma, Ornella; Hall, Megan C; Kirkpatrick, Jessica; Alabady, Magdy; Spence, Ashley K; Hudson, Matthew E; Rokhsar, Daniel S; Moose, Stephen P
2013-12-09
The Miscanthus genus of perennial C4 grasses contains promising biofuel crops for temperate climates. However, few genomic resources exist for Miscanthus, which limits understanding of its interesting biology and future genetic improvement. A comprehensive catalog of expressed sequences were generated from a variety of Miscanthus species and tissue types, with an emphasis on characterizing gene expression changes in spring compared to fall rhizomes. Illumina short read sequencing technology was used to produce transcriptome sequences from different tissues and organs during distinct developmental stages for multiple Miscanthus species, including Miscanthus sinensis, Miscanthus sacchariflorus, and their interspecific hybrid Miscanthus × giganteus. More than fifty billion base-pairs of Miscanthus transcript sequence were produced. Overall, 26,230 Sorghum gene models (i.e., ~ 96% of predicted Sorghum genes) had at least five Miscanthus reads mapped to them, suggesting that a large portion of the Miscanthus transcriptome is represented in this dataset. The Miscanthus × giganteus data was used to identify genes preferentially expressed in a single tissue, such as the spring rhizome, using Sorghum bicolor as a reference. Quantitative real-time PCR was used to verify examples of preferential expression predicted via RNA-Seq. Contiguous consensus transcript sequences were assembled for each species and annotated using InterProScan. Sequences from the assembled transcriptome were used to amplify genomic segments from a doubled haploid Miscanthus sinensis and from Miscanthus × giganteus to further disentangle the allelic and paralogous variations in genes. This large expressed sequence tag collection creates a valuable resource for the study of Miscanthus biology by providing detailed gene sequence information and tissue preferred expression patterns. We have successfully generated a database of transcriptome assemblies and demonstrated its use in the study of genes of interest. Analysis of gene expression profiles revealed biological pathways that exhibit altered regulation in spring compared to fall rhizomes, which are consistent with their different physiological functions. The expression profiles of the subterranean rhizome provides a better understanding of the biological activities of the underground stem structures that are essentials for perenniality and the storage or remobilization of carbon and nutrient resources.
A high resolution atlas of gene expression in the domestic sheep (Ovis aries)
Farquhar, Iseabail L.; Young, Rachel; Lefevre, Lucas; Pridans, Clare; Tsang, Hiu G.; Afrasiabi, Cyrus; Watson, Mick; Whitelaw, C. Bruce; Freeman, Tom C.; Archibald, Alan L.; Hume, David A.
2017-01-01
Sheep are a key source of meat, milk and fibre for the global livestock sector, and an important biomedical model. Global analysis of gene expression across multiple tissues has aided genome annotation and supported functional annotation of mammalian genes. We present a large-scale RNA-Seq dataset representing all the major organ systems from adult sheep and from several juvenile, neonatal and prenatal developmental time points. The Ovis aries reference genome (Oar v3.1) includes 27,504 genes (20,921 protein coding), of which 25,350 (19,921 protein coding) had detectable expression in at least one tissue in the sheep gene expression atlas dataset. Network-based cluster analysis of this dataset grouped genes according to their expression pattern. The principle of ‘guilt by association’ was used to infer the function of uncharacterised genes from their co-expression with genes of known function. We describe the overall transcriptional signatures present in the sheep gene expression atlas and assign those signatures, where possible, to specific cell populations or pathways. The findings are related to innate immunity by focusing on clusters with an immune signature, and to the advantages of cross-breeding by examining the patterns of genes exhibiting the greatest expression differences between purebred and crossbred animals. This high-resolution gene expression atlas for sheep is, to our knowledge, the largest transcriptomic dataset from any livestock species to date. It provides a resource to improve the annotation of the current reference genome for sheep, presenting a model transcriptome for ruminants and insight into gene, cell and tissue function at multiple developmental stages. PMID:28915238
A high resolution atlas of gene expression in the domestic sheep (Ovis aries).
Clark, Emily L; Bush, Stephen J; McCulloch, Mary E B; Farquhar, Iseabail L; Young, Rachel; Lefevre, Lucas; Pridans, Clare; Tsang, Hiu G; Wu, Chunlei; Afrasiabi, Cyrus; Watson, Mick; Whitelaw, C Bruce; Freeman, Tom C; Summers, Kim M; Archibald, Alan L; Hume, David A
2017-09-01
Sheep are a key source of meat, milk and fibre for the global livestock sector, and an important biomedical model. Global analysis of gene expression across multiple tissues has aided genome annotation and supported functional annotation of mammalian genes. We present a large-scale RNA-Seq dataset representing all the major organ systems from adult sheep and from several juvenile, neonatal and prenatal developmental time points. The Ovis aries reference genome (Oar v3.1) includes 27,504 genes (20,921 protein coding), of which 25,350 (19,921 protein coding) had detectable expression in at least one tissue in the sheep gene expression atlas dataset. Network-based cluster analysis of this dataset grouped genes according to their expression pattern. The principle of 'guilt by association' was used to infer the function of uncharacterised genes from their co-expression with genes of known function. We describe the overall transcriptional signatures present in the sheep gene expression atlas and assign those signatures, where possible, to specific cell populations or pathways. The findings are related to innate immunity by focusing on clusters with an immune signature, and to the advantages of cross-breeding by examining the patterns of genes exhibiting the greatest expression differences between purebred and crossbred animals. This high-resolution gene expression atlas for sheep is, to our knowledge, the largest transcriptomic dataset from any livestock species to date. It provides a resource to improve the annotation of the current reference genome for sheep, presenting a model transcriptome for ruminants and insight into gene, cell and tissue function at multiple developmental stages.
Ryan, Veronica H; Primiani, Christopher T; Rao, Jagadeesh S; Ahn, Kwangmi; Rapoport, Stanley I; Blanchard, Helene
2014-01-01
The polyunsaturated arachidonic and docosahexaenoic acids (AA and DHA) participate in cell membrane synthesis during neurodevelopment, neuroplasticity, and neurotransmission throughout life. Each is metabolized via coupled enzymatic reactions within separate but interacting metabolic cascades. AA and DHA pathway genes are coordinately expressed and underlie cascade interactions during human brain development and aging. The BrainCloud database for human non-pathological prefrontal cortex gene expression was used to quantify postnatal age changes in mRNA expression of 34 genes involved in AA and DHA metabolism. Expression patterns were split into Development (0 to 20 years) and Aging (21 to 78 years) intervals. Expression of genes for cytosolic phospholipases A2 (cPLA2), cyclooxygenases (COX)-1 and -2, and other AA cascade enzymes, correlated closely with age during Development, less so during Aging. Expression of DHA cascade enzymes was less inter-correlated in each period, but often changed in the opposite direction to expression of AA cascade genes. Except for the PLA2G4A (cPLA2 IVA) and PTGS2 (COX-2) genes at 1q25, highly inter-correlated genes were at distant chromosomal loci. Coordinated age-related gene expression during the brain Development and Aging intervals likely underlies coupled changes in enzymes of the AA and DHA cascades and largely occur through distant transcriptional regulation. Healthy brain aging does not show upregulation of PLA2G4 or PTGS2 expression, which was found in Alzheimer's disease.
Filling gaps in PPAR-alpha signaling through comparative nutrigenomics analysis.
Cavalieri, Duccio; Calura, Enrica; Romualdi, Chiara; Marchi, Emmanuela; Radonjic, Marijana; Van Ommen, Ben; Müller, Michael
2009-12-11
The application of high-throughput genomic tools in nutrition research is a widespread practice. However, it is becoming increasingly clear that the outcome of individual expression studies is insufficient for the comprehensive understanding of such a complex field. Currently, the availability of the large amounts of expression data in public repositories has opened up new challenges on microarray data analyses. We have focused on PPARalpha, a ligand-activated transcription factor functioning as fatty acid sensor controlling the gene expression regulation of a large set of genes in various metabolic organs such as liver, small intestine or heart. The function of PPARalpha is strictly connected to the function of its target genes and, although many of these have already been identified, major elements of its physiological function remain to be uncovered. To further investigate the function of PPARalpha, we have applied a cross-species meta-analysis approach to integrate sixteen microarray datasets studying high fat diet and PPARalpha signal perturbations in different organisms. We identified 164 genes (MDEGs) that were differentially expressed in a constant way in response to a high fat diet or to perturbations in PPARs signalling. In particular, we found five genes in yeast which were highly conserved and homologous of PPARalpha targets in mammals, potential candidates to be used as models for the equivalent mammalian genes. Moreover, a screening of the MDEGs for all known transcription factor binding sites and the comparison with a human genome-wide screening of Peroxisome Proliferating Response Elements (PPRE), enabled us to identify, 20 new potential candidate genes that show, both binding site, both change in expression in the condition studied. Lastly, we found a non random localization of the differentially expressed genes in the genome. The results presented are potentially of great interest to resume the currently available expression data, exploiting the power of in silico analysis filtered by evolutionary conservation. The analysis enabled us to indicate potential gene candidates that could fill in the gaps with regards to the signalling of PPARalpha and, moreover, the non-random localization of the differentially expressed genes in the genome, suggest that epigenetic mechanisms are of importance in the regulation of the transcription operated by PPARalpha.
Fowl adenovirus serotype 9 vectored vaccine for protection of avian influenza virus
USDA-ARS?s Scientific Manuscript database
A fowl adenovirus serotype 9, a non-pathogenic large double stranded DNA virus, was developed as a viral vector to express influenza genes as a potential vaccine. Two separate constructs were developed that expressed either the hemagglutinin gene of A/Chicken/Jalisco/2012 (H7) or A/ Chicken/Iowa/20...
USDA-ARS?s Scientific Manuscript database
Salinity is a major environmental stress that affects agricultural productivity worldwide. One approach to improving salt tolerance in crops is through high expression of the Arabidopsis gene AtNHX1, which encodes a vacuolar sodium/proton antiporter that sequesters excess sodium ion into the large i...
Rare Cell Detection by Single-Cell RNA Sequencing as Guided by Single-Molecule RNA FISH.
Torre, Eduardo; Dueck, Hannah; Shaffer, Sydney; Gospocic, Janko; Gupte, Rohit; Bonasio, Roberto; Kim, Junhyong; Murray, John; Raj, Arjun
2018-02-28
Although single-cell RNA sequencing can reliably detect large-scale transcriptional programs, it is unclear whether it accurately captures the behavior of individual genes, especially those that express only in rare cells. Here, we use single-molecule RNA fluorescence in situ hybridization as a gold standard to assess trade-offs in single-cell RNA-sequencing data for detecting rare cell expression variability. We quantified the gene expression distribution for 26 genes that range from ubiquitous to rarely expressed and found that the correspondence between estimates across platforms improved with both transcriptome coverage and increased number of cells analyzed. Further, by characterizing the trade-off between transcriptome coverage and number of cells analyzed, we show that when the number of genes required to answer a given biological question is small, then greater transcriptome coverage is more important than analyzing large numbers of cells. More generally, our report provides guidelines for selecting quality thresholds for single-cell RNA-sequencing experiments aimed at rare cell analyses. Copyright © 2018 Elsevier Inc. All rights reserved.
Che, Ping; Love, Tanzy M; Frame, Bronwyn R; Wang, Kan; Carriquiry, Alicia L; Howell, Stephen H
2006-09-01
Gene expression patterns were profiled during somatic embryogenesis in a regeneration-proficient maize hybrid line, Hi II, in an effort to identify genes that might be used as developmental markers or targets to optimize regeneration steps for recovering maize plants from tissue culture. Gene expression profiles were generated from embryogenic calli induced to undergo embryo maturation and germination. Over 1,000 genes in the 12,060 element arrays showed significant time variation during somatic embryo development. A substantial number of genes were downregulated during embryo maturation, largely histone and ribosomal protein genes, which may result from a slowdown in cell proliferation and growth during embryo maturation. The expression of these genes dramatically recovered at germination. Other genes up-regulated during embryo maturation included genes encoding hydrolytic enzymes (nucleases, glucosidases and proteases) and a few storage genes (an alpha-zein and caleosin), which are good candidates for developmental marker genes. Germination is accompanied by the up-regulation of a number of stress response and membrane transporter genes, and, as expected, greening is associated with the up-regulation of many genes encoding photosynthetic and chloroplast components. Thus, some, but not all genes typically associated with zygotic embryogenesis are significantly up or down-regulated during somatic embryogenesis in Hi II maize line regeneration. Although many genes varied in expression throughout somatic embryo development in this study, no statistically significant gene expression changes were detected between total embryogenic callus and callus enriched for transition stage somatic embryos.
Automated Protocol for Large-Scale Modeling of Gene Expression Data.
Hall, Michelle Lynn; Calkins, David; Sherman, Woody
2016-11-28
With the continued rise of phenotypic- and genotypic-based screening projects, computational methods to analyze, process, and ultimately make predictions in this field take on growing importance. Here we show how automated machine learning workflows can produce models that are predictive of differential gene expression as a function of a compound structure using data from A673 cells as a proof of principle. In particular, we present predictive models with an average accuracy of greater than 70% across a highly diverse ∼1000 gene expression profile. In contrast to the usual in silico design paradigm, where one interrogates a particular target-based response, this work opens the opportunity for virtual screening and lead optimization for desired multitarget gene expression profiles.
Olfactory gene expression in migrating adult sockeye salmon Oncorhynchus nerka.
Bett, N N; Hinch, S G; Kaukinen, K H; Li, S; Miller, K M
2018-04-16
Expression of 12 olfactory genes was analysed in adult sockeye salmon Oncorhynchus nerka nearing spawning grounds and O. nerka that had strayed from their natal migration route. Variation was found in six of these genes, all of which were olfc olfactory receptors and had lower expression levels in salmon nearing spawning grounds. The results may reflect decreased sensitivity to natal water olfactory cues as these fish are no longer seeking the correct migratory route. The expression of olfactory genes during the olfactory-mediated spawning migration of Pacific salmon Oncorhynchus spp. is largely unexplored and these findings demonstrate a link between migratory behaviours and olfactory plasticity that provides a basis for future molecular research on salmon homing. © 2018 The Fisheries Society of the British Isles.
Reyes-Velasco, Jacobo; Card, Daren C; Andrew, Audra L; Shaney, Kyle J; Adams, Richard H; Schield, Drew R; Casewell, Nicholas R; Mackessy, Stephen P; Castoe, Todd A
2015-01-01
Snake venom gene evolution has been studied intensively over the past several decades, yet most previous studies have lacked the context of complete snake genomes and the full context of gene expression across diverse snake tissues. We took a novel approach to studying snake venom evolution by leveraging the complete genome of the Burmese python, including information from tissue-specific patterns of gene expression. We identified the orthologs of snake venom genes in the python genome, and conducted detailed analysis of gene expression of these venom homologs to identify patterns that differ between snake venom gene families and all other genes. We found that venom gene homologs in the python are expressed in many different tissues outside of oral glands, which illustrates the pitfalls of using transcriptomic data alone to define "venom toxins." We hypothesize that the python may represent an ancestral state prior to major venom development, which is supported by our finding that the expansion of venom gene families is largely restricted to highly venomous caenophidian snakes. Therefore, the python provides insight into biases in which genes were recruited for snake venom systems. Python venom homologs are generally expressed at lower levels, have higher variance among tissues, and are expressed in fewer organs compared with all other python genes. We propose a model for the evolution of snake venoms in which venom genes are recruited preferentially from genes with particular expression profile characteristics, which facilitate a nearly neutral transition toward specialized venom system expression. © The Author 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Extraction and analysis of signatures from the Gene Expression Omnibus by the crowd
Wang, Zichen; Monteiro, Caroline D.; Jagodnik, Kathleen M.; Fernandez, Nicolas F.; Gundersen, Gregory W.; Rouillard, Andrew D.; Jenkins, Sherry L.; Feldmann, Axel S.; Hu, Kevin S.; McDermott, Michael G.; Duan, Qiaonan; Clark, Neil R.; Jones, Matthew R.; Kou, Yan; Goff, Troy; Woodland, Holly; Amaral, Fabio M R.; Szeto, Gregory L.; Fuchs, Oliver; Schüssler-Fiorenza Rose, Sophia M.; Sharma, Shvetank; Schwartz, Uwe; Bausela, Xabier Bengoetxea; Szymkiewicz, Maciej; Maroulis, Vasileios; Salykin, Anton; Barra, Carolina M.; Kruth, Candice D.; Bongio, Nicholas J.; Mathur, Vaibhav; Todoric, Radmila D; Rubin, Udi E.; Malatras, Apostolos; Fulp, Carl T.; Galindo, John A.; Motiejunaite, Ruta; Jüschke, Christoph; Dishuck, Philip C.; Lahl, Katharina; Jafari, Mohieddin; Aibar, Sara; Zaravinos, Apostolos; Steenhuizen, Linda H.; Allison, Lindsey R.; Gamallo, Pablo; de Andres Segura, Fernando; Dae Devlin, Tyler; Pérez-García, Vicente; Ma'ayan, Avi
2016-01-01
Gene expression data are accumulating exponentially in public repositories. Reanalysis and integration of themed collections from these studies may provide new insights, but requires further human curation. Here we report a crowdsourcing project to annotate and reanalyse a large number of gene expression profiles from Gene Expression Omnibus (GEO). Through a massive open online course on Coursera, over 70 participants from over 25 countries identify and annotate 2,460 single-gene perturbation signatures, 839 disease versus normal signatures, and 906 drug perturbation signatures. All these signatures are unique and are manually validated for quality. Global analysis of these signatures confirms known associations and identifies novel associations between genes, diseases and drugs. The manually curated signatures are used as a training set to develop classifiers for extracting similar signatures from the entire GEO repository. We develop a web portal to serve these signatures for query, download and visualization. PMID:27667448
Extraction and analysis of signatures from the Gene Expression Omnibus by the crowd.
Wang, Zichen; Monteiro, Caroline D; Jagodnik, Kathleen M; Fernandez, Nicolas F; Gundersen, Gregory W; Rouillard, Andrew D; Jenkins, Sherry L; Feldmann, Axel S; Hu, Kevin S; McDermott, Michael G; Duan, Qiaonan; Clark, Neil R; Jones, Matthew R; Kou, Yan; Goff, Troy; Woodland, Holly; Amaral, Fabio M R; Szeto, Gregory L; Fuchs, Oliver; Schüssler-Fiorenza Rose, Sophia M; Sharma, Shvetank; Schwartz, Uwe; Bausela, Xabier Bengoetxea; Szymkiewicz, Maciej; Maroulis, Vasileios; Salykin, Anton; Barra, Carolina M; Kruth, Candice D; Bongio, Nicholas J; Mathur, Vaibhav; Todoric, Radmila D; Rubin, Udi E; Malatras, Apostolos; Fulp, Carl T; Galindo, John A; Motiejunaite, Ruta; Jüschke, Christoph; Dishuck, Philip C; Lahl, Katharina; Jafari, Mohieddin; Aibar, Sara; Zaravinos, Apostolos; Steenhuizen, Linda H; Allison, Lindsey R; Gamallo, Pablo; de Andres Segura, Fernando; Dae Devlin, Tyler; Pérez-García, Vicente; Ma'ayan, Avi
2016-09-26
Gene expression data are accumulating exponentially in public repositories. Reanalysis and integration of themed collections from these studies may provide new insights, but requires further human curation. Here we report a crowdsourcing project to annotate and reanalyse a large number of gene expression profiles from Gene Expression Omnibus (GEO). Through a massive open online course on Coursera, over 70 participants from over 25 countries identify and annotate 2,460 single-gene perturbation signatures, 839 disease versus normal signatures, and 906 drug perturbation signatures. All these signatures are unique and are manually validated for quality. Global analysis of these signatures confirms known associations and identifies novel associations between genes, diseases and drugs. The manually curated signatures are used as a training set to develop classifiers for extracting similar signatures from the entire GEO repository. We develop a web portal to serve these signatures for query, download and visualization.
oPOSSUM: identification of over-represented transcription factor binding sites in co-expressed genes
Ho Sui, Shannan J.; Mortimer, James R.; Arenillas, David J.; Brumm, Jochen; Walsh, Christopher J.; Kennedy, Brian P.; Wasserman, Wyeth W.
2005-01-01
Targeted transcript profiling studies can identify sets of co-expressed genes; however, identification of the underlying functional mechanism(s) is a significant challenge. Established methods for the analysis of gene annotations, particularly those based on the Gene Ontology, can identify functional linkages between genes. Similar methods for the identification of over-represented transcription factor binding sites (TFBSs) have been successful in yeast, but extension to human genomics has largely proved ineffective. Creation of a system for the efficient identification of common regulatory mechanisms in a subset of co-expressed human genes promises to break a roadblock in functional genomics research. We have developed an integrated system that searches for evidence of co-regulation by one or more transcription factors (TFs). oPOSSUM combines a pre-computed database of conserved TFBSs in human and mouse promoters with statistical methods for identification of sites over-represented in a set of co-expressed genes. The algorithm successfully identified mediating TFs in control sets of tissue-specific genes and in sets of co-expressed genes from three transcript profiling studies. Simulation studies indicate that oPOSSUM produces few false positives using empirically defined thresholds and can tolerate up to 50% noise in a set of co-expressed genes. PMID:15933209
Xoca-Orozco, Luis-Ángel; Cuellar-Torres, Esther Angélica; González-Morales, Sandra; Gutiérrez-Martínez, Porfirio; López-García, Ulises; Herrera-Estrella, Luis; Vega-Arreguín, Julio; Chacón-López, Alejandra
2017-01-01
Avocado ( Persea americana ) is one of the most important crops in Mexico as it is the main producer, consumer, and exporter of avocado fruit in the world. However, successful avocado commercialization is often reduced by large postharvest losses due to Colletotrichum sp., the causal agent of anthracnose. Chitosan is known to have a direct antifungal effect and acts also as an elicitor capable of stimulating a defense response in plants. However, there is little information regarding the genes that are either activated or repressed in fruits treated with chitosan. The aim of this study was to identify by RNA-seq the genes differentially regulated by the action of low molecular weight chitosan in the avocado-chitosan- Colletotrichum interaction system. The samples for RNA-seq were obtained from fruits treated with chitosan, fruits inoculated with Colletotrichum and fruits both treated with chitosan and inoculated with the fungus. Non-treated and non-inoculated fruits were also analyzed. Expression profiles showed that in short times, the fruit-chitosan system presented a greater number of differentially expressed genes, compared to the fruit-pathogen system. Gene Ontology analysis of differentially expressed genes showed a large number of metabolic processes regulated by chitosan, including those preventing the spread of Colletotrichum . It was also found that there is a high correlation between the expression of genes in silico and qPCR of several genes involved in different metabolic pathways.
Gene expression profile of human Down syndrome leukocytes.
Malagó, Wilson; Sommer, César A; Del Cistia Andrade, Camillo; Soares-Costa, Andrea; Abrao Possik, Patricia; Cassago, Alexandre; Santejo Silveira, Henrique C; Henrique-Silva, Flavio
2005-08-01
Identification of differences in the gene expression patterns of Down syndrome and normal leukocytes. We constructed the first Down syndrome leukocyte serial analysis of gene expression (SAGE) library from a 28 year-old patient. This library was analyzed and compared with a normal leukocyte SAGE library using the eSAGE software. Reverse transcriptase polymerase chain reaction (RT-PCR) was used to validate the results. We found that a large number of unidentified transcripts were overexpressed in Down syndrome leukocytes and some transcripts coding for growth factors (e.g. interleukin 8, IL-8), ribosomaproteins (e.g. L13a, L29, and L37), and transcription factors (e.g., Jun B, Jun D, and C/EBP beta) were underexpressed. The SAGE data were successfully validated for the genes IL-8, CXCR4, BCL2A1, L13a, L29, L37, and GTF3A using RT-PCR. Our analysis identified significant changes in the expression pattern of Down syndrome leukocytes compared with normal ones, including key regulators of growth and proliferation, ribosomal proteins, and a large number of overexpressed transcripts that were not matched in UniGene clusters and that may represent novel genes related to Down syndrome. This study offers a new insight into transcriptional changes in Down syndrome leukocytes and indicates candidate genes for further investigations into the molecular mechanism of Down syndrome pathology.
Promoting gene expression in plants by permissive histone lysine methylation
Millar, Tony; Finnegan, E Jean
2009-01-01
Plants utilize sophisticated epigenetic regulatory mechanisms to coordinate changes in gene expression during development and in response to environmental stimuli. Epigenetics refers to the modification of DNA and chromatin associated proteins, which affect gene expression and cell function, without changing the DNA sequence. Such modifications are inherited through mitosis, and in rare instances through meiosis, although it can be reversible and thus regulatory. Epigenetic modifications are controlled by groups of proteins, such as the family of histone lysine methytransferases (HKMTs). The catalytic core known as the SET domain encodes HKMT activity and either promotes or represses gene expression. A large family of SET domain proteins is present in Arabidopsis where there is growing evidence that two classes of these genes are involved in promoting gene expression in a diverse range of developmental processes. This review will focus on the function of these two classes and the processes that they control, highlighting the huge potential this regulatory mechanism has in plants. PMID:19816124
Ali, Shahin S.; Shao, Jonathan; Lary, David J.; Strem, Mary D.; Meinhardt, Lyndel W.; Bailey, Bryan A.
2017-01-01
Phytophthora megakarya (Pmeg) and Phytophthora palmivora (Ppal) cause black pod rot of Theobroma cacao L. (cacao). Of these two clade 4 species, Pmeg is more virulent and is displacing Ppal in many cacao production areas in Africa. Symptoms and species specific sporangia production were compared when the two species were co-inoculated onto pod pieces in staggered 24 h time intervals. Pmeg sporangia were predominantly recovered from pod pieces with unwounded surfaces even when inoculated 24 h after Ppal. On wounded surfaces, sporangia of Ppal were predominantly recovered if the two species were simultaneously applied or Ppal was applied first but not if Pmeg was applied first. Pmeg demonstrated an advantage over Ppal when infecting un-wounded surfaces while Ppal had the advantage when infecting wounded surfaces. RNA-Seq was carried out on RNA isolated from control and Pmeg and Ppal infected pod pieces 3 days post inoculation to assess their abilities to alter/suppress cacao defense. Expression of 4,482 and 5,264 cacao genes was altered after Pmeg and Ppal infection, respectively, with most genes responding to both species. Neural network self-organizing map analyses separated the cacao RNA-Seq gene expression profiles into 24 classes, 6 of which were largely induced in response to infection. Using KEGG analysis, subsets of genes composing interrelated pathways leading to phenylpropanoid biosynthesis, ethylene and jasmonic acid biosynthesis and action, plant defense signal transduction, and endocytosis showed induction in response to infection. A large subset of genes encoding putative Pr-proteins also showed differential expression in response to infection. A subset of 36 cacao genes was used to validate the RNA-Seq expression data and compare infection induced gene expression patterns in leaves and wounded and unwounded pod husks. Expression patterns between RNA-Seq and RT-qPCR were generally reproducible. The level and timing of altered gene expression was influenced by the tissues studied and by wounding. Although, in these susceptible interactions gene expression patterns were similar, some genes did show differential expression in a Phytophthora species dependent manner. The biggest difference was the more intense changes in expression in Ppal inoculated wounded pod pieces further demonstrating its rapid progression when penetrating through wounds. PMID:28261234
Integrative analysis of RUNX1 downstream pathways and target genes
Michaud, Joëlle; Simpson, Ken M; Escher, Robert; Buchet-Poyau, Karine; Beissbarth, Tim; Carmichael, Catherine; Ritchie, Matthew E; Schütz, Frédéric; Cannon, Ping; Liu, Marjorie; Shen, Xiaofeng; Ito, Yoshiaki; Raskind, Wendy H; Horwitz, Marshall S; Osato, Motomi; Turner, David R; Speed, Terence P; Kavallaris, Maria; Smyth, Gordon K; Scott, Hamish S
2008-01-01
Background The RUNX1 transcription factor gene is frequently mutated in sporadic myeloid and lymphoid leukemia through translocation, point mutation or amplification. It is also responsible for a familial platelet disorder with predisposition to acute myeloid leukemia (FPD-AML). The disruption of the largely unknown biological pathways controlled by RUNX1 is likely to be responsible for the development of leukemia. We have used multiple microarray platforms and bioinformatic techniques to help identify these biological pathways to aid in the understanding of why RUNX1 mutations lead to leukemia. Results Here we report genes regulated either directly or indirectly by RUNX1 based on the study of gene expression profiles generated from 3 different human and mouse platforms. The platforms used were global gene expression profiling of: 1) cell lines with RUNX1 mutations from FPD-AML patients, 2) over-expression of RUNX1 and CBFβ, and 3) Runx1 knockout mouse embryos using either cDNA or Affymetrix microarrays. We observe that our datasets (lists of differentially expressed genes) significantly correlate with published microarray data from sporadic AML patients with mutations in either RUNX1 or its cofactor, CBFβ. A number of biological processes were identified among the differentially expressed genes and functional assays suggest that heterozygous RUNX1 point mutations in patients with FPD-AML impair cell proliferation, microtubule dynamics and possibly genetic stability. In addition, analysis of the regulatory regions of the differentially expressed genes has for the first time systematically identified numerous potential novel RUNX1 target genes. Conclusion This work is the first large-scale study attempting to identify the genetic networks regulated by RUNX1, a master regulator in the development of the hematopoietic system and leukemia. The biological pathways and target genes controlled by RUNX1 will have considerable importance in disease progression in both familial and sporadic leukemia as well as therapeutic implications. PMID:18671852
The Effect of Gestational Age on Angiogenic Gene Expression in the Rat Placenta
Vaswani, Kanchan; Hum, Melissa Wen-Ching; Chan, Hsiu-Wen; Ryan, Jennifer; Wood-Bradley, Ryan J.; Nitert, Marloes Dekker; Mitchell, Murray D.; Armitage, James A.; Rice, Gregory E.
2013-01-01
The placenta plays a central role in determining the outcome of pregnancy. It undergoes changes during gestation as the fetus develops and as demands for energy substrate transfer and gas exchange increase. The molecular mechanisms that coordinate these changes have yet to be fully elucidated. The study performed a large scale screen of the transcriptome of the rat placenta throughout mid-late gestation (E14.25–E20) with emphasis on characterizing gestational age associated changes in the expression of genes invoved in angiogenic pathways. Sprague Dawley dams were sacrificed at E14.25, E15.25, E17.25 and E20 (n = 6 per group) and RNA was isolated from one placenta per dam. Changes in placental gene expression were identifed using Illumina Rat Ref-12 Expression BeadChip Microarrays. Differentially expressed genes (>2-fold change, <1% false discovery rate, FDR) were functionally categorised by gene ontology pathway analysis. A subset of differentially expressed genes identified by microarrays were confirmed using Real-Time qPCR. The expression of thirty one genes involved in the angiogenic pathway was shown to change over time, using microarray analysis (22 genes displayed increased and 9 gene decreased expression). Five genes (4 up regulated: Cd36, Mmp14, Rhob and Angpt4 and 1 down regulated: Foxm1) involved in angiogenesis and blood vessel morphogenesis were subjected to further validation. qPCR confirmed late gestational increased expression of Cd36, Mmp14, Rhob and Angpt4 and a decrease in expression of Foxm1 before labour onset (P<0.0001). The observed acute, pre-labour changes in the expression of the 31 genes during gestation warrant further investigation to elucidate their role in pregnancy. PMID:24391823
Optimal consistency in microRNA expression analysis using reference-gene-based normalization.
Wang, Xi; Gardiner, Erin J; Cairns, Murray J
2015-05-01
Normalization of high-throughput molecular expression profiles secures differential expression analysis between samples of different phenotypes or biological conditions, and facilitates comparison between experimental batches. While the same general principles apply to microRNA (miRNA) normalization, there is mounting evidence that global shifts in their expression patterns occur in specific circumstances, which pose a challenge for normalizing miRNA expression data. As an alternative to global normalization, which has the propensity to flatten large trends, normalization against constitutively expressed reference genes presents an advantage through their relative independence. Here we investigated the performance of reference-gene-based (RGB) normalization for differential miRNA expression analysis of microarray expression data, and compared the results with other normalization methods, including: quantile, variance stabilization, robust spline, simple scaling, rank invariant, and Loess regression. The comparative analyses were executed using miRNA expression in tissue samples derived from subjects with schizophrenia and non-psychiatric controls. We proposed a consistency criterion for evaluating methods by examining the overlapping of differentially expressed miRNAs detected using different partitions of the whole data. Based on this criterion, we found that RGB normalization generally outperformed global normalization methods. Thus we recommend the application of RGB normalization for miRNA expression data sets, and believe that this will yield a more consistent and useful readout of differentially expressed miRNAs, particularly in biological conditions characterized by large shifts in miRNA expression.
Quiapim, Andréa C.; Brito, Michael S.; Bernardes, Luciano A.S.; daSilva, Idalete; Malavazi, Iran; DePaoli, Henrique C.; Molfetta-Machado, Jeanne B.; Giuliatti, Silvana; Goldman, Gustavo H.; Goldman, Maria Helena S.
2009-01-01
The success of plant reproduction depends on pollen-pistil interactions occurring at the stigma/style. These interactions vary depending on the stigma type: wet or dry. Tobacco (Nicotiana tabacum) represents a model of wet stigma, and its stigmas/styles express genes to accomplish the appropriate functions. For a large-scale study of gene expression during tobacco pistil development and preparation for pollination, we generated 11,216 high-quality expressed sequence tags (ESTs) from stigmas/styles and created the TOBEST database. These ESTs were assembled in 6,177 clusters, from which 52.1% are pistil transcripts/genes of unknown function. The 21 clusters with the highest number of ESTs (putative higher expression levels) correspond to genes associated with defense mechanisms or pollen-pistil interactions. The database analysis unraveled tobacco sequences homologous to the Arabidopsis (Arabidopsis thaliana) genes involved in specifying pistil identity or determining normal pistil morphology and function. Additionally, 782 independent clusters were examined by macroarray, revealing 46 stigma/style preferentially expressed genes. Real-time reverse transcription-polymerase chain reaction experiments validated the pistil-preferential expression for nine out of 10 genes tested. A search for these 46 genes in the Arabidopsis pistil data sets demonstrated that only 11 sequences, with putative equivalent molecular functions, are expressed in this dry stigma species. The reverse search for the Arabidopsis pistil genes in the TOBEST exposed a partial overlap between these dry and wet stigma transcriptomes. The TOBEST represents the most extensive survey of gene expression in the stigmas/styles of wet stigma plants, and our results indicate that wet and dry stigmas/styles express common as well as distinct genes in preparation for the pollination process. PMID:19052150
Rode, Tone Mari; Berget, Ingunn; Langsrud, Solveig; Møretrø, Trond; Holck, Askild
2009-07-01
Microorganisms are constantly exposed to new and altered growth conditions, and respond by changing gene expression patterns. Several methods for studying gene expression exist. During the last decade, the analysis of microarrays has been one of the most common approaches applied for large scale gene expression studies. A relatively new method for gene expression analysis is MassARRAY, which combines real competitive-PCR and MALDI-TOF (matrix-assisted laser desorption/ionization time-of-flight) mass spectrometry. In contrast to microarray methods, MassARRAY technology is suitable for analysing a larger number of samples, though for a smaller set of genes. In this study we compare the results from MassARRAY with microarrays on gene expression responses of Staphylococcus aureus exposed to acid stress at pH 4.5. RNA isolated from the same stress experiments was analysed using both the MassARRAY and the microarray methods. The MassARRAY and microarray methods showed good correlation. Both MassARRAY and microarray estimated somewhat lower fold changes compared with quantitative real-time PCR (qRT-PCR). The results confirmed the up-regulation of the urease genes in acidic environments, and also indicated the importance of metal ion regulation. This study shows that the MassARRAY technology is suitable for gene expression analysis in prokaryotes, and has advantages when a set of genes is being analysed for an organism exposed to many different environmental conditions.
Yu, R-L; Liu, A; Liu, Y; Yu, Z; Peng, T; Wu, X; Shen, L; Liu, Y; Li, J; Liu, X; Qiu, G; Chen, M; Zeng, W
2017-06-01
To explore the distribution disciplinarian of alginate on the chalcopyrite concentrate surface during bioleaching. The evolution of Sulfobacillus thermosulfidooxidans secreting alginate during bioleaching of chalcopyrite concentrate was investigated through gas chromatography coupled with mass spectrometry (GC-MS) and confocal laser scanning microscope (CLSM), and the critical synthetic genes (algA, algC, algD) of alginate were analysed by real-time polymerase chain reaction (RT-PCR). The GC-MS analysis results indicated that there was a little amount of alginate formed on the mineral surface at the early stage, while increasing largely to the maximum value at the intermediate stage, and then kept a stable value at the end stage. The CLSM analysis of chalcopyrite slice showed the same variation trend of alginate content on the mineral surface. Furthermore, the RT-PCR results showed that during the early stage of bioleaching, the expressions of the algA, algC and the algD genes were all overexpressed. However, at the final stage, the algD gene expression decreased in a large scale, and the algA and algC decreased slightly. This expression pattern was attributed to the fact that algA and algC genes were involved in several biosynthesis reactions, but the algD gene only participated in the alginate biosynthesis and this was considered as the key gene to control alginate synthesis. The content of alginate on the mineral surface increased largely at the beginning of bioleaching, and remained stable at the end of bioleaching due to the restriction of algD gene expression. Our findings provide valuable information to explore the relationship between alginate formation and bioleaching of chalcopyrite. © 2017 The Society for Applied Microbiology.
Silk gene expression of theridiid spiders: implications for male-specific silk use.
Correa-Garhwal, Sandra M; Chaw, R Crystal; Clarke, Thomas H; Ayoub, Nadia A; Hayashi, Cheryl Y
2017-06-01
Spiders (order Araneae) rely on their silks for essential tasks, such as dispersal, prey capture, and reproduction. Spider silks are largely composed of spidroins, members of a protein family that are synthesized in silk glands. As needed, silk stored in silk glands is extruded through spigots on the spinnerets. Nearly all studies of spider silks have been conducted on females; thus, little is known about male silk biology. To shed light on silk use by males, we compared silk gene expression profiles of mature males to those of females from three cob-web weaving species (Theridiidae). We de novo assembled species-specific male transcriptomes from Latrodectus hesperus, Latrodectus geometricus, and Steatoda grossa followed by differential gene expression analyses. Consistent with their complement of silk spigots, male theridiid spiders express appreciable amounts of aciniform, major ampullate, minor ampullate, and pyriform spidroin genes but not tubuliform spidroin genes. The relative expression levels of particular spidroin genes varied between sexes and species. Because mature males desert their prey-capture webs and become cursorial in their search for mates, we anticipated that major ampullate (dragline) spidroin genes would be the silk genes most highly expressed by males. Indeed, major ampullate spidroin genes had the highest expression in S. grossa males. However, minor ampullate spidroin genes were the most highly expressed spidroin genes in L. geometricus and L. hesperus males. Our expression profiling results suggest species-specific adaptive divergence of silk use by male theridiids. Copyright © 2017 The Authors. Published by Elsevier GmbH.. All rights reserved.
Digital gene expression for non-model organisms
Hong, Lewis Z.; Li, Jun; Schmidt-Küntzel, Anne; Warren, Wesley C.; Barsh, Gregory S.
2011-01-01
Next-generation sequencing technologies offer new approaches for global measurements of gene expression but are mostly limited to organisms for which a high-quality assembled reference genome sequence is available. We present a method for gene expression profiling called EDGE, or EcoP15I-tagged Digital Gene Expression, based on ultra-high-throughput sequencing of 27-bp cDNA fragments that uniquely tag the corresponding gene, thereby allowing direct quantification of transcript abundance. We show that EDGE is capable of assaying for expression in >99% of genes in the genome and achieves saturation after 6–8 million reads. EDGE exhibits very little technical noise, reveals a large (106) dynamic range of gene expression, and is particularly suited for quantification of transcript abundance in non-model organisms where a high-quality annotated genome is not available. In a direct comparison with RNA-seq, both methods provide similar assessments of relative transcript abundance, but EDGE does better at detecting gene expression differences for poorly expressed genes and does not exhibit transcript length bias. Applying EDGE to laboratory mice, we show that a loss-of-function mutation in the melanocortin 1 receptor (Mc1r), recognized as a Mendelian determinant of yellow hair color in many different mammals, also causes reduced expression of genes involved in the interferon response. To illustrate the application of EDGE to a non-model organism, we examine skin biopsy samples from a cheetah (Acinonyx jubatus) and identify genes likely to control differences in the color of spotted versus non-spotted regions. PMID:21844123
Quan, Yong; Jin, Yisheng; Faria, Teresa N; Tilford, Charles A; He, Aiqing; Wall, Doris A; Smith, Ronald L; Vig, Balvinder S
2012-06-18
The expression levels of genes involved in drug and nutrient absorption were evaluated in the Madin-Darby Canine Kidney (MDCK) in vitro drug absorption model. MDCK cells were grown on plastic surfaces (for 3 days) or on Transwell® membranes (for 3, 5, 7, and 9 days). The expression profile of genes including ABC transporters, SLC transporters, and cytochrome P450 (CYP) enzymes was determined using the Affymetrix® Canine GeneChip®. Expression of genes whose probe sets passed a stringent confirmation process was examined. Expression of a few transporter (MDR1, PEPT1 and PEPT2) genes in MDCK cells was confirmed by RT-PCR. The overall gene expression profile was strongly influenced by the type of support the cells were grown on. After 3 days of growth, expression of 28% of the genes was statistically different (1.5-fold cutoff, p < 0.05) between the cells grown on plastic and Transwell® membranes. When cells were differentiated on Transwell® membranes, large changes in gene expression profile were observed during the early stages, which then stabilized after 5-7 days. Only a small number of genes encoding drug absorption related SLC, ABC, and CYP were detected in MDCK cells, and most of them exhibited low hybridization signals. Results from this study provide valuable reference information on endogenous gene expression in MDCK cells that could assist in design of drug-transporter and/or drug-enzyme interaction studies, and help interpret the contributions of various transporters and metabolic enzymes in studies with MDCK cells.
Quan, Yong; Jin, Yisheng; Faria, Teresa N.; Tilford, Charles A.; He, Aiqing; Wall, Doris A.; Smith, Ronald L.; Vig, Balvinder S.
2012-01-01
The expression levels of genes involved in drug and nutrient absorption were evaluated in the Madin-Darby Canine Kidney (MDCK) in vitro drug absorption model. MDCK cells were grown on plastic surfaces (for 3 days) or on Transwell® membranes (for 3, 5, 7, and 9 days). The expression profile of genes including ABC transporters, SLC transporters, and cytochrome P450 (CYP) enzymes was determined using the Affymetrix® Canine GeneChip®. Expression of genes whose probe sets passed a stringent confirmation process was examined. Expression of a few transporter (MDR1, PEPT1 and PEPT2) genes in MDCK cells was confirmed by RT-PCR. The overall gene expression profile was strongly influenced by the type of support the cells were grown on. After 3 days of growth, expression of 28% of the genes was statistically different (1.5-fold cutoff, p < 0.05) between the cells grown on plastic and Transwell® membranes. When cells were differentiated on Transwell® membranes, large changes in gene expression profile were observed during the early stages, which then stabilized after 5–7 days. Only a small number of genes encoding drug absorption related SLC, ABC, and CYP were detected in MDCK cells, and most of them exhibited low hybridization signals. Results from this study provide valuable reference information on endogenous gene expression in MDCK cells that could assist in design of drug-transporter and/or drug-enzyme interaction studies, and help interpret the contributions of various transporters and metabolic enzymes in studies with MDCK cells. PMID:24300234
Shi, Kerong; He, Feng; Yuan, Xuefeng; Zhao, Yaofeng; Deng, Xuemei; Hu, Xiaoxiang; Li, Ning
2013-08-01
The ovarian follicle supplies a unique dynamic system for gametes that ensures the propagation of the species. During folliculogenesis, the vast majority of the germ cells are lost or inactivated because of ovarian follicle atresia, resulting in diminished reproductive potency and potential infertility. Understanding the underlying molecular mechanism of folliculogenesis rules is essential. Primordial (P), preantral (M), and large antral (L) porcine follicles were used to reveal their genome-wide gene expression profiles. Results indicate that primordial follicles (P) process a diverse gene expression pattern compared to growing follicles (M and L). The 5,548 differentially expressed genes display a similar expression mode in M and L, with a correlation coefficient of 0.892. The number of regulated (both up and down) genes in M is more than that in L. Also, their regulation folds in M (2-364-fold) are much more acute than in L (2-75-fold). Differentially expressed gene groups with different regulation patterns in certain follicular stages are identified and presumed to be closely related following follicular developmental rules. Interestingly, functional annotation analysis revealed that these gene groups feature distinct biological processes or molecular functions. Moreover, representative candidate genes from these gene groups have had their RNA or protein expressions within follicles confirmed. Our study emphasized genome-scale gene expression characteristics, which provide novel entry points for understanding the folliculogenesis rules on the molecular level, such as follicular initiation, atresia, and dominance. Transcriptional regulatory circuitries in certain follicular stages are expected to be found among the identified differentially expressed gene groups.
Mancuso, Nicholas; Shi, Huwenbo; Goddard, Pagé; Kichaev, Gleb; Gusev, Alexander; Pasaniuc, Bogdan
2017-03-02
Although genome-wide association studies (GWASs) have identified thousands of risk loci for many complex traits and diseases, the causal variants and genes at these loci remain largely unknown. Here, we introduce a method for estimating the local genetic correlation between gene expression and a complex trait and utilize it to estimate the genetic correlation due to predicted expression between pairs of traits. We integrated gene expression measurements from 45 expression panels with summary GWAS data to perform 30 multi-tissue transcriptome-wide association studies (TWASs). We identified 1,196 genes whose expression is associated with these traits; of these, 168 reside more than 0.5 Mb away from any previously reported GWAS significant variant. We then used our approach to find 43 pairs of traits with significant genetic correlation at the level of predicted expression; of these, eight were not found through genetic correlation at the SNP level. Finally, we used bi-directional regression to find evidence that BMI causally influences triglyceride levels and that triglyceride levels causally influence low-density lipoprotein. Together, our results provide insight into the role of gene expression in the susceptibility of complex traits and diseases. Copyright © 2017 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
Integrative Approach to Pain Genetics Identifies Pain Sensitivity Loci across Diseases
Ruau, David; Dudley, Joel T.; Chen, Rong; Phillips, Nicholas G.; Swan, Gary E.; Lazzeroni, Laura C.; Clark, J. David
2012-01-01
Identifying human genes relevant for the processing of pain requires difficult-to-conduct and expensive large-scale clinical trials. Here, we examine a novel integrative paradigm for data-driven discovery of pain gene candidates, taking advantage of the vast amount of existing disease-related clinical literature and gene expression microarray data stored in large international repositories. First, thousands of diseases were ranked according to a disease-specific pain index (DSPI), derived from Medical Subject Heading (MESH) annotations in MEDLINE. Second, gene expression profiles of 121 of these human diseases were obtained from public sources. Third, genes with expression variation significantly correlated with DSPI across diseases were selected as candidate pain genes. Finally, selected candidate pain genes were genotyped in an independent human cohort and prospectively evaluated for significant association between variants and measures of pain sensitivity. The strongest signal was with rs4512126 (5q32, ABLIM3, P = 1.3×10−10) for the sensitivity to cold pressor pain in males, but not in females. Significant associations were also observed with rs12548828, rs7826700 and rs1075791 on 8q22.2 within NCALD (P = 1.7×10−4, 1.8×10−4, and 2.2×10−4 respectively). Our results demonstrate the utility of a novel paradigm that integrates publicly available disease-specific gene expression data with clinical data curated from MEDLINE to facilitate the discovery of pain-relevant genes. This data-derived list of pain gene candidates enables additional focused and efficient biological studies validating additional candidates. PMID:22685391
Fuentes, Eduardo N; Safian, Diego; Valdés, Juan Antonio; Molina, Alfredo
2013-08-01
In the present study, different reference genes were isolated, and their stability in the skeletal muscle of fine flounder subjected to different nutritional states was assessed using geNorm and NormFinder. The combinations between 18S and ActB; Fau and 18S; and Fau and Tubb were chosen as the most stable gene combinations in feeding, long-term fasting and refeeding, and short-term refeeding conditions, respectively. In all periods, ActB was identified as the single least stable gene. Subsequently, the expression of the myosin heavy chain (MYH) and the insulin-like growth factor-I receptor (IGF-IR) was assessed. A large variation in MYH and IGF-IR expression was found depending on the reference gene that was chosen for normalizing the expression of both genes. Using the most stable reference genes, mRNA levels of MYH decreased and IGF-IR increased during fasting, with both returning to basal levels during refeeding. However, the drop in mRNA levels for IGF-IR occurred during short-term refeeding, in contrast with the observed events in the expression of MYH, which occurred during long-term refeeding. The present study highlights the vast differences incurred when using unsuitable versus suitable reference genes for normalizing gene expression, pointing out that normalization without proper validation could result in a bias of gene expression.
Identification of Cell Cycle-Regulated Genes by Convolutional Neural Network.
Liu, Chenglin; Cui, Peng; Huang, Tao
2017-01-01
The cell cycle-regulated genes express periodically with the cell cycle stages, and the identification and study of these genes can provide a deep understanding of the cell cycle process. Large false positives and low overlaps are big problems in cell cycle-regulated gene detection. Here, a computational framework called DLGene was proposed for cell cycle-regulated gene detection. It is based on the convolutional neural network, a deep learning algorithm representing raw form of data pattern without assumption of their distribution. First, the expression data was transformed to categorical state data to denote the changing state of gene expression, and four different expression patterns were revealed for the reported cell cycle-regulated genes. Then, DLGene was applied to discriminate the non-cell cycle gene and the four subtypes of cell cycle genes. Its performances were compared with six traditional machine learning methods. At last, the biological functions of representative cell cycle genes for each subtype are analyzed. Our method showed better and more balanced performance of sensitivity and specificity comparing to other machine learning algorithms. The cell cycle genes had very different expression pattern with non-cell cycle genes and among the cell-cycle genes, there were four subtypes. Our method not only detects the cell cycle genes, but also describes its expression pattern, such as when its highest expression level is reached and how it changes with time. For each type, we analyzed the biological functions of the representative genes and such results provided novel insight to the cell cycle mechanisms. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
Differential Sensitivity of Target Genes to Translational Repression by miR-17~92
Jin, Hyun Yong; Oda, Hiroyo; Chen, Pengda; Kang, Seung Goo; Valentine, Elizabeth; Liao, Lujian; Zhang, Yaoyang; Gonzalez-Martin, Alicia; Shepherd, Jovan; Head, Steven R.; Kim, Pyeung-Hyeun; Fu, Guo; Liu, Wen-Hsien; Han, Jiahuai
2017-01-01
MicroRNAs (miRNAs) are thought to exert their functions by modulating the expression of hundreds of target genes and each to a small degree, but it remains unclear how small changes in hundreds of target genes are translated into the specific function of a miRNA. Here, we conducted an integrated analysis of transcriptome and translatome of primary B cells from mutant mice expressing miR-17~92 at three different levels to address this issue. We found that target genes exhibit differential sensitivity to miRNA suppression and that only a small fraction of target genes are actually suppressed by a given concentration of miRNA under physiological conditions. Transgenic expression and deletion of the same miRNA gene regulate largely distinct sets of target genes. miR-17~92 controls target gene expression mainly through translational repression and 5’UTR plays an important role in regulating target gene sensitivity to miRNA suppression. These findings provide molecular insights into a model in which miRNAs exert their specific functions through a small number of key target genes. PMID:28241004
Gene expression profiling of single cells on large-scale oligonucleotide arrays
Hartmann, Claudia H.; Klein, Christoph A.
2006-01-01
Over the last decade, important insights into the regulation of cellular responses to various stimuli were gained by global gene expression analyses of cell populations. More recently, specific cell functions and underlying regulatory networks of rare cells isolated from their natural environment moved to the center of attention. However, low cell numbers still hinder gene expression profiling of rare ex vivo material in biomedical research. Therefore, we developed a robust method for gene expression profiling of single cells on high-density oligonucleotide arrays with excellent coverage of low abundance transcripts. The protocol was extensively tested with freshly isolated single cells of very low mRNA content including single epithelial, mature and immature dendritic cells and hematopoietic stem cells. Quantitative PCR confirmed that the PCR-based global amplification method did not change the relative ratios of transcript abundance and unsupervised hierarchical cluster analysis revealed that the histogenetic origin of an individual cell is correctly reflected by the gene expression profile. Moreover, the gene expression data from dendritic cells demonstrate that cellular differentiation and pathway activation can be monitored in individual cells. PMID:17071717
Dekkers, Bas J. W.; Pearce, Simon P.; van Bolderen-Veldkamp, R. P. M.; Holdsworth, Michael J.; Bentsink, Leónie
2016-01-01
Seed dormancy is a genetically controlled block preventing the germination of imbibed seeds in favorable conditions. It requires a period of dry storage (after-ripening) or certain environmental conditions to be overcome. Dormancy is an important seed trait, which is under selective pressure, to control the seasonal timing of seed germination. Dormant and non-dormant (after-ripened) seeds are characterized by large sets of differentially expressed genes. However, little information is available concerning the temporal and spatial transcriptional changes during early stages of rehydration in dormant and non-dormant seeds. We employed genome-wide transcriptome analysis on seeds of the model plant Arabidopsis thaliana to investigate transcriptional changes in dry seeds upon rehydration. We analyzed gene expression of dormant and after-ripened seeds of the Cvi accession over four time points and two seed compartments (the embryo and surrounding single cell layer endosperm), during the first 24 h after sowing. This work provides a global view of gene expression changes in dormant and non-dormant seeds with temporal and spatial detail, and these may be visualized via a web accessible tool (http://www.wageningenseedlab.nl/resources). A large proportion of transcripts change similarly in both dormant and non-dormant seeds upon rehydration, however, the first differences in transcript abundances become visible shortly after the initiation of imbibition, indicating that changes induced by after-ripening are detected and responded to rapidly upon rehydration. We identified several gene expression profiles which contribute to differential gene expression between dormant and non-dormant samples. Genes with enhanced expression in the endosperm of dormant seeds were overrepresented for stress-related Gene Ontology categories, suggesting a protective role for the endosperm against biotic and abiotic stress to support persistence of the dormant seed in its environment. PMID:27625677
Haney, Robert A.; Clarke, Thomas H.; Gadgil, Rujuta; Fitzpatrick, Ryan; Hayashi, Cheryl Y.; Ayoub, Nadia A.; Garb, Jessica E.
2016-01-01
Gene duplication and positive selection can be important determinants of the evolution of venom, a protein-rich secretion used in prey capture and defense. In a typical model of venom evolution, gene duplicates switch to venom gland expression and change function under the action of positive selection, which together with further duplication produces large gene families encoding diverse toxins. Although these processes have been demonstrated for individual toxin families, high-throughput multitissue sequencing of closely related venomous species can provide insights into evolutionary dynamics at the scale of the entire venom gland transcriptome. By assembling and analyzing multitissue transcriptomes from the Western black widow spider and two closely related species with distinct venom toxicity phenotypes, we do not find that gene duplication and duplicate retention is greater in gene families with venom gland biased expression in comparison with broadly expressed families. Positive selection has acted on some venom toxin families, but does not appear to be in excess for families with venom gland biased expression. Moreover, we find 309 distinct gene families that have single transcripts with venom gland biased expression, suggesting that the switching of genes to venom gland expression in numerous unrelated gene families has been a dominant mode of evolution. We also find ample variation in protein sequences of venom gland–specific transcripts, lineage-specific family sizes, and ortholog expression among species. This variation might contribute to the variable venom toxicity of these species. PMID:26733576
Bil-Lula, Iwona; Woźniak, Mieczysław
2018-03-26
Immunocompromised patients are susceptible to multiple viral infections. Relevant interactions between co-infecting viruses might result from viral regulatory genes which trans-activate or repress the expression of host cell genes as well as the genes of any co-infecting virus. The aim of the current study was to show that the replication of human adenovirus 5 is enhanced by co-infection with BK polyomavirus and is associated with increased expression of proteins including early region 4 open reading frame 1 and both the large tumor antigen and small tumor antigen. Clinical samples of whole blood and urine from 156 hematopoietic stem cell transplant recipients were tested. We also inoculated adenocarcinomic human alveolar basal epithelial cells with both human adenovirus 5 and BK polyomavirus to evaluate if co-infection of viruses affected their replication. Data showed that adenovirus load was significantly higher in the plasma (mean 7.5 x 10 3 ± 8.5 x 10 2 copies/ml) and urine (mean 1.9 x 10 3 ± 8.0 x 10 2 copies/ml) of samples from patients with co-infections, in comparison to samples from patients with isolated adenovirus infection. In vitro co-infection led to an increased (8.6 times) expression of the adenovirus early region 4 open reading frame gene 48 hours post-inoculation. The expression of the early region 4 open reading frame gene positively correlated with the expression of BK polyomavirus large tumor antigen (r = 0.90, p < 0.0001) and small tumor antigen (r = 0.83, p < 0.001) genes. The enhanced expression of the early region 4 open reading frame gene due to co-infection with BK polyomavirus was associated with enhanced adenovirus, but not BK polyomavirus, replication. The current study provides evidence that co-infection of adenovirus and BK polyomavirus contributes to enhanced adenovirus replication. Data obtained from this study may have significant importance in the clinical setting.
An integrated approach to reconstructing genome-scale transcriptional regulatory networks
Imam, Saheed; Noguera, Daniel R.; Donohue, Timothy J.; ...
2015-02-27
Transcriptional regulatory networks (TRNs) program cells to dynamically alter their gene expression in response to changing internal or environmental conditions. In this study, we develop a novel workflow for generating large-scale TRN models that integrates comparative genomics data, global gene expression analyses, and intrinsic properties of transcription factors (TFs). An assessment of this workflow using benchmark datasets for the well-studied γ-proteobacterium Escherichia coli showed that it outperforms expression-based inference approaches, having a significantly larger area under the precision-recall curve. Further analysis indicated that this integrated workflow captures different aspects of the E. coli TRN than expression-based approaches, potentially making themmore » highly complementary. We leveraged this new workflow and observations to build a large-scale TRN model for the α-Proteobacterium Rhodobacter sphaeroides that comprises 120 gene clusters, 1211 genes (including 93 TFs), 1858 predicted protein-DNA interactions and 76 DNA binding motifs. We found that ~67% of the predicted gene clusters in this TRN are enriched for functions ranging from photosynthesis or central carbon metabolism to environmental stress responses. We also found that members of many of the predicted gene clusters were consistent with prior knowledge in R. sphaeroides and/or other bacteria. Experimental validation of predictions from this R. sphaeroides TRN model showed that high precision and recall was also obtained for TFs involved in photosynthesis (PpsR), carbon metabolism (RSP_0489) and iron homeostasis (RSP_3341). In addition, this integrative approach enabled generation of TRNs with increased information content relative to R. sphaeroides TRN models built via other approaches. We also show how this approach can be used to simultaneously produce TRN models for each related organism used in the comparative genomics analysis. Our results highlight the advantages of integrating comparative genomics of closely related organisms with gene expression data to assemble large-scale TRN models with high-quality predictions.« less
Comparative studies of gene expression and the evolution of gene regulation
Romero, Irene Gallego; Ruvinsky, Ilya; Gilad, Yoav
2014-01-01
The hypothesis that differences in gene regulation play an important role in speciation and adaptation is more than 40 years old. With the advent of new sequencing technologies, we are able to characterize and study gene expression levels and associated regulatory mechanisms in a large number of individuals and species at unprecedented resolution and scale. We have thus gained new insights into the evolutionary pressures that shape gene expression levels, as well as developed an appreciation for the relative importance of evolutionary changes in different regulatory genetic and epigenetic mechanisms. The current challenge is to link gene regulatory changes to adaptive evolution of complex phenotypes. Here we mainly focus on comparative studies in primates, and how they are complemented by studies in model organisms. PMID:22705669
Genomic analysis of expressed sequence tags in American black bear Ursus americanus
2010-01-01
Background Species of the bear family (Ursidae) are important organisms for research in molecular evolution, comparative physiology and conservation biology, but relatively little genetic sequence information is available for this group. Here we report the development and analyses of the first large scale Expressed Sequence Tag (EST) resource for the American black bear (Ursus americanus). Results Comprehensive analyses of molecular functions, alternative splicing, and tissue-specific expression of 38,757 black bear EST sequences were conducted using the dog genome as a reference. We identified 18 genes, involved in functions such as lipid catabolism, cell cycle, and vesicle-mediated transport, that are showing rapid evolution in the bear lineage Three genes, Phospholamban (PLN), cysteine glycine-rich protein 3 (CSRP3) and Troponin I type 3 (TNNI3), are related to heart contraction, and defects in these genes in humans lead to heart disease. Two genes, biphenyl hydrolase-like (BPHL) and CSRP3, contain positively selected sites in bear. Global analysis of evolution rates of hibernation-related genes in bear showed that they are largely conserved and slowly evolving genes, rather than novel and fast-evolving genes. Conclusion We provide a genomic resource for an important mammalian organism and our study sheds new light on the possible functions and evolution of bear genes. PMID:20338065
Genomic analysis of expressed sequence tags in American black bear Ursus americanus.
Zhao, Sen; Shao, Chunxuan; Goropashnaya, Anna V; Stewart, Nathan C; Xu, Yichi; Tøien, Øivind; Barnes, Brian M; Fedorov, Vadim B; Yan, Jun
2010-03-26
Species of the bear family (Ursidae) are important organisms for research in molecular evolution, comparative physiology and conservation biology, but relatively little genetic sequence information is available for this group. Here we report the development and analyses of the first large scale Expressed Sequence Tag (EST) resource for the American black bear (Ursus americanus). Comprehensive analyses of molecular functions, alternative splicing, and tissue-specific expression of 38,757 black bear EST sequences were conducted using the dog genome as a reference. We identified 18 genes, involved in functions such as lipid catabolism, cell cycle, and vesicle-mediated transport, that are showing rapid evolution in the bear lineage Three genes, Phospholamban (PLN), cysteine glycine-rich protein 3 (CSRP3) and Troponin I type 3 (TNNI3), are related to heart contraction, and defects in these genes in humans lead to heart disease. Two genes, biphenyl hydrolase-like (BPHL) and CSRP3, contain positively selected sites in bear. Global analysis of evolution rates of hibernation-related genes in bear showed that they are largely conserved and slowly evolving genes, rather than novel and fast-evolving genes. We provide a genomic resource for an important mammalian organism and our study sheds new light on the possible functions and evolution of bear genes.
2014-01-01
Background Our current knowledge of tooth development derives mainly from studies in mice, which have only one set of non-replaced teeth, compared with the diphyodont dentition in humans. The miniature pig is also diphyodont, making it a valuable alternative model for understanding human tooth development and replacement. However, little is known about gene expression and function during swine odontogenesis. The goal of this study is to undertake the survey of differential gene expression profiling and functional network analysis during morphogenesis of diphyodont dentition in miniature pigs. The identification of genes related to diphyodont development should lead to a better understanding of morphogenetic patterns and the mechanisms of diphyodont replacement in large animal models and humans. Results The temporal gene expression profiles during early diphyodont development in miniature pigs were detected with the Affymetrix Porcine GeneChip. The gene expression data were further evaluated by ANOVA as well as pathway and STC analyses. A total of 2,053 genes were detected with differential expression. Several signal pathways and 151 genes were then identified through the construction of pathway and signal networks. Conclusions The gene expression profiles indicated that spatio-temporal down-regulation patterns of gene expression were predominant; while, both dynamic activation and inhibition of pathways occurred during the morphogenesis of diphyodont dentition. Our study offers a mechanistic framework for understanding dynamic gene regulation of early diphyodont development and provides a molecular basis for studying teeth development, replacement, and regeneration in miniature pigs. PMID:24498892
Gene Expression Profiling of Soft and Firm Atlantic Salmon Fillet
Larsson, Thomas; Mørkøre, Turid; Kolstad, Kari; Østbye, Tone-Kari; Afanasyev, Sergey; Krasnov, Aleksei
2012-01-01
Texture of salmon fillets is an important quality trait for consumer acceptance as well as for the suitability for processing. In the present work we measured fillet firmness in a population of farmed Atlantic salmon with known pedigree and investigated the relationship between this trait and gene expression. Transcriptomic analyses performed with a 21 K oligonucleotide microarray revealed strong correlations between firmness and a large number of genes. Highly similar expression profiles were observed in several functional groups. Positive regression was found between firmness and genes encoding proteasome components (41 genes) and mitochondrial proteins (129 genes), proteins involved in stress responses (12 genes), and lipid metabolism (30 genes). Coefficients of determination (R2) were in the range of 0.64–0.74. A weaker though highly significant negative regression was seen in sugar metabolism (26 genes, R2 = 0.66) and myofiber proteins (42 genes, R2 = 0.54). Among individual genes that showed a strong association with firmness, there were extracellular matrix proteins (negative correlation), immune genes, and intracellular proteases (positive correlation). Several genes can be regarded as candidate markers of flesh quality (coiled-coil transcriptional coactivator b, AMP deaminase 3, and oligopeptide transporter 15) though their functional roles are unclear. To conclude, fillet firmness of Atlantic salmon depends largely on metabolic properties of the skeletal muscle; where aerobic metabolism using lipids as fuel, and the rapid removal of damaged proteins, appear to play a major role. PMID:22745718
Gene expression profiling of soft and firm Atlantic salmon fillet.
Larsson, Thomas; Mørkøre, Turid; Kolstad, Kari; Østbye, Tone-Kari; Afanasyev, Sergey; Krasnov, Aleksei
2012-01-01
Texture of salmon fillets is an important quality trait for consumer acceptance as well as for the suitability for processing. In the present work we measured fillet firmness in a population of farmed Atlantic salmon with known pedigree and investigated the relationship between this trait and gene expression. Transcriptomic analyses performed with a 21 K oligonucleotide microarray revealed strong correlations between firmness and a large number of genes. Highly similar expression profiles were observed in several functional groups. Positive regression was found between firmness and genes encoding proteasome components (41 genes) and mitochondrial proteins (129 genes), proteins involved in stress responses (12 genes), and lipid metabolism (30 genes). Coefficients of determination (R(2)) were in the range of 0.64-0.74. A weaker though highly significant negative regression was seen in sugar metabolism (26 genes, R(2) = 0.66) and myofiber proteins (42 genes, R(2) = 0.54). Among individual genes that showed a strong association with firmness, there were extracellular matrix proteins (negative correlation), immune genes, and intracellular proteases (positive correlation). Several genes can be regarded as candidate markers of flesh quality (coiled-coil transcriptional coactivator b, AMP deaminase 3, and oligopeptide transporter 15) though their functional roles are unclear. To conclude, fillet firmness of Atlantic salmon depends largely on metabolic properties of the skeletal muscle; where aerobic metabolism using lipids as fuel, and the rapid removal of damaged proteins, appear to play a major role.
Patterns of homoeologous gene expression shown by RNA sequencing in hexaploid bread wheat
2014-01-01
Background Bread wheat (Triticum aestivum) has a large, complex and hexaploid genome consisting of A, B and D homoeologous chromosome sets. Therefore each wheat gene potentially exists as a trio of A, B and D homoeoloci, each of which may contribute differentially to wheat phenotypes. We describe a novel approach combining wheat cytogenetic resources (chromosome substitution ‘nullisomic-tetrasomic’ lines) with next generation deep sequencing of gene transcripts (RNA-Seq), to directly and accurately identify homoeologue-specific single nucleotide variants and quantify the relative contribution of individual homoeoloci to gene expression. Results We discover, based on a sample comprising ~5-10% of the total wheat gene content, that at least 45% of wheat genes are expressed from all three distinct homoeoloci. Most of these genes show strikingly biased expression patterns in which expression is dominated by a single homoeolocus. The remaining ~55% of wheat genes are expressed from either one or two homoeoloci only, through a combination of extensive transcriptional silencing and homoeolocus loss. Conclusions We conclude that wheat is tending towards functional diploidy, through a variety of mechanisms causing single homoeoloci to become the predominant source of gene transcripts. This discovery has profound consequences for wheat breeding and our understanding of wheat evolution. PMID:24726045
Sun, Yang; Huang, Shuijin; Wang, Shuping; Guo, Dianhao; Ge, Chang; Xiao, Huamei; Jie, Wencai; Yang, Qiupu; Teng, Xiaolu; Li, Fei
2017-04-01
Insects undergo metamorphosis, involving an abrupt change in body structure through cell growth and differentiation. Rice stem stripped borer (SSB), Chilo suppressalis, is one of the most destructive rice pests. However, little is known about the regulation mechanism of metamorphosis development in this notorious insect pest. Here, we studied the expression of 22,197 SSB genes at seven time points during pupa development with a customized microarray, identifying 622 differentially expressed genes (DEG) during pupa development. Gene ontology (GO) analysis of these DEGs indicated that the genes related to substance metabolism were highly expressed in the early pupa, which participate in the physiological processes of larval tissue disintegration at these stages. In comparison, highly expressed genes in the late pupal stages were mainly associated with substance biosynthesis, consistent with adult organ formation at these stages. There were 27 solute carrier (SLC) genes that were highly expressed during pupa development. We knocked down SLC22A3 at the prepupal stage, demonstrating that silencing SLC22A3 induced a deficiency in pupa stiffness and pigmentation. The RNAi-treated individuals had white and soft pupa, suggesting that this gene has an essential role in pupal development. Copyright © 2016 Elsevier Ltd. All rights reserved.
Imprinted gene expression in fetal growth and development.
Lambertini, L; Marsit, C J; Sharma, P; Maccani, M; Ma, Y; Hu, J; Chen, J
2012-06-01
Experimental studies showed that genomic imprinting is fundamental in fetoplacental development by timely regulating the expression of the imprinted genes to overlook a set of events determining placenta implantation, growth and embryogenesis. We examined the expression profile of 22 imprinted genes which have been linked to pregnancy abnormalities that may ultimately influence childhood development. The study was conducted in a subset of 106 placenta samples, overrepresented with small and large for gestational age cases, from the Rhode Island Child Health Study. We investigated associations between imprinted gene expression and three fetal development parameters: newborn head circumference, birth weight, and size for gestational age. Results from our investigation show that the maternally imprinted/paternally expressed gene ZNF331 inversely associates with each parameter to drive smaller fetal size, while paternally imprinted/maternally expressed gene SLC22A18 directly associates with the newborn head circumference promoting growth. Multidimensional Scaling analysis revealed two clusters within the 22 imprinted genes which are independently associated with fetoplacental development. Our data suggest that cluster 1 genes work by assuring cell growth and tissue development, while cluster 2 genes act by coordinating these processes. Results from this epidemiologic study offer solid support for the key role of imprinting in fetoplacental development. Copyright © 2012 Elsevier Ltd. All rights reserved.
On the presence and role of human gene-body DNA methylation
Jjingo, Daudi; Conley, Andrew B.; Yi, Soojin V.; Lunyak, Victoria V.; Jordan, I. King
2012-01-01
DNA methylation of promoter sequences is a repressive epigenetic mark that down-regulates gene expression. However, DNA methylation is more prevalent within gene-bodies than seen for promoters, and gene-body methylation has been observed to be positively correlated with gene expression levels. This paradox remains unexplained, and accordingly the role of DNA methylation in gene-bodies is poorly understood. We addressed the presence and role of human gene-body DNA methylation using a meta-analysis of human genome-wide methylation, expression and chromatin data sets. Methylation is associated with transcribed regions as genic sequences have higher levels of methylation than intergenic or promoter sequences. We also find that the relationship between gene-body DNA methylation and expression levels is non-monotonic and bell-shaped. Mid-level expressed genes have the highest levels of gene-body methylation, whereas the most lowly and highly expressed sets of genes both have low levels of methylation. While gene-body methylation can be seen to efficiently repress the initiation of intragenic transcription, the vast majority of methylated sites within genes are not associated with intragenic promoters. In fact, highly expressed genes initiate the most intragenic transcription, which is inconsistent with the previously held notion that gene-body methylation serves to repress spurious intragenic transcription to allow for efficient transcriptional elongation. These observations lead us to propose a model to explain the presence of human gene-body methylation. This model holds that the repression of intragenic transcription by gene-body methylation is largely epiphenomenal, and suggests that gene-body methylation levels are predominantly shaped via the accessibility of the DNA to methylating enzyme complexes. PMID:22577155
Regulatory role of AINTEGUMENTA in organ initiation and growth
DOE Office of Scientific and Technical Information (OSTI.GOV)
Krizek, Beth Allyn; Lebioda, Lukasz
2005-03-01
Although several members of the plant-specific AP2/ERF family of transcription factors are important developmental regulators, many genes in this large protein family remain uncharacterized. Here, we present a phylogenetic analysis of the18 genes that make up the AP2 subgroup of this family. We report expression analyses of seven Arabidopsis genes most closely related to the floral development gene AINTEGUMENTA and show that all AINTEGUMENTA-like (AIL) genes are transcribed in multiple tissues during development. They are expressed primarily in young actively dividing tissues of a plant and not in mature leaves or stems. The spatial distribution of AIL5, AIL6, and AIL7more » mRNA in inflorescences was characterized by in situ hybridization. Each of these genes is expressed in a spatially and temporally distinct pattern within inflorescence meristems and flowers. Ectopic expression of AIL5 resulted in a larger floral organ phenotype, similar to that resulting from ectopic expression of ANT. Our results are consistent with AIL genes having roles in specification of meristematic or division-competent states.« less
Expressing genes do not forget their LINEs: transposable elements and gene expression
Kines, Kristine J.; Belancio, Victoria P.
2012-01-01
1. ABSTRACT Historically the accumulated mass of mammalian transposable elements (TEs), particularly those located within gene boundaries, was viewed as a genetic burden potentially detrimental to the genomic landscape. This notion has been strengthened by the discovery that transposable sequences can alter the architecture of the transcriptome, not only through insertion, but also long after the integration process is completed. Insertions previously considered harmless are now known to impact the expression of host genes via modification of the transcript quality or quantity, transcriptional interference, or by the control of pathways that affect the mRNA life-cycle. Conversely, several examples of the evolutionary advantageous impact of TEs on the host gene structure that diversified the cellular transcriptome are reported. TE-induced changes in gene expression can be tissue-or disease-specific, raising the possibility that the impact of TE sequences may vary during development, among normal cell types, and between normal and disease-affected tissues. The understanding of the rules and abundance of TE-interference with gene expression is in its infancy, and its contribution to human disease and/or evolution remains largely unexplored. PMID:22201807
Mo, Delin; Zhu, Zhengmao; te Pas, Marinus F W; Li, Xinyun; Yang, Shulin; Wang, Heng; Wang, Huanling; Li, Kui
2008-06-30
In a previous screen to identify differentially expressed genes associated with embryonic development, the porcine PNAS-4 gene had been found. Considering differentially expressed genes in early stages of muscle development are potential candidate genes to improve meat quality and production efficiency, we determined how porcine PNAS-4 gene regulates meat production. Therefore, this gene has been sequenced, expression analyzed and associated with meat production traits. We cloned the full-length cDNA of porcine PNAS-4 gene encoding a protein of 194 amino acids which was expressed in the Golgi complex. This gene was mapped to chromosome 10, q11-16, in a region of conserved synteny with human chromosome 1 where the human homologous gene was localized. Real-time PCR revealed that PNAS-4 mRNA was widely expressed with highest expression levels in skeletal muscle followed by lymph, liver and other tissues, and showed a down-regulated expression pattern during prenatal development while a up-regulated expression pattern after weaning. Association analysis revealed that allele C of SNP A1813C was prevalent in Chinese indigenous breeds whereas A was dominant allele in Landrace and Large White, and the pigs with homozygous CC had a higher fat content than those of the pigs with other genotypes (P < 0.05). Porcine PNAS-4 protein tagged with green fluorescent protein accumulated in the Golgi complex, and its mRNA showed a widespread expression across many tissues and organs in pigs. It may be an important factor affecting the meat production efficiency, because its down-regulated expression pattern during early embryogenesis suggests involvement in increase of muscle fiber number. In addition, the SNP A1813C associated with fat traits might be a genetic marker for molecular-assisted selection in animal breeding.
Pyeon, Hye-Rim; Nah, Hee-Ju; Kang, Seung-Hoon; Choi, Si-Sun; Kim, Eung-Soo
2017-05-31
Heterologous expression of biosynthetic gene clusters of natural microbial products has become an essential strategy for titer improvement and pathway engineering of various potentially-valuable natural products. A Streptomyces artificial chromosomal conjugation vector, pSBAC, was previously successfully applied for precise cloning and tandem integration of a large polyketide tautomycetin (TMC) biosynthetic gene cluster (Nah et al. in Microb Cell Fact 14(1):1, 2015), implying that this strategy could be employed to develop a custom overexpression scheme of natural product pathway clusters present in actinomycetes. To validate the pSBAC system as a generally-applicable heterologous overexpression system for a large-sized polyketide biosynthetic gene cluster in Streptomyces, another model polyketide compound, the pikromycin biosynthetic gene cluster, was preciously cloned and heterologously expressed using the pSBAC system. A unique HindIII restriction site was precisely inserted at one of the border regions of the pikromycin biosynthetic gene cluster within the chromosome of Streptomyces venezuelae, followed by site-specific recombination of pSBAC into the flanking region of the pikromycin gene cluster. Unlike the previous cloning process, one HindIII site integration step was skipped through pSBAC modification. pPik001, a pSBAC containing the pikromycin biosynthetic gene cluster, was directly introduced into two heterologous hosts, Streptomyces lividans and Streptomyces coelicolor, resulting in the production of 10-deoxymethynolide, a major pikromycin derivative. When two entire pikromycin biosynthetic gene clusters were tandemly introduced into the S. lividans chromosome, overproduction of 10-deoxymethynolide and the presence of pikromycin, which was previously not detected, were both confirmed. Moreover, comparative qRT-PCR results confirmed that the transcription of pikromycin biosynthetic genes was significantly upregulated in S. lividans containing tandem clusters of pikromycin biosynthetic gene clusters. The 60 kb pikromycin biosynthetic gene cluster was isolated in a single integration pSBAC vector. Introduction of the pikromycin biosynthetic gene cluster into the pikromycin non-producing strains resulted in higher pikromycin production. The utility of the pSBAC system as a precise cloning tool for large-sized biosynthetic gene clusters was verified through heterologous expression of the pikromycin biosynthetic gene cluster. Moreover, this pSBAC-driven heterologous expression strategy was confirmed to be an ideal approach for production of low and inconsistent natural products such as pikromycin in S. venezuelae, implying that this strategy could be employed for development of a custom overexpression scheme of natural product biosynthetic gene clusters in actinomycetes.
Zhou, Lei-Lei; Xu, Xiao-Yue; Ni, Jie; Zhao, Xia; Zhou, Jian-Wei; Feng, Ji-Feng
2018-06-01
Due to the low incidence and the heterogeneity of subtypes, the biological process of T-cell lymphomas is largely unknown. Although many genes have been detected in T-cell lymphomas, the role of these genes in biological process of T-cell lymphomas was not further analyzed. Two qualified datasets were downloaded from Gene Expression Omnibus database. The biological functions of differentially expressed genes were evaluated by gene ontology enrichment and KEGG pathway analysis. The network for intersection genes was constructed by the cytoscape v3.0 software. Kaplan-Meier survival curves and log-rank test were employed to assess the association between differentially expressed genes and clinical characters. The intersection mRNAs were proved to be associated with fundamental processes of T-cell lymphoma cells. These intersection mRNAs were involved in the activation of some cancer-related pathways, including PI3K/AKT, Ras, JAK-STAT, and NF-kappa B signaling pathway. PDGFRA, CXCL12, and CCL19 were the most significant central genes in the signal-net analysis. The results of survival analysis are not entirely credible. Our findings uncovered aberrantly expressed genes and a complex RNA signal network in T-cell lymphomas and indicated cancer-related pathways involved in disease initiation and progression, providing a new insight for biotargeted therapy in T-cell lymphomas. © 2018 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
DGEM--a microarray gene expression database for primary human disease tissues.
Xia, Yuni; Campen, Andrew; Rigsby, Dan; Guo, Ying; Feng, Xingdong; Su, Eric W; Palakal, Mathew; Li, Shuyu
2007-01-01
Gene expression patterns can reflect gene regulations in human tissues under normal or pathologic conditions. Gene expression profiling data from studies of primary human disease samples are particularly valuable since these studies often span many years in order to collect patient clinical information and achieve a large sample size. Disease-to-Gene Expression Mapper (DGEM) provides a beneficial community resource to access and analyze these data; it currently includes Affymetrix oligonucleotide array datasets for more than 40 human diseases and 1400 samples. The data are normalized to the same scale and stored in a relational database. A statistical-analysis pipeline was implemented to identify genes abnormally expressed in disease tissues or genes whose expressions are associated with clinical parameters such as cancer patient survival. Data-mining results can be queried through a web-based interface at http://dgem.dhcp.iupui.edu/. The query tool enables dynamic generation of graphs and tables that are further linked to major gene and pathway resources that connect the data to relevant biology, including Entrez Gene and Kyoto Encyclopedia of Genes and Genomes (KEGG). In summary, DGEM provides scientists and physicians a valuable tool to study disease mechanisms, to discover potential disease biomarkers for diagnosis and prognosis, and to identify novel gene targets for drug discovery. The source code is freely available for non-profit use, on request to the authors.
Cardiac Gene Therapy: Optimization of Gene Delivery Techniques In Vivo
Katz, Michael G.; Swain, JaBaris D.; White, Jennifer D.; Low, David; Stedman, Hansell
2010-01-01
Abstract Vector-mediated cardiac gene therapy holds tremendous promise as a translatable platform technology for treating many cardiovascular diseases. The ideal technique is one that is efficient and practical, allowing for global cardiac gene expression, while minimizing collateral expression in other organs. Here we survey the available in vivo vector-mediated cardiac gene delivery methods—including transcutaneous, intravascular, intramuscular, and cardiopulmonary bypass techniques—with consideration of the relative merits and deficiencies of each. Review of available techniques suggests that an optimal method for vector-mediated gene delivery to the large animal myocardium would ideally employ retrograde and/or anterograde transcoronary gene delivery,extended vector residence time in the coronary circulation, an increased myocardial transcapillary gradient using physical methods, increased endothelial permeability with pharmacological agents, minimal collateral gene expression by isolation of the cardiac circulation from the systemic, and have low immunogenicity. PMID:19947886
USDA-ARS?s Scientific Manuscript database
After ovulation, somatic cells of the ovarian follicle (theca and granulosa cells) become the small and large luteal cells of the corpus luteum. Aside from known cell type-specific receptors and steroidogenic enzymes, little is known about the differences in the gene expression profiles of these fou...
Davin, Nicolas; Edger, Patrick P; Hefer, Charles A; Mizrachi, Eshchar; Schuetz, Mathias; Smets, Erik; Myburg, Alexander A; Douglas, Carl J; Schranz, Michael E; Lens, Frederic
2016-06-01
Many plant genes are known to be involved in the development of cambium and wood, but how the expression and functional interaction of these genes determine the unique biology of wood remains largely unknown. We used the soc1ful loss of function mutant - the woodiest genotype known in the otherwise herbaceous model plant Arabidopsis - to investigate the expression and interactions of genes involved in secondary growth (wood formation). Detailed anatomical observations of the stem in combination with mRNA sequencing were used to assess transcriptome remodeling during xylogenesis in wild-type and woody soc1ful plants. To interpret the transcriptome changes, we constructed functional gene association networks of differentially expressed genes using the STRING database. This analysis revealed functionally enriched gene association hubs that are differentially expressed in herbaceous and woody tissues. In particular, we observed the differential expression of genes related to mechanical stress and jasmonate biosynthesis/signaling during wood formation in soc1ful plants that may be an effect of greater tension within woody tissues. Our results suggest that habit shifts from herbaceous to woody life forms observed in many angiosperm lineages could have evolved convergently by genetic changes that modulate the gene expression and interaction network, and thereby redeploy the conserved wood developmental program. © 2016 The Authors. The Plant Journal published by Society for Experimental Biology and John Wiley & Sons Ltd.
Picking Cell Lines for High-Throughput Transcriptomic Toxicity ...
High throughput, whole genome transcriptomic profiling is a promising approach to comprehensively evaluate chemicals for potential biological effects. To be useful for in vitro toxicity screening, gene expression must be quantified in a set of representative cell types that captures the diversity of potential responses across chemicals. The ideal dataset to select these cell types would consist of hundreds of cell types treated with thousands of chemicals, but does not yet exist. However, basal gene expression data may be useful as a surrogate for representing the relevant biological space necessary for cell type selection. The goal of this study was to identify a small (< 20) number of cell types that capture a large, quantifiable fraction of basal gene expression diversity. Three publicly available collections of Affymetrix U133+2.0 cellular gene expression data were used: 1) 59 cell lines from the NCI60 set; 2) 303 primary cell types from the Mabbott et al (2013) expression atlas; and 3) 1036 cell lines from the Cancer Cell Line Encyclopedia. The data were RMA normalized, log-transformed, and the probe sets mapped to HUGO gene identifiers. The results showed that <20 cell lines capture only a small fraction of the total diversity in basal gene expression when evaluated using either the entire set of 20960 HUGO genes or a subset of druggable genes likely to be chemical targets. The fraction of the total gene expression variation explained was consistent when
Prediction of gene expression in embryonic structures of Drosophila melanogaster.
Samsonova, Anastasia A; Niranjan, Mahesan; Russell, Steven; Brazma, Alvis
2007-07-01
Understanding how sets of genes are coordinately regulated in space and time to generate the diversity of cell types that characterise complex metazoans is a major challenge in modern biology. The use of high-throughput approaches, such as large-scale in situ hybridisation and genome-wide expression profiling via DNA microarrays, is beginning to provide insights into the complexities of development. However, in many organisms the collection and annotation of comprehensive in situ localisation data is a difficult and time-consuming task. Here, we present a widely applicable computational approach, integrating developmental time-course microarray data with annotated in situ hybridisation studies, that facilitates the de novo prediction of tissue-specific expression for genes that have no in vivo gene expression localisation data available. Using a classification approach, trained with data from microarray and in situ hybridisation studies of gene expression during Drosophila embryonic development, we made a set of predictions on the tissue-specific expression of Drosophila genes that have not been systematically characterised by in situ hybridisation experiments. The reliability of our predictions is confirmed by literature-derived annotations in FlyBase, by overrepresentation of Gene Ontology biological process annotations, and, in a selected set, by detailed gene-specific studies from the literature. Our novel organism-independent method will be of considerable utility in enriching the annotation of gene function and expression in complex multicellular organisms.
Prediction of Gene Expression in Embryonic Structures of Drosophila melanogaster
Samsonova, Anastasia A; Niranjan, Mahesan; Russell, Steven; Brazma, Alvis
2007-01-01
Understanding how sets of genes are coordinately regulated in space and time to generate the diversity of cell types that characterise complex metazoans is a major challenge in modern biology. The use of high-throughput approaches, such as large-scale in situ hybridisation and genome-wide expression profiling via DNA microarrays, is beginning to provide insights into the complexities of development. However, in many organisms the collection and annotation of comprehensive in situ localisation data is a difficult and time-consuming task. Here, we present a widely applicable computational approach, integrating developmental time-course microarray data with annotated in situ hybridisation studies, that facilitates the de novo prediction of tissue-specific expression for genes that have no in vivo gene expression localisation data available. Using a classification approach, trained with data from microarray and in situ hybridisation studies of gene expression during Drosophila embryonic development, we made a set of predictions on the tissue-specific expression of Drosophila genes that have not been systematically characterised by in situ hybridisation experiments. The reliability of our predictions is confirmed by literature-derived annotations in FlyBase, by overrepresentation of Gene Ontology biological process annotations, and, in a selected set, by detailed gene-specific studies from the literature. Our novel organism-independent method will be of considerable utility in enriching the annotation of gene function and expression in complex multicellular organisms. PMID:17658945
Mosquera Orgueira, Adrián
2015-01-01
DNA methylation is a frequent epigenetic mechanism that participates in transcriptional repression. Variations in DNA methylation with respect to gene expression are constant, and, for unknown reasons, some genes with highly methylated promoters are sometimes overexpressed. In this study we have analyzed the expression and methylation patterns of thousands of genes in five groups of cancer and normal tissue samples in order to determine local and genome-wide differences. We observed significant changes in global methylation-expression correlation in all the neoplasms, which suggests that differential correlation events are frequent in cancer. A focused analysis in the breast cancer cohort identified 1662 genes whose correlation varies significantly between normal and cancerous breast, but whose DNA methylation and gene expression patterns do not change substantially. These genes were enriched in cancer-related pathways and repressive chromatin features across various model cell lines, such as PRC2 binding and H3K27me3 marks. Substantial changes in methylation-expression correlation indicate that these genes are subject to epigenetic remodeling, where the differential activity of other factors break the expected relationship between both variables. Our findings suggest a complex regulatory landscape where a redistribution of local and large-scale chromatin repressive domains at differentially correlated genes (DCGs) creates epigenetic hotspots that modulate cancer-specific gene expression. PMID:26029238
Wichmann, Gunnar; Rosolowski, Maciej; Krohn, Knut; Kreuz, Markus; Boehm, Andreas; Reiche, Anett; Scharrer, Ulrike; Halama, Dirk; Bertolini, Julia; Bauer, Ulrike; Holzinger, Dana; Pawlita, Michael; Hess, Jochen; Engel, Christoph; Hasenclever, Dirk; Scholz, Markus; Ahnert, Peter; Kirsten, Holger; Hemprich, Alexander; Wittekind, Christian; Herbarth, Olf; Horn, Friedemann; Dietz, Andreas; Loeffler, Markus
2015-12-15
Stratification of head and neck squamous cell carcinomas (HNSCC) based on HPV16 DNA and RNA status, gene expression patterns, and mutated candidate genes may facilitate patient treatment decision. We characterize head and neck squamous cell carcinomas (HNSCC) with different HPV16 DNA and RNA (E6*I) status from 290 consecutively recruited patients by gene expression profiling and targeted sequencing of 50 genes. We show that tumors with transcriptionally inactive HPV16 (DNA+ RNA-) are similar to HPV-negative (DNA-) tumors regarding gene expression and frequency of TP53 mutations (47%, 8/17 and 43%, 72/167, respectively). We also find that an immune response-related gene expression cluster is associated with lymph node metastasis, independent of HPV16 status and that disruptive TP53 mutations are associated with lymph node metastasis in HPV16 DNA- tumors. We validate each of these associations in another large data set. Four gene expression clusters which we identify differ moderately but significantly in overall survival. Our findings underscore the importance of measuring the HPV16 RNA (E6*I) and TP53-mutation status for patient stratification and identify associations of an immune response-related gene expression cluster and TP53 mutations with lymph node metastasis in HNSCC. © 2015 UICC.
Pandey, Ashutosh; Alok, Anshu; Lakhwani, Deepika; Singh, Jagdeep; Asif, Mehar H.; Trivedi, Prabodh K.
2016-01-01
Flavonoid biosynthesis is largely regulated at the transcriptional level due to the modulated expression of genes related to the phenylpropanoid pathway in plants. Although accumulation of different flavonoids has been reported in banana, a staple fruit crop, no detailed information is available on regulation of the biosynthesis in this important plant. We carried out genome-wide analysis of banana (Musa acuminata, AAA genome) and identified 28 genes belonging to 9 gene families associated with flavonoid biosynthesis. Expression analysis suggested spatial and temporal regulation of the identified genes in different tissues of banana. Analysis revealed enhanced expression of genes related to flavonol and proanthocyanidin (PA) biosynthesis in peel and pulp at the early developmental stages of fruit. Genes involved in anthocyanin biosynthesis were highly expressed during banana fruit ripening. In general, higher accumulation of metabolites was observed in the peel as compared to pulp tissue. A correlation between expression of genes and metabolite content was observed at the early stage of fruit development. Furthermore, this study also suggests regulation of flavonoid biosynthesis, at transcriptional level, under light and dark exposures as well as methyl jasmonate (MJ) treatment in banana. PMID:27539368
Pandey, Ashutosh; Alok, Anshu; Lakhwani, Deepika; Singh, Jagdeep; Asif, Mehar H; Trivedi, Prabodh K
2016-08-19
Flavonoid biosynthesis is largely regulated at the transcriptional level due to the modulated expression of genes related to the phenylpropanoid pathway in plants. Although accumulation of different flavonoids has been reported in banana, a staple fruit crop, no detailed information is available on regulation of the biosynthesis in this important plant. We carried out genome-wide analysis of banana (Musa acuminata, AAA genome) and identified 28 genes belonging to 9 gene families associated with flavonoid biosynthesis. Expression analysis suggested spatial and temporal regulation of the identified genes in different tissues of banana. Analysis revealed enhanced expression of genes related to flavonol and proanthocyanidin (PA) biosynthesis in peel and pulp at the early developmental stages of fruit. Genes involved in anthocyanin biosynthesis were highly expressed during banana fruit ripening. In general, higher accumulation of metabolites was observed in the peel as compared to pulp tissue. A correlation between expression of genes and metabolite content was observed at the early stage of fruit development. Furthermore, this study also suggests regulation of flavonoid biosynthesis, at transcriptional level, under light and dark exposures as well as methyl jasmonate (MJ) treatment in banana.
Papler, Tanja Burnik; Bokal, Eda Vrtačnik; Tacer, Klementina Fon; Juvan, Peter; Virant Klun, Irma; Devjak, Rok
2014-01-01
The aim of our study was to determine whether there are any differences in the cumulus cell gene expression profile of mature oocytes derived from modified natural IVF and controlled ovarian hyperstimulation cycles and if these changes could help us understand why modified natural IVF has lower success rates. Cumulus cells surrounding mature oocytes that developed to morulae or blastocysts on day 5 after oocyte retrieval were submitted to microarray analysis. The obtained data were then validated using quantitative real-time PCR. There were 66 differentially expressed genes between cumulus cells of modified natural IVF and controlled ovarian hyperstimulation cycles. Gene ontology analysis revealed the oxidation-reduction process, glutathione metabolic process, xenobiotic metabolic process and gene expression were significantly enriched biological processes in MNIVF cycles. Among differentially expressed genes we observed a large group of small nucleolar RNA's whose role in folliculogenesis has not yet been established. The increased expression of genes involved in the oxidation-reduction process probably points to hypoxic conditions in modified natural IVF cycles. This finding opens up new perspectives for the establishment of the potential role that oxidation-reduction processes have in determining success rates of modified natural IVF.
Syal, Kirtimaan; Srinivasan, Anand; Banerjee, Dibyajyoti
2015-07-01
Diabetes and tuberculosis are world's most deadly epidemics. People suffering from diabetes are susceptible to tuberculosis. Molecular link between the two is largely unknown. It is known that Vitamin A receptor (RXR) heterodimerizes with Vitamin D receptor (VDR) and Peroxisome proliferator-activator receptor-γ (PPARγ) to regulate Tryptophan-aspartate containing coat protein (TACO) expression and fatty acid metabolism respectively, so it would be interesting to check the expression of these genes in diabetes mellitus (DM) patients which might explain the susceptibility of diabetics to tuberculosis. In this study, we checked the expression of RXR, VDR, TACO and Interferon-γ (IFNγ) genes in type-2 DM patients for understanding the link between the two diseases. We observed down regulation of RXR gene and corresponding up regulation of TACO gene expression. We have not observed significant change in expression of VDR and IFNγ genes in type-2 DM patients. Repression of RXR gene could hamper VDR-RXR heterodimer formation and thus would up regulate TACO gene expression which may predispose the type-2 DM patients to tuberculosis. Also, decrease in RXR-PPARγ heterodimer could be involved in DM.
Loperfido, Mariana; Jarmin, Susan; Dastidar, Sumitava; Di Matteo, Mario; Perini, Ilaria; Moore, Marc; Nair, Nisha; Samara-Kuko, Ermira; Athanasopoulos, Takis; Tedesco, Francesco Saverio; Dickson, George; Sampaolesi, Maurilio; VandenDriessche, Thierry; Chuah, Marinee K.
2016-01-01
Duchenne muscular dystrophy (DMD) is a genetic neuromuscular disorder caused by the absence of dystrophin. We developed a novel gene therapy approach based on the use of the piggyBac (PB) transposon system to deliver the coding DNA sequence (CDS) of either full-length human dystrophin (DYS: 11.1 kb) or truncated microdystrophins (MD1: 3.6 kb; MD2: 4 kb). PB transposons encoding microdystrophins were transfected in C2C12 myoblasts, yielding 65±2% MD1 and 66±2% MD2 expression in differentiated multinucleated myotubes. A hyperactive PB (hyPB) transposase was then deployed to enable transposition of the large-size PB transposon (17 kb) encoding the full-length DYS and green fluorescence protein (GFP). Stable GFP expression attaining 78±3% could be achieved in the C2C12 myoblasts that had undergone transposition. Western blot analysis demonstrated expression of the full-length human DYS protein in myotubes. Subsequently, dystrophic mesoangioblasts from a Golden Retriever muscular dystrophy dog were transfected with the large-size PB transposon resulting in 50±5% GFP-expressing cells after stable transposition. This was consistent with correction of the differentiated dystrophic mesoangioblasts following expression of full-length human DYS. These results pave the way toward a novel non-viral gene therapy approach for DMD using PB transposons underscoring their potential to deliver large therapeutic genes. PMID:26682797
Tuller, T; Atar, S; Ruppin, E; Gurevich, M; Achiron, A
2013-03-01
The aim of this study is to understand intracellular regulatory mechanisms in peripheral blood mononuclear cells (PBMCs), which are either common to many autoimmune diseases or specific to some of them. We incorporated large-scale data such as protein-protein interactions, gene expression and demographical information of hundreds of patients and healthy subjects, related to six autoimmune diseases with available large-scale gene expression measurements: multiple sclerosis (MS), systemic lupus erythematosus (SLE), juvenile rheumatoid arthritis (JRA), Crohn's disease (CD), ulcerative colitis (UC) and type 1 diabetes (T1D). These data were analyzed concurrently by statistical and systems biology approaches tailored for this purpose. We found that chemokines such as CXCL1-3, 5, 6 and the interleukin (IL) IL8 tend to be differentially expressed in PBMCs of patients with the analyzed autoimmune diseases. In addition, the anti-apoptotic gene BCL3, interferon-γ (IFNG), and the vitamin D receptor (VDR) gene physically interact with significantly many genes that tend to be differentially expressed in PBMCs of patients with the analyzed autoimmune diseases. In general, similar cellular processes tend to be differentially expressed in PBMC in the analyzed autoimmune diseases. Specifically, the cellular processes related to cell proliferation (for example, epidermal growth factor, platelet-derived growth factor, nuclear factor-κB, Wnt/β-catenin signaling, stress-activated protein kinase c-Jun NH2-terminal kinase), inflammatory response (for example, interleukins IL2 and IL6, the cytokine granulocyte-macrophage colony-stimulating factor and the B-cell receptor), general signaling cascades (for example, mitogen-activated protein kinase, extracellular signal-regulated kinase, p38 and TRK) and apoptosis are activated in most of the analyzed autoimmune diseases. However, our results suggest that in each of the analyzed diseases, apoptosis and chemotaxis are activated via different subsignaling pathways. Analyses of the expression levels of dozens of genes and the protein-protein interactions among them demonstrated that CD and UC have relatively similar gene expression signatures, whereas the gene expression signatures of T1D and JRA relatively differ from the signatures of the other autoimmune diseases. These diseases are the only ones activated via the Fcɛ pathway. The relevant genes and pathways reported in this study are discussed at length, and may be helpful in the diagnoses and understanding of autoimmunity and/or specific autoimmune diseases.
Mina, Lida; Soule, Sharon E; Badve, Sunil; Baehner, Fredrick L; Baker, Joffre; Cronin, Maureen; Watson, Drew; Liu, Mei-Lan; Sledge, George W; Shak, Steve; Miller, Kathy D
2007-06-01
Primary chemotherapy provides an ideal opportunity to correlate gene expression with response to treatment. We used paraffin-embedded core biopsies from a completed phase II trial to identify genes that correlate with response to primary chemotherapy. Patients with newly diagnosed stage II or III breast cancer were treated with sequential doxorubicin 75 mg/M2 q2 wks x 3 and docetaxel 40 mg/M2 weekly x 6; treatment order was randomly assigned. Pretreatment core biopsy samples were interrogated for genes that might correlate with pathologic complete response (pCR). In addition to the individual genes, the correlation of the Oncotype DX Recurrence Score with pCR was examined. Of 70 patients enrolled in the parent trial, core biopsies samples with sufficient RNA for gene analyses were available from 45 patients; 9 (20%) had inflammatory breast cancer (IBC). Six (14%) patients achieved a pCR. Twenty-two of the 274 candidate genes assessed correlated with pCR (p < 0.05). Genes correlating with pCR could be grouped into three large clusters: angiogenesis-related genes, proliferation related genes, and invasion-related genes. Expression of estrogen receptor (ER)-related genes and Recurrence Score did not correlate with pCR. In an exploratory analysis we compared gene expression in IBC to non-inflammatory breast cancer; twenty-four (9%) of the genes were differentially expressed (p < 0.05), 5 were upregulated and 19 were downregulated in IBC. Gene expression analysis on core biopsy samples is feasible and identifies candidate genes that correlate with pCR to primary chemotherapy. Gene expression in IBC differs significantly from noninflammatory breast cancer.
Transposable elements contribute to activation of maize genes in response to abiotic stress.
Makarevitch, Irina; Waters, Amanda J; West, Patrick T; Stitzer, Michelle; Hirsch, Candice N; Ross-Ibarra, Jeffrey; Springer, Nathan M
2015-01-01
Transposable elements (TEs) account for a large portion of the genome in many eukaryotic species. Despite their reputation as "junk" DNA or genomic parasites deleterious for the host, TEs have complex interactions with host genes and the potential to contribute to regulatory variation in gene expression. It has been hypothesized that TEs and genes they insert near may be transcriptionally activated in response to stress conditions. The maize genome, with many different types of TEs interspersed with genes, provides an ideal system to study the genome-wide influence of TEs on gene regulation. To analyze the magnitude of the TE effect on gene expression response to environmental changes, we profiled gene and TE transcript levels in maize seedlings exposed to a number of abiotic stresses. Many genes exhibit up- or down-regulation in response to these stress conditions. The analysis of TE families inserted within upstream regions of up-regulated genes revealed that between four and nine different TE families are associated with up-regulated gene expression in each of these stress conditions, affecting up to 20% of the genes up-regulated in response to abiotic stress, and as many as 33% of genes that are only expressed in response to stress. Expression of many of these same TE families also responds to the same stress conditions. The analysis of the stress-induced transcripts and proximity of the transposon to the gene suggests that these TEs may provide local enhancer activities that stimulate stress-responsive gene expression. Our data on allelic variation for insertions of several of these TEs show strong correlation between the presence of TE insertions and stress-responsive up-regulation of gene expression. Our findings suggest that TEs provide an important source of allelic regulatory variation in gene response to abiotic stress in maize.
Ryan, Veronica H.; Primiani, Christopher T.; Rao, Jagadeesh S.; Ahn, Kwangmi; Rapoport, Stanley I.; Blanchard, Helene
2014-01-01
Background The polyunsaturated arachidonic and docosahexaenoic acids (AA and DHA) participate in cell membrane synthesis during neurodevelopment, neuroplasticity, and neurotransmission throughout life. Each is metabolized via coupled enzymatic reactions within separate but interacting metabolic cascades. Hypothesis AA and DHA pathway genes are coordinately expressed and underlie cascade interactions during human brain development and aging. Methods The BrainCloud database for human non-pathological prefrontal cortex gene expression was used to quantify postnatal age changes in mRNA expression of 34 genes involved in AA and DHA metabolism. Results Expression patterns were split into Development (0 to 20 years) and Aging (21 to 78 years) intervals. Expression of genes for cytosolic phospholipases A2 (cPLA2), cyclooxygenases (COX)-1 and -2, and other AA cascade enzymes, correlated closely with age during Development, less so during Aging. Expression of DHA cascade enzymes was less inter-correlated in each period, but often changed in the opposite direction to expression of AA cascade genes. Except for the PLA2G4A (cPLA2 IVA) and PTGS2 (COX-2) genes at 1q25, highly inter-correlated genes were at distant chromosomal loci. Conclusions Coordinated age-related gene expression during the brain Development and Aging intervals likely underlies coupled changes in enzymes of the AA and DHA cascades and largely occur through distant transcriptional regulation. Healthy brain aging does not show upregulation of PLA2G4 or PTGS2 expression, which was found in Alzheimer's disease. PMID:24963629
Mucin gene expression in human male urogenital tract epithelia
Russo, Cindy Leigh; Spurr-Michaud, Sandra; Tisdale, Ann; Pudney, Jeffrey; Anderson, Deborah; Gipson, Ilene K.
2010-01-01
BACKGROUND Mucins are large, hydrophilic glycoproteins that protect wet-surfaced epithelia from pathogen invasion as well as provide lubrication. At least 17 mucin genes have been cloned to date. This study sought to determine the mucin gene expression profile of the human male urogenital tract epithelia, to determine if mucins are present in seminal fluid, and to assess the effect of androgens on mucin expression. METHODS AND RESULTS Testis, epididymis, vas deferens, seminal vesicle, prostate, bladder, urethra and foreskin were assessed for mucin expression by RT-PCR and immunohistochemistry. Epithelia of the vas deferens, prostate and urethra expressed the greatest number of mucins, each expressing 5–8 mucins. Messenger RNA of MUC1 and MUC20, both membrane-associated mucins, were detected in most tissues analyzed. Conversely, MUC6 was predominantly detected in seminal vesicle. MUC1, MUC5B and MUC6 were detected in seminal fluid samples by immunoblot analysis. Androgens had no effect on mucin expression by cultured human prostatic epithelial cells. CONCLUSIONS Each region of urogenital tract epithelium expressed a unique mucin gene repertoire. Secretory mucins are present in seminal fluid, and androgens do not appear to regulate mucin gene expression. PMID:16997931
NASA Astrophysics Data System (ADS)
Noble, Misty L.; Song, Shuxian; Sun, Ryan R.; Fan, Luping; DiBlasi, Robert M.; O'Kelly-Priddy, Colleen; Loeb, Keith R.; Miao, Carol H.
2012-11-01
Ultrasound (US) targeted microbubble (MB) destruction (UTMD) has been shown to be an effective method in delivering drugs and plasmid DNA (pDNA) into cells. We previously reported successful gene transfection of a reporter luciferase gene, pGL4, into livers of mice and rats using UTMD. The challenge is to translate and achieve similar gene expression in large animals, like swine, where the treated tissue volume is substantially larger. The scale-up study requires proportionally increased amount of pDNA/MBs delivered to tissues and an equivalent increase in US energy. We use different MBs and surgical strategies to retain most of pDNA/MB locally during US application in order to maximize the effect of UTMD in gene transfection. Our results show significant increase in luciferase expression in swine injected with MBs and exposed to 2.7 MPa US. We obtained up to 1800-fold enhancement in the pig experiment using Definity® MBs, and 2000-fold and 6300-fold enhancement in two pig studies using RN18 MBs compared to sham. These results represent an important developmental step towards US mediated gene delivery in large animals and clinical trials.
Evolution of a Cellular Immune Response in Drosophila: A Phenotypic and Genomic Comparative Analysis
Salazar-Jaramillo, Laura; Paspati, Angeliki; van de Zande, Louis; Vermeulen, Cornelis Joseph; Schwander, Tanja; Wertheim, Bregje
2014-01-01
Understanding the genomic basis of evolutionary adaptation requires insight into the molecular basis underlying phenotypic variation. However, even changes in molecular pathways associated with extreme variation, gains and losses of specific phenotypes, remain largely uncharacterized. Here, we investigate the large interspecific differences in the ability to survive infection by parasitoids across 11 Drosophila species and identify genomic changes associated with gains and losses of parasitoid resistance. We show that a cellular immune defense, encapsulation, and the production of a specialized blood cell, lamellocytes, are restricted to a sublineage of Drosophila, but that encapsulation is absent in one species of this sublineage, Drosophila sechellia. Our comparative analyses of hemopoiesis pathway genes and of genes differentially expressed during the encapsulation response revealed that hemopoiesis-associated genes are highly conserved and present in all species independently of their resistance. In contrast, 11 genes that are differentially expressed during the response to parasitoids are novel genes, specific to the Drosophila sublineage capable of lamellocyte-mediated encapsulation. These novel genes, which are predominantly expressed in hemocytes, arose via duplications, whereby five of them also showed signatures of positive selection, as expected if they were recruited for new functions. Three of these novel genes further showed large-scale and presumably loss-of-function sequence changes in D. sechellia, consistent with the loss of resistance in this species. In combination, these convergent lines of evidence suggest that co-option of duplicated genes in existing pathways and subsequent neofunctionalization are likely to have contributed to the evolution of the lamellocyte-mediated encapsulation in Drosophila. PMID:24443439
Salazar-Jaramillo, Laura; Paspati, Angeliki; van de Zande, Louis; Vermeulen, Cornelis Joseph; Schwander, Tanja; Wertheim, Bregje
2014-02-01
Understanding the genomic basis of evolutionary adaptation requires insight into the molecular basis underlying phenotypic variation. However, even changes in molecular pathways associated with extreme variation, gains and losses of specific phenotypes, remain largely uncharacterized. Here, we investigate the large interspecific differences in the ability to survive infection by parasitoids across 11 Drosophila species and identify genomic changes associated with gains and losses of parasitoid resistance. We show that a cellular immune defense, encapsulation, and the production of a specialized blood cell, lamellocytes, are restricted to a sublineage of Drosophila, but that encapsulation is absent in one species of this sublineage, Drosophila sechellia. Our comparative analyses of hemopoiesis pathway genes and of genes differentially expressed during the encapsulation response revealed that hemopoiesis-associated genes are highly conserved and present in all species independently of their resistance. In contrast, 11 genes that are differentially expressed during the response to parasitoids are novel genes, specific to the Drosophila sublineage capable of lamellocyte-mediated encapsulation. These novel genes, which are predominantly expressed in hemocytes, arose via duplications, whereby five of them also showed signatures of positive selection, as expected if they were recruited for new functions. Three of these novel genes further showed large-scale and presumably loss-of-function sequence changes in D. sechellia, consistent with the loss of resistance in this species. In combination, these convergent lines of evidence suggest that co-option of duplicated genes in existing pathways and subsequent neofunctionalization are likely to have contributed to the evolution of the lamellocyte-mediated encapsulation in Drosophila.
Mollah, Mohammad Manir Hossain; Jamal, Rahman; Mokhtar, Norfilza Mohd; Harun, Roslan; Mollah, Md. Nurul Haque
2015-01-01
Background Identifying genes that are differentially expressed (DE) between two or more conditions with multiple patterns of expression is one of the primary objectives of gene expression data analysis. Several statistical approaches, including one-way analysis of variance (ANOVA), are used to identify DE genes. However, most of these methods provide misleading results for two or more conditions with multiple patterns of expression in the presence of outlying genes. In this paper, an attempt is made to develop a hybrid one-way ANOVA approach that unifies the robustness and efficiency of estimation using the minimum β-divergence method to overcome some problems that arise in the existing robust methods for both small- and large-sample cases with multiple patterns of expression. Results The proposed method relies on a β-weight function, which produces values between 0 and 1. The β-weight function with β = 0.2 is used as a measure of outlier detection. It assigns smaller weights (≥ 0) to outlying expressions and larger weights (≤ 1) to typical expressions. The distribution of the β-weights is used to calculate the cut-off point, which is compared to the observed β-weight of an expression to determine whether that gene expression is an outlier. This weight function plays a key role in unifying the robustness and efficiency of estimation in one-way ANOVA. Conclusion Analyses of simulated gene expression profiles revealed that all eight methods (ANOVA, SAM, LIMMA, EBarrays, eLNN, KW, robust BetaEB and proposed) perform almost identically for m = 2 conditions in the absence of outliers. However, the robust BetaEB method and the proposed method exhibited considerably better performance than the other six methods in the presence of outliers. In this case, the BetaEB method exhibited slightly better performance than the proposed method for the small-sample cases, but the the proposed method exhibited much better performance than the BetaEB method for both the small- and large-sample cases in the presence of more than 50% outlying genes. The proposed method also exhibited better performance than the other methods for m > 2 conditions with multiple patterns of expression, where the BetaEB was not extended for this condition. Therefore, the proposed approach would be more suitable and reliable on average for the identification of DE genes between two or more conditions with multiple patterns of expression. PMID:26413858
Tian, Jianan; Keller, Mark P.; Oler, Angie T.; Rabaglia, Mary E.; Schueler, Kathryn L.; Stapleton, Donald S.; Broman, Aimee Teo; Zhao, Wen; Kendziorski, Christina; Yandell, Brian S.; Hagenbuch, Bruno; Broman, Karl W.; Attie, Alan D.
2015-01-01
We surveyed gene expression in six tissues in an F2 intercross between mouse strains C57BL/6J (abbreviated B6) and BTBR T+ tf/J (abbreviated BTBR) made genetically obese with the Leptinob mutation. We identified a number of expression quantitative trait loci (eQTL) affecting the expression of numerous genes distal to the locus, called trans-eQTL hotspots. Some of these trans-eQTL hotspots showed effects in multiple tissues, whereas some were specific to a single tissue. An unusually large number of transcripts (∼8% of genes) mapped in trans to a hotspot on chromosome 6, specifically in pancreatic islets. By considering the first two principal components of the expression of genes mapping to this region, we were able to convert the multivariate phenotype into a simple Mendelian trait. Fine mapping the locus by traditional methods reduced the QTL interval to a 298-kb region containing only three genes, including Slco1a6, one member of a large family of organic anion transporters. Direct genomic sequencing of all Slco1a6 exons identified a nonsynonymous coding SNP that converts a highly conserved proline residue at amino acid position 564 to serine. Molecular modeling suggests that Pro564 faces an aqueous pore within this 12-transmembrane domain-spanning protein. When transiently overexpressed in HEK293 cells, BTBR organic anion transporting polypeptide (OATP)1A6-mediated cellular uptake of the bile acid taurocholic acid (TCA) was enhanced compared to B6 OATP1A6. Our results suggest that genetic variation in Slco1a6 leads to altered transport of TCA (and potentially other bile acids) by pancreatic islets, resulting in broad gene regulation. PMID:26385979
Liu, Jingjing; Yin, Tongming; Ye, Ning; Chen, Yingnan; Yin, Tingting; Liu, Min; Hassani, Danial
2013-01-01
Background The dioecious system is relatively rare in plants. Shrub willow is an annual flowering dioecious woody plant, and possesses many characteristics that lend it as a great model for tracking the missing pieces of sex determination evolution. To gain a global view of the genes differentially expressed in the male and female shrub willows and to develop a database for further studies, we performed a large-scale transcriptome sequencing of flower buds which were separately collected from two types of sexes. Results Totally, 1,201,931 high quality reads were obtained, with an average length of 389 bp and a total length of 467.96 Mb. The ESTs were assembled into 29,048 contigs, and 132,709 singletons. These unigenes were further functionally annotated by comparing their sequences to different proteins and functional domain databases and assigned with Gene Ontology (GO) terms. A biochemical pathway database containing 291 predicted pathways was also created based on the annotations of the unigenes. Digital expression analysis identified 806 differentially expressed genes between the male and female flower buds. And 33 of them located on the incipient sex chromosome of Salicaceae, among which, 12 genes might involve in plant sex determination empirically. These genes were worthy of special notification in future studies. Conclusions In this study, a large number of EST sequences were generated from the flower buds of a male and a female shrub willow. We also reported the differentially expressed genes between the two sex-type flowers. This work provides valuable information and sequence resources for uncovering the sex determining genes and for future functional genomics analysis of Salicaceae spp. PMID:23560075
D'Andrea, M; Dal Monego, S; Pallavicini, A; Modonut, M; Dreos, R; Stefanon, B; Pilla, F
2011-10-01
Using an array consisting of 10 665 70-mer oligonucleotide probes, the longissimus dorsi muscle tissue expression during growth in nine pigs belonging to Casertana (CT), an autochthonous breed characterized by slow growth and a massive accumulation of backfat, was compared with that of two cosmopolitan breeds, Large White (LW) and a crossbreed (CB; Duroc × Landrace × Large White). The results were validated by real-time PCR. All animals were of the same age and were raised under the same environmental conditions. Muscle tissues were collected at 3, 6, 9 and 11 months of age, and a total of 173 genes showed significant differential expression between CT and the cosmopolitan genetic types at 3 months of age. Time series cluster analysis indicated that the CT breed had a different pattern of gene expression compared with that of the LW and the CB. Four of the eight clusters highlighted the gene differences between CT and the other two breeds, which were further supported by statistical analyses: clusters 4 and 5 contained a total of 71 genes that were underexpressed at 3 months of age, and cluster 3 and cluster 7 included 28 and 42 genes respectively that were overexpressed at 3 months of age. As expected, differentially expressed genes belonged to the category of genes coding for contractile fibres and transcription factors involved in muscle development and differentiation. These findings highlight muscle expression genes during pig growth and are useful to understand the genetic meaning of the different developmental rates. © 2011 The Authors, Animal Genetics © 2011 Stichting International Foundation for Animal Genetics.
Zhang, Yu; Peng, Lifang; Wu, Ya; Shen, Yanyue; Wu, Xiaoming; Wang, Jianbo
2014-11-01
Embryo development represents a crucial developmental period in the life cycle of flowering plants. To gain insights into the genetic programs that control embryo development in Brassica rapa L., RNA sequencing technology was used to perform transcriptome profiling analysis of B. rapa developing embryos. The results generated 42,906,229 sequence reads aligned with 32,941 genes. In total, 27,760, 28,871, 28,384, and 25,653 genes were identified from embryos at globular, heart, early cotyledon, and mature developmental stages, respectively, and analysis between stages revealed a subset of stage-specific genes. We next investigated 9,884 differentially expressed genes with more than fivefold changes in expression and false discovery rate ≤ 0.001 from three adjacent-stage comparisons; 1,514, 3,831, and 6,633 genes were detected between globular and heart stage embryo libraries, heart stage and early cotyledon stage, and early cotyledon and mature stage, respectively. Large numbers of genes related to cellular process, metabolism process, response to stimulus, and biological process were expressed during the early and middle stages of embryo development. Fatty acid biosynthesis, biosynthesis of secondary metabolites, and photosynthesis-related genes were expressed predominantly in embryos at the middle stage. Genes for lipid metabolism and storage proteins were highly expressed in the middle and late stages of embryo development. We also identified 911 transcription factor genes that show differential expression across embryo developmental stages. These results increase our understanding of the complex molecular and cellular events during embryo development in B. rapa and provide a foundation for future studies on other oilseed crops.
Vital, Marius; Chai, Benli; Østman, Bjørn; Cole, James; Konstantinidis, Konstantinos T; Tiedje, James M
2015-01-01
Escherichia coli spans a genetic continuum from enteric strains to several phylogenetically distinct, atypical lineages that are rare in humans, but more common in extra-intestinal environments. To investigate the link between gene regulation, phylogeny and diversification in this species, we analyzed global gene expression profiles of four strains representing distinct evolutionary lineages, including a well-studied laboratory strain, a typical commensal (enteric) strain and two environmental strains. RNA-Seq was employed to compare the whole transcriptomes of strains grown under batch, chemostat and starvation conditions. Highly differentially expressed genes showed a significantly lower nucleotide sequence identity compared with other genes, indicating that gene regulation and coding sequence conservation are directly connected. Overall, distances between the strains based on gene expression profiles were largely dependent on the culture condition and did not reflect phylogenetic relatedness. Expression differences of commonly shared genes (all four strains) and E. coli core genes were consistently smaller between strains characterized by more similar primary habitats. For instance, environmental strains exhibited increased expression of stress defense genes under carbon-limited growth and entered a more pronounced survival-like phenotype during starvation compared with other strains, which stayed more alert for substrate scavenging and catabolism during no-growth conditions. Since those environmental strains show similar genetic distance to each other and to the other two strains, these findings cannot be simply attributed to genetic relatedness but suggest physiological adaptations. Our study provides new insights into ecologically relevant gene-expression and underscores the role of (differential) gene regulation for the diversification of the model bacterial species. PMID:25343512
Positive Selection Underlies Faster-Z Evolution of Gene Expression in Birds.
Dean, Rebecca; Harrison, Peter W; Wright, Alison E; Zimmer, Fabian; Mank, Judith E
2015-10-01
The elevated rate of evolution for genes on sex chromosomes compared with autosomes (Fast-X or Fast-Z evolution) can result either from positive selection in the heterogametic sex or from nonadaptive consequences of reduced relative effective population size. Recent work in birds suggests that Fast-Z of coding sequence is primarily due to relaxed purifying selection resulting from reduced relative effective population size. However, gene sequence and gene expression are often subject to distinct evolutionary pressures; therefore, we tested for Fast-Z in gene expression using next-generation RNA-sequencing data from multiple avian species. Similar to studies of Fast-Z in coding sequence, we recover clear signatures of Fast-Z in gene expression; however, in contrast to coding sequence, our data indicate that Fast-Z in expression is due to positive selection acting primarily in females. In the soma, where gene expression is highly correlated between the sexes, we detected Fast-Z in both sexes, although at a higher rate in females, suggesting that many positively selected expression changes in females are also expressed in males. In the gonad, where intersexual correlations in expression are much lower, we detected Fast-Z for female gene expression, but crucially, not males. This suggests that a large amount of expression variation is sex-specific in its effects within the gonad. Taken together, our results indicate that Fast-Z evolution of gene expression is the product of positive selection acting on recessive beneficial alleles in the heterogametic sex. More broadly, our analysis suggests that the adaptive potential of Z chromosome gene expression may be much greater than that of gene sequence, results which have important implications for the role of sex chromosomes in speciation and sexual selection. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Modeling Bi-modality Improves Characterization of Cell Cycle on Gene Expression in Single Cells
Danaher, Patrick; Finak, Greg; Krouse, Michael; Wang, Alice; Webster, Philippa; Beechem, Joseph; Gottardo, Raphael
2014-01-01
Advances in high-throughput, single cell gene expression are allowing interrogation of cell heterogeneity. However, there is concern that the cell cycle phase of a cell might bias characterizations of gene expression at the single-cell level. We assess the effect of cell cycle phase on gene expression in single cells by measuring 333 genes in 930 cells across three phases and three cell lines. We determine each cell's phase non-invasively without chemical arrest and use it as a covariate in tests of differential expression. We observe bi-modal gene expression, a previously-described phenomenon, wherein the expression of otherwise abundant genes is either strongly positive, or undetectable within individual cells. This bi-modality is likely both biologically and technically driven. Irrespective of its source, we show that it should be modeled to draw accurate inferences from single cell expression experiments. To this end, we propose a semi-continuous modeling framework based on the generalized linear model, and use it to characterize genes with consistent cell cycle effects across three cell lines. Our new computational framework improves the detection of previously characterized cell-cycle genes compared to approaches that do not account for the bi-modality of single-cell data. We use our semi-continuous modelling framework to estimate single cell gene co-expression networks. These networks suggest that in addition to having phase-dependent shifts in expression (when averaged over many cells), some, but not all, canonical cell cycle genes tend to be co-expressed in groups in single cells. We estimate the amount of single cell expression variability attributable to the cell cycle. We find that the cell cycle explains only 5%–17% of expression variability, suggesting that the cell cycle will not tend to be a large nuisance factor in analysis of the single cell transcriptome. PMID:25032992
Extensive variation between tissues in allele specific expression in an outbred mammal.
Chamberlain, Amanda J; Vander Jagt, Christy J; Hayes, Benjamin J; Khansefid, Majid; Marett, Leah C; Millen, Catriona A; Nguyen, Thuy T T; Goddard, Michael E
2015-11-23
Allele specific gene expression (ASE), with the paternal allele more expressed than the maternal allele or vice versa, appears to be a common phenomenon in humans and mice. In other species the extent of ASE is unknown, and even in humans and mice there are several outstanding questions. These include; to what extent is ASE tissue specific? how often does the direction of allele expression imbalance reverse between tissues? how often is only one of the two alleles expressed? is there a genome wide bias towards expression of the paternal or maternal allele; and finally do genes that are nearby on a chromosome share the same direction of ASE? Here we use gene expression data (RNASeq) from 18 tissues from a single cow to investigate each of these questions in turn, and then validate some of these findings in two tissues from 20 cows. Between 40 and 100 million sequence reads were generated per tissue across three replicate samples for each of the eighteen tissues from the single cow (the discovery dataset). A bovine gene expression atlas was created (the first from RNASeq data), and differentially expressed genes in each tissue were identified. To analyse ASE, we had access to unambiguously phased genotypes for all heterozygous variants in the cow's whole genome sequence, where these variants were homozygous in the whole genome sequence of her sire, and as a result we were able to map reads to parental genomes, to determine SNP and genes showing ASE in each tissue. In total 25,251 heterozygous SNP within 7985 genes were tested for ASE in at least one tissue. ASE was pervasive, 89 % of genes tested had significant ASE in at least one tissue. This large proportion of genes displaying ASE was confirmed in the two tissues in a validation dataset. For individual tissues the proportion of genes showing significant ASE varied from as low as 8-16 % of those tested in thymus to as high as 71-82 % of those tested in lung. There were a number of cases where the direction of allele expression imbalance reversed between tissues. For example the gene SPTY2D1 showed almost complete paternal allele expression in kidney and thymus, and almost complete maternal allele expression in the brain caudal lobe and brain cerebellum. Mono allelic expression (MAE) was common, with 1349 of 4856 genes (28 %) tested with more than one heterozygous SNP showing MAE. Across all tissues, 54.17 % of all genes with ASE favoured the paternal allele. Genes that are closely linked on the chromosome were more likely to show higher expression of the same allele (paternal or maternal) than expected by chance. We identified several long runs of neighbouring genes that showed either paternal or maternal ASE, one example was five adjacent genes (GIMAP8, GIMAP7 copy1, GIMAP4, GIMAP7 copy 2 and GIMAP5) that showed almost exclusive paternal expression in brain caudal lobe. Investigating the extent of ASE across 18 bovine tissues in one cow and two tissues in 20 cows demonstrated 1) ASE is pervasive in cattle, 2) the ASE is often MAE but ranges from MAE to slight overexpression of the major allele, 3) the ASE is most often tissue specific and that more than half the time displays divergent allele specific expression patterns across tissues, 4) across all genes there is a slight bias towards expression of the paternal allele and 5) genes expressing the same parental allele are clustered together more than expected by chance, and there are several runs of large numbers of genes expressing the same parental allele.
Murray, John Isaac
2018-05-01
The convergence of developmental biology and modern genomics tools brings the potential for a comprehensive understanding of developmental systems. This is especially true for the Caenorhabditis elegans embryo because its small size, invariant developmental lineage, and powerful genetic and genomic tools provide the prospect of a cellular resolution understanding of messenger RNA (mRNA) expression and regulation across the organism. We describe here how a systems biology framework might allow large-scale determination of the embryonic regulatory relationships encoded in the C. elegans genome. This framework consists of two broad steps: (a) defining the "parts list"-all genes expressed in all cells at each time during development and (b) iterative steps of computational modeling and refinement of these models by experimental perturbation. Substantial progress has been made towards defining the parts list through imaging methods such as large-scale green fluorescent protein (GFP) reporter analysis. Imaging results are now being augmented by high-resolution transcriptome methods such as single-cell RNA sequencing, and it is likely the complete expression patterns of all genes across the embryo will be known within the next few years. In contrast, the modeling and perturbation experiments performed so far have focused largely on individual cell types or genes, and improved methods will be needed to expand them to the full genome and organism. This emerging comprehensive map of embryonic expression and regulatory function will provide a powerful resource for developmental biologists, and would also allow scientists to ask questions not accessible without a comprehensive picture. This article is categorized under: Invertebrate Organogenesis > Worms Technologies > Analysis of the Transcriptome Gene Expression and Transcriptional Hierarchies > Gene Networks and Genomics. © 2018 Wiley Periodicals, Inc.
Liu, Jing; Wang, Qun; Sun, Minying; Zhu, Linlin; Yang, Michael; Zhao, Yu
2014-01-01
Quantitative real-time reverse transcription PCR (qRT-PCR) has become a widely used method for gene expression analysis; however, its data interpretation largely depends on the stability of reference genes. The transcriptomics of Panax ginseng, one of the most popular and traditional ingredients used in Chinese medicines, is increasingly being studied. Furthermore, it is vital to establish a series of reliable reference genes when qRT-PCR is used to assess the gene expression profile of ginseng. In this study, we screened out candidate reference genes for ginseng using gene expression data generated by a high-throughput sequencing platform. Based on the statistical tests, 20 reference genes (10 traditional housekeeping genes and 10 novel genes) were selected. These genes were tested for the normalization of expression levels in five growth stages and three distinct plant organs of ginseng by qPCR. These genes were subsequently ranked and compared according to the stability of their expressions using geNorm, NormFinder, and BestKeeper computational programs. Although the best reference genes were found to vary across different samples, CYP and EF-1α were the most stable genes amongst all samples. GAPDH/30S RPS20, CYP/60S RPL13 and CYP/QCR were the optimum pair of reference genes in the roots, stems, and leaves. CYP/60S RPL13, CYP/eIF-5A, aTUB/V-ATP, eIF-5A/SAR1, and aTUB/pol IIa were the most stably expressed combinations in each of the five developmental stages. Our study serves as a foundation for developing an accurate method of qRT-PCR and will benefit future studies on gene expression profiles of Panax Ginseng.
Upper airway gene expression in smokers: the mouth as a "window to the soul" of lung carcinogenesis?
Spira, Avrum
2010-03-01
This perspective on Boyle et al. (beginning on page 266 in this issue of the journal) explores transcriptomic profiling of upper airway epithelium as a biomarker of host response to tobacco smoke exposure. Boyle et al. have shown a striking relationship between smoking-related gene expression changes in the mouth and bronchus. This relationship suggests that buccal gene expression may serve as a relatively noninvasive surrogate marker of the physiologic response of the lung to tobacco smoke that could be used in large-scale screening and chemoprevention studies for lung cancer.
Jouffe, Vincent; Rowe, Suzanne; Liaubet, Laurence; Buitenhuis, Bart; Hornshøj, Henrik; SanCristobal, Magali; Mormède, Pierre; de Koning, D J
2009-07-16
Microarray studies can supplement QTL studies by suggesting potential candidate genes in the QTL regions, which by themselves are too large to provide a limited selection of candidate genes. Here we provide a case study where we explore ways to integrate QTL data and microarray data for the pig, which has only a partial genome sequence. We outline various procedures to localize differentially expressed genes on the pig genome and link this with information on published QTL. The starting point is a set of 237 differentially expressed cDNA clones in adrenal tissue from two pig breeds, before and after treatment with adrenocorticotropic hormone (ACTH). Different approaches to localize the differentially expressed (DE) genes to the pig genome showed different levels of success and a clear lack of concordance for some genes between the various approaches. For a focused analysis on 12 genes, overlapping QTL from the public domain were presented. Also, differentially expressed genes underlying QTL for ACTH response were described. Using the latest version of the draft sequence, the differentially expressed genes were mapped to the pig genome. This enabled co-location of DE genes and previously studied QTL regions, but the draft genome sequence is still incomplete and will contain many errors. A further step to explore links between DE genes and QTL at the pathway level was largely unsuccessful due to the lack of annotation of the pig genome. This could be improved by further comparative mapping analyses but this would be time consuming. This paper provides a case study for the integration of QTL data and microarray data for a species with limited genome sequence information and annotation. The results illustrate the challenges that must be addressed but also provide a roadmap for future work that is applicable to other non-model species.
Identification of stable reference genes in differentiating human pluripotent stem cells.
Holmgren, Gustav; Ghosheh, Nidal; Zeng, Xianmin; Bogestål, Yalda; Sartipy, Peter; Synnergren, Jane
2015-06-01
Reference genes, often referred to as housekeeping genes (HKGs), are frequently used to normalize gene expression data based on the assumption that they are expressed at a constant level in the cells. However, several studies have shown that there may be a large variability in the gene expression levels of HKGs in various cell types. In a previous study, employing human embryonic stem cells (hESCs) subjected to spontaneous differentiation, we observed that the expression of commonly used HKG varied to a degree that rendered them inappropriate to use as reference genes under those experimental settings. Here we present a substantially extended study of the HKG signature in human pluripotent stem cells (hPSC), including nine global gene expression datasets from both hESC and human induced pluripotent stem cells, obtained during directed differentiation toward endoderm-, mesoderm-, and ectoderm derivatives. Sets of stably expressed genes were compiled, and a handful of genes (e.g., EID2, ZNF324B, CAPN10, and RABEP2) were identified as generally applicable reference genes in hPSCs across all cell lines and experimental conditions. The stability in gene expression profiles was confirmed by reverse transcription quantitative PCR analysis. Taken together, the current results suggest that differentiating hPSCs have a distinct HKG signature, which in some aspects is different from somatic cell types, and underscore the necessity to validate the stability of reference genes under the actual experimental setup used. In addition, the novel putative HKGs identified in this study can preferentially be used for normalization of gene expression data obtained from differentiating hPSCs. Copyright © 2015 the American Physiological Society.
Vascular Gene Expression in Nonneoplastic and Malignant Brain
Madden, Stephen L.; Cook, Brian P.; Nacht, Mariana; Weber, William D.; Callahan, Michelle R.; Jiang, Yide; Dufault, Michael R.; Zhang, Xiaoming; Zhang, Wen; Walter-Yohrling, Jennifer; Rouleau, Cecile; Akmaev, Viatcheslav R.; Wang, Clarence J.; Cao, Xiaohong; St. Martin, Thia B.; Roberts, Bruce L.; Teicher, Beverly A.; Klinger, Katherine W.; Stan, Radu-Virgil; Lucey, Brenden; Carson-Walter, Eleanor B.; Laterra, John; Walter, Kevin A.
2004-01-01
Malignant gliomas are uniformly lethal tumors whose morbidity is mediated in large part by the angiogenic response of the brain to the invading tumor. This profound angiogenic response leads to aggressive tumor invasion and destruction of surrounding brain tissue as well as blood-brain barrier breakdown and life-threatening cerebral edema. To investigate the molecular mechanisms governing the proliferation of abnormal microvasculature in malignant brain tumor patients, we have undertaken a cell-specific transcriptome analysis from surgically harvested nonneoplastic and tumor-associated endothelial cells. SAGE-derived endothelial cell gene expression patterns from glioma and nonneoplastic brain tissue reveal distinct gene expression patterns and consistent up-regulation of certain glioma endothelial marker genes across patient samples. We define the G-protein-coupled receptor RDC1 as a tumor endothelial marker whose expression is distinctly induced in tumor endothelial cells of both brain and peripheral vasculature. Further, we demonstrate that the glioma-induced gene, PV1, shows expression both restricted to endothelial cells and coincident with endothelial cell tube formation. As PV1 provides a framework for endothelial cell caveolar diaphragms, this protein may serve to enhance glioma-induced disruption of the blood-brain barrier and transendothelial exchange. Additional characterization of this extensive brain endothelial cell gene expression database will provide unique molecular insights into vascular gene expression. PMID:15277233
Construction of two vectors for gene expression in Trichoderma reesei.
Lv, Dandan; Wang, Wei; Wei, Dongzhi
2012-01-01
We report the construction of two filamentous fungi Trichoderma reesei expression vectors, pWEF31 and pWEF32. Both vectors possess the hygromycin phosphotransferase B gene expression cassette and the strong promoter and terminator of the cellobiohydrolase 1 gene (cbh1) from T. reesei. The two newly constructed vectors can be efficiently transformed into T. reesei with Agrobacterium-mediated transformation. The difference between pWEF31 and pWEF32 is that pWEF32 has two longer homologous arms. As a result, pWEF32 easily undergoes homologous recombination. On the other hand, pWEF31 undergoes random recombination. The applicability of both vectors was tested by first generating the expression vectors pWEF31-red and pWEF32-red and then detecting the expression of the DsRed2 gene in T. reesei Rut C30. Additionally, we measured the exo-1,4-β-glucanase activity of the recombinant cells. Our work provides an effective transformation system for homologous and heterologous gene expression and gene knockout in T. reesei. It also provides a method for recombination at a specific chromosomal location. Finally, both vectors will be useful for the large-scale gene expression industry. Copyright © 2011 Elsevier Inc. All rights reserved.
Jesnowski, R; Zubakov, Dmitri; Faissner, Ralf; Ringel, Jörg; Hoheisel, Jörg D; Lösel, Ralf; Schnölzer, Martina; Löhr, Matthias
2007-01-01
Abstract Pancreatic carcinoma has an extremely bad prognosis due to lack of early diagnostic markers and lack of effective therapeutic strategies. Recently, we have established an in vitro model recapitulating the first steps in the carcinogenesis of the pancreas. SV40 large T antigen-immortalized bovine pancreatic duct cells formed intrapancreatic adenocarcinoma tumors on k-rasmut transfection after orthotopic injection in the nude mouse pancreas. Here we identified genes and proteins differentially expressed in the course of malignant transformation using reciprocal suppression subtractive hybridization and 2D gel electrophoresis and mass spectrometry, respectively. We identified 34 differentially expressed genes, expressed sequence tags, and 15 unique proteins. Differential expression was verified for some of the genes or proteins in samples from pancreatic carcinoma. Among these genes and proteins, the majority had already been described either to be influenced by a mutated ras or to be differentially expressed in pancreatic adenocarcinoma, thus proving the feasibility of our model. Other genes and proteins (e.g., BBC1, GLTSCR2, and rhoGDIα), up to now, have not been implicated in pancreatic tumor development. Thus, we were able to establish an in vitro model of pancreatic carcinogenesis, which enabled us to identify genes and proteins differentially expressed during the early steps of malignant transformation. PMID:17356710
Dominance and Sexual Dimorphism Pervade the Salix purpurea L. Transcriptome
Carlson, Craig H.; Choi, Yongwook; Chan, Agnes P.; ...
2017-09-01
The heritability of gene expression is critical in understanding heterosis and is dependent on allele-specific regulation by local and remote factors in the genome. We used RNA-Seq to test whether variation in gene expression among F 1 and F 2 intraspecific Salix purpurea progeny is attributable to cis- and trans-regulatory divergence. We assessed the mode of inheritance based on gene expression levels and allele-specific expression for F1 and F2 intraspecific progeny in two distinct tissue types: shoot tip and stem internode. In addition, we explored sexually dimorphic patterns of inheritance and regulatory divergence among F 1 progeny individuals. We showmore » that in S. purpurea intraspecific crosses, gene expression inheritance largely exhibits a maternal dominant pattern, regardless of tissue type or pedigree. A significantly greater number of cis- and trans-regulated genes coincided with upregulation of the maternal parent allele in the progeny, irrespective of the magnitude, whereas the paternal allele was higher expressed for genes showing cis × trans or compensatory regulation. Importantly, consistent with previous genetic mapping results for sex in shrub willow, we have delimited sex-biased gene expression to a 2 Mb pericentromeric region on S. purpurea chr15 and further refined the sex determination region. Lastly, altogether, our results offer insight into the inheritance of gene expression in S. purpurea as well as evidence of sexually dimorphic expression which may have contributed to the evolution of dioecy in Salix.« less
Cirelli, C; Tononi, G
1999-06-01
The consequences of sleep and sleep deprivation at the molecular level are largely unexplored. Knowledge of such molecular events is essential to understand the restorative processes occurring during sleep as well as the cellular mechanisms of sleep regulation. Here we review the available data about changes in neural gene expression across different behavioural states using candidate gene approaches such as in situ hybridization and immunocytochemistry. We then describe new techniques for systematic screening of gene expression in the brain, such as subtractive hybridization, mRNA differential display, and cDNA microarray technology, outlining advantages and disadvantages of these methods. Finally, we summarize our initial results of a systematic screening of gene expression in the rat brain across behavioural states using mRNA differential display and cDNA microarray technology. The expression pattern of approximately 7000 genes was analysed in the cerebral cortex of rats after 3 h of spontaneous sleep, 3 h of spontaneous waking, or 3 h of sleep deprivation. While the majority of transcripts were expressed at the same level among these three conditions, 14 mRNAs were modulated by sleep and waking. Six transcripts, four more expressed in waking and two more expressed in sleep, corresponded to novel genes. The eight known transcripts were all expressed at higher levels in waking than in sleep and included transcription factors and mitochondrial genes. A possible role for these known transcripts in mediating neural plasticity during waking is discussed.
Dominance and Sexual Dimorphism Pervade the Salix purpurea L. Transcriptome
DOE Office of Scientific and Technical Information (OSTI.GOV)
Carlson, Craig H.; Choi, Yongwook; Chan, Agnes P.
The heritability of gene expression is critical in understanding heterosis and is dependent on allele-specific regulation by local and remote factors in the genome. We used RNA-Seq to test whether variation in gene expression among F 1 and F 2 intraspecific Salix purpurea progeny is attributable to cis- and trans-regulatory divergence. We assessed the mode of inheritance based on gene expression levels and allele-specific expression for F1 and F2 intraspecific progeny in two distinct tissue types: shoot tip and stem internode. In addition, we explored sexually dimorphic patterns of inheritance and regulatory divergence among F 1 progeny individuals. We showmore » that in S. purpurea intraspecific crosses, gene expression inheritance largely exhibits a maternal dominant pattern, regardless of tissue type or pedigree. A significantly greater number of cis- and trans-regulated genes coincided with upregulation of the maternal parent allele in the progeny, irrespective of the magnitude, whereas the paternal allele was higher expressed for genes showing cis × trans or compensatory regulation. Importantly, consistent with previous genetic mapping results for sex in shrub willow, we have delimited sex-biased gene expression to a 2 Mb pericentromeric region on S. purpurea chr15 and further refined the sex determination region. Lastly, altogether, our results offer insight into the inheritance of gene expression in S. purpurea as well as evidence of sexually dimorphic expression which may have contributed to the evolution of dioecy in Salix.« less
Single-feature polymorphism discovery in the barley transcriptome
Rostoks, Nils; Borevitz, Justin O; Hedley, Peter E; Russell, Joanne; Mudie, Sharon; Morris, Jenny; Cardle, Linda; Marshall, David F; Waugh, Robbie
2005-01-01
A probe-level model for analysis of GeneChip gene-expression data is presented which identified more than 10,000 single-feature polymorphisms (SFP) between two barley genotypes. The method has good sensitivity, as 67% of known single-nucleotide polymorphisms (SNP) were called as SFPs. This method is applicable to all oligonucleotide microarray data, accounts for SNP effects in gene-expression data and represents an efficient and versatile approach for highly parallel marker identification in large genomes. PMID:15960806
The Importance of Normalization on Large and Heterogeneous Microarray Datasets
DNA microarray technology is a powerful functional genomics tool increasingly used for investigating global gene expression in environmental studies. Microarrays can also be used in identifying biological networks, as they give insight on the complex gene-to-gene interactions, ne...
Yamagishi, J; Isobe, R; Takebuchi, T; Bando, H
2003-03-01
We describe, for the first time, the generation of a viral DNA chip for simultaneous expression measurements of nearly all known open reading frames (ORFs) in the best-studied members of the family Baculoviridae, Autographa californica multiple nucleopolyhedrovirus (AcMNPV) and Bombyx mori nucleopolyhedrovirus (BmNPV). In this study, a viral DNA chip (Ac-BmNPV chip) was fabricated and used to characterize the viral gene expression profile for AcMNPV in different cell types. The viral chip is composed of microarrays of viral DNA prepared by robotic deposition of PCR-amplified viral DNA fragments on glass for ORFs in the NPV genome. Viral gene expression was monitored by hybridization to the DNA fragment microarrays with fluorescently labeled cDNAs prepared from infected Spodoptera frugiperda, Sf9 cells and Trichoplusia ni, TnHigh-Five cells, the latter a major producer of baculovirus and recombinant proteins. A comparison of expression profiles of known ORFs in AcMNPV elucidated six genes (ORF150, p10, pk2, and three late gene expression factor genes lef-3, p35 and lef- 6) the expression of each of which was regulated differently in the two cell lines. Most of these genes are known to be closely involved in the viral life cycle such as in DNA replication, late gene expression and the release of polyhedra from infected cells. These results imply that the differential expression of these viral genes accounts for the differences in viral replication between these two cell lines. Thus, these fabricated microarrays of NPV DNA which allow a rapid analysis of gene expression at the viral genome level should greatly speed the functional analysis of large genomes of NPV.
Liu, Peng-Cheng; Liu, Kuan; Liu, Jun-Feng; Xia, Kuo; Chen, Li-Yang; Wu, Xing
2016-09-27
The effect of overexpressing the Indian hedgehog (IHH) gene on the chondrogenic differentiation of rabbit bone marrow-derived mesenchymal stem cells (BMSCs) was investigated in a simulated microgravity environment. An adenovirus plasmid encoding the rabbit IHH gene was constructed in vitro and transfected into rabbit BMSCs. Two large groups were used: conventional cell culture and induction model group and simulated microgravity environment group. Each large group was further divided into blank control group, GFP transfection group, and IHH transfection group. During differentiation induction, the expression levels of cartilage-related and cartilage hypertrophy-related genes and proteins in each group were determined. In the conventional model, the IHH transfection group expressed high levels of cartilage-related factors (Coll2 and ANCN) at the early stage of differentiation induction and expressed high levels of cartilage hypertrophy-related factors (Coll10, annexin 5, and ALP) at the late stage. Under the simulated microgravity environment, the IHH transfection group expressed high levels of cartilage-related factors and low levels of cartilage hypertrophy-related factors at all stages of differentiation induction. Under the simulated microgravity environment, transfection of the IHH gene into BMSCs effectively promoted the generation of cartilage and inhibited cartilage aging and osteogenesis. Therefore, this technique is suitable for cartilage tissue engineering.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Vandenberg, P.; Khillan, J.S.; Prockop, D.J.
A minigene version of the human gene for type II procollagen (COL2AI) was prepared that lacked a large central region containing 12 of the 52 exons and therefore 291 of the 1523 codons of the gene. The construct was modeled after sporadic in-frame deletions of collagen genes that cause synthesis of shortened pro{alpha} chains that associate with normal pro{alpha} chains and thereby cause degradation of the shortened and normal pro{alpha} chains through a process called procollagen suicide. The gene construct was used to prepare five lines of transgenic mice expressing the minigene. A large proportion of the mice expressing themore » minigene developed a phenotype of a chondrodysplasia with dwarfism, short and thick limbs, a short snout, a cranial bulge, a cleft palate, and delayed mineralization of bone. A number of mice died shortly after birth. Microscopic examination of cartilage revealed decreased density and organization of collagen fibrils. In cultured chondrocytes from the transgenic mice, the minigene was expressed as shortened pro{alpha}1(II) chains that were disulfide-linked to normal mouse pro{alpha}1(II) chains. Therefore, the phenotype is probably explained by depletion of the endogenous mouse type II procollagen through the phenomenon of procollagen suicide.« less
Kaufman, Alon; Dror, Gideon; Meilijson, Isaac; Ruppin, Eytan
2006-12-08
The claim that genetic properties of neurons significantly influence their synaptic network structure is a common notion in neuroscience. The nematode Caenorhabditis elegans provides an exciting opportunity to approach this question in a large-scale quantitative manner. Its synaptic connectivity network has been identified, and, combined with cellular studies, we currently have characteristic connectivity and gene expression signatures for most of its neurons. By using two complementary analysis assays we show that the expression signature of a neuron carries significant information about its synaptic connectivity signature, and identify a list of putative genes predicting neural connectivity. The current study rigorously quantifies the relation between gene expression and synaptic connectivity signatures in the C. elegans nervous system and identifies subsets of neurons where this relation is highly marked. The results presented and the genes identified provide a promising starting point for further, more detailed computational and experimental investigations.
Sibout, Richard; Proost, Sebastian; Hansen, Bjoern Oest; Vaid, Neha; Giorgi, Federico M; Ho-Yue-Kuang, Severine; Legée, Frédéric; Cézart, Laurent; Bouchabké-Coussa, Oumaya; Soulhat, Camille; Provart, Nicholas; Pasha, Asher; Le Bris, Philippe; Roujol, David; Hofte, Herman; Jamet, Elisabeth; Lapierre, Catherine; Persson, Staffan; Mutwil, Marek
2017-08-01
While Brachypodium distachyon (Brachypodium) is an emerging model for grasses, no expression atlas or gene coexpression network is available. Such tools are of high importance to provide insights into the function of Brachypodium genes. We present a detailed Brachypodium expression atlas, capturing gene expression in its major organs at different developmental stages. The data were integrated into a large-scale coexpression database ( www.gene2function.de), enabling identification of duplicated pathways and conserved processes across 10 plant species, thus allowing genome-wide inference of gene function. We highlight the importance of the atlas and the platform through the identification of duplicated cell wall modules, and show that a lignin biosynthesis module is conserved across angiosperms. We identified and functionally characterised a putative ferulate 5-hydroxylase gene through overexpression of it in Brachypodium, which resulted in an increase in lignin syringyl units and reduced lignin content of mature stems, and led to improved saccharification of the stem biomass. Our Brachypodium expression atlas thus provides a powerful resource to reveal functionally related genes, which may advance our understanding of important biological processes in grasses. © 2017 The Authors. New Phytologist © 2017 New Phytologist Trust.
Plasticity-Related Gene Expression During Eszopiclone-Induced Sleep.
Gerashchenko, Dmitry; Pasumarthi, Ravi K; Kilduff, Thomas S
2017-07-01
Experimental evidence suggests that restorative processes depend on synaptic plasticity changes in the brain during sleep. We used the expression of plasticity-related genes to assess synaptic plasticity changes during drug-induced sleep. We first characterized sleep induced by eszopiclone in mice during baseline conditions and during the recovery from sleep deprivation. We then compared the expression of 18 genes and two miRNAs critically involved in synaptic plasticity in these mice. Gene expression was assessed in the cerebral cortex and hippocampus by the TaqMan reverse transcription polymerase chain reaction and correlated with sleep parameters. Eszopiclone reduced the latency to nonrapid eye movement (NREM) sleep and increased NREM sleep amounts. Eszopiclone had no effect on slow wave activity (SWA) during baseline conditions but reduced the SWA increase during recovery sleep (RS) after sleep deprivation. Gene expression analyses revealed three distinct patterns: (1) four genes had higher expression either in the cortex or hippocampus in the group of mice with increased amounts of wakefulness; (2) a large proportion of plasticity-related genes (7 out of 18 genes) had higher expression during RS in the cortex but not in the hippocampus; and (3) six genes and the two miRNAs showed no significant changes across conditions. Even at a relatively high dose (20 mg/kg), eszopiclone did not reduce the expression of plasticity-related genes during RS period in the cortex. These results indicate that gene expression associated with synaptic plasticity occurs in the cortex in the presence of a hypnotic medication. © Sleep Research Society 2017. Published by Oxford University Press on behalf of the Sleep Research Society. All rights reserved. For permissions, please e-mail journals.permissions@oup.com.
NASA Astrophysics Data System (ADS)
Tian, Caihong; Tek Tay, Wee; Feng, Hongqiang; Wang, Ying; Hu, Yongmin; Li, Guoping
2015-06-01
Adelphocoris suturalis is one of the most serious pest insects of Bt cotton in China, however its molecular genetics, biochemistry and physiology are poorly understood. We used high throughput sequencing platform to perform de novo transcriptome assembly and gene expression analyses across different developmental stages (eggs, 2nd and 5th instar nymphs, female and male adults). We obtained 20 GB of clean data and revealed 88,614 unigenes, including 23,830 clusters and 64,784 singletons. These unigene sequences were annotated and classified by Gene Ontology, Clusters of Orthologous Groups, and Kyoto Encyclopedia of Genes and Genomes databases. A large number of differentially expressed genes were discovered through pairwise comparisons between these developmental stages. Gene expression profiles were dramatically different between life stage transitions, with some of these most differentially expressed genes being associated with sex difference, metabolism and development. Quantitative real-time PCR results confirm deep-sequencing findings based on relative expression levels of nine randomly selected genes. Furthermore, over 791,390 single nucleotide polymorphisms and 2,682 potential simple sequence repeats were identified. Our study provided comprehensive transcriptional gene expression information for A. suturalis that will form the basis to better understanding of development pathways, hormone biosynthesis, sex differences and wing formation in mirid bugs.
Tian, Caihong; Tek Tay, Wee; Feng, Hongqiang; Wang, Ying; Hu, Yongmin; Li, Guoping
2015-01-01
Adelphocoris suturalis is one of the most serious pest insects of Bt cotton in China, however its molecular genetics, biochemistry and physiology are poorly understood. We used high throughput sequencing platform to perform de novo transcriptome assembly and gene expression analyses across different developmental stages (eggs, 2nd and 5th instar nymphs, female and male adults). We obtained 20 GB of clean data and revealed 88,614 unigenes, including 23,830 clusters and 64,784 singletons. These unigene sequences were annotated and classified by Gene Ontology, Clusters of Orthologous Groups, and Kyoto Encyclopedia of Genes and Genomes databases. A large number of differentially expressed genes were discovered through pairwise comparisons between these developmental stages. Gene expression profiles were dramatically different between life stage transitions, with some of these most differentially expressed genes being associated with sex difference, metabolism and development. Quantitative real-time PCR results confirm deep-sequencing findings based on relative expression levels of nine randomly selected genes. Furthermore, over 791,390 single nucleotide polymorphisms and 2,682 potential simple sequence repeats were identified. Our study provided comprehensive transcriptional gene expression information for A. suturalis that will form the basis to better understanding of development pathways, hormone biosynthesis, sex differences and wing formation in mirid bugs. PMID:26047353
Sources of Variance in Baseline Gene Expression in the Rodent Liver
Corton, J. Christopher; Bushel, Pierre R.; Fostel, Jennifer; O'Lone, Raegan B.
2012-01-01
The use of gene expression profiling in both clinical and laboratory settings would be enhanced by better characterization of variation due to individual, environmental, and technical factors. Analysis of microarray data from untreated or vehicle-treated animals within the control arm of toxicogenomics studies has yielded useful information on baseline fluctuations in liver gene expression in the rodent. Here, studies which highlight contributions of different factors to gene expression variability in the rodent liver are discussed including a large meta-analysis of rat liver, which identified genes that vary in control animals in the absence of chemical treatment. Genes and their pathways that are the most and least variable were identified in a number of these studies. Life stage, fasting, sex, diet, circadian rhythm and liver lobe source can profoundly influence gene expression in the liver. Recognition of biological and technical factors that contribute to variability of background gene expression can help the investigator in the design of an experiment that maximizes sensitivity and reduces the influence of confounders that may lead to misinterpretation of genomic changes. The factors that contribute to variability in liver gene expression in rodents are likely analogous to those contributing to human interindividual variability in drug response and chemical toxicity. Identification of batteries of genes that are altered in a variety of background conditions could be used to predict responses to drugs and chemicals in appropriate models of the human liver. PMID:22230429
Lepre, Jorge; Rice, J Jeremy; Tu, Yuhai; Stolovitzky, Gustavo
2004-05-01
Despite the growing literature devoted to finding differentially expressed genes in assays probing different tissues types, little attention has been paid to the combinatorial nature of feature selection inherent to large, high-dimensional gene expression datasets. New flexible data analysis approaches capable of searching relevant subgroups of genes and experiments are needed to understand multivariate associations of gene expression patterns with observed phenotypes. We present in detail a deterministic algorithm to discover patterns of multivariate gene associations in gene expression data. The patterns discovered are differential with respect to a control dataset. The algorithm is exhaustive and efficient, reporting all existent patterns that fit a given input parameter set while avoiding enumeration of the entire pattern space. The value of the pattern discovery approach is demonstrated by finding a set of genes that differentiate between two types of lymphoma. Moreover, these genes are found to behave consistently in an independent dataset produced in a different laboratory using different arrays, thus validating the genes selected using our algorithm. We show that the genes deemed significant in terms of their multivariate statistics will be missed using other methods. Our set of pattern discovery algorithms including a user interface is distributed as a package called Genes@Work. This package is freely available to non-commercial users and can be downloaded from our website (http://www.research.ibm.com/FunGen).
Gurunathan, Rajalakshmi; Van Emden, Bernard; Panchanathan, Sethuraman; Kumar, Sudhir
2004-01-01
Background Modern developmental biology relies heavily on the analysis of embryonic gene expression patterns. Investigators manually inspect hundreds or thousands of expression patterns to identify those that are spatially similar and to ultimately infer potential gene interactions. However, the rapid accumulation of gene expression pattern data over the last two decades, facilitated by high-throughput techniques, has produced a need for the development of efficient approaches for direct comparison of images, rather than their textual descriptions, to identify spatially similar expression patterns. Results The effectiveness of the Binary Feature Vector (BFV) and Invariant Moment Vector (IMV) based digital representations of the gene expression patterns in finding biologically meaningful patterns was compared for a small (226 images) and a large (1819 images) dataset. For each dataset, an ordered list of images, with respect to a query image, was generated to identify overlapping and similar gene expression patterns, in a manner comparable to what a developmental biologist might do. The results showed that the BFV representation consistently outperforms the IMV representation in finding biologically meaningful matches when spatial overlap of the gene expression pattern and the genes involved are considered. Furthermore, we explored the value of conducting image-content based searches in a dataset where individual expression components (or domains) of multi-domain expression patterns were also included separately. We found that this technique improves performance of both IMV and BFV based searches. Conclusions We conclude that the BFV representation consistently produces a more extensive and better list of biologically useful patterns than the IMV representation. The high quality of results obtained scales well as the search database becomes larger, which encourages efforts to build automated image query and retrieval systems for spatial gene expression patterns. PMID:15603586
2012-01-01
Background DNA cytosine methylation is an epigenetic modification that has been implicated in many biological processes. However, large-scale epigenomic studies have been applied to very few plant species, and variability in methylation among specialized tissues and its relationship to gene expression is poorly understood. Results We surveyed DNA methylation from seven distinct tissue types (vegetative bud, male inflorescence [catkin], female catkin, leaf, root, xylem, phloem) in the reference tree species black cottonwood (Populus trichocarpa). Using 5-methyl-cytosine DNA immunoprecipitation followed by Illumina sequencing (MeDIP-seq), we mapped a total of 129,360,151 36- or 32-mer reads to the P. trichocarpa reference genome. We validated MeDIP-seq results by bisulfite sequencing, and compared methylation and gene expression using published microarray data. Qualitative DNA methylation differences among tissues were obvious on a chromosome scale. Methylated genes had lower expression than unmethylated genes, but genes with methylation in transcribed regions ("gene body methylation") had even lower expression than genes with promoter methylation. Promoter methylation was more frequent than gene body methylation in all tissues except male catkins. Male catkins differed in demethylation of particular transposable element categories, in level of gene body methylation, and in expression range of genes with methylated transcribed regions. Tissue-specific gene expression patterns were correlated with both gene body and promoter methylation. Conclusions We found striking differences among tissues in methylation, which were apparent at the chromosomal scale and when genes and transposable elements were examined. In contrast to other studies in plants, gene body methylation had a more repressive effect on transcription than promoter methylation. PMID:22251412
Shanley, Thomas P; Cvijanovich, Natalie; Lin, Richard; Allen, Geoffrey L; Thomas, Neal J; Doctor, Allan; Kalyanaraman, Meena; Tofil, Nancy M; Penfil, Scott; Monaco, Marie; Odoms, Kelli; Barnes, Michael; Sakthivel, Bhuvaneswari; Aronow, Bruce J; Wong, Hector R
2007-01-01
We have conducted longitudinal studies focused on the expression profiles of signaling pathways and gene networks in children with septic shock. Genome-level expression profiles were generated from whole blood-derived RNA of children with septic shock (n = 30) corresponding to day one and day three of septic shock, respectively. Based on sequential statistical and expression filters, day one and day three of septic shock were characterized by differential regulation of 2,142 and 2,504 gene probes, respectively, relative to controls (n = 15). Venn analysis demonstrated 239 unique genes in the day one dataset, 598 unique genes in the day three dataset, and 1,906 genes common to both datasets. Functional analyses demonstrated time-dependent, differential regulation of genes involved in multiple signaling pathways and gene networks primarily related to immunity and inflammation. Notably, multiple and distinct gene networks involving T cell- and MHC antigen-related biology were persistently downregulated on both day one and day three. Further analyses demonstrated large scale, persistent downregulation of genes corresponding to functional annotations related to zinc homeostasis. These data represent the largest reported cohort of patients with septic shock subjected to longitudinal genome-level expression profiling. The data further advance our genome-level understanding of pediatric septic shock and support novel hypotheses. PMID:17932561
Namkung, Junghyun; Nam, Jin-Wu; Park, Taesung
2007-01-01
Many genes with major effects on quantitative traits have been reported to interact with other genes. However, finding a group of interacting genes from thousands of SNPs is challenging. Hence, an efficient and robust algorithm is needed. The genetic algorithm (GA) is useful in searching for the optimal solution from a very large searchable space. In this study, we show that genome-wide interaction analysis using GA and a statistical interaction model can provide a practical method to detect biologically interacting loci. We focus our search on transcriptional regulators by analyzing gene x gene interactions for cancer-related genes. The expression values of three cancer-related genes were selected from the expression data of the Genetic Analysis Workshop 15 Problem 1 data set. We implemented a GA to identify the expression quantitative trait loci that are significantly associated with expression levels of the cancer-related genes. The time complexity of the GA was compared with that of an exhaustive search algorithm. As a result, our GA, which included heuristic methods, such as archive, elitism, and local search, has greatly reduced computational time in a genome-wide search for gene x gene interactions. In general, the GA took one-fifth the computation time of an exhaustive search for the most significant pair of single-nucleotide polymorphisms.
Namkung, Junghyun; Nam, Jin-Wu; Park, Taesung
2007-01-01
Many genes with major effects on quantitative traits have been reported to interact with other genes. However, finding a group of interacting genes from thousands of SNPs is challenging. Hence, an efficient and robust algorithm is needed. The genetic algorithm (GA) is useful in searching for the optimal solution from a very large searchable space. In this study, we show that genome-wide interaction analysis using GA and a statistical interaction model can provide a practical method to detect biologically interacting loci. We focus our search on transcriptional regulators by analyzing gene × gene interactions for cancer-related genes. The expression values of three cancer-related genes were selected from the expression data of the Genetic Analysis Workshop 15 Problem 1 data set. We implemented a GA to identify the expression quantitative trait loci that are significantly associated with expression levels of the cancer-related genes. The time complexity of the GA was compared with that of an exhaustive search algorithm. As a result, our GA, which included heuristic methods, such as archive, elitism, and local search, has greatly reduced computational time in a genome-wide search for gene × gene interactions. In general, the GA took one-fifth the computation time of an exhaustive search for the most significant pair of single-nucleotide polymorphisms. PMID:18466570
Protists and the Wild, Wild West of Gene Expression: New Frontiers, Lawlessness, and Misfits.
Smith, David Roy; Keeling, Patrick J
2016-09-08
The DNA double helix has been called one of life's most elegant structures, largely because of its universality, simplicity, and symmetry. The expression of information encoded within DNA, however, can be far from simple or symmetric and is sometimes surprisingly variable, convoluted, and wantonly inefficient. Although exceptions to the rules exist in certain model systems, the true extent to which life has stretched the limits of gene expression is made clear by nonmodel systems, particularly protists (microbial eukaryotes). The nuclear and organelle genomes of protists are subject to the most tangled forms of gene expression yet identified. The complicated and extravagant picture of the underlying genetics of eukaryotic microbial life changes how we think about the flow of genetic information and the evolutionary processes shaping it. Here, we discuss the origins, diversity, and growing interest in noncanonical protist gene expression and its relationship to genomic architecture.
Pridans, Clare; Lillico, Simon; Whitelaw, Bruce; Hume, David A
2014-01-01
The development of macrophages requires signaling through the lineage-restricted receptor Csf1r. Macrophage-restricted expression of transgenic reporters based upon Csf1r requires the highly conserved Fms-intronic regulatory element (FIRE). We have created a lentiviral construct containing mouse FIRE and promoter. The lentivirus is capable of directing macrophage-restricted reporter gene expression in mouse, rat, human, pig, cow, sheep, and even chicken. Rat bone marrow cells transduced with the lentivirus were capable of differentiating into macrophages expressing the reporter gene in vitro. Macrophage-restricted expression may be desirable for immunization or immune response modulation, and for gene therapy for lysosomal storage diseases and some immunodeficiencies. The small size of the Csf1r transcription control elements will allow the insertion of large “cargo” for applications in gene therapy and vaccine delivery. PMID:26015955
Xoca-Orozco, Luis-Ángel; Cuellar-Torres, Esther Angélica; González-Morales, Sandra; Gutiérrez-Martínez, Porfirio; López-García, Ulises; Herrera-Estrella, Luis; Vega-Arreguín, Julio; Chacón-López, Alejandra
2017-01-01
Avocado (Persea americana) is one of the most important crops in Mexico as it is the main producer, consumer, and exporter of avocado fruit in the world. However, successful avocado commercialization is often reduced by large postharvest losses due to Colletotrichum sp., the causal agent of anthracnose. Chitosan is known to have a direct antifungal effect and acts also as an elicitor capable of stimulating a defense response in plants. However, there is little information regarding the genes that are either activated or repressed in fruits treated with chitosan. The aim of this study was to identify by RNA-seq the genes differentially regulated by the action of low molecular weight chitosan in the avocado-chitosan-Colletotrichum interaction system. The samples for RNA-seq were obtained from fruits treated with chitosan, fruits inoculated with Colletotrichum and fruits both treated with chitosan and inoculated with the fungus. Non-treated and non-inoculated fruits were also analyzed. Expression profiles showed that in short times, the fruit-chitosan system presented a greater number of differentially expressed genes, compared to the fruit-pathogen system. Gene Ontology analysis of differentially expressed genes showed a large number of metabolic processes regulated by chitosan, including those preventing the spread of Colletotrichum. It was also found that there is a high correlation between the expression of genes in silico and qPCR of several genes involved in different metabolic pathways. PMID:28642771
Radiation Dose-Rate Effects on Gene Expression in a Mouse Biodosimetry Model
Paul, Sunirmal; Smilenov, Lubomir B.; Elliston, Carl D.; Amundson, Sally A.
2015-01-01
In the event of a nuclear accident or radiological terrorist attack, there will be a pressing need for biodosimetry to triage a large, potentially exposed population and to assign individuals to appropriate treatment. Exposures from fallout are likely, resulting in protracted dose delivery that would, in turn, impact the extent of injury. Biodosimetry approaches that can distinguish such low-dose-rate (LDR) exposures from acute exposures have not yet been developed. In this study, we used the C57BL/6 mouse model in an initial investigation of the impact of low-dose-rate delivery on the transcriptomic response in blood. While a large number of the same genes responded to LDR and acute radiation exposures, for many genes the magnitude of response was lower after LDR exposures. Some genes, however, were differentially expressed (P < 0.001, false discovery rate < 5%) in mice exposed to LDR compared with mice exposed to acute radiation. We identified a set of 164 genes that correctly classified 97% of the samples in this experiment as exposed to acute or LDR radiation using a support vector machine algorithm. Gene expression is a promising approach to radiation biodosimetry, enhanced greatly by this first demonstration of its potential for distinguishing between acute and LDR exposures. Further development of this aspect of radiation biodosimetry, either as part of a complete gene expression biodosimetry test or as an adjunct to other methods, could provide vital triage information in a mass radiological casualty event. PMID:26114327
González-González, Andrea; Hug, Shaun M; Rodríguez-Verdugo, Alejandra; Patel, Jagdish Suresh; Gaut, Brandon S
2017-11-01
Modifications to transcriptional regulators play a major role in adaptation. Here, we compared the effects of multiple beneficial mutations within and between Escherichia coli rpoB, the gene encoding the RNA polymerase β subunit, and rho, which encodes a transcriptional terminator. These two genes have harbored adaptive mutations in numerous E. coli evolution experiments but particularly in our previous large-scale thermal stress experiment, where the two genes characterized alternative adaptive pathways. To compare the effects of beneficial mutations, we engineered four advantageous mutations into each of the two genes and measured their effects on fitness, growth, gene expression and transcriptional termination at 42.2 °C. Among the eight mutations, two rho mutations had no detectable effect on relative fitness, suggesting they were beneficial only in the context of epistatic interactions. The remaining six mutations had an average relative fitness benefit of ∼20%. The rpoB mutations affected the expression of ∼1,700 genes; rho mutations affected the expression of fewer genes but most (83%) were a subset of those altered by rpoB mutants. Across the eight mutants, relative fitness correlated with the degree to which a mutation restored gene expression back to the unstressed, 37.0 °C state. The beneficial mutations in the two genes did not have identical effects on fitness, growth or gene expression, but they caused parallel phenotypic effects on gene expression and genome-wide transcriptional termination. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Transcriptome architecture across tissues in the pig
Ferraz, André LJ; Ojeda, Ana; López-Béjar, Manel; Fernandes, Lana T; Castelló, Anna; Folch, Josep M; Pérez-Enciso, Miguel
2008-01-01
Background Artificial selection has resulted in animal breeds with extreme phenotypes. As an organism is made up of many different tissues and organs, each with its own genetic programme, it is pertinent to ask: How relevant is tissue in terms of total transcriptome variability? Which are the genes most distinctly expressed between tissues? Does breed or sex equally affect the transcriptome across tissues? Results In order to gain insight on these issues, we conducted microarray expression profiling of 16 different tissues from four animals of two extreme pig breeds, Large White and Iberian, two males and two females. Mixed model analysis and neighbor – joining trees showed that tissues with similar developmental origin clustered closer than those with different embryonic origins. Often a sound biological interpretation was possible for overrepresented gene ontology categories within differentially expressed genes between groups of tissues. For instance, an excess of nervous system or muscle development genes were found among tissues of ectoderm or mesoderm origins, respectively. Tissue accounted for ~11 times more variability than sex or breed. Nevertheless, we were able to confidently identify genes with differential expression across tissues between breeds (33 genes) and between sexes (19 genes). The genes primarily affected by sex were overall different than those affected by breed or tissue. Interaction with tissue can be important for differentially expressed genes between breeds but not so much for genes whose expression differ between sexes. Conclusion Embryonic development leaves an enduring footprint on the transcriptome. The interaction in gene × tissue for differentially expressed genes between breeds suggests that animal breeding has targeted differentially each tissue's transcriptome. PMID:18416811
Beaudoin, Trevor; Zhang, Li; Hinz, Aaron J; Parr, Christopher J; Mah, Thien-Fah
2012-06-01
Bacteria growing in biofilms are responsible for a large number of persistent infections and are often more resistant to antibiotics than are free-floating bacteria. In a previous study, we identified a Pseudomonas aeruginosa gene, ndvB, which is important for the formation of periplasmic glucans. We established that these glucans function in biofilm-specific antibiotic resistance by sequestering antibiotic molecules away from their cellular targets. In this study, we investigate another function of ndvB in biofilm-specific antibiotic resistance. DNA microarray analysis identified 24 genes that were responsive to the presence of ndvB. A subset of 20 genes, including 8 ethanol oxidation genes (ercS', erbR, exaA, exaB, eraR, pqqB, pqqC, and pqqE), was highly expressed in wild-type biofilm cells but not in ΔndvB biofilms, while 4 genes displayed the reciprocal expression pattern. Using quantitative real-time PCR, we confirmed the ndvB-dependent expression of the ethanol oxidation genes and additionally demonstrated that these genes were more highly expressed in biofilms than in planktonic cultures. Expression of erbR in ΔndvB biofilms was restored after the treatment of the biofilm with periplasmic extracts derived from wild-type biofilm cells. Inactivation of ethanol oxidation genes increased the sensitivity of biofilms to tobramycin. Together, these results reveal that ndvB affects the expression of multiple genes in biofilms and that ethanol oxidation genes are linked to biofilm-specific antibiotic resistance.
Tanigaki, Yusuke; Higashi, Takanobu; Takayama, Kotaro; Nagano, Atsushi J; Honjo, Mie N; Fukuda, Hirokazu
2015-01-01
In plant factories, measurements of plant conditions are necessary at an early stage of growth to predict harvest times of high value-added crops. Moreover, harvest qualities depend largely on environmental stresses that elicit plant hormone responses. However, the complexities of plant hormone networks have not been characterized under nonstress conditions. In the present study, we determined temporal expression profiles of all genes and then focused on plant hormone pathways using RNA-Seq analyses of gene expression in tomato leaves every 2 h for 48 h. In these experiments, temporally expressed genes were found in the hormone synthesis pathways for salicylic acid, abscisic acid, ethylene, and jasmonic acid. The timing of CAB expression 1 (TOC1) and abscisic acid insensitive 1 (ABA1) and open stomata 1 (OST1) control gating stomata. In this study, compare with tomato and Arabidopsis thaliana, expression patterns of TOC1 have similarity. In contrast, expression patterns of tomato ABI1 and OST1 had expression peak at different time. These findings suggest that the regulation of gating stomata does not depend predominantly on TOC1 and significantly reflects the extracellular environment. The present data provide new insights into relationships between temporally expressed plant hormone-related genes and clock genes under normal sunlight conditions.
Tanigaki, Yusuke; Higashi, Takanobu; Takayama, Kotaro; Nagano, Atsushi J.; Honjo, Mie N.; Fukuda, Hirokazu
2015-01-01
In plant factories, measurements of plant conditions are necessary at an early stage of growth to predict harvest times of high value-added crops. Moreover, harvest qualities depend largely on environmental stresses that elicit plant hormone responses. However, the complexities of plant hormone networks have not been characterized under nonstress conditions. In the present study, we determined temporal expression profiles of all genes and then focused on plant hormone pathways using RNA-Seq analyses of gene expression in tomato leaves every 2 h for 48 h. In these experiments, temporally expressed genes were found in the hormone synthesis pathways for salicylic acid, abscisic acid, ethylene, and jasmonic acid. The timing of CAB expression 1 (TOC1) and abscisic acid insensitive 1 (ABA1) and open stomata 1 (OST1) control gating stomata. In this study, compare with tomato and Arabidopsis thaliana, expression patterns of TOC1 have similarity. In contrast, expression patterns of tomato ABI1 and OST1 had expression peak at different time. These findings suggest that the regulation of gating stomata does not depend predominantly on TOC1 and significantly reflects the extracellular environment. The present data provide new insights into relationships between temporally expressed plant hormone-related genes and clock genes under normal sunlight conditions. PMID:26624004
Genes involved in Beauveria bassiana infection to Galleria mellonella.
Chen, Anhui; Wang, Yulong; Shao, Ying; Zhou, Qiumei; Chen, Shanglong; Wu, Yonghua; Chen, Hongwei; Liu, Enqi
2018-05-01
The ascomycete fungus Beauveria bassiana is a natural pathogen of hundreds of insect species and is commercially produced as an environmentally friendly mycoinsecticide. Many genes involved in fungal insecticide infection have been identified but few have been further explored. In this study, we constructed three transcriptomes of B. bassiana at 24, 48 and 72 h post infection of insect pests (BbI) or control (BbC). There were 3148, 3613 and 4922 genes differentially expressed at 24, 48 and 72 h post BbI/BbC infection, respectively. A large number of genes and pathways involved in infection were identified. To further analyze those genes, expression patterns across different infection stages (0, 12, 24, 36, 48, 60, 72 and 84 h) were studied using quantitative RT-PCR. This analysis showed that the infection-related genes could be divided into four patterns: highly expressed throughout the whole infection process (thioredoxin 1); highly expressed during early stages of infection but lowly expressed after the insect death (adhesin protein Mad1); lowly expressed during early infection but highly expressed after insect death (cation transporter, OpS13); or lowly expressed across the entire infection process (catalase protein). The data provide novel insights into the insect-pathogen interaction and help to uncover the molecular mechanisms involved in fungal infection of insect pests.
Tissue-specific NETs alter genome organization and regulation even in a heterologous system.
de Las Heras, Jose I; Zuleger, Nikolaj; Batrakou, Dzmitry G; Czapiewski, Rafal; Kerr, Alastair R W; Schirmer, Eric C
2017-01-02
Different cell types exhibit distinct patterns of 3D genome organization that correlate with changes in gene expression in tissue and differentiation systems. Several tissue-specific nuclear envelope transmembrane proteins (NETs) have been found to influence the spatial positioning of genes and chromosomes that normally occurs during tissue differentiation. Here we study 3 such NETs: NET29, NET39, and NET47, which are expressed preferentially in fat, muscle and liver, respectively. We found that even when exogenously expressed in a heterologous system they can specify particular genome organization patterns and alter gene expression. Each NET affected largely different subsets of genes. Notably, the liver-specific NET47 upregulated many genes in HT1080 fibroblast cells that are normally upregulated in hepatogenesis, showing that tissue-specific NETs can favor expression patterns associated with the tissue where the NET is normally expressed. Similarly, global profiling of peripheral chromatin after exogenous expression of these NETs using lamin B1 DamID revealed that each NET affected the nuclear positioning of distinct sets of genomic regions with a significant tissue-specific component. Thus NET influences on genome organization can contribute to gene expression changes associated with differentiation even in the absence of other factors and overt cellular differentiation changes.
Gaffoor, Iffa; Brown, Daren W.; Plattner, Ron; Proctor, Robert H.; Qi, Weihong; Trail, Frances
2005-01-01
Polyketides are a class of secondary metabolites that exhibit a vast diversity of form and function. In fungi, these compounds are produced by large, multidomain enzymes classified as type I polyketide synthases (PKSs). In this study we identified and functionally disrupted 15 PKS genes from the genome of the filamentous fungus Gibberella zeae. Five of these genes are responsible for producing the mycotoxins zearalenone, aurofusarin, and fusarin C and the black perithecial pigment. A comprehensive expression analysis of the 15 genes revealed diverse expression patterns during grain colonization, plant colonization, sexual development, and mycelial growth. Expression of one of the PKS genes was not detected under any of 18 conditions tested. This is the first study to genetically characterize a complete set of PKS genes from a single organism. PMID:16278459
Ecological transcriptomics of lake-type and riverine sockeye salmon (Oncorhynchus nerka)
2011-01-01
Background There are a growing number of genomes sequenced with tentative functions assigned to a large proportion of the individual genes. Model organisms in laboratory settings form the basis for the assignment of gene function, and the ecological context of gene function is lacking. This work addresses this shortcoming by investigating expressed genes of sockeye salmon (Oncorhynchus nerka) muscle tissue. We compared morphology and gene expression in natural juvenile sockeye populations related to river and lake habitats. Based on previously documented divergent morphology, feeding strategy, and predation in association with these distinct environments, we expect that burst swimming is favored in riverine population and continuous swimming is favored in lake-type population. In turn we predict that morphology and expressed genes promote burst swimming in riverine sockeye and continuous swimming in lake-type sockeye. Results We found the riverine sockeye population had deep, robust bodies and lake-type had shallow, streamlined bodies. Gene expression patterns were measured using a 16K microarray, discovering 141 genes with significant differential expression. Overall, the identity and function of these genes was consistent with our hypothesis. In addition, Gene Ontology (GO) enrichment analyses with a larger set of differentially expressed genes found the "biosynthesis" category enriched for the riverine population and the "metabolism" category enriched for the lake-type population. Conclusions This study provides a framework for understanding sockeye life history from a transcriptomic perspective and a starting point for more extensive, targeted studies determining the ecological context of genes. PMID:22136247
Ecological transcriptomics of lake-type and riverine sockeye salmon (Oncorhynchus nerka).
Pavey, Scott A; Sutherland, Ben J G; Leong, Jong; Robb, Adrienne; von Schalburg, Kris; Hamon, Troy R; Koop, Ben F; Nielsen, Jennifer L
2011-12-02
There are a growing number of genomes sequenced with tentative functions assigned to a large proportion of the individual genes. Model organisms in laboratory settings form the basis for the assignment of gene function, and the ecological context of gene function is lacking. This work addresses this shortcoming by investigating expressed genes of sockeye salmon (Oncorhynchus nerka) muscle tissue. We compared morphology and gene expression in natural juvenile sockeye populations related to river and lake habitats. Based on previously documented divergent morphology, feeding strategy, and predation in association with these distinct environments, we expect that burst swimming is favored in riverine population and continuous swimming is favored in lake-type population. In turn we predict that morphology and expressed genes promote burst swimming in riverine sockeye and continuous swimming in lake-type sockeye. We found the riverine sockeye population had deep, robust bodies and lake-type had shallow, streamlined bodies. Gene expression patterns were measured using a 16 k microarray, discovering 141 genes with significant differential expression. Overall, the identity and function of these genes was consistent with our hypothesis. In addition, Gene Ontology (GO) enrichment analyses with a larger set of differentially expressed genes found the "biosynthesis" category enriched for the riverine population and the "metabolism" category enriched for the lake-type population. This study provides a framework for understanding sockeye life history from a transcriptomic perspective and a starting point for more extensive, targeted studies determining the ecological context of genes.
Effects of seawater acidification on gene expression: resolving broader-scale trends in sea urchins.
Evans, Tyler G; Watson-Wynn, Priscilla
2014-06-01
Sea urchins are ecologically and economically important calcifying organisms threatened by acidification of the global ocean caused by anthropogenic CO2 emissions. Propelled by the sequencing of the purple sea urchin (Strongylocentrotus purpuratus) genome, profiling changes in gene expression during exposure to high pCO2 seawater has emerged as a powerful and increasingly common method to infer the response of urchins to ocean change. However, analyses of gene expression are sensitive to experimental methodology, and comparisons between studies of genes regulated by ocean acidification are most often made in the context of major caveats. Here we perform meta-analyses as a means of minimizing experimental discrepancies and resolving broader-scale trends regarding the effects of ocean acidification on gene expression in urchins. Analyses across eight studies and four urchin species largely support prevailing hypotheses about the impact of ocean acidification on marine calcifiers. The predominant expression pattern involved the down-regulation of genes within energy-producing pathways, a clear indication of metabolic depression. Genes with functions in ion transport were significantly over-represented and are most plausibly contributing to intracellular pH regulation. Expression profiles provided extensive evidence for an impact on biomineralization, epitomized by the down-regulation of seven spicule matrix proteins. In contrast, expression profiles provided limited evidence for CO2-mediated developmental delay or induction of a cellular stress response. Congruence between studies of gene expression and the ocean acidification literature in general validates the accuracy of gene expression in predicting the consequences of ocean change and justifies its continued use in future studies. © 2014 Marine Biological Laboratory.
Lee, Robyn K; Hittel, Dustin S; Nyamandi, Vongai Z; Kang, Li; Soh, Jung; Sensen, Christoph W; Shearer, Jane
2012-04-01
Obesity is a chronic condition involving the excessive accumulation of adipose tissue that adversely affects all systems in the body. The aim of the present study was to employ an unbiased, genome-wide assessment of transcript abundance in order to identify common gene expression pathways within insulin-sensitive tissues in response to dietary-induced diabetes. Following 20 weeks of chow or high-fat feeding (60% kcal), age-matched mice underwent a euglycemic-hyperinsulinemic clamp to assess insulin sensitivity. High-fat-fed animals were obese and highly insulin resistant, disposing of ∼75% less glucose compared with their chow-fed counterparts. Tissues were collected, and gene expression was examined by microarray in 4 tissues known to exhibit obesity-related metabolic disturbances: white adipose tissue, skeletal muscle, liver, and heart. A total of 463 genes were differentially expressed between diets. Analysis of individual tissues showed skeletal muscle to exhibit the largest number of differentially expressed genes (191) in response to high-fat feeding, followed by adipose tissue (169), liver (115), and heart (65). Analyses revealed that the response of individual genes to obesity is distinct and largely tissue specific, with less than 10% of transcripts being shared among tissues. Although transcripts are largely tissue specific, a systems approach shows numerous commonly activated pathways, including those involved in signal transduction, inflammation, oxidative stress, substrate transport, and metabolism. This suggests a coordinated attempt by tissues to limit metabolic perturbations occurring in early-stage obesity. Many identified genes were associated with a variety of disorders, thereby serving as potential links between obesity and its related health risks.
Zeier, Zane; Aguilar, J Santiago; Lopez, Cecilia M; Devi-Rao, G B; Watson, Zachary L; Baker, Henry V; Wagner, Edward K; Bloom, David C
2010-01-01
Herpes simplex virus type 1 (HSV-1)–based vectors readily transduce neurons and have a large payload capacity, making them particularly amenable to gene therapy applications within the central nervous system (CNS). Because aspects of the host responses to HSV-1 vectors in the CNS are largely unknown, we compared the host response of a nonreplicating HSV-1 vector to that of a replication-competent HSV-1 virus using microarray analysis. In parallel, HSV-1 gene expression was tracked using HSV-specific oligonucleotide-based arrays in order to correlate viral gene expression with observed changes in host response. Microarray analysis was performed following stereotactic injection into the right hippocampal formation of mice with either a replication-competent HSV-1 or a nonreplicating recombinant of HSV-1, lacking the ICP4 gene (ICP4−). Genes that demonstrated a significant change (P < .001) in expression in response to the replicating HSV-1 outnumbered those that changed in response to mock or nonreplicating vector by approximately 3-fold. Pathway analysis revealed that both the replicating and nonreplicating vectors induced robust antigen presentation but only mild interferon, chemokine, and cytokine signaling responses. The ICP4− vector was restricted in several of the Toll-like receptor-signaling pathways, indicating reduced stimulation of the innate immune response. These array analyses suggest that although the nonreplicating vector induces detectable activation of immune response pathways, the number and magnitude of the induced response is dramatically restricted compared to the replicating vector, and with the exception of antigen presentation, host gene expression induced by the non-replicating vector largely resembles mock infection. PMID:20095947
Bi-Force: large-scale bicluster editing and its application to gene expression data biclustering
Sun, Peng; Speicher, Nora K.; Röttger, Richard; Guo, Jiong; Baumbach, Jan
2014-01-01
Abstract The explosion of the biological data has dramatically reformed today's biological research. The need to integrate and analyze high-dimensional biological data on a large scale is driving the development of novel bioinformatics approaches. Biclustering, also known as ‘simultaneous clustering’ or ‘co-clustering’, has been successfully utilized to discover local patterns in gene expression data and similar biomedical data types. Here, we contribute a new heuristic: ‘Bi-Force’. It is based on the weighted bicluster editing model, to perform biclustering on arbitrary sets of biological entities, given any kind of pairwise similarities. We first evaluated the power of Bi-Force to solve dedicated bicluster editing problems by comparing Bi-Force with two existing algorithms in the BiCluE software package. We then followed a biclustering evaluation protocol in a recent review paper from Eren et al. (2013) (A comparative analysis of biclustering algorithms for gene expressiondata. Brief. Bioinform., 14:279–292.) and compared Bi-Force against eight existing tools: FABIA, QUBIC, Cheng and Church, Plaid, BiMax, Spectral, xMOTIFs and ISA. To this end, a suite of synthetic datasets as well as nine large gene expression datasets from Gene Expression Omnibus were analyzed. All resulting biclusters were subsequently investigated by Gene Ontology enrichment analysis to evaluate their biological relevance. The distinct theoretical foundation of Bi-Force (bicluster editing) is more powerful than strict biclustering. We thus outperformed existing tools with Bi-Force at least when following the evaluation protocols from Eren et al. Bi-Force is implemented in Java and integrated into the open source software package of BiCluE. The software as well as all used datasets are publicly available at http://biclue.mpi-inf.mpg.de. PMID:24682815
Divergence of Gene Body DNA Methylation and Evolution of Plant Duplicate Genes
Wang, Jun; Marowsky, Nicholas C.; Fan, Chuanzhu
2014-01-01
It has been shown that gene body DNA methylation is associated with gene expression. However, whether and how deviation of gene body DNA methylation between duplicate genes can influence their divergence remains largely unexplored. Here, we aim to elucidate the potential role of gene body DNA methylation in the fate of duplicate genes. We identified paralogous gene pairs from Arabidopsis and rice (Oryza sativa ssp. japonica) genomes and reprocessed their single-base resolution methylome data. We show that methylation in paralogous genes nonlinearly correlates with several gene properties including exon number/gene length, expression level and mutation rate. Further, we demonstrated that divergence of methylation level and pattern in paralogs indeed positively correlate with their sequence and expression divergences. This result held even after controlling for other confounding factors known to influence the divergence of paralogs. We observed that methylation level divergence might be more relevant to the expression divergence of paralogs than methylation pattern divergence. Finally, we explored the mechanisms that might give rise to the divergence of gene body methylation in paralogs. We found that exonic methylation divergence more closely correlates with expression divergence than intronic methylation divergence. We show that genomic environments (e.g., flanked by transposable elements and repetitive sequences) of paralogs generated by various duplication mechanisms are associated with the methylation divergence of paralogs. Overall, our results suggest that the changes in gene body DNA methylation could provide another avenue for duplicate genes to develop differential expression patterns and undergo different evolutionary fates in plant genomes. PMID:25310342
Kameshwar, Ayyappa Kumar Sista; Qin, Wensheng
2017-01-01
In literature, extensive studies have been conducted on popular wood degrading white rot fungus, Phanerochaete chrysosporium about its lignin degrading mechanisms compared to the cellulose and hemicellulose degrading abilities. This study delineates cellulose and hemicellulose degrading mechanisms through large scale metadata analysis of P. chrysosporium gene expression data (retrieved from NCBI GEO) to understand the common expression patterns of differentially expressed genes when cultured on different growth substrates. Genes encoding glycoside hydrolase classes commonly expressed during breakdown of cellulose such as GH-5,6,7,9,44,45,48 and hemicellulose are GH-2,8,10,11,26,30,43,47 were found to be highly expressed among varied growth conditions including simple customized and complex natural plant biomass growth mediums. Genes encoding carbohydrate esterase class enzymes CE (1,4,8,9,15,16) polysaccharide lyase class enzymes PL-8 and PL-14, and glycosyl transferases classes GT (1,2,4,8,15,20,35,39,48) were differentially expressed in natural plant biomass growth mediums. Based on these results, P. chrysosporium, on natural plant biomass substrates was found to express lignin and hemicellulose degrading enzymes more than cellulolytic enzymes except GH-61 (LPMO) class enzymes, in early stages. It was observed that the fate of P. chrysosporium transcriptome is significantly affected by the wood substrate provided. We believe, the gene expression findings in this study plays crucial role in developing genetically efficient microbe with effective cellulose and hemicellulose degradation abilities.
Dou, Tengfei; Zhao, Sumei; Rong, Hua; Gu, Dahai; Li, Qihua; Huang, Ying; Xu, Zhiqiang; Chu, Xiaohui; Tao, Linli; Liu, Lixian; Ge, Changrong; Te Pas, Marinus F W; Jia, Junjing
2017-06-20
Intensive selection has resulted in increased growth rates and muscularity in broiler chickens, in addition to adverse effects, including delayed organ development, sudden death syndrome, and altered metabolic rates. The biological mechanisms underlying selection responses remain largely unknown. Non-artificially-selected indigenous Chinese chicken breeds display a wide variety of phenotypes, including differential growth rate, body weight, and muscularity. The Wuding chicken breed is a fast growing large chicken breed, and the Daweishan mini chicken breed is a slow growing small chicken breed. Together they form an ideal model system to study the biological mechanisms underlying broiler chicken selection responses in a natural system. The objective of this study was to study the biological mechanisms underlying differential phenotypes between the two breeds in muscle and liver tissues, and relate these to the growth rate and body development phenotypes of the two breeds. The muscle tissue in the Wuding breed showed higher expression of muscle development genes than muscle tissue in the Daweishan chicken breed. This expression was accompanied by higher expression of acute inflammatory response genes in Wuding chicken than in Daweishan chicken. The muscle tissue of the Daweishan mini chicken breed showed higher expression of genes involved in several metabolic mechanisms including endoplasmic reticulum, protein and lipid metabolism, energy metabolism, as well as specific immune traits than in the Wuding chicken. The liver tissue showed fewer differences between the two breeds. Genes displaying higher expression in the Wuding breed than in the Daweishan breed were not associated with a specific gene network or biological mechanism. Genes highly expressed in the Daweishan mini chicken breed compared to the Wuding breed were enriched for protein metabolism, ABC receptors, signal transduction, and IL6-related mechanisms. We conclude that faster growth rates and larger body size are related to increased expression of genes involved in muscle development and immune response in muscle, while slower growth rates and smaller body size are related to increased general cellular metabolism. The liver of the Daweishan breed displayed increased expression of metabolic genes.
Piersma, Sjouke; Denham, Emma L; Drulhe, Samuel; Tonk, Rudi H J; Schwikowski, Benno; van Dijl, Jan Maarten
2013-01-01
Gene expression heterogeneity is a key driver for microbial adaptation to fluctuating environmental conditions, cell differentiation and the evolution of species. This phenomenon has therefore enormous implications, not only for life in general, but also for biotechnological applications where unwanted subpopulations of non-producing cells can emerge in large-scale fermentations. Only time-lapse fluorescence microscopy allows real-time measurements of gene expression heterogeneity. A major limitation in the analysis of time-lapse microscopy data is the lack of fast, cost-effective, open, simple and adaptable protocols. Here we describe TLM-Quant, a semi-automatic pipeline for the analysis of time-lapse fluorescence microscopy data that enables the user to visualize and quantify gene expression heterogeneity. Importantly, our pipeline builds on the open-source packages ImageJ and R. To validate TLM-Quant, we selected three possible scenarios, namely homogeneous expression, highly 'noisy' heterogeneous expression, and bistable heterogeneous expression in the Gram-positive bacterium Bacillus subtilis. This bacterium is both a paradigm for systems-level studies on gene expression and a highly appreciated biotechnological 'cell factory'. We conclude that the temporal resolution of such analyses with TLM-Quant is only limited by the numbers of recorded images.
Nikodemova, Maria; Yee, Jeremiah; Carney, Patrick R; Bradfield, Christopher A; Malecki, Kristen Mc
2018-04-01
Obesity has been shown to alter response to air pollution and smoking but underlying biological mechanisms are largely unknown and few studies have explored mechanisms by which obesity increases human sensitivity to environmental exposures. Overall study goals were to investigate whole blood gene expression in smokers and non-smokers to examine associations between cigarette smoke and changes in gene expression by obesity status and test for effect modification. Relative fold-change in mRNA expression levels of 84 genes were analyzed using a Toxicity and Stress PCR array among 50 21-54 year old adults. Data on smoking status was confirmed using urinary cotinine levels. Adjusted models included age, gender, white blood cell count and body-mass index. Models comparing gene expression of smokers vs. non-smokers identified six differentially expressed genes associated with smoking after adjustments for covariates. Obesity was associated with 29 genes differentially expressed compared to non-obese. We also identified 9 genes with significant smoking/obesity interactions influencing mRNA levels in adjusted models comparing expression between smokers vs non-smokers for four DNA damage related genes (GADD45A, DDB2, RAD51 and P53), two oxidative stress genes (FTH1, TXN), two hypoxia response genes (BN1P3lL, ARNT), and one gene associated with unfolded protein response (ATF6B). Findings suggest that obesity alters human sensitivity to smoke exposures through several biological pathways by modifying gene expression. Additional studies are needed to fully understand the clinical impact of these effects, but risk assessments should consider underlying phenotypes, such as obesity, that may modulate sensitivity of vulnerable populations to environmental exposures. Copyright © 2018 Elsevier Ltd. All rights reserved.
Selby, Katja; Mascher, Gerald; Somervuo, Panu; Korkeala, Hannu
2017-01-01
Foodborne pathogenic bacteria are exposed to a number of environmental stresses during food processing, storage, and preparation, and in the human body. In order to improve the safety of food, the understanding of molecular stress response mechanisms foodborne pathogens employ is essential. Many response mechanisms that are activated during heat shock may cross-protect bacteria against other environmental stresses. To better understand the molecular mechanisms Clostridium botulinum, the causative agent of botulism, utilizes during acute heat stress and during adaptation to stressfully high temperature, the C. botulinum Group I strain ATCC 3502 was grown in continuous culture at 39°C and exposed to heat shock at 45°C, followed by prolonged heat stress at 45°C to allow adaptation of the culture to the high temperature. Growth in continuous culture was performed to exclude secondary growth phase effects or other environmental impacts on bacterial gene transcription. Changes in global gene expression profiles were studied using DNA microarray hybridization. During acute heat stress, Class I and III heat shock genes as well as members of the SOS regulon were activated. The neurotoxin gene botA and genes encoding the neurotoxin-associated proteins were suppressed throughout the study. Prolonged heat stress led to suppression of the sporulation machinery whereas genes related to chemotaxis and motility were activated. Induced expression of a large proportion of prophage genes was detected, suggesting an important role of acquired genes in the stress resistance of C. botulinum. Finally, changes in the expression of a large number of genes related to carbohydrate and amino acid metabolism indicated remodeling of the cellular metabolism. PMID:28464023
Selby, Katja; Mascher, Gerald; Somervuo, Panu; Lindström, Miia; Korkeala, Hannu
2017-01-01
Foodborne pathogenic bacteria are exposed to a number of environmental stresses during food processing, storage, and preparation, and in the human body. In order to improve the safety of food, the understanding of molecular stress response mechanisms foodborne pathogens employ is essential. Many response mechanisms that are activated during heat shock may cross-protect bacteria against other environmental stresses. To better understand the molecular mechanisms Clostridium botulinum, the causative agent of botulism, utilizes during acute heat stress and during adaptation to stressfully high temperature, the C. botulinum Group I strain ATCC 3502 was grown in continuous culture at 39°C and exposed to heat shock at 45°C, followed by prolonged heat stress at 45°C to allow adaptation of the culture to the high temperature. Growth in continuous culture was performed to exclude secondary growth phase effects or other environmental impacts on bacterial gene transcription. Changes in global gene expression profiles were studied using DNA microarray hybridization. During acute heat stress, Class I and III heat shock genes as well as members of the SOS regulon were activated. The neurotoxin gene botA and genes encoding the neurotoxin-associated proteins were suppressed throughout the study. Prolonged heat stress led to suppression of the sporulation machinery whereas genes related to chemotaxis and motility were activated. Induced expression of a large proportion of prophage genes was detected, suggesting an important role of acquired genes in the stress resistance of C. botulinum. Finally, changes in the expression of a large number of genes related to carbohydrate and amino acid metabolism indicated remodeling of the cellular metabolism.
Effect of Mild Acid on Gene Expression in Staphylococcus aureus
Weinrick, Brian; Dunman, Paul M.; McAleese, Fionnuala; Murphy, Ellen; Projan, Steven J.; Fang, Yuan; Novick, Richard P.
2004-01-01
During staphylococcal growth in glucose-supplemented medium, the pH of a culture starting near neutrality typically decreases by about 2 units due to the fermentation of glucose. Many species can comfortably tolerate the resulting mildly acidic conditions (pH, ∼5.5) by mounting a cellular response, which serves to defend the intracellular pH and, in principle, to modify gene expression for optimal performance in a mildly acidic infection site. In this report, we show that changes in staphylococcal gene expression formerly thought to represent a glucose effect are largely the result of declining pH. We examine the cellular response to mild acid by microarray analysis and define the affected gene set as the mild acid stimulon. Many of the genes encoding extracellular virulence factors are affected, as are genes involved in regulation of virulence factor gene expression, transport of sugars and peptides, intermediary metabolism, and pH homeostasis. Key results are verified by gene fusion and Northern blot hybridization analyses. The results point to, but do not define, possible regulatory pathways by which the organism senses and responds to a pH stimulus. PMID:15576791
Kanofsky, Konstantin; Lehmeyer, Mona; Schulze, Jutta; Hehl, Reinhard
2016-01-01
Plants recognize pathogens by microbe-associated molecular patterns (MAMPs) and subsequently induce an immune response. The regulation of gene expression during the immune response depends largely on cis-sequences conserved in promoters of MAMP-responsive genes. These cis-sequences can be analyzed by constructing synthetic promoters linked to a reporter gene and by testing these constructs in transient expression systems. Here, the use of the parsley (Petroselinum crispum) protoplast system for analyzing MAMP-responsive synthetic promoters is described. The synthetic promoter consists of four copies of a potential MAMP-responsive cis-sequence cloned upstream of a minimal promoter and the uidA reporter gene. The reporter plasmid contains a second reporter gene, which is constitutively expressed and hence eliminates the requirement of a second plasmid used as a transformation control. The reporter plasmid is transformed into parsley protoplasts that are elicited by the MAMP Pep25. The MAMP responsiveness is validated by comparing the reporter gene activity from MAMP-treated and untreated cells and by normalizing reporter gene activity using the constitutively expressed reporter gene.
Gene expression profiles of changes underlying different-sized human rotator cuff tendon tears.
Chaudhury, Salma; Xia, Zhidao; Thakkar, Dipti; Hakimi, Osnat; Carr, Andrew J
2016-10-01
Progressive cellular and extracellular matrix (ECM) changes related to age and disease severity have been demonstrated in rotator cuff tendon tears. Larger rotator cuff tears demonstrate structural abnormalities that potentially adversely influence healing potential. This study aimed to gain greater insight into the relationship of pathologic changes to tear size by analyzing gene expression profiles from normal rotator cuff tendons, small rotator cuff tears, and large rotator cuff tears. We analyzed gene expression profiles of 28 human rotator cuff tendons using microarrays representing the entire genome; 11 large and 5 small torn rotator cuff tendon specimens were obtained intraoperatively from tear edges, which we compared with 12 age-matched normal controls. We performed real-time polymerase chain reaction and immunohistochemistry for validation. Torn rotator cuff tendons demonstrated upregulation of a number of key genes, such as matrix metalloproteinase 3, 10, 12, 13, 15, 21, and 25; a disintegrin and metalloproteinase (ADAM) 12, 15, and 22; and aggrecan. Amyloid was downregulated in all tears. Small tears displayed upregulation of bone morphogenetic protein 5. Chemokines and cytokines that may play a role in chemotaxis were altered; interleukins 3, 10, 13, and 15 were upregulated in tears, whereas interleukins 1, 8, 11, 18, and 27 were downregulated. The gene expression profiles of normal controls and small and large rotator cuff tear groups differ significantly. Extracellular matrix remodeling genes were found to contribute to rotator cuff tear pathogenesis. Rotator cuff tears displayed upregulation of a number of matrix metalloproteinase (3, 10, 12, 13, 15, 21, and 25), a disintegrin and metalloproteinase (ADAM 12, 15, and 22) genes, and downregulation of some interleukins (1, 8, and 27), which play important roles in chemotaxis. These gene products may potentially have a role as biomarkers of failure of healing or therapeutic targets to improve tendon healing. Copyright © 2016 Journal of Shoulder and Elbow Surgery Board of Trustees. Published by Elsevier Inc. All rights reserved.
Sousa, Katiene Régia Silva; Ribeiro, André Mauric Frossard; Dantas, Waleska de Melo Ferreira; Oliveira, Leandro Licursi de; Gasparino, Eliane; Guimarães, Simone Eliza Facioni
2017-10-01
We aimed to compare Toll-like receptors (TLR) and cytokines expression in local Piau breed and a Commercial line (Landrace×Large White crossbred) pigs in response to vaccination against Pasteurella multocida type D. Seronegative gilts for Pasteurella multocida type D and Mycoplasma hyopneumoniae were used, from which peripheral blood mononuclear cells (PBMC) were collected in four time points (T0, T1, T2 and T3; before and after each vaccination dose). For bronchoalveolar lavage fluid cells (BALF), we set groups of vaccinated and unvaccinated animals for both genetic groups. Gene expression was evaluated on PBMC and BALF. In PBMC, when we analyzed time points within breeds, significant differences in expression for TLRs and cytokines, except TGFβ, were observed for Commercial animals. For the Piau pigs, only TGFβ showed differential expression. Comparing the expression among genetic groups, the Commercial pigs showed higher expression for TLRs after first vaccination dose, while for IL2, IL6, IL12 and IL13, higher expression was also observed in T3 and IL8 and IL10, in T1 and T3. Still comparing the breeds, the crossbred animals showed higher expression for TNFα in T1 and T2, while for TGFβ only in T2. For gene expression in BALF, vaccinated Commercial pigs showed higher expression of TLR6, TLR10, IL6, IL8, IL10, TNFα and TGFβ genes than vaccinated Piau pigs. The Commercial line pigs showed higher sensitivity to vaccination, while in local Piau breed lower responsiveness, which may partly explain genetic variability in immune response and will let us better understand the tolerance/susceptibility for pasteurellosis. Copyright © 2017 Elsevier Ltd. All rights reserved.
Macromolecular Crowding Induces Spatial Correlations That Control Gene Expression Bursting Patterns.
Norred, S Elizabeth; Caveney, Patrick M; Chauhan, Gaurav; Collier, Lauren K; Collier, C Patrick; Abel, Steven M; Simpson, Michael L
2018-05-18
Recent superresolution microscopy studies in E. coli demonstrate that the cytoplasm has highly variable local concentrations where macromolecular crowding plays a central role in establishing membrane-less compartmentalization. This spatial inhomogeneity significantly influences molecular transport and association processes central to gene expression. Yet, little is known about how macromolecular crowding influences gene expression bursting-the episodic process where mRNA and proteins are produced in bursts. Here, we simultaneously measured mRNA and protein reporters in cell-free systems, showing that macromolecular crowding decoupled the well-known relationship between fluctuations in the protein population (noise) and mRNA population statistics. Crowded environments led to a 10-fold increase in protein noise even though there were only modest changes in the mRNA population and fluctuations. Instead, cell-like macromolecular crowding created an inhomogeneous spatial distribution of mRNA ("spatial noise") that led to large variability in the protein production burst size. As a result, the mRNA spatial noise created large temporal fluctuations in the protein population. These results highlight the interplay between macromolecular crowding, spatial inhomogeneities, and the resulting dynamics of gene expression, and provide insights into using these organizational principles in both cell-based and cell-free synthetic biology.
USDA-ARS?s Scientific Manuscript database
Functional annotations of large plant genome projects mostly provide information on gene function and gene families based on the presence of protein domains and gene homology, but not necessarily in association with gene expression or metabolic and regulatory networks. These additional annotations a...
Mukwaya, Anthony; Lindvall, Jessica M; Xeroudaki, Maria; Peebo, Beatrice; Ali, Zaheer; Lennikov, Anton; Jensen, Lasse Dahl Ejby; Lagali, Neil
2016-11-22
In angiogenesis with concurrent inflammation, many pathways are activated, some linked to VEGF and others largely VEGF-independent. Pathways involving inflammatory mediators, chemokines, and micro-RNAs may play important roles in maintaining a pro-angiogenic environment or mediating angiogenic regression. Here, we describe a gene expression dataset to facilitate exploration of pro-angiogenic, pro-inflammatory, and remodelling/normalization-associated genes during both an active capillary sprouting phase, and in the restoration of an avascular phenotype. The dataset was generated by microarray analysis of the whole transcriptome in a rat model of suture-induced inflammatory corneal neovascularisation. Regions of active capillary sprout growth or regression in the cornea were harvested and total RNA extracted from four biological replicates per group. High quality RNA was obtained for gene expression analysis using microarrays. Fold change of selected genes was validated by qPCR, and protein expression was evaluated by immunohistochemistry. We provide a gene expression dataset that may be re-used to investigate corneal neovascularisation, and may also have implications in other contexts of inflammation-mediated angiogenesis.
Feichtinger, Julia; Larcombe, Lee; McFarlane, Ramsay J
2014-05-15
Evidence is starting to emerge indicating that tumorigenesis in metazoans involves a soma-to-germline transition, which may contribute to the acquisition of neoplastic characteristics. Here, we have meta-analyzed gene expression profiles of the human orthologs of Drosophila melanogaster germline genes that are ectopically expressed in l(3)mbt brain tumors using gene expression datasets derived from a large cohort of human tumors. We find these germline genes, some of which drive oncogenesis in D. melanogaster, are similarly ectopically activated in a wide range of human cancers. Some of these genes normally have expression restricted to the germline, making them of particular clinical interest. Importantly, these analyses provide additional support to the emerging model that proposes a soma-to-germline transition is a general hallmark of a wide range of human tumors. This has implications for our understanding of human oncogenesis and the development of new therapeutic and biomarker targets with clinical potential. © 2013 The Authors. Published by Wiley Periodicals, Inc. on behalf of UICC.
Automation of fluorescent differential display with digital readout.
Meade, Jonathan D; Cho, Yong-Jig; Fisher, Jeffrey S; Walden, Jamie C; Guo, Zhen; Liang, Peng
2006-01-01
Since its invention in 1992, differential display (DD) has become the most commonly used technique for identifying differentially expressed genes because of its many advantages over competing technologies such as DNA microarray, serial analysis of gene expression (SAGE), and subtractive hybridization. Despite the great impact of the method on biomedical research, there has been a lack of automation of DD technology to increase its throughput and accuracy for systematic gene expression analysis. Most of previous DD work has taken a "shot-gun" approach of identifying one gene at a time, with a limited number of polymerase chain reaction (PCR) reactions set up manually, giving DD a low-tech and low-throughput image. We have optimized the DD process with a new platform that incorporates fluorescent digital readout, automated liquid handling, and large-format gels capable of running entire 96-well plates. The resulting streamlined fluorescent DD (FDD) technology offers an unprecedented accuracy, sensitivity, and throughput in comprehensive and quantitative analysis of gene expression. These major improvements will allow researchers to find differentially expressed genes of interest, both known and novel, quickly and easily.
Array data extractor (ADE): a LabVIEW program to extract and merge gene array data.
Kurtenbach, Stefan; Kurtenbach, Sarah; Zoidl, Georg
2013-12-01
Large data sets from gene expression array studies are publicly available offering information highly valuable for research across many disciplines ranging from fundamental to clinical research. Highly advanced bioinformatics tools have been made available to researchers, but a demand for user-friendly software allowing researchers to quickly extract expression information for multiple genes from multiple studies persists. Here, we present a user-friendly LabVIEW program to automatically extract gene expression data for a list of genes from multiple normalized microarray datasets. Functionality was tested for 288 class A G protein-coupled receptors (GPCRs) and expression data from 12 studies comparing normal and diseased human hearts. Results confirmed known regulation of a beta 1 adrenergic receptor and further indicate novel research targets. Although existing software allows for complex data analyses, the LabVIEW based program presented here, "Array Data Extractor (ADE)", provides users with a tool to retrieve meaningful information from multiple normalized gene expression datasets in a fast and easy way. Further, the graphical programming language used in LabVIEW allows applying changes to the program without the need of advanced programming knowledge.
Clustering approaches to identifying gene expression patterns from DNA microarray data.
Do, Jin Hwan; Choi, Dong-Kug
2008-04-30
The analysis of microarray data is essential for large amounts of gene expression data. In this review we focus on clustering techniques. The biological rationale for this approach is the fact that many co-expressed genes are co-regulated, and identifying co-expressed genes could aid in functional annotation of novel genes, de novo identification of transcription factor binding sites and elucidation of complex biological pathways. Co-expressed genes are usually identified in microarray experiments by clustering techniques. There are many such methods, and the results obtained even for the same datasets may vary considerably depending on the algorithms and metrics for dissimilarity measures used, as well as on user-selectable parameters such as desired number of clusters and initial values. Therefore, biologists who want to interpret microarray data should be aware of the weakness and strengths of the clustering methods used. In this review, we survey the basic principles of clustering of DNA microarray data from crisp clustering algorithms such as hierarchical clustering, K-means and self-organizing maps, to complex clustering algorithms like fuzzy clustering.
The many faces of REST oversee epigenetic programming of neuronal genes.
Ballas, Nurit; Mandel, Gail
2005-10-01
Nervous system development relies on a complex signaling network to engineer the orderly transitions that lead to the acquisition of a neural cell fate. Progression from the non-neuronal pluripotent stem cell to a restricted neural lineage is characterized by distinct patterns of gene expression, particularly the restriction of neuronal gene expression to neurons. Concurrently, cells outside the nervous system acquire and maintain a non-neuronal fate that permanently excludes expression of neuronal genes. Studies of the transcriptional repressor REST, which regulates a large network of neuronal genes, provide a paradigm for elucidating the link between epigenetic mechanisms and neurogenesis. REST orchestrates a set of epigenetic modifications that are distinct between non-neuronal cells that give rise to neurons and those that are destined to remain as nervous system outsiders.
Multiscale Embedded Gene Co-expression Network Analysis
Song, Won-Min; Zhang, Bin
2015-01-01
Gene co-expression network analysis has been shown effective in identifying functional co-expressed gene modules associated with complex human diseases. However, existing techniques to construct co-expression networks require some critical prior information such as predefined number of clusters, numerical thresholds for defining co-expression/interaction, or do not naturally reproduce the hallmarks of complex systems such as the scale-free degree distribution of small-worldness. Previously, a graph filtering technique called Planar Maximally Filtered Graph (PMFG) has been applied to many real-world data sets such as financial stock prices and gene expression to extract meaningful and relevant interactions. However, PMFG is not suitable for large-scale genomic data due to several drawbacks, such as the high computation complexity O(|V|3), the presence of false-positives due to the maximal planarity constraint, and the inadequacy of the clustering framework. Here, we developed a new co-expression network analysis framework called Multiscale Embedded Gene Co-expression Network Analysis (MEGENA) by: i) introducing quality control of co-expression similarities, ii) parallelizing embedded network construction, and iii) developing a novel clustering technique to identify multi-scale clustering structures in Planar Filtered Networks (PFNs). We applied MEGENA to a series of simulated data and the gene expression data in breast carcinoma and lung adenocarcinoma from The Cancer Genome Atlas (TCGA). MEGENA showed improved performance over well-established clustering methods and co-expression network construction approaches. MEGENA revealed not only meaningful multi-scale organizations of co-expressed gene clusters but also novel targets in breast carcinoma and lung adenocarcinoma. PMID:26618778
Multiscale Embedded Gene Co-expression Network Analysis.
Song, Won-Min; Zhang, Bin
2015-11-01
Gene co-expression network analysis has been shown effective in identifying functional co-expressed gene modules associated with complex human diseases. However, existing techniques to construct co-expression networks require some critical prior information such as predefined number of clusters, numerical thresholds for defining co-expression/interaction, or do not naturally reproduce the hallmarks of complex systems such as the scale-free degree distribution of small-worldness. Previously, a graph filtering technique called Planar Maximally Filtered Graph (PMFG) has been applied to many real-world data sets such as financial stock prices and gene expression to extract meaningful and relevant interactions. However, PMFG is not suitable for large-scale genomic data due to several drawbacks, such as the high computation complexity O(|V|3), the presence of false-positives due to the maximal planarity constraint, and the inadequacy of the clustering framework. Here, we developed a new co-expression network analysis framework called Multiscale Embedded Gene Co-expression Network Analysis (MEGENA) by: i) introducing quality control of co-expression similarities, ii) parallelizing embedded network construction, and iii) developing a novel clustering technique to identify multi-scale clustering structures in Planar Filtered Networks (PFNs). We applied MEGENA to a series of simulated data and the gene expression data in breast carcinoma and lung adenocarcinoma from The Cancer Genome Atlas (TCGA). MEGENA showed improved performance over well-established clustering methods and co-expression network construction approaches. MEGENA revealed not only meaningful multi-scale organizations of co-expressed gene clusters but also novel targets in breast carcinoma and lung adenocarcinoma.
Filling gaps in PPAR-alpha signaling through comparative nutrigenomics analysis
2009-01-01
Background The application of high-throughput genomic tools in nutrition research is a widespread practice. However, it is becoming increasingly clear that the outcome of individual expression studies is insufficient for the comprehensive understanding of such a complex field. Currently, the availability of the large amounts of expression data in public repositories has opened up new challenges on microarray data analyses. We have focused on PPARα, a ligand-activated transcription factor functioning as fatty acid sensor controlling the gene expression regulation of a large set of genes in various metabolic organs such as liver, small intestine or heart. The function of PPARα is strictly connected to the function of its target genes and, although many of these have already been identified, major elements of its physiological function remain to be uncovered. To further investigate the function of PPARα, we have applied a cross-species meta-analysis approach to integrate sixteen microarray datasets studying high fat diet and PPARα signal perturbations in different organisms. Results We identified 164 genes (MDEGs) that were differentially expressed in a constant way in response to a high fat diet or to perturbations in PPARs signalling. In particular, we found five genes in yeast which were highly conserved and homologous of PPARα targets in mammals, potential candidates to be used as models for the equivalent mammalian genes. Moreover, a screening of the MDEGs for all known transcription factor binding sites and the comparison with a human genome-wide screening of Peroxisome Proliferating Response Elements (PPRE), enabled us to identify, 20 new potential candidate genes that show, both binding site, both change in expression in the condition studied. Lastly, we found a non random localization of the differentially expressed genes in the genome. Conclusion The results presented are potentially of great interest to resume the currently available expression data, exploiting the power of in silico analysis filtered by evolutionary conservation. The analysis enabled us to indicate potential gene candidates that could fill in the gaps with regards to the signalling of PPARα and, moreover, the non-random localization of the differentially expressed genes in the genome, suggest that epigenetic mechanisms are of importance in the regulation of the transcription operated by PPARα. PMID:20003344
Aben, Nanne; Vis, Daniel J; Michaut, Magali; Wessels, Lodewyk F A
2016-09-01
Clinical response to anti-cancer drugs varies between patients. A large portion of this variation can be explained by differences in molecular features, such as mutation status, copy number alterations, methylation and gene expression profiles. We show that the classic approach for combining these molecular features (Elastic Net regression on all molecular features simultaneously) results in models that are almost exclusively based on gene expression. The gene expression features selected by the classic approach are difficult to interpret as they often represent poorly studied combinations of genes, activated by aberrations in upstream signaling pathways. To utilize all data types in a more balanced way, we developed TANDEM, a two-stage approach in which the first stage explains response using upstream features (mutations, copy number, methylation and cancer type) and the second stage explains the remainder using downstream features (gene expression). Applying TANDEM to 934 cell lines profiled across 265 drugs (GDSC1000), we show that the resulting models are more interpretable, while retaining the same predictive performance as the classic approach. Using the more balanced contributions per data type as determined with TANDEM, we find that response to MAPK pathway inhibitors is largely predicted by mutation data, while predicting response to DNA damaging agents requires gene expression data, in particular SLFN11 expression. TANDEM is available as an R package on CRAN (for more information, see http://ccb.nki.nl/software/tandem). m.michaut@nki.nl or l.wessels@nki.nl Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Wang, Guofu; Bi, Lechang; Wang, Gaofeng; Huang, Feilai; Lu, Mingjing; Zhu, Kai
2018-06-01
Objectives Expression profile of GSE57691 was analyzed to identify the similarities and differences between aortic occlusive disease and abdominal aortic aneurysm. Methods The expression profile of GSE57691 was downloaded from Gene Expression Omnibus database, including 20 small abdominal aortic aneurysm samples, 29 large abdominal aortic aneurysm samples, 9 aortic occlusive disease samples, and 10 control samples. Using the limma package in R, the differentially expressed genes were screened. Followed by enrichment analysis was performed for the differentially expressed genes using database for annotation, visualization, and integrated discovery online tool. Based on string online tool and Cytoscape software, protein-protein interaction network and module analyses were carried out. Moreover, integrated TF platform database and Cytoscape software were used for constructing transcriptional regulatory networks. Results As a result, 1757, 354, and 396 differentially expressed genes separately were identified in aortic occlusive disease, large abdominal aortic aneurysm, and small abdominal aortic aneurysm samples. UBB was significantly enriched in proteolysis related pathways with a high degree in three groups. SPARCL1 was another gene shared by these groups and regulated by NFIA, which had a high degree in transcriptional regulatory network. ACTB, a significant upregulated gene in abdominal aortic aneurysm samples, could be regulated by CLIC4, which was significantly enriched in cell motions. ACLY and NFIB were separately identified in aortic occlusive disease and small abdominal aortic aneurysm samples, and separately enriched in lipid metabolism and negative regulation of cell proliferation. Conclusions The downregulated UBB, NFIA, and SPARCL1 might play key roles in both aortic occlusive disease and abdominal aortic aneurysm, while the upregulated ACTB might only involve in abdominal aortic aneurysm. ACLY and NFIB were specifically involved in aortic occlusive disease and small abdominal aortic aneurysm separately.
Broi, Michele G Da; Rocha, Carlos V; Meola, Juliana; Martins, Wellington P; Carvalho, Filomena M; Ferriani, Rui A; Navarro, Paula A
2017-09-01
Alterations in endometrial receptivity may be involved in the etiopathogenesis of endometriosis-related infertility. The literature has suggested that patients with endometriosis present progestin resistance, which could affect embryo implantation. We question the presence of alterations in the expression of the progesterone receptor gene (PGR) and the genes related to endometrium-embryo interaction regulated by progesterone. This pilot study compared the expression of PGR, HBEGF, ITGAV, ITGB3, and SPP1 genes in eutopic endometrium during the implantation window (IW) in infertile women with endometriosis with that observed in the endometrium of fertile and infertile controls. In this prospective case-control study, endometrial biopsies were performed during the IW in patients aged between 18 and 45 years old, with regular cycles and without endocrine/systemic dysfunctions, divided into endometriosis (END), infertile control (IC) and fertile control (FC) groups. Total RNA extraction, cDNA synthesis, and gene expression analysis by Real-Time PCR were performed. We assessed the size of the difference that our series was powered to detect. From the 687 patients who underwent diagnostic videolaparoscopy or tubal ligation at the University Hospital, 130 were eligible. Of these, 32 had endometrial samples collected, with 17 confirmed in the IW. Fifteen samples (5 END, 5 IC and 5 FC) were analyzed. There was no significant difference in the expression of any studied gene. Our sample size allowed us to identify or discard large differences (two standard deviations) among the groups. Endometriosis doesn't cause large changes in the endometrial expression of PGR, HBEGF, ITGAV, ITGB3 and SPP1 during the IW.
Broi, Michele G Da; Rocha Junior, Carlos V; Meola, Juliana; Martins, Wellington P; Carvalho, Filomena M; Ferriani, Rui A; Navarro, Paula A
2017-01-01
Objective Alterations in endometrial receptivity may be involved in the etiopathogenesis of endometriosis-related infertility. The literature has suggested that patients with endometriosis present progestin resistance, which could affect embryo implantation. We question the presence of alterations in the expression of the progesterone receptor gene (PGR) and the genes related to endometrium-embryo interaction regulated by progesterone. This pilot study compared the expression of PGR, HBEGF, ITGAV, ITGB3, and SPP1 genes in eutopic endometrium during the implantation window (IW) in infertile women with endometriosis with that observed in the endometrium of fertile and infertile controls. Methods In this prospective case-control study, endometrial biopsies were performed during the IW in patients aged between 18 and 45 years old, with regular cycles and without endocrine/systemic dysfunctions, divided into endometriosis (END), infertile control (IC) and fertile control (FC) groups. Total RNA extraction, cDNA synthesis, and gene expression analysis by Real-Time PCR were performed. We assessed the size of the difference that our series was powered to detect. Results From the 687 patients who underwent diagnostic videolaparoscopy or tubal ligation at the University Hospital, 130 were eligible. Of these, 32 had endometrial samples collected, with 17 confirmed in the IW. Fifteen samples (5 END, 5 IC and 5 FC) were analyzed. There was no significant difference in the expression of any studied gene. Our sample size allowed us to identify or discard large differences (two standard deviations) among the groups. Conclusion Endometriosis doesn't cause large changes in the endometrial expression of PGR, HBEGF, ITGAV, ITGB3 and SPP1 during the IW. PMID:28837027
A stele-enriched gene regulatory network in the Arabidopsis root
Brady, Siobhan M; Zhang, Lifang; Megraw, Molly; Martinez, Natalia J; Jiang, Eric; Yi, Charles S; Liu, Weilin; Zeng, Anna; Taylor-Teeples, Mallorie; Kim, Dahae; Ahnert, Sebastian; Ohler, Uwe; Ware, Doreen; Walhout, Albertha J M; Benfey, Philip N
2011-01-01
Tightly controlled gene expression is a hallmark of multicellular development and is accomplished by transcription factors (TFs) and microRNAs (miRNAs). Although many studies have focused on identifying downstream targets of these molecules, less is known about the factors that regulate their differential expression. We used data from high spatial resolution gene expression experiments and yeast one-hybrid (Y1H) and two-hybrid (Y2H) assays to delineate a subset of interactions occurring within a gene regulatory network (GRN) that determines tissue-specific TF and miRNA expression in plants. We find that upstream TFs are expressed in more diverse cell types than their targets and that promoters that are bound by a relatively large number of TFs correspond to key developmental regulators. The regulatory consequence of many TFs for their target was experimentally determined using genetic analysis. Remarkably, molecular phenotypes were identified for 65% of the TFs, but morphological phenotypes were associated with only 16%. This indicates that the GRN is robust, and that gene expression changes may be canalized or buffered. PMID:21245844
Query-based biclustering of gene expression data using Probabilistic Relational Models.
Zhao, Hui; Cloots, Lore; Van den Bulcke, Tim; Wu, Yan; De Smet, Riet; Storms, Valerie; Meysman, Pieter; Engelen, Kristof; Marchal, Kathleen
2011-02-15
With the availability of large scale expression compendia it is now possible to view own findings in the light of what is already available and retrieve genes with an expression profile similar to a set of genes of interest (i.e., a query or seed set) for a subset of conditions. To that end, a query-based strategy is needed that maximally exploits the coexpression behaviour of the seed genes to guide the biclustering, but that at the same time is robust against the presence of noisy genes in the seed set as seed genes are often assumed, but not guaranteed to be coexpressed in the queried compendium. Therefore, we developed ProBic, a query-based biclustering strategy based on Probabilistic Relational Models (PRMs) that exploits the use of prior distributions to extract the information contained within the seed set. We applied ProBic on a large scale Escherichia coli compendium to extend partially described regulons with potentially novel members. We compared ProBic's performance with previously published query-based biclustering algorithms, namely ISA and QDB, from the perspective of bicluster expression quality, robustness of the outcome against noisy seed sets and biological relevance.This comparison learns that ProBic is able to retrieve biologically relevant, high quality biclusters that retain their seed genes and that it is particularly strong in handling noisy seeds. ProBic is a query-based biclustering algorithm developed in a flexible framework, designed to detect biologically relevant, high quality biclusters that retain relevant seed genes even in the presence of noise or when dealing with low quality seed sets.
Kadarmideen, Haja N; Watson-haigh, Nathan S
2012-01-01
Gene co-expression networks (GCN), built using high-throughput gene expression data are fundamental aspects of systems biology. The main aims of this study were to compare two popular approaches to building and analysing GCN. We use real ovine microarray transcriptomics datasets representing four different treatments with Metyrapone, an inhibitor of cortisol biosynthesis. We conducted several microarray quality control checks before applying GCN methods to filtered datasets. Then we compared the outputs of two methods using connectivity as a criterion, as it measures how well a node (gene) is connected within a network. The two GCN construction methods used were, Weighted Gene Co-expression Network Analysis (WGCNA) and Partial Correlation and Information Theory (PCIT) methods. Nodes were ranked based on their connectivity measures in each of the four different networks created by WGCNA and PCIT and node ranks in two methods were compared to identify those nodes which are highly differentially ranked (HDR). A total of 1,017 HDR nodes were identified across one or more of four networks. We investigated HDR nodes by gene enrichment analyses in relation to their biological relevance to phenotypes. We observed that, in contrast to WGCNA method, PCIT algorithm removes many of the edges of the most highly interconnected nodes. Removal of edges of most highly connected nodes or hub genes will have consequences for downstream analyses and biological interpretations. In general, for large GCN construction (with > 20000 genes) access to large computer clusters, particularly those with larger amounts of shared memory is recommended. PMID:23144540
Irizarry, Kristopher J L; Downs, Eileen; Bryden, Randall; Clark, Jory; Griggs, Lisa; Kopulos, Renee; Boettger, Cynthia M; Carr, Thomas J; Keeler, Calvin L; Collisson, Ellen; Drechsler, Yvonne
2017-01-01
Discovering genetic biomarkers associated with disease resistance and enhanced immunity is critical to developing advanced strategies for controlling viral and bacterial infections in different species. Macrophages, important cells of innate immunity, are directly involved in cellular interactions with pathogens, the release of cytokines activating other immune cells and antigen presentation to cells of the adaptive immune response. IFNγ is a potent activator of macrophages and increased production has been associated with disease resistance in several species. This study characterizes the molecular basis for dramatically different nitric oxide production and immune function between the B2 and the B19 haplotype chicken macrophages.A large-scale RNA sequencing approach was employed to sequence the RNA of purified macrophages from each haplotype group (B2 vs. B19) during differentiation and after stimulation. Our results demonstrate that a large number of genes exhibit divergent expression between B2 and B19 haplotype cells both prior and after stimulation. These differences in gene expression appear to be regulated by complex epigenetic mechanisms that need further investigation.
Ettensohn, Charles A; Illies, Michele R; Oliveri, Paola; De Jong, Deborah L
2003-07-01
In the sea urchin embryo, the large micromeres and their progeny function as a critical signaling center and execute a complex morphogenetic program. We have identified a new and essential component of the gene network that controls large micromere specification, the homeodomain protein Alx1. Alx1 is expressed exclusively by cells of the large micromere lineage beginning in the first interphase after the large micromeres are born. Morpholino studies demonstrate that Alx1 is essential at an early stage of specification and controls downstream genes required for epithelial-mesenchymal transition and biomineralization. Expression of Alx1 is cell autonomous and regulated maternally through beta-catenin and its downstream effector, Pmar1. Alx1 expression can be activated in other cell lineages at much later stages of development, however, through a regulative pathway of skeletogenesis that is responsive to cell signaling. The Alx1 protein is highly conserved among euechinoid sea urchins and is closely related to the Cart1/Alx3/Alx4 family of vertebrate homeodomain proteins. In vertebrates, these proteins regulate the formation of skeletal elements of the limbs, face and neck. Our findings suggest that the ancestral deuterostome had a population of biomineral-forming mesenchyme cells that expressed an Alx1-like protein.
Wang, Shuaiqun; Aorigele; Kong, Wei; Zeng, Weiming; Hong, Xiaomin
2016-01-01
Gene expression data composed of thousands of genes play an important role in classification platforms and disease diagnosis. Hence, it is vital to select a small subset of salient features over a large number of gene expression data. Lately, many researchers devote themselves to feature selection using diverse computational intelligence methods. However, in the progress of selecting informative genes, many computational methods face difficulties in selecting small subsets for cancer classification due to the huge number of genes (high dimension) compared to the small number of samples, noisy genes, and irrelevant genes. In this paper, we propose a new hybrid algorithm HICATS incorporating imperialist competition algorithm (ICA) which performs global search and tabu search (TS) that conducts fine-tuned search. In order to verify the performance of the proposed algorithm HICATS, we have tested it on 10 well-known benchmark gene expression classification datasets with dimensions varying from 2308 to 12600. The performance of our proposed method proved to be superior to other related works including the conventional version of binary optimization algorithm in terms of classification accuracy and the number of selected genes.
Aorigele; Zeng, Weiming; Hong, Xiaomin
2016-01-01
Gene expression data composed of thousands of genes play an important role in classification platforms and disease diagnosis. Hence, it is vital to select a small subset of salient features over a large number of gene expression data. Lately, many researchers devote themselves to feature selection using diverse computational intelligence methods. However, in the progress of selecting informative genes, many computational methods face difficulties in selecting small subsets for cancer classification due to the huge number of genes (high dimension) compared to the small number of samples, noisy genes, and irrelevant genes. In this paper, we propose a new hybrid algorithm HICATS incorporating imperialist competition algorithm (ICA) which performs global search and tabu search (TS) that conducts fine-tuned search. In order to verify the performance of the proposed algorithm HICATS, we have tested it on 10 well-known benchmark gene expression classification datasets with dimensions varying from 2308 to 12600. The performance of our proposed method proved to be superior to other related works including the conventional version of binary optimization algorithm in terms of classification accuracy and the number of selected genes. PMID:27579323
Bickel, David R.; Montazeri, Zahra; Hsieh, Pei-Chun; Beatty, Mary; Lawit, Shai J.; Bate, Nicholas J.
2009-01-01
Motivation: Measurements of gene expression over time enable the reconstruction of transcriptional networks. However, Bayesian networks and many other current reconstruction methods rely on assumptions that conflict with the differential equations that describe transcriptional kinetics. Practical approximations of kinetic models would enable inferring causal relationships between genes from expression data of microarray, tag-based and conventional platforms, but conclusions are sensitive to the assumptions made. Results: The representation of a sufficiently large portion of genome enables computation of an upper bound on how much confidence one may place in influences between genes on the basis of expression data. Information about which genes encode transcription factors is not necessary but may be incorporated if available. The methodology is generalized to cover cases in which expression measurements are missing for many of the genes that might control the transcription of the genes of interest. The assumption that the gene expression level is roughly proportional to the rate of translation led to better empirical performance than did either the assumption that the gene expression level is roughly proportional to the protein level or the Bayesian model average of both assumptions. Availability: http://www.oisb.ca points to R code implementing the methods (R Development Core Team 2004). Contact: dbickel@uottawa.ca Supplementary information: http://www.davidbickel.com PMID:19218351
Ji, Shuiwang
2013-07-11
The structured organization of cells in the brain plays a key role in its functional efficiency. This delicate organization is the consequence of unique molecular identity of each cell gradually established by precise spatiotemporal gene expression control during development. Currently, studies on the molecular-structural association are beginning to reveal how the spatiotemporal gene expression patterns are related to cellular differentiation and structural development. In this article, we aim at a global, data-driven study of the relationship between gene expressions and neuroanatomy in the developing mouse brain. To enable visual explorations of the high-dimensional data, we map the in situ hybridization gene expression data to a two-dimensional space by preserving both the global and the local structures. Our results show that the developing brain anatomy is largely preserved in the reduced gene expression space. To provide a quantitative analysis, we cluster the reduced data into groups and measure the consistency with neuroanatomy at multiple levels. Our results show that the clusters in the low-dimensional space are more consistent with neuroanatomy than those in the original space. Gene expression patterns and developing brain anatomy are closely related. Dimensionality reduction and visual exploration facilitate the study of this relationship.
Genome-wide misexpression of X-linked versus autosomal genes associated with hybrid male sterility
Lu, Xuemei; Shapiro, Joshua A.; Ting, Chau-Ti; Li, Yan; Li, Chunyan; Xu, Jin; Huang, Huanwei; Cheng, Ya-Jen; Greenberg, Anthony J.; Li, Shou-Hsien; Wu, Mao-Lien; Shen, Yang; Wu, Chung-I
2010-01-01
Postmating reproductive isolation is often manifested as hybrid male sterility, for which X-linked genes are overrepresented (the so-called large X effect). In contrast, X-linked genes are significantly under-represented among testis-expressing genes. This seeming contradiction may be germane to the X:autosome imbalance hypothesis on hybrid sterility, in which the X-linked effect is mediated mainly through the misexpression of autosomal genes. In this study, we compared gene expression in fertile and sterile males in the hybrids between two Drosophila species. These hybrid males differ only in a small region of the X chromosome containing the Ods-site homeobox (OdsH) (also known as Odysseus) locus of hybrid sterility. Of genes expressed in the testis, autosomal genes were, indeed, more likely to be misexpressed than X-linked genes under the sterilizing action of OdsH. Since this mechanism of X:autosome interaction is only associated with spermatogenesis, a connection between X:autosome imbalance and the high rate of hybrid male sterility seems plausible. PMID:20511493
Genome-wide misexpression of X-linked versus autosomal genes associated with hybrid male sterility.
Lu, Xuemei; Shapiro, Joshua A; Ting, Chau-Ti; Li, Yan; Li, Chunyan; Xu, Jin; Huang, Huanwei; Cheng, Ya-Jen; Greenberg, Anthony J; Li, Shou-Hsien; Wu, Mao-Lien; Shen, Yang; Wu, Chung-I
2010-08-01
Postmating reproductive isolation is often manifested as hybrid male sterility, for which X-linked genes are overrepresented (the so-called large X effect). In contrast, X-linked genes are significantly under-represented among testis-expressing genes. This seeming contradiction may be germane to the X:autosome imbalance hypothesis on hybrid sterility, in which the X-linked effect is mediated mainly through the misexpression of autosomal genes. In this study, we compared gene expression in fertile and sterile males in the hybrids between two Drosophila species. These hybrid males differ only in a small region of the X chromosome containing the Ods-site homeobox (OdsH) (also known as Odysseus) locus of hybrid sterility. Of genes expressed in the testis, autosomal genes were, indeed, more likely to be misexpressed than X-linked genes under the sterilizing action of OdsH. Since this mechanism of X:autosome interaction is only associated with spermatogenesis, a connection between X:autosome imbalance and the high rate of hybrid male sterility seems plausible.
Hodgins-Davis, Andrea; Adomas, Aleksandra B.; Warringer, Jonas; Townsend, Jeffrey P.
2012-01-01
Genetic variation for plastic phenotypes potentially contributes phenotypic variation to populations that can be selected during adaptation to novel ecological contexts. However, the basis and extent of plastic variation that manifests in diverse environments remains elusive. Here, we characterize copper reaction norms for mRNA abundance among five Saccharomyces cerevisiae strains to 1) describe population variation across the full range of ecologically relevant copper concentrations, from starvation to toxicity, and 2) to test the hypothesis that plastic networks exhibit increased population variation for gene expression. We find that although the vast majority of the variation is small in magnitude (considerably <2-fold), not just some, but most genes demonstrate variable expression across environments, across genetic backgrounds, or both. Plastically expressed genes included both genes regulated directly by copper-binding transcription factors Mac1 and Ace1 and genes indirectly responding to the downstream metabolic consequences of the copper gradient, particularly genes involved in copper, iron, and sulfur homeostasis. Copper-regulated gene networks exhibited more similar behavior within the population in environments where those networks have a large impact on fitness. Nevertheless, expression variation in genes like Cup1, important to surviving copper stress, was linked with variation in mitotic fitness and in the breadth of differential expression across the genome. By revealing a broader and deeper range of population variation, our results provide further evidence for the interconnectedness of genome-wide mRNA levels, their dependence on environmental context and genetic background, and the abundance of variation in gene expression that can contribute to future evolution. PMID:23019066
Modulation of gene expression in heart and liver of hibernating black bears (Ursus americanus)
2011-01-01
Background Hibernation is an adaptive strategy to survive in highly seasonal or unpredictable environments. The molecular and genetic basis of hibernation physiology in mammals has only recently been studied using large scale genomic approaches. We analyzed gene expression in the American black bear, Ursus americanus, using a custom 12,800 cDNA probe microarray to detect differences in expression that occur in heart and liver during winter hibernation in comparison to summer active animals. Results We identified 245 genes in heart and 319 genes in liver that were differentially expressed between winter and summer. The expression of 24 genes was significantly elevated during hibernation in both heart and liver. These genes are mostly involved in lipid catabolism and protein biosynthesis and include RNA binding protein motif 3 (Rbm3), which enhances protein synthesis at mildly hypothermic temperatures. Elevated expression of protein biosynthesis genes suggests induction of translation that may be related to adaptive mechanisms reducing cardiac and muscle atrophies over extended periods of low metabolism and immobility during hibernation in bears. Coordinated reduction of transcription of genes involved in amino acid catabolism suggests redirection of amino acids from catabolic pathways to protein biosynthesis. We identify common for black bears and small mammalian hibernators transcriptional changes in the liver that include induction of genes responsible for fatty acid β oxidation and carbohydrate synthesis and depression of genes involved in lipid biosynthesis, carbohydrate catabolism, cellular respiration and detoxification pathways. Conclusions Our findings show that modulation of gene expression during winter hibernation represents molecular mechanism of adaptation to extreme environments. PMID:21453527
Cha, Kihoon; Hwang, Taeho; Oh, Kimin; Yi, Gwan-Su
2015-01-01
It has been reported that several brain diseases can be treated as transnosological manner implicating possible common molecular basis under those diseases. However, molecular level commonality among those brain diseases has been largely unexplored. Gene expression analyses of human brain have been used to find genes associated with brain diseases but most of those studies were restricted either to an individual disease or to a couple of diseases. In addition, identifying significant genes in such brain diseases mostly failed when it used typical methods depending on differentially expressed genes. In this study, we used a correlation-based biclustering approach to find coexpressed gene sets in five neurodegenerative diseases and three psychiatric disorders. By using biclustering analysis, we could efficiently and fairly identified various gene sets expressed specifically in both single and multiple brain diseases. We could find 4,307 gene sets correlatively expressed in multiple brain diseases and 3,409 gene sets exclusively specified in individual brain diseases. The function enrichment analysis of those gene sets showed many new possible functional bases as well as neurological processes that are common or specific for those eight diseases. This study introduces possible common molecular bases for several brain diseases, which open the opportunity to clarify the transnosological perspective assumed in brain diseases. It also showed the advantages of correlation-based biclustering analysis and accompanying function enrichment analysis for gene expression data in this type of investigation.
2015-01-01
Background It has been reported that several brain diseases can be treated as transnosological manner implicating possible common molecular basis under those diseases. However, molecular level commonality among those brain diseases has been largely unexplored. Gene expression analyses of human brain have been used to find genes associated with brain diseases but most of those studies were restricted either to an individual disease or to a couple of diseases. In addition, identifying significant genes in such brain diseases mostly failed when it used typical methods depending on differentially expressed genes. Results In this study, we used a correlation-based biclustering approach to find coexpressed gene sets in five neurodegenerative diseases and three psychiatric disorders. By using biclustering analysis, we could efficiently and fairly identified various gene sets expressed specifically in both single and multiple brain diseases. We could find 4,307 gene sets correlatively expressed in multiple brain diseases and 3,409 gene sets exclusively specified in individual brain diseases. The function enrichment analysis of those gene sets showed many new possible functional bases as well as neurological processes that are common or specific for those eight diseases. Conclusions This study introduces possible common molecular bases for several brain diseases, which open the opportunity to clarify the transnosological perspective assumed in brain diseases. It also showed the advantages of correlation-based biclustering analysis and accompanying function enrichment analysis for gene expression data in this type of investigation. PMID:26043779
Gibbons, Taylor C; Metzger, David C H; Healy, Timothy M; Schulte, Patricia M
2017-05-01
Phenotypic plasticity is thought to facilitate the colonization of novel environments and shape the direction of evolution in colonizing populations. However, the relative prevalence of various predicted patterns of changes in phenotypic plasticity following colonization remains unclear. Here, we use a whole-transcriptome approach to characterize patterns of gene expression plasticity in the gills of a freshwater-adapted and a saltwater-adapted ecotype of threespine stickleback (Gasterosteus aculeatus) exposed to a range of salinities. The response of the gill transcriptome to environmental salinity had a large shared component common to both ecotypes (2159 genes) with significant enrichment of genes involved in transmembrane ion transport and the restructuring of the gill epithelium. This transcriptional response to freshwater acclimation is induced at salinities below two parts per thousand. There was also differentiation in gene expression patterns between ecotypes (2515 genes), particularly in processes important for changes in the gill structure and permeability. Only 508 genes that differed between ecotypes also responded to salinity and no specific processes were enriched among this gene set, and an even smaller number (87 genes) showed evidence of changes in the extent of the response to salinity acclimation between ecotypes. No pattern of relative expression dominated among these genes, suggesting that neither gains nor losses of plasticity dominated the changes in expression patterns between the ecotypes. These data demonstrate that multiple patterns of changes in gene expression plasticity can occur following colonization of novel habitats. © 2017 John Wiley & Sons Ltd.
Yan, Yan; Wang, Lianzhe; Ding, Zehong; Tie, Weiwei; Ding, Xupo; Zeng, Changying; Wei, Yunxie; Zhao, Hongliang; Peng, Ming; Hu, Wei
2016-01-01
Mitogen-activated protein kinases (MAPKs) play central roles in plant developmental processes, hormone signaling transduction, and responses to abiotic stress. However, no data are currently available about the MAPK family in cassava, an important tropical crop. Herein, 21 MeMAPK genes were identified from cassava. Phylogenetic analysis indicated that MeMAPKs could be classified into four subfamilies. Gene structure analysis demonstrated that the number of introns in MeMAPK genes ranged from 1 to 10, suggesting large variation among cassava MAPK genes. Conserved motif analysis indicated that all MeMAPKs had typical protein kinase domains. Transcriptomic analysis suggested that MeMAPK genes showed differential expression patterns in distinct tissues and in response to drought stress between wild subspecies and cultivated varieties. Interaction networks and co-expression analyses revealed that crucial pathways controlled by MeMAPK networks may be involved in the differential response to drought stress in different accessions of cassava. Expression of nine selected MAPK genes showed that these genes could comprehensively respond to osmotic, salt, cold, oxidative stressors, and abscisic acid (ABA) signaling. These findings yield new insights into the transcriptional control of MAPK gene expression, provide an improved understanding of abiotic stress responses and signaling transduction in cassava, and lead to potential applications in the genetic improvement of cassava cultivars. PMID:27625666
DOE Office of Scientific and Technical Information (OSTI.GOV)
Friddle, Carl J; Koga, Teiichiro; Rubin, Edward M.
2000-03-15
While cardiac hypertrophy has been the subject of intensive investigation, regression of hypertrophy has been significantly less studied, precluding large-scale analysis of the relationship between these processes. In the present study, using pharmacological models of hypertrophy in mice, expression profiling was performed with fragments of more than 3,000 genes to characterize and contrast expression changes during induction and regression of hypertrophy. Administration of angiotensin II and isoproterenol by osmotic minipump produced increases in heart weight (15% and 40% respectively) that returned to pre-induction size following drug withdrawal. From multiple expression analyses of left ventricular RNA isolated at daily time-points duringmore » cardiac hypertrophy and regression, we identified sets of genes whose expression was altered at specific stages of this process. While confirming the participation of 25 genes or pathways previously known to be altered by hypertrophy, a larger set of 30 genes was identified whose expression had not previously been associated with cardiac hypertrophy or regression. Of the 55 genes that showed reproducible changes during the time course of induction and regression, 32 genes were altered only during induction and 8 were altered only during regression. This study identified both known and novel genes whose expression is affected at different stages of cardiac hypertrophy and regression and demonstrates that cardiac remodeling during regression utilizes a set of genes that are distinct from those used during induction of hypertrophy.« less
Marbach, Daniel; Roy, Sushmita; Ay, Ferhat; Meyer, Patrick E.; Candeias, Rogerio; Kahveci, Tamer; Bristow, Christopher A.; Kellis, Manolis
2012-01-01
Gaining insights on gene regulation from large-scale functional data sets is a grand challenge in systems biology. In this article, we develop and apply methods for transcriptional regulatory network inference from diverse functional genomics data sets and demonstrate their value for gene function and gene expression prediction. We formulate the network inference problem in a machine-learning framework and use both supervised and unsupervised methods to predict regulatory edges by integrating transcription factor (TF) binding, evolutionarily conserved sequence motifs, gene expression, and chromatin modification data sets as input features. Applying these methods to Drosophila melanogaster, we predict ∼300,000 regulatory edges in a network of ∼600 TFs and 12,000 target genes. We validate our predictions using known regulatory interactions, gene functional annotations, tissue-specific expression, protein–protein interactions, and three-dimensional maps of chromosome conformation. We use the inferred network to identify putative functions for hundreds of previously uncharacterized genes, including many in nervous system development, which are independently confirmed based on their tissue-specific expression patterns. Last, we use the regulatory network to predict target gene expression levels as a function of TF expression, and find significantly higher predictive power for integrative networks than for motif or ChIP-based networks. Our work reveals the complementarity between physical evidence of regulatory interactions (TF binding, motif conservation) and functional evidence (coordinated expression or chromatin patterns) and demonstrates the power of data integration for network inference and studies of gene regulation at the systems level. PMID:22456606
A comparative analysis of biclustering algorithms for gene expression data
Eren, Kemal; Deveci, Mehmet; Küçüktunç, Onur; Çatalyürek, Ümit V.
2013-01-01
The need to analyze high-dimension biological data is driving the development of new data mining methods. Biclustering algorithms have been successfully applied to gene expression data to discover local patterns, in which a subset of genes exhibit similar expression levels over a subset of conditions. However, it is not clear which algorithms are best suited for this task. Many algorithms have been published in the past decade, most of which have been compared only to a small number of algorithms. Surveys and comparisons exist in the literature, but because of the large number and variety of biclustering algorithms, they are quickly outdated. In this article we partially address this problem of evaluating the strengths and weaknesses of existing biclustering methods. We used the BiBench package to compare 12 algorithms, many of which were recently published or have not been extensively studied. The algorithms were tested on a suite of synthetic data sets to measure their performance on data with varying conditions, such as different bicluster models, varying noise, varying numbers of biclusters and overlapping biclusters. The algorithms were also tested on eight large gene expression data sets obtained from the Gene Expression Omnibus. Gene Ontology enrichment analysis was performed on the resulting biclusters, and the best enrichment terms are reported. Our analyses show that the biclustering method and its parameters should be selected based on the desired model, whether that model allows overlapping biclusters, and its robustness to noise. In addition, we observe that the biclustering algorithms capable of finding more than one model are more successful at capturing biologically relevant clusters. PMID:22772837
Nagel, Stefan; Ehrentraut, Stefan; Tomasch, Jürgen; Quentmeier, Hilmar; Meyer, Corinna; Kaufmann, Maren; Drexler, Hans G; MacLeod, Roderick A F
2013-01-01
Homeobox genes encode transcription factors ubiquitously involved in basic developmental processes, deregulation of which promotes cell transformation in multiple cancers including hematopoietic malignancies. In particular, NKL-family homeobox genes TLX1, TLX3 and NKX2-5 are ectopically activated by chromosomal rearrangements in T-cell neoplasias. Here, using transcriptional microarray profiling and RQ-PCR we identified ectopic expression of NKL-family member NKX2-1, in a diffuse large B-cell lymphoma (DLBCL) cell line SU-DHL-5. Moreover, in silico analysis demonstrated NKX2-1 overexpression in 5% of examined DLBCL patient samples. NKX2-1 is physiologically expressed in lung and thyroid tissues where it regulates differentiation. Chromosomal and genomic analyses excluded rearrangements at the NKX2-1 locus in SU-DHL-5, implying alternative activation. Comparative expression profiling implicated several candidate genes in NKX2-1 regulation, variously encoding transcription factors, chromatin modifiers and signaling components. Accordingly, siRNA-mediated knockdown and overexpression studies confirmed involvement of transcription factor HEY1, histone methyltransferase MLL and ubiquitinated histone H2B in NKX2-1 deregulation. Chromosomal aberrations targeting MLL at 11q23 and the histone gene cluster HIST1 at 6p22 which we observed in SU-DHL-5 may, therefore, represent fundamental mutations mediating an aberrant chromatin structure at NKX2-1. Taken together, we identified ectopic expression of NKX2-1 in DLBCL cells, representing the central player in an oncogenic regulative network compromising B-cell differentiation. Thus, our data extend the paradigm of NKL homeobox gene deregulation in lymphoid malignancies.
MethHC: a database of DNA methylation and gene expression in human cancer.
Huang, Wei-Yun; Hsu, Sheng-Da; Huang, Hsi-Yuan; Sun, Yi-Ming; Chou, Chih-Hung; Weng, Shun-Long; Huang, Hsien-Da
2015-01-01
We present MethHC (http://MethHC.mbc.nctu.edu.tw), a database comprising a systematic integration of a large collection of DNA methylation data and mRNA/microRNA expression profiles in human cancer. DNA methylation is an important epigenetic regulator of gene transcription, and genes with high levels of DNA methylation in their promoter regions are transcriptionally silent. Increasing numbers of DNA methylation and mRNA/microRNA expression profiles are being published in different public repositories. These data can help researchers to identify epigenetic patterns that are important for carcinogenesis. MethHC integrates data such as DNA methylation, mRNA expression, DNA methylation of microRNA gene and microRNA expression to identify correlations between DNA methylation and mRNA/microRNA expression from TCGA (The Cancer Genome Atlas), which includes 18 human cancers in more than 6000 samples, 6548 microarrays and 12 567 RNA sequencing data. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Human Papillomavirus Genome Integration and Head and Neck Cancer.
Pinatti, L M; Walline, H M; Carey, T E
2018-06-01
We conducted a critical review of human papillomavirus (HPV) integration into the host genome in oral/oropharyngeal cancer, reviewed the literature for HPV-induced cancers, and obtained current data for HPV-related oral and oropharyngeal cancers. In addition, we performed studies to identify HPV integration sites and the relationship of integration to viral-host fusion transcripts and whether integration is required for HPV-associated oncogenesis. Viral integration of HPV into the host genome is not required for the viral life cycle and might not be necessary for cellular transformation, yet HPV integration is frequently reported in cervical and head and neck cancer specimens. Studies of large numbers of early cervical lesions revealed frequent viral integration into gene-poor regions of the host genome with comparatively rare integration into cellular genes, suggesting that integration is a stochastic event and that site of integration may be largely a function of chance. However, more recent studies of head and neck squamous cell carcinomas (HNSCCs) suggest that integration may represent an additional oncogenic mechanism through direct effects on cancer-related gene expression and generation of hybrid viral-host fusion transcripts. In HNSCC cell lines as well as primary tumors, integration into cancer-related genes leading to gene disruption has been reported. The studies have shown that integration-induced altered gene expression may be associated with tumor recurrence. Evidence from several studies indicates that viral integration into genic regions is accompanied by local amplification, increased expression in some cases, interruption of gene expression, and likely additional oncogenic effects. Similarly, reported examples of viral integration near microRNAs suggest that altered expression of these regulatory molecules may also contribute to oncogenesis. Future work is indicated to identify the mechanisms of these events on cancer cell behavior.
2010-01-01
Background Similar to human breast cancer mammary tumors of the female dog are commonly associated with a fatal outcome due to the development of distant metastases. However, the molecular defects leading to metastasis are largely unknown and the value of canine mammary carcinoma as a model for human breast cancer is unclear. In this study, we analyzed the gene expression signatures associated with mammary tumor metastasis and asked for parallels with the human equivalent. Methods Messenger RNA expression profiles of twenty-seven lymph node metastasis positive or negative canine mammary carcinomas were established by microarray analysis. Differentially expressed genes were functionally characterized and associated with molecular pathways. The findings were also correlated with published data on human breast cancer. Results Metastatic canine mammary carcinomas had 1,011 significantly differentially expressed genes when compared to non-metastatic carcinomas. Metastatic carcinomas had a significant up-regulation of genes associated with cell cycle regulation, matrix modulation, protein folding and proteasomal degradation whereas cell differentiation genes, growth factor pathway genes and regulators of actin organization were significantly down-regulated. Interestingly, 265 of the 1,011 differentially expressed canine genes are also related to human breast cancer and, vice versa, parts of a human prognostic gene signature were identified in the expression profiles of the metastatic canine tumors. Conclusions Metastatic canine mammary carcinomas can be discriminated from non-metastatic carcinomas by their gene expression profiles. More than one third of the differentially expressed genes are also described of relevance for human breast cancer. Many of the differentially expressed genes are linked to functions and pathways which appear to be relevant for the induction and maintenance of metastatic progression and may represent new therapeutic targets. Furthermore, dogs are in some aspects suitable as a translational model for human breast tumors in order to identify prognostic molecular signatures and potential therapeutic targets. PMID:21062462
Transcription in space--environmental vs. genetic effects on differential immune gene expression.
Lenz, Tobias L
2015-09-01
Understanding how organisms adapt to their local environment is one of the key goals in molecular ecology. Adaptation can be achieved through qualitative changes in the coding sequence and/or quantitative changes in gene expression, where the optimal dosage of a gene's product in a given environment is being selected for. Differences in gene expression among populations inhabiting distinct environments can be suggestive of locally adapted gene regulation and have thus been studied in different species (Whitehead & Crawford ; Hodgins-Davis & Townsend ). However, in contrast to a gene's coding sequence, its expression level at a given point in time may depend on various factors, including the current environment. Although critical for understanding the extent of local adaptation, it is usually difficult to disentangle the heritable differences in gene regulation from environmental effects. In this issue of Molecular Ecology, Stutz et al. () describe an experiment in which they reciprocally transplanted three-spined sticklebacks (Gasterosteus aculeatus) between independent pairs of small and large lakes. Their experimental design allows them to attribute differences in gene expression among sticklebacks either to lake of origin or destination lake. Interestingly, they find that translocated sticklebacks show a pattern of gene expression more similar to individuals from the destination lake than to individuals from the lake of origin, suggesting that expression of the targeted genes is more strongly regulated by environmental effects than by genetics. The environmental effect by itself is not entirely surprising; however, the relative extent of it is. Especially when put in the context of local adaptation and population differentiation, as done here, these findings cast a new light onto the heritability of differential gene expression and specifically its relative importance during population divergence and ultimately ecological speciation. © 2015 John Wiley & Sons Ltd.
A qRT-PCR assay for the expression of all Mal d 1 isoallergen genes
2013-01-01
Background A considerable number of individuals suffer from oral allergy syndrome (OAS) to apple, resulting in the avoidance of apple consumption. Apple cultivars differ greatly in their allergenic properties, but knowledge of the causes for such differences is incomplete. Mal d 1 is considered the major apple allergen. For Mal d 1, a wide range of isoallergens and variants exist, and they are encoded by a large gene family. To identify the specific proteins/genes that are potentially involved in the allergy, we developed a PCR assay to monitor the expression of each individual Mal d 1 gene. Gene-specific primer pairs were designed for the exploitation of sequence differences among Mal d 1 genes. The specificity of these primers was validated using both in silico and in vitro techniques. Subsequently, this assay was applied to the peel and flesh of fruits from the two cultivars ‘Florina’ and ‘Gala’. Results We successfully developed gene-specific primer pairs for each of the 31 Mal d 1 genes and incorporated them into a qRT-PCR assay. The results from the application of the assay showed that 11 genes were not expressed in fruit. In addition, differential expression was observed among the Mal d 1 genes that were expressed in the fruit. Moreover, the expression levels were tissue and cultivar dependent. Conclusion The assay developed in this study facilitated the first characterisation of the expression levels of all known Mal d 1 genes in a gene-specific manner. Using this assay on different fruit tissues and cultivars, we obtained knowledge concerning gene relevance in allergenicity. This study provides new perspectives for research on both plant breeding and immunotherapy. PMID:23522122
A multiplex branched DNA assay for parallel quantitative gene expression profiling.
Flagella, Michael; Bui, Son; Zheng, Zhi; Nguyen, Cung Tuong; Zhang, Aiguo; Pastor, Larry; Ma, Yunqing; Yang, Wen; Crawford, Kimberly L; McMaster, Gary K; Witney, Frank; Luo, Yuling
2006-05-01
We describe a novel method to quantitatively measure messenger RNA (mRNA) expression of multiple genes directly from crude cell lysates and tissue homogenates without the need for RNA purification or target amplification. The multiplex branched DNA (bDNA) assay adapts the bDNA technology to the Luminex fluorescent bead-based platform through the use of cooperative hybridization, which ensures an exceptionally high degree of assay specificity. Using in vitro transcribed RNA as reference standards, we demonstrated that the assay is highly specific, with cross-reactivity less than 0.2%. We also determined that the assay detection sensitivity is 25,000 RNA transcripts with intra- and interplate coefficients of variance of less than 10% and less than 15%, respectively. Using three 10-gene panels designed to measure proinflammatory and apoptosis responses, we demonstrated sensitive and specific multiplex gene expression profiling directly from cell lysates. The gene expression change data demonstrate a high correlation coefficient (R(2)=0.94) compared with measurements obtained using the single-plex bDNA assay. Thus, the multiplex bDNA assay provides a powerful means to quantify the gene expression profile of a defined set of target genes in large sample populations.
Control of gene expression by CRISPR-Cas systems
2013-01-01
Clustered regularly interspaced short palindromic repeats (CRISPR) loci and their associated cas (CRISPR-associated) genes provide adaptive immunity against viruses (phages) and other mobile genetic elements in bacteria and archaea. While most of the early work has largely been dominated by examples of CRISPR-Cas systems directing the cleavage of phage or plasmid DNA, recent studies have revealed a more complex landscape where CRISPR-Cas loci might be involved in gene regulation. In this review, we summarize the role of these loci in the regulation of gene expression as well as the recent development of synthetic gene regulation using engineered CRISPR-Cas systems. PMID:24273648
2010-01-01
Background Parkinson's disease is the second most common neurodegenerative disorder. The pathological hallmark of the disease is degeneration of midbrain dopaminergic neurons. Genetic association studies have linked 13 human chromosomal loci to Parkinson's disease. Identification of gene(s), as part of the etiology of Parkinson's disease, within the large number of genes residing in these loci can be achieved through several approaches, including screening methods, and considering appropriate criteria. Since several of the indentified Parkinson's disease genes are expressed in substantia nigra pars compact of the midbrain, expression within the neurons of this area could be a suitable criterion to limit the number of candidates and identify PD genes. Methods In this work we have used the combination of findings from six rodent transcriptome analysis studies on the gene expression profile of midbrain dopaminergic neurons and the PARK loci in OMIM (Online Mendelian Inheritance in Man) database, to identify new candidate genes for Parkinson's disease. Results Merging the two datasets, we identified 20 genes within PARK loci, 7 of which are located in an orphan Parkinson's disease locus and one, which had been identified as a disease gene. In addition to identifying a set of candidates for further genetic association studies, these results show that the criteria of expression in midbrain dopaminergic neurons may be used to narrow down the number of genes in PARK loci for such studies. PMID:20716345
Yu, Fang; Chen, Ming-Hui; Kuo, Lynn; Talbott, Heather; Davis, John S
2015-08-07
Recently, the Bayesian method becomes more popular for analyzing high dimensional gene expression data as it allows us to borrow information across different genes and provides powerful estimators for evaluating gene expression levels. It is crucial to develop a simple but efficient gene selection algorithm for detecting differentially expressed (DE) genes based on the Bayesian estimators. In this paper, by extending the two-criterion idea of Chen et al. (Chen M-H, Ibrahim JG, Chi Y-Y. A new class of mixture models for differential gene expression in DNA microarray data. J Stat Plan Inference. 2008;138:387-404), we propose two new gene selection algorithms for general Bayesian models and name these new methods as the confident difference criterion methods. One is based on the standardized differences between two mean expression values among genes; the other adds the differences between two variances to it. The proposed confident difference criterion methods first evaluate the posterior probability of a gene having different gene expressions between competitive samples and then declare a gene to be DE if the posterior probability is large. The theoretical connection between the proposed first method based on the means and the Bayes factor approach proposed by Yu et al. (Yu F, Chen M-H, Kuo L. Detecting differentially expressed genes using alibrated Bayes factors. Statistica Sinica. 2008;18:783-802) is established under the normal-normal-model with equal variances between two samples. The empirical performance of the proposed methods is examined and compared to those of several existing methods via several simulations. The results from these simulation studies show that the proposed confident difference criterion methods outperform the existing methods when comparing gene expressions across different conditions for both microarray studies and sequence-based high-throughput studies. A real dataset is used to further demonstrate the proposed methodology. In the real data application, the confident difference criterion methods successfully identified more clinically important DE genes than the other methods. The confident difference criterion method proposed in this paper provides a new efficient approach for both microarray studies and sequence-based high-throughput studies to identify differentially expressed genes.
Connahs, Heidi; Rhen, Turk; Simmons, Rebecca B
2016-03-31
Butterfly wing color patterns are an important model system for understanding the evolution and development of morphological diversity and animal pigmentation. Wing color patterns develop from a complex network composed of highly conserved patterning genes and pigmentation pathways. Patterning genes are involved in regulating pigment synthesis however the temporal expression dynamics of these interacting networks is poorly understood. Here, we employ next generation sequencing to examine expression patterns of the gene network underlying wing development in the nymphalid butterfly, Vanessa cardui. We identified 9, 376 differentially expressed transcripts during wing color pattern development, including genes involved in patterning, pigmentation and gene regulation. Differential expression of these genes was highest at the pre-ommochrome stage compared to early pupal and late melanin stages. Overall, an increasing number of genes were down-regulated during the progression of wing development. We observed dynamic expression patterns of a large number of pigment genes from the ommochrome, melanin and also pteridine pathways, including contrasting patterns of expression for paralogs of the yellow gene family. Surprisingly, many patterning genes previously associated with butterfly pattern elements were not significantly up-regulated at any time during pupation, although many other transcription factors were differentially expressed. Several genes involved in Notch signaling were significantly up-regulated during the pre-ommochrome stage including slow border cells, bunched and pebbles; the function of these genes in the development of butterfly wings is currently unknown. Many genes involved in ecdysone signaling were also significantly up-regulated during early pupal and late melanin stages and exhibited opposing patterns of expression relative to the ecdysone receptor. Finally, a comparison across four butterfly transcriptomes revealed 28 transcripts common to all four species that have no known homologs in other metazoans. This study provides a comprehensive list of differentially expressed transcripts during wing development, revealing potential candidate genes that may be involved in regulating butterfly wing patterns. Some differentially expressed genes have no known homologs possibly representing genes unique to butterflies. Results from this study also indicate that development of nymphalid wing patterns may arise not only from melanin and ommochrome pigments but also the pteridine pigment pathway.
Inferring causal genomic alterations in breast cancer using gene expression data
2011-01-01
Background One of the primary objectives in cancer research is to identify causal genomic alterations, such as somatic copy number variation (CNV) and somatic mutations, during tumor development. Many valuable studies lack genomic data to detect CNV; therefore, methods that are able to infer CNVs from gene expression data would help maximize the value of these studies. Results We developed a framework for identifying recurrent regions of CNV and distinguishing the cancer driver genes from the passenger genes in the regions. By inferring CNV regions across many datasets we were able to identify 109 recurrent amplified/deleted CNV regions. Many of these regions are enriched for genes involved in many important processes associated with tumorigenesis and cancer progression. Genes in these recurrent CNV regions were then examined in the context of gene regulatory networks to prioritize putative cancer driver genes. The cancer driver genes uncovered by the framework include not only well-known oncogenes but also a number of novel cancer susceptibility genes validated via siRNA experiments. Conclusions To our knowledge, this is the first effort to systematically identify and validate drivers for expression based CNV regions in breast cancer. The framework where the wavelet analysis of copy number alteration based on expression coupled with the gene regulatory network analysis, provides a blueprint for leveraging genomic data to identify key regulatory components and gene targets. This integrative approach can be applied to many other large-scale gene expression studies and other novel types of cancer data such as next-generation sequencing based expression (RNA-Seq) as well as CNV data. PMID:21806811
Gene expression characterizes different nutritional strategies among three mixotrophic protists.
Liu, Zhenfeng; Campbell, Victoria; Heidelberg, Karla B; Caron, David A
2016-07-01
Mixotrophic protists, i.e. protists that can carry out both phototrophy and heterotrophy, are a group of organisms with a wide range of nutritional strategies. The ecological and biogeochemical importance of these species has recently been recognized. In this study, we investigated and compared the gene expression of three mixotrophic protists, Prymnesium parvum, Dinobyron sp. and Ochromonas sp. under light and dark conditions in the presence of prey using RNA-Seq. Gene expression of the obligately phototrophic P. parvum and Dinobryon sp. changed significantly between light and dark treatments, while that of primarily heterotrophic Ochromonas sp. was largely unchanged. Gene expression of P. parvum and Dinobryon sp. shared many similarities, especially in the expression patterns of genes related to reproduction. However, key genes involved in central carbon metabolism and phagotrophy had different expression patterns between these two species, suggesting differences in prey consumption and heterotrophic nutrition in the dark. Transcriptomic data also offered clues to other physiological traits of these organisms such as preference of nitrogen sources and photo-oxidative stress. These results provide potential target genes for further exploration of the mechanisms of mixotrophic physiology and demonstrate the potential usefulness of molecular approaches in characterizing the nutritional modes of mixotrophic protists. © FEMS 2016. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Microarray-based identification of differentially expressed genes in extramammary Paget’s disease
Lin, Jin-Ran; Liang, Jun; Zhang, Qiao-An; Huang, Qiong; Wang, Shang-Shang; Qin, Hai-Hong; Chen, Lian-Jun; Xu, Jin-Hua
2015-01-01
Extramammary Paget’s disease (EMPD) is a rare cutaneous malignancy accounting for approximately 1-2% of vulvar cancers. The rarity of this disease has caused difficulties in characterization and the molecular mechanism underlying EMPD development remains largely unclear. Here we used microarray analysis to identify differentially expressed genes in EMPD of the scrotum comparing with normal epithelium from healthy donors. Agilent single-channel microarray was used to compare the gene expression between 6 EMPD specimens and 6 normal scrotum epithelium samples. A total of 799 up-regulated genes and 723 down-regulated genes were identified in EMPD tissues. Real-time PCR was conducted to verify the differential expression of some representative genes, including ERBB4, TCF3, PAPSS2, PIK3R3, PRLR, SULT1A1, TCF7L1, and CREB3L4. Generally, the real-time PCR results were consistent with microarray data, and the expression of ERBB4, PRLR, TCF3, PIK3R3, SULT1A1, and TCF7L1 was significantly overexpressed in EMPD (P<0.05). Moreover, the overexpression of PRLR in EMPD, a receptor for the anterior pituitary hormone prolactin (PRL), was confirmed by immunohistochemistry. These data demonstrate that the differentially expressed genes from the microarray-based identification are tightly associated with EMPD occurrence. PMID:26221264
DOE Office of Scientific and Technical Information (OSTI.GOV)
Farahani, Poupak; Chiu, Sally; Bowlus, Christopher L.
Obesity is a complex disease. To date, over 100 chromosomal loci for body weight, body fat, regional white adipose tissue weight, and other obesity-related traits have been identified in humans and in animal models. For most loci, the underlying genes are not yet identified; some of these chromosomal loci will be alleles of known obesity genes, whereas many will represent alleles of unknown genes. Microarray analysis allows simultaneous multiple gene and pathway discovery. cDNA and oligonucleotide arrays are commonly used to identify differentially expressed genes by surveys of large numbers of known and unnamed genes. Two papers previously identified genesmore » differentially expressed in adipose tissue of mouse models of obesity and diabetes by analysis of hybridization to Affymetrix oligonucleotide chips.« less
Liu, Kaidong; Li, Haili; Li, Weijin; Zhong, Jundi; Chen, Yan; Shen, Chenjia; Yuan, Changchun
2017-10-23
Sugar apple (Annona squamosa L.), a popular fruit with high medicinal and nutritional properties, is widely cultivated in tropical South Asia and America. The malformed flower is a major cause for a reduction in production of sugar apple. However, little information is available on the differences between normal and malformed flowers of sugar apple. To gain a comprehensive perspective on the differences between normal and malformed flowers of sugar apple, cDNA libraries from normal and malformation flowers were prepared independently for Illumina sequencing. The data generated a total of 70,189,896 reads that were integrated and assembled into 55,097 unigenes with a mean length of 783 bp. A large number of differentially expressed genes (DEGs) were identified. Among these DEGs, 701 flower development-associated transcript factor encoding genes were included. Furthermore, a large number of flowering- and hormone-related DEGs were also identified, and most of these genes were down-regulated expressed in the malformation flowers. The expression levels of 15 selected genes were validated using quantitative-PCR. The contents of several endogenous hormones were measured. The malformed flowers displayed lower endogenous hormone levels compared to the normal flowers. The expression data as well as hormone levels in our study will serve as a comprehensive resource for investigating the regulation mechanism involved in floral organ development in sugar apple.
GECKO: a complete large-scale gene expression analysis platform.
Theilhaber, Joachim; Ulyanov, Anatoly; Malanthara, Anish; Cole, Jack; Xu, Dapeng; Nahf, Robert; Heuer, Michael; Brockel, Christoph; Bushnell, Steven
2004-12-10
Gecko (Gene Expression: Computation and Knowledge Organization) is a complete, high-capacity centralized gene expression analysis system, developed in response to the needs of a distributed user community. Based on a client-server architecture, with a centralized repository of typically many tens of thousands of Affymetrix scans, Gecko includes automatic processing pipelines for uploading data from remote sites, a data base, a computational engine implementing approximately 50 different analysis tools, and a client application. Among available analysis tools are clustering methods, principal component analysis, supervised classification including feature selection and cross-validation, multi-factorial ANOVA, statistical contrast calculations, and various post-processing tools for extracting data at given error rates or significance levels. On account of its open architecture, Gecko also allows for the integration of new algorithms. The Gecko framework is very general: non-Affymetrix and non-gene expression data can be analyzed as well. A unique feature of the Gecko architecture is the concept of the Analysis Tree (actually, a directed acyclic graph), in which all successive results in ongoing analyses are saved. This approach has proven invaluable in allowing a large (approximately 100 users) and distributed community to share results, and to repeatedly return over a span of years to older and potentially very complex analyses of gene expression data. The Gecko system is being made publicly available as free software http://sourceforge.net/projects/geckoe. In totality or in parts, the Gecko framework should prove useful to users and system developers with a broad range of analysis needs.
Koper, Andre; Zeef, Leo A H; Joseph, Leena; Kerr, Keith; Gosney, John; Lindsay, Mark A; Booton, Richard
2017-01-10
Preinvasive squamous cell cancer (PSCC) are local transformations of bronchial epithelia that are frequently observed in current or former smokers. Their different grades and sizes suggest a continuum of dysplastic change with increasing severity, which may culminate in invasive squamous cell carcinoma (ISCC). As a consequence of the difficulty in isolating cancerous cells from biopsies, the molecular pathology that underlies their histological variability remains largely unknown. To address this issue, we have employed microdissection to isolate normal bronchial epithelia and cancerous cells from low- and high-grade PSCC and ISCC, from paraffin embedded (FFPE) biopsies and determined gene expression using Affymetric Human Exon 1.0 ST arrays. Tests for differential gene expression were performed using the Bioconductor package limma followed by functional analyses of differentially expressed genes in IPA. Examination of differential gene expression showed small differences between low- and high-grade PSCC but substantial changes between PSCC and ISCC samples (184 vs 1200 p-value <0.05, fc ±1.75). However, the majority of the differentially expressed PSCC genes (142 genes: 77%) were shared with those in ISCC samples. Pathway analysis showed that these shared genes are associated with DNA damage response, DNA/RNA metabolism and inflammation as major biological themes. Cluster analysis identified 12 distinct patterns of gene expression including progressive up or down-regulation across PSCC and ISCC. Pathway analysis of incrementally up-regulated genes revealed again significant enrichment of terms related to DNA damage response, DNA/RNA metabolism, inflammation, survival and proliferation. Altered expression of selected genes was confirmed using RT-PCR, as well as immunohistochemistry in an independent set of 45 ISCCs. Gene expression profiles in PSCC and ISCC differ greatly in terms of numbers of genes with altered transcriptional activity. However, altered gene expression in PSCC affects canonical pathways and cellular and biological processes, such as inflammation and DNA damage response, which are highly consistent with hallmarks of cancer.
Cárdenas-Guerra, Rosa Elena; Figueroa-Angulo, Elisa Elvira; Puente-Rivera, Jonathan; Zamudio-Prieto, Olga; Ortega-López, Jaime
2015-01-01
We focus on the iron response of Trichomonas vaginalis to gene family products such as the cysteine proteinases (CPs) involved in virulence properties. In particular, we examined the effect of iron on the gene expression regulation and function of cathepsin L-like and asparaginyl endopeptidase-like CPs as virulence factors. We addressed some important aspects about CPs genomic organization and we offer possible explanations to the fact that only few members of this large gene family are expressed at the RNA and protein levels and the way to control their proteolytic activity. We also summarized all known iron regulations of CPs at transcriptional, posttranscriptional, and posttranslational levels along with new insights into the possible epigenetic and miRNA processes. PMID:26090464
Loperfido, Mariana; Jarmin, Susan; Dastidar, Sumitava; Di Matteo, Mario; Perini, Ilaria; Moore, Marc; Nair, Nisha; Samara-Kuko, Ermira; Athanasopoulos, Takis; Tedesco, Francesco Saverio; Dickson, George; Sampaolesi, Maurilio; VandenDriessche, Thierry; Chuah, Marinee K
2016-01-29
Duchenne muscular dystrophy (DMD) is a genetic neuromuscular disorder caused by the absence of dystrophin. We developed a novel gene therapy approach based on the use of the piggyBac (PB) transposon system to deliver the coding DNA sequence (CDS) of either full-length human dystrophin (DYS: 11.1 kb) or truncated microdystrophins (MD1: 3.6 kb; MD2: 4 kb). PB transposons encoding microdystrophins were transfected in C2C12 myoblasts, yielding 65±2% MD1 and 66±2% MD2 expression in differentiated multinucleated myotubes. A hyperactive PB (hyPB) transposase was then deployed to enable transposition of the large-size PB transposon (17 kb) encoding the full-length DYS and green fluorescence protein (GFP). Stable GFP expression attaining 78±3% could be achieved in the C2C12 myoblasts that had undergone transposition. Western blot analysis demonstrated expression of the full-length human DYS protein in myotubes. Subsequently, dystrophic mesoangioblasts from a Golden Retriever muscular dystrophy dog were transfected with the large-size PB transposon resulting in 50±5% GFP-expressing cells after stable transposition. This was consistent with correction of the differentiated dystrophic mesoangioblasts following expression of full-length human DYS. These results pave the way toward a novel non-viral gene therapy approach for DMD using PB transposons underscoring their potential to deliver large therapeutic genes. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Identification of an Imprinted Gene Cluster in the X-Inactivation Center
Kobayashi, Shin; Totoki, Yasushi; Soma, Miki; Matsumoto, Kazuya; Fujihara, Yoshitaka; Toyoda, Atsushi; Sakaki, Yoshiyuki; Okabe, Masaru; Ishino, Fumitoshi
2013-01-01
Mammalian development is strongly influenced by the epigenetic phenomenon called genomic imprinting, in which either the paternal or the maternal allele of imprinted genes is expressed. Paternally expressed Xist, an imprinted gene, has been considered as a single cis-acting factor to inactivate the paternally inherited X chromosome (Xp) in preimplantation mouse embryos. This means that X-chromosome inactivation also entails gene imprinting at a very early developmental stage. However, the precise mechanism of imprinted X-chromosome inactivation remains unknown and there is little information about imprinted genes on X chromosomes. In this study, we examined whether there are other imprinted genes than Xist expressed from the inactive paternal X chromosome and expressed in female embryos at the preimplantation stage. We focused on small RNAs and compared their expression patterns between sexes by tagging the female X chromosome with green fluorescent protein. As a result, we identified two micro (mi)RNAs–miR-374-5p and miR-421-3p–mapped adjacent to Xist that were predominantly expressed in female blastocysts. Allelic expression analysis revealed that these miRNAs were indeed imprinted and expressed from the Xp. Further analysis of the imprinting status of adjacent locus led to the discovery of a large cluster of imprinted genes expressed from the Xp: Jpx, Ftx and Zcchc13. To our knowledge, this is the first identified cluster of imprinted genes in the cis-acting regulatory region termed the X-inactivation center. This finding may help in understanding the molecular mechanisms regulating imprinted X-chromosome inactivation during early mammalian development. PMID:23940725
Identification of an imprinted gene cluster in the X-inactivation center.
Kobayashi, Shin; Totoki, Yasushi; Soma, Miki; Matsumoto, Kazuya; Fujihara, Yoshitaka; Toyoda, Atsushi; Sakaki, Yoshiyuki; Okabe, Masaru; Ishino, Fumitoshi
2013-01-01
Mammalian development is strongly influenced by the epigenetic phenomenon called genomic imprinting, in which either the paternal or the maternal allele of imprinted genes is expressed. Paternally expressed Xist, an imprinted gene, has been considered as a single cis-acting factor to inactivate the paternally inherited X chromosome (Xp) in preimplantation mouse embryos. This means that X-chromosome inactivation also entails gene imprinting at a very early developmental stage. However, the precise mechanism of imprinted X-chromosome inactivation remains unknown and there is little information about imprinted genes on X chromosomes. In this study, we examined whether there are other imprinted genes than Xist expressed from the inactive paternal X chromosome and expressed in female embryos at the preimplantation stage. We focused on small RNAs and compared their expression patterns between sexes by tagging the female X chromosome with green fluorescent protein. As a result, we identified two micro (mi)RNAs-miR-374-5p and miR-421-3p-mapped adjacent to Xist that were predominantly expressed in female blastocysts. Allelic expression analysis revealed that these miRNAs were indeed imprinted and expressed from the Xp. Further analysis of the imprinting status of adjacent locus led to the discovery of a large cluster of imprinted genes expressed from the Xp: Jpx, Ftx and Zcchc13. To our knowledge, this is the first identified cluster of imprinted genes in the cis-acting regulatory region termed the X-inactivation center. This finding may help in understanding the molecular mechanisms regulating imprinted X-chromosome inactivation during early mammalian development.
Wang, Y; Smallwood, P M; Cowan, M; Blesh, D; Lawler, A; Nathans, J
1999-04-27
This study examines the mechanism of mutually exclusive expression of the human X-linked red and green visual pigment genes in their respective cone photoreceptors by asking whether this expression pattern can be produced in a mammal that normally carries only a single X-linked visual pigment gene. To address this question, we generated transgenic mice that carry a single copy of a minimal human X chromosome visual pigment gene array in which the red and green pigment gene transcription units were replaced, respectively, by alkaline phosphatase and beta-galactosidase reporters. As determined by histochemical staining, the reporters are expressed exclusively in cone photoreceptor cells. In 20 transgenic mice carrying any one of three independent transgene insertion events, an average of 63% of expressing cones have alkaline phosphatase activity, 10% have beta-galactosidase activity, and 27% have activity for both reporters. Thus, mutually exclusive expression of red and green pigment transgenes can be achieved in a large fraction of cones in a dichromat mammal, suggesting a facile evolutionary path for the development of trichromacy after visual pigment gene duplication. These observations are consistent with a model of visual pigment expression in which stochastic pairing occurs between a locus control region and either the red or the green pigment gene promotor.
Jin, Erqing; Wong, Lynn; Jiao, Yun; Engel, Jake; Holdridge, Benjamin; Xu, Peng
2017-12-01
Engineering cell factories for producing biofuels and pharmaceuticals has spurred great interests to develop rapid and efficient synthetic biology tools customized for modular pathway engineering. Along the way, combinatorial gene expression control through modification of regulatory element offered tremendous opportunity for fine-tuning gene expression and generating digital-like genetic circuits. In this report, we present an efficient evolutionary approach to build a range of regulatory control elements. The reported method allows for rapid construction of promoter, 5'UTR, terminator and trans -activating RNA libraries. Synthetic overlapping oligos with high portion of degenerate nucleotides flanking the regulatory element could be efficiently assembled to a vector expressing fluorescence reporter. This approach combines high mutation rate of the synthetic DNA with the high assembly efficiency of Gibson Mix. Our constructed library demonstrates broad range of transcriptional or translational gene expression dynamics. Specifically, both the promoter library and 5'UTR library exhibits gene expression dynamics spanning across three order of magnitude. The terminator library and trans -activating RNA library displays relatively narrowed gene expression pattern. The reported study provides a versatile toolbox for rapidly constructing a large family of prokaryotic regulatory elements. These libraries also facilitate the implementation of combinatorial pathway engineering principles and the engineering of more efficient microbial cell factory for various biomanufacturing applications.
Dardick, Christopher
2007-08-01
Plant viruses cause a wide array of disease symptoms and cytopathic effects. Although some of these changes are virus specific, many appear to be common even among diverse viruses. Currently, little is known about the underlying molecular determinants. To identify gene expression changes that are concomitant with virus symptoms, we performed comparative expression profiling experiments on Nicotiana benthamiana leaves infected with one of three different fruit tree viruses that produce distinct symptoms: Plum pox potyvirus (PPV; leaf distortion and mosaic), Tomato ringspot nepovirus (ToRSV; tissue necrosis and general chlorosis), and Prunus necrotic ringspot ilarvirus (PNRSV; subtle chlorotic mottling). The numbers of statistically significant genes identified were consistent with the severity of the observed symptoms: 1,082 (ToRSV), 744 (PPV), and 89 (PNRSV). In all, 56% of the gene expression changes found in PPV-infected leaves also were altered by ToRSV, 87% of which changed in the same direction. Both PPV- and ToRSV-infected leaves showed widespread repression of genes associated with plastid functions. PPV uniquely induced the expression of large numbers of cytosolic ribosomal genes whereas ToRSV repressed the expression of plastidic ribosomal genes. How these and other observed expression changes might be associated with symptom development are discussed.
CellLineNavigator: a workbench for cancer cell line analysis
Krupp, Markus; Itzel, Timo; Maass, Thorsten; Hildebrandt, Andreas; Galle, Peter R.; Teufel, Andreas
2013-01-01
The CellLineNavigator database, freely available at http://www.medicalgenomics.org/celllinenavigator, is a web-based workbench for large scale comparisons of a large collection of diverse cell lines. It aims to support experimental design in the fields of genomics, systems biology and translational biomedical research. Currently, this compendium holds genome wide expression profiles of 317 different cancer cell lines, categorized into 57 different pathological states and 28 individual tissues. To enlarge the scope of CellLineNavigator, the database was furthermore closely linked to commonly used bioinformatics databases and knowledge repositories. To ensure easy data access and search ability, a simple data and an intuitive querying interface were implemented. It allows the user to explore and filter gene expression, focusing on pathological or physiological conditions. For a more complex search, the advanced query interface may be used to query for (i) differentially expressed genes; (ii) pathological or physiological conditions; or (iii) gene names or functional attributes, such as Kyoto Encyclopaedia of Genes and Genomes pathway maps. These queries may also be combined. Finally, CellLineNavigator allows additional advanced analysis of differentially regulated genes by a direct link to the Database for Annotation, Visualization and Integrated Discovery (DAVID) Bioinformatics Resources. PMID:23118487
Lu, Chenqi; Liu, Xiaoqin; Wang, Lin; Jiang, Ning; Yu, Jun; Zhao, Xiaobo; Hu, Hairong; Zheng, Saihua; Li, Xuelian; Wang, Guiying
2017-01-10
Due to genetic heterogeneity and variable diagnostic criteria, genetic studies of polycystic ovary syndrome are particularly challenging. Furthermore, lack of sufficiently large cohorts limits the identification of susceptibility genes contributing to polycystic ovary syndrome. Here, we carried out a systematic search of studies deposited in the Gene Expression Omnibus database through August 31, 2016. The present analyses included studies with: 1) patients with polycystic ovary syndrome and normal controls, 2) gene expression profiling of messenger RNA, and 3) sufficient data for our analysis. Ultimately, a total of 9 studies with 13 datasets met the inclusion criteria and were performed for the subsequent integrated analyses. Through comprehensive analyses, there were 13 genetic factors overlapped in all datasets and identified as significant specific genes for polycystic ovary syndrome. After quality control assessment, there were six datasets remained. Further gene ontology enrichment and pathway analyses suggested that differentially expressed genes mainly enriched in oocyte pathways. These findings provide potential molecular markers for diagnosis and prognosis of polycystic ovary syndrome, and need in-depth studies on the exact function and mechanism in polycystic ovary syndrome.
Zhang, Zhenyu; Zhao, Wei; Li, Deshan; Yang, Jinlong; Zsak, Laszlo; Yu, Qingzhong
2015-08-01
In the present study, we developed a novel approach for foreign gene expression by Newcastle disease virus (NDV) from a second ORF through an internal ribosomal entry site (IRES). Six NDV LaSota strain-based recombinant viruses vectoring the IRES and a red fluorescence protein (RFP) gene behind the nucleocapsid (NP), phosphoprotein (P), matrix (M), fusion (F), haemagglutinin-neuraminidase (HN) or large polymerase (L) gene ORF were generated using reverse genetics technology. The insertion of the second ORF slightly attenuated virus pathogenicity, but did not affect ability of the virus to grow. Quantitative measurements of RFP expression in virus-infected DF-1 cells revealed that the abundance of viral mRNAs and red fluorescence intensity were positively correlated with the gene order of NDV, 3'-NP-P-M-F-HN-L-5', proving the sequential transcription mechanism for NDV. The results herein suggest that the level of foreign gene expression could be regulated by selecting the second ORF insertion site to maximize the efficacy of vaccine and gene therapy.
Diversification of Root Hair Development Genes in Vascular Plants.
Huang, Ling; Shi, Xinhui; Wang, Wenjia; Ryu, Kook Hui; Schiefelbein, John
2017-07-01
The molecular genetic program for root hair development has been studied intensively in Arabidopsis ( Arabidopsis thaliana ). To understand the extent to which this program might operate in other plants, we conducted a large-scale comparative analysis of root hair development genes from diverse vascular plants, including eudicots, monocots, and a lycophyte. Combining phylogenetics and transcriptomics, we discovered conservation of a core set of root hair genes across all vascular plants, which may derive from an ancient program for unidirectional cell growth coopted for root hair development during vascular plant evolution. Interestingly, we also discovered preferential diversification in the structure and expression of root hair development genes, relative to other root hair- and root-expressed genes, among these species. These differences enabled the definition of sets of genes and gene functions that were acquired or lost in specific lineages during vascular plant evolution. In particular, we found substantial divergence in the structure and expression of genes used for root hair patterning, suggesting that the Arabidopsis transcriptional regulatory mechanism is not shared by other species. To our knowledge, this study provides the first comprehensive view of gene expression in a single plant cell type across multiple species. © 2017 American Society of Plant Biologists. All Rights Reserved.
Canales, Javier; Moyano, Tomás C.; Villarroel, Eva; Gutiérrez, Rodrigo A.
2014-01-01
Nitrogen (N) is an essential macronutrient for plant growth and development. Plants adapt to changes in N availability partly by changes in global gene expression. We integrated publicly available root microarray data under contrasting nitrate conditions to identify new genes and functions important for adaptive nitrate responses in Arabidopsis thaliana roots. Overall, more than 2000 genes exhibited changes in expression in response to nitrate treatments in Arabidopsis thaliana root organs. Global regulation of gene expression by nitrate depends largely on the experimental context. However, despite significant differences from experiment to experiment in the identity of regulated genes, there is a robust nitrate response of specific biological functions. Integrative gene network analysis uncovered relationships between nitrate-responsive genes and 11 highly co-expressed gene clusters (modules). Four of these gene network modules have robust nitrate responsive functions such as transport, signaling, and metabolism. Network analysis hypothesized G2-like transcription factors are key regulatory factors controlling transport and signaling functions. Our meta-analysis highlights the role of biological processes not studied before in the context of the nitrate response such as root hair development and provides testable hypothesis to advance our understanding of nitrate responses in plants. PMID:24570678
Diversification of Root Hair Development Genes in Vascular Plants1[OPEN
Shi, Xinhui; Wang, Wenjia; Ryu, Kook Hui
2017-01-01
The molecular genetic program for root hair development has been studied intensively in Arabidopsis (Arabidopsis thaliana). To understand the extent to which this program might operate in other plants, we conducted a large-scale comparative analysis of root hair development genes from diverse vascular plants, including eudicots, monocots, and a lycophyte. Combining phylogenetics and transcriptomics, we discovered conservation of a core set of root hair genes across all vascular plants, which may derive from an ancient program for unidirectional cell growth coopted for root hair development during vascular plant evolution. Interestingly, we also discovered preferential diversification in the structure and expression of root hair development genes, relative to other root hair- and root-expressed genes, among these species. These differences enabled the definition of sets of genes and gene functions that were acquired or lost in specific lineages during vascular plant evolution. In particular, we found substantial divergence in the structure and expression of genes used for root hair patterning, suggesting that the Arabidopsis transcriptional regulatory mechanism is not shared by other species. To our knowledge, this study provides the first comprehensive view of gene expression in a single plant cell type across multiple species. PMID:28487476
Wasito, Ito; Hashim, Siti Zaiton M; Sukmaningrum, Sri
2007-01-01
Gene expression profiling plays an important role in the identification of biological and clinical properties of human solid tumors such as colorectal carcinoma. Profiling is required to reveal underlying molecular features for diagnostic and therapeutic purposes. A non-parametric density-estimation-based approach called iterative local Gaussian clustering (ILGC), was used to identify clusters of expressed genes. We used experimental data from a previous study by Muro and others consisting of 1,536 genes in 100 colorectal cancer and 11 normal tissues. In this dataset, the ILGC finds three clusters, two large and one small gene clusters, similar to their results which used Gaussian mixture clustering. The correlation of each cluster of genes and clinical properties of malignancy of human colorectal cancer was analysed for the existence of tumor or normal, the existence of distant metastasis and the existence of lymph node metastasis. PMID:18305825
Wasito, Ito; Hashim, Siti Zaiton M; Sukmaningrum, Sri
2007-12-30
Gene expression profiling plays an important role in the identification of biological and clinical properties of human solid tumors such as colorectal carcinoma. Profiling is required to reveal underlying molecular features for diagnostic and therapeutic purposes. A non-parametric density-estimation-based approach called iterative local Gaussian clustering (ILGC), was used to identify clusters of expressed genes. We used experimental data from a previous study by Muro and others consisting of 1,536 genes in 100 colorectal cancer and 11 normal tissues. In this dataset, the ILGC finds three clusters, two large and one small gene clusters, similar to their results which used Gaussian mixture clustering. The correlation of each cluster of genes and clinical properties of malignancy of human colorectal cancer was analysed for the existence of tumor or normal, the existence of distant metastasis and the existence of lymph node metastasis.
Atallah, Nadia M; Vitek, Olga; Gaiti, Federico; Tanurdzic, Milos; Banks, Jo Ann
2018-05-02
The fern Ceratopteris richardii is an important model for studies of sex determination and gamete differentiation in homosporous plants. Here we use RNA-seq to de novo assemble a transcriptome and identify genes differentially expressed in young gametophytes as their sex is determined by the presence or absence of the male-inducing pheromone called antheridiogen. Of the 1,163 consensus differentially expressed genes identified, the vast majority (1,030) are up-regulated in gametophytes treated with antheridiogen. GO term enrichment analyses of these DEGs reveals that a large number of genes involved in epigenetic reprogramming of the gametophyte genome are up-regulated by the pheromone. Additional hormone response and development genes are also up-regulated by the pheromone. This C. richardii gametophyte transcriptome and gene expression dataset will prove useful for studies focusing on sex determination and differentiation in plants. Copyright © 2018, G3: Genes, Genomes, Genetics.
BRAIN NETWORKS. Correlated gene expression supports synchronous activity in brain networks.
Richiardi, Jonas; Altmann, Andre; Milazzo, Anna-Clare; Chang, Catie; Chakravarty, M Mallar; Banaschewski, Tobias; Barker, Gareth J; Bokde, Arun L W; Bromberg, Uli; Büchel, Christian; Conrod, Patricia; Fauth-Bühler, Mira; Flor, Herta; Frouin, Vincent; Gallinat, Jürgen; Garavan, Hugh; Gowland, Penny; Heinz, Andreas; Lemaître, Hervé; Mann, Karl F; Martinot, Jean-Luc; Nees, Frauke; Paus, Tomáš; Pausova, Zdenka; Rietschel, Marcella; Robbins, Trevor W; Smolka, Michael N; Spanagel, Rainer; Ströhle, Andreas; Schumann, Gunter; Hawrylycz, Mike; Poline, Jean-Baptiste; Greicius, Michael D
2015-06-12
During rest, brain activity is synchronized between different regions widely distributed throughout the brain, forming functional networks. However, the molecular mechanisms supporting functional connectivity remain undefined. We show that functional brain networks defined with resting-state functional magnetic resonance imaging can be recapitulated by using measures of correlated gene expression in a post mortem brain tissue data set. The set of 136 genes we identify is significantly enriched for ion channels. Polymorphisms in this set of genes significantly affect resting-state functional connectivity in a large sample of healthy adolescents. Expression levels of these genes are also significantly associated with axonal connectivity in the mouse. The results provide convergent, multimodal evidence that resting-state functional networks correlate with the orchestrated activity of dozens of genes linked to ion channel activity and synaptic function. Copyright © 2015, American Association for the Advancement of Science.
Shu, Xianghua; Liu, Yonggang; Yang, Liangyu; Song, Chunlian; Hou, Jiafa
2008-01-01
The complete coding sequences of 3 porcine genes - ASPA, NAGA, and HEXA - were amplified by the reverse transcriptase polymerase chain reaction (RT-PCR) based on the conserved sequence information of the mouse or other mammals and referenced pig ESTs. These 3 novel porcine genes were then deposited in the NCBI database and assigned GeneIDs: 100142661, 100142664 and 100142667. The phylogenetic tree analysis revealed that the porcine ASPA, NAGA, and HEXA all have closer genetic relationships with the ASPA, NAGA, and HEXA of cattle. Tissue expression profile analysis was also carried out and results revealed that swine ASPA, NAGA, and HEXA genes were differentially expressed in various organs, including skeletal muscle, the heart, liver, fat, kidney, lung, and small and large intestines. Our experiment is the first one to establish the foundation for further research on these 3 swine genes.
Teste, Marie-Ange; Duquenne, Manon; François, Jean M; Parrou, Jean-Luc
2009-01-01
Background Real-time RT-PCR is the recommended method for quantitative gene expression analysis. A compulsory step is the selection of good reference genes for normalization. A few genes often referred to as HouseKeeping Genes (HSK), such as ACT1, RDN18 or PDA1 are among the most commonly used, as their expression is assumed to remain unchanged over a wide range of conditions. Since this assumption is very unlikely, a geometric averaging of multiple, carefully selected internal control genes is now strongly recommended for normalization to avoid this problem of expression variation of single reference genes. The aim of this work was to search for a set of reference genes for reliable gene expression analysis in Saccharomyces cerevisiae. Results From public microarray datasets, we selected potential reference genes whose expression remained apparently invariable during long-term growth on glucose. Using the algorithm geNorm, ALG9, TAF10, TFC1 and UBC6 turned out to be genes whose expression remained stable, independent of the growth conditions and the strain backgrounds tested in this study. We then showed that the geometric averaging of any subset of three genes among the six most stable genes resulted in very similar normalized data, which contrasted with inconsistent results among various biological samples when the normalization was performed with ACT1. Normalization with multiple selected genes was therefore applied to transcriptional analysis of genes involved in glycogen metabolism. We determined an induction ratio of 100-fold for GPH1 and 20-fold for GSY2 between the exponential phase and the diauxic shift on glucose. There was no induction of these two genes at this transition phase on galactose, although in both cases, the kinetics of glycogen accumulation was similar. In contrast, SGA1 expression was independent of the carbon source and increased by 3-fold in stationary phase. Conclusion In this work, we provided a set of genes that are suitable reference genes for quantitative gene expression analysis by real-time RT-PCR in yeast biological samples covering a large panel of physiological states. In contrast, we invalidated and discourage the use of ACT1 as well as other commonly used reference genes (PDA1, TDH3, RDN18, etc) as internal controls for quantitative gene expression analysis in yeast. PMID:19874630
Teste, Marie-Ange; Duquenne, Manon; François, Jean M; Parrou, Jean-Luc
2009-10-30
Real-time RT-PCR is the recommended method for quantitative gene expression analysis. A compulsory step is the selection of good reference genes for normalization. A few genes often referred to as HouseKeeping Genes (HSK), such as ACT1, RDN18 or PDA1 are among the most commonly used, as their expression is assumed to remain unchanged over a wide range of conditions. Since this assumption is very unlikely, a geometric averaging of multiple, carefully selected internal control genes is now strongly recommended for normalization to avoid this problem of expression variation of single reference genes. The aim of this work was to search for a set of reference genes for reliable gene expression analysis in Saccharomyces cerevisiae. From public microarray datasets, we selected potential reference genes whose expression remained apparently invariable during long-term growth on glucose. Using the algorithm geNorm, ALG9, TAF10, TFC1 and UBC6 turned out to be genes whose expression remained stable, independent of the growth conditions and the strain backgrounds tested in this study. We then showed that the geometric averaging of any subset of three genes among the six most stable genes resulted in very similar normalized data, which contrasted with inconsistent results among various biological samples when the normalization was performed with ACT1. Normalization with multiple selected genes was therefore applied to transcriptional analysis of genes involved in glycogen metabolism. We determined an induction ratio of 100-fold for GPH1 and 20-fold for GSY2 between the exponential phase and the diauxic shift on glucose. There was no induction of these two genes at this transition phase on galactose, although in both cases, the kinetics of glycogen accumulation was similar. In contrast, SGA1 expression was independent of the carbon source and increased by 3-fold in stationary phase. In this work, we provided a set of genes that are suitable reference genes for quantitative gene expression analysis by real-time RT-PCR in yeast biological samples covering a large panel of physiological states. In contrast, we invalidated and discourage the use of ACT1 as well as other commonly used reference genes (PDA1, TDH3, RDN18, etc) as internal controls for quantitative gene expression analysis in yeast.
Simonini, Sara; Roig-Villanova, Irma; Gregis, Veronica; Colombo, Bilitis; Colombo, Lucia; Kater, Martin M.
2012-01-01
BASIC PENTACYSTEINE (BPC) transcription factors have been identified in a large variety of plant species. In Arabidopsis thaliana there are seven BPC genes, which, except for BPC5, are expressed ubiquitously. BPC genes are functionally redundant in a wide range of developmental processes. Recently, we reported that BPC1 binds to guanine and adenine (GA)–rich consensus sequences in the SEEDSTICK (STK) promoter in vitro and induces conformational changes. Here we show by chromatin immunoprecipitation experiments that in vivo BPCs also bind to the consensus boxes, and when these were mutated, expression from the STK promoter was derepressed, resulting in ectopic expression in the inflorescence. We also reveal that SHORT VEGETATIVE PHASE (SVP) is a direct regulator of STK. SVP is a floral meristem identity gene belonging to the MADS box gene family. The SVP-APETALA1 (AP1) dimer recruits the SEUSS (SEU)-LEUNIG (LUG) transcriptional cosuppressor to repress floral homeotic gene expression in the floral meristem. Interestingly, we found that GA consensus sequences in the STK promoter to which BPCs bind are essential for recruitment of the corepressor complex to this promoter. Our data suggest that we have identified a new regulatory mechanism controlling plant gene expression that is probably generally used, when considering BPCs’ wide expression profile and the frequent presence of consensus binding sites in plant promoters. PMID:23054472
Jung, Ki-Hong; Dardick, Christopher; Bartley, Laura E; Cao, Peijian; Phetsom, Jirapa; Canlas, Patrick; Seo, Young-Su; Shultz, Michael; Ouyang, Shu; Yuan, Qiaoping; Frank, Bryan C; Ly, Eugene; Zheng, Li; Jia, Yi; Hsia, An-Ping; An, Kyungsook; Chou, Hui-Hsien; Rocke, David; Lee, Geun Cheol; Schnable, Patrick S; An, Gynheung; Buell, C Robin; Ronald, Pamela C
2008-10-06
Studies of gene function are often hampered by gene-redundancy, especially in organisms with large genomes such as rice (Oryza sativa). We present an approach for using transcriptomics data to focus functional studies and address redundancy. To this end, we have constructed and validated an inexpensive and publicly available rice oligonucleotide near-whole genome array, called the rice NSF45K array. We generated expression profiles for light- vs. dark-grown rice leaf tissue and validated the biological significance of the data by analyzing sources of variation and confirming expression trends with reverse transcription polymerase chain reaction. We examined trends in the data by evaluating enrichment of gene ontology terms at multiple false discovery rate thresholds. To compare data generated with the NSF45K array with published results, we developed publicly available, web-based tools (www.ricearray.org). The Oligo and EST Anatomy Viewer enables visualization of EST-based expression profiling data for all genes on the array. The Rice Multi-platform Microarray Search Tool facilitates comparison of gene expression profiles across multiple rice microarray platforms. Finally, we incorporated gene expression and biochemical pathway data to reduce the number of candidate gene products putatively participating in the eight steps of the photorespiration pathway from 52 to 10, based on expression levels of putatively functionally redundant genes. We confirmed the efficacy of this method to cope with redundancy by correctly predicting participation in photorespiration of a gene with five paralogs. Applying these methods will accelerate rice functional genomics.
Parabolic flight induces changes in gene expression patterns in Arabidopsis thaliana.
Paul, Anna-Lisa; Manak, Michael S; Mayfield, John D; Reyes, Matthew F; Gurley, William B; Ferl, Robert J
2011-10-01
Our primary objective was to evaluate gene expression changes in Arabidopsis thaliana in response to parabolic flight as part of a comprehensive approach to the molecular biology of spaceflight-related adaptations. In addition, we wished to establish parabolic flight as a tractable operations platform for molecular biology studies. In a succession of experiments on NASA's KC-135 and C-9 parabolic aircraft, Arabidopsis plants were presented with replicated exposure to parabolic flight. Transcriptome profiling revealed that parabolic flight caused changes in gene expression patterns that stood the statistical tests of replication on three different flight days. The earliest response, after 20 parabolas, was characterized by a prominence of genes associated with signal transduction. After 40 parabolas, this prominence was largely replaced by genes associated with biotic and abiotic stimuli and stress. Among these responses, three metabolic processes stand out in particular: the induction of auxin metabolism and signaling, the differential expression of genes associated with calcium-mediated signaling, and the repression of genes associated with disease resistance and cell wall biochemistry. Many, but not all, of these responses are known to be involved in gravity sensing in plants. Changes in auxin-related gene expression were also recorded by reporter genes tuned to auxin signal pathways. These data demonstrate that the parabolic flight environment is appropriate for molecular biology research involving the transition to microgravity, in that with replication, proper controls, and analyses, gene expression changes can be observed in the time frames of typical parabolic flight experiments.
Ståhlberg, Anders; Elbing, Karin; Andrade-Garda, José Manuel; Sjögreen, Björn; Forootan, Amin; Kubista, Mikael
2008-04-16
The large sensitivity, high reproducibility and essentially unlimited dynamic range of real-time PCR to measure gene expression in complex samples provides the opportunity for powerful multivariate and multiway studies of biological phenomena. In multiway studies samples are characterized by their expression profiles to monitor changes over time, effect of treatment, drug dosage etc. Here we perform a multiway study of the temporal response of four yeast Saccharomyces cerevisiae strains with different glucose uptake rates upon altered metabolic conditions. We measured the expression of 18 genes as function of time after addition of glucose to four strains of yeast grown in ethanol. The data are analyzed by matrix-augmented PCA, which is a generalization of PCA for 3-way data, and the results are confirmed by hierarchical clustering and clustering by Kohonen self-organizing map. Our approach identifies gene groups that respond similarly to the change of nutrient, and genes that behave differently in mutant strains. Of particular interest is our finding that ADH4 and ADH6 show a behavior typical of glucose-induced genes, while ADH3 and ADH5 are repressed after glucose addition. Multiway real-time PCR gene expression profiling is a powerful technique which can be utilized to characterize functions of new genes by, for example, comparing their temporal response after perturbation in different genetic variants of the studied subject. The technique also identifies genes that show perturbed expression in specific strains.
Ståhlberg, Anders; Elbing, Karin; Andrade-Garda, José Manuel; Sjögreen, Björn; Forootan, Amin; Kubista, Mikael
2008-01-01
Background The large sensitivity, high reproducibility and essentially unlimited dynamic range of real-time PCR to measure gene expression in complex samples provides the opportunity for powerful multivariate and multiway studies of biological phenomena. In multiway studies samples are characterized by their expression profiles to monitor changes over time, effect of treatment, drug dosage etc. Here we perform a multiway study of the temporal response of four yeast Saccharomyces cerevisiae strains with different glucose uptake rates upon altered metabolic conditions. Results We measured the expression of 18 genes as function of time after addition of glucose to four strains of yeast grown in ethanol. The data are analyzed by matrix-augmented PCA, which is a generalization of PCA for 3-way data, and the results are confirmed by hierarchical clustering and clustering by Kohonen self-organizing map. Our approach identifies gene groups that respond similarly to the change of nutrient, and genes that behave differently in mutant strains. Of particular interest is our finding that ADH4 and ADH6 show a behavior typical of glucose-induced genes, while ADH3 and ADH5 are repressed after glucose addition. Conclusion Multiway real-time PCR gene expression profiling is a powerful technique which can be utilized to characterize functions of new genes by, for example, comparing their temporal response after perturbation in different genetic variants of the studied subject. The technique also identifies genes that show perturbed expression in specific strains. PMID:18412983
The evolution of duplicate gene expression in mammalian organs
Guschanski, Katerina; Warnefors, Maria; Kaessmann, Henrik
2017-01-01
Gene duplications generate genomic raw material that allows the emergence of novel functions, likely facilitating adaptive evolutionary innovations. However, global assessments of the functional and evolutionary relevance of duplicate genes in mammals were until recently limited by the lack of appropriate comparative data. Here, we report a large-scale study of the expression evolution of DNA-based functional gene duplicates in three major mammalian lineages (placental mammals, marsupials, egg-laying monotremes) and birds, on the basis of RNA sequencing (RNA-seq) data from nine species and eight organs. We observe dynamic changes in tissue expression preference of paralogs with different duplication ages, suggesting differential contribution of paralogs to specific organ functions during vertebrate evolution. Specifically, we show that paralogs that emerged in the common ancestor of bony vertebrates are enriched for genes with brain-specific expression and provide evidence for differential forces underlying the preferential emergence of young testis- and liver-specific expressed genes. Further analyses uncovered that the overall spatial expression profiles of gene families tend to be conserved, with several exceptions of pronounced tissue specificity shifts among lineage-specific gene family expansions. Finally, we trace new lineage-specific genes that may have contributed to the specific biology of mammalian organs, including the little-studied placenta. Overall, our study provides novel and taxonomically broad evidence for the differential contribution of duplicate genes to tissue-specific transcriptomes and for their importance for the phenotypic evolution of vertebrates. PMID:28743766
Wozniak, Magdalena B.; Le Calvez-Kelm, Florence; Abedi-Ardekani, Behnoush; Byrnes, Graham; Durand, Geoffroy; Carreira, Christine; Michelon, Jocelyne; Janout, Vladimir; Holcatova, Ivana; Foretova, Lenka; Brisuda, Antonin; Lesueur, Fabienne; McKay, James; Brennan, Paul; Scelo, Ghislaine
2013-01-01
Gene expression microarray and next generation sequencing efforts on conventional, clear cell renal cell carcinoma (ccRCC) have been mostly performed in North American and Western European populations, while the highest incidence rates are found in Central/Eastern Europe. We conducted whole-genome expression profiling on 101 pairs of ccRCC tumours and adjacent non-tumour renal tissue from Czech patients recruited within the “K2 Study”, using the Illumina HumanHT-12 v4 Expression BeadChips to explore the molecular variations underlying the biological and clinical heterogeneity of this cancer. Differential expression analysis identified 1650 significant probes (fold change ≥2 and false discovery rate <0.05) mapping to 630 up- and 720 down-regulated unique genes. We performed similar statistical analysis on the RNA sequencing data of 65 ccRCC cases from the Cancer Genome Atlas (TCGA) project and identified 60% (402) of the downregulated and 74% (469) of the upregulated genes found in the K2 series. The biological characterization of the significantly deregulated genes demonstrated involvement of downregulated genes in metabolic and catabolic processes, excretion, oxidation reduction, ion transport and response to chemical stimulus, while simultaneously upregulated genes were associated with immune and inflammatory responses, response to hypoxia, stress, wounding, vasculature development and cell activation. Furthermore, genome-wide DNA methylation analysis of 317 TCGA ccRCC/adjacent non-tumour renal tissue pairs indicated that deregulation of approximately 7% of genes could be explained by epigenetic changes. Finally, survival analysis conducted on 89 K2 and 464 TCGA cases identified 8 genes associated with differential prognostic outcomes. In conclusion, a large proportion of ccRCC molecular characteristics were common to the two populations and several may have clinical implications when validated further through large clinical cohorts. PMID:23526956
2008-01-01
Background Sox genes encode transcription factors that function in a wide range of developmental processes across the animal kingdom. To better understand both the evolution of the Sox family and the roles of these genes in cnidarians, we are studying the Sox gene complement of the coral, Acropora millepora (Class Anthozoa). Results Based on overall domain structures and HMG box sequences, the Acropora Sox genes considered here clearly fall into four of the five major Sox classes. AmSoxC is expressed in the ectoderm during development, in cells whose morphology is consistent with their assignment as sensory neurons. The expression pattern of the Nematostella ortholog of this gene is broadly similar to that of AmSoxC, but there are subtle differences – for example, expression begins significantly earlier in Acropora than in Nematostella. During gastrulation, AmSoxBb and AmSoxB1 transcripts are detected only in the presumptive ectoderm while AmSoxE1 transcription is restricted to the presumptive endoderm, suggesting that these Sox genes might play roles in germ layer specification. A third type B Sox gene, AmSoxBa, and a Sox F gene AmSoxF also have complex and specific expression patterns during early development. Each of these genes has a clear Nematostella ortholog, but in several cases the expression pattern observed in Acropora differs significantly from that reported in Nematostella. Conclusion These differences in expression patterns between Acropora and Nematostella largely reflect fundamental differences in developmental processes, underscoring the diversity of mechanisms within the anthozoan Sub-Class Hexacorallia (Zoantharia). PMID:19014479
Wu, Shuanghua; Lei, Jianjun; Chen, Guoju; Chen, Hancai; Cao, Bihao; Chen, Changming
2017-01-01
Chinese kale, a vegetable of the cruciferous family, is a popular crop in southern China and Southeast Asia due to its high glucosinolate content and nutritional qualities. However, there is little research on the molecular genetics and genes involved in glucosinolate metabolism and its regulation in Chinese kale. In this study, we sequenced and characterized the transcriptomes and expression profiles of genes expressed in 11 tissues of Chinese kale. A total of 216 million 150-bp clean reads were generated using RNA-sequencing technology. From the sequences, 98,180 unigenes were assembled for the whole plant, and 49,582~98,423 unigenes were assembled for each tissue. Blast analysis indicated that a total of 80,688 (82.18%) unigenes exhibited similarity to known proteins. The functional annotation and classification tools used in this study suggested that genes principally expressed in Chinese kale, were mostly involved in fundamental processes, such as cellular and molecular functions, the signal transduction, and biosynthesis of secondary metabolites. The expression levels of all unigenes were analyzed in various tissues of Chinese kale. A large number of candidate genes involved in glucosinolate metabolism and its regulation were identified, and the expression patterns of these genes were analyzed. We found that most of the genes involved in glucosinolate biosynthesis were highly expressed in the root, petiole, and in senescent leaves. The expression patterns of ten glucosinolate biosynthetic genes from RNA-seq were validated by quantitative RT-PCR in different tissues. These results provided an initial and global overview of Chinese kale gene functions and expression activities in different tissues. PMID:28228764
2017-01-01
Although in recent years the study of gene expression variation in the absence of genetic or environmental cues or gene expression heterogeneity has intensified considerably, many basic and applied biological fields still remain unaware of how useful the study of gene expression heterogeneity patterns might be for the characterization of biological systems and/or processes. Largely based on the modulator effect chromatin compaction has for gene expression heterogeneity and the extensive changes in chromatin compaction known to occur for specialized cells that are naturally or artificially induced to revert to less specialized states or dedifferentiate, I recently hypothesized that processes that concur with cell dedifferentiation would show an extensive reduction in gene expression heterogeneity. The confirmation of the existence of such trend could be of wide interest because of the biomedical and biotechnological relevance of cell dedifferentiation-based processes, i.e., regenerative development, cancer, human induced pluripotent stem cells, or plant somatic embryogenesis. Here, I report the first empirical evidence consistent with the existence of an extensive reduction in gene expression heterogeneity for processes that concur with cell dedifferentiation by analyzing transcriptome dynamics along forearm regenerative development in Ambystoma mexicanum or axolotl. Also, I briefly discuss on the utility of the study of gene expression heterogeneity dynamics might have for the characterization of cell dedifferentiation-based processes, and the engineering of tools that afforded better monitoring and modulating such processes. Finally, I reflect on how a transitional reduction in gene expression heterogeneity for dedifferentiated cells can promote a long-term increase in phenotypic heterogeneity following cell dedifferentiation with potential adverse effects for biomedical and biotechnological applications. PMID:29134148
Genome-wide screen identifies a novel prognostic signature for breast cancer survival
Mao, Xuan Y.; Lee, Matthew J.; Zhu, Jeffrey; ...
2017-01-21
Large genomic datasets in combination with clinical data can be used as an unbiased tool to identify genes important in patient survival and discover potential therapeutic targets. We used a genome-wide screen to identify 587 genes significantly and robustly deregulated across four independent breast cancer (BC) datasets compared to normal breast tissue. Gene expression of 381 genes was significantly associated with relapse-free survival (RFS) in BC patients. We used a gene co-expression network approach to visualize the genetic architecture in normal breast and BCs. In normal breast tissue, co-expression cliques were identified enriched for cell cycle, gene transcription, cell adhesion,more » cytoskeletal organization and metabolism. In contrast, in BC, only two major co-expression cliques were identified enriched for cell cycle-related processes or blood vessel development, cell adhesion and mammary gland development processes. Interestingly, gene expression levels of 7 genes were found to be negatively correlated with many cell cycle related genes, highlighting these genes as potential tumor suppressors and novel therapeutic targets. A forward-conditional Cox regression analysis was used to identify a 12-gene signature associated with RFS. A prognostic scoring system was created based on the 12-gene signature. This scoring system robustly predicted BC patient RFS in 60 sampling test sets and was further validated in TCGA and METABRIC BC data. Our integrated study identified a 12-gene prognostic signature that could guide adjuvant therapy for BC patients and includes novel potential molecular targets for therapy.« less
Genome-wide screen identifies a novel prognostic signature for breast cancer survival
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mao, Xuan Y.; Lee, Matthew J.; Zhu, Jeffrey
Large genomic datasets in combination with clinical data can be used as an unbiased tool to identify genes important in patient survival and discover potential therapeutic targets. We used a genome-wide screen to identify 587 genes significantly and robustly deregulated across four independent breast cancer (BC) datasets compared to normal breast tissue. Gene expression of 381 genes was significantly associated with relapse-free survival (RFS) in BC patients. We used a gene co-expression network approach to visualize the genetic architecture in normal breast and BCs. In normal breast tissue, co-expression cliques were identified enriched for cell cycle, gene transcription, cell adhesion,more » cytoskeletal organization and metabolism. In contrast, in BC, only two major co-expression cliques were identified enriched for cell cycle-related processes or blood vessel development, cell adhesion and mammary gland development processes. Interestingly, gene expression levels of 7 genes were found to be negatively correlated with many cell cycle related genes, highlighting these genes as potential tumor suppressors and novel therapeutic targets. A forward-conditional Cox regression analysis was used to identify a 12-gene signature associated with RFS. A prognostic scoring system was created based on the 12-gene signature. This scoring system robustly predicted BC patient RFS in 60 sampling test sets and was further validated in TCGA and METABRIC BC data. Our integrated study identified a 12-gene prognostic signature that could guide adjuvant therapy for BC patients and includes novel potential molecular targets for therapy.« less
Smith, Maria W.; Herfort, Lydie; Tyrol, Kaitlin; Suciu, Dominic; Campbell, Victoria; Crump, Byron C.; Peterson, Tawnya D.; Zuber, Peter; Baptista, Antonio M.; Simon, Holly M.
2010-01-01
Through their metabolic activities, microbial populations mediate the impact of high gradient regions on ecological function and productivity of the highly dynamic Columbia River coastal margin (CRCM). A 2226-probe oligonucleotide DNA microarray was developed to investigate expression patterns for microbial genes involved in nitrogen and carbon metabolism in the CRCM. Initial experiments with the environmental microarrays were directed toward validation of the platform and yielded high reproducibility in multiple tests. Bioinformatic and experimental validation also indicated that >85% of the microarray probes were specific for their corresponding target genes and for a few homologs within the same microbial family. The validated probe set was used to query gene expression responses by microbial assemblages to environmental variability. Sixty-four samples from the river, estuary, plume, and adjacent ocean were collected in different seasons and analyzed to correlate the measured variability in chemical, physical and biological water parameters to differences in global gene expression profiles. The method produced robust seasonal profiles corresponding to pre-freshet spring (April) and late summer (August). Overall relative gene expression was high in both seasons and was consistent with high microbial abundance measured by total RNA, heterotrophic bacterial production, and chlorophyll a. Both seasonal patterns involved large numbers of genes that were highly expressed relative to background, yet each produced very different gene expression profiles. April patterns revealed high differential gene expression in the coastal margin samples (estuary, plume and adjacent ocean) relative to freshwater, while little differential gene expression was observed along the river-to-ocean transition in August. Microbial gene expression profiles appeared to relate, in part, to seasonal differences in nutrient availability and potential resource competition. Furthermore, our results suggest that highly-active particle-attached microbiota in the Columbia River water column may perform dissimilatory nitrate reduction (both dentrification and DNRA) within anoxic particle microniches. PMID:20967204
Genome-wide methylation analysis identifies genes silenced in non-seminoma cell lines
Noor, Dzul Azri Mohamed; Jeyapalan, Jennie N; Alhazmi, Safiah; Carr, Matthew; Squibb, Benjamin; Wallace, Claire; Tan, Christopher; Cusack, Martin; Hughes, Jaime; Reader, Tom; Shipley, Janet; Sheer, Denise; Scotting, Paul J
2016-01-01
Silencing of genes by DNA methylation is a common phenomenon in many types of cancer. However, the genome-wide effect of DNA methylation on gene expression has been analysed in relatively few cancers. Germ cell tumours (GCTs) are a complex group of malignancies. They are unique in developing from a pluripotent progenitor cell. Previous analyses have suggested that non-seminomas exhibit much higher levels of DNA methylation than seminomas. The genomic targets that are methylated, the extent to which this results in gene silencing and the identity of the silenced genes most likely to play a role in the tumours’ biology have not yet been established. In this study, genome-wide methylation and expression analysis of GCT cell lines was combined with gene expression data from primary tumours to address this question. Genome methylation was analysed using the Illumina infinium HumanMethylome450 bead chip system and gene expression was analysed using Affymetrix GeneChip Human Genome U133 Plus 2.0 arrays. Regulation by methylation was confirmed by demethylation using 5-aza-2-deoxycytidine and reverse transcription–quantitative PCR. Large differences in the level of methylation of the CpG islands of individual genes between tumour cell lines correlated well with differential gene expression. Treatment of non-seminoma cells with 5-aza-2-deoxycytidine verified that methylation of all genes tested played a role in their silencing in yolk sac tumour cells and many of these genes were also differentially expressed in primary tumours. Genes silenced by methylation in the various GCT cell lines were identified. Several pluripotency-associated genes were identified as a major functional group of silenced genes. PMID:29263807
Genome-wide methylation analysis identifies genes silenced in non-seminoma cell lines.
Noor, Dzul Azri Mohamed; Jeyapalan, Jennie N; Alhazmi, Safiah; Carr, Matthew; Squibb, Benjamin; Wallace, Claire; Tan, Christopher; Cusack, Martin; Hughes, Jaime; Reader, Tom; Shipley, Janet; Sheer, Denise; Scotting, Paul J
2016-01-01
Silencing of genes by DNA methylation is a common phenomenon in many types of cancer. However, the genome-wide effect of DNA methylation on gene expression has been analysed in relatively few cancers. Germ cell tumours (GCTs) are a complex group of malignancies. They are unique in developing from a pluripotent progenitor cell. Previous analyses have suggested that non-seminomas exhibit much higher levels of DNA methylation than seminomas. The genomic targets that are methylated, the extent to which this results in gene silencing and the identity of the silenced genes most likely to play a role in the tumours' biology have not yet been established. In this study, genome-wide methylation and expression analysis of GCT cell lines was combined with gene expression data from primary tumours to address this question. Genome methylation was analysed using the Illumina infinium HumanMethylome450 bead chip system and gene expression was analysed using Affymetrix GeneChip Human Genome U133 Plus 2.0 arrays. Regulation by methylation was confirmed by demethylation using 5-aza-2-deoxycytidine and reverse transcription-quantitative PCR. Large differences in the level of methylation of the CpG islands of individual genes between tumour cell lines correlated well with differential gene expression. Treatment of non-seminoma cells with 5-aza-2-deoxycytidine verified that methylation of all genes tested played a role in their silencing in yolk sac tumour cells and many of these genes were also differentially expressed in primary tumours. Genes silenced by methylation in the various GCT cell lines were identified. Several pluripotency-associated genes were identified as a major functional group of silenced genes.
Liu, Kaidong; Yuan, Changchun; Feng, Shaoxian; Zhong, Shuting; Li, Haili; Zhong, Jundi; Shen, Chenjia; Liu, Jinxiang
2017-05-05
Auxin/indole-3-acetic acid (Aux/IAA) family genes encode short-lived nuclear proteins that mediate the responses of auxin-related genes and are involved in several plant developmental and growth processes. However, how Aux/IAA genes function in the fruit development and ripening of papaya (Carica papaya L.) is largely unknown. In this study, a comprehensive identification and a distinctive expression analysis of 18 C. papaya Aux/IAA (CpIAA) genes were performed using newly updated papaya reference genome data. The Aux/IAA gene family in papaya is slightly smaller than that in Arabidopsis, but all of the phylogenetic subfamilies are represented. Most of the CpIAA genes are responsive to various phytohormones and expressed in a tissues-specific manner. To understand the putative biological functions of the CpIAA genes involved in fruit development and ripening, quantitative real-time PCR was used to test the expression profiling of CpIAA genes at different stages. Furthermore, an IAA treatment significantly delayed the ripening process in papaya fruit at the early stages. The expression changes of CpIAA genes in ACC and 1-MCP treatments suggested a crosstalk between auxin and ethylene during the fruit ripening process of papaya. Our study provided comprehensive information on the Aux/IAA family in papaya, including gene structures, phylogenetic relationships and expression profiles. The involvement of CpIAA gene expression changes in fruit development and ripening gives us an opportunity to understand the roles of auxin signaling in the maturation of papaya reproductive organs.
Cellular Factors Shape 3D Genome Landscape
Researchers, using novel large-scale imaging technology, have mapped the spatial location of individual genes in the nucleus of human cells and identified 50 cellular factors required for the proper 3D positioning of genes. These spatial locations play important roles in gene expression, DNA repair, genome stability, and other cellular activities.
Aguirre von Wobeser, Eneas; Ibelings, Bas W.; Bok, Jasper; Krasikov, Vladimir; Huisman, Jef; Matthijs, Hans C.P.
2011-01-01
Physiological adaptation and genome-wide expression profiles of the cyanobacterium Synechocystis sp. strain PCC 6803 in response to gradual transitions between nitrogen-limited and light-limited growth conditions were measured in continuous cultures. Transitions induced changes in pigment composition, light absorption coefficient, photosynthetic electron transport, and specific growth rate. Physiological changes were accompanied by reproducible changes in the expression of several hundred open reading frames, genes with functions in photosynthesis and respiration, carbon and nitrogen assimilation, protein synthesis, phosphorus metabolism, and overall regulation of cell function and proliferation. Cluster analysis of the nearly 1,600 regulated open reading frames identified eight clusters, each showing a different temporal response during the transitions. Two large clusters mirrored each other. One cluster included genes involved in photosynthesis, which were up-regulated during light-limited growth but down-regulated during nitrogen-limited growth. Conversely, genes in the other cluster were down-regulated during light-limited growth but up-regulated during nitrogen-limited growth; this cluster included several genes involved in nitrogen uptake and assimilation. These results demonstrate complementary regulation of gene expression for two major metabolic activities of cyanobacteria. Comparison with batch-culture experiments revealed interesting differences in gene expression between batch and continuous culture and illustrates that continuous-culture experiments can pick up subtle changes in cell physiology and gene expression. PMID:21205618
Houtz, Robert L.
1998-01-01
The gene sequence for ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) large subunit (LS) .epsilon.N-methyltransferase (protein methylase III or Rubisco LSMT) is disclosed. This enzyme catalyzes methylation of the .epsilon.-amine of lysine-14 in the large subunit of Rubisco. In addition, a full-length cDNA clone for Rubisco LSMT is disclosed. Transgenic plants and methods of producing same which (1) have the Rubisco LSMT gene inserted into the DNA, and (2) have the Rubisco LSMT gene product or the action of the gene product deleted from the DNA are also provided. Further, methods of using the gene to selectively deliver desired agents to a plant are also disclosed.
Houtz, Robert L.
1999-01-01
The gene sequence for ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) large subunit (LS) .sup..epsilon. N-methyltransferase (protein methylase III or Rubisco LSMT) is disclosed. This enzyme catalyzes methylation of the .epsilon.-amine of lysine-14 in the large subunit of Rubisco. In addition, a full-length cDNA clone for Rubisco LSMT is disclosed. Transgenic plants and methods of producing same which (1) have the Rubisco LSMT gene inserted into the DNA, and (2) have the Rubisco LSMT gene product or the action of the gene product deleted from the DNA are also provided. Further, methods of using the gene to selectively deliver desired agents to a plant are also disclosed.
Houtz, R.L.
1998-03-03
The gene sequence for ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) large subunit (LS) {epsilon}N-methyltransferase (protein methylase III or Rubisco LSMT) is disclosed. This enzyme catalyzes methylation of the {epsilon}-amine of lysine-14 in the large subunit of Rubisco. In addition, a full-length cDNA clone for Rubisco LSMT is disclosed. Transgenic plants and methods of producing same which (1) have the Rubisco LSMT gene inserted into the DNA, and (2) have the Rubisco LSMT gene product or the action of the gene product deleted from the DNA are also provided. Further, methods of using the gene to selectively deliver desired agents to a plant are also disclosed. 5 figs.
Houtz, R.L.
1999-02-02
The gene sequence for ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) large subunit (LS){sup {epsilon}}N-methyltransferase (protein methylase III or Rubisco LSMT) is disclosed. This enzyme catalyzes methylation of the {epsilon}-amine of lysine-14 in the large subunit of Rubisco. In addition, a full-length cDNA clone for Rubisco LSMT is disclosed. Transgenic plants and methods of producing same which (1) have the Rubisco LSMT gene inserted into the DNA, and (2) have the Rubisco LSMT gene product or the action of the gene product deleted from the DNA are also provided. Further, methods of using the gene to selectively deliver desired agents to a plant are also disclosed. 8 figs.
Lee, Ann-Ying; Chen, Chun-Yi; Chang, Yao-Chien Alex; Chao, Ya-Ting; Shih, Ming-Che
2013-01-01
Previously we developed genomic resources for orchids, including transcriptomic analyses using next-generation sequencing techniques and construction of a web-based orchid genomic database. Here, we report a modified molecular model of flower development in the Orchidaceae based on functional analysis of gene expression profiles in Phalaenopsis aphrodite (a moth orchid) that revealed novel roles for the transcription factors involved in floral organ pattern formation. Phalaenopsis orchid floral organ-specific genes were identified by microarray analysis. Several critical transcription factors including AP3, PI, AP1 and AGL6, displayed distinct spatial distribution patterns. Phylogenetic analysis of orchid MADS box genes was conducted to infer the evolutionary relationship among floral organ-specific genes. The results suggest that gene duplication MADS box genes in orchid may have resulted in their gaining novel functions during evolution. Based on these analyses, a modified model of orchid flowering was proposed. Comparison of the expression profiles of flowers of a peloric mutant and wild-type Phalaenopsis orchid further identified genes associated with lip morphology and peloric effects. Large scale investigation of gene expression profiles revealed that homeotic genes from the ABCDE model of flower development classes A and B in the Phalaenopsis orchid have novel functions due to evolutionary diversification, and display differential expression patterns. PMID:24265826
Selective modes determine evolutionary rates, gene compactness and expression patterns in Brassica.
Guo, Yue; Liu, Jing; Zhang, Jiefu; Liu, Shengyi; Du, Jianchang
2017-07-01
It has been well documented that most nuclear protein-coding genes in organisms can be classified into two categories: positively selected genes (PSGs) and negatively selected genes (NSGs). The characteristics and evolutionary fates of different types of genes, however, have been poorly understood. In this study, the rates of nonsynonymous substitution (K a ) and the rates of synonymous substitution (K s ) were investigated by comparing the orthologs between the two sequenced Brassica species, Brassica rapa and Brassica oleracea, and the evolutionary rates, gene structures, expression patterns, and codon bias were compared between PSGs and NSGs. The resulting data show that PSGs have higher protein evolutionary rates, lower synonymous substitution rates, shorter gene length, fewer exons, higher functional specificity, lower expression level, higher tissue-specific expression and stronger codon bias than NSGs. Although the quantities and values are different, the relative features of PSGs and NSGs have been largely verified in the model species Arabidopsis. These data suggest that PSGs and NSGs differ not only under selective pressure (K a /K s ), but also in their evolutionary, structural and functional properties, indicating that selective modes may serve as a determinant factor for measuring evolutionary rates, gene compactness and expression patterns in Brassica. © 2017 The Authors The Plant Journal © 2017 John Wiley & Sons Ltd.
Singh, R; Upadhyay, G; Kumar, S; Kapoor, A; Kumar, A; Tiwari, M; Godbole, M M
2003-01-01
Thyroid hormone (TH) deficiency results in delayed proliferation and migration of cerebellar granule cells. Although extensive cell loss during the development of the cerebellum under hypothyroid conditions is known, its nature and its mechanism are poorly understood. Bcl-2 family gene expression is known to determine the fate of cells to undergo apoptosis. We evaluated the effect of hypothyroidism on Bcl-2 family gene expression in the developing rat cerebellum. Electrophoresis and Western blotting were used to analyze DNA fragmentation and expression of DNA fragmentation factor (DFF-45), Bcl-2, Bcl-xL and Bax genes respectively. In the hypothyroid condition, extensive DNA fragmentation and enhanced cleavage of DFF-45 were seen throughout development (postnatal day 0 to day 24) and adulthood whereas they were absent in the euthyroid state. The anti-apoptotic genes Bcl-2 and Bcl-xL were down-regulated and the pro-apoptotic gene Bax was expressed at higher levels compared with the euthyroid state. These results suggest that normal levels of TH prevent cerebellar apoptosis to a large extent, whereas hypothyroidism not only increases the extent but also the duration of apoptosis by down-regulating the anti-apoptotic genes and maintaining a high level of the pro-apoptotic gene Bax.
Domestication Effects on Stress Induced Steroid Secretion and Adrenal Gene Expression in Chickens.
Fallahsharoudi, Amir; de Kock, Neil; Johnsson, Martin; Ubhayasekera, S J Kumari A; Bergquist, Jonas; Wright, Dominic; Jensen, Per
2015-10-16
Understanding the genetic basis of phenotypic diversity is a challenge in contemporary biology. Domestication provides a model for unravelling aspects of the genetic basis of stress sensitivity. The ancestral Red Junglefowl (RJF) exhibits greater fear-related behaviour and a more pronounced HPA-axis reactivity than its domesticated counterpart, the White Leghorn (WL). By comparing hormones (plasmatic) and adrenal global gene transcription profiles between WL and RJF in response to an acute stress event, we investigated the molecular basis for the altered physiological stress responsiveness in domesticated chickens. Basal levels of pregnenolone and dehydroepiandrosterone as well as corticosterone response were lower in WL. Microarray analysis of gene expression in adrenal glands showed a significant breed effect in a large number of transcripts with over-representation of genes in the channel activity pathway. The expression of the best-known steroidogenesis genes were similar across the breeds used. Transcription levels of acute stress response genes such as StAR, CH25 and POMC were upregulated in response to acute stress. Dampened HPA reactivity in domesticated chickens was associated with changes in the expression of several genes that presents potentially minor regulatory effects rather than by means of change in expression of critical steroidogenic genes in the adrenal.
Arkusz, Joanna; Stępnik, Maciej; Sobala, Wojciech; Dastych, Jarosław
2010-11-10
The aim of this study was to find differentially regulated genes in THP-1 monocytic cells exposed to sensitizers and nonsensitizers and to investigate if such genes could be reliable markers for an in vitro predictive method for the identification of skin sensitizing chemicals. Changes in expression of 35 genes in the THP-1 cell line following treatment with chemicals of different sensitizing potential (from nonsensitizers to extreme sensitizers) were assessed using real-time PCR. Verification of 13 candidate genes by testing a large number of chemicals (an additional 22 sensitizers and 8 nonsensitizers) revealed that prediction of contact sensitization potential was possible based on evaluation of changes in three genes: IL8, HMOX1 and PAIMP1. In total, changes in expression of these genes allowed correct detection of sensitization potential of 21 out of 27 (78%) test sensitizers. The gene expression levels inside potency groups varied and did not allow estimation of sensitization potency of test chemicals. Results of this study indicate that evaluation of changes in expression of proposed biomarkers in THP-1 cells could be a valuable model for preliminary screening of chemicals to discriminate an appreciable majority of sensitizers from nonsensitizers. Copyright © 2010 Elsevier Ireland Ltd. All rights reserved.
Gene expression profiling of mesenteric lymph nodes from sheep with natural scrapie
2014-01-01
Background Prion diseases are characterized by the accumulation of the pathogenic PrPSc protein, mainly in the brain and the lymphoreticular system. Although prions multiply/accumulate in the lymph nodes without any detectable pathology, transcriptional changes in this tissue may reflect biological processes that contribute to the molecular pathogenesis of prion diseases. Little is known about the molecular processes that occur in the lymphoreticular system in early and late stages of prion disease. We performed a microarray-based study to identify genes that are differentially expressed at different disease stages in the mesenteric lymph node of sheep naturally infected with scrapie. Oligo DNA microarrays were used to identify gene-expression profiles in the early/middle (preclinical) and late (clinical) stages of the disease. Results In the clinical stage of the disease, we detected 105 genes that were differentially expressed (≥2-fold change in expression). Of these, 43 were upregulated and 62 downregulated as compared with age-matched negative controls. Fewer genes (50) were differentially expressed in the preclinical stage of the disease. Gene Ontology enrichment analysis revealed that the differentially expressed genes were largely associated with the following terms: glycoprotein, extracellular region, disulfide bond, cell cycle and extracellular matrix. Moreover, some of the annotated genes could be grouped into 3 specific signaling pathways: focal adhesion, PPAR signaling and ECM-receptor interaction. We discuss the relationship between the observed gene expression profiles and PrPSc deposition and the potential involvement in the pathogenesis of scrapie of 7 specific differentially expressed genes whose expression levels were confirmed by real time-PCR. Conclusions The present findings identify new genes that may be involved in the pathogenesis of natural scrapie infection in the lymphoreticular system, and confirm previous reports describing scrapie-induced alterations in the expression of genes involved in protein misfolding, angiogenesis and the oxidative stress response. Further studies will be necessary to determine the role of these genes in prion replication, dissemination and in the response of the organism to this disease. PMID:24450868
Composite transcriptome assembly of RNA-seq data in a sheep model for delayed bone healing.
Jäger, Marten; Ott, Claus-Eric; Grünhagen, Johannes; Hecht, Jochen; Schell, Hanna; Mundlos, Stefan; Duda, Georg N; Robinson, Peter N; Lienau, Jasmin
2011-03-24
The sheep is an important model organism for many types of medically relevant research, but molecular genetic experiments in the sheep have been limited by the lack of knowledge about ovine gene sequences. Prior to our study, mRNA sequences for only 1,556 partial or complete ovine genes were publicly available. Therefore, we developed a composite de novo transcriptome assembly method for next-generation sequence data to combine known ovine mRNA and EST sequences, mRNA sequences from mouse and cow, and sequences assembled de novo from short read RNA-Seq data into a composite reference transcriptome, and identified transcripts from over 12 thousand previously undescribed ovine genes. Gene expression analysis based on these data revealed substantially different expression profiles in standard versus delayed bone healing in an ovine tibial osteotomy model. Hundreds of transcripts were differentially expressed between standard and delayed healing and between the time points of the standard and delayed healing groups. We used the sheep sequences to design quantitative RT-PCR assays with which we validated the differential expression of 26 genes that had been identified by RNA-seq analysis. A number of clusters of characteristic expression profiles could be identified, some of which showed striking differences between the standard and delayed healing groups. Gene Ontology (GO) analysis showed that the differentially expressed genes were enriched in terms including extracellular matrix, cartilage development, contractile fiber, and chemokine activity. Our results provide a first atlas of gene expression profiles and differentially expressed genes in standard and delayed bone healing in a large-animal model and provide a number of clues as to the shifts in gene expression that underlie delayed bone healing. In the course of our study, we identified transcripts of 13,987 ovine genes, including 12,431 genes for which no sequence information was previously available. This information will provide a basis for future molecular research involving the sheep as a model organism.
Composite transcriptome assembly of RNA-seq data in a sheep model for delayed bone healing
2011-01-01
Background The sheep is an important model organism for many types of medically relevant research, but molecular genetic experiments in the sheep have been limited by the lack of knowledge about ovine gene sequences. Results Prior to our study, mRNA sequences for only 1,556 partial or complete ovine genes were publicly available. Therefore, we developed a composite de novo transcriptome assembly method for next-generation sequence data to combine known ovine mRNA and EST sequences, mRNA sequences from mouse and cow, and sequences assembled de novo from short read RNA-Seq data into a composite reference transcriptome, and identified transcripts from over 12 thousand previously undescribed ovine genes. Gene expression analysis based on these data revealed substantially different expression profiles in standard versus delayed bone healing in an ovine tibial osteotomy model. Hundreds of transcripts were differentially expressed between standard and delayed healing and between the time points of the standard and delayed healing groups. We used the sheep sequences to design quantitative RT-PCR assays with which we validated the differential expression of 26 genes that had been identified by RNA-seq analysis. A number of clusters of characteristic expression profiles could be identified, some of which showed striking differences between the standard and delayed healing groups. Gene Ontology (GO) analysis showed that the differentially expressed genes were enriched in terms including extracellular matrix, cartilage development, contractile fiber, and chemokine activity. Conclusions Our results provide a first atlas of gene expression profiles and differentially expressed genes in standard and delayed bone healing in a large-animal model and provide a number of clues as to the shifts in gene expression that underlie delayed bone healing. In the course of our study, we identified transcripts of 13,987 ovine genes, including 12,431 genes for which no sequence information was previously available. This information will provide a basis for future molecular research involving the sheep as a model organism. PMID:21435219
Expression and assembly of a fully active antibody in algae
NASA Astrophysics Data System (ADS)
Mayfield, Stephen P.; Franklin, Scott E.; Lerner, Richard A.
2003-01-01
Although combinatorial antibody libraries have solved the problem of access to large immunological repertoires, efficient production of these complex molecules remains a problem. Here we demonstrate the efficient expression of a unique large single-chain (lsc) antibody in the chloroplast of the unicellular, green alga, Chlamydomonas reinhardtii. We achieved high levels of protein accumulation by synthesizing the lsc gene in chloroplast codon bias and by driving expression of the chimeric gene using either of two C. reinhardtii chloroplast promoters and 5' and 3' RNA elements. This lsc antibody, directed against glycoprotein D of the herpes simplex virus, is produced in a soluble form by the alga and assembles into higher order complexes in vivo. Aside from dimerization by disulfide bond formation, the antibody undergoes no detectable posttranslational modification. We further demonstrate that accumulation of the antibody can be modulated by the specific growth regime used to culture the alga, and by the choice of 5' and 3' elements used to drive expression of the antibody gene. These results demonstrate the utility of alga as an expression platform for recombinant proteins, and describe a new type of single chain antibody containing the entire heavy chain protein, including the Fc domain.
Quantification of differential gene expression by multiplexed targeted resequencing of cDNA
Arts, Peer; van der Raadt, Jori; van Gestel, Sebastianus H.C.; Steehouwer, Marloes; Shendure, Jay; Hoischen, Alexander; Albers, Cornelis A.
2017-01-01
Whole-transcriptome or RNA sequencing (RNA-Seq) is a powerful and versatile tool for functional analysis of different types of RNA molecules, but sample reagent and sequencing cost can be prohibitive for hypothesis-driven studies where the aim is to quantify differential expression of a limited number of genes. Here we present an approach for quantification of differential mRNA expression by targeted resequencing of complementary DNA using single-molecule molecular inversion probes (cDNA-smMIPs) that enable highly multiplexed resequencing of cDNA target regions of ∼100 nucleotides and counting of individual molecules. We show that accurate estimates of differential expression can be obtained from molecule counts for hundreds of smMIPs per reaction and that smMIPs are also suitable for quantification of relative gene expression and allele-specific expression. Compared with low-coverage RNA-Seq and a hybridization-based targeted RNA-Seq method, cDNA-smMIPs are a cost-effective high-throughput tool for hypothesis-driven expression analysis in large numbers of genes (10 to 500) and samples (hundreds to thousands). PMID:28474677
Xu, Min; Wang, Yemin; Zhao, Zhilong; Gao, Guixi; Huang, Sheng-Xiong; Kang, Qianjin; He, Xinyi; Lin, Shuangjun; Pang, Xiuhua; Deng, Zixin
2016-01-01
ABSTRACT Genome sequencing projects in the last decade revealed numerous cryptic biosynthetic pathways for unknown secondary metabolites in microbes, revitalizing drug discovery from microbial metabolites by approaches called genome mining. In this work, we developed a heterologous expression and functional screening approach for genome mining from genomic bacterial artificial chromosome (BAC) libraries in Streptomyces spp. We demonstrate mining from a strain of Streptomyces rochei, which is known to produce streptothricins and borrelidin, by expressing its BAC library in the surrogate host Streptomyces lividans SBT5, and screening for antimicrobial activity. In addition to the successful capture of the streptothricin and borrelidin biosynthetic gene clusters, we discovered two novel linear lipopeptides and their corresponding biosynthetic gene cluster, as well as a novel cryptic gene cluster for an unknown antibiotic from S. rochei. This high-throughput functional genome mining approach can be easily applied to other streptomycetes, and it is very suitable for the large-scale screening of genomic BAC libraries for bioactive natural products and the corresponding biosynthetic pathways. IMPORTANCE Microbial genomes encode numerous cryptic biosynthetic gene clusters for unknown small metabolites with potential biological activities. Several genome mining approaches have been developed to activate and bring these cryptic metabolites to biological tests for future drug discovery. Previous sequence-guided procedures relied on bioinformatic analysis to predict potentially interesting biosynthetic gene clusters. In this study, we describe an efficient approach based on heterologous expression and functional screening of a whole-genome library for the mining of bioactive metabolites from Streptomyces. The usefulness of this function-driven approach was demonstrated by the capture of four large biosynthetic gene clusters for metabolites of various chemical types, including streptothricins, borrelidin, two novel lipopeptides, and one unknown antibiotic from Streptomyces rochei Sal35. The transfer, expression, and screening of the library were all performed in a high-throughput way, so that this approach is scalable and adaptable to industrial automation for next-generation antibiotic discovery. PMID:27451447
Wang, Siwen; Xing, Zheng; Pascuzzi, Pete E; Tran, Elizabeth J
2017-07-05
Cells fine-tune their metabolic programs according to nutrient availability in order to maintain homeostasis. This is achieved largely through integrating signaling pathways and the gene expression program, allowing cells to adapt to nutritional change. Dbp2, a member of the DEAD-box RNA helicase family in Saccharomyces cerevisiae , has been proposed to integrate gene expression with cellular metabolism. Prior work from our laboratory has reported the necessity of DBP2 in proper gene expression, particularly for genes involved in glucose-dependent regulation. Here, by comparing differentially expressed genes in dbp2 ∆ to those of 700 other deletion strains from other studies, we find that CYC8 and TUP1 , which form a complex and inhibit transcription of numerous genes, corepress a common set of genes with DBP2 Gene ontology (GO) annotations reveal that these corepressed genes are related to cellular metabolism, including respiration, gluconeogenesis, and alternative carbon-source utilization genes. Consistent with a direct role in metabolic gene regulation, loss of either DBP2 or CYC8 results in increased cellular respiration rates. Furthermore, we find that corepressed genes have a propensity to be associated with overlapping long noncoding RNAs and that upregulation of these genes in the absence of DBP2 correlates with decreased binding of Cyc8 to these gene promoters. Taken together, this suggests that Dbp2 integrates nutrient availability with energy homeostasis by maintaining repression of glucose-repressed, Cyc8-targeted genes across the genome. Copyright © 2017 Wang et al.
Identification and expression profiles of the WRKY transcription factor family in Ricinus communis.
Li, Hui-Liang; Zhang, Liang-Bo; Guo, Dong; Li, Chang-Zhu; Peng, Shi-Qing
2012-07-25
In plants, WRKY proteins constitute a large family of transcription factors. They are involved in many biological processes, such as plant development, metabolism, and responses to biotic and abiotic stresses. A large number of WRKY transcription factors have been reported from Arabidopsis, rice, and other higher plants. The recent publication of the draft genome sequence of castor bean (Ricinus communis) has allowed a genome-wide search for R. communis WRKY (RcWRKY) transcription factors and the comparison of these positively identified proteins with their homologs in model plants. A total of 47 WRKY genes were identified in the castor bean genome. According to the structural features of the WRKY domain, the RcWRKY are classified into seven main phylogenetic groups. Furthermore, putative orthologs of RcWRKY proteins in Arabidopsis and rice could now be assigned. An analysis of expression profiles of RcWRKY genes indicates that 47 WRKY genes display differential expressions either in their transcript abundance or expression patterns under normal growth conditions. Copyright © 2012 Elsevier B.V. All rights reserved.
The Plant Genome Integrative Explorer Resource: PlantGenIE.org.
Sundell, David; Mannapperuma, Chanaka; Netotea, Sergiu; Delhomme, Nicolas; Lin, Yao-Cheng; Sjödin, Andreas; Van de Peer, Yves; Jansson, Stefan; Hvidsten, Torgeir R; Street, Nathaniel R
2015-12-01
Accessing and exploring large-scale genomics data sets remains a significant challenge to researchers without specialist bioinformatics training. We present the integrated PlantGenIE.org platform for exploration of Populus, conifer and Arabidopsis genomics data, which includes expression networks and associated visualization tools. Standard features of a model organism database are provided, including genome browsers, gene list annotation, Blast homology searches and gene information pages. Community annotation updating is supported via integration of WebApollo. We have produced an RNA-sequencing (RNA-Seq) expression atlas for Populus tremula and have integrated these data within the expression tools. An updated version of the ComPlEx resource for performing comparative plant expression analyses of gene coexpression network conservation between species has also been integrated. The PlantGenIE.org platform provides intuitive access to large-scale and genome-wide genomics data from model forest tree species, facilitating both community contributions to annotation improvement and tools supporting use of the included data resources to inform biological insight. © 2015 The Authors. New Phytologist © 2015 New Phytologist Trust.
An elt-3/elt-5/elt-6 GATA Transcription Circuit Guides Aging in C. elegans
Budovskaya, Yelena V.; Wu, Kendall; Southworth, Lucinda K.; Jiang, Min; Tedesco, Patricia; Johnson, Thomas E.; Kim, Stuart K.
2016-01-01
SUMMARY To define the C. elegans aging process at the molecular level, we used DNA microarray experiments to identify a set of 1294 age-regulated genes and found that the GATA transcription factors ELT-3, ELT-5, and ELT-6 are responsible for age regulation of a large fraction of these genes. Expression of elt-5 and elt-6 increases during normal aging, and both of these GATA factors repress expression of elt-3, which shows a corresponding decrease in expression in old worms. elt-3 regulates a large number of downstream genes that change expression in old age, including ugt-9, col-144, and sod-3. elt-5(RNAi) and elt-6(RNAi) worms have extended longevity, indicating that elt-3, elt-5, and elt-6 play an important functional role in the aging process. These results identify a transcriptional circuit that guides the rapid aging process in C. elegans and indicate that this circuit is driven by drift of developmental pathways rather than accumulation of damage. PMID:18662544