Gene Expression Profiling of Gastric Cancer
Marimuthu, Arivusudar; Jacob, Harrys K.C.; Jakharia, Aniruddha; Subbannayya, Yashwanth; Keerthikumar, Shivakumar; Kashyap, Manoj Kumar; Goel, Renu; Balakrishnan, Lavanya; Dwivedi, Sutopa; Pathare, Swapnali; Dikshit, Jyoti Bajpai; Maharudraiah, Jagadeesha; Singh, Sujay; Sameer Kumar, Ghantasala S; Vijayakumar, M.; Veerendra Kumar, Kariyanakatte Veeraiah; Premalatha, Chennagiri Shrinivasamurthy; Tata, Pramila; Hariharan, Ramesh; Roa, Juan Carlos; Prasad, T.S.K; Chaerkady, Raghothama; Kumar, Rekha Vijay; Pandey, Akhilesh
2015-01-01
Gastric cancer is the second leading cause of cancer death worldwide, both in men and women. A genomewide gene expression analysis was carried out to identify differentially expressed genes in gastric adenocarcinoma tissues as compared to adjacent normal tissues. We used Agilent’s whole human genome oligonucleotide microarray platform representing ~41,000 genes to carry out gene expression analysis. Two-color microarray analysis was employed to directly compare the expression of genes between tumor and normal tissues. Through this approach, we identified several previously known candidate genes along with a number of novel candidate genes in gastric cancer. Testican-1 (SPOCK1) was one of the novel molecules that was 10-fold upregulated in tumors. Using tissue microarrays, we validated the expression of testican-1 by immunohistochemical staining. It was overexpressed in 56% (160/282) of the cases tested. Pathway analysis led to the identification of several networks in which SPOCK1 was among the topmost networks of interacting genes. By gene enrichment analysis, we identified several genes involved in cell adhesion and cell proliferation to be significantly upregulated while those corresponding to metabolic pathways were significantly downregulated. The differentially expressed genes identified in this study are candidate biomarkers for gastric adenoacarcinoma. PMID:27030788
Kim, Eunjung; Kim, Eun Jung; Seo, Seung-Won; Hur, Cheol-Goo; McGregor, Robin A; Choi, Myung-Sook
2014-01-01
Worldwide obesity and related comorbidities are increasing, but identifying new therapeutic targets remains a challenge. A plethora of microarray studies in diet-induced obesity models has provided large datasets of obesity associated genes. In this review, we describe an approach to examine the underlying molecular network regulating obesity, and we discuss interactions between obesity candidate genes. We conducted network analysis on functional protein-protein interactions associated with 25 obesity candidate genes identified in a literature-driven approach based on published microarray studies of diet-induced obesity. The obesity candidate genes were closely associated with lipid metabolism and inflammation. Peroxisome proliferator activated receptor gamma (Pparg) appeared to be a core obesity gene, and obesity candidate genes were highly interconnected, suggesting a coordinately regulated molecular network in adipose tissue. In conclusion, the current network analysis approach may help elucidate the underlying molecular network regulating obesity and identify anti-obesity targets for therapeutic intervention.
McArt, Darragh G.; Dunne, Philip D.; Blayney, Jaine K.; Salto-Tellez, Manuel; Van Schaeybroeck, Sandra; Hamilton, Peter W.; Zhang, Shu-Dong
2013-01-01
The advent of next generation sequencing technologies (NGS) has expanded the area of genomic research, offering high coverage and increased sensitivity over older microarray platforms. Although the current cost of next generation sequencing is still exceeding that of microarray approaches, the rapid advances in NGS will likely make it the platform of choice for future research in differential gene expression. Connectivity mapping is a procedure for examining the connections among diseases, genes and drugs by differential gene expression initially based on microarray technology, with which a large collection of compound-induced reference gene expression profiles have been accumulated. In this work, we aim to test the feasibility of incorporating NGS RNA-Seq data into the current connectivity mapping framework by utilizing the microarray based reference profiles and the construction of a differentially expressed gene signature from a NGS dataset. This would allow for the establishment of connections between the NGS gene signature and those microarray reference profiles, alleviating the associated incurring cost of re-creating drug profiles with NGS technology. We examined the connectivity mapping approach on a publicly available NGS dataset with androgen stimulation of LNCaP cells in order to extract candidate compounds that could inhibit the proliferative phenotype of LNCaP cells and to elucidate their potential in a laboratory setting. In addition, we also analyzed an independent microarray dataset of similar experimental settings. We found a high level of concordance between the top compounds identified using the gene signatures from the two datasets. The nicotine derivative cotinine was returned as the top candidate among the overlapping compounds with potential to suppress this proliferative phenotype. Subsequent lab experiments validated this connectivity mapping hit, showing that cotinine inhibits cell proliferation in an androgen dependent manner. Thus the results in this study suggest a promising prospect of integrating NGS data with connectivity mapping. PMID:23840550
Ron, Micha; Israeli, Galit; Seroussi, Eyal; Weller, Joel I; Gregg, Jeffrey P; Shani, Moshe; Medrano, Juan F
2007-01-01
Background Many studies have found segregating quantitative trait loci (QTL) for milk production traits in different dairy cattle populations. However, even for relatively large effects with a saturated marker map the confidence interval for QTL location by linkage analysis spans tens of map units, or hundreds of genes. Combining mapping and arraying has been suggested as an approach to identify candidate genes. Thus, gene expression analysis in the mammary gland of genes positioned in the confidence interval of the QTL can bridge the gap between fine mapping and quantitative trait nucleotide (QTN) determination. Results We hybridized Affymetrix microarray (MG-U74v2), containing 12,488 murine probes, with RNA derived from mammary gland of virgin, pregnant, lactating and involuting C57BL/6J mice in a total of nine biological replicates. We combined microarray data from two additional studies that used the same design in mice with a total of 75 biological replicates. The same filtering and normalization was applied to each microarray data using GeneSpring software. Analysis of variance identified 249 differentially expressed probe sets common to the three experiments along the four developmental stages of puberty, pregnancy, lactation and involution. 212 genes were assigned to their bovine map positions through comparative mapping, and thus form a list of candidate genes for previously identified QTLs for milk production traits. A total of 82 of the genes showed mammary gland-specific expression with at least 3-fold expression over the median representing all tissues tested in GeneAtlas. Conclusion This work presents a web tool for candidate genes for QTL (cgQTL) that allows navigation between the map of bovine milk production QTL, potential candidate genes and their level of expression in mammary gland arrays and in GeneAtlas. Three out of four confirmed genes that affect QTL in livestock (ABCG2, DGAT1, GDF8, IGF2) were over expressed in the target organ. Thus, cgQTL can be used to determine priority of candidate genes for QTN analysis based on differential expression in the target organ. PMID:17584498
Novel Biomarker Candidates for Colorectal Cancer Metastasis: A Meta-analysis of In Vitro Studies
Long, Nguyen Phuoc; Lee, Wun Jun; Huy, Nguyen Truong; Lee, Seul Ji; Park, Jeong Hill; Kwon, Sung Won
2016-01-01
Colorectal cancer (CRC) is one of the most common and lethal cancers. Although numerous studies have evaluated potential biomarkers for early diagnosis, current biomarkers have failed to reach an acceptable level of accuracy for distant metastasis. In this paper, we performed a gene set meta-analysis of in vitro microarray studies and combined the results from this study with previously published proteomic data to validate and suggest prognostic candidates for CRC metastasis. Two microarray data sets included found 21 significant genes. Of these significant genes, ALDOA, IL8 (CXCL8), and PARP4 had strong potential as prognostic candidates. LAMB2, MCM7, CXCL23A, SERPINA3, ABCA3, ALDH3A2, and POLR2I also have potential. Other candidates were more controversial, possibly because of the biologic heterogeneity of tumor cells, which is a major obstacle to predicting metastasis. In conclusion, we demonstrated a meta-analysis approach and successfully suggested ten biomarker candidates for future investigation. PMID:27688707
Novel Biomarker Candidates for Colorectal Cancer Metastasis: A Meta-analysis of In Vitro Studies.
Long, Nguyen Phuoc; Lee, Wun Jun; Huy, Nguyen Truong; Lee, Seul Ji; Park, Jeong Hill; Kwon, Sung Won
2016-01-01
Colorectal cancer (CRC) is one of the most common and lethal cancers. Although numerous studies have evaluated potential biomarkers for early diagnosis, current biomarkers have failed to reach an acceptable level of accuracy for distant metastasis. In this paper, we performed a gene set meta-analysis of in vitro microarray studies and combined the results from this study with previously published proteomic data to validate and suggest prognostic candidates for CRC metastasis. Two microarray data sets included found 21 significant genes. Of these significant genes, ALDOA, IL8 (CXCL8), and PARP4 had strong potential as prognostic candidates. LAMB2, MCM7, CXCL23A, SERPINA3, ABCA3, ALDH3A2, and POLR2I also have potential. Other candidates were more controversial, possibly because of the biologic heterogeneity of tumor cells, which is a major obstacle to predicting metastasis. In conclusion, we demonstrated a meta-analysis approach and successfully suggested ten biomarker candidates for future investigation.
González-Plaza, Juan J; Ortiz-Martín, Inmaculada; Muñoz-Mérida, Antonio; García-López, Carmen; Sánchez-Sevilla, José F; Luque, Francisco; Trelles, Oswaldo; Bejarano, Eduardo R; De La Rosa, Raúl; Valpuesta, Victoriano; Beuzón, Carmen R
2016-01-01
Plant architecture is a critical trait in fruit crops that can significantly influence yield, pruning, planting density and harvesting. Little is known about how plant architecture is genetically determined in olive, were most of the existing varieties are traditional with an architecture poorly suited for modern growing and harvesting systems. In the present study, we have carried out microarray analysis of meristematic tissue to compare expression profiles of olive varieties displaying differences in architecture, as well as seedlings from their cross pooled on the basis of their sharing architecture-related phenotypes. The microarray used, previously developed by our group has already been applied to identify candidates genes involved in regulating juvenile to adult transition in the shoot apex of seedlings. Varieties with distinct architecture phenotypes and individuals from segregating progenies displaying opposite architecture features were used to link phenotype to expression. Here, we identify 2252 differentially expressed genes (DEGs) associated to differences in plant architecture. Microarray results were validated by quantitative RT-PCR carried out on genes with functional annotation likely related to plant architecture. Twelve of these genes were further analyzed in individual seedlings of the corresponding pool. We also examined Arabidopsis mutants in putative orthologs of these targeted candidate genes, finding altered architecture for most of them. This supports a functional conservation between species and potential biological relevance of the candidate genes identified. This study is the first to identify genes associated to plant architecture in olive, and the results obtained could be of great help in future programs aimed at selecting phenotypes adapted to modern cultivation practices in this species.
Gene expression profiling in respond to TBT exposure in small abalone Haliotis diversicolor.
Jia, Xiwei; Zou, Zhihua; Wang, Guodong; Wang, Shuhong; Wang, Yilei; Zhang, Ziping
2011-10-01
In this study, we investigated the gene expression profiling of small abalone, Haliotis diversicolor by tributyltin (TBT) exposure using a cDNA microarray containing 2473 unique transcripts. Totally, 107 up-regulated genes and 41 down-regulated genes were found. For further investigation of candidate genes from microarray data and EST analysis, quantitative real-time PCR was performed at 6 h, 24 h, 48 h, 96 h and 192 h TBT exposure. 26 genes were found to be significantly differentially expressed in different time course, 3 of them were unknown. Some gene homologues like cellulose, endo-beta-1,4-glucanase, ferritin subunit 1 and thiolester containing protein II CG7052-PB might be the good biomarker candidate for TBT monitor. The identification of stress response genes and their expression profiles will permit detailed investigation of the defense responses of small abalone genes. Published by Elsevier Ltd.
Identification of candidate genes in osteoporosis by integrated microarray analysis.
Li, J J; Wang, B Q; Fei, Q; Yang, Y; Li, D
2016-12-01
In order to screen the altered gene expression profile in peripheral blood mononuclear cells of patients with osteoporosis, we performed an integrated analysis of the online microarray studies of osteoporosis. We searched the Gene Expression Omnibus (GEO) database for microarray studies of peripheral blood mononuclear cells in patients with osteoporosis. Subsequently, we integrated gene expression data sets from multiple microarray studies to obtain differentially expressed genes (DEGs) between patients with osteoporosis and normal controls. Gene function analysis was performed to uncover the functions of identified DEGs. A total of three microarray studies were selected for integrated analysis. In all, 1125 genes were found to be significantly differentially expressed between osteoporosis patients and normal controls, with 373 upregulated and 752 downregulated genes. Positive regulation of the cellular amino metabolic process (gene ontology (GO): 0033240, false discovery rate (FDR) = 1.00E + 00) was significantly enriched under the GO category for biological processes, while for molecular functions, flavin adenine dinucleotide binding (GO: 0050660, FDR = 3.66E-01) and androgen receptor binding (GO: 0050681, FDR = 6.35E-01) were significantly enriched. DEGs were enriched in many osteoporosis-related signalling pathways, including those of mitogen-activated protein kinase (MAPK) and calcium. Protein-protein interaction (PPI) network analysis showed that the significant hub proteins contained ubiquitin specific peptidase 9, X-linked (Degree = 99), ubiquitin specific peptidase 19 (Degree = 57) and ubiquitin conjugating enzyme E2 B (Degree = 57). Analysis of gene function of identified differentially expressed genes may expand our understanding of fundamental mechanisms leading to osteoporosis. Moreover, significantly enriched pathways, such as MAPK and calcium, may involve in osteoporosis through osteoblastic differentiation and bone formation.Cite this article: J. J. Li, B. Q. Wang, Q. Fei, Y. Yang, D. Li. Identification of candidate genes in osteoporosis by integrated microarray analysis. Bone Joint Res 2016;5:594-601. DOI: 10.1302/2046-3758.512.BJR-2016-0073.R1. © 2016 Fei et al.
2014-01-01
Background Kidney stone disease (KSD) is a complex disorder with unknown etiology in majority of the patients. Genetic and environmental factors may cause the disease. In the present study, we used DNA microarray to genotype single nucleotide polymorphisms (SNP) and performed candidate gene association analysis to determine genetic variations associated with the disease. Methods A whole genome SNP genotyping by DNA microarray was initially conducted in 101 patients and 105 control subjects. A set of 104 candidate genes reported to be involved in KSD, gathered from public databases and candidate gene association study databases, were evaluated for their variations associated with KSD. Results Altogether 82 SNPs distributed within 22 candidate gene regions showed significant differences in SNP allele frequencies between the patient and control groups (P < 0.05). Of these, 4 genes including BGLAP, AHSG, CD44, and HAO1, encoding osteocalcin, fetuin-A, CD44-molecule and glycolate oxidase 1, respectively, were further assessed for their associations with the disease because they carried high proportion of SNPs with statistical differences of allele frequencies between the patient and control groups within the gene. The total of 26 SNPs showed significant differences of allele frequencies between the patient and control groups and haplotypes associated with disease risk were identified. The SNP rs759330 located 144 bp downstream of BGLAP where it is a predicted microRNA binding site at 3′UTR of PAQR6 – a gene encoding progestin and adipoQ receptor family member VI, was genotyped in 216 patients and 216 control subjects and found to have significant differences in its genotype and allele frequencies (P = 0.0007, OR 2.02 and P = 0.0001, OR 2.02, respectively). Conclusions Our results suggest that these candidate genes are associated with KSD and PAQR6 comes into our view as the most potent candidate since associated SNP rs759330 is located in the miRNA binding site and may affect mRNA expression level. PMID:24886237
González-Plaza, Juan J.; Ortiz-Martín, Inmaculada; Muñoz-Mérida, Antonio; García-López, Carmen; Sánchez-Sevilla, José F.; Luque, Francisco; Trelles, Oswaldo; Bejarano, Eduardo R.; De La Rosa, Raúl; Valpuesta, Victoriano; Beuzón, Carmen R.
2016-01-01
Plant architecture is a critical trait in fruit crops that can significantly influence yield, pruning, planting density and harvesting. Little is known about how plant architecture is genetically determined in olive, were most of the existing varieties are traditional with an architecture poorly suited for modern growing and harvesting systems. In the present study, we have carried out microarray analysis of meristematic tissue to compare expression profiles of olive varieties displaying differences in architecture, as well as seedlings from their cross pooled on the basis of their sharing architecture-related phenotypes. The microarray used, previously developed by our group has already been applied to identify candidates genes involved in regulating juvenile to adult transition in the shoot apex of seedlings. Varieties with distinct architecture phenotypes and individuals from segregating progenies displaying opposite architecture features were used to link phenotype to expression. Here, we identify 2252 differentially expressed genes (DEGs) associated to differences in plant architecture. Microarray results were validated by quantitative RT-PCR carried out on genes with functional annotation likely related to plant architecture. Twelve of these genes were further analyzed in individual seedlings of the corresponding pool. We also examined Arabidopsis mutants in putative orthologs of these targeted candidate genes, finding altered architecture for most of them. This supports a functional conservation between species and potential biological relevance of the candidate genes identified. This study is the first to identify genes associated to plant architecture in olive, and the results obtained could be of great help in future programs aimed at selecting phenotypes adapted to modern cultivation practices in this species. PMID:26973682
Jouffe, Vincent; Rowe, Suzanne; Liaubet, Laurence; Buitenhuis, Bart; Hornshøj, Henrik; SanCristobal, Magali; Mormède, Pierre; de Koning, D J
2009-07-16
Microarray studies can supplement QTL studies by suggesting potential candidate genes in the QTL regions, which by themselves are too large to provide a limited selection of candidate genes. Here we provide a case study where we explore ways to integrate QTL data and microarray data for the pig, which has only a partial genome sequence. We outline various procedures to localize differentially expressed genes on the pig genome and link this with information on published QTL. The starting point is a set of 237 differentially expressed cDNA clones in adrenal tissue from two pig breeds, before and after treatment with adrenocorticotropic hormone (ACTH). Different approaches to localize the differentially expressed (DE) genes to the pig genome showed different levels of success and a clear lack of concordance for some genes between the various approaches. For a focused analysis on 12 genes, overlapping QTL from the public domain were presented. Also, differentially expressed genes underlying QTL for ACTH response were described. Using the latest version of the draft sequence, the differentially expressed genes were mapped to the pig genome. This enabled co-location of DE genes and previously studied QTL regions, but the draft genome sequence is still incomplete and will contain many errors. A further step to explore links between DE genes and QTL at the pathway level was largely unsuccessful due to the lack of annotation of the pig genome. This could be improved by further comparative mapping analyses but this would be time consuming. This paper provides a case study for the integration of QTL data and microarray data for a species with limited genome sequence information and annotation. The results illustrate the challenges that must be addressed but also provide a roadmap for future work that is applicable to other non-model species.
Detecting novel genes with sparse arrays
Haiminen, Niina; Smit, Bart; Rautio, Jari; Vitikainen, Marika; Wiebe, Marilyn; Martinez, Diego; Chee, Christine; Kunkel, Joe; Sanchez, Charles; Nelson, Mary Anne; Pakula, Tiina; Saloheimo, Markku; Penttilä, Merja; Kivioja, Teemu
2014-01-01
Species-specific genes play an important role in defining the phenotype of an organism. However, current gene prediction methods can only efficiently find genes that share features such as sequence similarity or general sequence characteristics with previously known genes. Novel sequencing methods and tiling arrays can be used to find genes without prior information and they have demonstrated that novel genes can still be found from extensively studied model organisms. Unfortunately, these methods are expensive and thus are not easily applicable, e.g., to finding genes that are expressed only in very specific conditions. We demonstrate a method for finding novel genes with sparse arrays, applying it on the 33.9 Mb genome of the filamentous fungus Trichoderma reesei. Our computational method does not require normalisations between arrays and it takes into account the multiple-testing problem typical for analysis of microarray data. In contrast to tiling arrays, that use overlapping probes, only one 25mer microarray oligonucleotide probe was used for every 100 b. Thus, only relatively little space on a microarray slide was required to cover the intergenic regions of a genome. The analysis was done as a by-product of a conventional microarray experiment with no additional costs. We found at least 23 good candidates for novel transcripts that could code for proteins and all of which were expressed at high levels. Candidate genes were found to neighbour ire1 and cre1 and many other regulatory genes. Our simple, low-cost method can easily be applied to finding novel species-specific genes without prior knowledge of their sequence properties. PMID:20691772
Tiwari, Jagesh Kumar; Devi, Sapna; Sundaresha, S; Chandel, Poonam; Ali, Nilofer; Singh, Brajesh; Bhardwaj, Vinay; Singh, Bir Pal
2015-06-01
Genes involved in photoassimilate partitioning and changes in hormonal balance are important for potato tuberization. In the present study, we investigated gene expression patterns in the tuber-bearing potato somatic hybrid (E1-3) and control non-tuberous wild species Solanum etuberosum (Etb) by microarray. Plants were grown under controlled conditions and leaves were collected at eight tuber developmental stages for microarray analysis. A t-test analysis identified a total of 468 genes (94 up-regulated and 374 down-regulated) that were statistically significant (p ≤ 0.05) and differentially expressed in E1-3 and Etb. Gene Ontology (GO) characterization of the 468 genes revealed that 145 were annotated and 323 were of unknown function. Further, these 145 genes were grouped based on GO biological processes followed by molecular function and (or) PGSC description into 15 gene sets, namely (1) transport, (2) metabolic process, (3) biological process, (4) photosynthesis, (5) oxidation-reduction, (6) transcription, (7) translation, (8) binding, (9) protein phosphorylation, (10) protein folding, (11) ubiquitin-dependent protein catabolic process, (12) RNA processing, (13) negative regulation of protein, (14) methylation, and (15) mitosis. RT-PCR analysis of 10 selected highly significant genes (p ≤ 0.01) confirmed the microarray results. Overall, we show that candidate genes induced in leaves of E1-3 were implicated in tuberization processes such as transport, carbohydrate metabolism, phytohormones, and transcription/translation/binding functions. Hence, our results provide an insight into the candidate genes induced in leaf tissues during tuberization in E1-3.
Constitutional downregulation of SEMA5A expression in autism.
Melin, M; Carlsson, B; Anckarsater, H; Rastam, M; Betancur, C; Isaksson, A; Gillberg, C; Dahl, N
2006-01-01
There is strong evidence for the importance of genetic factors in idiopathic autism. The results from independent twin and family studies suggest that the disorder is caused by the action of several genes, possibly acting epistatically. We have used cDNA microarray technology for the identification of constitutional changes in the gene expression profile associated with idiopathic autism. Samples were obtained and analyzed from 6 affected subjects belonging to multiplex autism families and from 6 healthy controls. We assessed the expression levels for approximately 7,700 genes by cDNA microarrays using mRNA derived from Epstein-Barr virus-transformed B lymphocytes. The microarray data were analyzed in order to identify up- or downregulation of specific genes. A common pattern with nine downregulated genes was identified among samples derived from individuals with autism when compared to controls. Four of these nine genes encode proteins involved in biological processes associated with brain function or the immune system, and are consequently considered as candidates for genes associated with autism. Quantitative real-time PCR confirms the downregulation of the gene encoding SEMA5A, a protein involved in axonal guidance. Epstein-Barr virus should be considered as a possible source for altered expression, but our consistent results make us suggest SEMA5A as a candidate gene in the etiology of idiopathic autism.
Constitutional downregulation of SEMA5A expression in autism
Melin, Malin; Carlsson, Birgit; Anckarsäter, Henrik; Rastam, Maria; Betancur, Catalina; Isaksson, Anders; Gillberg, Christopher; Dahl, Niklas
2006-01-01
There is strong evidence for the importance of genetic factors in idiopathic autism. The results from independent twin and family studies suggest that the disorder is caused by the action of several genes, possibly acting epistatically. We have used cDNA microarray technology for the identification of constitutional changes in the gene expression profile associated with idiopathic autism. Samples were obtained and analyzed from six affected subjects belonging to multiplex autism families and from six healthy controls. We assessed the expression levels for approximately 7,700 genes by cDNA microarrays using mRNA derived from Epstein Barr virus (EBV)-transformed B-lymphocytes. The microarray data was analyzed in order to identify up- or down-regulation of specific genes. A common pattern with nine down-regulated genes was identified among samples derived from individuals with autism when compared to controls. Four of these nine genes encode proteins involved in biological processes associated with brain function or the immune system, and are consequently considered as candidates for genes associated with autism. Quantitative realtime PCR confirms the down-regulation of the gene encoding SEMA5A, a protein involved in axonal guidance. EBV should be considered as a possible source for altered expression but our consistent results make us suggest SEMA5A a candidate gene in the etiology of idiopathic autism. PMID:17028446
Seliger, Barbara; Dressler, Sven P.; Wang, Ena; Kellner, Roland; Recktenwald, Christian V.; Lottspeich, Friedrich; Marincola, Francesco M.; Baumgärtner, Maja; Atkins, Derek; Lichtenfels, Rudolf
2012-01-01
Results obtained from expression profilings of renal cell carcinoma using different “ome”-based approaches and comprehensive data analysis demonstrated that proteome-based technologies and cDNA microarray analyses complement each other during the discovery phase for disease-related candidate biomarkers. The integration of the respective data revealed the uniqueness and complementarities of the different technologies. While comparative cDNA microarray analyses though restricted to upregulated targets largely revealed genes involved in controlling gene/protein expression (19%) and signal transduction processes (13%), proteomics/PROTEOMEX-defined candidate biomarkers include enzymes of the cellular metabolism (36%), transport proteins (12%) and cell motility/structural molecules (10%). Candidate biomarkers defined by proteomics and PROTEOMEX are frequently shared, whereas the sharing rate between cDNA microarray and proteome-based profilings is limited. Putative candidate biomarkers provide insights into their cellular (dys)function and their diagnostic/prognostic value but still warrant further validation in larger patient numbers. Based on the fact that merely 3 candidate biomarkers were shared by all applied technologies, namely annexin A4, tubulin alpha-1A chain and ubiquitin carboxyl-terminal hydrolase L1 the analysis at a single hierarchical level of biological regulation seems to provide only limited results thus emphasizing the importance and benefit of performing rather combinatorial screenings which can complement the standard clinical predictors. PMID:19235166
Cross-platform method for identifying candidate network biomarkers for prostate cancer.
Jin, G; Zhou, X; Cui, K; Zhang, X-S; Chen, L; Wong, S T C
2009-11-01
Discovering biomarkers using mass spectrometry (MS) and microarray expression profiles is a promising strategy in molecular diagnosis. Here, the authors proposed a new pipeline for biomarker discovery that integrates disease information for proteins and genes, expression profiles in both genomic and proteomic levels, and protein-protein interactions (PPIs) to discover high confidence network biomarkers. Using this pipeline, a total of 474 molecules (genes and proteins) related to prostate cancer were identified and a prostate-cancer-related network (PCRN) was derived from the integrative information. Thus, a set of candidate network biomarkers were identified from multiple expression profiles composed by eight microarray datasets and one proteomics dataset. The network biomarkers with PPIs can accurately distinguish the prostate patients from the normal ones, which potentially provide more reliable hits of biomarker candidates than conventional biomarker discovery methods.
Kudo, Toru; Sasaki, Yohei; Terashima, Shin; Matsuda-Imai, Noriko; Takano, Tomoyuki; Saito, Misa; Kanno, Maasa; Ozaki, Soichi; Suwabe, Keita; Suzuki, Go; Watanabe, Masao; Matsuoka, Makoto; Takayama, Seiji; Yano, Kentaro
2016-10-13
In quantitative gene expression analysis, normalization using a reference gene as an internal control is frequently performed for appropriate interpretation of the results. Efforts have been devoted to exploring superior novel reference genes using microarray transcriptomic data and to evaluating commonly used reference genes by targeting analysis. However, because the number of specifically detectable genes is totally dependent on probe design in the microarray analysis, exploration using microarray data may miss some of the best choices for the reference genes. Recently emerging RNA sequencing (RNA-seq) provides an ideal resource for comprehensive exploration of reference genes since this method is capable of detecting all expressed genes, in principle including even unknown genes. We report the results of a comprehensive exploration of reference genes using public RNA-seq data from plants such as Arabidopsis thaliana (Arabidopsis), Glycine max (soybean), Solanum lycopersicum (tomato) and Oryza sativa (rice). To select reference genes suitable for the broadest experimental conditions possible, candidates were surveyed by the following four steps: (1) evaluation of the basal expression level of each gene in each experiment; (2) evaluation of the expression stability of each gene in each experiment; (3) evaluation of the expression stability of each gene across the experiments; and (4) selection of top-ranked genes, after ranking according to the number of experiments in which the gene was expressed stably. Employing this procedure, 13, 10, 12 and 21 top candidates for reference genes were proposed in Arabidopsis, soybean, tomato and rice, respectively. Microarray expression data confirmed that the expression of the proposed reference genes under broad experimental conditions was more stable than that of commonly used reference genes. These novel reference genes will be useful for analyzing gene expression profiles across experiments carried out under various experimental conditions.
A microarray analysis of potential genes underlying the neurosensitivity of mice to propofol.
Lowes, Damon A; Galley, Helen F; Lowe, Peter R; Rikke, Brad A; Johnson, Thomas E; Webster, Nigel R
2005-09-01
Establishing the mechanism of action of general anesthetics at the molecular level is difficult because of the multiple targets with which these drugs are associated. Inbred short sleep (ISS) and long sleep (ILS) mice are differentially sensitive in response to ethanol and other sedative hypnotics and contain a single quantitative trait locus (Lorp1) that accounts for the genetic variance of loss-of-righting reflex in response to propofol (LORP). In this study, we used high-density oligonucleotide microarrays to identify global gene expression and candidate genes differentially expressed within the Lorp1 region that may give insight into the molecular mechanism underlying LORP. Microarray analysis was performed using Affymetrix MG-U74Av2 Genechips and a selection of differentially expressed genes was confirmed by semiquantitative reverse transcription-polymerase chain reaction. Global expression in the brains of ILS and ISS mice revealed 3423 genes that were significantly expressed, of which 139 (4%) were differentially expressed. Analysis of genes located within the Lorp1 region showed that 26 genes were significantly expressed and that just 2 genes (7%) were differentially expressed. These genes encoded for the proteins AWP1 (associated with protein kinase 1) and "BTB (POZ) domain containing 1," whose functions are largely uncharacterized. Genes differentially expressed outside Lorp1 included seven genes with previously characterized neuronal functions and thus stand out as additional candidate genes that may be involved in mediating the neurosensitivity differences between ISS and ILS.
Klangnurak, Wanlada; Fukuyo, Taketo; Rezanujjaman, M D; Seki, Masahide; Sugano, Sumio; Suzuki, Yutaka; Tokumoto, Toshinobu
2018-01-01
We previously reported the microarray-based selection of three ovulation-related genes in zebrafish. We used a different selection method in this study, RNA sequencing analysis. An additional eight up-regulated candidates were found as specifically up-regulated genes in ovulation-induced samples. Changes in gene expression were confirmed by qPCR analysis. Furthermore, up-regulation prior to ovulation during natural spawning was verified in samples from natural pairing. Gene knock-out zebrafish strains of one of the candidates, the starmaker gene (stm), were established by CRISPR genome editing techniques. Unexpectedly, homozygous mutants were fertile and could spawn eggs. However, a high percentage of unfertilized eggs and abnormal embryos were produced from these homozygous females. The results suggest that the stm gene is necessary for fertilization. In this study, we selected additional ovulation-inducing candidate genes, and a novel function of the stm gene was investigated.
Łastowska, M; Viprey, V; Santibanez-Koref, M; Wappler, I; Peters, H; Cullinane, C; Roberts, P; Hall, A G; Tweddle, D A; Pearson, A D J; Lewis, I; Burchill, S A; Jackson, M S
2007-11-22
Identifying genes, whose expression is consistently altered by chromosomal gains or losses, is an important step in defining genes of biological relevance in a wide variety of tumour types. However, additional criteria are needed to discriminate further among the large number of candidate genes identified. This is particularly true for neuroblastoma, where multiple genomic copy number changes of proven prognostic value exist. We have used Affymetrix microarrays and a combination of fluorescent in situ hybridization and single nucleotide polymorphism (SNP) microarrays to establish expression profiles and delineate copy number alterations in 30 primary neuroblastomas. Correlation of microarray data with patient survival and analysis of expression within rodent neuroblastoma cell lines were then used to define further genes likely to be involved in the disease process. Using this approach, we identify >1000 genes within eight recurrent genomic alterations (loss of 1p, 3p, 4p, 10q and 11q, 2p gain, 17q gain, and the MYCN amplicon) whose expression is consistently altered by copy number change. Of these, 84 correlate with patient survival, with the minimal regions of 17q gain and 4p loss being enriched significantly for such genes. These include genes involved in RNA and DNA metabolism, and apoptosis. Orthologues of all but one of these genes on 17q are overexpressed in rodent neuroblastoma cell lines. A significant excess of SNPs whose copy number correlates with survival is also observed on proximal 4p in stage 4 tumours, and we find that deletion of 4p is associated with improved outcome in an extended cohort of tumours. These results define the major impact of genomic copy number alterations upon transcription within neuroblastoma, and highlight genes on distal 17q and proximal 4p for downstream analyses. They also suggest that integration of discriminators, such as survival and comparative gene expression, with microarray data may be useful in the identification of critical genes within regions of loss or gain in many human cancers.
Microarray-assisted fine-mapping of quantitative trait loci for cold tolerance in rice.
Liu, Fengxia; Xu, Wenying; Song, Qian; Tan, Lubin; Liu, Jiayong; Zhu, Zuofeng; Fu, Yongcai; Su, Zhen; Sun, Chuanqing
2013-05-01
Many important agronomic traits, including cold stress resistance, are complex and controlled by quantitative trait loci (QTLs). Isolation of these QTLs will greatly benefit the agricultural industry but it is a challenging task. This study explored an integrated strategy by combining microarray with QTL-mapping in order to identify cold-tolerant QTLs from a cold-tolerant variety IL112 at early-seedling stage. All the early seedlings of IL112 survived normally for 9 d at 4-5°C, while Guichao2 (GC2), an indica cultivar, died after 4 d under the same conditions. Using the F2:3 population derived from the progeny of GC2 and IL112, we identified seven QTLs for cold tolerance. Furthermore, we performed Affymetrix rice whole-genome array hybridization and obtained the expression profiles of IL112 and GC2 under both low-temperature and normal conditions. Four genes were selected as cold QTL-related candidates, based on microarray data mining and QTL-mapping. One candidate gene, LOC_Os07g22494, was shown to be highly associated with cold tolerance in a number of rice varieties and in the F2:3 population, and its overexpression transgenic rice plants displayed strong tolerance to low temperature at early-seedling stage. The results indicated that overexpression of this gene (LOC_Os07g22494) could increase cold tolerance in rice seedlings. Therefore, this study provides a promising strategy for identifying candidate genes in defined QTL regions.
Identification and characterization of nuclear genes involved in photosynthesis in Populus
2014-01-01
Background The gap between the real and potential photosynthetic rate under field conditions suggests that photosynthesis could potentially be improved. Nuclear genes provide possible targets for improving photosynthetic efficiency. Hence, genome-wide identification and characterization of the nuclear genes affecting photosynthetic traits in woody plants would provide key insights on genetic regulation of photosynthesis and identify candidate processes for improvement of photosynthesis. Results Using microarray and bulked segregant analysis strategies, we identified differentially expressed nuclear genes for photosynthesis traits in a segregating population of poplar. We identified 515 differentially expressed genes in this population (FC ≥ 2 or FC ≤ 0.5, P < 0.05), 163 up-regulated and 352 down-regulated. Real-time PCR expression analysis confirmed the microarray data. Singular Enrichment Analysis identified 48 significantly enriched GO terms for molecular functions (28), biological processes (18) and cell components (2). Furthermore, we selected six candidate genes for functional examination by a single-marker association approach, which demonstrated that 20 SNPs in five candidate genes significantly associated with photosynthetic traits, and the phenotypic variance explained by each SNP ranged from 2.3% to 12.6%. This revealed that regulation of photosynthesis by the nuclear genome mainly involves transport, metabolism and response to stimulus functions. Conclusions This study provides new genome-scale strategies for the discovery of potential candidate genes affecting photosynthesis in Populus, and for identification of the functions of genes involved in regulation of photosynthesis. This work also suggests that improving photosynthetic efficiency under field conditions will require the consideration of multiple factors, such as stress responses. PMID:24673936
Microarray characterization of gene expression changes in blood during acute ethanol exposure
2013-01-01
Background As part of the civil aviation safety program to define the adverse effects of ethanol on flying performance, we performed a DNA microarray analysis of human whole blood samples from a five-time point study of subjects administered ethanol orally, followed by breathalyzer analysis, to monitor blood alcohol concentration (BAC) to discover significant gene expression changes in response to the ethanol exposure. Methods Subjects were administered either orange juice or orange juice with ethanol. Blood samples were taken based on BAC and total RNA was isolated from PaxGene™ blood tubes. The amplified cDNA was used in microarray and quantitative real-time polymerase chain reaction (RT-qPCR) analyses to evaluate differential gene expression. Microarray data was analyzed in a pipeline fashion to summarize and normalize and the results evaluated for relative expression across time points with multiple methods. Candidate genes showing distinctive expression patterns in response to ethanol were clustered by pattern and further analyzed for related function, pathway membership and common transcription factor binding within and across clusters. RT-qPCR was used with representative genes to confirm relative transcript levels across time to those detected in microarrays. Results Microarray analysis of samples representing 0%, 0.04%, 0.08%, return to 0.04%, and 0.02% wt/vol BAC showed that changes in gene expression could be detected across the time course. The expression changes were verified by qRT-PCR. The candidate genes of interest (GOI) identified from the microarray analysis and clustered by expression pattern across the five BAC points showed seven coordinately expressed groups. Analysis showed function-based networks, shared transcription factor binding sites and signaling pathways for members of the clusters. These include hematological functions, innate immunity and inflammation functions, metabolic functions expected of ethanol metabolism, and pancreatic and hepatic function. Five of the seven clusters showed links to the p38 MAPK pathway. Conclusions The results of this study provide a first look at changing gene expression patterns in human blood during an acute rise in blood ethanol concentration and its depletion because of metabolism and excretion, and demonstrate that it is possible to detect changes in gene expression using total RNA isolated from whole blood. The analysis approach for this study serves as a workflow to investigate the biology linked to expression changes across a time course and from these changes, to identify target genes that could serve as biomarkers linked to pilot performance. PMID:23883607
2010-01-01
Background The large amount of high-throughput genomic data has facilitated the discovery of the regulatory relationships between transcription factors and their target genes. While early methods for discovery of transcriptional regulation relationships from microarray data often focused on the high-throughput experimental data alone, more recent approaches have explored the integration of external knowledge bases of gene interactions. Results In this work, we develop an algorithm that provides improved performance in the prediction of transcriptional regulatory relationships by supplementing the analysis of microarray data with a new method of integrating information from an existing knowledge base. Using a well-known dataset of yeast microarrays and the Yeast Proteome Database, a comprehensive collection of known information of yeast genes, we show that knowledge-based predictions demonstrate better sensitivity and specificity in inferring new transcriptional interactions than predictions from microarray data alone. We also show that comprehensive, direct and high-quality knowledge bases provide better prediction performance. Comparison of our results with ChIP-chip data and growth fitness data suggests that our predicted genome-wide regulatory pairs in yeast are reasonable candidates for follow-up biological verification. Conclusion High quality, comprehensive, and direct knowledge bases, when combined with appropriate bioinformatic algorithms, can significantly improve the discovery of gene regulatory relationships from high throughput gene expression data. PMID:20122245
Seok, Junhee; Kaushal, Amit; Davis, Ronald W; Xiao, Wenzhong
2010-01-18
The large amount of high-throughput genomic data has facilitated the discovery of the regulatory relationships between transcription factors and their target genes. While early methods for discovery of transcriptional regulation relationships from microarray data often focused on the high-throughput experimental data alone, more recent approaches have explored the integration of external knowledge bases of gene interactions. In this work, we develop an algorithm that provides improved performance in the prediction of transcriptional regulatory relationships by supplementing the analysis of microarray data with a new method of integrating information from an existing knowledge base. Using a well-known dataset of yeast microarrays and the Yeast Proteome Database, a comprehensive collection of known information of yeast genes, we show that knowledge-based predictions demonstrate better sensitivity and specificity in inferring new transcriptional interactions than predictions from microarray data alone. We also show that comprehensive, direct and high-quality knowledge bases provide better prediction performance. Comparison of our results with ChIP-chip data and growth fitness data suggests that our predicted genome-wide regulatory pairs in yeast are reasonable candidates for follow-up biological verification. High quality, comprehensive, and direct knowledge bases, when combined with appropriate bioinformatic algorithms, can significantly improve the discovery of gene regulatory relationships from high throughput gene expression data.
Costa, Fabrizio; Alba, Rob; Schouten, Henk; Soglio, Valeria; Gianfranceschi, Luca; Serra, Sara; Musacchi, Stefano; Sansavini, Silviero; Costa, Guglielmo; Fei, Zhangjun; Giovannoni, James
2010-10-25
Fruit development, maturation and ripening consists of a complex series of biochemical and physiological changes that in climacteric fruits, including apple and tomato, are coordinated by the gaseous hormone ethylene. These changes lead to final fruit quality and understanding of the functional machinery underlying these processes is of both biological and practical importance. To date many reports have been made on the analysis of gene expression in apple. In this study we focused our investigation on the role of ethylene during apple maturation, specifically comparing transcriptomics of normal ripening with changes resulting from application of the hormone receptor competitor 1-methylcyclopropene. To gain insight into the molecular process regulating ripening in apple, and to compare to tomato (model species for ripening studies), we utilized both homologous and heterologous (tomato) microarray to profile transcriptome dynamics of genes involved in fruit development and ripening, emphasizing those which are ethylene regulated.The use of both types of microarrays facilitated transcriptome comparison between apple and tomato (for the later using data previously published and available at the TED: tomato expression database) and highlighted genes conserved during ripening of both species, which in turn represent a foundation for further comparative genomic studies. The cross-species analysis had the secondary aim of examining the efficiency of heterologous (specifically tomato) microarray hybridization for candidate gene identification as related to the ripening process. The resulting transcriptomics data revealed coordinated gene expression during fruit ripening of a subset of ripening-related and ethylene responsive genes, further facilitating the analysis of ethylene response during fruit maturation and ripening. Our combined strategy based on microarray hybridization enabled transcriptome characterization during normal climacteric apple ripening, as well as definition of ethylene-dependent transcriptome changes. Comparison with tomato fruit maturation and ethylene responsive transcriptome activity facilitated identification of putative conserved orthologous ripening-related genes, which serve as an initial set of candidates for assessing conservation of gene activity across genomes of fruit bearing plant species.
Feng, Yinling; Wang, Xuefeng
2017-03-01
In order to investigate commonly disturbed genes and pathways in various brain regions of patients with Parkinson's disease (PD), microarray datasets from previous studies were collected and systematically analyzed. Different normalization methods were applied to microarray datasets from different platforms. A strategy combining gene co‑expression networks and clinical information was adopted, using weighted gene co‑expression network analysis (WGCNA) to screen for commonly disturbed genes in different brain regions of patients with PD. Functional enrichment analysis of commonly disturbed genes was performed using the Database for Annotation, Visualization, and Integrated Discovery (DAVID). Co‑pathway relationships were identified with Pearson's correlation coefficient tests and a hypergeometric distribution‑based test. Common genes in pathway pairs were selected out and regarded as risk genes. A total of 17 microarray datasets from 7 platforms were retained for further analysis. Five gene coexpression modules were identified, containing 9,745, 736, 233, 101 and 93 genes, respectively. One module was significantly correlated with PD samples and thus the 736 genes it contained were considered to be candidate PD‑associated genes. Functional enrichment analysis demonstrated that these genes were implicated in oxidative phosphorylation and PD. A total of 44 pathway pairs and 52 risk genes were revealed, and a risk gene pathway relationship network was constructed. Eight modules were identified and were revealed to be associated with PD, cancers and metabolism. A number of disturbed pathways and risk genes were unveiled in PD, and these findings may help advance understanding of PD pathogenesis.
Microarray analysis identifies candidate genes for key roles in coral development
Grasso, Lauretta C; Maindonald, John; Rudd, Stephen; Hayward, David C; Saint, Robert; Miller, David J; Ball, Eldon E
2008-01-01
Background Anthozoan cnidarians are amongst the simplest animals at the tissue level of organization, but are surprisingly complex and vertebrate-like in terms of gene repertoire. As major components of tropical reef ecosystems, the stony corals are anthozoans of particular ecological significance. To better understand the molecular bases of both cnidarian development in general and coral-specific processes such as skeletogenesis and symbiont acquisition, microarray analysis was carried out through the period of early development – when skeletogenesis is initiated, and symbionts are first acquired. Results Of 5081 unique peptide coding genes, 1084 were differentially expressed (P ≤ 0.05) in comparisons between four different stages of coral development, spanning key developmental transitions. Genes of likely relevance to the processes of settlement, metamorphosis, calcification and interaction with symbionts were characterised further and their spatial expression patterns investigated using whole-mount in situ hybridization. Conclusion This study is the first large-scale investigation of developmental gene expression for any cnidarian, and has provided candidate genes for key roles in many aspects of coral biology, including calcification, metamorphosis and symbiont uptake. One surprising finding is that some of these genes have clear counterparts in higher animals but are not present in the closely-related sea anemone Nematostella. Secondly, coral-specific processes (i.e. traits which distinguish corals from their close relatives) may be analogous to similar processes in distantly related organisms. This first large-scale application of microarray analysis demonstrates the potential of this approach for investigating many aspects of coral biology, including the effects of stress and disease. PMID:19014561
Park, Soomin; Baek, Seung-Hun; Cho, Sang-Nae; Jang, Young-Saeng; Kim, Ahreum; Choi, In-Hong
2017-01-01
There is a substantial need for biomarkers to distinguish latent stage from active Mycobacterium tuberculosis infections, for predicting disease progression. To induce the reactivation of tuberculosis, we present a new experimental animal model modified based on the previous model established by our group. In the new model, the reactivation of tuberculosis is induced without administration of immunosuppressive agents, which might disturb immune responses. To identify the immunological status of the persistent and chronic stages, we analyzed immunological genes in lung tissues from mice infected with M. tuberculosis . Gene expression was screened using cDNA microarray analysis and confirmed by quantitative RT-PCR. Based on the cDNA microarray results, 11 candidate cytokines genes, which were obviously up-regulated during the chronic stage compared with those during the persistent stage, were selected and clustered into three groups: (1) chemokine genes, except those of monocyte chemoattractant proteins (MCPs; CXCL9, CXCL10, CXCL11, CCL5, CCL19); (2) MCP genes (CCL2, CCL7, CCL8, CCL12); and (3) TNF and IFN-γ genes. Results from the cDNA microarray and quantitative RT-PCR analyses revealed that the mRNA expression of the selected cytokine genes was significantly higher in lung tissues of the chronic stage than of the persistent stage. Three chemokines (CCL5, CCL19, and CXCL9) and three MCPs (CCL7, CCL2, and CCL12) were noticeably increased in the chronic stage compared with the persistent stage by cDNA microarray ( p < 0.01, except CCL12) or RT-PCR ( p < 0.01). Therefore, these six significantly increased cytokines in lung tissue from the mouse tuberculosis model might be candidates for biomarkers to distinguish the two disease stages. This information can be combined with already reported potential biomarkers to construct a network of more efficient tuberculosis markers.
Mehrian-Shai, Ruty; Yalon, Michal; Moshe, Itai; Barshack, Iris; Nass, Dvorah; Jacob, Jasmine; Dor, Chen; Reichardt, Juergen K V; Constantini, Shlomi; Toren, Amos
2016-01-14
The genetic mechanisms underlying hemangioblastoma development are still largely unknown. We used high-resolution single nucleotide polymorphism microarrays and droplet digital PCR analysis to detect copy number variations (CNVs) in total of 45 hemangioblastoma tumors. We identified 94 CNVs with a median of 18 CNVs per sample. The most frequently gained regions were on chromosomes 1 (p36.32) and 7 (p11.2). These regions contain the EGFR and PRDM16 genes. Recurrent losses were located at chromosome 12 (q24.13), which includes the gene PTPN11. Our findings provide the first high-resolution genome-wide view of chromosomal changes in hemangioblastoma and identify 23 candidate genes: EGFR, PRDM16, PTPN11, HOXD11, HOXD13, FLT3, PTCH, FGFR1, FOXP1, GPC3, HOXC13, HOXC11, MKL1, CHEK2, IRF4, GPHN, IKZF1, RB1, HOXA9, and micro RNA, such as hsa-mir-196a-2 for hemangioblastoma pathogenesis. Furthermore, our data implicate that cell proliferation and angiogenesis promoting pathways may be involved in the molecular pathogenesis of hemangioblastoma.
Genome-wide prediction and analysis of human tissue-selective genes using microarray expression data
2013-01-01
Background Understanding how genes are expressed specifically in particular tissues is a fundamental question in developmental biology. Many tissue-specific genes are involved in the pathogenesis of complex human diseases. However, experimental identification of tissue-specific genes is time consuming and difficult. The accurate predictions of tissue-specific gene targets could provide useful information for biomarker development and drug target identification. Results In this study, we have developed a machine learning approach for predicting the human tissue-specific genes using microarray expression data. The lists of known tissue-specific genes for different tissues were collected from UniProt database, and the expression data retrieved from the previously compiled dataset according to the lists were used for input vector encoding. Random Forests (RFs) and Support Vector Machines (SVMs) were used to construct accurate classifiers. The RF classifiers were found to outperform SVM models for tissue-specific gene prediction. The results suggest that the candidate genes for brain or liver specific expression can provide valuable information for further experimental studies. Our approach was also applied for identifying tissue-selective gene targets for different types of tissues. Conclusions A machine learning approach has been developed for accurately identifying the candidate genes for tissue specific/selective expression. The approach provides an efficient way to select some interesting genes for developing new biomedical markers and improve our knowledge of tissue-specific expression. PMID:23369200
Kim, Yong-June; Yoon, Hyung-Yoon; Kim, Seon-Kyu; Kim, Young-Won; Kim, Eun-Jung; Kim, Isaac Yi; Kim, Wun-Jae
2011-07-01
Abnormal DNA methylation is associated with many human cancers. The aim of the present study was to identify novel methylation markers in prostate cancer (PCa) by microarray analysis and to test whether these markers could discriminate normal and PCa cells. Microarray-based DNA methylation and gene expression profiling was carried out using a panel of PCa cell lines and a control normal prostate cell line. The methylation status of candidate genes in prostate cell lines was confirmed by real-time reverse transcriptase-PCR, bisulfite sequencing analysis, and treatment with a demethylation agent. DNA methylation and gene expression analysis in 203 human prostate specimens, including 106 PCa and 97 benign prostate hyperplasia (BPH), were carried out. Further validation using microarray gene expression data from the Gene Expression Omnibus (GEO) was carried out. Epidermal growth factor-containing fibulin-like extracellular matrix protein 1 (EFEMP1) was identified as a lead candidate methylation marker for PCa. The gene expression level of EFEMP1 was significantly higher in tissue samples from patients with BPH than in those with PCa (P < 0.001). The sensitivity and specificity of EFEMP1 methylation status in discriminating between PCa and BPH reached 95.3% (101 of 106) and 86.6% (84 of 97), respectively. From the GEO data set, we confirmed that the expression level of EFEMP1 was significantly different between PCa and BPH. Genome-wide characterization of DNA methylation profiles enabled the identification of EFEMP1 aberrant methylation patterns in PCa. EFEMP1 might be a useful indicator for the detection of PCa.
Prediction of Human Disease Genes by Human-Mouse Conserved Coexpression Analysis
Grassi, Elena; Damasco, Christian; Silengo, Lorenzo; Oti, Martin; Provero, Paolo; Di Cunto, Ferdinando
2008-01-01
Background Even in the post-genomic era, the identification of candidate genes within loci associated with human genetic diseases is a very demanding task, because the critical region may typically contain hundreds of positional candidates. Since genes implicated in similar phenotypes tend to share very similar expression profiles, high throughput gene expression data may represent a very important resource to identify the best candidates for sequencing. However, so far, gene coexpression has not been used very successfully to prioritize positional candidates. Methodology/Principal Findings We show that it is possible to reliably identify disease-relevant relationships among genes from massive microarray datasets by concentrating only on genes sharing similar expression profiles in both human and mouse. Moreover, we show systematically that the integration of human-mouse conserved coexpression with a phenotype similarity map allows the efficient identification of disease genes in large genomic regions. Finally, using this approach on 850 OMIM loci characterized by an unknown molecular basis, we propose high-probability candidates for 81 genetic diseases. Conclusion Our results demonstrate that conserved coexpression, even at the human-mouse phylogenetic distance, represents a very strong criterion to predict disease-relevant relationships among human genes. PMID:18369433
2009-01-01
Background Soybeans grown in the upper Midwestern United States often suffer from iron deficiency chlorosis, which results in yield loss at the end of the season. To better understand the effect of iron availability on soybean yield, we identified genes in two near isogenic lines with changes in expression patterns when plants were grown in iron sufficient and iron deficient conditions. Results Transcriptional profiles of soybean (Glycine max, L. Merr) near isogenic lines Clark (PI548553, iron efficient) and IsoClark (PI547430, iron inefficient) grown under Fe-sufficient and Fe-limited conditions were analyzed and compared using the Affymetrix® GeneChip® Soybean Genome Array. There were 835 candidate genes in the Clark (PI548553) genotype and 200 candidate genes in the IsoClark (PI547430) genotype putatively involved in soybean's iron stress response. Of these candidate genes, fifty-eight genes in the Clark genotype were identified with a genetic location within known iron efficiency QTL and 21 in the IsoClark genotype. The arrays also identified 170 single feature polymorphisms (SFPs) specific to either Clark or IsoClark. A sliding window analysis of the microarray data and the 7X genome assembly coupled with an iterative model of the data showed the candidate genes are clustered in the genome. An analysis of 5' untranslated regions in the promoter of candidate genes identified 11 conserved motifs in 248 differentially expressed genes, all from the Clark genotype, representing 129 clusters identified earlier, confirming the cluster analysis results. Conclusion These analyses have identified the first genes with expression patterns that are affected by iron stress and are located within QTL specific to iron deficiency stress. The genetic location and promoter motif analysis results support the hypothesis that the differentially expressed genes are co-regulated. The combined results of all analyses lead us to postulate iron inefficiency in soybean is a result of a mutation in a transcription factor(s), which controls the expression of genes required in inducing an iron stress response. PMID:19678937
USDA-ARS?s Scientific Manuscript database
Cotton productivity is affected by water deficit, and little is known about the molecular basis of drought tolerance in cotton. In this study, microarray analysis was conducted to identify drought-responsive genes in the third topmost leaves of the field-grown drought-tolerant cotton (Gossypium hirs...
Takahashi, Hiro; Nemoto, Takeshi; Yoshida, Teruhiko; Honda, Hiroyuki; Hasegawa, Tadashi
2006-01-01
Background Recent advances in genome technologies have provided an excellent opportunity to determine the complete biological characteristics of neoplastic tissues, resulting in improved diagnosis and selection of treatment. To accomplish this objective, it is important to establish a sophisticated algorithm that can deal with large quantities of data such as gene expression profiles obtained by DNA microarray analysis. Results Previously, we developed the projective adaptive resonance theory (PART) filtering method as a gene filtering method. This is one of the clustering methods that can select specific genes for each subtype. In this study, we applied the PART filtering method to analyze microarray data that were obtained from soft tissue sarcoma (STS) patients for the extraction of subtype-specific genes. The performance of the filtering method was evaluated by comparison with other widely used methods, such as signal-to-noise, significance analysis of microarrays, and nearest shrunken centroids. In addition, various combinations of filtering and modeling methods were used to extract essential subtype-specific genes. The combination of the PART filtering method and boosting – the PART-BFCS method – showed the highest accuracy. Seven genes among the 15 genes that are frequently selected by this method – MIF, CYFIP2, HSPCB, TIMP3, LDHA, ABR, and RGS3 – are known prognostic marker genes for other tumors. These genes are candidate marker genes for the diagnosis of STS. Correlation analysis was performed to extract marker genes that were not selected by PART-BFCS. Sixteen genes among those extracted are also known prognostic marker genes for other tumors, and they could be candidate marker genes for the diagnosis of STS. Conclusion The procedure that consisted of two steps, such as the PART-BFCS and the correlation analysis, was proposed. The results suggest that novel diagnostic and therapeutic targets for STS can be extracted by a procedure that includes the PART filtering method. PMID:16948864
MIPHENO: Data normalization for high throughput metabolic analysis.
High throughput methodologies such as microarrays, mass spectrometry and plate-based small molecule screens are increasingly used to facilitate discoveries from gene function to drug candidate identification. These large-scale experiments are typically carried out over the course...
2010-01-01
Background Fruit development, maturation and ripening consists of a complex series of biochemical and physiological changes that in climacteric fruits, including apple and tomato, are coordinated by the gaseous hormone ethylene. These changes lead to final fruit quality and understanding of the functional machinery underlying these processes is of both biological and practical importance. To date many reports have been made on the analysis of gene expression in apple. In this study we focused our investigation on the role of ethylene during apple maturation, specifically comparing transcriptomics of normal ripening with changes resulting from application of the hormone receptor competitor 1-Methylcyclopropene. Results To gain insight into the molecular process regulating ripening in apple, and to compare to tomato (model species for ripening studies), we utilized both homologous and heterologous (tomato) microarray to profile transcriptome dynamics of genes involved in fruit development and ripening, emphasizing those which are ethylene regulated. The use of both types of microarrays facilitated transcriptome comparison between apple and tomato (for the later using data previously published and available at the TED: tomato expression database) and highlighted genes conserved during ripening of both species, which in turn represent a foundation for further comparative genomic studies. The cross-species analysis had the secondary aim of examining the efficiency of heterologous (specifically tomato) microarray hybridization for candidate gene identification as related to the ripening process. The resulting transcriptomics data revealed coordinated gene expression during fruit ripening of a subset of ripening-related and ethylene responsive genes, further facilitating the analysis of ethylene response during fruit maturation and ripening. Conclusion Our combined strategy based on microarray hybridization enabled transcriptome characterization during normal climacteric apple ripening, as well as definition of ethylene-dependent transcriptome changes. Comparison with tomato fruit maturation and ethylene responsive transcriptome activity facilitated identification of putative conserved orthologous ripening-related genes, which serve as an initial set of candidates for assessing conservation of gene activity across genomes of fruit bearing plant species. PMID:20973957
Hou, Qi; Bing, Zhi-Tong; Hu, Cheng; Li, Mao-Yin; Yang, Ke-Hu; Mo, Zu; Xie, Xiang-Wei; Liao, Ji-Lin; Lu, Yan; Horie, Shigeo; Lou, Ming-Wu
2018-06-01
Prostate cancer (PCa) is the most commonly diagnosed cancer in males in the Western world. Although prostate-specific antigen (PSA) has been widely used as a biomarker for PCa diagnosis, its results can be controversial. Therefore, new biomarkers are needed to enhance the clinical management of PCa. From publicly available microarray data, differentially expressed genes (DEGs) were identified by meta-analysis with RankProd. Genetic algorithm optimized artificial neural network (GA-ANN) was introduced to establish a diagnostic prediction model and to filter candidate genes. The diagnostic and prognostic capability of the prediction model and candidate genes were investigated in both GEO and TCGA datasets. Candidate genes were further validated by qPCR, Western Blot and Tissue microarray. By RankProd meta-analyses, 2306 significantly up- and 1311 down-regulated probes were found in 133 cases and 30 controls microarray data. The overall accuracy rate of the PCa diagnostic prediction model, consisting of a 15-gene signature, reached up to 100% in both the training and test dataset. The prediction model also showed good results for the diagnosis (AUC = 0.953) and prognosis (AUC of 5 years overall survival time = 0.808) of PCa in the TCGA database. The expression levels of three genes, FABP5, C1QTNF3 and LPHN3, were validated by qPCR. C1QTNF3 high expression was further validated in PCa tissue by Western Blot and Tissue microarray. In the GEO datasets, C1QTNF3 was a good predictor for the diagnosis of PCa (GSE6956: AUC = 0.791; GSE8218: AUC = 0.868; GSE26910: AUC = 0.972). In the TCGA database, C1QTNF3 was significantly associated with PCa patient recurrence free survival (P < .001, AUC = 0.57). In this study, we have developed a diagnostic and prognostic prediction model for PCa. C1QTNF3 was revealed as a promising biomarker for PCa. This approach can be applied to other high-throughput data from different platforms for the discovery of oncogenes or biomarkers in different kinds of diseases. Copyright © 2018. Published by Elsevier B.V.
Yanagawa, Rempei; Furukawa, Yoichi; Tsunoda, Tatsuhiko; Kitahara, Osamu; Kameyama, Masao; Murata, Kohei; Ishikawa, Osamu; Nakamura, Yusuke
2001-01-01
Abstract In spite of intensive and increasingly successful attempts to determine the multiple steps involved in colorectal carcinogenesis, the mechanisms responsible for metastasis of colorectal tumors to the liver remain to be clarified. To identify genes that are candidates for involvement in the metastatic process, we analyzed genome-wide expression profiles of 10 primary colorectal cancers and their corresponding metastatic lesions by means of a cDNA microarray consisting of 9121 human genes. This analysis identified 40 genes whose expression was commonly upregulated in metastatic lesions, and 7 that were commonly downregulated. The upregulated genes encoded proteins involved in cell adhesion, or remodeling of the actin cytoskeleton. Investigation of the functions of more of the altered genes should improve our understanding of metastasis and may identify diagnostic markers and/or novel molecular targets for prevention or therapy of metastatic lesions. PMID:11687950
Song, Yajian; Xue, Yanfen; Ma, Yanhe
2013-01-01
The alkaliphilic hemicellulolytic bacterium Bacillus sp. N16-5 has a broad substrate spectrum and exhibits the capacity to utilize complex carbohydrates such as galactomannan, xylan, and pectin. In the monosaccharide mixture, sequential utilization by Bacillus sp. N16-5 was observed. Glucose appeared to be its preferential monosaccharide, followed by fructose, mannose, arabinose, xylose, and galactose. Global transcription profiles of the strain were determined separately for growth on six monosaccharides (glucose, fructose, mannose, galactose, arabinose, and xylose) and four polysaccharides (galactomannan, xylan, pectin, and sodium carboxymethylcellulose) using one-color microarrays. Numerous genes potentially related to polysaccharide degradation, sugar transport, and monosaccharide metabolism were found to respond to a specific substrate. Putative gene clusters for different carbohydrates were identified according to transcriptional patterns and genome annotation. Identification and analysis of these gene clusters contributed to pathway reconstruction for carbohydrate utilization in Bacillus sp. N16-5. Several genes encoding putative sugar transporters were highly expressed during growth on specific sugars, suggesting their functional roles. Two phosphoenolpyruvate-dependent phosphotransferase systems were identified as candidate transporters for mannose and fructose, and a major facilitator superfamily transporter was identified as a candidate transporter for arabinose and xylose. Five carbohydrate uptake transporter 1 family ATP-binding cassette transporters were predicted to participate in the uptake of hemicellulose and pectin degradation products. Collectively, microarray data improved the pathway reconstruction involved in carbohydrate utilization of Bacillus sp. N16-5 and revealed that the organism precisely regulates gene transcription in response to fluctuations in energy resources. PMID:23326578
Jupiter, Daniel; Chen, Hailin; VanBuren, Vincent
2009-01-01
Background Although expression microarrays have become a standard tool used by biologists, analysis of data produced by microarray experiments may still present challenges. Comparison of data from different platforms, organisms, and labs may involve complicated data processing, and inferring relationships between genes remains difficult. Results STARNET 2 is a new web-based tool that allows post hoc visual analysis of correlations that are derived from expression microarray data. STARNET 2 facilitates user discovery of putative gene regulatory networks in a variety of species (human, rat, mouse, chicken, zebrafish, Drosophila, C. elegans, S. cerevisiae, Arabidopsis and rice) by graphing networks of genes that are closely co-expressed across a large heterogeneous set of preselected microarray experiments. For each of the represented organisms, raw microarray data were retrieved from NCBI's Gene Expression Omnibus for a selected Affymetrix platform. All pairwise Pearson correlation coefficients were computed for expression profiles measured on each platform, respectively. These precompiled results were stored in a MySQL database, and supplemented by additional data retrieved from NCBI. A web-based tool allows user-specified queries of the database, centered at a gene of interest. The result of a query includes graphs of correlation networks, graphs of known interactions involving genes and gene products that are present in the correlation networks, and initial statistical analyses. Two analyses may be performed in parallel to compare networks, which is facilitated by the new HEATSEEKER module. Conclusion STARNET 2 is a useful tool for developing new hypotheses about regulatory relationships between genes and gene products, and has coverage for 10 species. Interpretation of the correlation networks is supported with a database of previously documented interactions, a test for enrichment of Gene Ontology terms, and heat maps of correlation distances that may be used to compare two networks. The list of genes in a STARNET network may be useful in developing a list of candidate genes to use for the inference of causal networks. The tool is freely available at , and does not require user registration. PMID:19828039
A Discovery Resource of Rare Copy Number Variations in Individuals with Autism Spectrum Disorder
Prasad, Aparna; Merico, Daniele; Thiruvahindrapuram, Bhooma; Wei, John; Lionel, Anath C.; Sato, Daisuke; Rickaby, Jessica; Lu, Chao; Szatmari, Peter; Roberts, Wendy; Fernandez, Bridget A.; Marshall, Christian R.; Hatchwell, Eli; Eis, Peggy S.; Scherer, Stephen W.
2012-01-01
The identification of rare inherited and de novo copy number variations (CNVs) in human subjects has proven a productive approach to highlight risk genes for autism spectrum disorder (ASD). A variety of microarrays are available to detect CNVs, including single-nucleotide polymorphism (SNP) arrays and comparative genomic hybridization (CGH) arrays. Here, we examine a cohort of 696 unrelated ASD cases using a high-resolution one-million feature CGH microarray, the majority of which were previously genotyped with SNP arrays. Our objective was to discover new CNVs in ASD cases that were not detected by SNP microarray analysis and to delineate novel ASD risk loci via combined analysis of CGH and SNP array data sets on the ASD cohort and CGH data on an additional 1000 control samples. Of the 615 ASD cases analyzed on both SNP and CGH arrays, we found that 13,572 of 21,346 (64%) of the CNVs were exclusively detected by the CGH array. Several of the CGH-specific CNVs are rare in population frequency and impact previously reported ASD genes (e.g., NRXN1, GRM8, DPYD), as well as novel ASD candidate genes (e.g., CIB2, DAPP1, SAE1), and all were inherited except for a de novo CNV in the GPHN gene. A functional enrichment test of gene-sets in ASD cases over controls revealed nucleotide metabolism as a potential novel pathway involved in ASD, which includes several candidate genes for follow-up (e.g., DPYD, UPB1, UPP1, TYMP). Finally, this extensively phenotyped and genotyped ASD clinical cohort serves as an invaluable resource for the next step of genome sequencing for complete genetic variation detection. PMID:23275889
Lee, Chu-I; Chou, An-Kuo; Lin, Ching-Chih; Chou, Chia-Hua; Loh, Joon-Khim; Lieu, Ann-Shung; Wang, Chih-Jen; Huang, Chi-Ying F; Howng, Shen-Long; Hong, Yi-Ren
2012-01-01
Cerebral vasospasm following subarachnoid hemorrhage (SAH) has been studied in terms of a contraction of the major cerebral arteries, but the effect of cerebrum tissue in SAH is not yet well understood. To gain insight into the biology of SAH-expressing cerebrum, we employed oligonucleotide microarrays to characterize the gene expression profiles of cerebrum tissue at the early stage of SAH. Functional gene expression in the cerebrum was analyzed 2 h following stage 1-hemorrhage in Sprague-Dawley rats. mRNA was investigated by performing microarray and quantitative real-time PCR analyses, and protein expression was determined by Western blot analysis. In this study, 18 upregulated and 18 downregulated genes displayed at least a 1.5-fold change. Five genes were verified by real-time PCR, including three upregulated genes [prostaglandin E synthase (PGES), CD14 antigen, and tissue inhibitor of metalloproteinase 1 (TIMP1)] as well as two downregulated genes [KRAB-zinc finger protein-2 (KZF-2) and γ-aminobutyric acid B receptor 1 (GABA B receptor)]. Notably, there were functional implications for the three upregulated genes involved in the inflammatory SAH process. However, the mechanisms leading to decreased KZF-2 and GABA B receptor expression in SAH have never been characterized. We conclude that oligonucleotide microarrays have the potential for use as a method to identify candidate genes associated with SAH and to provide novel investigational targets, including genes involved in the immune and inflammatory response. Furthermore, understanding the regulation of MMP9/TIMP1 during the early stages of SAH may elucidate the pathophysiological mechanisms in SAH rats.
Role of skeletal muscle in ear development.
Rot, Irena; Baguma-Nibasheka, Mark; Costain, Willard J; Hong, Paul; Tafra, Robert; Mardesic-Brakus, Snjezana; Mrduljas-Djujic, Natasa; Saraga-Babic, Mirna; Kablar, Boris
2017-10-01
The current paper is a continuation of our work described in Rot and Kablar, 2010. Here, we show lists of 10 up- and 87 down-regulated genes obtained by a cDNA microarray analysis that compared developing Myf5-/-:Myod-/- (and Mrf4-/-) petrous part of the temporal bone, containing middle and inner ear, to the control, at embryonic day 18.5. Myf5-/-:Myod-/- fetuses entirely lack skeletal myoblasts and muscles. They are unable to move their head, which interferes with the perception of angular acceleration. Previously, we showed that the inner ear areas most affected in Myf5-/-:Myod-/- fetuses were the vestibular cristae ampullaris, sensitive to angular acceleration. Our finding that the type I hair cells were absent in the mutants' cristae was further used here to identify a profile of genes specific to the lacking cell type. Microarrays followed by a detailed consultation of web-accessible mouse databases allowed us to identify 6 candidate genes with a possible role in the development of the inner ear sensory organs: Actc1, Pgam2, Ldb3, Eno3, Hspb7 and Smpx. Additionally, we searched for human homologues of the candidate genes since a number of syndromes in humans have associated inner ear abnormalities. Mutations in one of our candidate genes, Smpx, have been reported as the cause of X-linked deafness in humans. Our current study suggests an epigenetic role that mechanical, and potentially other, stimuli originating from muscle, play in organogenesis, and offers an approach to finding novel genes responsible for altered inner ear phenotypes.
Guerra-Laso, José M; Raposo-García, Sara; García-García, Silvia; Diez-Tascón, Cristina; Rivero-Lezcano, Octavio M
2015-02-01
Differences in the activity of monocytes/macrophages, important target cells of Mycobacterium tuberculosis, might influence tuberculosis progression. With the purpose of identifying candidate genes for tuberculosis susceptibility we infected monocytes from both healthy elderly individuals (a tuberculosis susceptibility group) and elderly tuberculosis patients with M. tuberculosis, and performed a microarray experiment. We detected 78 differentially expressed transcripts and confirmed these results by quantitative PCR of selected genes. We found that monocytes from tuberculosis patients showed similar expression patterns for these genes, regardless of whether they were obtained from younger or older patients. Only one of the detected genes corresponded to a cytokine: IL26, a member of the interleukin-10 (IL-10) cytokine family which we found to be down-regulated in infected monocytes from tuberculosis patients. Non-infected monocytes secreted IL-26 constitutively but they reacted strongly to M. tuberculosis infection by decreasing IL-26 production. Furthermore, IL-26 serum concentrations appeared to be lower in the tuberculosis patients. When whole blood was infected, IL-26 inhibited the observed pathogen-killing capability. Although lymphocytes expressed IL26R, the receptor mRNA was not detected in either monocytes or neutrophils, suggesting that the inhibition of anti-mycobacterial activity may be mediated by lymphocytes. Additionally, IL-2 concentrations in infected blood were lower in the presence of IL-26. The negative influence of IL-26 on the anti-mycobacterial activity and its constitutive presence in both serum and monocyte supernatants prompt us to propose IL26 as a candidate gene for tuberculosis susceptibility. © 2014 John Wiley & Sons Ltd.
NASA Technical Reports Server (NTRS)
Weitzel, A. J.; Wyatt, S. E.; Parsons-Wingerter, P.
2016-01-01
Venation patterning in leaves is a major determinant of photosynthesis efficiency because of its dependency on vascular transport of photo-assimilates, water, and minerals. Arabidopsis thaliana grown in microgravity show delayed growth and leaf maturation. Gene expression data from the roots, hypocotyl, and leaves of A. thaliana grown during spaceflight vs. ground control analyzed by Affymetrix microarray are available through NASA's GeneLab (GLDS-7). We analyzed the data for differential expression of genes in leaves resulting from the effects of spaceflight on vascular patterning. Two genes were found by preliminary analysis to be up-regulated during spaceflight that may be related to vascular formation. The genes are responsible for coding an ARGOS (Auxin-Regulated Gene Involved in Organ Size)-like protein (potentially affecting cell elongation in the leaves), and an F-box/kelch-repeat protein (possibly contributing to protoxylem specification). Further analysis that will focus on raw data quality assessment and a moderated t-test may further confirm up-regulation of the two genes and/or identify other gene candidates. Plants defective in these genes will then be assessed for phenotype by the mapping and quantification of leaf vascular patterning by NASA's VESsel GENeration (VESGEN) software to model specific vascular differences of plants grown in spaceflight.
Integration of QTL and bioinformatic tools to identify candidate genes for triglycerides in mice[S
Leduc, Magalie S.; Hageman, Rachael S.; Verdugo, Ricardo A.; Tsaih, Shirng-Wern; Walsh, Kenneth; Churchill, Gary A.; Paigen, Beverly
2011-01-01
To identify genetic loci influencing lipid levels, we performed quantitative trait loci (QTL) analysis between inbred mouse strains MRL/MpJ and SM/J, measuring triglyceride levels at 8 weeks of age in F2 mice fed a chow diet. We identified one significant QTL on chromosome (Chr) 15 and three suggestive QTL on Chrs 2, 7, and 17. We also carried out microarray analysis on the livers of parental strains of 282 F2 mice and used these data to find cis-regulated expression QTL. We then narrowed the list of candidate genes under significant QTL using a “toolbox” of bioinformatic resources, including haplotype analysis; parental strain comparison for gene expression differences and nonsynonymous coding single nucleotide polymorphisms (SNP); cis-regulated eQTL in livers of F2 mice; correlation between gene expression and phenotype; and conditioning of expression on the phenotype. We suggest Slc25a7 as a candidate gene for the Chr 7 QTL and, based on expression differences, five genes (Polr3 h, Cyp2d22, Cyp2d26, Tspo, and Ttll12) as candidate genes for Chr 15 QTL. This study shows how bioinformatics can be used effectively to reduce candidate gene lists for QTL related to complex traits. PMID:21622629
Mutation spectrum and differential gene expression in cystic and solid vestibular schwannoma.
Zhang, Zhihua; Wang, Zhaoyan; Sun, Lianhua; Li, Xiaohua; Huang, Qi; Yang, Tao; Wu, Hao
2014-03-01
We sought to characterize the mutation spectrum of NF2 and the differential gene expression in cystic and solid vestibular schwannomas. We collected tumor tissue and blood samples of 31 cystic vestibular schwannomas and 114 solid vestibular schwannomas. Mutation screening of NF2 was performed in both tumor and blood DNA samples of all patients. cDNA microarray was used to analyze the differential gene expression between 11 cystic vestibular schwannomas and 6 solid vestibular schwannomas. Expression levels of top candidate genes were verified by quantitative reverse transcription PCR. NF2 mutations were identified in 34.5% of sporadic vestibular schwannomas, with all mutations being exclusively somatic. No significant difference was found between the mutation detection rates of cystic vestibular schwannoma (35.5%) and solid vestibular schwannoma (34.2%). cDNA microarray analysis detected a total of 46 differentially expressed genes between the cystic vestibular schwannoma and solid vestibular schwannoma samples. The significantly decreased expression of four top candidate genes, C1orf130, CNTF, COL4A3, and COL4A4, was verified by quantitative reverse transcription PCR. NF2 mutations are not directly involved in the cystic formation of vestibular schwannoma. In addition, the differential gene expression of cystic vestibular schwannoma reported in our study may provide useful insights into the molecular mechanism underlying this process.
Harripaul, R; Vasli, N; Mikhailov, A; Rafiq, M A; Mittal, K; Windpassinger, C; Sheikh, T I; Noor, A; Mahmood, H; Downey, S; Johnson, M; Vleuten, K; Bell, L; Ilyas, M; Khan, F S; Khan, V; Moradi, M; Ayaz, M; Naeem, F; Heidari, A; Ahmed, I; Ghadami, S; Agha, Z; Zeinali, S; Qamar, R; Mozhdehipanah, H; John, P; Mir, A; Ansar, M; French, L; Ayub, M; Vincent, J B
2018-04-01
Approximately 1% of the global population is affected by intellectual disability (ID), and the majority receive no molecular diagnosis. Previous studies have indicated high levels of genetic heterogeneity, with estimates of more than 2500 autosomal ID genes, the majority of which are autosomal recessive (AR). Here, we combined microarray genotyping, homozygosity-by-descent (HBD) mapping, copy number variation (CNV) analysis, and whole exome sequencing (WES) to identify disease genes/mutations in 192 multiplex Pakistani and Iranian consanguineous families with non-syndromic ID. We identified definite or candidate mutations (or CNVs) in 51% of families in 72 different genes, including 26 not previously reported for ARID. The new ARID genes include nine with loss-of-function mutations (ABI2, MAPK8, MPDZ, PIDD1, SLAIN1, TBC1D23, TRAPPC6B, UBA7 and USP44), and missense mutations include the first reports of variants in BDNF or TET1 associated with ID. The genes identified also showed overlap with de novo gene sets for other neuropsychiatric disorders. Transcriptional studies showed prominent expression in the prenatal brain. The high yield of AR mutations for ID indicated that this approach has excellent clinical potential and should inform clinical diagnostics, including clinical whole exome and genome sequencing, for populations in which consanguinity is common. As with other AR disorders, the relevance will also apply to outbred populations.
Jones, Kevin; Weiss, Shelly K; Minassian, Berge
2016-07-01
Patients presenting with infantile spasms, dysmorphic features, and periventricular nodular heterotopia may benefit from genetic copy number variation microarray, or whole-exome sequencing to identify candidate genes. This will allow personalized diagnosis and prognostication and the eventual understanding of single and combined gene functions in brain health and disease.
Jin, Yulan; Sharma, Ashok; Bai, Shan; Davis, Colleen; Liu, Haitao; Hopkins, Diane; Barriga, Kathy; Rewers, Marian; She, Jin-Xiong
2014-07-01
There is tremendous scientific and clinical value to further improving the predictive power of autoantibodies because autoantibody-positive (AbP) children have heterogeneous rates of progression to clinical diabetes. This study explored the potential of gene expression profiles as biomarkers for risk stratification among 104 AbP subjects from the Diabetes Autoimmunity Study in the Young (DAISY) using a discovery data set based on microarray and a validation data set based on real-time RT-PCR. The microarray data identified 454 candidate genes with expression levels associated with various type 1 diabetes (T1D) progression rates. RT-PCR analyses of the top-27 candidate genes confirmed 5 genes (BACH2, IGLL3, EIF3A, CDC20, and TXNDC5) associated with differential progression and implicated in lymphocyte activation and function. Multivariate analyses of these five genes in the discovery and validation data sets identified and confirmed four multigene models (BI, ICE, BICE, and BITE, with each letter representing a gene) that consistently stratify high- and low-risk subsets of AbP subjects with hazard ratios >6 (P < 0.01). The results suggest that these genes may be involved in T1D pathogenesis and potentially serve as excellent gene expression biomarkers to predict the risk of progression to clinical diabetes for AbP subjects. © 2014 by the American Diabetes Association.
EBF factors drive expression of multiple classes of target genes governing neuronal development.
Green, Yangsook S; Vetter, Monica L
2011-04-30
Early B cell factor (EBF) family members are transcription factors known to have important roles in several aspects of vertebrate neurogenesis, including commitment, migration and differentiation. Knowledge of how EBF family members contribute to neurogenesis is limited by a lack of detailed understanding of genes that are transcriptionally regulated by these factors. We performed a microarray screen in Xenopus animal caps to search for targets of EBF transcriptional activity, and identified candidate targets with multiple roles, including transcription factors of several classes. We determined that, among the most upregulated candidate genes with expected neuronal functions, most require EBF activity for some or all of their expression, and most have overlapping expression with ebf genes. We also found that the candidate target genes that had the most strongly overlapping expression patterns with ebf genes were predicted to be direct transcriptional targets of EBF transcriptional activity. The identification of candidate targets that are transcription factor genes, including nscl-1, emx1 and aml1, improves our understanding of how EBF proteins participate in the hierarchy of transcription control during neuronal development, and suggests novel mechanisms by which EBF activity promotes migration and differentiation. Other candidate targets, including pcdh8 and kcnk5, expand our knowledge of the types of terminal differentiated neuronal functions that EBF proteins regulate.
Esibizione, Diana; Cui, Chang-Yi; Schlessinger, David
2009-01-01
EDA, the gene mutated in anhidrotic ectodermal dysplasia, encodes ectodysplasin, a TNF superfamily member that activates NF-kB mediated transcription. To identify EDA target genes, we have earlier used expression profiling to infer genes differentially expressed at various developmental time points in Tabby (Eda-deficient) compared to wild-type mouse skin. To increase the resolution to find genes whose expression may be restricted to epidermal cells, we have now extended studies to primary keratinocyte cultures established from E19 wild-type and Tabby skin. Using microarrays bearing 44,000 gene probes, we found 385 preliminary candidate genes whose expression was significantly affected by Eda loss. By comparing expression profiles to those from Eda-A1 transgenic skin, we restricted the list to 38 “candidate EDA targets”, 14 of which were already known to be expressed in hair follicles or epidermis. We confirmed expression changes for 3 selected genes, Tbx1, Bmp7, and Jag1, both in keratinocytes and in whole skin, by Q-PCR and Western blotting analyses. Thus, by the analysis of keratinocytes, novel candidate pathways downstream of EDA were detected. PMID:18848976
2011-01-01
Background The mechanical properties of wood are largely determined by the orientation of cellulose microfibrils in secondary cell walls. Several genes and their allelic variants have previously been found to affect microfibril angle (MFA) and wood stiffness; however, the molecular mechanisms controlling microfibril orientation and mechanical strength are largely uncharacterised. In the present study, cDNA microarrays were used to compare gene expression in developing xylem with contrasting stiffness and MFA in juvenile Pinus radiata trees in order to gain further insights into the molecular mechanisms underlying microfibril orientation and cell wall mechanics. Results Juvenile radiata pine trees with higher stiffness (HS) had lower MFA in the earlywood and latewood of each ring compared to low stiffness (LS) trees. Approximately 3.4 to 14.5% out of 3, 320 xylem unigenes on cDNA microarrays were differentially regulated in juvenile wood with contrasting stiffness and MFA. Greater variation in MFA and stiffness was observed in earlywood compared to latewood, suggesting earlywood contributes most to differences in stiffness; however, 3-4 times more genes were differentially regulated in latewood than in earlywood. A total of 108 xylem unigenes were differentially regulated in juvenile wood with HS and LS in at least two seasons, including 43 unigenes with unknown functions. Many genes involved in cytoskeleton development and secondary wall formation (cellulose and lignin biosynthesis) were preferentially transcribed in wood with HS and low MFA. In contrast, several genes involved in cell division and primary wall synthesis were more abundantly transcribed in LS wood with high MFA. Conclusions Microarray expression profiles in Pinus radiata juvenile wood with contrasting stiffness has shed more light on the transcriptional control of microfibril orientation and the mechanical properties of wood. The identified candidate genes provide an invaluable resource for further gene function and association genetics studies aimed at deepening our understanding of cell wall biomechanics with a view to improving the mechanical properties of wood. PMID:21962175
From genomes to vaccines: Leishmania as a model.
Almeida, Renata; Norrish, Alan; Levick, Mark; Vetrie, David; Freeman, Tom; Vilo, Jaak; Ivens, Alasdair; Lange, Uta; Stober, Carmel; McCann, Sharon; Blackwell, Jenefer M
2002-01-01
The 35 Mb genome of Leishmania should be sequenced by late 2002. It contains approximately 8500 genes that will probably translate into more than 10 000 proteins. In the laboratory we have been piloting strategies to try to harness the power of the genome-proteome for rapid screening of new vaccine candidate. To this end, microarray analysis of 1094 unique genes identified using an EST analysis of 2091 cDNA clones from spliced leader libraries prepared from different developmental stages of Leishmania has been employed. The plan was to identify amastigote-expressed genes that could be used in high-throughput DNA-vaccine screens to identify potential new vaccine candidates. Despite the lack of transcriptional regulation that polycistronic transcription in Leishmania dictates, the data provide evidence for a high level of post-transcriptional regulation of RNA abundance during the developmental cycle of promastigotes in culture and in lesion-derived amastigotes of Leishmania major. This has provided 147 candidates from the 1094 unique genes that are specifically upregulated in amastigotes and are being used in vaccine studies. Using DNA vaccination, it was demonstrated that pooling strategies can work to identify protective vaccines, but it was found that some potentially protective antigens are masked by other disease-exacerbatory antigens in the pool. A total of 100 new vaccine candidates are currently being tested separately and in pools to extend this analysis, and to facilitate retrospective bioinformatic analysis to develop predictive algorithms for sequences that constitute potentially protective antigens. We are also working with other members of the Leishmania Genome Network to determine whether RNA expression determined by microarray analyses parallels expression at the protein level. We believe we are making good progress in developing strategies that will allow rapid translation of the sequence of Leishmania into potential interventions for disease control in humans. PMID:11839176
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tholouli, Eleni; MacDermott, Sarah; Hoyland, Judith
2012-08-24
Highlights: Black-Right-Pointing-Pointer Development of a quantitative high throughput in situ expression profiling method. Black-Right-Pointing-Pointer Application to a tissue microarray of 242 AML bone marrow samples. Black-Right-Pointing-Pointer Identification of HOXA4, HOXA9, Meis1 and DNMT3A as prognostic markers in AML. -- Abstract: Measurement and validation of microarray gene signatures in routine clinical samples is problematic and a rate limiting step in translational research. In order to facilitate measurement of microarray identified gene signatures in routine clinical tissue a novel method combining quantum dot based oligonucleotide in situ hybridisation (QD-ISH) and post-hybridisation spectral image analysis was used for multiplex in-situ transcript detection inmore » archival bone marrow trephine samples from patients with acute myeloid leukaemia (AML). Tissue-microarrays were prepared into which white cell pellets were spiked as a standard. Tissue microarrays were made using routinely processed bone marrow trephines from 242 patients with AML. QD-ISH was performed for six candidate prognostic genes using triplex QD-ISH for DNMT1, DNMT3A, DNMT3B, and for HOXA4, HOXA9, Meis1. Scrambled oligonucleotides were used to correct for background staining followed by normalisation of expression against the expression values for the white cell pellet standard. Survival analysis demonstrated that low expression of HOXA4 was associated with poorer overall survival (p = 0.009), whilst high expression of HOXA9 (p < 0.0001), Meis1 (p = 0.005) and DNMT3A (p = 0.04) were associated with early treatment failure. These results demonstrate application of a standardised, quantitative multiplex QD-ISH method for identification of prognostic markers in formalin-fixed paraffin-embedded clinical samples, facilitating measurement of gene expression signatures in routine clinical samples.« less
Mixture models for detecting differentially expressed genes in microarrays.
Jones, Liat Ben-Tovim; Bean, Richard; McLachlan, Geoffrey J; Zhu, Justin Xi
2006-10-01
An important and common problem in microarray experiments is the detection of genes that are differentially expressed in a given number of classes. As this problem concerns the selection of significant genes from a large pool of candidate genes, it needs to be carried out within the framework of multiple hypothesis testing. In this paper, we focus on the use of mixture models to handle the multiplicity issue. With this approach, a measure of the local FDR (false discovery rate) is provided for each gene. An attractive feature of the mixture model approach is that it provides a framework for the estimation of the prior probability that a gene is not differentially expressed, and this probability can subsequently be used in forming a decision rule. The rule can also be formed to take the false negative rate into account. We apply this approach to a well-known publicly available data set on breast cancer, and discuss our findings with reference to other approaches.
Defining the Human Macula Transcriptome and Candidate Retinal Disease Genes UsingEyeSAGE
Rickman, Catherine Bowes; Ebright, Jessica N.; Zavodni, Zachary J.; Yu, Ling; Wang, Tianyuan; Daiger, Stephen P.; Wistow, Graeme; Boon, Kathy; Hauser, Michael A.
2009-01-01
Purpose To develop large-scale, high-throughput annotation of the human macula transcriptome and to identify and prioritize candidate genes for inherited retinal dystrophies, based on ocular-expression profiles using serial analysis of gene expression (SAGE). Methods Two human retina and two retinal pigment epithelium (RPE)/choroid SAGE libraries made from matched macula or midperipheral retina and adjacent RPE/choroid of morphologically normal 28- to 66-year-old donors and a human central retina longSAGE library made from 41- to 66-year-old donors were generated. Their transcription profiles were entered into a relational database, EyeSAGE, including microarray expression profiles of retina and publicly available normal human tissue SAGE libraries. EyeSAGE was used to identify retina- and RPE-specific and -associated genes, and candidate genes for retina and RPE disease loci. Differential and/or cell-type specific expression was validated by quantitative and single-cell RT-PCR. Results Cone photoreceptor-associated gene expression was elevated in the macula transcription profiles. Analysis of the longSAGE retina tags enhanced tag-to-gene mapping and revealed alternatively spliced genes. Analysis of candidate gene expression tables for the identified Bardet-Biedl syndrome disease gene (BBS5) in the BBS5 disease region table yielded BBS5 as the top candidate. Compelling candidates for inherited retina diseases were identified. Conclusions The EyeSAGE database, combining three different gene-profiling platforms including the authors’ multidonor-derived retina/RPE SAGE libraries and existing single-donor retina/RPE libraries, is a powerful resource for definition of the retina and RPE transcriptomes. It can be used to identify retina-specific genes, including alternatively spliced transcripts and to prioritize candidate genes within mapped retinal disease regions. PMID:16723438
Defining the human macula transcriptome and candidate retinal disease genes using EyeSAGE.
Bowes Rickman, Catherine; Ebright, Jessica N; Zavodni, Zachary J; Yu, Ling; Wang, Tianyuan; Daiger, Stephen P; Wistow, Graeme; Boon, Kathy; Hauser, Michael A
2006-06-01
To develop large-scale, high-throughput annotation of the human macula transcriptome and to identify and prioritize candidate genes for inherited retinal dystrophies, based on ocular-expression profiles using serial analysis of gene expression (SAGE). Two human retina and two retinal pigment epithelium (RPE)/choroid SAGE libraries made from matched macula or midperipheral retina and adjacent RPE/choroid of morphologically normal 28- to 66-year-old donors and a human central retina longSAGE library made from 41- to 66-year-old donors were generated. Their transcription profiles were entered into a relational database, EyeSAGE, including microarray expression profiles of retina and publicly available normal human tissue SAGE libraries. EyeSAGE was used to identify retina- and RPE-specific and -associated genes, and candidate genes for retina and RPE disease loci. Differential and/or cell-type specific expression was validated by quantitative and single-cell RT-PCR. Cone photoreceptor-associated gene expression was elevated in the macula transcription profiles. Analysis of the longSAGE retina tags enhanced tag-to-gene mapping and revealed alternatively spliced genes. Analysis of candidate gene expression tables for the identified Bardet-Biedl syndrome disease gene (BBS5) in the BBS5 disease region table yielded BBS5 as the top candidate. Compelling candidates for inherited retina diseases were identified. The EyeSAGE database, combining three different gene-profiling platforms including the authors' multidonor-derived retina/RPE SAGE libraries and existing single-donor retina/RPE libraries, is a powerful resource for definition of the retina and RPE transcriptomes. It can be used to identify retina-specific genes, including alternatively spliced transcripts and to prioritize candidate genes within mapped retinal disease regions.
Bae, Yun Jung; Kim, Sung-Eun; Hong, Seong Yeon; Park, Taesun; Lee, Sang Gyu; Choi, Myung-Sook; Sung, Mi-Kyung
2016-01-01
Obesity is known to increase the risk of colorectal cancer. However, mechanisms underlying the pathogenesis of obesity-induced colorectal cancer are not completely understood. The purposes of this study were to identify differentially expressed genes in the colon of mice with diet-induced obesity and to select candidate genes as early markers of obesity-associated abnormal cell growth in the colon. C57BL/6N mice were fed normal diet (11% fat energy) or high-fat diet (40% fat energy) and were euthanized at different time points. Genome-wide expression profiles of the colon were determined at 2, 4, 8, and 12 weeks. Cluster analysis was performed using expression data of genes showing log 2 fold change of ≥1 or ≤-1 (twofold change), based on time-dependent expression patterns, followed by virtual network analysis. High-fat diet-fed mice showed significant increase in body weight and total visceral fat weight over 12 weeks. Time-course microarray analysis showed that 50, 47, 36, and 411 genes were differentially expressed at 2, 4, 8, and 12 weeks, respectively. Ten cluster profiles representing distinguishable patterns of genes differentially expressed over time were determined. Cluster 4, which consisted of genes showing the most significant alterations in expression in response to high-fat diet over 12 weeks, included Apoa4 (apolipoprotein A-IV), Ppap2b (phosphatidic acid phosphatase type 2B), Cel (carboxyl ester lipase), and Clps (colipase, pancreatic), which interacted strongly with surrounding genes associated with colorectal cancer or obesity. Our data indicate that Apoa4 , Ppap2b , Cel , and Clps are candidate early marker genes associated with obesity-related pathological changes in the colon. Genome-wide analyses performed in the present study provide new insights on selecting novel genes that may be associated with the development of diseases of the colon.
Identification of the TFII-I family target genes in the vertebrate genome.
Chimge, Nyam-Osor; Makeyev, Aleksandr V; Ruddle, Frank H; Bayarsaihan, Dashzeveg
2008-07-01
GTF2I and GTF2IRD1 encode members of the TFII-I transcription factor family and are prime candidates in the Williams syndrome, a complex neurodevelopmental disorder. Our previous expression microarray studies implicated TFII-I proteins in the regulation of a number of genes critical in various aspects of cell physiology. Here, we combined bioinformatics and microarray results to identify TFII-I downstream targets in the vertebrate genome. These results were validated by chromatin immunoprecipitation and siRNA analysis. The collected evidence revealed the complexity of TFII-I-mediated processes that involve distinct regulatory networks. Altogether, these results lead to a better understanding of specific molecular events, some of which may be responsible for the Williams syndrome phenotype.
Xia, Yu; Yang, Yongchao; Huang, Shufang; Wu, Yueheng; Li, Ping; Zhuang, Jian
2018-03-24
This study aimed to determine chromosomal abnormalities and copy number variations (CNVs) in fetuses with congenital heart disease (CHD) by chromosomal microarray analysis (CMA). One hundred and ten cases with CHD detected by prenatal echocardiography were enrolled in the study; 27 cases were simple CHDs, and 83 were complex CHDs. Chromosomal microarray analysis was performed on the Affymetrix CytoScan HD platform. All annotated CNVs were validated by quantitative PCR. Chromosomal microarray analysis identified 6 cases with chromosomal abnormalities, including 2 cases with trisomy 21, 2 cases with trisomy 18, 1 case with trisomy 13, and 1 unusual case of mosaic trisomy 21. Pathogenic CNVs were detected in 15.5% (17/110) of the fetuses with CHDs, including 13 cases with CHD-associated CNVs. We further identified 10 genes as likely novel CHD candidate genes through gene functional enrichment analysis. We also found that pathogenic CMA results impacted the rate of pregnancy termination. This study shows that CMA is particularly effective for identifying chromosomal abnormalities and CNVs in fetuses with CHDs as well as having an effect on obstetrical outcomes. The elucidation of the genetic basis of CHDs will continue to expand our understanding of the etiology of CHDs. © 2018 John Wiley & Sons, Ltd.
Gene-Expression Biomarkers for Application to High-Throughput Radiation Biodosimetry
2005-01-01
nuclear disaster . Even with the delayed onset of symptoms, sometimes several days after exposure, gene-expression biomarkers can identify these exposed individuals very early after exposure, allowing for prompt medical intervention. This early assessment of a radiation dose after exposure would enhance the operational commander’s situational awareness of the radiation exposure status of deployed units and increase the prospect of reduced morbidity and mortality through early medical intervention. Candidate gene targets were selected from microarray studies of ex
Bayne, Christopher J.; Camara, Mark D.; Cunningham, Charles; Jenny, Matthew J.; Langdon, Christopher J.
2010-01-01
Sessile inhabitants of marine intertidal environments commonly face heat stress, an important component of summer mortality syndrome in the Pacific oyster Crassostrea gigas. Marker-aided selection programs would be useful for developing oyster strains that resist summer mortality; however, there is currently a need to identify candidate genes associated with stress tolerance and to develop molecular markers associated with those genes. To identify candidate genes for further study, we used cDNA microarrays to test the hypothesis that oyster families that had high (>64%) or low (<29%) survival of heat shock (43°C, 1 h) differ in their transcriptional responses to stress. Based upon data generated by the microarray and by real-time quantitative PCR, we found that transcription after heat shock increased for genes putatively encoding heat shock proteins and genes for proteins that synthesize lipids, protect against bacterial infection, and regulate spawning, whereas transcription decreased for genes for proteins that mobilize lipids and detoxify reactive oxygen species. RNAs putatively identified as heat shock protein 27, collagen, peroxinectin, S-crystallin, and two genes with no match in Genbank had higher transcript concentrations in low-surviving families than in high-surviving families, whereas concentration of putative cystatin B mRNA was greater in high-surviving families. These ESTs should be studied further for use in marker-aided selection programs. Low survival of heat shock could result from a complex interaction of cell damage, opportunistic infection, and metabolic exhaustion. PMID:19205802
Reif, David M.; Israel, Mark A.; Moore, Jason H.
2007-01-01
The biological interpretation of gene expression microarray results is a daunting challenge. For complex diseases such as cancer, wherein the body of published research is extensive, the incorporation of expert knowledge provides a useful analytical framework. We have previously developed the Exploratory Visual Analysis (EVA) software for exploring data analysis results in the context of annotation information about each gene, as well as biologically relevant groups of genes. We present EVA as a flexible combination of statistics and biological annotation that provides a straightforward visual interface for the interpretation of microarray analyses of gene expression in the most commonly occuring class of brain tumors, glioma. We demonstrate the utility of EVA for the biological interpretation of statistical results by analyzing publicly available gene expression profiles of two important glial tumors. The results of a statistical comparison between 21 malignant, high-grade glioblastoma multiforme (GBM) tumors and 19 indolent, low-grade pilocytic astrocytomas were analyzed using EVA. By using EVA to examine the results of a relatively simple statistical analysis, we were able to identify tumor class-specific gene expression patterns having both statistical and biological significance. Our interactive analysis highlighted the potential importance of genes involved in cell cycle progression, proliferation, signaling, adhesion, migration, motility, and structure, as well as candidate gene loci on a region of Chromosome 7 that has been implicated in glioma. Because EVA does not require statistical or computational expertise and has the flexibility to accommodate any type of statistical analysis, we anticipate EVA will prove a useful addition to the repertoire of computational methods used for microarray data analysis. EVA is available at no charge to academic users and can be found at http://www.epistasis.org. PMID:19390666
Transcriptome Analysis of Early Responsive Genes in Rice during Magnaporthe oryzae Infection.
Wang, Yiming; Kwon, Soon Jae; Wu, Jingni; Choi, Jaeyoung; Lee, Yong-Hwan; Agrawal, Ganesh Kumar; Tamogami, Shigeru; Rakwal, Randeep; Park, Sang-Ryeol; Kim, Beom-Gi; Jung, Ki-Hong; Kang, Kyu Young; Kim, Sang Gon; Kim, Sun Tae
2014-12-01
Rice blast disease caused by Magnaporthe oryzae is one of the most serious diseases of cultivated rice (Oryza sativa L.) in most rice-growing regions of the world. In order to investigate early response genes in rice, we utilized the transcriptome analysis approach using a 300 K tilling microarray to rice leaves infected with compatible and incompatible M. oryzae strains. Prior to the microarray experiment, total RNA was validated by measuring the differential expression of rice defense-related marker genes (chitinase 2, barwin, PBZ1, and PR-10) by RT-PCR, and phytoalexins (sakuranetin and momilactone A) with HPLC. Microarray analysis revealed that 231 genes were up-regulated (>2 fold change, p < 0.05) in the incompatible interaction compared to the compatible one. Highly expressed genes were functionally characterized into metabolic processes and oxidation-reduction categories. The oxidative stress response was induced in both early and later infection stages. Biotic stress overview from MapMan analysis revealed that the phytohormone ethylene as well as signaling molecules jasmonic acid and salicylic acid is important for defense gene regulation. WRKY and Myb transcription factors were also involved in signal transduction processes. Additionally, receptor-like kinases were more likely associated with the defense response, and their expression patterns were validated by RT-PCR. Our results suggest that candidate genes, including receptor-like protein kinases, may play a key role in disease resistance against M. oryzae attack.
TOM: a web-based integrated approach for identification of candidate disease genes.
Rossi, Simona; Masotti, Daniele; Nardini, Christine; Bonora, Elena; Romeo, Giovanni; Macii, Enrico; Benini, Luca; Volinia, Stefano
2006-07-01
The massive production of biological data by means of highly parallel devices like microarrays for gene expression has paved the way to new possible approaches in molecular genetics. Among them the possibility of inferring biological answers by querying large amounts of expression data. Based on this principle, we present here TOM, a web-based resource for the efficient extraction of candidate genes for hereditary diseases. The service requires the previous knowledge of at least another gene responsible for the disease and the linkage area, or else of two disease associated genetic intervals. The algorithm uses the information stored in public resources, including mapping, expression and functional databases. Given the queries, TOM will select and list one or more candidate genes. This approach allows the geneticist to bypass the costly and time consuming tracing of genetic markers through entire families and might improve the chance of identifying disease genes, particularly for rare diseases. We present here the tool and the results obtained on known benchmark and on hereditary predisposition to familial thyroid cancer. Our algorithm is available at http://www-micrel.deis.unibo.it/~tom/.
Tsimakouridze, Elena V; Straume, Marty; Podobed, Peter S; Chin, Heather; LaMarre, Jonathan; Johnson, Ron; Antenos, Monica; Kirby, Gordon M; Mackay, Allison; Huether, Patsy; Simpson, Jeremy A; Sole, Michael; Gadal, Gerard; Martino, Tami A
2012-08-01
There is critical demand in contemporary medicine for gene expression markers in all areas of human disease, for early detection of disease, classification, prognosis, and response to therapy. The integrity of circadian gene expression underlies cardiovascular health and disease; however time-of-day profiling in heart disease has never been examined. We hypothesized that a time-of-day chronomic approach using samples collected across 24-h cycles and analyzed by microarrays and bioinformatics advances contemporary approaches, because it includes sleep-time and/or wake-time molecular responses. As proof of concept, we demonstrate the value of this approach in cardiovascular disease using a murine Transverse Aortic Constriction (TAC) model of pressure overload-induced cardiac hypertrophy in mice. First, microarrays and a novel algorithm termed DeltaGene were used to identify time-of-day differences in gene expression in cardiac hypertrophy 8 wks post-TAC. The top 300 candidates were further analyzed using knowledge-based platforms, paring the list to 20 candidates, which were then validated by real-time polymerase chain reaction (RTPCR). Next, we tested whether the time-of-day gene expression profiles could be indicative of disease progression by comparing the 1- vs. 8-wk TAC. Lastly, since protein expression is functionally relevant, we monitored time-of-day cycling for the analogous cardiac proteins. This approach is generally applicable and can lead to new understanding of disease.
Tamplin, Owen J; Cox, Brian J; Rossant, Janet
2011-12-15
The node and notochord are key tissues required for patterning of the vertebrate body plan. Understanding the gene regulatory network that drives their formation and function is therefore important. Foxa2 is a key transcription factor at the top of this genetic hierarchy and finding its targets will help us to better understand node and notochord development. We performed an extensive microarray-based gene expression screen using sorted embryonic notochord cells to identify early notochord-enriched genes. We validated their specificity to the node and notochord by whole mount in situ hybridization. This provides the largest available resource of notochord-expressed genes, and therefore candidate Foxa2 target genes in the notochord. Using existing Foxa2 ChIP-seq data from adult liver, we were able to identify a set of genes expressed in the notochord that had associated regions of Foxa2-bound chromatin. Given that Foxa2 is a pioneer transcription factor, we reasoned that these sites might represent notochord-specific enhancers. Candidate Foxa2-bound regions were tested for notochord specific enhancer function in a zebrafish reporter assay and 7 novel notochord enhancers were identified. Importantly, sequence conservation or predictive models could not have readily identified these regions. Mutation of putative Foxa2 binding elements in two of these novel enhancers abrogated reporter expression and confirmed their Foxa2 dependence. The combination of highly specific gene expression profiling and genome-wide ChIP analysis is a powerful means of understanding developmental pathways, even for small cell populations such as the notochord. Copyright © 2011 Elsevier Inc. All rights reserved.
Lake, Jennifer; Gravel, Catherine; Koko, Gabriel Koffi D; Robert, Claude; Vandenberg, Grant W
2010-03-01
Phosphorus (P)-responsive genes and how they regulate renal adaptation to phosphorous-deficient diets in animals, including fish, are not well understood. RNA abundance profiling using cDNA microarrays is an efficient approach to study nutrient-gene interactions and identify these dietary P-responsive genes. To test the hypothesis that dietary P-responsive genes are differentially expressed in fish fed varying P levels, rainbow trout were fed a practical high-P diet (R20: 0.96% P) or a low-P diet (R0: 0.38% P) for 7 weeks. The differentially-expressed genes between dietary groups were identified and compared from the kidney by combining suppressive subtractive hybridization (SSH) with cDNA microarray analysis. A number of genes were confirmed by real-time PCR, and correlated with plasma and bone P concentrations. Approximately 54 genes were identified as potential dietary P-responsive after 7 weeks on a diet deficient in P according to cDNA microarray analysis. Of 18 selected genes, 13 genes were confirmed to be P-responsive at 7 weeks by real-time PCR analysis, including: iNOS, cytochrome b, cytochrome c oxidase subunit II , alpha-globin I, beta-globin, ATP synthase, hyperosmotic protein 21, COL1A3, Nkef, NDPK, glucose phosphate isomerase 1, Na+/H+ exchange protein and GDP dissociation inhibitor 2. Many of these dietary P-responsive genes responded in a moderate way (R0/R20 ratio: <2-3 or >0.5) and in a transient manner to dietary P limitation. In summary, renal adaptation to dietary P deficiency in trout involves changes in the expression of several genes, suggesting a profile of metabolic stress, since many of these differentially-expressed candidates are associated with the cellular adaptative responses. Crown Copyright 2009. Published by Elsevier Inc. All rights reserved.
Persson, Anna-Karin; Gebauer, Mathias; Jordan, Suzana; Metz-Weidmann, Christiane; Schulte, Anke M; Schneider, Hans-Christoph; Ding-Pfennigdorff, Danping; Thun, Jonas; Xu, Xiao-Jun; Wiesenfeld-Hallin, Zsuzsanna; Darvasi, Ariel; Fried, Kaj; Devor, Marshall
2009-01-01
Background Nerve injury-triggered hyperexcitability in primary sensory neurons is considered a major source of chronic neuropathic pain. The hyperexcitability, in turn, is thought to be related to transcriptional switching in afferent cell somata. Analysis using expression microarrays has revealed that many genes are regulated in the dorsal root ganglion (DRG) following axotomy. But which contribute to pain phenotype versus other nerve injury-evoked processes such as nerve regeneration? Using the L5 spinal nerve ligation model of neuropathy we examined differential changes in gene expression in the L5 (and L4) DRGs in five mouse strains with contrasting susceptibility to neuropathic pain. We sought genes for which the degree of regulation correlates with strain-specific pain phenotype. Results In an initial experiment six candidate genes previously identified as important in pain physiology were selected for in situ hybridization to DRG sections. Among these, regulation of the Na+ channel α subunit Scn11a correlated with levels of spontaneous pain behavior, and regulation of the cool receptor Trpm8 correlated with heat hypersensibility. In a larger scale experiment, mRNA extracted from individual mouse DRGs was processed on Affymetrix whole-genome expression microarrays. Overall, 2552 ± 477 transcripts were significantly regulated in the axotomized L5DRG 3 days postoperatively. However, in only a small fraction of these was the degree of regulation correlated with pain behavior across strains. Very few genes in the "uninjured" L4DRG showed altered expression (24 ± 28). Conclusion Correlational analysis based on in situ hybridization provided evidence that differential regulation of Scn11a and Trpm8 contributes to across-strain variability in pain phenotype. This does not, of course, constitute evidence that the others are unrelated to pain. Correlational analysis based on microarray data yielded a larger "look-up table" of genes whose regulation likely contributes to pain variability. While this list is enriched in genes of potential importance for pain physiology, and is relatively free of the bias inherent in the candidate gene approach, additional steps are required to clarify which transcripts on the list are in fact of functional importance. PMID:19228393
Cirelli, C; Tononi, G
1999-06-01
The consequences of sleep and sleep deprivation at the molecular level are largely unexplored. Knowledge of such molecular events is essential to understand the restorative processes occurring during sleep as well as the cellular mechanisms of sleep regulation. Here we review the available data about changes in neural gene expression across different behavioural states using candidate gene approaches such as in situ hybridization and immunocytochemistry. We then describe new techniques for systematic screening of gene expression in the brain, such as subtractive hybridization, mRNA differential display, and cDNA microarray technology, outlining advantages and disadvantages of these methods. Finally, we summarize our initial results of a systematic screening of gene expression in the rat brain across behavioural states using mRNA differential display and cDNA microarray technology. The expression pattern of approximately 7000 genes was analysed in the cerebral cortex of rats after 3 h of spontaneous sleep, 3 h of spontaneous waking, or 3 h of sleep deprivation. While the majority of transcripts were expressed at the same level among these three conditions, 14 mRNAs were modulated by sleep and waking. Six transcripts, four more expressed in waking and two more expressed in sleep, corresponded to novel genes. The eight known transcripts were all expressed at higher levels in waking than in sleep and included transcription factors and mitochondrial genes. A possible role for these known transcripts in mediating neural plasticity during waking is discussed.
Identifying novel glioma associated pathways based on systems biology level meta-analysis.
Hu, Yangfan; Li, Jinquan; Yan, Wenying; Chen, Jiajia; Li, Yin; Hu, Guang; Shen, Bairong
2013-01-01
With recent advances in microarray technology, including genomics, proteomics, and metabolomics, it brings a great challenge for integrating this "-omics" data to analysis complex disease. Glioma is an extremely aggressive and lethal form of brain tumor, and thus the study of the molecule mechanism underlying glioma remains very important. To date, most studies focus on detecting the differentially expressed genes in glioma. However, the meta-analysis for pathway analysis based on multiple microarray datasets has not been systematically pursued. In this study, we therefore developed a systems biology based approach by integrating three types of omics data to identify common pathways in glioma. Firstly, the meta-analysis has been performed to study the overlapping of signatures at different levels based on the microarray gene expression data of glioma. Among these gene expression datasets, 12 pathways were found in GeneGO database that shared by four stages. Then, microRNA expression profiles and ChIP-seq data were integrated for the further pathway enrichment analysis. As a result, we suggest 5 of these pathways could be served as putative pathways in glioma. Among them, the pathway of TGF-beta-dependent induction of EMT via SMAD is of particular importance. Our results demonstrate that the meta-analysis based on systems biology level provide a more useful approach to study the molecule mechanism of complex disease. The integration of different types of omics data, including gene expression microarrays, microRNA and ChIP-seq data, suggest some common pathways correlated with glioma. These findings will offer useful potential candidates for targeted therapeutic intervention of glioma.
Drost, Derek R; Novaes, Evandro; Boaventura-Novaes, Carolina; Benedict, Catherine I; Brown, Ryan S; Yin, Tongming; Tuskan, Gerald A; Kirst, Matias
2009-06-01
Microarrays have demonstrated significant power for genome-wide analyses of gene expression, and recently have also revolutionized the genetic analysis of segregating populations by genotyping thousands of loci in a single assay. Although microarray-based genotyping approaches have been successfully applied in yeast and several inbred plant species, their power has not been proven in an outcrossing species with extensive genetic diversity. Here we have developed methods for high-throughput microarray-based genotyping in such species using a pseudo-backcross progeny of 154 individuals of Populus trichocarpa and P. deltoides analyzed with long-oligonucleotide in situ-synthesized microarray probes. Our analysis resulted in high-confidence genotypes for 719 single-feature polymorphism (SFP) and 1014 gene expression marker (GEM) candidates. Using these genotypes and an established microsatellite (SSR) framework map, we produced a high-density genetic map comprising over 600 SFPs, GEMs and SSRs. The abundance of gene-based markers allowed us to localize over 35 million base pairs of previously unplaced whole-genome shotgun (WGS) scaffold sequence to putative locations in the genome of P. trichocarpa. A high proportion of sampled scaffolds could be verified for their placement with independently mapped SSRs, demonstrating the previously un-utilized power that high-density genotyping can provide in the context of map-based WGS sequence reassembly. Our results provide a substantial contribution to the continued improvement of the Populus genome assembly, while demonstrating the feasibility of microarray-based genotyping in a highly heterozygous population. The strategies presented are applicable to genetic mapping efforts in all plant species with similarly high levels of genetic diversity.
Construction of diagnosis system and gene regulatory networks based on microarray analysis.
Hong, Chun-Fu; Chen, Ying-Chen; Chen, Wei-Chun; Tu, Keng-Chang; Tsai, Meng-Hsiun; Chan, Yung-Kuan; Yu, Shyr Shen
2018-05-01
A microarray analysis generally contains expression data of thousands of genes, but most of them are irrelevant to the disease of interest, making analyzing the genes concerning specific diseases complicated. Therefore, filtering out a few essential genes as well as their regulatory networks is critical, and a disease can be easily diagnosed just depending on the expression profiles of a few critical genes. In this study, a target gene screening (TGS) system, which is a microarray-based information system that integrates F-statistics, pattern recognition matching, a two-layer K-means classifier, a Parameter Detection Genetic Algorithm (PDGA), a genetic-based gene selector (GBG selector) and the association rule, was developed to screen out a small subset of genes that can discriminate malignant stages of cancers. During the first stage, F-statistic, pattern recognition matching, and a two-layer K-means classifier were applied in the system to filter out the 20 critical genes most relevant to ovarian cancer from 9600 genes, and the PDGA was used to decide the fittest values of the parameters for these critical genes. Among the 20 critical genes, 15 are associated with cancer progression. In the second stage, we further employed a GBG selector and the association rule to screen out seven target gene sets, each with only four to six genes, and each of which can precisely identify the malignancy stage of ovarian cancer based on their expression profiles. We further deduced the gene regulatory networks of the 20 critical genes by applying the Pearson correlation coefficient to evaluate the correlationship between the expression of each gene at the same stages and at different stages. Correlationships between gene pairs were calculated, and then, three regulatory networks were deduced. Their correlationships were further confirmed by the Ingenuity pathway analysis. The prognostic significances of the genes identified via regulatory networks were examined using online tools, and most represented biomarker candidates. In summary, our proposed system provides a new strategy to identify critical genes or biomarkers, as well as their regulatory networks, from microarray data. Copyright © 2018. Published by Elsevier Inc.
Coral Reef Genomics: Developing tools for functional genomics ofcoral symbiosis
DOE Office of Scientific and Technical Information (OSTI.GOV)
Schwarz, Jodi; Brokstein, Peter; Manohar, Chitra
Symbioses between cnidarians and dinoflagellates in the genus Symbiodinium are widespread in the marine environment. The importance of this symbiosis to reef-building corals and reef nutrient and carbon cycles is well documented, but little is known about the mechanisms by which the partners establish and regulate the symbiosis. Because the dinoflagellate symbionts live inside the cells of their host coral, the interactions between the partners occur on cellular and molecular levels, as each partner alters the expression of genes and proteins to facilitate the partnership. These interactions can examined using high-throughput techniques that allow thousands of genes to be examinedmore » simultaneously. We are developing the groundwork so that we can use DNA microarray profiling to identify genes involved in the Montastraea faveolata and Acropora palmata symbioses. Here we report results from the initial steps in this microarray initiative, that is, the construction of cDNA libraries from 4 of 16 target stages, sequencing of 3450 cDNA clones to generate Expressed Sequenced Tags (ESTs), and annotation of the ESTs to identify candidate genes to include in the microarrays. An understanding of how the coral-dinoflagellate symbiosis is regulated will have implications for atmospheric and ocean sciences, conservation biology, the study and diagnosis of coral bleaching and disease, and comparative studies of animal-protest interactions.« less
Microarray-based DNA methylation study of Ewing's sarcoma of the bone.
Park, Hye-Rim; Jung, Woon-Won; Kim, Hyun-Sook; Park, Yong-Koo
2014-10-01
Alterations in DNA methylation patterns are a hallmark of malignancy. However, the majority of epigenetic studies of Ewing's sarcoma have focused on the analysis of only a few candidate genes. Comprehensive studies are thus lacking and are required. The aim of the present study was to identify novel methylation markers in Ewing's sarcoma using microarray analysis. The current study reports the microarray-based DNA methylation study of 1,505 CpG sites of 807 cancer-related genes from 69 Ewing's sarcoma samples. The Illumina GoldenGate Methylation Cancer Panel I microarray was used, and with the appropriate controls (n=14), a total of 92 hypermethylated genes were identified in the Ewing's sarcoma samples. The majority of the hypermethylated genes were associated with cell adhesion, cell regulation, development and signal transduction. The overall methylation mean values were compared between patients who survived and those that did not. The overall methylation mean was significantly higher in the patients who did not survive (0.25±0.03) than in those who did (0.22±0.05) (P=0.0322). However, the overall methylation mean was not found to significantly correlate with age, gender or tumor location. GDF10 , OSM , APC and HOXA11 were the most significant differentially-methylated genes, however, their methylation levels were not found to significantly correlate with the survival rate. The DNA methylation profile of Ewing's sarcoma was characterized and 92 genes that were significantly hypermethylated were detected. A trend towards a more aggressive behavior was identified in the methylated group. The results of this study indicated that methylation may be significant in the development of Ewing's sarcoma.
Microarray-based DNA methylation study of Ewing’s sarcoma of the bone
PARK, HYE-RIM; JUNG, WOON-WON; KIM, HYUN-SOOK; PARK, YONG-KOO
2014-01-01
Alterations in DNA methylation patterns are a hallmark of malignancy. However, the majority of epigenetic studies of Ewing’s sarcoma have focused on the analysis of only a few candidate genes. Comprehensive studies are thus lacking and are required. The aim of the present study was to identify novel methylation markers in Ewing’s sarcoma using microarray analysis. The current study reports the microarray-based DNA methylation study of 1,505 CpG sites of 807 cancer-related genes from 69 Ewing’s sarcoma samples. The Illumina GoldenGate Methylation Cancer Panel I microarray was used, and with the appropriate controls (n=14), a total of 92 hypermethylated genes were identified in the Ewing’s sarcoma samples. The majority of the hypermethylated genes were associated with cell adhesion, cell regulation, development and signal transduction. The overall methylation mean values were compared between patients who survived and those that did not. The overall methylation mean was significantly higher in the patients who did not survive (0.25±0.03) than in those who did (0.22±0.05) (P=0.0322). However, the overall methylation mean was not found to significantly correlate with age, gender or tumor location. GDF10, OSM, APC and HOXA11 were the most significant differentially-methylated genes, however, their methylation levels were not found to significantly correlate with the survival rate. The DNA methylation profile of Ewing’s sarcoma was characterized and 92 genes that were significantly hypermethylated were detected. A trend towards a more aggressive behavior was identified in the methylated group. The results of this study indicated that methylation may be significant in the development of Ewing’s sarcoma. PMID:25202378
NASA Technical Reports Server (NTRS)
Weitzeal, A. J.; Wyatt, S. E.; Parsons-Wingerter, P.
2016-01-01
Venation patterning in leaves is a major determinant of photosynthesis efficiency because of its dependency on vascular transport of photoassimilates, water, and minerals. Arabidopsis thaliana grown in microgravity show delayed growth and leaf maturation. Gene expression data from the roots, hypocotyl, and leaves of A. thaliana grown during spaceflight vs. ground control analyzed by Affymetrix microarray are available through NASAs GeneLab (GLDS-7). We analyzed the data for differential expression of genes in leaves resulting from the effects of spaceflight on vascular patterning. Two genes were found by preliminary analysis to be upregulated during spaceflight that may be related to vascular formation. The genes are responsible for coding an ARGOS like protein (potentially affecting cell elongation in the leaves), and an F-boxkelch-repeat protein (possibly contributing to protoxylem specification). Further analysis that will focus on raw data quality assessment and a moderated t-test may further confirm upregulation of the two genes and/or identify other gene candidates. Plants defective in these genes will then be assessed for phenotype by the mapping and quantification of leaf vascular patterning by NASAs VESsel GENeration (VESGEN) software to model specific vascular differences of plants grown in spaceflight.
NASA Technical Reports Server (NTRS)
Weitzeal, A. J.; Wyatt, S. E.; Parsons-Wingerter, P.
2016-01-01
Venation patterning in leaves is a major determinant of photosynthesis efficiency because of its dependency on vascular transport of photoassimilates, water, and minerals. Arabidopsis thaliana grown in microgravity show delayed growth and leaf maturation. Gene expression data from the roots, hypocotyl, and leaves of A. thaliana grown during spaceflight vs. ground control analyzed by Affymetrix microarray are available through NASA's GeneLab (GLDS-7). We analyzed the data for differential expression of genes in leaves resulting from the effects of spaceflight on vascular patterning. Two genes were found by preliminary analysis to be upregulated during spaceflight that may be related to vascular formation. The genes are responsible for coding an ARGOS like protein (potentially affecting cell elongation in the leaves), and an F-box/kelch-repeat protein (possibly contributing to protoxylem specification). Further analysis that will focus on raw data quality assessment and a moderated t-test may further confirm upregulation of the two genes and/or identify other gene candidates. Plants defective in these genes will then be assessed for phenotype by the mapping and quantification of leaf vascular patterning by NASA's VESsel GENeration (VESGEN) software to model specific vascular differences of plants grown in spaceflight.
Microarray analysis of retinal gene expression in chicks during imposed myopic defocus.
Schippert, Ruth; Schaeffel, Frank; Feldkaemper, Marita Pauline
2008-08-31
The retina plays an important regulatory role in ocular growth. To screen for new retinal candidate genes that could be involved in the inhibition of ocular growth, we used chick microarrays to analyze the changes in retinal mRNA expression after myopic defocus was imposed by positive lens wear. Four male white leghorn chicks, aged nine days, wore +6.9D spectacle lenses over both eyes for 24 h. Four untreated age-matched male chicks from the same batch served as controls. The chicks were euthanized, and retinas from both eyes of each chick were pooled. RNA was isolated and labeled cRNA was prepared. These samples were hybridized to Affymetrix GeneChip Chicken Genome arrays with more than 28,000 characterized genes. After comparison of multiple normalization methods, GC-RMA and a false-discovery rate of 6% was chosen for normalization of the data. The expression of 16 candidate genes was further studied, using semiquantitative real-time RT-PCR. In addition, the expression of the mRNA of some of these candidate genes was assessed in chicks that wore either +6.9D lenses for 4 h or -7D lenses for 24 h. 123 transcripts were found to be differentially expressed (p<0.05; at least 1.5-fold change in expression level), with an absolute mean fold-change of 1.97+/-1.16 (mean+/-standard deviation). Nine of the sixteen genes that were examined by real-time RT-PCR were validated. Regardless of whether positive or negative lenses were worn, six of these nine genes were regulated in the same direction after 24 h: arginyltransferase 1 (ATE1), E74-like factor 1 (ELF1), growth factor receptor-bound protein 2 (GRB2), SHQ1 homolog (S. cerevisiae) (SHQ1), spectrin, beta, non-erythrocytic 1 (SPTBN1), prepro-urotensin II-related peptide (pp-URP). Three genes responded differently to positive and negative lens treatment after 24 h: ATP-binding cassette, sub-family C, member 10 (ABCC10), CD226 molecule (CD226) and oxysterol binding protein 2 (OSBP2). The validated genes that were regulated only by myopic defocus may represent elements in a pathway generating a "stop-signal" for eye growth. Some of the genes identified in this study have so far not been described in the retina. Further investigation of their function may improve the understanding of the signaling cascades in emmetropization. More general, published microarray data are variable among different animal models (mouse, chick, monkeys), tissues (retina, retina/retinal pigment epithelium), treatments (diffusers, lenses, lid-suture), as well as different treatment durations (hours, days), and comparisons remain difficult. That only a small number of common genes were found emphasizes the need for careful normalization of the experimental parameters.
Differential gene expression of wheat progeny with contrasting levels of transpiration efficiency.
Xue, Gang-Ping; McIntyre, C Lynne; Chapman, Scott; Bower, Neil I; Way, Heather; Reverter, Antonio; Clarke, Bryan; Shorter, Ray
2006-08-01
High water use efficiency or transpiration efficiency (TE) in wheat is a desirable physiological trait for increasing grain yield under water-limited environments. The identification of genes associated with this trait would facilitate the selection for genotypes with higher TE using molecular markers. We performed an expression profiling (microarray) analysis of approximately 16,000 unique wheat ESTs to identify genes that were differentially expressed between wheat progeny lines with contrasting TE levels from a cross between Quarrion (high TE) and Genaro 81 (low TE). We also conducted a second microarray analysis to identify genes responsive to drought stress in wheat leaves. Ninety-three genes that were differentially expressed between high and low TE progeny lines were identified. One fifth of these genes were markedly responsive to drought stress. Several potential growth-related regulatory genes, which were down-regulated by drought, were expressed at a higher level in the high TE lines than the low TE lines and are potentially associated with a biomass production component of the Quarrion-derived high TE trait. Eighteen of the TE differentially expressed genes were further analysed using quantitative RT-PCR on a separate set of plant samples from those used for microarray analysis. The expression levels of 11 of the 18 genes were positively correlated with the high TE trait, measured as carbon isotope discrimination (Delta(13)C). These data indicate that some of these TE differentially expressed genes are candidates for investigating processes that underlie the high TE trait or for use as expression quantitative trait loci (eQTLs) for TE.
Integrative Approach to Pain Genetics Identifies Pain Sensitivity Loci across Diseases
Ruau, David; Dudley, Joel T.; Chen, Rong; Phillips, Nicholas G.; Swan, Gary E.; Lazzeroni, Laura C.; Clark, J. David
2012-01-01
Identifying human genes relevant for the processing of pain requires difficult-to-conduct and expensive large-scale clinical trials. Here, we examine a novel integrative paradigm for data-driven discovery of pain gene candidates, taking advantage of the vast amount of existing disease-related clinical literature and gene expression microarray data stored in large international repositories. First, thousands of diseases were ranked according to a disease-specific pain index (DSPI), derived from Medical Subject Heading (MESH) annotations in MEDLINE. Second, gene expression profiles of 121 of these human diseases were obtained from public sources. Third, genes with expression variation significantly correlated with DSPI across diseases were selected as candidate pain genes. Finally, selected candidate pain genes were genotyped in an independent human cohort and prospectively evaluated for significant association between variants and measures of pain sensitivity. The strongest signal was with rs4512126 (5q32, ABLIM3, P = 1.3×10−10) for the sensitivity to cold pressor pain in males, but not in females. Significant associations were also observed with rs12548828, rs7826700 and rs1075791 on 8q22.2 within NCALD (P = 1.7×10−4, 1.8×10−4, and 2.2×10−4 respectively). Our results demonstrate the utility of a novel paradigm that integrates publicly available disease-specific gene expression data with clinical data curated from MEDLINE to facilitate the discovery of pain-relevant genes. This data-derived list of pain gene candidates enables additional focused and efficient biological studies validating additional candidates. PMID:22685391
2011-01-01
Background During gene conversion, genetic information is transferred unidirectionally between highly homologous but non-allelic regions of DNA. While germ-line gene conversion has been implicated in the pathogenesis of some diseases, somatic gene conversion has remained technically difficult to investigate on a large scale. Methods A novel analysis technique is proposed for detecting the signature of somatic gene conversion from SNP microarray data. The Wellcome Trust Case Control Consortium has gathered SNP microarray data for two control populations and cohorts for bipolar disorder (BD), cardiovascular disease (CAD), Crohn's disease (CD), hypertension (HT), rheumatoid arthritis (RA), type-1 diabetes (T1D) and type-2 diabetes (T2D). Using the new analysis technique, the seven disease cohorts are analyzed to identify cohort-specific SNPs at which conversion is predicted. The quality of the predictions is assessed by identifying known disease associations for genes in the homologous duplicons, and comparing the frequency of such associations with background rates. Results Of 28 disease/locus pairs meeting stringent conditions, 22 show various degrees of disease association, compared with only 8 of 70 in a mock study designed to measure the background association rate (P < 10-9). Additional candidate genes are identified using less stringent filtering conditions. In some cases, somatic deletions appear likely. RA has a distinctive pattern of events relative to other diseases. Similarities in patterns are apparent between BD and HT. Conclusions The associations derived represent the first evidence that somatic gene conversion could be a significant causative factor in each of the seven diseases. The specific genes provide potential insights about disease mechanisms, and are strong candidates for further study. Please see Commentary: http://www.biomedcentral.com/1741-7015/9/13/abstract. PMID:21291537
Ross, Kenneth Andrew
2011-02-03
During gene conversion, genetic information is transferred unidirectionally between highly homologous but non-allelic regions of DNA. While germ-line gene conversion has been implicated in the pathogenesis of some diseases, somatic gene conversion has remained technically difficult to investigate on a large scale. A novel analysis technique is proposed for detecting the signature of somatic gene conversion from SNP microarray data. The Wellcome Trust Case Control Consortium has gathered SNP microarray data for two control populations and cohorts for bipolar disorder (BD), cardiovascular disease (CAD), Crohn's disease (CD), hypertension (HT), rheumatoid arthritis (RA), type-1 diabetes (T1D) and type-2 diabetes (T2D). Using the new analysis technique, the seven disease cohorts are analyzed to identify cohort-specific SNPs at which conversion is predicted. The quality of the predictions is assessed by identifying known disease associations for genes in the homologous duplicons, and comparing the frequency of such associations with background rates. Of 28 disease/locus pairs meeting stringent conditions, 22 show various degrees of disease association, compared with only 8 of 70 in a mock study designed to measure the background association rate (P < 10-9). Additional candidate genes are identified using less stringent filtering conditions. In some cases, somatic deletions appear likely. RA has a distinctive pattern of events relative to other diseases. Similarities in patterns are apparent between BD and HT. The associations derived represent the first evidence that somatic gene conversion could be a significant causative factor in each of the seven diseases. The specific genes provide potential insights about disease mechanisms, and are strong candidates for further study.
Identification of Quantitative Trait Loci (QTL) and Candidate Genes for Cadmium Tolerance in Populus
DOE Office of Scientific and Technical Information (OSTI.GOV)
Induri, Brahma R; Ellis, Danielle R; Slavov, Gancho
2012-01-01
Knowledge of genetic variation in response of Populus to heavy metals like cadmium (Cd) is an important step in understanding the underlying mechanisms of tolerance. In this study, a pseudo-backcross pedigree of Populus trichocarpa and Populus deltoides was characterized for Cd exposure. The pedigree showed significant variation for Cd tolerance thus enabling the identification of relatively tolerant and susceptible genotypes for intensive characterization. A total of 16 QTLs at logarithm of odds (LOD) ratio > 2.5, were found to be associated with total dry weight, its components, and root volume. Four major QTLs for total dry weight were mapped tomore » different linkage groups in control (LG III) and Cd conditions (LG XVI) and had opposite allelic effects on Cd tolerance, suggesting that these genomic regions were differentially controlled. The phenotypic variation explained by Cd QTL for all traits under study varied from 5.9% to 11.6% and averaged 8.2% across all QTL. Leaf Cd contents also showed significant variation suggesting the phytoextraction potential of Populus genotypes, though heritability of this trait was low (0.22). A whole-genome microarray study was conducted by using two genotypes with extreme responses for Cd tolerance in the above study and differentially expressed genes were identified. Candidate genes including CAD2 (CADMIUM SENSITIVE 2), HMA5 (HEAVY METAL ATPase5), ATGTST1 (Arabidopsis thaliana Glutathione S-Transferase1), ATGPX6 (Glutathione peroxidase 6), and ATMRP 14 (Arabidopsis thaliana Multidrug Resistance associated Protein 14) were identified from QTL intervals and microarray study. Functional characterization of these candidate genes could enhance phytoremediation capabilities of Populus.« less
Integrative Assessment of Chlorine-Induced Acute Lung Injury in Mice
Pope-Varsalona, Hannah; Concel, Vincent J.; Liu, Pengyuan; Bein, Kiflai; Berndt, Annerose; Martin, Timothy M.; Ganguly, Koustav; Jang, An Soo; Brant, Kelly A.; Dopico, Richard A.; Upadhyay, Swapna; Di, Y. P. Peter; Hu, Zhen; Vuga, Louis J.; Medvedovic, Mario; Kaminski, Naftali; You, Ming; Alexander, Danny C.; McDunn, Jonathan E.; Prows, Daniel R.; Knoell, Daren L.
2012-01-01
The genetic basis for the underlying individual susceptibility to chlorine-induced acute lung injury is unknown. To uncover the genetic basis and pathophysiological processes that could provide additional homeostatic capacities during lung injury, 40 inbred murine strains were exposed to chlorine, and haplotype association mapping was performed. The identified single-nucleotide polymorphism (SNP) associations were evaluated through transcriptomic and metabolomic profiling. Using ≥ 10% allelic frequency and ≥ 10% phenotype explained as threshold criteria, promoter SNPs that could eliminate putative transcriptional factor recognition sites in candidate genes were assessed by determining transcript levels through microarray and reverse real-time PCR during chlorine exposure. The mean survival time varied by approximately 5-fold among strains, and SNP associations were identified for 13 candidate genes on chromosomes 1, 4, 5, 9, and 15. Microarrays revealed several differentially enriched pathways, including protein transport (decreased more in the sensitive C57BLKS/J lung) and protein catabolic process (increased more in the resistant C57BL/10J lung). Lung metabolomic profiling revealed 95 of the 280 metabolites measured were altered by chlorine exposure, and included alanine, which decreased more in the C57BLKS/J than in the C57BL/10J strain, and glutamine, which increased more in the C57BL/10J than in the C57BLKS/J strain. Genetic associations from haplotype mapping were strengthened by an integrated assessment using transcriptomic and metabolomic profiling. The leading candidate genes associated with increased susceptibility to acute lung injury in mice included Klf4, Sema7a, Tns1, Aacs, and a gene that encodes an amino acid carrier, Slc38a4. PMID:22447970
Using expression genetics to study the neurobiology of ethanol and alcoholism.
Farris, Sean P; Wolen, Aaron R; Miles, Michael F
2010-01-01
Recent simultaneous progress in human and animal model genetics and the advent of microarray whole genome expression profiling have produced prodigious data sets on genetic loci, potential candidate genes, and differential gene expression related to alcoholism and ethanol behaviors. Validated target genes or gene networks functioning in alcoholism are still of meager proportions. Genetical genomics, which combines genetic analysis of both traditional phenotypes and whole genome expression data, offers a potential methodology for characterizing brain gene networks functioning in alcoholism. This chapter will describe concepts, approaches, and recent findings in the field of genetical genomics as it applies to alcohol research. Copyright 2010 Elsevier Inc. All rights reserved.
Farber, Charles R; van Nas, Atila; Ghazalpour, Anatole; Aten, Jason E; Doss, Sudheer; Sos, Brandon; Schadt, Eric E; Ingram-Drake, Leslie; Davis, Richard C; Horvath, Steve; Smith, Desmond J; Drake, Thomas A; Lusis, Aldons J
2009-01-01
Numerous quantitative trait loci (QTLs) affecting bone traits have been identified in the mouse; however, few of the underlying genes have been discovered. To improve the process of transitioning from QTL to gene, we describe an integrative genetics approach, which combines linkage analysis, expression QTL (eQTL) mapping, causality modeling, and genetic association in outbred mice. In C57BL/6J × C3H/HeJ (BXH) F2 mice, nine QTLs regulating femoral BMD were identified. To select candidate genes from within each QTL region, microarray gene expression profiles from individual F2 mice were used to identify 148 genes whose expression was correlated with BMD and regulated by local eQTLs. Many of the genes that were the most highly correlated with BMD have been previously shown to modulate bone mass or skeletal development. Candidates were further prioritized by determining whether their expression was predicted to underlie variation in BMD. Using network edge orienting (NEO), a causality modeling algorithm, 18 of the 148 candidates were predicted to be causally related to differences in BMD. To fine-map QTLs, markers in outbred MF1 mice were tested for association with BMD. Three chromosome 11 SNPs were identified that were associated with BMD within the Bmd11 QTL. Finally, our approach provides strong support for Wnt9a, Rasd1, or both underlying Bmd11. Integration of multiple genetic and genomic data sets can substantially improve the efficiency of QTL fine-mapping and candidate gene identification. PMID:18767929
Carlson, Kimberly A.; Gardner, Kylee; Pashaj, Anjeza; Carlson, Darby J.; Yu, Fang; Eudy, James D.; Zhang, Chi; Harshman, Lawrence G.
2015-01-01
Aging is a complex process characterized by a steady decline in an organism's ability to perform life-sustaining tasks. In the present study, two cages of approximately 12,000 mated Drosophila melanogaster females were used as a source of RNA from individuals sampled frequently as a function of age. A linear model for microarray data method was used for the microarray analysis to adjust for the box effect; it identified 1,581 candidate aging genes. Cluster analyses using a self-organizing map algorithm on the 1,581 significant genes identified gene expression patterns across different ages. Genes involved in immune system function and regulation, chorion assembly and function, and metabolism were all significantly differentially expressed as a function of age. The temporal pattern of data indicated that gene expression related to aging is affected relatively early in life span. In addition, the temporal variance in gene expression in immune function genes was compared to a random set of genes. There was an increase in the variance of gene expression within each cohort, which was not observed in the set of random genes. This observation is compatible with the hypothesis that D. melanogaster immune function genes lose control of gene expression as flies age. PMID:26090231
A Genomics Approach to Deciphering Lignin Biosynthesis in Switchgrass[W
Shen, Hui; Mazarei, Mitra; Hisano, Hiroshi; Escamilla-Trevino, Luis; Fu, Chunxiang; Pu, Yunqiao; Rudis, Mary R.; Tang, Yuhong; Xiao, Xirong; Jackson, Lisa; Li, Guifen; Hernandez, Tim; Chen, Fang; Ragauskas, Arthur J.; Stewart, C. Neal; Wang, Zeng-Yu; Dixon, Richard A.
2013-01-01
It is necessary to overcome recalcitrance of the biomass to saccharification (sugar release) to make switchgrass (Panicum virgatum) economically viable as a feedstock for liquid biofuels. Lignin content correlates negatively with sugar release efficiency in switchgrass, but selecting the right gene candidates for engineering lignin biosynthesis in this tetraploid outcrossing species is not straightforward. To assist this endeavor, we have used an inducible switchgrass cell suspension system for studying lignin biosynthesis in response to exogenous brassinolide. By applying a combination of protein sequence phylogeny with whole-genome microarray analyses of induced cell cultures and developing stem internode sections, we have generated a list of candidate monolignol biosynthetic genes for switchgrass. Several genes that were strongly supported through our bioinformatics analysis as involved in lignin biosynthesis were confirmed by gene silencing studies, in which lignin levels were reduced as a result of targeting a single gene. However, candidate genes encoding enzymes involved in the early steps of the currently accepted monolignol biosynthesis pathway in dicots may have functionally redundant paralogues in switchgrass and therefore require further evaluation. This work provides a blueprint and resources for the systematic genome-wide study of the monolignol pathway in switchgrass, as well as other C4 monocot species. PMID:24285795
Filling gaps in PPAR-alpha signaling through comparative nutrigenomics analysis.
Cavalieri, Duccio; Calura, Enrica; Romualdi, Chiara; Marchi, Emmanuela; Radonjic, Marijana; Van Ommen, Ben; Müller, Michael
2009-12-11
The application of high-throughput genomic tools in nutrition research is a widespread practice. However, it is becoming increasingly clear that the outcome of individual expression studies is insufficient for the comprehensive understanding of such a complex field. Currently, the availability of the large amounts of expression data in public repositories has opened up new challenges on microarray data analyses. We have focused on PPARalpha, a ligand-activated transcription factor functioning as fatty acid sensor controlling the gene expression regulation of a large set of genes in various metabolic organs such as liver, small intestine or heart. The function of PPARalpha is strictly connected to the function of its target genes and, although many of these have already been identified, major elements of its physiological function remain to be uncovered. To further investigate the function of PPARalpha, we have applied a cross-species meta-analysis approach to integrate sixteen microarray datasets studying high fat diet and PPARalpha signal perturbations in different organisms. We identified 164 genes (MDEGs) that were differentially expressed in a constant way in response to a high fat diet or to perturbations in PPARs signalling. In particular, we found five genes in yeast which were highly conserved and homologous of PPARalpha targets in mammals, potential candidates to be used as models for the equivalent mammalian genes. Moreover, a screening of the MDEGs for all known transcription factor binding sites and the comparison with a human genome-wide screening of Peroxisome Proliferating Response Elements (PPRE), enabled us to identify, 20 new potential candidate genes that show, both binding site, both change in expression in the condition studied. Lastly, we found a non random localization of the differentially expressed genes in the genome. The results presented are potentially of great interest to resume the currently available expression data, exploiting the power of in silico analysis filtered by evolutionary conservation. The analysis enabled us to indicate potential gene candidates that could fill in the gaps with regards to the signalling of PPARalpha and, moreover, the non-random localization of the differentially expressed genes in the genome, suggest that epigenetic mechanisms are of importance in the regulation of the transcription operated by PPARalpha.
Washio, Kana; Oka, Takashi; Abdalkader, Lamia; Muraoka, Michiko; Shimada, Akira; Oda, Megumi; Sato, Hiaki; Takata, Katsuyoshi; Kagami, Yoshitoyo; Shimizu, Norio; Kato, Seiichi; Kimura, Hiroshi; Nishizaki, Kazunori; Yoshino, Tadashi; Tsukahara, Hirokazu
2017-11-01
The human herpes virus, Epstein-Barr virus (EBV), is a known oncogenic virus and plays important roles in life-threatening T/NK-cell lymphoproliferative disorders (T/NK-cell LPD) such as hypersensitivity to mosquito bite (HMB), chronic active EBV infection (CAEBV), and NK/T-cell lymphoma/leukemia. During the clinical courses of HMB and CAEBV, patients frequently develop malignant lymphomas and the diseases passively progress sequentially. In the present study, gene expression of CD16 (-) CD56 (+) -, EBV (+) HMB, CAEBV, NK-lymphoma, and NK-leukemia cell lines, which were established from patients, was analyzed using oligonucleotide microarrays and compared to that of CD56 bright CD16 dim/- NK cells from healthy donors. Principal components analysis showed that CAEBV and NK-lymphoma cells were relatively closely located, indicating that they had similar expression profiles. Unsupervised hierarchal clustering analyses of microarray data and gene ontology analysis revealed specific gene clusters and identified several candidate genes responsible for disease that can be used to discriminate each category of NK-LPD and NK-cell lymphoma/leukemia.
Ingham, Victoria A; Jones, Christopher M; Pignatelli, Patricia; Balabanidou, Vasileia; Vontas, John; Wagstaff, Simon C; Moore, Jonathan D; Ranson, Hilary
2014-11-25
The elevated expression of enzymes with insecticide metabolism activity can lead to high levels of insecticide resistance in the malaria vector, Anopheles gambiae. In this study, adult female mosquitoes from an insecticide susceptible and resistant strain were dissected into four different body parts. RNA from each of these samples was used in microarray analysis to determine the enrichment patterns of the key detoxification gene families within the mosquito and to identify additional candidate insecticide resistance genes that may have been overlooked in previous experiments on whole organisms. A general enrichment in the transcription of genes from the four major detoxification gene families (carboxylesterases, glutathione transferases, UDP glucornyltransferases and cytochrome P450s) was observed in the midgut and malpighian tubules. Yet the subset of P450 genes that have previously been implicated in insecticide resistance in An gambiae, show a surprisingly varied profile of tissue enrichment, confirmed by qPCR and, for three candidates, by immunostaining. A stringent selection process was used to define a list of 105 genes that are significantly (p ≤0.001) over expressed in body parts from the resistant versus susceptible strain. Over half of these, including all the cytochrome P450s on this list, were identified in previous whole organism comparisons between the strains, but several new candidates were detected, notably from comparisons of the transcriptomes from dissected abdomen integuments. The use of RNA extracted from the whole organism to identify candidate insecticide resistance genes has a risk of missing candidates if key genes responsible for the phenotype have restricted expression within the body and/or are over expression only in certain tissues. However, as transcription of genes implicated in metabolic resistance to insecticides is not enriched in any one single organ, comparison of the transcriptome of individual dissected body parts cannot be recommended as a preferred means to identify new candidate insecticide resistant genes. Instead the rich data set on in vivo sites of transcription should be consulted when designing follow up qPCR validation steps, or for screening known candidates in field populations.
IFRD1 Is a Candidate Gene for SMNA on Chromosome 7q22-q23
Brkanac, Zoran; Spencer, David; Shendure, Jay; Robertson, Peggy D.; Matsushita, Mark; Vu, Tiffany; Bird, Thomas D.; Olson, Maynard V.; Raskind, Wendy H.
2009-01-01
We have established strong linkage evidence that supports mapping autosomal-dominant sensory/motor neuropathy with ataxia (SMNA) to chromosome 7q22-q32. SMNA is a rare neurological disorder whose phenotype encompasses both the central and the peripheral nervous system. In order to identify a gene responsible for SMNA, we have undertaken a comprehensive genomic evaluation of the region of linkage, including evaluation for repeat expansion and small deletions or duplications, capillary sequencing of candidate genes, and massively parallel sequencing of all coding exons. We excluded repeat expansion and small deletions or duplications as causative, and through microarray-based hybrid capture and massively parallel short-read sequencing, we identified a nonsynonymous variant in the human interferon-related developmental regulator gene 1 (IFRD1) as a disease-causing candidate. Sequence conservation, animal models, and protein structure evaluation support the involvement of IFRD1 in SMNA. Mutation analysis of IFRD1 in additional patients with similar phenotypes is needed for demonstration of causality and further evaluation of its importance in neurological diseases. PMID:19409521
Suh, Yun-Suhk; Yu, Jieun; Kim, Byung Chul; Choi, Boram; Han, Tae-Su; Ahn, Hye Seong; Kong, Seong-Ho; Lee, Hyuk-Joon; Kim, Woo Ho; Yang, Han-Kwang
2015-01-01
Purpose The purpose of this study is to investigate differentially expressed genes using DNA microarray between advanced gastric cancer (AGC) with aggressive lymph node (LN) metastasis and that with a more advanced tumor stage but without LN metastasis. Materials and Methods Five sample pairs of gastric cancer tissue and normal gastric mucosa were taken from three patients with T3N3 stage (highN) and two with T4N0 stage (lowN). Data from triplicate DNA microarray experiments were analyzed, and candidate genes were identified using a volcano plot that showed ≥ 2-fold differential expression and were significant by Welch's t test (p < 0.05) between highN and lowN. Those selected genes were validated independently by reverse-transcriptase–polymerase chain reaction (RT-PCR) using five AGC patients, and tissue-microarray (TMA) comprising 47 AGC patients. Results CFTR, LAMC2, SERPINE2, F2R, MMP7, FN1, TIMP1, plasminogen activator inhibitor-1 (PAI-1), ITGB8, SDS, and TMPRSS4 were commonly up-regulated over 2-fold in highN. REG3A, CD24, ITLN1, and WBP5 were commonly down-regulated over 2-fold in lowN. Among these genes, overexpression of PAI-1 was validated by RT-PCR, and TMA showed 16.7% (7/42) PAI-1 expression in T3N3, but none (0/5) in T4N0 (p=0.393). Conclusion DNA microarray analysis and validation by RT-PCR and TMA showed that overexpression of PAI-1 is related to aggressive LN metastasis in AGC. PMID:25687870
Identifying positive selection candidate loci for high-altitude adaptation in Andean populations
2009-01-01
High-altitude environments (>2,500 m) provide scientists with a natural laboratory to study the physiological and genetic effects of low ambient oxygen tension on human populations. One approach to understanding how life at high altitude has affected human metabolism is to survey genome-wide datasets for signatures of natural selection. In this work, we report on a study to identify selection-nominated candidate genes involved in adaptation to hypoxia in one highland group, Andeans from the South American Altiplano. We analysed dense microarray genotype data using four test statistics that detect departures from neutrality. Using a candidate gene, single nucleotide polymorphism-based approach, we identified genes exhibiting preliminary evidence of recent genetic adaptation in this population. These included genes that are part of the hypoxia-inducible transcription factor (HIF) pathway, a biochemical pathway involved in oxygen homeostasis, as well as three other genomic regions previously not known to be associated with high-altitude phenotypes. In addition to identifying selection-nominated candidate genes, we also tested whether the HIF pathway shows evidence of natural selection. Our results indicate that the genes of this biochemical pathway as a group show no evidence of having evolved in response to hypoxia in Andeans. Results from particular HIF-targeted genes, however, suggest that genes in this pathway could play a role in Andean adaptation to high altitude, even if the pathway as a whole does not show higher relative rates of evolution. These data suggest a genetic role in high-altitude adaptation and provide a basis for genotype/phenotype association studies that are necessary to confirm the role of putative natural selection candidate genes and gene regions in adaptation to altitude. PMID:20038496
Genetic Dissection of Learning and Memory in Mice
Mineur, Yann S.; Crusio, Wim E.; Sluyter, Frans
2004-01-01
In this minireview, we discuss different strategies to dissect genetically the keystones of learning and memory. First, we broadly sketch the neurogenetic analysis of complex traits in mice. We then discuss two general strategies to find genes affecting learning and memory: candidate gene studies and whole genome searches. Next, we briefly review more recently developed techniques, such as microarrays and RNA interference. In addition, we focus on gene-environment interactions and endophenotypes. All sections are illustrated with examples from the learning and memory field, including a table summarizing the latest information about genes that have been shown to have effects on learning and memory. PMID:15656270
Graubner, Felix R.; Gram, Aykut; Kautz, Ewa; Bauersachs, Stefan; Aslan, Selim; Agaoglu, Ali R.; Boos, Alois
2017-01-01
Abstract In the dog, there is no luteolysis in the absence of pregnancy. Thus, this species lacks any anti-luteolytic endocrine signal as found in other species that modulate uterine function during the critical period of pregnancy establishment. Nevertheless, in the dog an embryo-maternal communication must occur in order to prevent rejection of embryos. Based on this hypothesis, we performed microarray analysis of canine uterine samples collected during pre-attachment phase (days 10-12) and in corresponding non-pregnant controls, in order to elucidate the embryo attachment signal. An additional goal was to identify differences in uterine responses to pre-attachment embryos between dogs and other mammalian species exhibiting different reproductive patterns with regard to luteolysis, implantation, and preparation for placentation. Therefore, the canine microarray data were compared with gene sets from pigs, cattle, horses, and humans. We found 412 genes differentially regulated between the two experimental groups. The functional terms most strongly enriched in response to pre-attachment embryos related to extracellular matrix function and remodeling, and to immune and inflammatory responses. Several candidate genes were validated by semi-quantitative PCR. When compared with other species, best matches were found with human and equine counterparts. Especially for the pig, the majority of overlapping genes showed opposite expression patterns. Interestingly, 1926 genes did not pair with any of the other gene sets. Using a microarray approach, we report the uterine changes in the dog driven by the presence of embryos and compare these results with datasets from other mammalian species, finding common-, contrary-, and exclusively canine-regulated genes. PMID:28651344
Initiation of follicular atresia: gene networks during early atresia in pig ovaries.
Zhang, Jinbi; Liu, Yang; Yao, Wang; Li, Qifa; Liu, Hong-Lin; Pan, Zengxiang
2018-05-09
In mammals, more than 99% of ovarian follicles undergo a degenerative process known as atresia. The molecular events involve in atresia initiation remain incompletely understood. The objective of this study was to analyze differential gene expression profiles of medium antral ovarian follicles during early atresia in pig. The transcriptome evaluation was performed on cDNA microarrays using healthy and early atretic follicle samples and was validated by quantitative PCR. Annotation analysis applying current database (sus scrofa 11.1) revealed 450 significantly differential expressed genes between healthy and early atretic follicles. Among them, 142 were significantly up-regulated in early atretic with respect to healthy group and 308 were down-regulated. Similar expression trends were observed between microarray data and qRT-PCR confirmation, which indicated the reliability of the microarray analysis. Further analysis of the differential expressed genes revealed the most significantly affected biological functions during early atresia including blood vessel development, regulation of DNA-templated transcription in response to stress and negative regulation of cell adhesion. The pathway and interaction analysis suggested that atresia initiation associates with 1) a crosstalk of cell apoptosis, autophagy, and ferroptosis rather than change of typical apoptosis markers, 2) dramatic shift of steroidogenic enzymes, 3) deficient glutathione metabolism, and 4) vascular degeneration. The novel gene candidates and pathways identified in the current study will lead to a comprehensive view of the molecular regulation of ovarian follicular atresia and a new understanding of atresia initiation.
Microarray analysis of retinal gene expression in chicks during imposed myopic defocus
Schippert, Ruth; Schaeffel, Frank
2008-01-01
Purpose The retina plays an important regulatory role in ocular growth. To screen for new retinal candidate genes that could be involved in the inhibition of ocular growth, we used chick microarrays to analyze the changes in retinal mRNA expression after myopic defocus was imposed by positive lens wear. Methods Four male white leghorn chicks, aged nine days, wore +6.9D spectacle lenses over both eyes for 24 h. Four untreated age-matched male chicks from the same batch served as controls. The chicks were euthanized, and retinas from both eyes of each chick were pooled. RNA was isolated and labeled cRNA was prepared. These samples were hybridized to Affymetrix GeneChip Chicken Genome arrays with more than 28,000 characterized genes. After comparison of multiple normalization methods, GC-RMA and a false-discovery rate of 6% was chosen for normalization of the data. The expression of 16 candidate genes was further studied, using semiquantitative real-time RT–PCR. In addition, the expression of the mRNA of some of these candidate genes was assessed in chicks that wore either +6.9D lenses for 4 h or −7D lenses for 24 h. Results 123 transcripts were found to be differentially expressed (p<0.05; at least 1.5-fold change in expression level), with an absolute mean fold-change of 1.97±1.16 (mean±standard deviation). Nine of the sixteen genes that were examined by real-time RT–PCR were validated. Regardless of whether positive or negative lenses were worn, six of these nine genes were regulated in the same direction after 24 h: arginyltransferase 1 (ATE1), E74-like factor 1 (ELF1), growth factor receptor-bound protein 2 (GRB2), SHQ1 homolog (S. cerevisiae) (SHQ1), spectrin, beta, non-erythrocytic 1 (SPTBN1), prepro-urotensin II-related peptide (pp-URP). Three genes responded differently to positive and negative lens treatment after 24 h: ATP-binding cassette, sub-family C, member 10 (ABCC10), CD226 molecule (CD226) and oxysterol binding protein 2 (OSBP2). Conclusions The validated genes that were regulated only by myopic defocus may represent elements in a pathway generating a “stop-signal” for eye growth. Some of the genes identified in this study have so far not been described in the retina. Further investigation of their function may improve the understanding of the signaling cascades in emmetropization. More general, published microarray data are variable among different animal models (mouse, chick, monkeys), tissues (retina, retina/retinal pigment epithelium), treatments (diffusers, lenses, lid-suture), as well as different treatment durations (hours, days), and comparisons remain difficult. That only a small number of common genes were found emphasizes the need for careful normalization of the experimental parameters. PMID:18769560
EgoNet: identification of human disease ego-network modules
2014-01-01
Background Mining novel biomarkers from gene expression profiles for accurate disease classification is challenging due to small sample size and high noise in gene expression measurements. Several studies have proposed integrated analyses of microarray data and protein-protein interaction (PPI) networks to find diagnostic subnetwork markers. However, the neighborhood relationship among network member genes has not been fully considered by those methods, leaving many potential gene markers unidentified. The main idea of this study is to take full advantage of the biological observation that genes associated with the same or similar diseases commonly reside in the same neighborhood of molecular networks. Results We present EgoNet, a novel method based on egocentric network-analysis techniques, to exhaustively search and prioritize disease subnetworks and gene markers from a large-scale biological network. When applied to a triple-negative breast cancer (TNBC) microarray dataset, the top selected modules contain both known gene markers in TNBC and novel candidates, such as RAD51 and DOK1, which play a central role in their respective ego-networks by connecting many differentially expressed genes. Conclusions Our results suggest that EgoNet, which is based on the ego network concept, allows the identification of novel biomarkers and provides a deeper understanding of their roles in complex diseases. PMID:24773628
Arias, Carlos Roberto; Yeh, Hsiang-Yuan; Soo, Von-Wun
2012-01-01
Finding a genetic disease-related gene is not a trivial task. Therefore, computational methods are needed to present clues to the biomedical community to explore genes that are more likely to be related to a specific disease as biomarker. We present biomarker identification problem using gene prioritization method called gene prioritization from microarray data based on shortest paths, extended with structural and biological properties and edge flux using voting scheme (GP-MIDAS-VXEF). The method is based on finding relevant interactions on protein interaction networks, then scoring the genes using shortest paths and topological analysis, integrating the results using a voting scheme and a biological boosting. We applied two experiments, one is prostate primary and normal samples and the other is prostate primary tumor with and without lymph nodes metastasis. We used 137 truly prostate cancer genes as benchmark. In the first experiment, GP-MIDAS-VXEF outperforms all the other state-of-the-art methods in the benchmark by retrieving the truest related genes from the candidate set in the top 50 scores found. We applied the same technique to infer the significant biomarkers in prostate cancer with lymph nodes metastasis which is not established well. PMID:22654636
Microarray Detection of Duplex and Triplex DNA Binders with DNA-Modified Gold Nanoparticles
Lytton-Jean, Abigail K. R.; Han, Min Su; Mirkin, Chad A.
2008-01-01
We have designed a chip-based assay, using microarray technology, for determining the relative binding affinities of duplex and triplex DNA binders. This assay combines the high discrimination capabilities afforded by DNA-modified Au nanoparticles with the high-throughput capabilities of DNA microarrays. The detection and screening of duplex DNA binders are important because these molecules, in many cases, are potential anticancer agents as well as toxins. Triplex DNA binders are also promising drug candidates. These molecules, in conjunction with triplex forming oligonucleotides, could potentially be used to achieve control of gene expression by interfering with transcription factors that bind to DNA. Therefore, the ability to screen for these molecules in a high-throughput fashion could dramatically improve the drug screening process. The assay reported here provides excellent discrimination between strong, intermediate, and weak duplex and triplex DNA binders in a high-throughput fashion. PMID:17614366
Jeukens, Julie; Bittner, David; Knudsen, Rune; Bernatchez, Louis
2009-01-01
In the past 40 years, there has been increasing acceptance that variation in levels of gene expression represents a major source of evolutionary novelty. Gene expression divergence is therefore likely to be involved in the emergence of incipient species, namely, in a context of adaptive radiation. In the lake whitefish species complex (Coregonus clupeaformis), previous microarray experiments have led to the identification of candidate genes potentially implicated in the parallel evolution of the limnetic dwarf lake whitefish, which is highly distinct from the benthic normal lake whitefish in life history, morphology, metabolism, and behavior, and yet diverged from it only approximately 15,000 years before present. The aim of the present study was to address transcriptional divergence for six candidate genes among lake whitefish and European whitefish (Coregonus lavaretus) species pairs, as well as lake cisco (Coregonus artedi) and vendace (Coregonus albula). The main goal was to test the hypothesis that parallel phenotypic adaptation toward the use of the limnetic niche in coregonine fishes is accompanied by parallelism in candidate gene transcription as measured by quantitative real-time polymerase chain reaction. Results obtained for three candidate genes, whereby parallelism in expression was observed across all whitefish species pairs, provide strong support for the hypothesis that divergent natural selection plays an important role in the adaptive radiation of whitefish species. However, this parallelism in expression did not extend to cisco and vendace, thereby infirming transcriptional convergence between limnetic whitefish species and their limnetic congeners for these genes. As recently proposed (Lynch 2007a. The evolution of genetic networks by non-adaptive processes. Nat Rev Genet. 8:803-813), these results may suggest that convergent phenotypic evolution can result from nonadaptive shaping of genome architecture in independently evolved coregonine lineages.
Mohr, Roland; Neckel, Peter; Zhang, Ying; Stachon, Susanne; Nothelfer, Katharina; Schaeferhoff, Karin; Obermayr, Florian; Bonin, Michael; Just, Lothar
2013-11-01
Thyroid hormones play important roles in the development of neural cells in the central nervous system. Even minor changes to normal thyroid hormone levels affect dendritic and axonal outgrowth, sprouting and myelination and might even lead to irreversible damages such as cretinism. Despite our knowledge of the influence on the mammalian CNS, the role of thyroid hormones in the development of the enteric nervous system (ENS) still needs to be elucidated. In this study we have analyzed for the first time the influence of 3,5,3'-triiodothyronine (T3) on ENS progenitor cells using cell biological assays and a microarray technique. In our in vitro model, T3 inhibited cell proliferation and stimulated neurite outgrowth of differentiating ENS progenitor cells. Microarray analysis revealed a group of 338 genes that were regulated by T3 in differentiating enterospheres. 67 of these genes are involved in function and development of the nervous system. 14 of them belong to genes that are involved in axonal guidance or neurite outgrowth. Interestingly, T3 regulated the expression of netrin G1 and endothelin 3, two guidance molecules that are involved in human enteric dysganglionoses. The results of our study give first insights how T3 may affect the enteric nervous system. T3 is involved in proliferation and differentiation processes in enterospheres. Microarray analysis revealed several interesting gene candidates that might be involved in the observed effects on enterosphere differentiation. Future studies need to be conducted to better understand the gene to gene interactions. © 2013.
Morton, Nicholas M.; Nelson, Yvonne B.; Michailidou, Zoi; Di Rollo, Emma M.; Ramage, Lynne; Hadoke, Patrick W. F.; Seckl, Jonathan R.; Bunger, Lutz; Horvat, Simon; Kenyon, Christopher J.; Dunbar, Donald R.
2011-01-01
Background Obesity and metabolic syndrome results from a complex interaction between genetic and environmental factors. In addition to brain-regulated processes, recent genome wide association studies have indicated that genes highly expressed in adipose tissue affect the distribution and function of fat and thus contribute to obesity. Using a stratified transcriptome gene enrichment approach we attempted to identify adipose tissue-specific obesity genes in the unique polygenic Fat (F) mouse strain generated by selective breeding over 60 generations for divergent adiposity from a comparator Lean (L) strain. Results To enrich for adipose tissue obesity genes a ‘snap-shot’ pooled-sample transcriptome comparison of key fat depots and non adipose tissues (muscle, liver, kidney) was performed. Known obesity quantitative trait loci (QTL) information for the model allowed us to further filter genes for increased likelihood of being causal or secondary for obesity. This successfully identified several genes previously linked to obesity (C1qr1, and Np3r) as positional QTL candidate genes elevated specifically in F line adipose tissue. A number of novel obesity candidate genes were also identified (Thbs1, Ppp1r3d, Tmepai, Trp53inp2, Ttc7b, Tuba1a, Fgf13, Fmr) that have inferred roles in fat cell function. Quantitative microarray analysis was then applied to the most phenotypically divergent adipose depot after exaggerating F and L strain differences with chronic high fat feeding which revealed a distinct gene expression profile of line, fat depot and diet-responsive inflammatory, angiogenic and metabolic pathways. Selected candidate genes Npr3 and Thbs1, as well as Gys2, a non-QTL gene that otherwise passed our enrichment criteria were characterised, revealing novel functional effects consistent with a contribution to obesity. Conclusions A focussed candidate gene enrichment strategy in the unique F and L model has identified novel adipose tissue-enriched genes contributing to obesity. PMID:21915269
Morton, Nicholas M; Nelson, Yvonne B; Michailidou, Zoi; Di Rollo, Emma M; Ramage, Lynne; Hadoke, Patrick W F; Seckl, Jonathan R; Bunger, Lutz; Horvat, Simon; Kenyon, Christopher J; Dunbar, Donald R
2011-01-01
Obesity and metabolic syndrome results from a complex interaction between genetic and environmental factors. In addition to brain-regulated processes, recent genome wide association studies have indicated that genes highly expressed in adipose tissue affect the distribution and function of fat and thus contribute to obesity. Using a stratified transcriptome gene enrichment approach we attempted to identify adipose tissue-specific obesity genes in the unique polygenic Fat (F) mouse strain generated by selective breeding over 60 generations for divergent adiposity from a comparator Lean (L) strain. To enrich for adipose tissue obesity genes a 'snap-shot' pooled-sample transcriptome comparison of key fat depots and non adipose tissues (muscle, liver, kidney) was performed. Known obesity quantitative trait loci (QTL) information for the model allowed us to further filter genes for increased likelihood of being causal or secondary for obesity. This successfully identified several genes previously linked to obesity (C1qr1, and Np3r) as positional QTL candidate genes elevated specifically in F line adipose tissue. A number of novel obesity candidate genes were also identified (Thbs1, Ppp1r3d, Tmepai, Trp53inp2, Ttc7b, Tuba1a, Fgf13, Fmr) that have inferred roles in fat cell function. Quantitative microarray analysis was then applied to the most phenotypically divergent adipose depot after exaggerating F and L strain differences with chronic high fat feeding which revealed a distinct gene expression profile of line, fat depot and diet-responsive inflammatory, angiogenic and metabolic pathways. Selected candidate genes Npr3 and Thbs1, as well as Gys2, a non-QTL gene that otherwise passed our enrichment criteria were characterised, revealing novel functional effects consistent with a contribution to obesity. A focussed candidate gene enrichment strategy in the unique F and L model has identified novel adipose tissue-enriched genes contributing to obesity.
Nectoux, J; Fichou, Y; Rosas-Vargas, H; Cagnard, N; Bahi-Buisson, N; Nusbaum, P; Letourneur, F; Chelly, J; Bienvenu, T
2010-07-01
More than 90% of Rett syndrome (RTT) patients have heterozygous mutations in the X-linked methyl-CpG binding protein 2 (MECP2) gene that encodes the methyl-CpG-binding protein 2, a transcriptional modulator. Because MECP2 is subjected to X chromosome inactivation (XCI), girls with RTT either express the wild-type or mutant allele in each individual cell. To test the consequences of MECP2 mutations resulting from a genome-wide transcriptional dysregulation and to identify its target genes in a system that circumvents the functional mosaicism resulting from XCI, we carried out gene expression profiling of clonal populations derived from fibroblast primary cultures expressing exclusively either the wild-type or the mutant MECP2 allele. Clonal cultures were obtained from skin biopsy of three RTT patients carrying either a non-sense or a frameshift MECP2 mutation. For each patient, gene expression profiles of wild-type and mutant clones were compared by oligonucleotide expression microarray analysis. Firstly, clustering analysis classified the RTT patients according to their genetic background and MECP2 mutation. Secondly, expression profiling by microarray analysis and quantitative RT-PCR indicated four up-regulated genes and five down-regulated genes significantly dysregulated in all our statistical analysis, including excellent potential candidate genes for the understanding of the pathophysiology of this neurodevelopmental disease. Thirdly, chromatin immunoprecipitation analysis confirmed MeCP2 binding to respective CpG islands in three out of four up-regulated candidate genes and sequencing of bisulphite-converted DNA indicated that MeCP2 preferentially binds to methylated-DNA sequences. Most importantly, the finding that at least two of these genes (BMCC1 and RNF182) were shown to be involved in cell survival and/or apoptosis may suggest that impaired MeCP2 function could alter the survival of neurons thus compromising brain function without inducing cell death.
Christiansen, Helena E.; Mehinto, Alvine C.; Yu, Fahong; Perry, Russell W.; Denslow, Nancy D.; Maule, Alec G.; Mesa, Matthew G.
2014-01-01
Toxic compounds such as organochlorine pesticides (OCs), polychlorinated biphenyls (PCBs), and polybrominated diphenyl ether flame retardants (PBDEs) have been detected in fish, birds, and aquatic mammals that live in the Columbia River or use food resources from within the river. We developed a custom microarray for largescale suckers (Catostomus macrocheilus) and used it to investigate the molecular effects of contaminant exposure on wild fish in the Columbia River. Using Significance Analysis of Microarrays (SAM) we identified 72 probes representing 69 unique genes with expression patterns that correlated with hepatic tissue levels of OCs, PCBs, or PBDEs. These genes were involved in many biological processes previously shown to respond to contaminant exposure, including drug and lipid metabolism, apoptosis, cellular transport, oxidative stress, and cellular chaperone function. The relation between gene expression and contaminant concentration suggests that these genes may respond to environmental contaminant exposure and are promising candidates for further field and laboratory studies to develop biomarkers for monitoring exposure of wild fish to contaminant mixtures found in the Columbia River Basin. The array developed in this study could also be a useful tool for studies involving endangered sucker species and other sucker species used in contaminant research.
Mustroph, Angelika; Bailey-Serres, Julia
2010-03-01
Plants consist of distinct cell types distinguished by position, morphological features and metabolic activities. We recently developed a method to extract cell-type specific mRNA populations by immunopurification of ribosome-associated mRNAs. Microarray profiles of 21 cell-specific mRNA populations from seedling roots and shoots comprise the Arabidopsis Translatome dataset. This gene expression atlas provides a new tool for the study of cell-specific processes. Here we provide an example of how genes involved in a pathway limited to one or few cell-types can be further characterized and new candidate genes can be predicted. Cells of the root endodermis produce suberin as an inner barrier between the cortex and stele, whereas the shoot epidermal cells form cutin as a barrier to the external environment. Both polymers consist of fatty acid derivates, and share biosynthetic origins. We use the Arabidopsis Translatome dataset to demonstrate the significant cell-specific expression patterns of genes involved in those biosynthetic processes and suggest new candidate genes in the biosynthesis of suberin and cutin.
Identification of a transcriptional signature for the wound healing continuum
Peake, Matthew A; Caley, Mathew; Giles, Peter J; Wall, Ivan; Enoch, Stuart; Davies, Lindsay C; Kipling, David; Thomas, David W; Stephens, Phil
2014-01-01
There is a spectrum/continuum of adult human wound healing outcomes ranging from the enhanced (nearly scarless) healing observed in oral mucosa to scarring within skin and the nonhealing of chronic skin wounds. Central to these outcomes is the role of the fibroblast. Global gene expression profiling utilizing microarrays is starting to give insight into the role of such cells during the healing process, but no studies to date have produced a gene signature for this wound healing continuum. Microarray analysis of adult oral mucosal fibroblast (OMF), normal skin fibroblast (NF), and chronic wound fibroblast (CWF) at 0 and 6 hours post-serum stimulation was performed. Genes whose expression increases following serum exposure in the order OMF < NF < CWF are candidates for a negative/impaired healing phenotype (the dysfunctional healing group), whereas genes with the converse pattern are potentially associated with a positive/preferential healing phenotype (the enhanced healing group). Sixty-six genes in the enhanced healing group and 38 genes in the dysfunctional healing group were identified. Overrepresentation analysis revealed pathways directly and indirectly associated with wound healing and aging and additional categories associated with differentiation, development, and morphogenesis. Knowledge of this wound healing continuum gene signature may in turn assist in the therapeutic assessment/treatment of a patient's wounds. PMID:24844339
Huerta, Mario; Munyi, Marc; Expósito, David; Querol, Enric; Cedano, Juan
2014-06-15
The microarrays performed by scientific teams grow exponentially. These microarray data could be useful for researchers around the world, but unfortunately they are underused. To fully exploit these data, it is necessary (i) to extract these data from a repository of the high-throughput gene expression data like Gene Expression Omnibus (GEO) and (ii) to make the data from different microarrays comparable with tools easy to use for scientists. We have developed these two solutions in our server, implementing a database of microarray marker genes (Marker Genes Data Base). This database contains the marker genes of all GEO microarray datasets and it is updated monthly with the new microarrays from GEO. Thus, researchers can see whether the marker genes of their microarray are marker genes in other microarrays in the database, expanding the analysis of their microarray to the rest of the public microarrays. This solution helps not only to corroborate the conclusions regarding a researcher's microarray but also to identify the phenotype of different subsets of individuals under investigation, to frame the results with microarray experiments from other species, pathologies or tissues, to search for drugs that promote the transition between the studied phenotypes, to detect undesirable side effects of the treatment applied, etc. Thus, the researcher can quickly add relevant information to his/her studies from all of the previous analyses performed in other studies as long as they have been deposited in public repositories. Marker-gene database tool: http://ibb.uab.es/mgdb © The Author 2014. Published by Oxford University Press.
Molecular Targeted Therapies of Childhood Choroid Plexus Carcinoma
2013-10-01
Microarray intensities were analyzed in PGS, using the benign human choroid plexus papilloma (CPP) samples as an expression baseline reference. This...additional human and mouse CPC genomic profiles (timeframe: months 1-5). The goal of these studies is to expand our number of genomic profiles (DNA and...mRNA arrays) of both human and mouse CPCs to provide a comprehensive dataset with which to identify key candidate oncogenes, tumor suppressor genes
Filling gaps in PPAR-alpha signaling through comparative nutrigenomics analysis
2009-01-01
Background The application of high-throughput genomic tools in nutrition research is a widespread practice. However, it is becoming increasingly clear that the outcome of individual expression studies is insufficient for the comprehensive understanding of such a complex field. Currently, the availability of the large amounts of expression data in public repositories has opened up new challenges on microarray data analyses. We have focused on PPARα, a ligand-activated transcription factor functioning as fatty acid sensor controlling the gene expression regulation of a large set of genes in various metabolic organs such as liver, small intestine or heart. The function of PPARα is strictly connected to the function of its target genes and, although many of these have already been identified, major elements of its physiological function remain to be uncovered. To further investigate the function of PPARα, we have applied a cross-species meta-analysis approach to integrate sixteen microarray datasets studying high fat diet and PPARα signal perturbations in different organisms. Results We identified 164 genes (MDEGs) that were differentially expressed in a constant way in response to a high fat diet or to perturbations in PPARs signalling. In particular, we found five genes in yeast which were highly conserved and homologous of PPARα targets in mammals, potential candidates to be used as models for the equivalent mammalian genes. Moreover, a screening of the MDEGs for all known transcription factor binding sites and the comparison with a human genome-wide screening of Peroxisome Proliferating Response Elements (PPRE), enabled us to identify, 20 new potential candidate genes that show, both binding site, both change in expression in the condition studied. Lastly, we found a non random localization of the differentially expressed genes in the genome. Conclusion The results presented are potentially of great interest to resume the currently available expression data, exploiting the power of in silico analysis filtered by evolutionary conservation. The analysis enabled us to indicate potential gene candidates that could fill in the gaps with regards to the signalling of PPARα and, moreover, the non-random localization of the differentially expressed genes in the genome, suggest that epigenetic mechanisms are of importance in the regulation of the transcription operated by PPARα. PMID:20003344
2013-01-01
Background Helicobacter pylori (H. pylori) infection and excessive salt intake are known as important risk factors for stomach cancer in humans. However, interactions of these two factors with gene expression profiles during gastric carcinogenesis remain unclear. In the present study, we investigated the global gene expression associated with stomach carcinogenesis and prognosis of human gastric cancer using a mouse model. Methods To find candidate genes involved in stomach carcinogenesis, we firstly constructed a carcinogen-induced mouse gastric tumor model combined with H. pylori infection and high-salt diet. C57BL/6J mice were given N-methyl-N-nitrosourea in their drinking water and sacrificed after 40 weeks. Animals of a combination group were inoculated with H. pylori and fed a high-salt diet. Gene expression profiles in glandular stomach of the mice were investigated by oligonucleotide microarray. Second, we examined an availability of the candidate gene as prognostic factor for human patients. Immunohistochemical analysis of CD177, one of the up-regulated genes, was performed in human advanced gastric cancer specimens to evaluate the association with prognosis. Results The multiplicity of gastric tumor in carcinogen-treated mice was significantly increased by combination of H. pylori infection and high-salt diet. In the microarray analysis, 35 and 31 more than two-fold up-regulated and down-regulated genes, respectively, were detected in the H. pylori-infection and high-salt diet combined group compared with the other groups. Quantitative RT-PCR confirmed significant over-expression of two candidate genes including Cd177 and Reg3g. On immunohistochemical analysis of CD177 in human advanced gastric cancer specimens, over-expression was evident in 33 (60.0%) of 55 cases, significantly correlating with a favorable prognosis (P = 0.0294). Multivariate analysis including clinicopathological factors as covariates revealed high expression of CD177 to be an independent prognostic factor for overall survival. Conclusions These results suggest that our mouse model combined with H. pylori infection and high-salt diet is useful for gene expression profiling in gastric carcinogenesis, providing evidence that CD177 is a novel prognostic factor for stomach cancer. This is the first report showing a prognostic correlation between CD177 expression and solid tumor behavior. PMID:23899160
VTCdb: a gene co-expression database for the crop species Vitis vinifera (grapevine).
Wong, Darren C J; Sweetman, Crystal; Drew, Damian P; Ford, Christopher M
2013-12-16
Gene expression datasets in model plants such as Arabidopsis have contributed to our understanding of gene function and how a single underlying biological process can be governed by a diverse network of genes. The accumulation of publicly available microarray data encompassing a wide range of biological and environmental conditions has enabled the development of additional capabilities including gene co-expression analysis (GCA). GCA is based on the understanding that genes encoding proteins involved in similar and/or related biological processes may exhibit comparable expression patterns over a range of experimental conditions, developmental stages and tissues. We present an open access database for the investigation of gene co-expression networks within the cultivated grapevine, Vitis vinifera. The new gene co-expression database, VTCdb (http://vtcdb.adelaide.edu.au/Home.aspx), offers an online platform for transcriptional regulatory inference in the cultivated grapevine. Using condition-independent and condition-dependent approaches, grapevine co-expression networks were constructed using the latest publicly available microarray datasets from diverse experimental series, utilising the Affymetrix Vitis vinifera GeneChip (16 K) and the NimbleGen Grape Whole-genome microarray chip (29 K), thus making it possible to profile approximately 29,000 genes (95% of the predicted grapevine transcriptome). Applications available with the online platform include the use of gene names, probesets, modules or biological processes to query the co-expression networks, with the option to choose between Affymetrix or Nimblegen datasets and between multiple co-expression measures. Alternatively, the user can browse existing network modules using interactive network visualisation and analysis via CytoscapeWeb. To demonstrate the utility of the database, we present examples from three fundamental biological processes (berry development, photosynthesis and flavonoid biosynthesis) whereby the recovered sub-networks reconfirm established plant gene functions and also identify novel associations. Together, we present valuable insights into grapevine transcriptional regulation by developing network models applicable to researchers in their prioritisation of gene candidates, for on-going study of biological processes related to grapevine development, metabolism and stress responses.
Graubner, Felix R; Gram, Aykut; Kautz, Ewa; Bauersachs, Stefan; Aslan, Selim; Agaoglu, Ali R; Boos, Alois; Kowalewski, Mariusz P
2017-08-01
In the dog, there is no luteolysis in the absence of pregnancy. Thus, this species lacks any anti-luteolytic endocrine signal as found in other species that modulate uterine function during the critical period of pregnancy establishment. Nevertheless, in the dog an embryo-maternal communication must occur in order to prevent rejection of embryos. Based on this hypothesis, we performed microarray analysis of canine uterine samples collected during pre-attachment phase (days 10-12) and in corresponding non-pregnant controls, in order to elucidate the embryo attachment signal. An additional goal was to identify differences in uterine responses to pre-attachment embryos between dogs and other mammalian species exhibiting different reproductive patterns with regard to luteolysis, implantation, and preparation for placentation. Therefore, the canine microarray data were compared with gene sets from pigs, cattle, horses, and humans. We found 412 genes differentially regulated between the two experimental groups. The functional terms most strongly enriched in response to pre-attachment embryos related to extracellular matrix function and remodeling, and to immune and inflammatory responses. Several candidate genes were validated by semi-quantitative PCR. When compared with other species, best matches were found with human and equine counterparts. Especially for the pig, the majority of overlapping genes showed opposite expression patterns. Interestingly, 1926 genes did not pair with any of the other gene sets. Using a microarray approach, we report the uterine changes in the dog driven by the presence of embryos and compare these results with datasets from other mammalian species, finding common-, contrary-, and exclusively canine-regulated genes. © The Authors 2017. Published by Oxford University Press on behalf of Society for the Study of Reproduction.
2013-01-01
Background Apoptosis is a critical process in endothelial cell (EC) biology and pathology, which has been extensively studied at protein level. Numerous gene expression studies of EC apoptosis have also been performed, however few attempts have been made to use gene expression data to identify the molecular relationships and master regulators that underlie EC apoptosis. Therefore, we sought to understand these relationships by generating a Bayesian gene regulatory network (GRN) model. Results ECs were induced to undergo apoptosis using serum withdrawal and followed over a time course in triplicate, using microarrays. When generating the GRN, this EC time course data was supplemented by a library of microarray data from EC treated with siRNAs targeting over 350 signalling molecules. The GRN model proposed Vasohibin-1 (VASH1) as one of the candidate master-regulators of EC apoptosis with numerous downstream mRNAs. To evaluate the role played by VASH1 in EC, we used siRNA to reduce the expression of VASH1. Of 10 mRNAs downstream of VASH1 in the GRN that were examined, 7 were significantly up- or down-regulated in the direction predicted by the GRN.Further supporting an important biological role of VASH1 in EC, targeted reduction of VASH1 mRNA abundance conferred resistance to serum withdrawal-induced EC death. Conclusion We have utilised Bayesian GRN modelling to identify a novel candidate master regulator of EC apoptosis. This study demonstrates how GRN technology can complement traditional methods to hypothesise the regulatory relationships that underlie important biological processes. PMID:23324451
Adan, Aysun; Baran, Yusuf
2016-05-01
Fisetin and hesperetin, naturally occurring flavonoids, have been reported as novel antioxidants with chemopreventive/chemotherapeutic potential against various types of cancer. However, their mechanism of action in CML is still unknown. This particular study aims to evaluate the therapeutic potentials of fisetin and hesperetin and their effects on cell proliferation, apoptosis, and cell cycle progression in human K562 CML cells. The results indicated that fisetin and hesperetin inhibited cell proliferation and triggered programmed cell death in these cells. The latter was confırmed by mitochondrial membrane depolarization and an increase in caspase-3 activation. In addition to that, we have detected S and G2/M cell cycle arrests and G0/G1 arrest upon fisetin and hesperetin treatment, respectively. To identify the altered genes and genetic networks in response to fisetin and hesperetin, whole-genome microarray analysis was performed. The microarray gene profiling analysis revealed some important signaling pathways including JAK/STAT pathway, KIT receptor signaling, and growth hormone receptor signaling that were altered upon fisetin and hesperetin treatment. Moreover, microarray data suggested potential candidate genes for targeted CML therapy. Fisetin and hesperetin significantly modulated the expression of genes involved in cell proliferation and division, apoptosis, cell cycle regulation, and other significant cellular processes such as replication, transcription, and translation. In conclusion, our results suggest that fisetin and hesperetin as potential natural agents for CML therapy.
Screening key candidate genes and pathways involved in insulinoma by microarray analysis.
Zhou, Wuhua; Gong, Li; Li, Xuefeng; Wan, Yunyan; Wang, Xiangfei; Li, Huili; Jiang, Bin
2018-06-01
Insulinoma is a rare type tumor and its genetic features remain largely unknown. This study aimed to search for potential key genes and relevant enriched pathways of insulinoma.The gene expression data from GSE73338 were downloaded from Gene Expression Omnibus database. Differentially expressed genes (DEGs) were identified between insulinoma tissues and normal pancreas tissues, followed by pathway enrichment analysis, protein-protein interaction (PPI) network construction, and module analysis. The expressions of candidate key genes were validated by quantitative real-time polymerase chain reaction (RT-PCR) in insulinoma tissues.A total of 1632 DEGs were obtained, including 1117 upregulated genes and 514 downregulated genes. Pathway enrichment results showed that upregulated DEGs were significantly implicated in insulin secretion, and downregulated DEGs were mainly enriched in pancreatic secretion. PPI network analysis revealed 7 hub genes with degrees more than 10, including GCG (glucagon), GCGR (glucagon receptor), PLCB1 (phospholipase C, beta 1), CASR (calcium sensing receptor), F2R (coagulation factor II thrombin receptor), GRM1 (glutamate metabotropic receptor 1), and GRM5 (glutamate metabotropic receptor 5). DEGs involved in the significant modules were enriched in calcium signaling pathway, protein ubiquitination, and platelet degranulation. Quantitative RT-PCR data confirmed that the expression trends of these hub genes were similar to the results of bioinformatic analysis.The present study demonstrated that candidate DEGs and enriched pathways were the potential critical molecule events involved in the development of insulinoma, and these findings were useful for better understanding of insulinoma genesis.
Fan, Qing-Jie; Yan, Feng-Xia; Qiao, Guang; Zhang, Bing-Xue; Wen, Xiao-Peng
2014-01-01
Drought is one of the most severe threats to the growth, development and yield of plant. In order to unravel the molecular basis underlying the high tolerance of pitaya (Hylocereus undatus) to drought stress, suppression subtractive hybridization (SSH) and cDNA microarray approaches were firstly combined to identify the potential important or novel genes involved in the plant responses to drought stress. The forward (drought over drought-free) and reverse (drought-free over drought) suppression subtractive cDNA libraries were constructed using in vitro shoots of cultivar 'Zihonglong' exposed to drought stress and drought-free (control). A total of 2112 clones, among which half were from either forward or reverse SSH library, were randomly picked up to construct a pitaya cDNA microarray. Microarray analysis was carried out to verify the expression fluctuations of this set of clones upon drought treatment compared with the controls. A total of 309 expressed sequence tags (ESTs), 153 from forward library and 156 from reverse library, were obtained, and 138 unique ESTs were identified after sequencing by clustering and blast analyses, which included genes that had been previously reported as responsive to water stress as well as some functionally unknown genes. Thirty six genes were mapped to 47 KEGG pathways, including carbohydrate metabolism, lipid metabolism, energy metabolism, nucleotide metabolism, and amino acid metabolism of pitaya. Expression analysis of the selected ESTs by reverse transcriptase polymerase chain reaction (RT-PCR) corroborated the results of differential screening. Moreover, time-course expression patterns of these selected ESTs further confirmed that they were closely responsive to drought treatment. Among the differentially expressed genes (DEGs), many are related to stress tolerances including drought tolerance. Thereby, the mechanism of drought tolerance of this pitaya genotype is a very complex physiological and biochemical process, in which multiple metabolism pathways and many genes were implicated. The data gained herein provide an insight into the mechanism underlying the drought stress tolerance of pitaya, as well as may facilitate the screening of candidate genes for drought tolerance. © 2013 Elsevier B.V. All rights reserved.
Azumi, Kaoru; Usami, Takeshi; Kamimura, Akiko; Sabau, Sorin V; Miki, Yasufumi; Fujie, Manabu; Jung, Sung-Ju; Kitamura, Shin-Ichi; Suzuki, Satoru; Yokosawa, Hideyoshi
2007-12-01
A serious disease of the ascidian Halocynthia roretzi has been spread extensively among Korean aquaculture sites. To reveal the cause of the disease and establish a monitoring system for it, we constructed a cDNA microarray spotted with 2,688 cDNAs derived from H. roretzi hemocyte cDNA libraries to detect genes differentially expressed in hemocytes between diseased and non-diseased ascidians. We detected 21 genes showing increased expression and 16 genes showing decreased expression in hemocytes from diseased ascidians compared with those from non-diseased ascidians. RT-PCR analyses confirmed that the expression levels of genes encoding astacin, lysozyme, ribosomal protein PO, and ubiquitin-ribosomal protein L40e fusion protein were increased in hemocytes from diseased ascidians, while those of genes encoding HSP40, HSP70, fibronectin, carboxypeptidase and lactate dehydrogenase were decreased. These genes were expressed not only in hemocytes but also in various other tissues in ascidians. Furthermore, the expression of glutathione-S transferase omega, which is known to be up-regulated in H. roretzi hemocytes during inflammatory responses, was strongly increased in hemocytes from diseased ascidians. These gene expression profiles suggest that immune and inflammatory reactions occur in the hemocytes of diseased ascidians. These genes will be good markers for detecting and monitoring this disease of ascidians in Korean aquaculture sites.
Berkovic, Samuel F.; Dibbens, Leanne M.; Oshlack, Alicia; Silver, Jeremy D.; Katerelos, Marina; Vears, Danya F.; Lüllmann-Rauch, Renate; Blanz, Judith; Zhang, Ke Wei; Stankovich, Jim; Kalnins, Renate M.; Dowling, John P.; Andermann, Eva; Andermann, Frederick; Faldini, Enrico; D'Hooge, Rudi; Vadlamudi, Lata; Macdonell, Richard A.; Hodgson, Bree L.; Bayly, Marta A.; Savige, Judy; Mulley, John C.; Smyth, Gordon K.; Power, David A.; Saftig, Paul; Bahlo, Melanie
2008-01-01
Action myoclonus-renal failure syndrome (AMRF) is an autosomal-recessive disorder with the remarkable combination of focal glomerulosclerosis, frequently with glomerular collapse, and progressive myoclonus epilepsy associated with storage material in the brain. Here, we employed a novel combination of molecular strategies to find the responsible gene and show its effects in an animal model. Utilizing only three unrelated affected individuals and their relatives, we used homozygosity mapping with single-nucleotide polymorphism chips to localize AMRF. We then used microarray-expression analysis to prioritize candidates prior to sequencing. The disorder was mapped to 4q13-21, and microarray-expression analysis identified SCARB2/Limp2, which encodes a lysosomal-membrane protein, as the likely candidate. Mutations in SCARB2/Limp2 were found in all three families used for mapping and subsequently confirmed in two other unrelated AMRF families. The mutations were associated with lack of SCARB2 protein. Reanalysis of an existing Limp2 knockout mouse showed intracellular inclusions in cerebral and cerebellar cortex, and the kidneys showed subtle glomerular changes. This study highlights that recessive genes can be identified with a very small number of subjects. The ancestral lysosomal-membrane protein SCARB2/LIMP-2 is responsible for AMRF. The heterogeneous pathology in the kidney and brain suggests that SCARB2/Limp2 has pleiotropic effects that may be relevant to understanding the pathogenesis of other forms of glomerulosclerosis or collapse and myoclonic epilepsies. PMID:18308289
2010-01-01
Introduction Various multigene predictors of breast cancer clinical outcome have been commercialized, but proved to be prognostic only for hormone receptor (HR) subsets overexpressing estrogen or progesterone receptors. Hormone receptor negative (HRneg) breast cancers, particularly those lacking HER2/ErbB2 overexpression and known as triple-negative (Tneg) cases, are heterogeneous and generally aggressive breast cancer subsets in need of prognostic subclassification, since most early stage HRneg and Tneg breast cancer patients are cured with conservative treatment yet invariably receive aggressive adjuvant chemotherapy. Methods An unbiased search for genes predictive of distant metastatic relapse was undertaken using a training cohort of 199 node-negative, adjuvant treatment naïve HRneg (including 154 Tneg) breast cancer cases curated from three public microarray datasets. Prognostic gene candidates were subsequently validated using a different cohort of 75 node-negative, adjuvant naïve HRneg cases curated from three additional datasets. The HRneg/Tneg gene signature was prognostically compared with eight other previously reported gene signatures, and evaluated for cancer network associations by two commercial pathway analysis programs. Results A novel set of 14 prognostic gene candidates was identified as outcome predictors: CXCL13, CLIC5, RGS4, RPS28, RFX7, EXOC7, HAPLN1, ZNF3, SSX3, HRBL, PRRG3, ABO, PRTN3, MATN1. A composite HRneg/Tneg gene signature index proved more accurate than any individual candidate gene or other reported multigene predictors in identifying cases likely to remain free of metastatic relapse. Significant positive correlations between the HRneg/Tneg index and three independent immune-related signatures (STAT1, IFN, and IR) were observed, as were consistent negative associations between the three immune-related signatures and five other proliferation module-containing signatures (MS-14, ONCO-RS, GGI, CSR/wound and NKI-70). Network analysis identified 8 genes within the HRneg/Tneg signature as being functionally linked to immune/inflammatory chemokine regulation. Conclusions A multigene HRneg/Tneg signature linked to immune/inflammatory cytokine regulation was identified from pooled expression microarray data and shown to be superior to other reported gene signatures in predicting the metastatic outcome of early stage and conservatively managed HRneg and Tneg breast cancer. Further validation of this prognostic signature may lead to new therapeutic insights and spare many newly diagnosed breast cancer patients the need for aggressive adjuvant chemotherapy. PMID:20946665
Sheu, Jim Jinn-Chyuan; Lee, Chia-Huei; Ko, Jenq-Yuh; Tsao, George S W; Wu, Chung-Chun; Fang, Chih-Yeu; Tsai, Fuu-Jen; Hua, Chun-Hung; Chen, Chi-Long; Chen, Jen-Yang
2009-10-01
Nasopharyngeal carcinoma is an epithelial malignancy with a remarkable racial and geographic distribution. Previous cytogenetic studies have shown nasopharyngeal carcinoma to be characterized by gross genomic aberrations. However, identification of susceptible gene loci in advanced nasopharyngeal carcinoma has been poorly discussed. A genome-wide survey of gene copy number changes was initiated with two nasopharyngeal carcinoma cell lines by array-based comparative genomic hybridization analysis. These alterations were confirmed by a parallel analysis with the data from the gene expression microarray and were validated by quantitative PCR. Clinical association of the defined target genes was analyzed by fluorescence in situ hybridization on 48 metastatic tumors. A high percentage of genes were consistently altered in dosage and expression levels with gain on 3q26.2-q26.32 and losses on 3p12.3-p14.2 and 9p21.3-p23. Six candidate genes, GPR160 (3q26.2-q27), SKIL (3q26), ADAMTS9 (3p14.2-p14.3), LRIG1 (3p14), MPDZ (9p22-p24), and ADFP (9p22.1) were validated by quantitative PCR. Fluorescence in situ hybridization studies revealed amplification of GPR160 (in 25% of cases) and SKIL (33%); and deletion of ADAMTS9 (30%), LRIG1 (35%), MPDZ (15%), and ADFP (15%). Clinical association analyses indicated a poor survival rate with genetic alterations at the defined 3p deletion (P = 0.0012) and the 3q amplification regions (P = 0.0114). The combined microarray technologies suggested novel candidate oncogenes, amplification of GPR160 and SKIL at 3q26.2-q26.32, and deletion of tumor suppressor genes ADAMTS9 and LRIG1 at 3p12.3-p14.2. Altered expression of these genes may be responsible for malignant progression and could be used as potential markers for nasopharyngeal carcinoma.
Lim, Hye-Sun; Ha, Hyekyung; Shin, Hyeun-Kyoo; Jeong, Soo-Jin
2015-09-01
Saussurea lappa has been reported to possess anti-atopic properties. In this study, we have confirmed the S. lappa's anti-atopic properties in Nc/Nga mice and investigated the candidate gene related with its properties using microarray. We determined the target gene using real time PCR in in vitro experiment. S. lappa showed the significant reduction in atopic dermatitis (AD) score and immunoglobulin E compared with the AD induced Nc/Nga mice. In the results of microarray using back skin obtained from animals, we found that S. lappa's properties are closely associated with cytokine-cytokine receptor interaction and the JAK-STAT signaling pathway. Consistent with the microarray data, real-time RT-PCR confirmed these modulation at the mRNA level in skin tissues from S. lappa-treated mice. Among these genes, PI3Kca and IL20Rβ were significantly downregulated by S. lappa treatment in Nc/Nga mouse model. In in vitro experiment using HaCaT cells, we found that the S. lappa components, including alantolactone, caryophyllene, costic acid, costunolide and dehydrocostus lactone significantly decreased the expression of PI3Kca but not IL20Rβ in vitro. Therefore, our study suggests that PI3Kca-related signaling is closely related with the protective effects of S. lappa against the development of atopic-dermatitis.
2011-01-01
Background Technological advances are progressively increasing the application of genomics to a wider array of economically and ecologically important species. High-density maps enriched for transcribed genes facilitate the discovery of connections between genes and phenotypes. We report the construction of a high-density linkage map of expressed genes for the heterozygous genome of Eucalyptus using Single Feature Polymorphism (SFP) markers. Results SFP discovery and mapping was achieved using pseudo-testcross screening and selective mapping to simultaneously optimize linkage mapping and microarray costs. SFP genotyping was carried out by hybridizing complementary RNA prepared from 4.5 year-old trees xylem to an SFP array containing 103,000 25-mer oligonucleotide probes representing 20,726 unigenes derived from a modest size expressed sequence tags collection. An SFP-mapping microarray with 43,777 selected candidate SFP probes representing 15,698 genes was subsequently designed and used to genotype SFPs in a larger subset of the segregating population drawn by selective mapping. A total of 1,845 genes were mapped, with 884 of them ordered with high likelihood support on a framework map anchored to 180 microsatellites with average density of 1.2 cM. Using more probes per unigene increased by two-fold the likelihood of detecting segregating SFPs eventually resulting in more genes mapped. In silico validation showed that 87% of the SFPs map to the expected location on the 4.5X draft sequence of the Eucalyptus grandis genome. Conclusions The Eucalyptus 1,845 gene map is the most highly enriched map for transcriptional information for any forest tree species to date. It represents a major improvement on the number of genes previously positioned on Eucalyptus maps and provides an initial glimpse at the gene space for this global tree genome. A general protocol is proposed to build high-density transcript linkage maps in less characterized plant species by SFP genotyping with a concurrent objective of reducing microarray costs. HIgh-density gene-rich maps represent a powerful resource to assist gene discovery endeavors when used in combination with QTL and association mapping and should be especially valuable to assist the assembly of reference genome sequences soon to come for several plant and animal species. PMID:21492453
The molecular genetic makeup of acute lymphoblastic leukemia | Office of Cancer Genomics
Abstract: Genomic profiling has transformed our understanding of the genetic basis of acute lymphoblastic leukemia (ALL). Recent years have seen a shift from microarray analysis and candidate gene sequencing to next-generation sequencing. Together, these approaches have shown that many ALL subtypes are characterized by constellations of structural rearrangements, submicroscopic DNA copy number alterations, and sequence mutations, several of which have clear implications for risk stratification and targeted therapeutic intervention.
Identification of quantitative trait loci and candidate genes for cadmium tolerance in Populus
DOE Office of Scientific and Technical Information (OSTI.GOV)
Induri, Brahma R; Ellis, Danielle R; Slavov, Goncho T.
2012-01-01
Understanding genetic variation for the response of Populus to heavy metals like cadmium (Cd) is an important step in elucidating the underlying mechanisms of tolerance. In this study, a pseudo-backcross pedigree of Populus trichocarpa Torr. & Gray and Populus deltoides Bart. was characterized for growth and performance traits after Cd exposure. A total of 16 quantitative trait loci (QTL) at logarithm of odds (LOD) ratio 2.5 were detected for total dry weight, its components and root volume. Major QTL for Cd responses were mapped to two different linkage groups and the relative allelic effects were in opposing directions on themore » two chromosomes, suggesting differential mechanisms at these two loci. The phenotypic variance explained by Cd QTL ranged from 5.9 to 11.6% and averaged 8.2% across all QTL. A whole-genome microarray study led to the identification of nine Cd-responsive genes from these QTL. Promising candidates for Cd tolerance include an NHL repeat membrane-spanning protein, a metal transporter and a putative transcription factor. Additional candidates in the QTL intervals include a putative homolog of a glutamate cysteine ligase, and a glutathione-S-transferase. Functional characterization of these candidate genes should enhance our understanding of Cd metabolism and transport and phytoremediation capabilities of Populus.« less
Feltus, F Alex
2014-06-01
Understanding the control of any trait optimally requires the detection of causal genes, gene interaction, and mechanism of action to discover and model the biochemical pathways underlying the expressed phenotype. Functional genomics techniques, including RNA expression profiling via microarray and high-throughput DNA sequencing, allow for the precise genome localization of biological information. Powerful genetic approaches, including quantitative trait locus (QTL) and genome-wide association study mapping, link phenotype with genome positions, yet genetics is less precise in localizing the relevant mechanistic information encoded in DNA. The coupling of salient functional genomic signals with genetically mapped positions is an appealing approach to discover meaningful gene-phenotype relationships. Techniques used to define this genetic-genomic convergence comprise the field of systems genetics. This short review will address an application of systems genetics where RNA profiles are associated with genetically mapped genome positions of individual genes (eQTL mapping) or as gene sets (co-expression network modules). Both approaches can be applied for knowledge independent selection of candidate genes (and possible control mechanisms) underlying complex traits where multiple, likely unlinked, genomic regions might control specific complex traits. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Comprehensive genomic analysis of patients with disorders of cerebral cortical development.
Wiszniewski, Wojciech; Gawlinski, Pawel; Gambin, Tomasz; Bekiesinska-Figatowska, Monika; Obersztyn, Ewa; Antczak-Marach, Dorota; Akdemir, Zeynep Hande Coban; Harel, Tamar; Karaca, Ender; Jurek, Marta; Sobecka, Katarzyna; Nowakowska, Beata; Kruk, Malgorzata; Terczynska, Iwona; Goszczanska-Ciuchta, Alicja; Rudzka-Dybala, Mariola; Jamroz, Ewa; Pyrkosz, Antoni; Jakubiuk-Tomaszuk, Anna; Iwanowski, Piotr; Gieruszczak-Bialek, Dorota; Piotrowicz, Malgorzata; Sasiadek, Maria; Kochanowska, Iwona; Gurda, Barbara; Steinborn, Barbara; Dawidziuk, Mateusz; Castaneda, Jennifer; Wlasienko, Pawel; Bezniakow, Natalia; Jhangiani, Shalini N; Hoffman-Zacharska, Dorota; Bal, Jerzy; Szczepanik, Elzbieta; Boerwinkle, Eric; Gibbs, Richard A; Lupski, James R
2018-04-30
Malformations of cortical development (MCDs) manifest with structural brain anomalies that lead to neurologic sequelae, including epilepsy, cerebral palsy, developmental delay, and intellectual disability. To investigate the underlying genetic architecture of patients with disorders of cerebral cortical development, a cohort of 54 patients demonstrating neuroradiologic signs of MCDs was investigated. Individual genomes were interrogated for single-nucleotide variants (SNV) and copy number variants (CNV) with whole-exome sequencing and chromosomal microarray studies. Variation affecting known MCDs-associated genes was found in 16/54 cases, including 11 patients with SNV, 2 patients with CNV, and 3 patients with both CNV and SNV, at distinct loci. Diagnostic pathogenic SNV and potentially damaging variants of unknown significance (VUS) were identified in two groups of seven individuals each. We demonstrated that de novo variants are important among patients with MCDs as they were identified in 10/16 individuals with a molecular diagnosis. Three patients showed changes in known MCDs genes and a clinical phenotype beyond the usual characteristics observed, i.e., phenotypic expansion, for a particular known disease gene clinical entity. We also discovered 2 likely candidate genes, CDH4, and ASTN1, with human and animal studies supporting their roles in brain development, and 5 potential candidate genes. Our findings emphasize genetic heterogeneity of MCDs disorders and postulate potential novel candidate genes involved in cerebral cortical development.
Moon, Sunok; Oo, Moe Moe; Kim, Backki; Koh, Hee-Jong; Oh, Sung Aeong; Yi, Gihwan; An, Gynheung; Park, Soon Ki; Jung, Ki-Hong
2018-04-23
Understanding late pollen development, including the maturation and pollination process, is a key component in maintaining crop yields. Transcriptome data obtained through microarray or RNA-seq technologies can provide useful insight into those developmental processes. Six series of microarray data from a public transcriptome database, the Gene Expression Omnibus of the National Center for Biotechnology Information, are related to anther and pollen development. We performed a systematic and functional study across the rice genome of genes that are preferentially expressed in the late stages of pollen development, including maturation and germination. By comparing the transcriptomes of sporophytes and male gametes over time, we identified 627 late pollen-preferred genes that are conserved among japonica and indica rice cultivars. Functional classification analysis with a MapMan tool kit revealed a significant association between cell wall organization/metabolism and mature pollen grains. Comparative analysis of rice and Arabidopsis demonstrated that genes involved in cell wall modifications and the metabolism of major carbohydrates are unique to rice. We used the GUS reporter system to monitor the expression of eight of those genes. In addition, we evaluated the significance of our candidate genes, using T-DNA insertional mutant population and the CRISPR/Cas9 system. Mutants from T-DNA insertion and CRISPR/Cas9 systems of a rice gene encoding glycerophosphoryl diester phosphodiesterase are defective in their male gamete transfer. Through the global analyses of the late pollen-preferred genes from rice, we found several biological features of these genes. First, biological process related to cell wall organization and modification is over-represented in these genes to support rapid tube growth. Second, comparative analysis of late pollen preferred genes between rice and Arabidopsis provide a significant insight on the evolutional disparateness in cell wall biogenesis and storage reserves of pollen. In addition, these candidates might be useful targets for future examinations of late pollen development, and will be a valuable resource for accelerating the understanding of molecular mechanisms for pollen maturation and germination processes in rice.
Baumann, Antoine; Devaux, Yvan; Audibert, Gérard; Zhang, Lu; Bracard, Serge; Colnat-Coulbois, Sophie; Klein, Olivier; Zannad, Faiez; Charpentier, Claire; Longrois, Dan; Mertes, Paul-Michel
2013-01-01
Delayed cerebral ischemia (DCI) is a potentially devastating complication after intracranial aneurysm rupture and its mechanisms remain poorly elucidated. Early identification of the patients prone to developing DCI after rupture may represent a major breakthrough in its prevention and treatment. The single gene approach of DCI has demonstrated interest in humans. We hypothesized that whole genome expression profile of blood cells may be useful for better comprehension and prediction of aneurysmal DCI. Over a 35-month period, 218 patients with aneurysm rupture were included in this study. DCI was defined as the occurrence of a new delayed neurological deficit occurring within 2 weeks after aneurysm rupture with evidence of ischemia either on perfusion-diffusion MRI, CT angiography or CT perfusion imaging, or with cerebral angiography. DCI patients were matched against controls based on 4 out of 5 criteria (age, sex, Fisher grade, aneurysm location and smoking status). Genome-wide expression analysis of blood cells obtained at admission was performed by microarrays. Transcriptomic analysis was performed using long oligonucleotide microarrays representing 25,000 genes. Quantitative PCR: 1 µg of total RNA extracted was reverse-transcribed, and the resulting cDNA was diluted 10-fold before performing quantitative PCR. Microarray data were first analyzed by 'Significance Analysis of Microarrays' software which includes the Benjamini correction for multiple testing. In a second step, microarray data fold change was compared using a two-tailed, paired t test. Analysis of receiver-operating characteristic (ROC) curves and the area under the ROC curves were used for prediction analysis. Logistic regression models were used to investigate the additive value of multiple biomarkers. A total of 16 patients demonstrated DCI. Significance Analysis of Microarrays software failed to retrieve significant genes, most probably because of the heterogeneity of the patients included in the microarray experiments and the small size of the DCI population sample. Standard two-tailed paired t test and C-statistic revealed significant associations between gene expression and the occurrence of DCI: in particular, the expression of neuroregulin 1 was 1.6-fold upregulated in patients with DCI (p = 0.01) and predicted DCI with an area under the ROC curve of 0.96. Logistic regression analyses revealed a significant association between neuroregulin 1 and DCI (odds ratio 1.46, 95% confidence interval 1.02-2.09, p = 0.02). This pilot study suggests that blood cells may be a reservoir of prognostic biomarkers of DCI in patients with intracranial aneurysm rupture. Despite an evident lack of power, this study elicited neuroregulin 1, a vasoreactivity-, inflammation- and angiogenesis-related gene, as a possible candidate predictor of DCI. Larger cohort studies are needed but genome-wide microarray-based studies are promising research tools for the understanding of DCI after intracranial aneurysm rupture. © 2013 S. Karger AG, Basel.
DISSECTING THE GENETICS OF HUMAN HIGH MYOPIA: A MOLECULAR BIOLOGIC APPROACH
Young, Terri L
2004-01-01
ABSTRACT Purpose Despite the plethora of experimental myopia animal studies that demonstrate biochemical factor changes in various eye tissues, and limited human studies utilizing pharmacologic agents to thwart axial elongation, we have little knowledge of the basic physiology that drives myopic development. Identifying the implicated genes for myopia susceptibility will provide a fundamental molecular understanding of how myopia occurs and may lead to directed physiologic (ie, pharmacologic, gene therapy) interventions. The purpose of this proposal is to describe the results of positional candidate gene screening of selected genes within the autosomal dominant high-grade myopia-2 locus (MYP2) on chromosome 18p11.31. Methods A physical map of a contracted MYP2 interval was compiled, and gene expression studies in ocular tissues using complementary DNA library screens, microarray matches, and reverse-transcription techniques aided in prioritizing gene selection for screening. The TGIF, EMLIN-2, MLCB, and CLUL1 genes were screened in DNA samples from unrelated controls and in high-myopia affected and unaffected family members from the original seven MYP2 pedigrees. All candidate genes were screened by direct base pair sequence analysis. Results Consistent segregation of a gene sequence alteration (polymorphism) with myopia was not demonstrated in any of the seven families. Novel single nucleotide polymorphisms were found. Conclusion The positional candidate genes TGIF, EMLIN-2, MLCB, and CLUL1 are not associated with MYP2-linked high-grade myopia. Base change polymorphisms discovered with base sequence screening of these genes were submitted to an Internet database. Other genes that also map within the interval are currently undergoing mutation screening. PMID:15747770
Robinson, Joshua F; Port, Jesse A; Yu, Xiaozhong; Faustman, Elaine M
2010-10-01
To understand the complex etiology of developmental disorders, an understanding of both genetic and environmental risk factors is needed. Human and rodent genetic studies have identified a multitude of gene candidates for specific developmental disorders such as neural tube defects (NTDs). With the emergence of toxicogenomic-based assessments, scientists now also have the ability to compare and understand the expression of thousands of genes simultaneously across strain, time, and exposure in developmental models. Using a systems-based approach in which we are able to evaluate information from various parts and levels of the developing organism, we propose a framework for integrating genetic information with toxicogenomic-based studies to better understand gene-environmental interactions critical for developmental disorders. This approach has allowed us to characterize candidate genes in the context of variables critical for determining susceptibility such as strain, time, and exposure. Using a combination of toxicogenomic studies and complementary bioinformatic tools, we characterize NTD candidate genes during normal development by function (gene ontology), linked phenotype (disease outcome), location, and expression (temporally and strain-dependent). In addition, we show how environmental exposures (cadmium, methylmercury) can influence expression of these genes in a strain-dependent manner. Using NTDs as an example of developmental disorder, we show how simple integration of genetic information from previous studies into the standard microarray design can enhance analysis of gene-environment interactions to better define environmental exposure-disease pathways in sensitive and resistant mouse strains. © Wiley-Liss, Inc.
Sandhu, Maninder; Sureshkumar, V; Prakash, Chandra; Dixit, Rekha; Solanke, Amolkumar U; Sharma, Tilak Raj; Mohapatra, Trilochan; S V, Amitha Mithra
2017-09-30
Genome-wide microarray has enabled development of robust databases for functional genomics studies in rice. However, such databases do not directly cater to the needs of breeders. Here, we have attempted to develop a web interface which combines the information from functional genomic studies across different genetic backgrounds with DNA markers so that they can be readily deployed in crop improvement. In the current version of the database, we have included drought and salinity stress studies since these two are the major abiotic stresses in rice. RiceMetaSys, a user-friendly and freely available web interface provides comprehensive information on salt responsive genes (SRGs) and drought responsive genes (DRGs) across genotypes, crop development stages and tissues, identified from multiple microarray datasets. 'Physical position search' is an attractive tool for those using QTL based approach for dissecting tolerance to salt and drought stress since it can provide the list of SRGs and DRGs in any physical interval. To identify robust candidate genes for use in crop improvement, the 'common genes across varieties' search tool is useful. Graphical visualization of expression profiles across genes and rice genotypes has been enabled to facilitate the user and to make the comparisons more impactful. Simple Sequence Repeat (SSR) search in the SRGs and DRGs is a valuable tool for fine mapping and marker assisted selection since it provides primers for survey of polymorphism. An external link to intron specific markers is also provided for this purpose. Bulk retrieval of data without any limit has been enabled in case of locus and SSR search. The aim of this database is to facilitate users with a simple and straight-forward search options for identification of robust candidate genes from among thousands of SRGs and DRGs so as to facilitate linking variation in expression profiles to variation in phenotype. Database URL: http://14.139.229.201.
Cuykendall, Tawny N.; Houston, Douglas W.
2011-01-01
RNA localization is a common mechanism for regulating cell structure and function. Localized RNAs in Xenopus oocytes are critical for early development, including germline specification by the germ plasm. Despite the importance of these localized RNAs, only approximately 25 have been identified and fewer are functionally characterized. Using microarrays, we identified a large set of localized RNAs from the vegetal cortex. Overall, our results indicate a minimum of 275 localized RNAs in oocytes, or 2–3% of maternal transcripts, which are in general agreement with previous findings. We further validated vegetal localization for 24 candidates and further characterized three genes expressed in the germ plasm. We identified novel germ plasm expression for reticulon 3.1, exd2 (a novel exonuclease-domain encoding gene), and a putative noncoding RNA. Further analysis of these and other localized RNAs will likely identify new functions of germ plasm and facilitate the identification of cis-acting RNA localization elements. PMID:20503379
Quinn, Patrick; Bowers, Robert M; Zhang, Xiaoyu; Wahlund, Thomas M; Fanelli, Michael A; Olszova, Daniela; Read, Betsy A
2006-08-01
Marine unicellular coccolithophore algae produce species-specific calcite scales otherwise known as coccoliths. While the coccoliths and their elaborate architecture have attracted the attention of investigators from various scientific disciplines, our knowledge of the underpinnings of the process of biomineralization in this alga is still in its infancy. The processes of calcification and coccolithogenesis are highly regulated and likely to be complex, requiring coordinated expression of many genes and pathways. In this study, we have employed cDNA microarrays to investigate changes in gene expression associated with biomineralization in the most abundant coccolithophorid, Emiliania huxleyi. Expression profiling of cultures grown under calcifying and noncalcifying conditions has been carried out using cDNA microarrays corresponding to approximately 2,300 expressed sequence tags. A total of 127 significantly up- or down-regulated transcripts were identified using a P value of 0.01 and a change of >2.0-fold. Real-time reverse transcriptase PCR was used to test the overall validity of the microarray data, as well as the relevance of many of the proteins predicted to be associated with biomineralization, including a novel gamma-class carbonic anhydrase (A. R. Soto, H. Zheng, D. Shoemaker, J. Rodriguez, B. A. Read, and T. M. Wahlund, Appl. Environ. Microbiol. 72:5500-5511, 2006). Differentially regulated genes include those related to cellular metabolism, ion channels, transport proteins, vesicular trafficking, and cell signaling. The putative function of the vast majority of candidate transcripts could not be defined. Nonetheless, the data described herein represent profiles of the transcription changes associated with biomineralization-related pathways in E. huxleyi and have identified novel and potentially useful targets for more detailed analysis.
Quinn, Patrick; Bowers, Robert M.; Zhang, Xiaoyu; Wahlund, Thomas M.; Fanelli, Michael A.; Olszova, Daniela; Read, Betsy A.
2006-01-01
Marine unicellular coccolithophore algae produce species-specific calcite scales otherwise known as coccoliths. While the coccoliths and their elaborate architecture have attracted the attention of investigators from various scientific disciplines, our knowledge of the underpinnings of the process of biomineralization in this alga is still in its infancy. The processes of calcification and coccolithogenesis are highly regulated and likely to be complex, requiring coordinated expression of many genes and pathways. In this study, we have employed cDNA microarrays to investigate changes in gene expression associated with biomineralization in the most abundant coccolithophorid, Emiliania huxleyi. Expression profiling of cultures grown under calcifying and noncalcifying conditions has been carried out using cDNA microarrays corresponding to approximately 2,300 expressed sequence tags. A total of 127 significantly up- or down-regulated transcripts were identified using a P value of 0.01 and a change of >2.0-fold. Real-time reverse transcriptase PCR was used to test the overall validity of the microarray data, as well as the relevance of many of the proteins predicted to be associated with biomineralization, including a novel gamma-class carbonic anhydrase (A. R. Soto, H. Zheng, D. Shoemaker, J. Rodriguez, B. A. Read, and T. M. Wahlund, Appl. Environ. Microbiol. 72:5500-5511, 2006). Differentially regulated genes include those related to cellular metabolism, ion channels, transport proteins, vesicular trafficking, and cell signaling. The putative function of the vast majority of candidate transcripts could not be defined. Nonetheless, the data described herein represent profiles of the transcription changes associated with biomineralization-related pathways in E. huxleyi and have identified novel and potentially useful targets for more detailed analysis. PMID:16885305
Jung, Ki-Hong; Dardick, Christopher; Bartley, Laura E; Cao, Peijian; Phetsom, Jirapa; Canlas, Patrick; Seo, Young-Su; Shultz, Michael; Ouyang, Shu; Yuan, Qiaoping; Frank, Bryan C; Ly, Eugene; Zheng, Li; Jia, Yi; Hsia, An-Ping; An, Kyungsook; Chou, Hui-Hsien; Rocke, David; Lee, Geun Cheol; Schnable, Patrick S; An, Gynheung; Buell, C Robin; Ronald, Pamela C
2008-10-06
Studies of gene function are often hampered by gene-redundancy, especially in organisms with large genomes such as rice (Oryza sativa). We present an approach for using transcriptomics data to focus functional studies and address redundancy. To this end, we have constructed and validated an inexpensive and publicly available rice oligonucleotide near-whole genome array, called the rice NSF45K array. We generated expression profiles for light- vs. dark-grown rice leaf tissue and validated the biological significance of the data by analyzing sources of variation and confirming expression trends with reverse transcription polymerase chain reaction. We examined trends in the data by evaluating enrichment of gene ontology terms at multiple false discovery rate thresholds. To compare data generated with the NSF45K array with published results, we developed publicly available, web-based tools (www.ricearray.org). The Oligo and EST Anatomy Viewer enables visualization of EST-based expression profiling data for all genes on the array. The Rice Multi-platform Microarray Search Tool facilitates comparison of gene expression profiles across multiple rice microarray platforms. Finally, we incorporated gene expression and biochemical pathway data to reduce the number of candidate gene products putatively participating in the eight steps of the photorespiration pathway from 52 to 10, based on expression levels of putatively functionally redundant genes. We confirmed the efficacy of this method to cope with redundancy by correctly predicting participation in photorespiration of a gene with five paralogs. Applying these methods will accelerate rice functional genomics.
Cracking the genomic piggy bank: identifying secrets of the pig genome.
Mote, B E; Rothschild, M F
2006-01-01
Though researchers are uncovering valuable information about the pig genome at unprecedented speed, the porcine genome community is barely scratching the surface as to understanding interactions of the biological code. The pig genetic linkage map has nearly 5,000 loci comprised of genes, microsatellites, and amplified fragment length polymorphism markers. Likewise, the physical map is becoming denser with nearly 6,000 markers. The long awaited sequencing efforts are providing multidimensional benefits with sequence available for comparative genomics and identifying single nucleotide polymorphisms for use in linkage and trait association studies. Scientists are using exotic and commercial breeds for quantitative trait loci scans. Additionally, candidate gene studies continue to identify chromosomal regions or genes associated with economically important traits such as growth rate, leanness, feed intake, meat quality, litter size, and disease resistance. The commercial pig industry is actively incorporating these markers in marker-assisted selection along with traditional performance information to improve said traits. Researchers are utilizing novel tools including pig microarrays along with advanced bioinformatics to identify new candidate genes, understand gene function, and piece together gene networks involved in important biological processes. Advances in pig genomics and implications to the pork industry as well as human health are reviewed.
PGMapper: a web-based tool linking phenotype to genes.
Xiong, Qing; Qiu, Yuhui; Gu, Weikuan
2008-04-01
With the availability of whole genome sequence in many species, linkage analysis, positional cloning and microarray are gradually becoming powerful tools for investigating the links between phenotype and genotype or genes. However, in these methods, causative genes underlying a quantitative trait locus, or a disease, are usually located within a large genomic region or a large set of genes. Examining the function of every gene is very time consuming and needs to retrieve and integrate the information from multiple databases or genome resources. PGMapper is a software tool for automatically matching phenotype to genes from a defined genome region or a group of given genes by combining the mapping information from the Ensembl database and gene function information from the OMIM and PubMed databases. PGMapper is currently available for candidate gene search of human, mouse, rat, zebrafish and 12 other species. Available online at http://www.genediscovery.org/pgmapper/index.jsp.
Song, Minyan; He, Yanghua; Zhou, Huangkai; Zhang, Yi; Li, Xizhi; Yu, Ying
2016-07-14
Subclinical mastitis is a widely spread disease of lactating cows. Its major pathogen is Staphylococcus aureus (S. aureus). In this study, we performed genome-wide integrative analysis of DNA methylation and transcriptional expression to identify candidate genes and pathways relevant to bovine S. aureus subclinical mastitis. The genome-scale DNA methylation profiles of peripheral blood lymphocytes in cows with S. aureus subclinical mastitis (SA group) and healthy controls (CK) were generated by methylated DNA immunoprecipitation combined with microarrays. We identified 1078 differentially methylated genes in SA cows compared with the controls. By integrating DNA methylation and transcriptome data, 58 differentially methylated genes were shared with differently expressed genes, in which 20.7% distinctly hypermethylated genes showed down-regulated expression in SA versus CK, whereas 14.3% dramatically hypomethylated genes showed up-regulated expression. Integrated pathway analysis suggested that these genes were related to inflammation, ErbB signalling pathway and mismatch repair. Further functional analysis revealed that three genes, NRG1, MST1 and NAT9, were strongly correlated with the progression of S. aureus subclinical mastitis and could be used as powerful biomarkers for the improvement of bovine mastitis resistance. Our studies lay the groundwork for epigenetic modification and mechanistic studies on susceptibility of bovine mastitis.
Song, Minyan; He, Yanghua; Zhou, Huangkai; Zhang, Yi; Li, Xizhi; Yu, Ying
2016-01-01
Subclinical mastitis is a widely spread disease of lactating cows. Its major pathogen is Staphylococcus aureus (S. aureus). In this study, we performed genome-wide integrative analysis of DNA methylation and transcriptional expression to identify candidate genes and pathways relevant to bovine S. aureus subclinical mastitis. The genome-scale DNA methylation profiles of peripheral blood lymphocytes in cows with S. aureus subclinical mastitis (SA group) and healthy controls (CK) were generated by methylated DNA immunoprecipitation combined with microarrays. We identified 1078 differentially methylated genes in SA cows compared with the controls. By integrating DNA methylation and transcriptome data, 58 differentially methylated genes were shared with differently expressed genes, in which 20.7% distinctly hypermethylated genes showed down-regulated expression in SA versus CK, whereas 14.3% dramatically hypomethylated genes showed up-regulated expression. Integrated pathway analysis suggested that these genes were related to inflammation, ErbB signalling pathway and mismatch repair. Further functional analysis revealed that three genes, NRG1, MST1 and NAT9, were strongly correlated with the progression of S. aureus subclinical mastitis and could be used as powerful biomarkers for the improvement of bovine mastitis resistance. Our studies lay the groundwork for epigenetic modification and mechanistic studies on susceptibility of bovine mastitis. PMID:27411928
Bekiaris, Pavlos Stephanos; Tekath, Tobias; Staiger, Dorothee; Danisman, Selahattin
2018-01-01
Understanding the effect of cis-regulatory elements (CRE) and clusters of CREs, which are called cis-regulatory modules (CRM), in eukaryotic gene expression is a challenge of computational biology. We developed two programs that allow simple, fast and reliable analysis of candidate CREs and CRMs that may affect specific gene expression and that determine positional features between individual CREs within a CRM. The first program, "Exploration of Distinctive CREs and CRMs" (EDCC), correlates candidate CREs and CRMs with specific gene expression patterns. For pairs of CREs, EDCC also determines positional preferences of the single CREs in relation to each other and to the transcriptional start site. The second program, "CRM Network Generator" (CNG), prioritizes these positional preferences using a neural network and thus allows unbiased rating of the positional preferences that were determined by EDCC. We tested these programs with data from a microarray study of circadian gene expression in Arabidopsis thaliana. Analyzing more than 1.5 million pairwise CRE combinations, we found 22 candidate combinations, of which several contained known clock promoter elements together with elements that had not been identified as relevant to circadian gene expression before. CNG analysis further identified positional preferences of these CRE pairs, hinting at positional information that may be relevant for circadian gene expression. Future wet lab experiments will have to determine which of these combinations confer daytime specific circadian gene expression.
Staiger, Dorothee
2018-01-01
Understanding the effect of cis-regulatory elements (CRE) and clusters of CREs, which are called cis-regulatory modules (CRM), in eukaryotic gene expression is a challenge of computational biology. We developed two programs that allow simple, fast and reliable analysis of candidate CREs and CRMs that may affect specific gene expression and that determine positional features between individual CREs within a CRM. The first program, “Exploration of Distinctive CREs and CRMs” (EDCC), correlates candidate CREs and CRMs with specific gene expression patterns. For pairs of CREs, EDCC also determines positional preferences of the single CREs in relation to each other and to the transcriptional start site. The second program, “CRM Network Generator” (CNG), prioritizes these positional preferences using a neural network and thus allows unbiased rating of the positional preferences that were determined by EDCC. We tested these programs with data from a microarray study of circadian gene expression in Arabidopsis thaliana. Analyzing more than 1.5 million pairwise CRE combinations, we found 22 candidate combinations, of which several contained known clock promoter elements together with elements that had not been identified as relevant to circadian gene expression before. CNG analysis further identified positional preferences of these CRE pairs, hinting at positional information that may be relevant for circadian gene expression. Future wet lab experiments will have to determine which of these combinations confer daytime specific circadian gene expression. PMID:29298348
Dorts, Jennifer; Richter, Catherine A.; Wright-Osment, Maureen K.; Ellersieck, Mark R.; Carter, Barbara J.; Tillitt, Donald E.
2009-01-01
We investigated the genomic transcriptional response of female fathead minnows (Pimephales promelas) to an acute (4 days) exposure to 0.1 or 1.0 ??g/L of 17??-trenbolone (TB), the active metabolite of an anabolic androgenic steroid used as a growth promoter in cattle and a contaminant of concern in aquatic systems. Our objectives were to investigate the gene expression profile induced by TB, define biomarkers of exposure to TB, and increase our understanding of the mechanisms of adverse effects of TB on fish reproduction. In female gonad tissue, microarray analysis using a 22 K oligonucleotide microarray (EcoArray Inc., Gainesville, FL) showed 99 significantly upregulated genes and 741 significantly downregulated genes in response to 1 ??g TB/L. In particular, hydroxysteroid (17??) dehydrogenase 12a (hsd17b12a), zona pellucida glycoprotein 2.2 (zp2.2), and protein inhibitor of activated STAT, 2 (pias2) were all downregulated in gonad. Q-PCR measurements in a larger sample set were consistent with the microarray results in the direction and magnitude of these changes in gene expression. However, several novel potential biomarkers were verified by Q-PCR in the same samples, but could not be validated in independent samples. In liver, Q-PCR measurements showed a significant decrease in vitellogenin 1 (vtg1) mRNA expression. In brain, cytochrome P450, family 19, subfamily A, polypeptide 1b (cyp19a1b, previously known as aromatase B) transcript levels were significantly reduced following TB exposure. Our study provides a candidate gene involved in mediating the action of TB, hsd17b12a, and two potential biomarkers sensitive to acute TB exposure, hepatic vtg1 and brain cyp19a1b.
Analysis of Gene Regulatory Networks of Maize in Response to Nitrogen.
Jiang, Lu; Ball, Graham; Hodgman, Charlie; Coules, Anne; Zhao, Han; Lu, Chungui
2018-03-08
Nitrogen (N) fertilizer has a major influence on the yield and quality. Understanding and optimising the response of crop plants to nitrogen fertilizer usage is of central importance in enhancing food security and agricultural sustainability. In this study, the analysis of gene regulatory networks reveals multiple genes and biological processes in response to N. Two microarray studies have been used to infer components of the nitrogen-response network. Since they used different array technologies, a map linking the two probe sets to the maize B73 reference genome has been generated to allow comparison. Putative Arabidopsis homologues of maize genes were used to query the Biological General Repository for Interaction Datasets (BioGRID) network, which yielded the potential involvement of three transcription factors (TFs) (GLK5, MADS64 and bZIP108) and a Calcium-dependent protein kinase. An Artificial Neural Network was used to identify influential genes and retrieved bZIP108 and WRKY36 as significant TFs in both microarray studies, along with genes for Asparagine Synthetase, a dual-specific protein kinase and a protein phosphatase. The output from one study also suggested roles for microRNA (miRNA) 399b and Nin-like Protein 15 (NLP15). Co-expression-network analysis of TFs with closely related profiles to known Nitrate-responsive genes identified GLK5, GLK8 and NLP15 as candidate regulators of genes repressed under low Nitrogen conditions, while bZIP108 might play a role in gene activation.
2014-01-01
Background Brassica vegetables contain a class of secondary metabolites, the glucosinolates (GS), whose specific degradation products determine the characteristic flavor and smell. While some of the respective degradation products of particular GS are recognized as health promoting substances for humans, recent studies also show evidence that namely the 1-methoxy-indol-3-ylmethyl GS might be deleterious by forming characteristic DNA adducts. Therefore, a deeper knowledge of aspects involved in the biosynthesis of indole GS is crucial to design vegetables with an improved secondary metabolite profile. Results Initially the leafy Brassica vegetable pak choi (Brassica rapa ssp. chinensis) was established as suitable tool to elicit very high concentrations of 1-methoxy-indol-3-ylmethyl GS by application of methyl jasmonate. Differentially expressed candidate genes were discovered in a comparative microarray analysis using the 2 × 104 K format Brassica Array and compared to available gene expression data from the Arabidopsis AtGenExpress effort. Arabidopsis knock out mutants of the respective candidate gene homologs were subjected to a comprehensive examination of their GS profiles and confirmed the exclusive involvement of polypeptide 4 of the cytochrome P450 monooxygenase subfamily CYP81F in 1-methoxy-indol-3-ylmethyl GS biosynthesis. Functional characterization of the two identified isoforms coding for CYP81F4 in the Brassica rapa genome was performed using expression analysis and heterologous complementation of the respective Arabidopsis mutant. Conclusions Specific differences discovered in a comparative microarray and glucosinolate profiling analysis enables the functional attribution of Brassica rapa ssp. chinensis genes coding for polypeptide 4 of the cytochrome P450 monooxygenase subfamily CYP81F to their metabolic role in indole glucosinolate biosynthesis. These new identified Brassica genes will enable the development of genetic tools for breeding vegetables with improved GS composition in the near future. PMID:24886080
Qiu, Chongying; Cheng, Shuqun; Xia, Yinyin; Peng, Bin; Tang, Qian; Tu, Baijie
2011-11-18
Exposure of laboratory rats to Benzo(a)pyrene (BaP), an environmental contaminant with its high lipophilicify which is widely dispersed in the environment and can easily cross the blood brain barrier presenting in the central nervous system, is associated with impaired learning and memory. The purpose of the research was to examine whether subchronic exposure to BaP affects spatial learning and memory, and how it alters normal gene expression in hippocampus, as well as selection of candidate genes involving neurotransmitter receptor attributed to learning and memory. Morris water maze (MWM) was used to evaluate behavioral differences between BaP-treated and vehicle-treated groups. To gain a better insight into the mechanism of BaP-induced neurotoxicity on learning and memory, we used whole genome oligo microarrays as well as Polymerase Chain Reaction (PCR) to assess the global impact of gene expression. Male Sprague-Dawley rats were intraperitoneally injected with 6.25mg/kg of BaP or vehicle for 14 weeks. The results from the Morris water maze (MWM) test showed that rats treated with BaP exhibited significantly higher mean latencies as compared to vehicle controls. BaP exposure significantly decreased the number of crossing the platform and the time spent in the target area. After the hippocampus was collected from each rat, total RNA was isolated. Microarray and PCR revealed that exposure to BaP affected mRNA expression of neurotransmitter receptors. The web tool DAVID was used to analyze the significantly enriched gene ontology (GO) and KEGG pathways in the differentially expressed genes. Analysis showed that the most significantly affected gene ontology category was behavior. Furthermore, the fourth highest significantly affected gene ontology category was learning and memory. KEGG molecular pathway analysis showed that "neuroactive ligand-receptor interaction" was affected by BaP with highest statistical significance, and 9 candidate neurotransmitter receptor genes involving learning and memory were selected out. Our results revealed a close link between behavioral changes and altered neurotransmitter receptor gene expression in BaP-treated rats. Copyright © 2011 Elsevier Ireland Ltd. All rights reserved.
Cody, N A L; Ouellet, V; Manderson, E N; Quinn, M C J; Filali-Mouhim, A; Tellis, P; Zietarska, M; Provencher, D M; Mes-Masson, A-M; Chevrette, M; Tonin, P N
2007-01-25
Multiple chromosome 3p tumor suppressor genes (TSG) have been proposed in the pathogenesis of ovarian cancer based on complex patterns of 3p loss. To attain functional evidence in support of TSGs and identify candidate regions, we applied a chromosome transfer method involving cell fusions of the tumorigenic OV90 human ovarian cancer cell line, monoallelic for 3p and an irradiated mouse cell line containing a human chromosome 3 in order to derive OV90 hybrids containing normal 3p fragments. The resulting hybrids showed complete or incomplete suppression of tumorigenicity in nude mouse xenograft assays, and varied in their ability to form colonies in soft agarose and three-dimensional spheroids in a manner consistent with alteration of their in vivo tumorigenic phenotypes. Expression microarray analysis identified a set of common differentially expressed genes, such as SPARC, DAB2 and VEGF, some of which have been shown implicated in ovarian cancer. Genotyping assays revealed that they harbored normal 3p fragments, some of which overlapped candidate TSG regions (3p25-p26, 3p24 and 3p14-pcen) identified previously in loss of heterozygosity analyses of ovarian cancers. However, only the 3p12-pcen region was acquired in common by all hybrids where expression microarray analysis identified differentially expressed genes. The correlation of 3p12-pcen transfer and tumor suppression with a concerted re-programming of the cellular transcriptome suggest that the putative TSG may have affected key underlying events in ovarian cancer.
2011-01-01
Background Understanding the genetic elements that contribute to key aspects of coffee biology will have an impact on future agronomical improvements for this economically important tree. During the past years, EST collections were generated in Coffee, opening the possibility to create new tools for functional genomics. Results The "PUCE CAFE" Project, organized by the scientific consortium NESTLE/IRD/CIRAD, has developed an oligo-based microarray using 15,721 unigenes derived from published coffee EST sequences mostly obtained from different stages of fruit development and leaves in Coffea Canephora (Robusta). Hybridizations for two independent experiments served to compare global gene expression profiles in three types of tissue matter (mature beans, leaves and flowers) in C. canephora as well as in the leaves of three different coffee species (C. canephora, C. eugenoides and C. arabica). Microarray construction, statistical analyses and validation by Q-PCR analysis are presented in this study. Conclusion We have generated the first 15 K coffee array during this PUCE CAFE project, granted by Génoplante (the French consortium for plant genomics). This new tool will help study functional genomics in a wide range of experiments on various plant tissues, such as analyzing bean maturation or resistance to pathogens or drought. Furthermore, the use of this array has proven to be valid in different coffee species (diploid or tetraploid), drastically enlarging its impact for high-throughput gene expression in the community of coffee research. PMID:21208403
Privat, Isabelle; Bardil, Amélie; Gomez, Aureliano Bombarely; Severac, Dany; Dantec, Christelle; Fuentes, Ivanna; Mueller, Lukas; Joët, Thierry; Pot, David; Foucrier, Séverine; Dussert, Stéphane; Leroy, Thierry; Journot, Laurent; de Kochko, Alexandre; Campa, Claudine; Combes, Marie-Christine; Lashermes, Philippe; Bertrand, Benoit
2011-01-05
Understanding the genetic elements that contribute to key aspects of coffee biology will have an impact on future agronomical improvements for this economically important tree. During the past years, EST collections were generated in Coffee, opening the possibility to create new tools for functional genomics. The "PUCE CAFE" Project, organized by the scientific consortium NESTLE/IRD/CIRAD, has developed an oligo-based microarray using 15,721 unigenes derived from published coffee EST sequences mostly obtained from different stages of fruit development and leaves in Coffea Canephora (Robusta). Hybridizations for two independent experiments served to compare global gene expression profiles in three types of tissue matter (mature beans, leaves and flowers) in C. canephora as well as in the leaves of three different coffee species (C. canephora, C. eugenoides and C. arabica). Microarray construction, statistical analyses and validation by Q-PCR analysis are presented in this study. We have generated the first 15 K coffee array during this PUCE CAFE project, granted by Génoplante (the French consortium for plant genomics). This new tool will help study functional genomics in a wide range of experiments on various plant tissues, such as analyzing bean maturation or resistance to pathogens or drought. Furthermore, the use of this array has proven to be valid in different coffee species (diploid or tetraploid), drastically enlarging its impact for high-throughput gene expression in the community of coffee research.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ueda, Kohei; Fujiki, Katsunori; Shirahige, Katsuhiko
Highlights: • We define a target gene of MR as that with MR-binding to the adjacent region of DNA. • We use ChIP-seq analysis in combination with microarray. • We, for the first time, explore the genome-wide binding profile of MR. • We reveal 5 genes as the direct target genes of MR in the renal epithelial cell-line. - Abstract: Background and objective: Mineralocorticoid receptor (MR) is a member of nuclear receptor family proteins and contributes to fluid homeostasis in the kidney. Although aldosterone-MR pathway induces several gene expressions in the kidney, it is often unclear whether the gene expressionsmore » are accompanied by direct regulations of MR through its binding to the regulatory region of each gene. The purpose of this study is to identify the direct target genes of MR in a murine distal convoluted tubular epithelial cell-line (mDCT). Methods: We analyzed the DNA samples of mDCT cells overexpressing 3xFLAG-hMR after treatment with 10{sup −7} M aldosterone for 1 h by chromatin immunoprecipitation with deep-sequence (ChIP-seq) and mRNA of the cell-line with treatment of 10{sup −7} M aldosterone for 3 h by microarray. Results: 3xFLAG-hMR overexpressed in mDCT cells accumulated in the nucleus in response to 10{sup −9} M aldosterone. Twenty-five genes were indicated as the candidate target genes of MR by ChIP-seq and microarray analyses. Five genes, Sgk1, Fkbp5, Rasl12, Tns1 and Tsc22d3 (Gilz), were validated as the direct target genes of MR by quantitative RT-qPCR and ChIP-qPCR. MR binding regions adjacent to Ctgf and Serpine1 were also validated. Conclusions: We, for the first time, captured the genome-wide distribution of MR in mDCT cells and, furthermore, identified five MR target genes in the cell-line. These results will contribute to further studies on the mechanisms of kidney diseases.« less
A hierarchical two-phase framework for selecting genes in cancer datasets with a neuro-fuzzy system.
Lim, Jongwoo; Wang, Bohyun; Lim, Joon S
2016-04-29
Finding the minimum number of appropriate biomarkers for specific targets such as a lung cancer has been a challenging issue in bioinformatics. We propose a hierarchical two-phase framework for selecting appropriate biomarkers that extracts candidate biomarkers from the cancer microarray datasets and then selects the minimum number of appropriate biomarkers from the extracted candidate biomarkers datasets with a specific neuro-fuzzy algorithm, which is called a neural network with weighted fuzzy membership function (NEWFM). In this context, as the first phase, the proposed framework is to extract candidate biomarkers by using a Bhattacharyya distance method that measures the similarity of two discrete probability distributions. Finally, the proposed framework is able to reduce the cost of finding biomarkers by not receiving medical supplements and improve the accuracy of the biomarkers in specific cancer target datasets.
Prediction of regulatory gene pairs using dynamic time warping and gene ontology.
Yang, Andy C; Hsu, Hui-Huang; Lu, Ming-Da; Tseng, Vincent S; Shih, Timothy K
2014-01-01
Selecting informative genes is the most important task for data analysis on microarray gene expression data. In this work, we aim at identifying regulatory gene pairs from microarray gene expression data. However, microarray data often contain multiple missing expression values. Missing value imputation is thus needed before further processing for regulatory gene pairs becomes possible. We develop a novel approach to first impute missing values in microarray time series data by combining k-Nearest Neighbour (KNN), Dynamic Time Warping (DTW) and Gene Ontology (GO). After missing values are imputed, we then perform gene regulation prediction based on our proposed DTW-GO distance measurement of gene pairs. Experimental results show that our approach is more accurate when compared with existing missing value imputation methods on real microarray data sets. Furthermore, our approach can also discover more regulatory gene pairs that are known in the literature than other methods.
Gene selection for microarray data classification via subspace learning and manifold regularization.
Tang, Chang; Cao, Lijuan; Zheng, Xiao; Wang, Minhui
2017-12-19
With the rapid development of DNA microarray technology, large amount of genomic data has been generated. Classification of these microarray data is a challenge task since gene expression data are often with thousands of genes but a small number of samples. In this paper, an effective gene selection method is proposed to select the best subset of genes for microarray data with the irrelevant and redundant genes removed. Compared with original data, the selected gene subset can benefit the classification task. We formulate the gene selection task as a manifold regularized subspace learning problem. In detail, a projection matrix is used to project the original high dimensional microarray data into a lower dimensional subspace, with the constraint that the original genes can be well represented by the selected genes. Meanwhile, the local manifold structure of original data is preserved by a Laplacian graph regularization term on the low-dimensional data space. The projection matrix can serve as an importance indicator of different genes. An iterative update algorithm is developed for solving the problem. Experimental results on six publicly available microarray datasets and one clinical dataset demonstrate that the proposed method performs better when compared with other state-of-the-art methods in terms of microarray data classification. Graphical Abstract The graphical abstract of this work.
Findeisen, Peter; Röckel, Matthias; Nees, Matthias; Röder, Christian; Kienle, Peter; Von Knebel Doeberitz, Magnus; Kalthoff, Holger; Neumaier, Michael
2008-11-01
The presence of tumor cells in peripheral blood is being regarded increasingly as a clinically relevant prognostic factor for colorectal cancer patients. Current molecular methods are very sensitive but due to low specificity their diagnostic value is limited. This study was undertaken in order to systematically identify and validate new colorectal cancer (CRC) marker genes for improved detection of minimal residual disease in peripheral blood mononuclear cells of colorectal cancer patients. Marker genes with upregulated gene expression in colorectal cancer tissue and cell lines were identified using microarray experiments and publicly available gene expression data. A systematic iterative approach was used to reduce a set of 346 candidate genes, reportedly associated with CRC to a selection of candidate genes that were then further validated by relative quantitative real-time RT-PCR. Analytical sensitivity of RT-PCR assays was determined by spiking experiments with CRC cells. Diagnostic sensitivity as well as specificity was tested on a control group consisting of 18 CRC patients compared to 12 individuals without malignant disease. From a total of 346-screened genes only serine (or cysteine) proteinase inhibitor, clade B (ovalbumin), member 5 (SERPINB5) showed significantly elevated transcript levels in peripheral venous blood specimens of tumor patients when compared to the nonmalignant control group. These results were confirmed by analysis of an enlarged collective consisting of 63 CRC patients and 36 control individuals without malignant disease. In conclusion SERPINB5 seems to be a promising marker for detection of circulating tumor cells in peripheral blood of colorectal cancer patients.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lee, Seoung Hoon; Kim, Taesoo; Park, Eui-Soon
2008-05-02
Bone homeostasis is tightly regulated by the balanced actions of osteoblasts (OBs) and osteoclasts (OCs). We previously analyzed the gene expression profile of OC differentiation using a cDNA microarray, and identified a novel osteoclastogenic gene candidate, clone OCL-1-E7 [J. Rho, C.R. Altmann, N.D. Socci, L. Merkov, N. Kim, H. So, O. Lee, M. Takami, A.H. Brivanlou, Y. Choi, Gene expression profiling of osteoclast differentiation by combined suppression subtractive hybridization (SSH) and cDNA microarray analysis, DNA Cell Biol. 21 (2002) 541-549]. In this study, we have isolated full-length cDNAs corresponding to this clone from mice and humans to determine the functionalmore » roles of this gene in osteoclastogenesis. The full-length cDNA of OCL-1-E7 encodes 12 membrane-spanning domains that are typical of isoforms of the Na{sup +}/H{sup +} exchangers (NHEs), indicating that this clone is a novel member of the NHE family (hereafter referred to as NHE10). Here, we show that NHE10 is highly expressed in OCs in response to receptor activator of nuclear factor-{kappa}B ligand signaling and is required for OC differentiation and survival.« less
Musser, Richard O.; Hum-Musser, Sue M.; Gallucci, Matthew; DesRochers, Brittany; Brown, Judith K.
2014-01-01
Abstract Plants are routinely exposed to biotic and abiotic stresses to which they have evolved by synthesizing constitutive and induced defense compounds. Induced defense compounds are usually made, initially, at low levels; however, following further stimulation by specific kinds of biotic and abiotic stresses, they can be synthesized in relatively large amounts to abate the particular stress. cDNA microarray hybridization was used to identify an array of genes that were differentially expressed in tomato plants 15 d after they were exposed to feeding by nonviruliferous whiteflies or by viruliferous whiteflies carrying Pepper golden mosaic virus (PepGMV) ( Begomovirus, Geminiviridae ). Tomato plants inoculated by viruliferous whiteflies developed symptoms characteristic of PepGMV, whereas plants exposed to nonviruliferous whitefly feeding or nonwounded (negative) control plants exhibited no disease symptoms. The microarray analysis yielded over 290 spotted probes, with significantly altered expression of 161 putative annotated gene targets, and 129 spotted probes of unknown identities. The majority of the differentially regulated “known” genes were associated with the plants exposed to viruliferous compared with nonviruliferous whitefly feeding. Overall, significant differences in gene expression were represented by major physiological functions including defense-, pathogen-, photosynthesis-, and signaling-related responses and were similar to genes identified for other insect–plant systems. Viruliferous whitefly-stimulated gene expression was validated by real-time quantitative polymerase chain reaction of selected, representative candidate genes (messenger RNA): arginase, dehydrin, pathogenesis-related proteins 1 and -4, polyphenol oxidase, and several protease inhibitors. This is the first comparative profiling of the expression of tomato plants portraying different responses to biotic stress induced by viruliferous whitefly feeding (with resultant virus infection) compared with whitefly feeding only and negative control nonwounded plants exposed to neither. These results may be applicable to many other plant–insect–pathogen system interactions. PMID:25525099
Teaching bioinformatics and neuroinformatics by using free web-based tools.
Grisham, William; Schottler, Natalie A; Valli-Marill, Joanne; Beck, Lisa; Beatty, Jackson
2010-01-01
This completely computer-based module's purpose is to introduce students to bioinformatics resources. We present an easy-to-adopt module that weaves together several important bioinformatic tools so students can grasp how these tools are used in answering research questions. Students integrate information gathered from websites dealing with anatomy (Mouse Brain Library), quantitative trait locus analysis (WebQTL from GeneNetwork), bioinformatics and gene expression analyses (University of California, Santa Cruz Genome Browser, National Center for Biotechnology Information's Entrez Gene, and the Allen Brain Atlas), and information resources (PubMed). Instructors can use these various websites in concert to teach genetics from the phenotypic level to the molecular level, aspects of neuroanatomy and histology, statistics, quantitative trait locus analysis, and molecular biology (including in situ hybridization and microarray analysis), and to introduce bioinformatic resources. Students use these resources to discover 1) the region(s) of chromosome(s) influencing the phenotypic trait, 2) a list of candidate genes-narrowed by expression data, 3) the in situ pattern of a given gene in the region of interest, 4) the nucleotide sequence of the candidate gene, and 5) articles describing the gene. Teaching materials such as a detailed student/instructor's manual, PowerPoints, sample exams, and links to free Web resources can be found at http://mdcune.psych.ucla.edu/modules/bioinformatics.
Contributions to Statistical Problems Related to Microarray Data
ERIC Educational Resources Information Center
Hong, Feng
2009-01-01
Microarray is a high throughput technology to measure the gene expression. Analysis of microarray data brings many interesting and challenging problems. This thesis consists three studies related to microarray data. First, we propose a Bayesian model for microarray data and use Bayes Factors to identify differentially expressed genes. Second, we…
Mining biological databases for candidate disease genes
NASA Astrophysics Data System (ADS)
Braun, Terry A.; Scheetz, Todd; Webster, Gregg L.; Casavant, Thomas L.
2001-07-01
The publicly-funded effort to sequence the complete nucleotide sequence of the human genome, the Human Genome Project (HGP), has currently produced more than 93% of the 3 billion nucleotides of the human genome into a preliminary `draft' format. In addition, several valuable sources of information have been developed as direct and indirect results of the HGP. These include the sequencing of model organisms (rat, mouse, fly, and others), gene discovery projects (ESTs and full-length), and new technologies such as expression analysis and resources (micro-arrays or gene chips). These resources are invaluable for the researchers identifying the functional genes of the genome that transcribe and translate into the transcriptome and proteome, both of which potentially contain orders of magnitude more complexity than the genome itself. Preliminary analyses of this data identified approximately 30,000 - 40,000 human `genes.' However, the bulk of the effort still remains -- to identify the functional and structural elements contained within the transcriptome and proteome, and to associate function in the transcriptome and proteome to genes. A fortuitous consequence of the HGP is the existence of hundreds of databases containing biological information that may contain relevant data pertaining to the identification of disease-causing genes. The task of mining these databases for information on candidate genes is a commercial application of enormous potential. We are developing a system to acquire and mine data from specific databases to aid our efforts to identify disease genes. A high speed cluster of Linux of workstations is used to analyze sequence and perform distributed sequence alignments as part of our data mining and processing. This system has been used to mine GeneMap99 sequences within specific genomic intervals to identify potential candidate disease genes associated with Bardet-Biedle Syndrome (BBS).
A microarray analysis of retinal transcripts that are controlled by image contrast in mice.
Brand, Christine; Schaeffel, Frank; Feldkaemper, Marita Pauline
2007-06-18
The development of myopia is controlled by still largely unknown retinal signals. The aim of this study was to investigate the changes in retinal mRNA expression after different periods of visual deprivation in mice, while controlling for retinal illuminance. Each group consisted of three male C57BL/6 mice. Treatment periods were 30 min, 4 h, and 6+6 h. High spatial frequencies were filtered from the retinal image by frosted diffusers over one eye while the fellow eyes were covered by clear neutral density (ND) filters that exhibited similar light attenuating properties (0.1 log units) as the diffusers. For the final 30 min of the respective treatment period mice were individually placed in a clear Perspex cylinder that was positioned in the center of a rotating (60 degrees) large drum. The inside of the drum was covered with a 0.1 cyc/degree vertical square wave grating. This visual environment was chosen to standardize illuminances and contrasts seen by the mice. Labeled cRNA was prepared and hybridized to Affymetrix GeneChip Mouse Genome 430 2.0 arrays. Alterations in mRNA expression levels of candidate genes with potential biological relevance were confirmed by semi-quantitative real-time reverse transcription polymerase chain reaction (RT-PCR). In all groups, Egr-1 mRNA expression was reduced in diffuser-treated eyes. Furthermore, the degradation of the spatial frequency spectrum also changed the cFos mRNA level, with reduced expression after 4 h of diffuser treatment. Other interesting candidates were Akt2, which was up-regulated after 30 min of deprivation and Mapk8ip3, a neuron specific JNK binding and scaffolding protein that was temporally regulated in the diffuser-treated eyes only. The microarray analysis demonstrated a pattern of differential transcriptional changes, even though differences in the retinal images were restricted to spatial features. The candidate genes may provide further insight into the biochemical short-term changes following retinal image degradation in mice. Because deprivation of spatial vision leads to increased eye growth and myopia in both animals and humans, it is believed some of the identified genes play a role in myopia development.
Wang, Shih-Han; Cheng, Chuen-Yu; Tang, Pin-Chi; Chen, Chih-Feng; Chen, Hsin-Hsin; Lee, Yen-Pai; Huang, San-Yuan
2013-01-15
Acute heat stress affects genes involved in spermatogenesis in mammals. However, there is apparently no elaborate research on the effects of acute heat stress on gene expression in avian testes. The purpose of this study was to investigate global gene expression in testes of the L2 strain of Taiwan country chicken after acute heat stress. Twelve roosters, 45 weeks old, were allocated into four groups, including control roosters kept at 25 °C, roosters subjected to 38 °C acute heat stress for 4 hours without recovery, with 2-hour recovery, and with 6-hour recovery, respectively. Testis samples were collected for RNA isolation and microarray analysis. Based on gene expression profiles, 169 genes were upregulated and 140 genes were downregulated after heat stress using a cutoff value of twofold or greater change. Based on gene ontology analysis, differentially expressed genes were mainly related to response to stress, transport, signal transduction, and metabolism. A functional network analysis displayed that heat shock protein genes and related chaperones were the major upregulated groups in chicken testes after acute heat stress. A quantitative real-time polymerase chain reaction analysis of mRNA expressions of HSP70, HSP90AA1, BAG3, SERPINB2, HSP25, DNAJA4, CYP3A80, CIRBP, and TAGLN confirmed the results of the microarray analysis. Because the HSP genes (HSP25, HSP70, and HSP90AA1) and the antiapoptotic BAG3 gene were dramatically altered in heat-stressed chicken testes, we concluded that these genes were important factors in the avian testes under acute heat stress. Whether these genes could be candidate genes for thermotolerance in roosters requires further investigation. Copyright © 2013 Elsevier Inc. All rights reserved.
Emerging Use of Gene Expression Microarrays in Plant Physiology
Wullschleger, Stan D.; Difazio, Stephen P.
2003-01-01
Microarrays have become an important technology for the global analysis of gene expression in humans, animals, plants, and microbes. Implemented in the context of a well-designed experiment, cDNA and oligonucleotide arrays can provide highthroughput, simultaneous analysis of transcript abundance for hundreds, if not thousands, of genes. However, despite widespread acceptance, the use of microarrays as a tool to better understand processes of interest to the plant physiologist is still being explored. To help illustrate current uses of microarrays in the plant sciences, several case studies that we believe demonstrate the emerging application of gene expression arrays in plant physiology weremore » selected from among the many posters and presentations at the 2003 Plant and Animal Genome XI Conference. Based on this survey, microarrays are being used to assess gene expression in plants exposed to the experimental manipulation of air temperature, soil water content and aluminium concentration in the root zone. Analysis often includes characterizing transcript profiles for multiple post-treatment sampling periods and categorizing genes with common patterns of response using hierarchical clustering techniques. In addition, microarrays are also providing insights into developmental changes in gene expression associated with fibre and root elongation in cotton and maize, respectively. Technical and analytical limitations of microarrays are discussed and projects attempting to advance areas of microarray design and data analysis are highlighted. Finally, although much work remains, we conclude that microarrays are a valuable tool for the plant physiologist interested in the characterization and identification of individual genes and gene families with potential application in the fields of agriculture, horticulture and forestry.« less
Satapathy, Lopamudra; Singh, Dharmendra; Ranjan, Prashant; Kumar, Dhananjay; Kumar, Manish; Prabhu, Kumble Vinod; Mukhopadhyay, Kunal
2014-12-01
WRKY, a plant-specific transcription factor family, has important roles in pathogen defense, abiotic cues and phytohormone signaling, yet little is known about their roles and molecular mechanism of function in response to rust diseases in wheat. We identified 100 TaWRKY sequences using wheat Expressed Sequence Tag database of which 22 WRKY sequences were novel. Identified proteins were characterized based on their zinc finger motifs and phylogenetic analysis clustered them into six clades consisting of class IIc and class III WRKY proteins. Functional annotation revealed major functions in metabolic and cellular processes in control plants; whereas response to stimuli, signaling and defense in pathogen inoculated plants, their major molecular function being binding to DNA. Tag-based expression analysis of the identified genes revealed differential expression between mock and Puccinia triticina inoculated wheat near isogenic lines. Gene expression was also performed with six rust-related microarray experiments at Gene Expression Omnibus database. TaWRKY10, 15, 17 and 56 were common in both tag-based and microarray-based differential expression analysis and could be representing rust specific WRKY genes. The obtained results will bestow insight into the functional characterization of WRKY transcription factors responsive to leaf rust pathogenesis that can be used as candidate genes in molecular breeding programs to improve biotic stress tolerance in wheat.
LING, SHIZHANG; RETTIG, ELENI M.; TAN, MARIETTA; CHANG, XIAOFEI; WANG, ZHIMING; BRAIT, MARIANA; BISHOP, JUSTIN A.; FERTIG, ELANA J.; CONSIDINE, MICHAEL; WICK, MICHAEL J.; HA, PATRICK K.
2016-01-01
Salivary gland adenoid cystic carcinoma (ACC) is a rare head and neck malignancy without molecular biomarkers that can be used to predict the chemotherapeutic response or prognosis of ACC. The regulation of gene expression of oncogenes and tumor suppressor genes (TSGs) through DNA promoter methylation may play a role in the carcinogenesis of ACC. To identify differentially methylated genes in ACC, a global demethylating agent, 5-aza-2′-deoxycytidine (5-AZA) was utilized to unmask putative TSG silencing in ACC xenograft models in mice. Fresh xenografts were passaged, implanted in triplicate in mice that were treated with 5-AZA daily for 28 days. These xenografts were then evaluated for genome-wide DNA methylation patterns using the Illumina Infinium HumanMethylation27 BeadChip array. Validation of the 32 candidate genes was performed by bisulfite sequencing (BS-seq) in a separate cohort of 6 ACC primary tumors and 6 normal control salivary gland tissues. Hypermethylation was identified in the HCN2 gene promoter in all 6 control tissues, but hypomethylation was found in all 6 ACC tumor tissues. Quantitative validation of HCN2 promoter methylation level in the region detected by BS-seq was performed in a larger cohort of primary tumors (n=32) confirming significant HCN2 hypomethylation in ACCs compared with normal samples (n=10; P=0.04). HCN2 immunohistochemical staining was performed on an ACC tissue microarray. HCN2 staining intensity and H-score, but not percentage of the positively stained cells, were significantly stronger in normal tissues than those of ACC tissues. With our novel screening and sequencing methods, we identified several gene candidates that were methylated. The most significant of these genes, HCN2, was actually hypomethylated in tumors. However, promoter methylation status does not appear to be a major determinant of HCN2 expression in normal and ACC tissues. HCN2 hypomethylation is a biomarker of ACC and may play an important role in the carcinogenesis of ACC. PMID:27212063
2015-01-01
Cancer is a disease characterized largely by the accumulation of out-of-control somatic mutations during the lifetime of a patient. Distinguishing driver mutations from passenger mutations has posed a challenge in modern cancer research. With the advanced development of microarray experiments and clinical studies, a large numbers of candidate cancer genes have been extracted and distinguishing informative genes out of them is essential. As a matter of fact, we proposed to find the informative genes for cancer by using mutation data from ovarian cancers in our framework. In our model we utilized the patient gene mutation profile, gene expression data and gene gene interactions network to construct a graphical representation of genes and patients. Markov processes for mutation and patients are triggered separately. After this process, cancer genes are prioritized automatically by examining their scores at their stationary distributions in the eigenvector. Extensive experiments demonstrate that the integration of heterogeneous sources of information is essential in finding important cancer genes. PMID:26328548
Vuillaume, Marie-Laure; Naudion, Sophie; Banneau, Guillaume; Diene, Gwenaelle; Cartault, Audrey; Cailley, Dorothée; Bouron, Julie; Toutain, Jérôme; Bourrouillou, Georges; Vigouroux, Adeline; Bouneau, Laurence; Nacka, Fabienne; Kieffer, Isabelle; Arveiler, Benoit; Knoll-Gellida, Anja; Babin, Patrick J; Bieth, Eric; Jouret, Béatrice; Julia, Sophie; Sarda, Pierre; Geneviève, David; Faivre, Laurence; Lacombe, Didier; Barat, Pascal; Tauber, Maithé; Delrue, Marie-Ange; Rooryck, Caroline
2014-08-01
Syndromic obesity is defined by the association of obesity with one or more feature(s) including developmental delay, dysmorphic traits, and/or congenital malformations. Over 25 syndromic forms of obesity have been identified. However, most cases remain of unknown etiology. The aim of this study was to identify new candidate loci associated with syndromic obesity to find new candidate genes and to better understand molecular mechanisms involved in this pathology. We performed oligonucleotide microarray-based comparative genomic hybridization in a cohort of 100 children presenting with syndromic obesity of unknown etiology, after exhaustive clinical, biological, and molecular studies. Chromosomal copy number variations were detected in 42% of the children in our cohort, with 23% of patients with potentially pathogenic copy number variants. Our results support that chromosomal rearrangements are frequently associated with syndromic obesity with a variety of contributory genes having relevance to either obesity or developmental delay. A list of inherited or apparently de novo duplications and deletions including their enclosed genes and not previously linked to syndromic obesity was established. Proteins encoded by several of these genes are involved in lipid metabolism (ACOXL, MSMO1, MVD, and PDZK1) linked with nervous system function (BDH1 and LINGO2), neutral lipid storage (PLIN2), energy homeostasis and metabolic processes (CDH13, CNTNAP2, CPPED1, NDUFA4, PTGS2, and SOCS6). © 2014 Wiley Periodicals, Inc.
Marcolino-Gomes, Juliana; Rodrigues, Fabiana Aparecida; Fuganti-Pagliarini, Renata; Nakayama, Thiago Jonas; Ribeiro Reis, Rafaela; Bouças Farias, Jose Renato; Harmon, Frank G; Correa Molinari, Hugo Bruno; Correa Molinari, Mayla Daiane; Nepomuceno, Alexandre
2015-01-01
The soybean transcriptome displays strong variation along the day in optimal growth conditions and also in response to adverse circumstances, like drought stress. However, no study conducted to date has presented suitable reference genes, with stable expression along the day, for relative gene expression quantification in combined studies on drought stress and diurnal oscillations. Recently, water deficit responses have been associated with circadian clock oscillations at the transcription level, revealing the existence of hitherto unknown processes and increasing the demand for studies on plant responses to drought stress and its oscillation during the day. We performed data mining from a transcriptome-wide background using microarrays and RNA-seq databases to select an unpublished set of candidate reference genes, specifically chosen for the normalization of gene expression in studies on soybean under both drought stress and diurnal oscillations. Experimental validation and stability analysis in soybean plants submitted to drought stress and sampled during a 24 h timecourse showed that four of these newer reference genes (FYVE, NUDIX, Golgin-84 and CYST) indeed exhibited greater expression stability than the conventionally used housekeeping genes (ELF1-β and β-actin) under these conditions. We also demonstrated the effect of using reference candidate genes with different stability values to normalize the relative expression data from a drought-inducible soybean gene (DREB5) evaluated in different periods of the day.
Rao, Xiaolan; Shen, Hui; Pattathil, Sivakumar; Hahn, Michael G; Gelineo-Albersheim, Ivana; Mohnen, Debra; Pu, Yunqiao; Ragauskas, Arthur J; Chen, Xin; Chen, Fang; Dixon, Richard A
2017-01-01
Plant cell walls contribute the majority of plant biomass that can be used to produce transportation fuels. However, the complexity and variability in composition and structure of cell walls, particularly the presence of lignin, negatively impacts their deconstruction for bioenergy. Metabolic and genetic changes associated with secondary wall development in the biofuel crop switchgrass ( Panicum virgatum ) have yet to be reported. Our previous studies have established a cell suspension system for switchgrass, in which cell wall lignification can be induced by application of brassinolide (BL). We have now collected cell wall composition and microarray-based transcriptome profiles for BL-induced and non-induced suspension cultures to provide an overview of the dynamic changes in transcriptional reprogramming during BL-induced cell wall modification. From this analysis, we have identified changes in candidate genes involved in cell wall precursor synthesis, cellulose, hemicellulose, and pectin formation and ester-linkage generation. We have also identified a large number of transcription factors with expression correlated with lignin biosynthesis genes, among which are candidates for control of syringyl (S) lignin accumulation. Together, this work provides an overview of the dynamic compositional changes during brassinosteroid-induced cell wall remodeling, and identifies candidate genes for future plant genetic engineering to overcome cell wall recalcitrance.
GeneXplorer: an interactive web application for microarray data visualization and analysis.
Rees, Christian A; Demeter, Janos; Matese, John C; Botstein, David; Sherlock, Gavin
2004-10-01
When publishing large-scale microarray datasets, it is of great value to create supplemental websites where either the full data, or selected subsets corresponding to figures within the paper, can be browsed. We set out to create a CGI application containing many of the features of some of the existing standalone software for the visualization of clustered microarray data. We present GeneXplorer, a web application for interactive microarray data visualization and analysis in a web environment. GeneXplorer allows users to browse a microarray dataset in an intuitive fashion. It provides simple access to microarray data over the Internet and uses only HTML and JavaScript to display graphic and annotation information. It provides radar and zoom views of the data, allows display of the nearest neighbors to a gene expression vector based on their Pearson correlations and provides the ability to search gene annotation fields. The software is released under the permissive MIT Open Source license, and the complete documentation and the entire source code are freely available for download from CPAN http://search.cpan.org/dist/Microarray-GeneXplorer/.
Yang, N; Xie, W; Jones, CM; Bass, C; Jiao, X; Yang, X; Liu, B; Li, R; Zhang, Y
2013-01-01
Bemisia tabaci has developed high levels of resistance to many insecticides including the neonicotinoids and there is strong evidence that for some compounds resistance is stage-specific. To investigate the molecular basis of B. tabaci resistance to the neonicotinoid thiamethoxam we used a custom whitefly microarray to compare gene expression in the egg, nymph and adult stages of a thiamethoxam-resistant strain (TH-R) with a susceptible strain (TH-S). Gene ontology and bioinformatic analyses revealed that in all life stages many of the differentially expressed transcripts encoded enzymes involved in metabolic processes and/or metabolism of xenobiotics. Several of these are candidate resistance genes and include the cytochrome P450 CYP6CM1, which has been shown to confer resistance to several neonicotinoids previously, a P450 belonging to the Cytochrome P450s 4 family and a glutathione S-transferase (GST) belonging to the sigma class. Finally several ATP-binding cassette transporters of the ABCG subfamily were highly over-expressed in the adult stage of the TH-R strain and may play a role in resistance by active efflux. Here, we evaluated both common and stage-specific gene expression signatures and identified several candidate resistance genes that may underlie B. tabaci resistance to thiamethoxam. PMID:23889345
Goonesekere, Nalin C W; Andersen, Wyatt; Smith, Alex; Wang, Xiaosheng
2018-02-01
The lack of specific symptoms at early tumor stages, together with a high biological aggressiveness of the tumor contribute to the high mortality rate for pancreatic cancer (PC), which has a 5-year survival rate of about 7%. Recent failures of targeted therapies inhibiting kinase activity in clinical trials have highlighted the need for new approaches towards combating this deadly disease. In this study, we have identified genes that are significantly downregulated in PC, through a meta-analysis of large number of microarray datasets. We have used qRT-PCR to confirm the downregulation of selected genes in a panel of PC cell lines. This study has yielded several novel candidate tumor-suppressor genes (TSGs) including GNMT, CEL, PLA2G1B and SERPINI2. We highlight the role of GNMT, a methyl transferase associated with the methylation potential of the cell, and CEL, a lipase, as potential therapeutic targets. We have uncovered genetic links to risk factors associated with PC such as smoking and obesity. Genes important for patient survival and prognosis are also discussed, and we confirm the dysregulation of metabolic pathways previously observed in PC. While many of the genes downregulated in our dataset are associated with protein products normally produced by the pancreas for excretion, we have uncovered some genes whose downregulation appear to play a more causal role in PC. These genes will assist in providing a better understanding of the disease etiology of PC, and in the search for new therapeutic targets and biomarkers.
Le, Mai Q; Pagter, Majken; Hincha, Dirk K
2015-01-01
During cold acclimation plants increase in freezing tolerance in response to low non-freezing temperatures. This is accompanied by many physiological, biochemical and molecular changes that have been extensively investigated. In addition, plants of many species, including Arabidopsis thaliana, become more freezing tolerant during exposure to mild, non-damaging sub-zero temperatures after cold acclimation. There is hardly any information available about the molecular basis of this adaptation. Here, we have used microarrays and a qRT-PCR primer platform covering 1,880 genes encoding transcription factors (TFs) to monitor changes in gene expression in the Arabidopsis accessions Columbia-0, Rschew and Tenela during the first 3 days of sub-zero acclimation at -3 °C. The results indicate that gene expression during sub-zero acclimation follows a tighly controlled time-course. Especially AP2/EREBP and WRKY TFs may be important regulators of sub-zero acclimation, although the CBF signal transduction pathway seems to be less important during sub-zero than during cold acclimation. Globally, we estimate that approximately 5% of all Arabidopsis genes are regulated during sub-zero acclimation. Particularly photosynthesis-related genes are down-regulated and genes belonging to the functional classes of cell wall biosynthesis, hormone metabolism and RNA regulation of transcription are up-regulated. Collectively, these data provide the first global analysis of gene expression during sub-zero acclimation and allow the identification of candidate genes for forward and reverse genetic studies into the molecular mechanisms of sub-zero acclimation.
Wu, Tao; Yang, Chunyan; Ding, Baoxu; Feng, Zhiming; Wang, Qian; He, Jun; Tong, Jianhua; Xiao, Langtao; Jiang, Ling; Wan, Jianmin
2016-02-01
Seed dormancy in rice is an important trait related to the pre-harvest sprouting resistance. In order to understand the molecular mechanisms of seed dormancy, gene expression was investigated by transcriptome analysis using seeds of the strongly dormant cultivar N22 and its less dormant mutants Q4359 and Q4646 at 24 days after heading (DAH). Microarray data revealed more differentially expressed genes in Q4359 than in Q4646 compared to N22. Most genes differing between Q4646 and N22 also differed between Q4359 and N22. GO analysis of genes differentially expressed in both Q4359 and Q4646 revealed that some genes such as those for starch biosynthesis were repressed, whereas metabolic genes such as those for carbohydrate metabolism were enhanced in Q4359 and Q4646 seeds relative to N22. Expression of some genes involved in cell redox homeostasis and chromatin remodeling differed significantly only between Q4359 and N22. The results suggested a close correlation between cell redox homeostasis, chromatin remodeling and seed dormancy. In addition, some genes involved in ABA signaling were down-regulated, and several genes involved in GA biosynthesis and signaling were up-regulated. These observations suggest that reduced seed dormancy in Q4359 was regulated by ABA-GA antagonism. A few differentially expressed genes were located in the regions containing qSdn-1 and qSdn-5 suggesting that they could be candidate genes underlying seed dormancy. Our work provides useful leads to further determine the underling mechanisms of seed dormancy and for cloning seed dormancy genes from N22. Copyright © 2015 Elsevier Masson SAS. All rights reserved.
Reliable pre-eclampsia pathways based on multiple independent microarray data sets.
Kawasaki, Kaoru; Kondoh, Eiji; Chigusa, Yoshitsugu; Ujita, Mari; Murakami, Ryusuke; Mogami, Haruta; Brown, J B; Okuno, Yasushi; Konishi, Ikuo
2015-02-01
Pre-eclampsia is a multifactorial disorder characterized by heterogeneous clinical manifestations. Gene expression profiling of preeclamptic placenta have provided different and even opposite results, partly due to data compromised by various experimental artefacts. Here we aimed to identify reliable pre-eclampsia-specific pathways using multiple independent microarray data sets. Gene expression data of control and preeclamptic placentas were obtained from Gene Expression Omnibus. Single-sample gene-set enrichment analysis was performed to generate gene-set activation scores of 9707 pathways obtained from the Molecular Signatures Database. Candidate pathways were identified by t-test-based screening using data sets, GSE10588, GSE14722 and GSE25906. Additionally, recursive feature elimination was applied to arrive at a further reduced set of pathways. To assess the validity of the pre-eclampsia pathways, a statistically-validated protocol was executed using five data sets including two independent other validation data sets, GSE30186, GSE44711. Quantitative real-time PCR was performed for genes in a panel of potential pre-eclampsia pathways using placentas of 20 women with normal or severe preeclamptic singleton pregnancies (n = 10, respectively). A panel of ten pathways were found to discriminate women with pre-eclampsia from controls with high accuracy. Among these were pathways not previously associated with pre-eclampsia, such as the GABA receptor pathway, as well as pathways that have already been linked to pre-eclampsia, such as the glutathione and CDKN1C pathways. mRNA expression of GABRA3 (GABA receptor pathway), GCLC and GCLM (glutathione metabolic pathway), and CDKN1C was significantly reduced in the preeclamptic placentas. In conclusion, ten accurate and reliable pre-eclampsia pathways were identified based on multiple independent microarray data sets. A pathway-based classification may be a worthwhile approach to elucidate the pathogenesis of pre-eclampsia. © The Author 2014. Published by Oxford University Press on behalf of the European Society of Human Reproduction and Embryology. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Two Novel Determinants of Etoposide Resistance in Small Cell Lung Cancer
Lawson, Malcolm H; Cummings, Natalie M; Rassl, Doris M; Russell, Roslin; Brenton, James D; Rintoul, Robert C; Murphy, Gillian
2011-01-01
Patient survival in small cell lung cancer (SCLC) is limited by acquired chemoresistance. Here we report the use of a biologically relevant model to identify novel candidate genes mediating in vivo acquired resistance to etoposide. Candidate genes derived from a cDNA microarray analysis were cloned and transiently overexpressed to evaluate their potential functional roles. We identified two promising genes in the DNA repair enzyme DNA Polymerase β and in the neuroendocrine transcription factor NKX2.2. Specific inhibition of DNA Polymerase β reduced the numbers of cells surviving treatment with etoposide and increased the amount of DNA damage in cells. Conversely, stable overexpression of NKX2.2 increased cell survival in response to etoposide in SCLC cell lines. Consistent with these findings, we found that an absence of nuclear staining for NKX2.2 in SCLC primary tumors was an independent predictor of improved outcomes in chemotherapy-treated patients. Taken together, our findings justify future prospective studies to confirm the roles of these molecules in mediating chemotherapy resistance in SCLC. PMID:21642373
Importing MAGE-ML format microarray data into BioConductor.
Durinck, Steffen; Allemeersch, Joke; Carey, Vincent J; Moreau, Yves; De Moor, Bart
2004-12-12
The microarray gene expression markup language (MAGE-ML) is a widely used XML (eXtensible Markup Language) standard for describing and exchanging information about microarray experiments. It can describe microarray designs, microarray experiment designs, gene expression data and data analysis results. We describe RMAGEML, a new Bioconductor package that provides a link between cDNA microarray data stored in MAGE-ML format and the Bioconductor framework for preprocessing, visualization and analysis of microarray experiments. http://www.bioconductor.org. Open Source.
The Importance of Normalization on Large and Heterogeneous Microarray Datasets
DNA microarray technology is a powerful functional genomics tool increasingly used for investigating global gene expression in environmental studies. Microarrays can also be used in identifying biological networks, as they give insight on the complex gene-to-gene interactions, ne...
Plant-pathogen interactions: what microarray tells about it?
Lodha, T D; Basak, J
2012-01-01
Plant defense responses are mediated by elementary regulatory proteins that affect expression of thousands of genes. Over the last decade, microarray technology has played a key role in deciphering the underlying networks of gene regulation in plants that lead to a wide variety of defence responses. Microarray is an important tool to quantify and profile the expression of thousands of genes simultaneously, with two main aims: (1) gene discovery and (2) global expression profiling. Several microarray technologies are currently in use; most include a glass slide platform with spotted cDNA or oligonucleotides. Till date, microarray technology has been used in the identification of regulatory genes, end-point defence genes, to understand the signal transduction processes underlying disease resistance and its intimate links to other physiological pathways. Microarray technology can be used for in-depth, simultaneous profiling of host/pathogen genes as the disease progresses from infection to resistance/susceptibility at different developmental stages of the host, which can be done in different environments, for clearer understanding of the processes involved. A thorough knowledge of plant disease resistance using successful combination of microarray and other high throughput techniques, as well as biochemical, genetic, and cell biological experiments is needed for practical application to secure and stabilize yield of many crop plants. This review starts with a brief introduction to microarray technology, followed by the basics of plant-pathogen interaction, the use of DNA microarrays over the last decade to unravel the mysteries of plant-pathogen interaction, and ends with the future prospects of this technology.
Microarray analysis of potential genes in the pathogenesis of recurrent oral ulcer.
Han, Jingying; He, Zhiwei; Li, Kun; Hou, Lu
2015-01-01
Recurrent oral ulcer seriously threatens patients' daily life and health. This study investigated potential genes and pathways that participate in the pathogenesis of recurrent oral ulcer by high throughput bioinformatic analysis. RT-PCR and Western blot were applied to further verify screened interleukins effect. Recurrent oral ulcer related genes were collected from websites and papers, and further found out from Human Genome 280 6.0 microarray data. Each pathway of recurrent oral ulcer related genes were got through chip hybridization. RT-PCR was applied to test four recurrent oral ulcer related genes to verify the microarray data. Data transformation, scatter plot, clustering analysis, and expression pattern analysis were used to analyze recurrent oral ulcer related gene expression changes. Recurrent oral ulcer gene microarray was successfully established. Microarray showed that 551 genes involved in recurrent oral ulcer activity and 196 genes were recurrent oral ulcer related genes. Of them, 76 genes up-regulated, 62 genes down-regulated, and 58 genes up-/down-regulated. Total expression level up-regulated 752 times (60%) and down-regulated 485 times (40%). IL-2 plays an important role in the occurrence, development and recurrence of recurrent oral ulcer on the mRNA and protein levels. Gene microarray can be used to analyze potential genes and pathways in recurrent oral ulcer. IL-2 may be involved in the pathogenesis of recurrent oral ulcer.
Analyzing gene perturbation screens with nested effects models in R and bioconductor.
Fröhlich, Holger; Beissbarth, Tim; Tresch, Achim; Kostka, Dennis; Jacob, Juby; Spang, Rainer; Markowetz, F
2008-11-01
Nested effects models (NEMs) are a class of probabilistic models introduced to analyze the effects of gene perturbation screens visible in high-dimensional phenotypes like microarrays or cell morphology. NEMs reverse engineer upstream/downstream relations of cellular signaling cascades. NEMs take as input a set of candidate pathway genes and phenotypic profiles of perturbing these genes. NEMs return a pathway structure explaining the observed perturbation effects. Here, we describe the package nem, an open-source software to efficiently infer NEMs from data. Our software implements several search algorithms for model fitting and is applicable to a wide range of different data types and representations. The methods we present summarize the current state-of-the-art in NEMs. Our software is written in the R language and freely avail-able via the Bioconductor project at http://www.bioconductor.org.
α-amanitin resistance in Drosophila melanogaster: A genome-wide association approach.
Mitchell, Chelsea L; Latuszek, Catrina E; Vogel, Kara R; Greenlund, Ian M; Hobmeier, Rebecca E; Ingram, Olivia K; Dufek, Shannon R; Pecore, Jared L; Nip, Felicia R; Johnson, Zachary J; Ji, Xiaohui; Wei, Hairong; Gailing, Oliver; Werner, Thomas
2017-01-01
We investigated the mechanisms of mushroom toxin resistance in the Drosophila Genetic Reference Panel (DGRP) fly lines, using genome-wide association studies (GWAS). While Drosophila melanogaster avoids mushrooms in nature, some lines are surprisingly resistant to α-amanitin-a toxin found solely in mushrooms. This resistance may represent a pre-adaptation, which might enable this species to invade the mushroom niche in the future. Although our previous microarray study had strongly suggested that pesticide-metabolizing detoxification genes confer α-amanitin resistance in a Taiwanese D. melanogaster line Ama-KTT, none of the traditional detoxification genes were among the top candidate genes resulting from the GWAS in the current study. Instead, we identified Megalin, Tequila, and widerborst as candidate genes underlying the α-amanitin resistance phenotype in the North American DGRP lines, all three of which are connected to the Target of Rapamycin (TOR) pathway. Both widerborst and Tequila are upstream regulators of TOR, and TOR is a key regulator of autophagy and Megalin-mediated endocytosis. We suggest that endocytosis and autophagy of α-amanitin, followed by lysosomal degradation of the toxin, is one of the mechanisms that confer α-amanitin resistance in the DGRP lines.
Genomic expression analysis of rat chromosome 4 for skeletal traits at femoral neck.
Alam, Imranul; Sun, Qiwei; Liu, Lixiang; Koller, Daniel L; Liu, Yunlong; Edenberg, Howard J; Econs, Michael J; Foroud, Tatiana; Turner, Charles H
2008-10-08
Hip fracture is the most devastating osteoporotic fracture type with significant morbidity and mortality. Several studies in humans and animal models identified chromosomal regions linked to hip size and bone mass. Previously, we identified that the region of 4q21-q41 on rat chromosome (Chr) 4 harbors multiple femoral neck quantitative trait loci (QTLs) in inbred Fischer 344 (F344) and Lewis (LEW) rats. The purpose of this study is to identify the candidate genes for femoral neck structure and density by correlating gene expression in the proximal femur with the femoral neck phenotypes linked to the QTLs on Chr 4. RNA was extracted from proximal femora of 4-wk-old rats from F344 and LEW strains, and two other strains, Copenhagen 2331 and Dark Agouti, were used as a negative control. Microarray analysis was performed using Affymetrix Rat Genome 230 2.0 arrays. A total of 99 genes in the 4q21-q41 region were differentially expressed (P < 0.05) among all strains of rats with a false discovery rate <10%. These 99 genes were then ranked based on the strength of correlation between femoral neck phenotypes measured in F2 animals, homozygous for a particular strain's allele at the Chr 4 QTL and the expression level of the gene in that strain. A total of 18 candidate genes were strongly correlated (r(2) > 0.50) with femoral neck width and prioritized for further analysis. Quantitative PCR analysis confirmed 14 of 18 of the candidate genes. Ingenuity pathway analysis revealed several direct or indirect relationships among the candidate genes related to angiogenesis (VEGF), bone growth (FGF2), bone formation (IGF2 and IGF2BP3), and resorption (TNF). This study provides a shortened list of genetic determinants of skeletal traits at the hip and may lead to novel approaches for prevention and treatment of hip fracture.
Genomic expression analysis of rat chromosome 4 for skeletal traits at femoral neck
Alam, Imranul; Sun, Qiwei; Liu, Lixiang; Koller, Daniel L.; Liu, Yunlong; Edenberg, Howard J.; Econs, Michael J.; Foroud, Tatiana; Turner, Charles H.
2008-01-01
Hip fracture is the most devastating osteoporotic fracture type with significant morbidity and mortality. Several studies in humans and animal models identified chromosomal regions linked to hip size and bone mass. Previously, we identified that the region of 4q21-q41 on rat chromosome (Chr) 4 harbors multiple femoral neck quantitative trait loci (QTLs) in inbred Fischer 344 (F344) and Lewis (LEW) rats. The purpose of this study is to identify the candidate genes for femoral neck structure and density by correlating gene expression in the proximal femur with the femoral neck phenotypes linked to the QTLs on Chr 4. RNA was extracted from proximal femora of 4-wk-old rats from F344 and LEW strains, and two other strains, Copenhagen 2331 and Dark Agouti, were used as a negative control. Microarray analysis was performed using Affymetrix Rat Genome 230 2.0 arrays. A total of 99 genes in the 4q21-q41 region were differentially expressed (P < 0.05) among all strains of rats with a false discovery rate <10%. These 99 genes were then ranked based on the strength of correlation between femoral neck phenotypes measured in F2 animals, homozygous for a particular strain's allele at the Chr 4 QTL and the expression level of the gene in that strain. A total of 18 candidate genes were strongly correlated (r2 > 0.50) with femoral neck width and prioritized for further analysis. Quantitative PCR analysis confirmed 14 of 18 of the candidate genes. Ingenuity pathway analysis revealed several direct or indirect relationships among the candidate genes related to angiogenesis (VEGF), bone growth (FGF2), bone formation (IGF2 and IGF2BP3), and resorption (TNF). This study provides a shortened list of genetic determinants of skeletal traits at the hip and may lead to novel approaches for prevention and treatment of hip fracture. PMID:18728226
Multiplex cDNA quantification method that facilitates the standardization of gene expression data
Gotoh, Osamu; Murakami, Yasufumi; Suyama, Akira
2011-01-01
Microarray-based gene expression measurement is one of the major methods for transcriptome analysis. However, current microarray data are substantially affected by microarray platforms and RNA references because of the microarray method can provide merely the relative amounts of gene expression levels. Therefore, valid comparisons of the microarray data require standardized platforms, internal and/or external controls and complicated normalizations. These requirements impose limitations on the extensive comparison of gene expression data. Here, we report an effective approach to removing the unfavorable limitations by measuring the absolute amounts of gene expression levels on common DNA microarrays. We have developed a multiplex cDNA quantification method called GEP-DEAN (Gene expression profiling by DCN-encoding-based analysis). The method was validated by using chemically synthesized DNA strands of known quantities and cDNA samples prepared from mouse liver, demonstrating that the absolute amounts of cDNA strands were successfully measured with a sensitivity of 18 zmol in a highly multiplexed manner in 7 h. PMID:21415008
X-linked intellectual disability update 2017.
Neri, Giovanni; Schwartz, Charles E; Lubs, Herbert A; Stevenson, Roger E
2018-04-25
The X-chromosome comprises only about 5% of the human genome but accounts for about 15% of the genes currently known to be associated with intellectual disability. The early progress in identifying the X-linked intellectual disability (XLID)-associated genes through linkage analysis and candidate gene sequencing has been accelerated with the use of high-throughput technologies. In the 10 years since the last update, the number of genes associated with XLID has increased by 96% from 72 to 141 and duplications of all 141 XLID genes have been described, primarily through the application of high-resolution microarrays and next generation sequencing. The progress in identifying genetic and genomic alterations associated with XLID has not been matched with insights that improve the clinician's ability to form differential diagnoses, that bring into view the possibility of curative therapies for patients, or that inform scientists of the impact of the genetic alterations on cell organization and function. © 2018 Wiley Periodicals, Inc.
Construction of a cDNA microarray derived from the ascidian Ciona intestinalis.
Azumi, Kaoru; Takahashi, Hiroki; Miki, Yasufumi; Fujie, Manabu; Usami, Takeshi; Ishikawa, Hisayoshi; Kitayama, Atsusi; Satou, Yutaka; Ueno, Naoto; Satoh, Nori
2003-10-01
A cDNA microarray was constructed from a basal chordate, the ascidian Ciona intestinalis. The draft genome of Ciona has been read and inferred to contain approximately 16,000 protein-coding genes, and cDNAs for transcripts of 13,464 genes have been characterized and compiled as the "Ciona intestinalis Gene Collection Release I". In the present study, we constructed a cDNA microarray of these 13,464 Ciona genes. A preliminary experiment with Cy3- and Cy5-labeled probes showed extensive differential gene expression between fertilized eggs and larvae. In addition, there was a good correlation between results obtained by the present microarray analysis and those from previous EST analyses. This first microarray of a large collection of Ciona intestinalis cDNA clones should facilitate the analysis of global gene expression and gene networks during the embryogenesis of basal chordates.
Wang, Hong; Bi, Yongyi; Tao, Ning; Wang, Chunhong
2005-08-01
To detect the differential expression of cell signal transduction genes associated with benzene poisoning, and to explore the pathogenic mechanisms of blood system damage induced by benzene. Peripheral white blood cell gene expression profile of 7 benzene poisoning patients, including one aplastic anemia, was determined by cDNA microarray. Seven chips from normal workers were served as controls. Cluster analysis of gene expression profile was performed. Among the 4265 target genes, 176 genes associated with cell signal transduction were differentially expressed. 35 up-regulated genes including PTPRC, STAT4, IFITM1 etc were found in at least 6 pieces of microarray; 45 down-regulated genes including ARHB, PPP3CB, CDC37 etc were found in at least 5 pieces of microarray. cDNA microarray technology is an effective technique for screening the differentially expressed genes of cell signal transduction. Disorder in cell signal transduction may play certain role in the pathogenic mechanism of benzene poisoning.
Clark, Melody S; Thorne, Michael AS; Purać, Jelena; Burns, Gavin; Hillyard, Guy; Popović, Željko D; Grubor-Lajšić, Gordana; Worland, M Roger
2009-01-01
Background Insects provide tractable models for enhancing our understanding of the physiological and cellular processes that enable survival at extreme low temperatures. They possess three main strategies to survive the cold: freeze tolerance, freeze avoidance or cryoprotective dehydration, of which the latter method is exploited by our model species, the Arctic springtail Megaphorura arctica, formerly Onychiurus arcticus (Tullberg 1876). The physiological mechanisms underlying cryoprotective dehydration have been well characterised in M. arctica and to date this process has been described in only a few other species: the Antarctic nematode Panagrolaimus davidi, an enchytraied worm, the larvae of the Antarctic midge Belgica antarctica and the cocoons of the earthworm Dendrobaena octaedra. There are no in-depth molecular studies on the underlying cold survival mechanisms in any species. Results A cDNA microarray was generated using 6,912 M. arctica clones printed in duplicate. Analysis of clones up-regulated during dehydration procedures (using both cold- and salt-induced dehydration) has identified a number of significant cellular processes, namely the production and mobilisation of trehalose, protection of cellular systems via small heat shock proteins and tissue/cellular remodelling during the dehydration process. Energy production, initiation of protein translation and cell division, plus potential tissue repair processes dominate genes identified during recovery. Heat map analysis identified a duplication of the trehalose-6-phosphate synthase (TPS) gene in M. arctica and also 53 clones co-regulated with TPS, including a number of membrane associated and cell signalling proteins. Q-PCR on selected candidate genes has also contributed to our understanding with glutathione-S-transferase identified as the major antioxdidant enzyme protecting the cells during these stressful procedures, and a number of protein kinase signalling molecules involved in recovery. Conclusion Microarray analysis has proved to be a powerful technique for understanding the processes and genes involved in cryoprotective dehydration, beyond the few candidate genes identified in the current literature. Dehydration is associated with the mobilisation of trehalose, cell protection and tissue remodelling. Energy production, leading to protein production, and cell division characterise the recovery process. Novel membrane proteins, along with aquaporins and desaturases, have been identified as promising candidates for future functional analyses to better understand membrane remodelling during cellular dehydration. PMID:19622137
Gouré, Julien; Findlay, Wendy A; Deslandes, Vincent; Bouevitch, Anne; Foote, Simon J; MacInnes, Janet I; Coulton, James W; Nash, John HE; Jacques, Mario
2009-01-01
Background Actinobacillus pleuropneumoniae, the causative agent of porcine pleuropneumonia, is a highly contagious respiratory pathogen that causes severe losses to the swine industry worldwide. Current commercially-available vaccines are of limited value because they do not induce cross-serovar immunity and do not prevent development of the carrier state. Microarray-based comparative genomic hybridizations (M-CGH) were used to estimate whole genomic diversity of representative Actinobacillus pleuropneumoniae strains. Our goal was to identify conserved genes, especially those predicted to encode outer membrane proteins and lipoproteins because of their potential for the development of more effective vaccines. Results Using hierarchical clustering, our M-CGH results showed that the majority of the genes in the genome of the serovar 5 A. pleuropneumoniae L20 strain were conserved in the reference strains of all 15 serovars and in representative field isolates. Fifty-eight conserved genes predicted to encode for outer membrane proteins or lipoproteins were identified. As well, there were several clusters of diverged or absent genes including those associated with capsule biosynthesis, toxin production as well as genes typically associated with mobile elements. Conclusion Although A. pleuropneumoniae strains are essentially clonal, M-CGH analysis of the reference strains of the fifteen serovars and representative field isolates revealed several classes of genes that were divergent or absent. Not surprisingly, these included genes associated with capsule biosynthesis as the capsule is associated with sero-specificity. Several of the conserved genes were identified as candidates for vaccine development, and we conclude that M-CGH is a valuable tool for reverse vaccinology. PMID:19239696
Radeke, Monte J; Peterson, Katie E; Johnson, Lincoln V; Anderson, Don H
2007-09-01
The discoveries of gene variants associated with macular diseases have provided valuable insight into their molecular mechanisms, but they have not clarified why the macula is particularly vulnerable to degenerative disease. Its predisposition may be attributable to specialized structural features and/or functional properties of the underlying macular RPE/choroid. To examine the molecular basis for the macula's disease susceptibility, we compared the gene expression profile of the human RPE/choroid in the macula with the profile in the extramacular region using DNA microarrays. Seventy-five candidate genes with differences in macular:extramacular expression levels were identified by microarray analysis, of which 29 were selected for further analysis. Quantitative PCR confirmed that 21 showed statistically significant differences in expression. Five genes were expressed at higher levels in the macula. Two showed significant changes in the macular:extramacular expression ratio; another two exhibited changes in absolute expression level, as a function of age or AMD. Several of the differentially expressed genes have potential relevance to AMD pathobiology. One is an RPE cell growth factor (TFPI2), five are extracellular matrix components (DCN, MYOC, OGN, SMOC2, TFPI2), and six are related to inflammation (CCL19, CCL26, CXCL14, SLIT2) and/or angiogenesis (CXCL14, SLIT2, TFPI2, WFDC1). The identification of regional differences in gene expression in the RPE/choroid is a first step in clarifying the macula's propensity for degeneration. These findings lay the groundwork for further studies into the roles of the corresponding gene products in the normal, aged, and diseased macula.
Ramirez-Córdova, Jesús; Drnevich, Jenny; Madrigal-Pulido, Jaime Alberto; Arrizon, Javier; Allen, Kirk; Martínez-Velázquez, Moisés; Alvarez-Maya, Ikuri
2012-08-01
During ethanol fermentation, yeast cells are exposed to stress due to the accumulation of ethanol, cell growth is altered and the output of the target product is reduced. For Agave beverages, like tequila, no reports have been published on the global gene expression under ethanol stress. In this work, we used microarray analysis to identify Saccharomyces cerevisiae genes involved in the ethanol response. Gene expression of a tequila yeast strain of S. cerevisiae (AR5) was explored by comparing global gene expression with that of laboratory strain S288C, both after ethanol exposure. Additionally, we used two different culture conditions, cells grown in Agave tequilana juice as a natural fermentation media or grown in yeast-extract peptone dextrose as artificial media. Of the 6368 S. cerevisiae genes in the microarray, 657 genes were identified that had different expression responses to ethanol stress due to strain and/or media. A cluster of 28 genes was found over-expressed specifically in the AR5 tequila strain that could be involved in the adaptation to tequila yeast fermentation, 14 of which are unknown such as yor343c, ylr162w, ygr182c, ymr265c, yer053c-a or ydr415c. These could be the most suitable genes for transforming tequila yeast to increase ethanol tolerance in the tequila fermentation process. Other genes involved in response to stress (RFC4, TSA1, MLH1, PAU3, RAD53) or transport (CYB2, TIP20, QCR9) were expressed in the same cluster. Unknown genes could be good candidates for the development of recombinant yeasts with ethanol tolerance for use in industrial tequila fermentation.
Luque-Almagro, V M; Escribano, M P; Manso, I; Sáez, L P; Cabello, P; Moreno-Vivián, C; Roldán, M D
2015-11-20
Pseudomonas pseudoalcaligenes CECT5344 is an alkaliphilic bacterium that can use cyanide as nitrogen source for growth, becoming a suitable candidate to be applied in biological treatment of cyanide-containing wastewaters. The assessment of the whole genome sequence of the strain CECT5344 has allowed the generation of DNA microarrays to analyze the response to different nitrogen sources. The mRNA of P. pseudoalcaligenes CECT5344 cells grown under nitrogen limiting conditions showed considerable changes when compared against the transcripts from cells grown with ammonium; up-regulated genes were, among others, the glnK gene encoding the nitrogen regulatory protein PII, the two-component ntrBC system involved in global nitrogen regulation, and the ammonium transporter-encoding amtB gene. The protein coding transcripts of P. pseudoalcaligenes CECT5344 cells grown with sodium cyanide or an industrial jewelry wastewater that contains high concentration of cyanide and metals like iron, copper and zinc, were also compared against the transcripts of cells grown with ammonium as nitrogen source. This analysis revealed the induction by cyanide and the cyanide-rich wastewater of four nitrilase-encoding genes, including the nitC gene that is essential for cyanide assimilation, the cyanase cynS gene involved in cyanate assimilation, the cioAB genes required for the cyanide-insensitive respiration, and the ahpC gene coding for an alkyl-hydroperoxide reductase that could be related with iron homeostasis and oxidative stress. The nitC and cynS genes were also induced in cells grown under nitrogen starvation conditions. In cells grown with the jewelry wastewater, a malate quinone:oxidoreductase mqoB gene and several genes coding for metal extrusion systems were specifically induced. Copyright © 2015 The Authors. Published by Elsevier B.V. All rights reserved.
Terabayashi, Yasunobu; Sano, Motoaki; Yamane, Noriko; Marui, Junichiro; Tamano, Koichi; Sagara, Junichi; Dohmoto, Mitsuko; Oda, Ken; Ohshima, Eiji; Tachibana, Kuniharu; Higa, Yoshitaka; Ohashi, Shinichi; Koike, Hideaki; Machida, Masayuki
2010-12-01
Kojic acid is produced in large amounts by Aspergillus oryzae as a secondary metabolite and is widely used in the cosmetic industry. Glucose can be converted to kojic acid, perhaps by only a few steps, but no genes for the conversion have thus far been revealed. Using a DNA microarray, gene expression profiles under three pairs of conditions significantly affecting kojic acid production were compared. All genes were ranked using an index parameter reflecting both high amounts of transcription and a high induction ratio under producing conditions. After disruption of nine candidate genes selected from the top of the list, two genes of unknown function were found to be responsible for kojic acid biosynthesis, one having an oxidoreductase motif and the other a transporter motif. These two genes are closely associated in the genome, showing typical characteristics of genes involved in secondary metabolism. Copyright © 2010 Elsevier Inc. All rights reserved.
EBF proteins participate in transcriptional regulation of Xenopus muscle development.
Green, Yangsook Song; Vetter, Monica L
2011-10-01
EBF proteins have diverse functions in the development of multiple lineages, including neurons, B cells and adipocytes. During Drosophila muscle development EBF proteins are expressed in muscle progenitors and are required for muscle cell differentiation, but there is no known function of EBF proteins in vertebrate muscle development. In this study, we examine the expression of ebf genes in Xenopus muscle tissue and show that EBF activity is necessary for aspects of Xenopus skeletal muscle development, including somite organization, migration of hypaxial muscle anlagen toward the ventral abdomen, and development of jaw muscle. From a microarray screen, we have identified multiple candidate targets of EBF activity with known roles in muscle development. The candidate targets we have verified are MYOD, MYF5, M-Cadherin and SEB-4. In vivo overexpression of the ebf2 and ebf3 genes leads to ectopic expression of these candidate targets, and knockdown of EBF activity causes downregulation of the endogenous expression of the candidate targets. Furthermore, we found that MYOD and MYF5 are likely to be direct targets. Finally we show that MYOD can upregulate the expression of ebf genes, indicating the presence of a positive feedback loop between EBF and MYOD that we find to be important for maintenance of MYOD expression in Xenopus. These results suggest that EBF activity is important for both stabilizing commitment and driving aspects of differentiation in Xenopus muscle cells. Copyright © 2010 Elsevier Inc. All rights reserved.
2011-01-01
Background Global transcriptional analysis of loblolly pine (Pinus taeda L.) is challenging due to limited molecular tools. PtGen2, a 26,496 feature cDNA microarray, was fabricated and used to assess drought-induced gene expression in loblolly pine propagule roots. Statistical analysis of differential expression and weighted gene correlation network analysis were used to identify drought-responsive genes and further characterize the molecular basis of drought tolerance in loblolly pine. Results Microarrays were used to interrogate root cDNA populations obtained from 12 genotype × treatment combinations (four genotypes, three watering regimes). Comparison of drought-stressed roots with roots from the control treatment identified 2445 genes displaying at least a 1.5-fold expression difference (false discovery rate = 0.01). Genes commonly associated with drought response in pine and other plant species, as well as a number of abiotic and biotic stress-related genes, were up-regulated in drought-stressed roots. Only 76 genes were identified as differentially expressed in drought-recovered roots, indicating that the transcript population can return to the pre-drought state within 48 hours. Gene correlation analysis predicts a scale-free network topology and identifies eleven co-expression modules that ranged in size from 34 to 938 members. Network topological parameters identified a number of central nodes (hubs) including those with significant homology (E-values ≤ 2 × 10-30) to 9-cis-epoxycarotenoid dioxygenase, zeatin O-glucosyltransferase, and ABA-responsive protein. Identified hubs also include genes that have been associated previously with osmotic stress, phytohormones, enzymes that detoxify reactive oxygen species, and several genes of unknown function. Conclusion PtGen2 was used to evaluate transcriptome responses in loblolly pine and was leveraged to identify 2445 differentially expressed genes responding to severe drought stress in roots. Many of the genes identified are known to be up-regulated in response to osmotic stress in pine and other plant species and encode proteins involved in both signal transduction and stress tolerance. Gene expression levels returned to control values within a 48-hour recovery period in all but 76 transcripts. Correlation network analysis indicates a scale-free network topology for the pine root transcriptome and identifies central nodes that may serve as drivers of drought-responsive transcriptome dynamics in the roots of loblolly pine. PMID:21609476
Prescott, Meagan A; Pastey, Manoj K
2010-12-05
Each year, there are estimated to be approximately 200,000 hospitalizations and 36,000 deaths due to influenza in the United States. Reports have indicated that most deaths are not directly due to influenza virus, but to secondary bacterial pneumonia, predominantly staphylococcal in origin. Here we identify the presence of candidate blood and urine biomarkers in mice with Staphyococcus aureus and influenza virus co-infection. In this pilot study, mice were grouped into four treatments: co-infected with influenza virus and S. aureus, singly infected with influenza virus or S. aureus, and a control group of uninfected mice (PBS treated). Gene expression changes were identified by DNA-microarrays from blood samples taken at day five post infection. Proteomic changes were obtained from urine samples collected at three and five days post infection using 2-D DIGE followed by protein ID by mass spectrometry. Differentially expressed genes and/or proteins were identified as candidate biomarkers for future validation in larger studies.
Richard, Arianne C; Lyons, Paul A; Peters, James E; Biasci, Daniele; Flint, Shaun M; Lee, James C; McKinney, Eoin F; Siegel, Richard M; Smith, Kenneth G C
2014-08-04
Although numerous investigations have compared gene expression microarray platforms, preprocessing methods and batch correction algorithms using constructed spike-in or dilution datasets, there remains a paucity of studies examining the properties of microarray data using diverse biological samples. Most microarray experiments seek to identify subtle differences between samples with variable background noise, a scenario poorly represented by constructed datasets. Thus, microarray users lack important information regarding the complexities introduced in real-world experimental settings. The recent development of a multiplexed, digital technology for nucleic acid measurement enables counting of individual RNA molecules without amplification and, for the first time, permits such a study. Using a set of human leukocyte subset RNA samples, we compared previously acquired microarray expression values with RNA molecule counts determined by the nCounter Analysis System (NanoString Technologies) in selected genes. We found that gene measurements across samples correlated well between the two platforms, particularly for high-variance genes, while genes deemed unexpressed by the nCounter generally had both low expression and low variance on the microarray. Confirming previous findings from spike-in and dilution datasets, this "gold-standard" comparison demonstrated signal compression that varied dramatically by expression level and, to a lesser extent, by dataset. Most importantly, examination of three different cell types revealed that noise levels differed across tissues. Microarray measurements generally correlate with relative RNA molecule counts within optimal ranges but suffer from expression-dependent accuracy bias and precision that varies across datasets. We urge microarray users to consider expression-level effects in signal interpretation and to evaluate noise properties in each dataset independently.
Microarray technology is a powerful tool to investigate the gene expression profiles for thousands of genes simultaneously. In recent years, microarrays have been used to characterize environmental pollutants and identify molecular mode(s) of action of chemicals including endocri...
Principles of gene microarray data analysis.
Mocellin, Simone; Rossi, Carlo Riccardo
2007-01-01
The development of several gene expression profiling methods, such as comparative genomic hybridization (CGH), differential display, serial analysis of gene expression (SAGE), and gene microarray, together with the sequencing of the human genome, has provided an opportunity to monitor and investigate the complex cascade of molecular events leading to tumor development and progression. The availability of such large amounts of information has shifted the attention of scientists towards a nonreductionist approach to biological phenomena. High throughput technologies can be used to follow changing patterns of gene expression over time. Among them, gene microarray has become prominent because it is easier to use, does not require large-scale DNA sequencing, and allows for the parallel quantification of thousands of genes from multiple samples. Gene microarray technology is rapidly spreading worldwide and has the potential to drastically change the therapeutic approach to patients affected with tumor. Therefore, it is of paramount importance for both researchers and clinicians to know the principles underlying the analysis of the huge amount of data generated with microarray technology.
Maratou, Klio; Behmoaras, Jacques; Fewings, Chris; Srivastava, Prashant; D’Souza, Zelpha; Smith, Jennifer; Game, Laurence; Cook, Terence; Aitman, Tim
2010-01-01
Crescentic glomerulonephritis (CRGN) is a major cause of rapidly progressive renal failure for which the underlying genetic basis is unknown. WKY rats show marked susceptibility to CRGN, while Lewis rats are resistant. Glomerular injury and crescent formation are macrophage-dependent and mainly explained by seven quantitative trait loci (Crgn1-7). Here, we used microarray analysis in basal and lipopolysaccharide (LPS)-stimulated macrophages to identify genes that reside on pathways predisposing WKY rats to CRGN. We detected 97 novel positional candidates for the uncharacterised Crgn3-7. We identified 10 additional secondary effector genes with profound differences in expression between the two strains (>5-fold change, <1% False Discovery Rate) for basal and LPS-stimulated macrophages. Moreover, we identified 8 genes with differentially expressed alternatively spliced isoforms, by using an in depth analysis at probe-level that allowed us to discard false positives due to polymorphisms between the two rat strains. Pathway analysis identified several common linked pathways, enriched for differentially expressed genes, which affect macrophage activation. In summary, our results identify distinct macrophage transcriptome profiles between two rat strains that differ in susceptibility to glomerulonephritis, provide novel positional candidates for Crgn3-7, and define groups of genes that play a significant role in differential regulation of macrophage activity. PMID:21179115
Characterization of the OmyY1 region on the rainbow trout Y chromosome
Phillips, Ruth B.; DeKoning, Jenefer J.; Brunelli, Joseph P.; Faber-Hammond, Joshua J.; Hansen, John D.; Christensen, Kris A.; Renn, Suzy C.P.; Thorgaard, Gary H.
2013-01-01
We characterized the male-specific region on the Y chromosome of rainbow trout, which contains both sdY (the sex-determining gene) and the male-specific genetic marker, OmyY1. Several clones containing the OmyY1 marker were screened from a BAC library from a YY clonal line and found to be part of an 800 kb BAC contig. Using fluorescence in situ hybridization (FISH), these clones were localized to the end of the short arm of the Y chromosome in rainbow trout, with an additional signal on the end of the X chromosome in many cells. We sequenced a minimum tiling path of these clones using Illumina and 454 pyrosequencing. The region is rich in transposons and rDNA, but also appears to contain several single-copy protein-coding genes. Most of these genes are also found on the X chromosome; and in several cases sex-specific SNPs in these genes were identified between the male (YY) and female (XX) homozygous clonal lines. Additional genes were identified by hybridization of the BACs to the cGRASP salmonid 4x44K oligo microarray. By BLASTn evaluations using hypothetical transcripts of OmyY1-linked candidate genes as query against several EST databases, we conclude at least 12 of these candidate genes are likely functional, and expressed.
Candidate Genes for Inherited Autism Susceptibility in the Lebanese Population.
Kourtian, Silva; Soueid, Jihane; Makhoul, Nadine J; Guisso, Dikran Richard; Chahrour, Maria; Boustany, Rose-Mary N
2017-03-30
Autism spectrum disorder (ASD) is characterized by ritualistic-repetitive behaviors and impaired verbal/non-verbal communication. Many ASD susceptibility genes implicated in neuronal pathways/brain development have been identified. The Lebanese population is ideal for uncovering recessive genes because of shared ancestry and a high rate of consanguineous marriages. Aims here are to analyze for published ASD genes and uncover novel inherited ASD susceptibility genes specific to the Lebanese. We recruited 36 ASD families (ASD: 37, unaffected parents: 36, unaffected siblings: 33) and 100 unaffected Lebanese controls. Cytogenetics 2.7 M Microarrays/CytoScan™ HD arrays allowed mapping of homozygous regions of the genome. The CNTNAP2 gene was screened by Sanger sequencing. Homozygosity mapping uncovered DPP4, TRHR, and MLF1 as novel candidate susceptibility genes for ASD in the Lebanese. Sequencing of hot spot exons in CNTNAP2 led to discovery of a 5 bp insertion in 23/37 ASD patients. This mutation was present in unaffected family members and unaffected Lebanese controls. Although a slight increase in number was observed in ASD patients and family members compared to controls, there were no significant differences in allele frequencies between affecteds and controls (C/TTCTG: γ 2 value = 0.014; p = 0.904). The CNTNAP2 polymorphism identified in this population, hence, is not linked to the ASD phenotype.
Candidate Genes for Inherited Autism Susceptibility in the Lebanese Population
Kourtian, Silva; Soueid, Jihane; Makhoul, Nadine J.; Guisso, Dikran Richard; Chahrour, Maria; Boustany, Rose-Mary N.
2017-01-01
Autism spectrum disorder (ASD) is characterized by ritualistic-repetitive behaviors and impaired verbal/non-verbal communication. Many ASD susceptibility genes implicated in neuronal pathways/brain development have been identified. The Lebanese population is ideal for uncovering recessive genes because of shared ancestry and a high rate of consanguineous marriages. Aims here are to analyze for published ASD genes and uncover novel inherited ASD susceptibility genes specific to the Lebanese. We recruited 36 ASD families (ASD: 37, unaffected parents: 36, unaffected siblings: 33) and 100 unaffected Lebanese controls. Cytogenetics 2.7 M Microarrays/CytoScan™ HD arrays allowed mapping of homozygous regions of the genome. The CNTNAP2 gene was screened by Sanger sequencing. Homozygosity mapping uncovered DPP4, TRHR, and MLF1 as novel candidate susceptibility genes for ASD in the Lebanese. Sequencing of hot spot exons in CNTNAP2 led to discovery of a 5 bp insertion in 23/37 ASD patients. This mutation was present in unaffected family members and unaffected Lebanese controls. Although a slight increase in number was observed in ASD patients and family members compared to controls, there were no significant differences in allele frequencies between affecteds and controls (C/TTCTG: γ2 value = 0.014; p = 0.904). The CNTNAP2 polymorphism identified in this population, hence, is not linked to the ASD phenotype. PMID:28358038
Costin, Blair N.; Wolen, Aaron R.; Fitting, Sylvia; Shelton, Keith L.; Miles, Michael F.
2012-01-01
Background Glucocorticoid hormones modulate acute and chronic behavioral and molecular responses to drugs of abuse including psychostimulants and opioids. There is growing evidence that glucocorticoids might also modulate behavioral responses to ethanol. Acute ethanol activates the HPA axis, causing release of adrenal glucocorticoid hormones. Our prior genomic studies suggest glucocorticoids play a role in regulating gene expression in the prefrontal cortex (PFC) of DBA2/J (D2) mice following acute ethanol administration. However, few studies have analyzed the role of glucocorticoid signaling in behavioral responses to acute ethanol. Such work could be significant, given the predictive value for level of response to acute ethanol in the risk for alcoholism. Methods We studied whether the glucocorticoid receptor (GR) antagonist, RU-486, or adrenalectomy (ADX) altered male D2 mouse behavioral responses to acute (locomotor activation, anxiolysis or loss-of-righting reflex (LORR)) or repeated (sensitization) ethanol treatment. Whole genome microarray analysis and bioinformatics approaches were used to identify PFC candidate genes possibly responsible for altered behavioral responses to ethanol following ADX. Results ADX and RU-486 both impaired acute ethanol (2 g/kg) induced locomotor activation in D2 mice without affecting basal locomotor activity. However, neither ADX nor RU-486 altered initiation of ethanol sensitization (locomotor activation or jump counts), ethanol-induced anxiolysis or LORR. ADX mice showed microarray gene expression changes in PFC that significantly overlapped with acute ethanol-responsive gene sets derived by our prior microarray studies. Q-rtPCR analysis verified that ADX decreased PFC expression of Fkbp5 while significantly increasing Gpr6 expression. In addition, high dose RU-486 pre-treatment blunted ethanol-induced Fkbp5 expression. Conclusions Our studies suggest that ethanol’s activation of adrenal glucocorticoid release and subsequent GR activation may partially modulate ethanol’s acute locomotor activation in male D2 mice. Furthermore, since adrenal glucocorticoid basal tone regulated PFC gene expression, including a significant set of acute ethanol-responsive genes, this suggests that glucocorticoid regulated PFC gene expression may be an important factor modulating acute behavioral responses to ethanol. PMID:22671426
Vawter, Marquis P.; Harvey, Philip D.; DeLisi, Lynn E.
2007-01-01
Klinefelter’s Syndrome (KS) is a chromosomal karyotype with one or more extra X chromosomes. KS individuals often show language impairment and the phenotype might be due to overexpression of genes on the extra X chromosome(s). We profiled mRNA derived from lymphoblastoid cell lines from males with documented KS and control males using the Affymetrix U133P microarray platform. There were 129 differentially expressed genes (DEGs) in KS group compared with controls after Benjamini–Hochberg false discovery adjustment. The DEGs included 14 X chromosome genes which were significantly over-represented. The Y chromosome had zero DEGs. In exploratory analysis of gene expression–cognition relationships, 12 DEGs showed significant correlation of expression with measures of verbal cognition in KS. Overexpression of one pseudoautosomal gene, GTPBP6 (GTP binding protein 6, putative) was inversely correlated with verbal IQ (r = −0.86, P < 0.001) and four other measures of verbal ability. Overexpression of XIST was found in KS compared to XY controls suggesting that silencing of many genes on the X chromosome might occur in KS similar to XX females. The microarray findings for eight DEGs were validated by quantitative PCR. The 14 X chromosome DEGs were not differentially expressed in prior studies comparing female and male brains suggesting a dysregulation profile unique to KS. Examination of X-linked DEGs, such as GTPBP6, TAF9L, and CXORF21, that show verbal cognition–gene expression correlations may establish a causal link between these genes, neurodevelopment, and language function. A screen of candidate genes may serve as biomarkers of KS for early diagnosis. PMID:17347996
Vawter, Marquis P; Harvey, Philip D; DeLisi, Lynn E
2007-09-05
Klinefelter's Syndrome (KS) is a chromosomal karyotype with one or more extra X chromosomes. KS individuals often show language impairment and the phenotype might be due to overexpression of genes on the extra X chromosome(s). We profiled mRNA derived from lymphoblastoid cell lines from males with documented KS and control males using the Affymetrix U133P microarray platform. There were 129 differentially expressed genes (DEGs) in KS group compared with controls after Benjamini-Hochberg false discovery adjustment. The DEGs included 14 X chromosome genes which were significantly over-represented. The Y chromosome had zero DEGs. In exploratory analysis of gene expression-cognition relationships, 12 DEGs showed significant correlation of expression with measures of verbal cognition in KS. Overexpression of one pseudoautosomal gene, GTPBP6 (GTP binding protein 6, putative) was inversely correlated with verbal IQ (r = -0.86, P < 0.001) and four other measures of verbal ability. Overexpression of XIST was found in KS compared to XY controls suggesting that silencing of many genes on the X chromosome might occur in KS similar to XX females. The microarray findings for eight DEGs were validated by quantitative PCR. The 14 X chromosome DEGs were not differentially expressed in prior studies comparing female and male brains suggesting a dysregulation profile unique to KS. Examination of X-linked DEGs, such as GTPBP6, TAF9L, and CXORF21, that show verbal cognition-gene expression correlations may establish a causal link between these genes, neurodevelopment, and language function. A screen of candidate genes may serve as biomarkers of KS for early diagnosis. Copyright 2007 Wiley-Liss, Inc.
Kawaura, Kanako; Mochida, Keiichi; Yamazaki, Yukiko; Ogihara, Yasunari
2006-04-01
In this study, we constructed a 22k wheat oligo-DNA microarray. A total of 148,676 expressed sequence tags of common wheat were collected from the database of the Wheat Genomics Consortium of Japan. These were grouped into 34,064 contigs, which were then used to design an oligonucleotide DNA microarray. Following a multistep selection of the sense strand, 21,939 60-mer oligo-DNA probes were selected for attachment on the microarray slide. This 22k oligo-DNA microarray was used to examine the transcriptional response of wheat to salt stress. More than 95% of the probes gave reproducible hybridization signals when targeted with RNAs extracted from salt-treated wheat shoots and roots. With the microarray, we identified 1,811 genes whose expressions changed more than 2-fold in response to salt. These included genes known to mediate response to salt, as well as unknown genes, and they were classified into 12 major groups by hierarchical clustering. These gene expression patterns were also confirmed by real-time reverse transcription-PCR. Many of the genes with unknown function were clustered together with genes known to be involved in response to salt stress. Thus, analysis of gene expression patterns combined with gene ontology should help identify the function of the unknown genes. Also, functional analysis of these wheat genes should provide new insight into the response to salt stress. Finally, these results indicate that the 22k oligo-DNA microarray is a reliable method for monitoring global gene expression patterns in wheat.
Hayashi, Ken-Go; Hosoe, Misa; Kizaki, Keiichiro; Fujii, Shiori; Kanahara, Hiroko; Takahashi, Toru; Sakumoto, Ryosuke
2017-03-23
Repeat breeding directly affects reproductive efficiency in cattle due to an increase in services per conception and calving interval. This study aimed to investigate whether changes in endometrial gene expression profile are involved in repeat breeding in cows. Differential gene expression profiles of the endometrium were investigated during the mid-luteal phase of the estrous cycle between repeat breeder (RB) and non-RB cows using microarray analysis. The caruncular (CAR) and intercaruncular (ICAR) endometrium of both ipsilateral and contralateral uterine horns to the corpus luteum were collected from RB (inseminated at least three times but not pregnant) and non-RB cows on Day 15 of the estrous cycle (4 cows/group). Global gene expression profiles of these endometrial samples were analyzed with a 15 K custom-made oligo-microarray for cattle. Immunohistochemistry was performed to investigate the cellular localization of proteins of three identified transcripts in the endometrium. Microarray analysis revealed that 405 and 397 genes were differentially expressed in the CAR and ICAR of the ipsilateral uterine horn of RB, respectively when compared with non-RB cows. In the contralateral uterine horn, 443 and 257 differentially expressed genes were identified in the CAR and ICAR of RB, respectively when compared with non-RB cows. Gene ontology analysis revealed that genes involved in development and morphogenesis were mainly up-regulated in the CAR of RB cows. In the ICAR of both the ipsilateral and contralateral uterine horns, genes related to the metabolic process were predominantly enriched in the RB cows when compared with non-RB cows. In the analysis of the whole uterus (combining the data above four endometrial compartments), RB cows showed up-regulation of 37 genes including PRSS2, GSTA3 and PIPOX and down-regulation of 39 genes including CHGA, KRT35 and THBS4 when compared with non-RB cows. Immunohistochemistry revealed that CHGA, GSTA3 and PRSS2 proteins were localized in luminal and glandular epithelial cells and stroma of the endometrium. The present study showed that endometrial gene expression profiles are different between RB and non-RB cows. The identified candidate endometrial genes and functions in each endometrial compartment may contribute to bovine reproductive performance.
Sysol, Justin R.; Abbasi, Taimur; Patel, Amit R.; Lang, Roberto M.; Gupta, Akash; Garcia, Joe G. N.; Gordeuk, Victor R.; Machado, Roberto F.
2016-01-01
Background Diastolic dysfunction is common in sickle cell disease (SCD), and is associated with an increased risk of mortality. However, the molecular pathogenesis underlying this development is poorly understood. The aim of this study was to identify a gene expression profile that is associated with diastolic function in SCD, potentially elucidating molecular mechanisms behind diastolic dysfunction development. Methods Diastolic function was measured via echocardiography in 65 patients with SCD from two independent study populations. Gene expression microarray data was compared with diastolic function in both study cohorts. Candidate genes that associated in both analyses were tested for validation in a murine SCD model. Lastly, genotyping array data from the replication cohort was used to derive cis-expression quantitative trait loci (cis-eQTLs) and genetic associations within the candidate gene regions. Results Transcriptome data from both patient cohorts implicated 7 genes associated with diastolic function, and mouse SCD myocardial expression validated 3 of these genes. Genetic associations and eQTLs were detected in 2 of the 3 genes, FUCA2 and IL18. Conclusions FUCA2 and IL18 are associated with diastolic function in SCD patients, and may be involved in the pathogenesis of the disease. Genetic polymorphisms within the FUCA2 and IL18 gene regions are also associated with diastolic function in SCD, likely by affecting expression levels of the genes. PMID:27636371
Stankiewicz, Adrian M; Goscik, Joanna; Dyr, Wanda; Juszczak, Grzegorz R; Ryglewicz, Danuta; Swiergiel, Artur H; Wieczorek, Marek; Stefanski, Roman
2015-12-01
Animal models provide opportunity to study neurobiological aspects of human alcoholism. Changes in gene expression have been implicated in mediating brain functions, including reward system and addiction. The current study aimed to identify genes that may underlie differential ethanol preference in Warsaw High Preferring (WHP) and Warsaw Low Preferring (WLP) rats. Microarray analysis comparing gene expression in nucleus accumbens (NAc), hippocampus (HP) and medial prefrontal cortex (mPFC) was performed in male WHP and WLP rats bred for differences in ethanol preference. Differential and stable between biological repeats expression of 345, 254 and 129 transcripts in NAc, HP and mPFC was detected. Identified genes and processes included known mediators of ethanol response (Mx2, Fam111a, Itpr1, Gabra4, Agtr1a, LTP/LTD, renin-angiotensin signaling pathway), toxicity (Sult1c2a, Ces1, inflammatory response), as well as genes involved in regulation of important addiction-related brain systems such as dopamine, tachykinin or acetylcholine (Gng7, Tac4, Slc5a7). The identified candidate genes may underlie differential ethanol preference in an animal model of alcoholism. Names of genes are written in italics, while names of proteins are written in standard font. Names of human genes/proteins are written in all capital letters. Names of rodent genes/proteins are written in capital letter followed by small letters. Copyright © 2015 Elsevier Inc. All rights reserved.
Probing the Xenopus laevis inner ear transcriptome for biological function
2012-01-01
Background The senses of hearing and balance depend upon mechanoreception, a process that originates in the inner ear and shares features across species. Amphibians have been widely used for physiological studies of mechanotransduction by sensory hair cells. In contrast, much less is known of the genetic basis of auditory and vestibular function in this class of animals. Among amphibians, the genus Xenopus is a well-characterized genetic and developmental model that offers unique opportunities for inner ear research because of the amphibian capacity for tissue and organ regeneration. For these reasons, we implemented a functional genomics approach as a means to undertake a large-scale analysis of the Xenopus laevis inner ear transcriptome through microarray analysis. Results Microarray analysis uncovered genes within the X. laevis inner ear transcriptome associated with inner ear function and impairment in other organisms, thereby supporting the inclusion of Xenopus in cross-species genetic studies of the inner ear. The use of gene categories (inner ear tissue; deafness; ion channels; ion transporters; transcription factors) facilitated the assignment of functional significance to probe set identifiers. We enhanced the biological relevance of our microarray data by using a variety of curation approaches to increase the annotation of the Affymetrix GeneChip® Xenopus laevis Genome array. In addition, annotation analysis revealed the prevalence of inner ear transcripts represented by probe set identifiers that lack functional characterization. Conclusions We identified an abundance of targets for genetic analysis of auditory and vestibular function. The orthologues to human genes with known inner ear function and the highly expressed transcripts that lack annotation are particularly interesting candidates for future analyses. We used informatics approaches to impart biologically relevant information to the Xenopus inner ear transcriptome, thereby addressing the impediment imposed by insufficient gene annotation. These findings heighten the relevance of Xenopus as a model organism for genetic investigations of inner ear organogenesis, morphogenesis, and regeneration. PMID:22676585
DEVELOPMENT AND VALIDATION OF A 2,000 GENE MICROARRAY FOR THE FATHEAD MINNOW, PIMEPHALES PROMELAS
The development of the gene microarray has provided the field of ecotoxicology a new tool to identify modes of action (MOA) of chemicals and chemical mixtures. Herein we describe the development and application of a 2,000 gene oligonucleotide microarray for the fathead minnow (P...
Talke, Ina N; Hanikenne, Marc; Krämer, Ute
2006-09-01
The metal hyperaccumulator Arabidopsis halleri exhibits naturally selected zinc (Zn) and cadmium (Cd) hypertolerance and accumulates extraordinarily high Zn concentrations in its leaves. With these extreme physiological traits, A. halleri phylogenetically belongs to the sister clade of Arabidopsis thaliana. Using a combination of genome-wide cross species microarray analysis and real-time reverse transcription-PCR, a set of candidate genes is identified for Zn hyperaccumulation, Zn and Cd hypertolerance, and the adjustment of micronutrient homeostasis in A. halleri. Eighteen putative metal homeostasis genes are newly identified to be more highly expressed in A. halleri than in A. thaliana, and 11 previously identified candidate genes are confirmed. The encoded proteins include HMA4, known to contribute to root-shoot transport of Zn in A. thaliana. Expression of either AtHMA4 or AhHMA4 confers cellular Zn and Cd tolerance to yeast (Saccharomyces cerevisiae). Among further newly implicated proteins are IRT3 and ZIP10, which have been proposed to contribute to cytoplasmic Zn influx, and FRD3 required for iron partitioning in A. thaliana. In A. halleri, the presence of more than a single genomic copy is a hallmark of several highly expressed candidate genes with possible roles in metal hyperaccumulation and metal hypertolerance. Both A. halleri and A. thaliana exert tight regulatory control over Zn homeostasis at the transcript level. Zn hyperaccumulation in A. halleri involves enhanced partitioning of Zn from roots into shoots. The transcriptional regulation of marker genes suggests that in the steady state, A. halleri roots, but not the shoots, act as physiologically Zn deficient under conditions of moderate Zn supply.
Sinha, Ranjita; Gupta, Aarti; Senthil-Kumar, Muthappa
2017-01-01
Chickpea (Cicer arietinum); the second largest legume grown worldwide is prone to drought and various pathogen infections. These drought and pathogen stresses often occur concurrently in the field conditions. However, the molecular events in response to that are largely unknown. The present study examines the transcriptome dynamics in chickpea plants exposed to a combination of water-deficit stress and Ralstonia solanacearum infection. R. solanacearum is a potential wilt disease causing pathogen in chickpea. Drought stressed chickpea plants were infected with this pathogen and the plants were allowed to experience progressive drought with 2 and 4 days of R. solanacearum infection called short duration stress (SD stresses) and long duration stress (LD stresses), respectively. Our study showed that R. solanacearum multiplication decreased under SD-combined stress compared to SD-pathogen but there was no significant change in LD-combined stress compared to LD-pathogen. The microarray analysis during these conditions showed that 821 and 1039 differentially expressed genes (DEGs) were unique to SD- and LD-combined stresses, respectively, when compared with individual stress conditions. Three and fifteen genes were common among all the SD-stress treatments and LD-stress treatments, respectively. Genes involved in secondary cell wall biosynthesis, alkaloid biosynthesis, defense related proteins, and osmo-protectants were up-regulated during combined stress. The expression of genes involved in lignin and cellulose biosynthesis were specifically up-regulated in SD-combined, LD-combined, and LD-pathogen stress. A close transcriptomic association of LD-pathogen stress with SD-combined stress was observed in this study which indicates that R. solanacearum infection also exerts drought stress along with pathogen stress thus mimics combined stress effect. Furthermore the expression profiling of candidate genes using real-time quantitative PCR validated the microarray data. The study showed that down-regulation of defense-related genes during LD-combined stress resulted in an increased bacterial multiplication as compared to SD-combined stress. Overall, our study highlights a sub-set of DEGs uniquely expressed in response to combined stress, which serve as potential candidates for further functional characterization to delineate the molecular response of the plant to concurrent drought-pathogen stress. PMID:28382041
Validation of MIMGO: a method to identify differentially expressed GO terms in a microarray dataset
2012-01-01
Background We previously proposed an algorithm for the identification of GO terms that commonly annotate genes whose expression is upregulated or downregulated in some microarray data compared with in other microarray data. We call these “differentially expressed GO terms” and have named the algorithm “matrix-assisted identification method of differentially expressed GO terms” (MIMGO). MIMGO can also identify microarray data in which genes annotated with a differentially expressed GO term are upregulated or downregulated. However, MIMGO has not yet been validated on a real microarray dataset using all available GO terms. Findings We combined Gene Set Enrichment Analysis (GSEA) with MIMGO to identify differentially expressed GO terms in a yeast cell cycle microarray dataset. GSEA followed by MIMGO (GSEA + MIMGO) correctly identified (p < 0.05) microarray data in which genes annotated to differentially expressed GO terms are upregulated. We found that GSEA + MIMGO was slightly less effective than, or comparable to, GSEA (Pearson), a method that uses Pearson’s correlation as a metric, at detecting true differentially expressed GO terms. However, unlike other methods including GSEA (Pearson), GSEA + MIMGO can comprehensively identify the microarray data in which genes annotated with a differentially expressed GO term are upregulated or downregulated. Conclusions MIMGO is a reliable method to identify differentially expressed GO terms comprehensively. PMID:23232071
2012-01-01
Background Discovering new biomarkers has a great role in improving early diagnosis of Hepatocellular carcinoma (HCC). The experimental determination of biomarkers needs a lot of time and money. This motivates this work to use in-silico prediction of biomarkers to reduce the number of experiments required for detecting new ones. This is achieved by extracting the most representative genes in microarrays of HCC. Results In this work, we provide a method for extracting the differential expressed genes, up regulated ones, that can be considered candidate biomarkers in high throughput microarrays of HCC. We examine the power of several gene selection methods (such as Pearson’s correlation coefficient, Cosine coefficient, Euclidean distance, Mutual information and Entropy with different estimators) in selecting informative genes. A biological interpretation of the highly ranked genes is done using KEGG (Kyoto Encyclopedia of Genes and Genomes) pathways, ENTREZ and DAVID (Database for Annotation, Visualization, and Integrated Discovery) databases. The top ten genes selected using Pearson’s correlation coefficient and Cosine coefficient contained six genes that have been implicated in cancer (often multiple cancers) genesis in previous studies. A fewer number of genes were obtained by the other methods (4 genes using Mutual information, 3genes using Euclidean distance and only one gene using Entropy). A better result was obtained by the utilization of a hybrid approach based on intersecting the highly ranked genes in the output of all investigated methods. This hybrid combination yielded seven genes (2 genes for HCC and 5 genes in different types of cancer) in the top ten genes of the list of intersected genes. Conclusions To strengthen the effectiveness of the univariate selection methods, we propose a hybrid approach by intersecting several of these methods in a cascaded manner. This approach surpasses all of univariate selection methods when used individually according to biological interpretation and the examination of gene expression signal profiles. PMID:22867264
Protein profiles associated with survival in lung adenocarcinoma
Chen, Guoan; Gharib, Tarek G; Wang, Hong; Huang, Chiang-Ching; Kuick, Rork; Thomas, Dafydd G.; Shedden, Kerby A.; Misek, David E.; Taylor, Jeremy M. G.; Giordano, Thomas J.; Kardia, Sharon L. R.; Iannettoni, Mark D.; Yee, John; Hogg, Philip J.; Orringer, Mark B.; Hanash, Samir M.; Beer, David G.
2003-01-01
Morphologic assessment of lung tumors is informative but insufficient to adequately predict patient outcome. We previously identified transcriptional profiles that predict patient survival, and here we identify proteins associated with patient survival in lung adenocarcinoma. A total of 682 individual protein spots were quantified in 90 lung adenocarcinomas by using quantitative two-dimensional polyacrylamide gel electrophoresis analysis. A leave-one-out cross-validation procedure using the top 20 survival-associated proteins identified by Cox modeling indicated that protein profiles as a whole can predict survival in stage I tumor patients (P = 0.01). Thirty-three of 46 survival-associated proteins were identified by using mass spectrometry. Expression of 12 candidate proteins was confirmed as tumor-derived with immunohistochemical analysis and tissue microarrays. Oligonucleotide microarray results from both the same tumors and from an independent study showed mRNAs associated with survival for 11 of 27 encoded genes. Combined analysis of protein and mRNA data revealed 11 components of the glycolysis pathway as associated with poor survival. Among these candidates, phosphoglycerate kinase 1 was associated with survival in the protein study, in both mRNA studies and in an independent validation set of 117 adenocarcinomas and squamous lung tumors using tissue microarrays. Elevated levels of phosphoglycerate kinase 1 in the serum were also significantly correlated with poor outcome in a validation set of 107 patients with lung adenocarcinomas using ELISA analysis. These studies identify new prognostic biomarkers and indicate that protein expression profiles can predict the outcome of patients with early-stage lung cancer. PMID:14573703
Thiel, Cora S; Hauschild, Swantje; Tauber, Svantje; Paulsen, Katrin; Raig, Christiane; Raem, Arnold; Biskup, Josefine; Gutewort, Annett; Hürlimann, Eva; Unverdorben, Felix; Buttron, Isabell; Lauber, Beatrice; Philpot, Claudia; Lier, Hartwin; Engelmann, Frank; Layer, Liliana E; Ullrich, Oliver
2015-01-01
Gene expression studies are indispensable for investigation and elucidation of molecular mechanisms. For the process of normalization, reference genes ("housekeeping genes") are essential to verify gene expression analysis. Thus, it is assumed that these reference genes demonstrate similar expression levels over all experimental conditions. However, common recommendations about reference genes were established during 1 g conditions and therefore their applicability in studies with altered gravity has not been demonstrated yet. The microarray technology is frequently used to generate expression profiles under defined conditions and to determine the relative difference in expression levels between two or more different states. In our study, we searched for potential reference genes with stable expression during different gravitational conditions (microgravity, normogravity, and hypergravity) which are additionally not altered in different hardware systems. We were able to identify eight genes (ALB, B4GALT6, GAPDH, HMBS, YWHAZ, ABCA5, ABCA9, and ABCC1) which demonstrated no altered gene expression levels in all tested conditions and therefore represent good candidates for the standardization of gene expression studies in altered gravity.
Rode, Tone Mari; Berget, Ingunn; Langsrud, Solveig; Møretrø, Trond; Holck, Askild
2009-07-01
Microorganisms are constantly exposed to new and altered growth conditions, and respond by changing gene expression patterns. Several methods for studying gene expression exist. During the last decade, the analysis of microarrays has been one of the most common approaches applied for large scale gene expression studies. A relatively new method for gene expression analysis is MassARRAY, which combines real competitive-PCR and MALDI-TOF (matrix-assisted laser desorption/ionization time-of-flight) mass spectrometry. In contrast to microarray methods, MassARRAY technology is suitable for analysing a larger number of samples, though for a smaller set of genes. In this study we compare the results from MassARRAY with microarrays on gene expression responses of Staphylococcus aureus exposed to acid stress at pH 4.5. RNA isolated from the same stress experiments was analysed using both the MassARRAY and the microarray methods. The MassARRAY and microarray methods showed good correlation. Both MassARRAY and microarray estimated somewhat lower fold changes compared with quantitative real-time PCR (qRT-PCR). The results confirmed the up-regulation of the urease genes in acidic environments, and also indicated the importance of metal ion regulation. This study shows that the MassARRAY technology is suitable for gene expression analysis in prokaryotes, and has advantages when a set of genes is being analysed for an organism exposed to many different environmental conditions.
Kano, Akihito; Gomi, Kenji; Yamasaki-Kokudo, Yumiko; Satoh, Masaru; Fukumoto, Takeshi; Ohtani, Kouhei; Tajima, Shigeyuki; Izumori, Ken; Tanaka, Keiji; Ishida, Yutaka; Tada, Yasuomi; Nishizawa, Yoko; Akimitsu, Kazuya
2010-01-01
We investigated responses of rice plant to three rare sugars, d-altrose, d-sorbose, and d-allose, due to establishment of mass production methods for these rare sugars. Root growth and shoot growth were significantly inhibited by d-allose but not by the other rare sugars. A large-scale gene expression analysis using a rice microarray revealed that d-allose treatment causes a high upregulation of many defense-related, pathogenesis-related (PR) protein genes in rice. The PR protein genes were not upregulated by other rare sugars. Furthermore, d-allose treatment of rice plants conferred limited resistance of the rice against the pathogen Xanthomonas oryzae pv. oryzae but the other tested sugars did not. These results indicate that d-allose has a growth inhibitory effect but might prove to be a candidate elicitor for reducing disease development in rice.
The application of DNA microarrays in gene expression analysis.
van Hal, N L; Vorst, O; van Houwelingen, A M; Kok, E J; Peijnenburg, A; Aharoni, A; van Tunen, A J; Keijer, J
2000-03-31
DNA microarray technology is a new and powerful technology that will substantially increase the speed of molecular biological research. This paper gives a survey of DNA microarray technology and its use in gene expression studies. The technical aspects and their potential improvements are discussed. These comprise array manufacturing and design, array hybridisation, scanning, and data handling. Furthermore, it is discussed how DNA microarrays can be applied in the working fields of: safety, functionality and health of food and gene discovery and pathway engineering in plants.
Bikel, Shirley; Jacobo-Albavera, Leonor; Sánchez-Muñoz, Fausto; Cornejo-Granados, Fernanda; Canizales-Quinteros, Samuel; Soberón, Xavier; Sotelo-Mundo, Rogerio R; Del Río-Navarro, Blanca E; Mendoza-Vargas, Alfredo; Sánchez, Filiberto; Ochoa-Leyva, Adrian
2017-01-01
In spite of the emergence of RNA sequencing (RNA-seq), microarrays remain in widespread use for gene expression analysis in the clinic. There are over 767,000 RNA microarrays from human samples in public repositories, which are an invaluable resource for biomedical research and personalized medicine. The absolute gene expression analysis allows the transcriptome profiling of all expressed genes under a specific biological condition without the need of a reference sample. However, the background fluorescence represents a challenge to determine the absolute gene expression in microarrays. Given that the Y chromosome is absent in female subjects, we used it as a new approach for absolute gene expression analysis in which the fluorescence of the Y chromosome genes of female subjects was used as the background fluorescence for all the probes in the microarray. This fluorescence was used to establish an absolute gene expression threshold, allowing the differentiation between expressed and non-expressed genes in microarrays. We extracted the RNA from 16 children leukocyte samples (nine males and seven females, ages 6-10 years). An Affymetrix Gene Chip Human Gene 1.0 ST Array was carried out for each sample and the fluorescence of 124 genes of the Y chromosome was used to calculate the absolute gene expression threshold. After that, several expressed and non-expressed genes according to our absolute gene expression threshold were compared against the expression obtained using real-time quantitative polymerase chain reaction (RT-qPCR). From the 124 genes of the Y chromosome, three genes (DDX3Y, TXLNG2P and EIF1AY) that displayed significant differences between sexes were used to calculate the absolute gene expression threshold. Using this threshold, we selected 13 expressed and non-expressed genes and confirmed their expression level by RT-qPCR. Then, we selected the top 5% most expressed genes and found that several KEGG pathways were significantly enriched. Interestingly, these pathways were related to the typical functions of leukocytes cells, such as antigen processing and presentation and natural killer cell mediated cytotoxicity. We also applied this method to obtain the absolute gene expression threshold in already published microarray data of liver cells, where the top 5% expressed genes showed an enrichment of typical KEGG pathways for liver cells. Our results suggest that the three selected genes of the Y chromosome can be used to calculate an absolute gene expression threshold, allowing a transcriptome profiling of microarray data without the need of an additional reference experiment. Our approach based on the establishment of a threshold for absolute gene expression analysis will allow a new way to analyze thousands of microarrays from public databases. This allows the study of different human diseases without the need of having additional samples for relative expression experiments.
Severgnini, Marco; Bicciato, Silvio; Mangano, Eleonora; Scarlatti, Francesca; Mezzelani, Alessandra; Mattioli, Michela; Ghidoni, Riccardo; Peano, Clelia; Bonnal, Raoul; Viti, Federica; Milanesi, Luciano; De Bellis, Gianluca; Battaglia, Cristina
2006-06-01
Meta-analysis of microarray data is increasingly important, considering both the availability of multiple platforms using disparate technologies and the accumulation in public repositories of data sets from different laboratories. We addressed the issue of comparing gene expression profiles from two microarray platforms by devising a standardized investigative strategy. We tested this procedure by studying MDA-MB-231 cells, which undergo apoptosis on treatment with resveratrol. Gene expression profiles were obtained using high-density, short-oligonucleotide, single-color microarray platforms: GeneChip (Affymetrix) and CodeLink (Amersham). Interplatform analyses were carried out on 8414 common transcripts represented on both platforms, as identified by LocusLink ID, representing 70.8% and 88.6% of annotated GeneChip and CodeLink features, respectively. We identified 105 differentially expressed genes (DEGs) on CodeLink and 42 DEGs on GeneChip. Among them, only 9 DEGs were commonly identified by both platforms. Multiple analyses (BLAST alignment of probes with target sequences, gene ontology, literature mining, and quantitative real-time PCR) permitted us to investigate the factors contributing to the generation of platform-dependent results in single-color microarray experiments. An effective approach to cross-platform comparison involves microarrays of similar technologies, samples prepared by identical methods, and a standardized battery of bioinformatic and statistical analyses.
Rodd, Z A; Bertsch, B A; Strother, W N; Le-Niculescu, H; Balaraman, Y; Hayden, E; Jerome, R E; Lumeng, L; Nurnberger, J I; Edenberg, H J; McBride, W J; Niculescu, A B
2007-08-01
We describe a comprehensive translational approach for identifying candidate genes for alcoholism. The approach relies on the cross-matching of animal model brain gene expression data with human genetic linkage data, as well as human tissue data and biological roles data, an approach termed convergent functional genomics. An analysis of three animal model paradigms, based on inbred alcohol-preferring (iP) and alcohol-non-preferring (iNP) rats, and their response to treatments with alcohol, was used. A comprehensive analysis of microarray gene expression data from five key brain regions (frontal cortex, amygdala, caudate-putamen, nucleus accumbens and hippocampus) was carried out. The Bayesian-like integration of multiple independent lines of evidence, each by itself lacking sufficient discriminatory power, led to the identification of high probability candidate genes, pathways and mechanisms for alcoholism. These data reveal that alcohol has pleiotropic effects on multiple systems, which may explain the diverse neuropsychiatric and medical pathology in alcoholism. Some of the pathways identified suggest avenues for pharmacotherapy of alcoholism with existing agents, such as angiotensin-converting enzyme (ACE) inhibitors. Experiments we carried out in alcohol-preferring rats with an ACE inhibitor show a marked modulation of alcohol intake. Other pathways are new potential targets for drug development. The emergent overall picture is that physical and physiological robustness may permit alcohol-preferring individuals to withstand the aversive effects of alcohol. In conjunction with a higher reactivity to its rewarding effects, they may able to ingest enough of this nonspecific drug for a strong hedonic and addictive effect to occur.
Xu, H; Li, C; Zeng, Q; Agrawal, I; Zhu, X; Gong, Z
2016-06-01
In this study, to systematically identify the most stably expressed genes for internal reference in zebrafish Danio rerio investigations, 37 D. rerio transcriptomic datasets (both RNA sequencing and microarray data) were collected from gene expression omnibus (GEO) database and unpublished data, and gene expression variations were analysed under three experimental conditions: tissue types, developmental stages and chemical treatments. Forty-four putative candidate genes were identified with the c.v. <0·2 from all datasets. Following clustering into different functional groups, 21 genes, in addition to four conventional housekeeping genes (eef1a1l1, b2m, hrpt1l and actb1), were selected from different functional groups for further quantitative real-time (qrt-)PCR validation using 25 RNA samples from different adult tissues, developmental stages and chemical treatments. The qrt-PCR data were then analysed using the statistical algorithm refFinder for gene expression stability. Several new candidate genes showed better expression stability than the conventional housekeeping genes in all three categories. It was found that sep15 and metap1 were the top two stable genes for tissue types, ube2a and tmem50a the top two for different developmental stages, and rpl13a and rp1p0 the top two for chemical treatments. Thus, based on the extensive transcriptomic analyses and qrt-PCR validation, these new reference genes are recommended for normalization of D. rerio qrt-PCR data respectively for the three different experimental conditions. © 2016 The Fisheries Society of the British Isles.
Nigam, Deepti; Sawant, Samir V
2013-01-01
Technological development led to an increased interest in systems biological approaches in plants to characterize developmental mechanism and candidate genes relevant to specific tissue or cell morphology. AUX-IAA proteins are important plant-specific putative transcription factors. There are several reports on physiological response of this family in Arabidopsis but in cotton fiber the transcriptional network through which AUX-IAA regulated its target genes is still unknown. in-silico modelling of cotton fiber development specific gene expression data (108 microarrays and 22,737 genes) using Algorithm for the Reconstruction of Accurate Cellular Networks (ARACNe) reveals 3690 putative AUX-IAA target genes of which 139 genes were known to be AUX-IAA co-regulated within Arabidopsis. Further AUX-IAA targeted gene regulatory network (GRN) had substantial impact on the transcriptional dynamics of cotton fiber, as showed by, altered TF networks, and Gene Ontology (GO) biological processes and metabolic pathway associated with its target genes. Analysis of the AUX-IAA-correlated gene network reveals multiple functions for AUX-IAA target genes such as unidimensional cell growth, cellular nitrogen compound metabolic process, nucleosome organization, DNA-protein complex and process related to cell wall. These candidate networks/pathways have a variety of profound impacts on such cellular functions as stress response, cell proliferation, and cell differentiation. While these functions are fairly broad, their underlying TF networks may provide a global view of AUX-IAA regulated gene expression and a GRN that guides future studies in understanding role of AUX-IAA box protein and its targets regulating fiber development. PMID:24497725
Kohno, Takashi; Otsuka, Ayaka; Girard, Luc; Sato, Masanori; Iwakawa, Reika; Ogiwara, Hideaki; Sanchez-Cespedes, Montse; Minna, John D.; Yokota, Jun
2010-01-01
A total of 176 genes homozygously deleted in human lung cancer were identified by DNA array-based whole genome scanning of 52 lung cancer cell lines and subsequent genomic PCR in 74 cell lines, including the 52 cell lines scanned. One or more exons of these genes were homozygously deleted in one (1%) to 20 (27%) cell lines. These genes included known tumor suppressor genes, e.g., CDKN2A/p16, RB1, and SMAD4, and candidate tumor suppressor genes whose hemizygous or homozygous deletions were reported in several types of human cancers, such as FHIT, KEAP1, and LRP1B/LRP-DIP. CDKN2A/p16 and p14ARF located in 9p21 were most frequently deleted (20/74, 27%). The PTPRD gene was most frequently deleted (8/74, 11%) among genes mapping to regions other than 9p21. Somatic mutations, including a nonsense mutation, of the PTPRD gene were detected in 8/74 (11%) of cell lines and 4/95 (4%) of surgical specimens of lung cancer. Reduced PTPRD expression was observed in the majority (>80%) of cell lines and surgical specimens of lung cancer. Therefore, PTPRD is a candidate tumor suppressor gene in lung cancer. Microarray-based expression profiling of 19 lung cancer cell lines also indicated that some of the 176 genes, such as KANK and ADAMTS1, are preferentially inactivated by epigenetic alterations. Genetic/epigenetic as well as functional studies of these 176 genes will increase our understanding of molecular mechanisms behind lung carcinogenesis. PMID:20073072
Thiel, Cora S.; Hauschild, Swantje; Tauber, Svantje; Paulsen, Katrin; Raig, Christiane; Raem, Arnold; Biskup, Josefine; Gutewort, Annett; Hürlimann, Eva; Philpot, Claudia; Lier, Hartwin; Engelmann, Frank; Layer, Liliana E.
2015-01-01
Gene expression studies are indispensable for investigation and elucidation of molecular mechanisms. For the process of normalization, reference genes (“housekeeping genes”) are essential to verify gene expression analysis. Thus, it is assumed that these reference genes demonstrate similar expression levels over all experimental conditions. However, common recommendations about reference genes were established during 1 g conditions and therefore their applicability in studies with altered gravity has not been demonstrated yet. The microarray technology is frequently used to generate expression profiles under defined conditions and to determine the relative difference in expression levels between two or more different states. In our study, we searched for potential reference genes with stable expression during different gravitational conditions (microgravity, normogravity, and hypergravity) which are additionally not altered in different hardware systems. We were able to identify eight genes (ALB, B4GALT6, GAPDH, HMBS, YWHAZ, ABCA5, ABCA9, and ABCC1) which demonstrated no altered gene expression levels in all tested conditions and therefore represent good candidates for the standardization of gene expression studies in altered gravity. PMID:25654098
Xu, Xiaodan; Li, Yingcong; Zhao, Heng; Wen, Si-yuan; Wang, Sheng-qi; Huang, Jian; Huang, Kun-lun; Luo, Yun-bo
2005-05-18
To devise a rapid and reliable method for the detection and identification of genetically modified (GM) events, we developed a multiplex polymerase chain reaction (PCR) coupled with a DNA microarray system simultaneously aiming at many targets in a single reaction. The system included probes for screening gene, species reference gene, specific gene, construct-specific gene, event-specific gene, and internal and negative control genes. 18S rRNA was combined with species reference genes as internal controls to assess the efficiency of all reactions and to eliminate false negatives. Two sets of the multiplex PCR system were used to amplify four and five targets, respectively. Eight different structure genes could be detected and identified simultaneously for Roundup Ready soybean in a single microarray. The microarray specificity was validated by its ability to discriminate two GM maizes Bt176 and Bt11. The advantages of this method are its high specificity and greatly reduced false-positives and -negatives. The multiplex PCR coupled with microarray technology presented here is a rapid and reliable tool for the simultaneous detection of GM organism ingredients.
Perdiguero, Pedro; Barbero, María Del Carmen; Cervera, María Teresa; Collada, Carmen; Soto, Alvaro
2013-06-01
Adaptation to water stress has determined the evolution and diversification of vascular plants. Water stress is forecasted to increase drastically in the next decades in certain regions, such as in the Mediterranean basin. Consequently, a proper knowledge of the response and adaptations to drought stress is essential for the correct management of plant genetic resources. However, most of the advances in the understanding of the molecular response to water stress have been attained in angiosperms, and are not always applicable to gymnosperms. In this work we analyse the transcriptional response of two emblematic Mediterranean pines, Pinus pinaster and Pinus pinea, which show noticeable differences in their performance under water stress. Using microarray analysis, up to 113 genes have been detected as significantly induced by drought in both species. Reliability of expression patterns has been confirmed by RT-PCR. While induced genes with similar profiles in both species can be considered as general candidate genes for the study of drought response in conifers, genes with diverging expression patterns can underpin the differences displayed by these species under water stress. Most promising candidate genes for drought stress response include genes related to carbohydrate metabolism, such as glycosyltransferases or galactosidases, sugar transporters, dehydrins and transcription factors. Additionally, differences in the molecular response to drought and polyethylene-glycol-induced water stress are also discussed. Copyright © 2013 Elsevier Masson SAS. All rights reserved.
Vukmirovic, Milica; Herazo-Maya, Jose D; Blackmon, John; Skodric-Trifunovic, Vesna; Jovanovic, Dragana; Pavlovic, Sonja; Stojsic, Jelena; Zeljkovic, Vesna; Yan, Xiting; Homer, Robert; Stefanovic, Branko; Kaminski, Naftali
2017-01-12
Idiopathic Pulmonary Fibrosis (IPF) is a lethal lung disease of unknown etiology. A major limitation in transcriptomic profiling of lung tissue in IPF has been a dependence on snap-frozen fresh tissues (FF). In this project we sought to determine whether genome scale transcript profiling using RNA Sequencing (RNA-Seq) could be applied to archived Formalin-Fixed Paraffin-Embedded (FFPE) IPF tissues. We isolated total RNA from 7 IPF and 5 control FFPE lung tissues and performed 50 base pair paired-end sequencing on Illumina 2000 HiSeq. TopHat2 was used to map sequencing reads to the human genome. On average ~62 million reads (53.4% of ~116 million reads) were mapped per sample. 4,131 genes were differentially expressed between IPF and controls (1,920 increased and 2,211 decreased (FDR < 0.05). We compared our results to differentially expressed genes calculated from a previously published dataset generated from FF tissues analyzed on Agilent microarrays (GSE47460). The overlap of differentially expressed genes was very high (760 increased and 1,413 decreased, FDR < 0.05). Only 92 differentially expressed genes changed in opposite directions. Pathway enrichment analysis performed using MetaCore confirmed numerous IPF relevant genes and pathways including extracellular remodeling, TGF-beta, and WNT. Gene network analysis of MMP7, a highly differentially expressed gene in both datasets, revealed the same canonical pathways and gene network candidates in RNA-Seq and microarray data. For validation by NanoString nCounter® we selected 35 genes that had a fold change of 2 in at least one dataset (10 discordant, 10 significantly differentially expressed in one dataset only and 15 concordant genes). High concordance of fold change and FDR was observed for each type of the samples (FF vs FFPE) with both microarrays (r = 0.92) and RNA-Seq (r = 0.90) and the number of discordant genes was reduced to four. Our results demonstrate that RNA sequencing of RNA obtained from archived FFPE lung tissues is feasible. The results obtained from FFPE tissue are highly comparable to FF tissues. The ability to perform RNA-Seq on archived FFPE IPF tissues should greatly enhance the availability of tissue biopsies for research in IPF.
Saccharomyces cerevisiae gene expression changes during rotating wall vessel suspension culture
NASA Technical Reports Server (NTRS)
Johanson, Kelly; Allen, Patricia L.; Lewis, Fawn; Cubano, Luis A.; Hyman, Linda E.; Hammond, Timothy G.
2002-01-01
This study utilizes Saccharomyces cerevisiae to study genetic responses to suspension culture. The suspension culture system used in this study is the high-aspect-ratio vessel, one type of the rotating wall vessel, that provides a high rate of gas exchange necessary for rapidly dividing cells. Cells were grown in the high-aspect-ratio vessel, and DNA microarray and metabolic analyses were used to determine the resulting changes in yeast gene expression. A significant number of genes were found to be up- or downregulated by at least twofold as a result of rotational growth. By using Gibbs promoter alignment, clusters of genes were examined for promoter elements mediating these genetic changes. Candidate binding motifs similar to the Rap1p binding site and the stress-responsive element were identified in the promoter regions of differentially regulated genes. This study shows that, as in higher order organisms, S. cerevisiae changes gene expression in response to rotational culture and also provides clues for investigations into the signaling pathways involved in gravitational response.
Uchida, Masaya; Hirano, Masashi; Ishibashi, Hiroshi; Kobayashi, Jun; Kagami, Yoshihiro; Koyanagi, Akiko; Kusano, Teruhiko; Koga, Minoru; Arizono, Koji
2016-11-01
Nonylphenol (NP) has been classified as an endocrine-disrupting chemical. In this study, we conducted mysid DNA microarray analysis with which has 2240 oligo DNA probes to observe differential gene expressions in mysid crustacean (Americamysis bahia) exposed to 1, 3, 10 and 30 μg/l of NP for 14 days. As a result, we found 31, 27, 39 and 68 genes were differentially expressed in the respective concentrations. Among these genes, the expressions of five particular genes were regulated in a similar manner at all concentrations of the NP exposure. So, we focused on one gene encoding cuticle protein, and another encoding cuticular protein analogous to peritrophins 1-H precursor. These genes were down-regulated by NP exposure in a dose-dependent manner, and it suggested that they were related in a reduction of the number of molting in mysids. Thus, they might become useful molecular biomarker candidates to evaluate molting inhibition in mysids. Copyright © 2016 Elsevier Inc. All rights reserved.
2005-01-01
Gene expression databases contain a wealth of information, but current data mining tools are limited in their speed and effectiveness in extracting meaningful biological knowledge from them. Online analytical processing (OLAP) can be used as a supplement to cluster analysis for fast and effective data mining of gene expression databases. We used Analysis Services 2000, a product that ships with SQLServer2000, to construct an OLAP cube that was used to mine a time series experiment designed to identify genes associated with resistance of soybean to the soybean cyst nematode, a devastating pest of soybean. The data for these experiments is stored in the soybean genomics and microarray database (SGMD). A number of candidate resistance genes and pathways were found. Compared to traditional cluster analysis of gene expression data, OLAP was more effective and faster in finding biologically meaningful information. OLAP is available from a number of vendors and can work with any relational database management system through OLE DB. PMID:16046824
A genome-wide 20 K citrus microarray for gene expression analysis
Martinez-Godoy, M Angeles; Mauri, Nuria; Juarez, Jose; Marques, M Carmen; Santiago, Julia; Forment, Javier; Gadea, Jose
2008-01-01
Background Understanding of genetic elements that contribute to key aspects of citrus biology will impact future improvements in this economically important crop. Global gene expression analysis demands microarray platforms with a high genome coverage. In the last years, genome-wide EST collections have been generated in citrus, opening the possibility to create new tools for functional genomics in this crop plant. Results We have designed and constructed a publicly available genome-wide cDNA microarray that include 21,081 putative unigenes of citrus. As a functional companion to the microarray, a web-browsable database [1] was created and populated with information about the unigenes represented in the microarray, including cDNA libraries, isolated clones, raw and processed nucleotide and protein sequences, and results of all the structural and functional annotation of the unigenes, like general description, BLAST hits, putative Arabidopsis orthologs, microsatellites, putative SNPs, GO classification and PFAM domains. We have performed a Gene Ontology comparison with the full set of Arabidopsis proteins to estimate the genome coverage of the microarray. We have also performed microarray hybridizations to check its usability. Conclusion This new cDNA microarray replaces the first 7K microarray generated two years ago and allows gene expression analysis at a more global scale. We have followed a rational design to minimize cross-hybridization while maintaining its utility for different citrus species. Furthermore, we also provide access to a website with full structural and functional annotation of the unigenes represented in the microarray, along with the ability to use this site to directly perform gene expression analysis using standard tools at different publicly available servers. Furthermore, we show how this microarray offers a good representation of the citrus genome and present the usefulness of this genomic tool for global studies in citrus by using it to catalogue genes expressed in citrus globular embryos. PMID:18598343
Sima, Chao; Amundson, Sally A.; Zenhausern, Frederic
2018-01-01
Purpose To compile a list of genes that have been reported to be affected by external ionizing radiation (IR) and to assess their performance as candidate biomarkers for individual human radiation dosimetry. Methods Eligible studies were identified through extensive searches of the online databases from 1978 to 2017. Original English-language publications of microarray studies assessing radiation-induced changes in gene expression levels in human blood after external IR were included. Genes identified in at least half of the selected studies were retained for bio-statistical analysis in order to evaluate their diagnostic ability. Results 24 studies met the criteria and were included in this study. Radiation-induced expression of 10,170 unique genes was identified and the 31 genes that have been identified in at least 50% of studies (12/24 studies) were selected for diagnostic power analysis. Twenty-seven genes showed a significant Spearman’s correlation with radiation dose. Individually, TNFSF4, FDXR, MYC, ZMAT3 and GADD45A provided the best discrimination of radiation dose < 2 Gy and dose ≥ 2 Gy according to according to their maximized Youden’s index (0.67, 0.55, 0.55, 0.55 and 0.53 respectively). Moreover, 12 combinations of three genes display an area under the Receiver Operating Curve (ROC) curve (AUC) = 1 reinforcing the concept of biomarker combinations instead of looking for an ideal and unique biomarker. Conclusion Gene expression is a promising approach for radiation dosimetry assessment. A list of robust candidate biomarkers has been identified from analysis of the studies published to date, confirming for example the potential of well-known genes such as FDXR and TNFSF4 or highlighting other promising gene such as ZMAT3. However, heterogeneity in protocols and analysis methods will require additional studies to confirm these results. PMID:29879226
Ishibashi, Osamu; Akagi, Ichiro; Ogawa, Yota; Inui, Takashi
2018-05-11
The phosphatidylinositol-3-kinase (PI3K)/AKT pathway is frequently activated in various human cancers and plays essential roles in their development and progression. Accumulating evidence suggests that dysregulated expression of microRNAs (miRNAs) is closely associated with cancer progression and metastasis. Here, we focused on miRNAs that could regulate genes related to the PI3K/AKT pathway in esophageal squamous cell carcinoma (ESCC). To identify upregulated miRNAs and their possible target genes in ESCC, we performed microarray-based integrative analyses of miRNA and mRNA expression levels in three human ESCC cell lines and a normal esophageal epithelial cell line. The miRNA microarray analysis revealed that miR-31-5p, miR-141-3p, miR-200b-3p, miR-200c-3p, and miR-205-5p were expressed at higher levels in the ESCC cell lines than the normal esophageal epithelial cell line. Bioinformatical analyses of mRNA microarray data identified several AKT/PI3K pathway-related genes as candidate targets of these miRNAs, which include tumor suppressors such as DNA-damage-inducible transcript 4 and pleckstrin homology domain leucine-rich repeat protein phosphatase-2 (PHLPP2). To validate the targets of relevant miRNAs experimentally, synthetic mimics of the miRNAs were transfected into the esophageal epithelial cell line. Here, we report that miR-141-3p suppress the expression of PHLPP2, a negative regulators of the AKT/PI3K pathway, as a target in ESCC. Copyright © 2018 Elsevier Inc. All rights reserved.
Millson, Alison; Lagrave, Danielle; Willis, Mary J H; Rowe, Leslie R; Lyon, Elaine; South, Sarah T
2012-01-01
Neuroligin 1 (NLGN1) is one of five members of the neuroligin gene family and may represent a candidate gene for neurological disorders, as members of this family are involved in formation and remodeling of central nervous system synapses. NLGN1 is expressed predominantly in the central nervous system, where it dimerizes and then binds with β-neurexin to form a functional synapse. Mutations in neurexin 1 (NRXN1) as well as two other members of the neuroligin family, NLGN3 and NLGN4, have been associated with autism and mutations in NLGN4 have also been associated with intellectual disability, seizures, and EEG abnormalities. Genomic microarray is recommended for the detection of chromosomal gains or losses in patients with intellectual disability and multiple congenital anomalies. Results of uncertain significance are not uncommon. Parental studies can provide additional information by demonstrating that the imbalance is either de novo or inherited, and therefore is more or less likely to be causative of the clinical phenotype. However, the possibility that even inherited deletions and duplications may play a role in the phenotype of the proband cannot be excluded as many copy number variants associated with neurodevelopmental conditions show incomplete penetrance and may be inherited from an unaffected parent. Here, we report on a patient with a 2.2 Mb deletion at 3q26.3-3q26.32-encompassing the terminal end of NLGN1 and the entire NAALADL2 gene-detected by genomic microarray, and confirmed by FISH and real-time quantitative PCR. The same size deletion was subsequently found in her healthy, asymptomatic, adult mother. Copyright © 2011 Wiley Periodicals, Inc.
Ring, Ludwig; Yeh, Su-Ying; Hücherig, Stephanie; Hoffmann, Thomas; Blanco-Portales, Rosario; Fouche, Mathieu; Villatoro, Carmen; Denoyes, Béatrice; Monfort, Amparo; Caballero, José Luis; Muñoz-Blanco, Juan; Gershenson, Jonathan; Schwab, Wilfried
2013-01-01
Plant phenolics have drawn increasing attention due to their potential nutritional benefits. Although the basic reactions of the phenolics biosynthetic pathways in plants have been intensively analyzed, the regulation of their accumulation and flux through the pathway is not that well established. The aim of this study was to use a strawberry (Fragaria × ananassa) microarray to investigate gene expression patterns associated with the accumulation of phenylpropanoids, flavonoids, and anthocyanins in strawberry fruit. An examination of the transcriptome, coupled with metabolite profiling data from different commercial varieties, was undertaken to identify genes whose expression correlated with altered phenolics composition. Seventeen comparative microarray analyses revealed 15 genes that were differentially (more than 200-fold) expressed in phenolics-rich versus phenolics-poor varieties. The results were validated by heterologous expression of the peroxidase FaPRX27 gene, which showed the highest altered expression level (more than 900-fold). The encoded protein was functionally characterized and is assumed to be involved in lignin formation during strawberry fruit ripening. Quantitative trait locus analysis indicated that the genomic region of FaPRX27 is associated with the fruit color trait. Down-regulation of the CHALCONE SYNTHASE gene and concomitant induction of FaPRX27 expression diverted the flux from anthocyanins to lignin. The results highlight the competition of the different phenolics pathways for their common precursors. The list of the 15 candidates provides new genes that are likely to impact polyphenol accumulation in strawberry fruit and could be used to develop molecular markers to select phenolics-rich germplasm. PMID:23835409
Microarray Analysis of Iris Gene Expression in Mice with Mutations Influencing Pigmentation
Trantow, Colleen M.; Cuffy, Tryphena L.; Fingert, John H.; Kuehn, Markus H.
2011-01-01
Purpose. Several ocular diseases involve the iris, notably including oculocutaneous albinism, pigment dispersion syndrome, and exfoliation syndrome. To screen for candidate genes that may contribute to the pathogenesis of these diseases, genome-wide iris gene expression patterns were comparatively analyzed from mouse models of these conditions. Methods. Iris samples from albino mice with a Tyr mutation, pigment dispersion–prone mice with Tyrp1 and Gpnmb mutations, and mice resembling exfoliation syndrome with a Lyst mutation were compared with samples from wild-type mice. All mice were strain (C57BL/6J), age (60 days old), and sex (female) matched. Microarrays were used to compare transcriptional profiles, and differentially expressed transcripts were described by functional annotation clustering using DAVID Bioinformatics Resources. Quantitative real-time PCR was performed to validate a subset of identified changes. Results. Compared with wild-type C57BL/6J mice, each disease context exhibited a large number of statistically significant changes in gene expression, including 685 transcripts differentially expressed in albino irides, 403 in pigment dispersion–prone irides, and 460 in exfoliative-like irides. Conclusions. Functional annotation clusterings were particularly striking among the overrepresented genes, with albino and pigment dispersion–prone irides both exhibiting overall evidence of crystallin-mediated stress responses. Exfoliative-like irides from mice with a Lyst mutation showed overall evidence of involvement of genes that influence immune system processes, lytic vacuoles, and lysosomes. These findings have several biologically relevant implications, particularly with respect to secondary forms of glaucoma, and represent a useful resource as a hypothesis-generating dataset. PMID:20739468
Nguewa, Paul A; Agorreta, Jackeline; Blanco, David; Lozano, Maria Dolores; Gomez-Roman, Javier; Sanchez, Blas A; Valles, Iñaki; Pajares, Maria J; Pio, Ruben; Rodriguez, Maria Jose; Montuenga, Luis M; Calvo, Alfonso
2008-01-01
Background The accurate normalization of differentially expressed genes in lung cancer is essential for the identification of novel therapeutic targets and biomarkers by real time RT-PCR and microarrays. Although classical "housekeeping" genes, such as GAPDH, HPRT1, and beta-actin have been widely used in the past, their accuracy as reference genes for lung tissues has not been proven. Results We have conducted a thorough analysis of a panel of 16 candidate reference genes for lung specimens and lung cell lines. Gene expression was measured by quantitative real time RT-PCR and expression stability was analyzed with the softwares GeNorm and NormFinder, mean of |ΔCt| (= |Ct Normal-Ct tumor|) ± SEM, and correlation coefficients among genes. Systematic comparison between candidates led us to the identification of a subset of suitable reference genes for clinical samples: IPO8, ACTB, POLR2A, 18S, and PPIA. Further analysis showed that IPO8 had a very low mean of |ΔCt| (0.70 ± 0.09), with no statistically significant differences between normal and malignant samples and with excellent expression stability. Conclusion Our data show that IPO8 is the most accurate reference gene for clinical lung specimens. In addition, we demonstrate that the commonly used genes GAPDH and HPRT1 are inappropriate to normalize data derived from lung biopsies, although they are suitable as reference genes for lung cell lines. We thus propose IPO8 as a novel reference gene for lung cancer samples. PMID:19014639
Choi, Y; Lim, SY; Jeong, HS; Koo, KA; Sung, SH; Kim, YC
2009-01-01
Background and purpose: We conducted a genome wide gene expression analysis to explore the biological aspects of 15-methoxypinusolidic acid (15-MPA) isolated from Biota orientalis and tried to confirm the suitability of 15-MPA as a therapeutic candidate for CNS injuries focusing on microglia. Experimental approach: Murine microglial BV2 cells were treated with 15-MPA, and their transcriptome was analysed by using oligonucleotide microarrays. Genes differentially expressed upon 15-MPA treatment were selected for RT-PCR (reverse transcription-polymerase chain reaction) analysis to confirm the gene expression. Inhibition of cell proliferation and induction of apoptosis by 15-MPA were examined by bromodeoxyuridine assay, Western blot analysis of poly-ADP-ribose polymerase and flow cytometry. Key results: A total of 514 genes were differentially expressed by 15-MPA treatment. Biological pathway analysis revealed that 15-MPA induced significant changes in expression of genes in the cell cycle pathway. Genes involved in growth arrest and DNA damage [gadd45α, gadd45γ and ddit3 (DNA damage-inducible transcript 3)] and cyclin-dependent kinase inhibitor (cdkn2b) were up-regulated, whereas genes involved in cell cycle progression (ccnd1, ccnd3 and ccne1), DNA replication (mcm4, orc1l and cdc6) and cell proliferation (fos and jun) were down-regulated. RT-PCR analysis for representative genes confirmed the expression levels. 15-MPA significantly reduced bromodeoxyuridine incorporation, increased poly-ADP-ribose polymerase cleavage and the number of apoptotic cells, indicating that 15-MPA induces apoptosis in BV2 cells. Conclusion and implications: 15-MPA induced apoptosis in murine microglial cells, presumably via inhibition of the cell cycle progression. As microglial activation is detrimental in CNS injuries, these data suggest a strong therapeutic potential of 15-MPA. PMID:19466985
Lin, Huapeng; Zhang, Qian; Li, Xiaocheng; Wu, Yushen; Liu, Ye; Hu, Yingchun
2018-01-01
Abstract Hepatitis B virus-associated acute liver failure (HBV-ALF) is a rare but life-threatening syndrome that carried a high morbidity and mortality. Our study aimed to explore the possible molecular mechanisms of HBV-ALF by means of bioinformatics analysis. In this study, genes expression microarray datasets of HBV-ALF from Gene Expression Omnibus were collected, and then we identified differentially expressed genes (DEGs) by the limma package in R. After functional enrichment analysis, we constructed the protein–protein interaction (PPI) network by the Search Tool for the Retrieval of Interacting Genes online database and weighted genes coexpression network by the WGCNA package in R. Subsequently, we picked out the hub genes among the DEGs. A total of 423 DEGs with 198 upregulated genes and 225 downregulated genes were identified between HBV-ALF and normal samples. The upregulated genes were mainly enriched in immune response, and the downregulated genes were mainly enriched in complement and coagulation cascades. Orosomucoid 1 (ORM1), orosomucoid 2 (ORM2), plasminogen (PLG), and aldehyde oxidase 1 (AOX1) were picked out as the hub genes that with a high degree in both PPI network and weighted genes coexpression network. The weighted genes coexpression network analysis found out 3 of the 5 modules that upregulated genes enriched in were closely related to immune system. The downregulated genes enriched in only one module, and the genes in this module majorly enriched in the complement and coagulation cascades pathway. In conclusion, 4 genes (ORM1, ORM2, PLG, and AOX1) with immune response and the complement and coagulation cascades pathway may take part in the pathogenesis of HBV-ALF, and these candidate genes and pathways could be therapeutic targets for HBV-ALF. PMID:29384847
A microarray analysis of retinal transcripts that are controlled by image contrast in mice
Brand, Christine; Schaeffel, Frank
2007-01-01
Purpose The development of myopia is controlled by still largely unknown retinal signals. The aim of this study was to investigate the changes in retinal mRNA expression after different periods of visual deprivation in mice, while controlling for retinal illuminance. Methods Each group consisted of three male C57BL/6 mice. Treatment periods were 30 min, 4 h, and 6+6 h. High spatial frequencies were filtered from the retinal image by frosted diffusers over one eye while the fellow eyes were covered by clear neutral density (ND) filters that exhibited similar light attenuating properties (0.1 log units) as the diffusers. For the final 30 min of the respective treatment period mice were individually placed in a clear Perspex cylinder that was positioned in the center of a rotating (60 degrees) large drum. The inside of the drum was covered with a 0.1 cyc/degree vertical square wave grating. This visual environment was chosen to standardize illuminances and contrasts seen by the mice. Labeled cRNA was prepared and hybridized to Affymetrix GeneChip® Mouse Genome 430 2.0 arrays. Alterations in mRNA expression levels of candidate genes with potential biological relevance were confirmed by semi-quantitative real-time reverse transcription polymerase chain reaction (RT-PCR). Results In all groups, Egr-1 mRNA expression was reduced in diffuser-treated eyes. Furthermore, the degradation of the spatial frequency spectrum also changed the cFos mRNA level, with reduced expression after 4 h of diffuser treatment. Other interesting candidates were Akt2, which was up-regulated after 30 min of deprivation and Mapk8ip3, a neuron specific JNK binding and scaffolding protein that was temporally regulated in the diffuser-treated eyes only. Conclusions The microarray analysis demonstrated a pattern of differential transcriptional changes, even though differences in the retinal images were restricted to spatial features. The candidate genes may provide further insight into the biochemical short-term changes following retinal image degradation in mice. Because deprivation of spatial vision leads to increased eye growth and myopia in both animals and humans, it is believed some of the identified genes play a role in myopia development. PMID:17653032
Vadigepalli, Rajanikanth; Chakravarthula, Praveen; Zak, Daniel E; Schwaber, James S; Gonye, Gregory E
2003-01-01
We have developed a bioinformatics tool named PAINT that automates the promoter analysis of a given set of genes for the presence of transcription factor binding sites. Based on coincidence of regulatory sites, this tool produces an interaction matrix that represents a candidate transcriptional regulatory network. This tool currently consists of (1) a database of promoter sequences of known or predicted genes in the Ensembl annotated mouse genome database, (2) various modules that can retrieve and process the promoter sequences for binding sites of known transcription factors, and (3) modules for visualization and analysis of the resulting set of candidate network connections. This information provides a substantially pruned list of genes and transcription factors that can be examined in detail in further experimental studies on gene regulation. Also, the candidate network can be incorporated into network identification methods in the form of constraints on feasible structures in order to render the algorithms tractable for large-scale systems. The tool can also produce output in various formats suitable for use in external visualization and analysis software. In this manuscript, PAINT is demonstrated in two case studies involving analysis of differentially regulated genes chosen from two microarray data sets. The first set is from a neuroblastoma N1E-115 cell differentiation experiment, and the second set is from neuroblastoma N1E-115 cells at different time intervals following exposure to neuropeptide angiotensin II. PAINT is available for use as an agent in BioSPICE simulation and analysis framework (www.biospice.org), and can also be accessed via a WWW interface at www.dbi.tju.edu/dbi/tools/paint/.
Parameter estimation in tree graph metabolic networks.
Astola, Laura; Stigter, Hans; Gomez Roldan, Maria Victoria; van Eeuwijk, Fred; Hall, Robert D; Groenenboom, Marian; Molenaar, Jaap J
2016-01-01
We study the glycosylation processes that convert initially toxic substrates to nutritionally valuable metabolites in the flavonoid biosynthesis pathway of tomato (Solanum lycopersicum) seedlings. To estimate the reaction rates we use ordinary differential equations (ODEs) to model the enzyme kinetics. A popular choice is to use a system of linear ODEs with constant kinetic rates or to use Michaelis-Menten kinetics. In reality, the catalytic rates, which are affected among other factors by kinetic constants and enzyme concentrations, are changing in time and with the approaches just mentioned, this phenomenon cannot be described. Another problem is that, in general these kinetic coefficients are not always identifiable. A third problem is that, it is not precisely known which enzymes are catalyzing the observed glycosylation processes. With several hundred potential gene candidates, experimental validation using purified target proteins is expensive and time consuming. We aim at reducing this task via mathematical modeling to allow for the pre-selection of most potential gene candidates. In this article we discuss a fast and relatively simple approach to estimate time varying kinetic rates, with three favorable properties: firstly, it allows for identifiable estimation of time dependent parameters in networks with a tree-like structure. Secondly, it is relatively fast compared to usually applied methods that estimate the model derivatives together with the network parameters. Thirdly, by combining the metabolite concentration data with a corresponding microarray data, it can help in detecting the genes related to the enzymatic processes. By comparing the estimated time dynamics of the catalytic rates with time series gene expression data we may assess potential candidate genes behind enzymatic reactions. As an example, we show how to apply this method to select prominent glycosyltransferase genes in tomato seedlings.
Kresse, Stine H; Berner, Jeanne-Marie; Meza-Zepeda, Leonardo A; Gregory, Simon G; Kuo, Wen-Lin; Gray, Joe W; Forus, Anne; Myklebost, Ola
2005-11-07
Amplification of the q21-q23 region on chromosome 1 is frequently found in sarcomas and a variety of other solid tumours. Previous analyses of sarcomas have indicated the presence of at least two separate amplicons within this region, one located in 1q21 and one located near the apolipoprotein A-II (APOA2) gene in 1q23. In this study we have mapped and characterized the amplicon in 1q23 in more detail. We have used fluorescence in situ hybridisation (FISH) and microarray-based comparative genomic hybridisation (array CGH) to map and define the borders of the amplicon in 10 sarcomas. A subregion of approximately 800 kb was identified as the core of the amplicon. The amplification patterns of nine possible candidate target genes located to this subregion were determined by Southern blot analysis. The genes activating transcription factor 6 (ATF6) and dual specificity phosphatase 12 (DUSP12) showed the highest level of amplification, and they were also shown to be over-expressed by quantitative real-time reverse transcription PCR (RT-PCR). In general, the level of expression reflected the level of amplification in the different tumours. DUSP12 was expressed significantly higher than ATF6 in a subset of the tumours. In addition, two genes known to be transcriptionally activated by ATF6, glucose-regulated protein 78 kDa and -94 kDa (GRP78 and GRP94), were shown to be over-expressed in the tumours that showed over-expression of ATF6. ATF6 and DUSP12 seem to be the most likely candidate target genes for the 1q23 amplification in sarcomas. Both genes have possible roles in promoting cell growth, which makes them interesting candidate targets.
Johnsson, Martin; Jonsson, Kenneth B; Andersson, Leif; Jensen, Per; Wright, Dominic
2015-05-01
Birds have a unique bone physiology, due to the demands placed on them through egg production. In particular their medullary bone serves as a source of calcium for eggshell production during lay and undergoes continuous and rapid remodelling. We take advantage of the fact that bone traits have diverged massively during chicken domestication to map the genetic basis of bone metabolism in the chicken. We performed a quantitative trait locus (QTL) and expression QTL (eQTL) mapping study in an advanced intercross based on Red Junglefowl (the wild progenitor of the modern domestic chicken) and White Leghorn chickens. We measured femoral bone traits in 456 chickens by peripheral computerised tomography and femoral gene expression in a subset of 125 females from the cross with microarrays. This resulted in 25 loci for female bone traits, 26 loci for male bone traits and 6318 local eQTL loci. We then overlapped bone and gene expression loci, before checking for an association between gene expression and trait values to identify candidate quantitative trait genes for bone traits. A handful of our candidates have been previously associated with bone traits in mice, but our results also implicate unexpected and largely unknown genes in bone metabolism. In summary, by utilising the unique bone metabolism of an avian species, we have identified a number of candidate genes affecting bone allocation and metabolism. These findings can have ramifications not only for the understanding of bone metabolism genetics in general, but could also be used as a potential model for osteoporosis as well as revealing new aspects of vertebrate bone regulation or features that distinguish avian and mammalian bone.
Alonso, Ana; Larraga, Vicente; Alcolea, Pedro J
2018-05-07
The first genome project of any living organism excluding viruses, the gammaproteobacteria Haemophilus influenzae, was completed in 1995. Until the last decade, genome sequencing was very tedious because genome survey sequences (GSS) and/or expressed sequence tags (ESTs) belonging to plasmid, cosmid and artificial chromosome genome libraries had to be sequenced and assembled in silico. Nowadays, no genome is completely assembled actually, because gaps and unassembled contigs are always remaining. However, most represent the whole genome of the organism of origin from a practical point of view. The first genome sequencing projects of trypanosomatid parasites were completed in 2005 following those strategies, and belong to Leishmania major, Trypanosoma cruzi and T. brucei. The functional genomics era rapidly developed on the basis of the microarray technology and has been evolving. In the case of the genus Leishmania, substantial biological information about differentiation in the digenetic life cycle of the parasite has been obtained. Later on, next generation sequencing has revolutionized genome sequencing and functional genomics, leading to more sensitive, accurate results by using much less resources. This new technology is more advantageous, but does not invalidate microarray results. In fact, promising vaccine candidates and drug targets have been found on the basis of microarray-based screening and preliminary proof-of-concept tests. Copyright © 2018. Published by Elsevier B.V.
Bessonov, Kyrylo; Walkey, Christopher J.; Shelp, Barry J.; van Vuuren, Hennie J. J.; Chiu, David; van der Merwe, George
2013-01-01
Analyzing time-course expression data captured in microarray datasets is a complex undertaking as the vast and complex data space is represented by a relatively low number of samples as compared to thousands of available genes. Here, we developed the Interdependent Correlation Clustering (ICC) method to analyze relationships that exist among genes conditioned on the expression of a specific target gene in microarray data. Based on Correlation Clustering, the ICC method analyzes a large set of correlation values related to gene expression profiles extracted from given microarray datasets. ICC can be applied to any microarray dataset and any target gene. We applied this method to microarray data generated from wine fermentations and selected NSF1, which encodes a C2H2 zinc finger-type transcription factor, as the target gene. The validity of the method was verified by accurate identifications of the previously known functional roles of NSF1. In addition, we identified and verified potential new functions for this gene; specifically, NSF1 is a negative regulator for the expression of sulfur metabolism genes, the nuclear localization of Nsf1 protein (Nsf1p) is controlled in a sulfur-dependent manner, and the transcription of NSF1 is regulated by Met4p, an important transcriptional activator of sulfur metabolism genes. The inter-disciplinary approach adopted here highlighted the accuracy and relevancy of the ICC method in mining for novel gene functions using complex microarray datasets with a limited number of samples. PMID:24130853
Perdiguero, Pedro; Collada, Carmen; Barbero, María Del Carmen; García Casado, Gloria; Cervera, María Teresa; Soto, Alvaro
2012-01-01
Climate change is a major challenge particularly for forest tree species, which will have to face the severe alterations of environmental conditions with their current genetic pool. Thus, an understanding of their adaptive responses is of the utmost interest. In this work we have selected Pinus pinaster as a model species. This pine is one of the most important conifers (for which molecular tools and knowledge are far more scarce than for angiosperms) in the Mediterranean Basin, which is characterised in all foreseen scenarios as one of the regions most drastically affected by climate change, mainly because of increasing temperature and, particularly, by increasing drought. We have induced a controlled, increasing water stress by adding PEG to a hydroponic culture. We have generated a subtractive library, with the aim of identifying the genes induced by this stress and have searched for the most reliable expressional candidate genes, based on their overexpression during water stress, as revealed by microarray analysis and confirmed by RT-PCR. We have selected a set of 67 candidate genes belonging to different functional groups that will be useful molecular tools for further studies on drought stress responses, adaptation, and population genomics in conifers, as well as in breeding programs. Copyright © 2011 Elsevier Masson SAS. All rights reserved.
Li, Xiang; Harwood, Valerie J.; Nayak, Bina
2016-01-01
Pathogen identification and microbial source tracking (MST) to identify sources of fecal pollution improve evaluation of water quality. They contribute to improved assessment of human health risks and remediation of pollution sources. An MST microarray was used to simultaneously detect genes for multiple pathogens and indicators of fecal pollution in freshwater, marine water, sewage-contaminated freshwater and marine water, and treated wastewater. Dead-end ultrafiltration (DEUF) was used to concentrate organisms from water samples, yielding a recovery efficiency of >95% for Escherichia coli and human polyomavirus. Whole-genome amplification (WGA) increased gene copies from ultrafiltered samples and increased the sensitivity of the microarray. Viruses (adenovirus, bocavirus, hepatitis A virus, and human polyomaviruses) were detected in sewage-contaminated samples. Pathogens such as Legionella pneumophila, Shigella flexneri, and Campylobacter fetus were detected along with genes conferring resistance to aminoglycosides, beta-lactams, and tetracycline. Nonmetric dimensional analysis of MST marker genes grouped sewage-spiked freshwater and marine samples with sewage and apart from other fecal sources. The sensitivity (percent true positives) of the microarray probes for gene targets anticipated in sewage was 51 to 57% and was lower than the specificity (percent true negatives; 79 to 81%). A linear relationship between gene copies determined by quantitative PCR and microarray fluorescence was found, indicating the semiquantitative nature of the MST microarray. These results indicate that ultrafiltration coupled with WGA provides sufficient nucleic acids for detection of viruses, bacteria, protozoa, and antibiotic resistance genes by the microarray in applications ranging from beach monitoring to risk assessment. PMID:26729716
Identifying gene networks underlying the neurobiology of ethanol and alcoholism.
Wolen, Aaron R; Miles, Michael F
2012-01-01
For complex disorders such as alcoholism, identifying the genes linked to these diseases and their specific roles is difficult. Traditional genetic approaches, such as genetic association studies (including genome-wide association studies) and analyses of quantitative trait loci (QTLs) in both humans and laboratory animals already have helped identify some candidate genes. However, because of technical obstacles, such as the small impact of any individual gene, these approaches only have limited effectiveness in identifying specific genes that contribute to complex diseases. The emerging field of systems biology, which allows for analyses of entire gene networks, may help researchers better elucidate the genetic basis of alcoholism, both in humans and in animal models. Such networks can be identified using approaches such as high-throughput molecular profiling (e.g., through microarray-based gene expression analyses) or strategies referred to as genetical genomics, such as the mapping of expression QTLs (eQTLs). Characterization of gene networks can shed light on the biological pathways underlying complex traits and provide the functional context for identifying those genes that contribute to disease development.
Abou Assi, Hala; Gómez-Pinto, Irene; González, Carlos
2017-01-01
Abstract In situ fabricated nucleic acids microarrays are versatile and very high-throughput platforms for aptamer optimization and discovery, but the chemical space that can be probed against a given target has largely been confined to DNA, while RNA and non-natural nucleic acid microarrays are still an essentially uncharted territory. 2΄-Fluoroarabinonucleic acid (2΄F-ANA) is a prime candidate for such use in microarrays. Indeed, 2΄F-ANA chemistry is readily amenable to photolithographic microarray synthesis and its potential in high affinity aptamers has been recently discovered. We thus synthesized the first microarrays containing 2΄F-ANA and 2΄F-ANA/DNA chimeric sequences to fully map the binding affinity landscape of the TBA1 thrombin-binding G-quadruplex aptamer containing all 32 768 possible DNA-to-2΄F-ANA mutations. The resulting microarray was screened against thrombin to identify a series of promising 2΄F-ANA-modified aptamer candidates with Kds significantly lower than that of the unmodified control and which were found to adopt highly stable, antiparallel-folded G-quadruplex structures. The solution structure of the TBA1 aptamer modified with 2΄F-ANA at position T3 shows that fluorine substitution preorganizes the dinucleotide loop into the proper conformation for interaction with thrombin. Overall, our work strengthens the potential of 2΄F-ANA in aptamer research and further expands non-genomic applications of nucleic acids microarrays. PMID:28100695
Challenges of microarray applications for microbial detection and gene expression profiling in food
USDA-ARS?s Scientific Manuscript database
Microarray technology represents one of the latest advances in molecular biology. The diverse types of microarrays have been applied to clinical and environmental microbiology, microbial ecology, and in human, veterinary, and plant diagnostics. Since multiple genes can be analyzed simultaneously, ...
Shaw, Joseph R; Colbourne, John K; Davey, Jennifer C; Glaholt, Stephen P; Hampton, Thomas H; Chen, Celia Y; Folt, Carol L; Hamilton, Joshua W
2007-12-21
Genomic research tools such as microarrays are proving to be important resources to study the complex regulation of genes that respond to environmental perturbations. A first generation cDNA microarray was developed for the environmental indicator species Daphnia pulex, to identify genes whose regulation is modulated following exposure to the metal stressor cadmium. Our experiments revealed interesting changes in gene transcription that suggest their biological roles and their potentially toxicological features in responding to this important environmental contaminant. Our microarray identified genes reported in the literature to be regulated in response to cadmium exposure, suggested functional attributes for genes that share no sequence similarity to proteins in the public databases, and pointed to genes that are likely members of expanded gene families in the Daphnia genome. Genes identified on the microarray also were associated with cadmium induced phenotypes and population-level outcomes that we experimentally determined. A subset of genes regulated in response to cadmium exposure was independently validated using quantitative-realtime (Q-RT)-PCR. These microarray studies led to the discovery of three genes coding for the metal detoxication protein metallothionein (MT). The gene structures and predicted translated sequences of D. pulex MTs clearly place them in this gene family. Yet, they share little homology with previously characterized MTs. The genomic information obtained from this study represents an important first step in characterizing microarray patterns that may be diagnostic to specific environmental contaminants and give insights into their toxicological mechanisms, while also providing a practical tool for evolutionary, ecological, and toxicological functional gene discovery studies. Advances in Daphnia genomics will enable the further development of this species as a model organism for the environmental sciences.
Shaw, Joseph R; Colbourne, John K; Davey, Jennifer C; Glaholt, Stephen P; Hampton, Thomas H; Chen, Celia Y; Folt, Carol L; Hamilton, Joshua W
2007-01-01
Background Genomic research tools such as microarrays are proving to be important resources to study the complex regulation of genes that respond to environmental perturbations. A first generation cDNA microarray was developed for the environmental indicator species Daphnia pulex, to identify genes whose regulation is modulated following exposure to the metal stressor cadmium. Our experiments revealed interesting changes in gene transcription that suggest their biological roles and their potentially toxicological features in responding to this important environmental contaminant. Results Our microarray identified genes reported in the literature to be regulated in response to cadmium exposure, suggested functional attributes for genes that share no sequence similarity to proteins in the public databases, and pointed to genes that are likely members of expanded gene families in the Daphnia genome. Genes identified on the microarray also were associated with cadmium induced phenotypes and population-level outcomes that we experimentally determined. A subset of genes regulated in response to cadmium exposure was independently validated using quantitative-realtime (Q-RT)-PCR. These microarray studies led to the discovery of three genes coding for the metal detoxication protein metallothionein (MT). The gene structures and predicted translated sequences of D. pulex MTs clearly place them in this gene family. Yet, they share little homology with previously characterized MTs. Conclusion The genomic information obtained from this study represents an important first step in characterizing microarray patterns that may be diagnostic to specific environmental contaminants and give insights into their toxicological mechanisms, while also providing a practical tool for evolutionary, ecological, and toxicological functional gene discovery studies. Advances in Daphnia genomics will enable the further development of this species as a model organism for the environmental sciences. PMID:18154678
The use of open source bioinformatics tools to dissect transcriptomic data.
Nitsche, Benjamin M; Ram, Arthur F J; Meyer, Vera
2012-01-01
Microarrays are a valuable technology to study fungal physiology on a transcriptomic level. Various microarray platforms are available comprising both single and two channel arrays. Despite different technologies, preprocessing of microarray data generally includes quality control, background correction, normalization, and summarization of probe level data. Subsequently, depending on the experimental design, diverse statistical analysis can be performed, including the identification of differentially expressed genes and the construction of gene coexpression networks.We describe how Bioconductor, a collection of open source and open development packages for the statistical programming language R, can be used for dissecting microarray data. We provide fundamental details that facilitate the process of getting started with R and Bioconductor. Using two publicly available microarray datasets from Aspergillus niger, we give detailed protocols on how to identify differentially expressed genes and how to construct gene coexpression networks.
Bikel, Shirley; Jacobo-Albavera, Leonor; Sánchez-Muñoz, Fausto; Cornejo-Granados, Fernanda; Canizales-Quinteros, Samuel; Soberón, Xavier; Sotelo-Mundo, Rogerio R.; del Río-Navarro, Blanca E.; Mendoza-Vargas, Alfredo; Sánchez, Filiberto
2017-01-01
Background In spite of the emergence of RNA sequencing (RNA-seq), microarrays remain in widespread use for gene expression analysis in the clinic. There are over 767,000 RNA microarrays from human samples in public repositories, which are an invaluable resource for biomedical research and personalized medicine. The absolute gene expression analysis allows the transcriptome profiling of all expressed genes under a specific biological condition without the need of a reference sample. However, the background fluorescence represents a challenge to determine the absolute gene expression in microarrays. Given that the Y chromosome is absent in female subjects, we used it as a new approach for absolute gene expression analysis in which the fluorescence of the Y chromosome genes of female subjects was used as the background fluorescence for all the probes in the microarray. This fluorescence was used to establish an absolute gene expression threshold, allowing the differentiation between expressed and non-expressed genes in microarrays. Methods We extracted the RNA from 16 children leukocyte samples (nine males and seven females, ages 6–10 years). An Affymetrix Gene Chip Human Gene 1.0 ST Array was carried out for each sample and the fluorescence of 124 genes of the Y chromosome was used to calculate the absolute gene expression threshold. After that, several expressed and non-expressed genes according to our absolute gene expression threshold were compared against the expression obtained using real-time quantitative polymerase chain reaction (RT-qPCR). Results From the 124 genes of the Y chromosome, three genes (DDX3Y, TXLNG2P and EIF1AY) that displayed significant differences between sexes were used to calculate the absolute gene expression threshold. Using this threshold, we selected 13 expressed and non-expressed genes and confirmed their expression level by RT-qPCR. Then, we selected the top 5% most expressed genes and found that several KEGG pathways were significantly enriched. Interestingly, these pathways were related to the typical functions of leukocytes cells, such as antigen processing and presentation and natural killer cell mediated cytotoxicity. We also applied this method to obtain the absolute gene expression threshold in already published microarray data of liver cells, where the top 5% expressed genes showed an enrichment of typical KEGG pathways for liver cells. Our results suggest that the three selected genes of the Y chromosome can be used to calculate an absolute gene expression threshold, allowing a transcriptome profiling of microarray data without the need of an additional reference experiment. Discussion Our approach based on the establishment of a threshold for absolute gene expression analysis will allow a new way to analyze thousands of microarrays from public databases. This allows the study of different human diseases without the need of having additional samples for relative expression experiments. PMID:29230367
Zebrafish knockout of Down syndrome gene, DYRK1A, shows social impairments relevant to autism.
Kim, Oc-Hee; Cho, Hyun-Ju; Han, Enna; Hong, Ted Inpyo; Ariyasiri, Krishan; Choi, Jung-Hwa; Hwang, Kyu-Seok; Jeong, Yun-Mi; Yang, Se-Yeol; Yu, Kweon; Park, Doo-Sang; Oh, Hyun-Woo; Davis, Erica E; Schwartz, Charles E; Lee, Jeong-Soo; Kim, Hyung-Goo; Kim, Cheol-Hee
2017-01-01
DYRK1A maps to the Down syndrome critical region at 21q22. Mutations in this kinase-encoding gene have been reported to cause microcephaly associated with either intellectual disability or autism in humans. Intellectual disability accompanied by microcephaly was recapitulated in a murine model by overexpressing Dyrk1a which mimicked Down syndrome phenotypes. However, given embryonic lethality in homozygous knockout (KO) mice, no murine model studies could present sufficient evidence to link Dyrk1a dysfunction with autism. To understand the molecular mechanisms underlying microcephaly and autism spectrum disorders (ASD), we established an in vivo dyrk1aa KO model using zebrafish. We identified a patient with a mutation in the DYRK1A gene using microarray analysis. Circumventing the barrier of murine model studies, we generated a dyrk1aa KO zebrafish using transcription activator-like effector nuclease (TALEN)-mediated genome editing. For social behavioral tests, we have established a social interaction test, shoaling assay, and group behavior assay. For molecular analysis, we examined the neuronal activity in specific brain regions of dyrk1aa KO zebrafish through in situ hybridization with various probes including c-fos and crh which are the molecular markers for stress response. Microarray detected an intragenic microdeletion of DYRK1A in an individual with microcephaly and autism. From behavioral tests of social interaction and group behavior, dyrk1aa KO zebrafish exhibited social impairments that reproduce human phenotypes of autism in a vertebrate animal model. Social impairment in dyrk1aa KO zebrafish was further confirmed by molecular analysis of c-fos and crh expression. Transcriptional expression of c-fos and crh was lower than that of wild type fish in specific hypothalamic regions, suggesting that KO fish brains are less activated by social context. In this study, we established a zebrafish model to validate a candidate gene for autism in a vertebrate animal. These results illustrate the functional deficiency of DYRK1A as an underlying disease mechanism for autism. We also propose simple social behavioral assays as a tool for the broader study of autism candidate genes.
Hybrid genetic algorithm-neural network: feature extraction for unpreprocessed microarray data.
Tong, Dong Ling; Schierz, Amanda C
2011-09-01
Suitable techniques for microarray analysis have been widely researched, particularly for the study of marker genes expressed to a specific type of cancer. Most of the machine learning methods that have been applied to significant gene selection focus on the classification ability rather than the selection ability of the method. These methods also require the microarray data to be preprocessed before analysis takes place. The objective of this study is to develop a hybrid genetic algorithm-neural network (GANN) model that emphasises feature selection and can operate on unpreprocessed microarray data. The GANN is a hybrid model where the fitness value of the genetic algorithm (GA) is based upon the number of samples correctly labelled by a standard feedforward artificial neural network (ANN). The model is evaluated by using two benchmark microarray datasets with different array platforms and differing number of classes (a 2-class oligonucleotide microarray data for acute leukaemia and a 4-class complementary DNA (cDNA) microarray dataset for SRBCTs (small round blue cell tumours)). The underlying concept of the GANN algorithm is to select highly informative genes by co-evolving both the GA fitness function and the ANN weights at the same time. The novel GANN selected approximately 50% of the same genes as the original studies. This may indicate that these common genes are more biologically significant than other genes in the datasets. The remaining 50% of the significant genes identified were used to build predictive models and for both datasets, the models based on the set of genes extracted by the GANN method produced more accurate results. The results also suggest that the GANN method not only can detect genes that are exclusively associated with a single cancer type but can also explore the genes that are differentially expressed in multiple cancer types. The results show that the GANN model has successfully extracted statistically significant genes from the unpreprocessed microarray data as well as extracting known biologically significant genes. We also show that assessing the biological significance of genes based on classification accuracy may be misleading and though the GANN's set of extra genes prove to be more statistically significant than those selected by other methods, a biological assessment of these genes is highly recommended to confirm their functionality. Copyright © 2011 Elsevier B.V. All rights reserved.
iSyTE 2.0: a database for expression-based gene discovery in the eye
Kakrana, Atul; Yang, Andrian; Anand, Deepti; Djordjevic, Djordje; Ramachandruni, Deepti; Singh, Abhyudai; Huang, Hongzhan
2018-01-01
Abstract Although successful in identifying new cataract-linked genes, the previous version of the database iSyTE (integrated Systems Tool for Eye gene discovery) was based on expression information on just three mouse lens stages and was functionally limited to visualization by only UCSC-Genome Browser tracks. To increase its efficacy, here we provide an enhanced iSyTE version 2.0 (URL: http://research.bioinformatics.udel.edu/iSyTE) based on well-curated, comprehensive genome-level lens expression data as a one-stop portal for the effective visualization and analysis of candidate genes in lens development and disease. iSyTE 2.0 includes all publicly available lens Affymetrix and Illumina microarray datasets representing a broad range of embryonic and postnatal stages from wild-type and specific gene-perturbation mouse mutants with eye defects. Further, we developed a new user-friendly web interface for direct access and cogent visualization of the curated expression data, which supports convenient searches and a range of downstream analyses. The utility of these new iSyTE 2.0 features is illustrated through examples of established genes associated with lens development and pathobiology, which serve as tutorials for its application by the end-user. iSyTE 2.0 will facilitate the prioritization of eye development and disease-linked candidate genes in studies involving transcriptomics or next-generation sequencing data, linkage analysis and GWAS approaches. PMID:29036527
Over the last decade, the introduction of microarray technology has had a profound impact on gene expression research. The publication of studies with dissimilar or altogether contradictory results, obtained using different microarray platforms to analyze identical RNA samples, ...
Wang, Yuwei; Yang, Chun; He, Yonglin; Zhan, Xingxing; Xu, Lei
2016-08-01
Tuberculosis is a major challenge to global public health. However, the Bacille Calmette‑Guérin (BCG), the only vaccine available against tuberculosis, has been questioned for the low protective effect. The present study used the mouse gene intracellular pathogen resistance I (Ipr1) gene to alter the current BCG vaccine and evaluated its immunity effect against tuberculosis. This study also investigated the intrinsic relationships of Ipr1 and innate immunity. The reformed BCG (BCGi) carrying the Ipr1 gene was constructed. The mice were intranasally challenged with the M. tuberculosis H37Rv strain after vaccination with BCGi. Protection efficacy of the vaccine was assessed by the organ coefficient, bacterial load and pathological changes in the lung. The differential expression of 113 immune‑related genes between BCGi and BCG groups were detected by an oligo microarray. According to the results of organ coefficient, bacterial load and pathological changes in the organization, BCGi had been shown to have stronger protective effects against M. tuberculosis than BCG. The oligo microarray and reverse transcription‑quantitative polymerase chain reaction further revealed that the Ipr1 gene could upregulate the expression of 13 genes, including a >3‑fold increase in Toll‑like receptor (TLR)4 and 10‑fold increase in surfactant protein D (sftpd). The two genes not only participate in innate immunity against pathogens, but also are closely interrelated. Ipr1 could activate the TLR4 and sftpd signaling pathway and improve the innate immunity against tuberculosis, therefore Ipr1 modified BCG may be a candidate vaccine against M. tuberculosis.
Henry, Ellen C; Welle, Stephen L; Gasiewicz, Thomas A
2010-03-01
The aryl hydrocarbon receptor (AhR), a ligand-dependent transcription factor, mediates toxicity of several classes of xenobiotics and also has important physiological roles in differentiation, reproduction, and immunity, although the endogenous ligand(s) mediating these functions is/are as yet unidentified. One candidate endogenous ligand, 2-(1'H-indolo-3'-carbonyl)-thiazole-4-carboxylic acid methyl ester (ITE), is a potent AhR agonist in vitro, activates the murine AhR in vivo, but does not induce toxicity. We hypothesized that ITE and the toxic ligand, 2,3,7,8-tetrachlorodibenzo-p-dioxin (TCDD), may modify transcription of different sets of genes to account for their different toxicity. To test this hypothesis, primary mouse lung fibroblasts were exposed to 0.5muM ITE, 0.2nM TCDD, or vehicle for 4 h, and total gene expression was evaluated using microarrays. After this short-term and low-dose treatment, several hundred genes were changed significantly, and the response to ITE and TCDD was remarkably similar, both qualitatively and quantitatively. Induced gene sets included the expected battery of AhR-dependent xenobiotic-metabolizing enzymes, as well as several sets that reflect the inflammatory role of lung fibroblasts. Real time quantitative RT-qPCR assay of several selected genes confirmed these microarray data and further suggested that there may be kinetic differences in expression between ligands. These data suggest that ITE and TCDD elicit an analogous change in AhR conformation such that the initial transcription response is the same. Furthermore, if the difference in toxicity between TCDD and ITE is mediated by differences in gene expression, then it is likely that secondary changes enabled by the persistent TCDD, but not by the shorter lived ITE, are responsible.
Nodale, Cristina; Ceccarelli, Simona; Giuliano, Mariateresa; Cammarota, Marcella; D'Amici, Sirio; Vescarelli, Enrica; Maffucci, Diana; Bellati, Filippo; Panici, Pierluigi Benedetti; Romano, Ferdinando; Angeloni, Antonio; Marchese, Cinzia
2014-01-01
Mayer-Rokitansky-Küster-Hauser syndrome (MRKHS) is a rare disease characterized by congenital aplasia of uterus and vagina. Although many studies have investigated several candidate genes, up to now none of them seem to be responsible for the aetiology of the syndrome. In our study, we identified differences in gene expression profile of in vitro cultured vaginal tissue of MRHKS patients using whole-genome microarray analysis. A group of eight out of sixteen MRKHS patients that underwent reconstruction of neovagina with an autologous in vitro cultured vaginal tissue were subjected to microarray analysis and compared with five healthy controls. Results obtained by array were confirmed by qRT-PCR and further extended to other eight MRKHS patients. Gene profiling of MRKHS patients delineated 275 differentially expressed genes, of which 133 downregulated and 142 upregulated. We selected six deregulated genes (MUC1, HOXC8, HOXB2, HOXB5, JAG1 and DLL1) on the basis of their fold change, their differential expression in most patients and their relevant role in embryological development. All patients showed upregulation of MUC1, while HOXB2 and HOXB5 were downregulated, as well as Notch ligands JAG1 and DLL1 in the majority of them. Interestingly, HOXC8 was significantly upregulated in 47% of patients, with a differential expression only in MRKHS type I patients. Taken together, our results highlighted the dysregulation of developmental genes, thus suggesting a potential alteration of networks involved in the formation of the female reproductive tract and providing a useful clue for understanding the pathophysiology of MRKHS.
Giuliano, Mariateresa; Cammarota, Marcella; D’Amici, Sirio; Vescarelli, Enrica; Maffucci, Diana; Bellati, Filippo; Panici, Pierluigi Benedetti; Romano, Ferdinando; Angeloni, Antonio; Marchese, Cinzia
2014-01-01
Mayer-Rokitansky-Küster-Hauser syndrome (MRKHS) is a rare disease characterized by congenital aplasia of uterus and vagina. Although many studies have investigated several candidate genes, up to now none of them seem to be responsible for the aetiology of the syndrome. In our study, we identified differences in gene expression profile of in vitro cultured vaginal tissue of MRHKS patients using whole-genome microarray analysis. A group of eight out of sixteen MRKHS patients that underwent reconstruction of neovagina with an autologous in vitro cultured vaginal tissue were subjected to microarray analysis and compared with five healthy controls. Results obtained by array were confirmed by qRT-PCR and further extended to other eight MRKHS patients. Gene profiling of MRKHS patients delineated 275 differentially expressed genes, of which 133 downregulated and 142 upregulated. We selected six deregulated genes (MUC1, HOXC8, HOXB2, HOXB5, JAG1 and DLL1) on the basis of their fold change, their differential expression in most patients and their relevant role in embryological development. All patients showed upregulation of MUC1, while HOXB2 and HOXB5 were downregulated, as well as Notch ligands JAG1 and DLL1 in the majority of them. Interestingly, HOXC8 was significantly upregulated in 47% of patients, with a differential expression only in MRKHS type I patients. Taken together, our results highlighted the dysregulation of developmental genes, thus suggesting a potential alteration of networks involved in the formation of the female reproductive tract and providing a useful clue for understanding the pathophysiology of MRKHS. PMID:24608967
Genes that characterize T3-predominant Graves' thyroid tissues.
Matsumoto, Chisa; Ito, Mitsuru; Yamada, Hiroya; Yamakawa, Noriko; Yoshida, Hiroshi; Date, Arisa; Watanabe, Mikio; Hidaka, Yoh; Iwatani, Yoshinori; Miyauchi, Akira; Takano, Toru
2013-02-01
3,5,3'-Triiodothyronine (T(3))-predominant Graves' disease is characterized by the increasing volume of thyroid goiter resulting in poor prognosis. Although type 1 and type 2 iodothyronine deiodinases (DIO1 and DIO2 respectively) are known to be overexpressed in the thyroid tissues of T(3)-predominant Graves' disease, the pathogenesis of this disease is still unclear. The aim of our study is to identify genes that characterize T(3)-predominant Graves' disease tissue in order to clarify the molecular mechanism of this disease. mRNAs from two thyroid tissues of both typical T(3)-predominant and common-type Graves' disease were analyzed with DNA microarrays with probes for 28 869 genes. Genes identified to be differentially expressed between the two groups were further analyzed in the second and third screenings using 70 Graves' thyroid tissues by real-time quantitative RT-PCR. Twenty-three candidate genes were selected as being differentially expressed in the first screening with microarrays. Among these, seven genes, leucine-rich repeat neuronal 1 (LRRN1), bone morphogenetic protein 8a (BMP8A), N-cadherin (CDH2), phosphodiesterase 1A (PDE1A), creatine kinase mitochondrial 2 (CKMT2), integrin beta-3 (ITGB3), and protein tyrosine phosphatase non-receptor type 4 (PTPN4), were confirmed to be differentially expressed in DIO1 or DIO2 over- and underexpressing Graves' tissues. These genes are related to the characteristics of T(3)-predominant Graves' disease, such as high titer level of serum anti-TSH receptor antibody, high free T(3) to free thyroxine ratio, and a large goiter size. They might play a role in the pathogenesis of T(3)-predominant Graves' disease.
Ingram, Jennifer L; Antao-Menezes, Aurita; Turpin, Elizabeth A; Wallace, Duncan G; Mangum, James B; Pluta, Linda J; Thomas, Russell S; Bonner, James C
2007-01-01
Background Exposure to vanadium pentoxide (V2O5) is a cause of occupational bronchitis. We evaluated gene expression profiles in cultured human lung fibroblasts exposed to V2O5 in vitro in order to identify candidate genes that could play a role in inflammation, fibrosis, and repair during the pathogenesis of V2O5-induced bronchitis. Methods Normal human lung fibroblasts were exposed to V2O5 in a time course experiment. Gene expression was measured at various time points over a 24 hr period using the Affymetrix Human Genome U133A 2.0 Array. Selected genes that were significantly changed in the microarray experiment were validated by RT-PCR. Results V2O5 altered more than 1,400 genes, of which ~300 were induced while >1,100 genes were suppressed. Gene ontology categories (GO) categories unique to induced genes included inflammatory response and immune response, while GO catogories unique to suppressed genes included ubiquitin cycle and cell cycle. A dozen genes were validated by RT-PCR, including growth factors (HBEGF, VEGF, CTGF), chemokines (IL8, CXCL9, CXCL10), oxidative stress response genes (SOD2, PIPOX, OXR1), and DNA-binding proteins (GAS1, STAT1). Conclusion Our study identified a variety of genes that could play pivotal roles in inflammation, fibrosis and repair during V2O5-induced bronchitis. The induction of genes that mediate inflammation and immune responses, as well as suppression of genes involved in growth arrest appear to be important to the lung fibrotic reaction to V2O5. PMID:17459161
Kirby, Ralph; Herron, Paul; Hoskisson, Paul
2011-02-01
Based on available genome sequences, Actinomycetales show significant gene synteny across a wide range of species and genera. In addition, many genera show varying degrees of complex morphological development. Using the presence of gene synteny as a basis, it is clear that an analysis of gene conservation across the Streptomyces and various other Actinomycetales will provide information on both the importance of genes and gene clusters and the evolution of morphogenesis in these bacteria. Genome sequencing, although becoming cheaper, is still relatively expensive for comparing large numbers of strains. Thus, a heterologous DNA/DNA microarray hybridization dataset based on a Streptomyces coelicolor microarray allows a cheaper and greater depth of analysis of gene conservation. This study, using both bioinformatical and microarray approaches, was able to classify genes previously identified as involved in morphogenesis in Streptomyces into various subgroups in terms of conservation across species and genera. This will allow the targeting of genes for further study based on their importance at the species level and at higher evolutionary levels.
Genes Involved in the Balance between Neuronal Survival and Death during Inflammation
Glezer, Isaias; Chernomoretz, Ariel; David, Samuel; Plante, Marie-Michèle; Rivest, Serge
2007-01-01
Glucocorticoids are potent regulators of the innate immune response, and alteration in this inhibitory feedback has detrimental consequences for the neural tissue. This study profiled and investigated functionally candidate genes mediating this switch between cell survival and death during an acute inflammatory reaction subsequent to the absence of glucocorticoid signaling. Oligonucleotide microarray analysis revealed that following lipopolysaccharide (LPS) intracerebral administration at striatum level, more modulated genes presented transcription impairment than exacerbation upon glucocorticoid receptor blockage. Among impaired genes we identified ceruloplasmin (Cp), which plays a key role in iron metabolism and is implicated in a neurodegenative disease. Microglial and endothelial induction of Cp is a natural neuroprotective mechanism during inflammation, because Cp-deficient mice exhibited increased iron accumulation and demyelination when exposed to LPS and neurovascular reactivity to pneumococcal meningitis. This study has identified genes that can play a critical role in programming the innate immune response, helping to clarify the mechanisms leading to protection or damage during inflammatory conditions in the CNS. PMID:17375196
Downstream targets of HOXB4 in a cell line model of primitive hematopoietic progenitor cells.
Lee, Han M; Zhang, Hui; Schulz, Vincent; Tuck, David P; Forget, Bernard G
2010-08-05
Enforced expression of the homeobox transcription factor HOXB4 has been shown to enhance hematopoietic stem cell self-renewal and expansion ex vivo and in vivo. To investigate the downstream targets of HOXB4 in hematopoietic progenitor cells, HOXB4 was constitutively overexpressed in the primitive hematopoietic progenitor cell line EML. Two genome-wide analytical techniques were used: RNA expression profiling using microarrays and chromatin immunoprecipitation (ChIP)-chip. RNA expression profiling revealed that 465 gene transcripts were differentially expressed in KLS (c-Kit(+), Lin(-), Sca-1(+))-EML cells that overexpressed HOXB4 (KLS-EML-HOXB4) compared with control KLS-EML cells that were transduced with vector alone. In particular, erythroid-specific gene transcripts were observed to be highly down-regulated in KLS-EML-HOXB4 cells. ChIP-chip analysis revealed that the promoter region for 1910 genes, such as CD34, Sox4, and B220, were occupied by HOXB4 in KLS-EML-HOXB4 cells. Side-by-side comparison of the ChIP-chip and RNA expression profiling datasets provided correlative information and identified Gp49a and Laptm4b as candidate "stemness-related" genes. Both genes were highly ranked in both dataset lists and have been previously shown to be preferentially expressed in hematopoietic stem cells and down-regulated in mature hematopoietic cells, thus making them attractive candidates for future functional studies in hematopoietic cells.
Lou, Qiaojun; Chen, Liang; Mei, Hanwei; Xu, Kai; Wei, Haibin; Feng, Fangjun; Li, Tiemei; Pang, Xiaomeng; Shi, Caiping; Luo, Lijun; Zhong, Yang
2017-01-01
Drought is the most serious abiotic stress limiting rice production, and deep root is the key contributor to drought avoidance. However, the genetic mechanism regulating the development of deep roots is largely unknown. In this study, the transcriptomes of 74 root samples from 37 rice varieties, representing the extreme genotypes of shallow or deep rooting, were surveyed by RNA-seq. The 13,242 differentially expressed genes (DEGs) between deep rooting and shallow rooting varieties (H vs. L) were enriched in the pathway of genetic information processing and metabolism, while the 1,052 DEGs between the deep roots and shallow roots from each of the plants (D vs. S) were significantly enriched in metabolic pathways especially energy metabolism. Ten quantitative trait transcripts (QTTs) were identified and some were involved in energy metabolism. Forty-nine candidate DEGs were confirmed by qRT-PCR and microarray. Through weighted gene co-expression network analysis (WGCNA), we found 18 hub genes. Surprisingly, all these hub genes expressed higher in deep roots than in shallow roots, furthermore half of them functioned in energy metabolism. We also estimated that the ATP production in the deep roots was faster than shallow roots. Our results provided a lot of reliable candidate genes to improve deep rooting, and firstly highlight the importance of energy metabolism to the development of deep roots.
Lou, Qiaojun; Chen, Liang; Mei, Hanwei; Xu, Kai; Wei, Haibin; Feng, Fangjun; Li, Tiemei; Pang, Xiaomeng; Shi, Caiping; Luo, Lijun; Zhong, Yang
2017-01-01
Drought is the most serious abiotic stress limiting rice production, and deep root is the key contributor to drought avoidance. However, the genetic mechanism regulating the development of deep roots is largely unknown. In this study, the transcriptomes of 74 root samples from 37 rice varieties, representing the extreme genotypes of shallow or deep rooting, were surveyed by RNA-seq. The 13,242 differentially expressed genes (DEGs) between deep rooting and shallow rooting varieties (H vs. L) were enriched in the pathway of genetic information processing and metabolism, while the 1,052 DEGs between the deep roots and shallow roots from each of the plants (D vs. S) were significantly enriched in metabolic pathways especially energy metabolism. Ten quantitative trait transcripts (QTTs) were identified and some were involved in energy metabolism. Forty-nine candidate DEGs were confirmed by qRT-PCR and microarray. Through weighted gene co-expression network analysis (WGCNA), we found 18 hub genes. Surprisingly, all these hub genes expressed higher in deep roots than in shallow roots, furthermore half of them functioned in energy metabolism. We also estimated that the ATP production in the deep roots was faster than shallow roots. Our results provided a lot of reliable candidate genes to improve deep rooting, and firstly highlight the importance of energy metabolism to the development of deep roots. PMID:28798764
2012-01-01
Background VDR may be considered as a candidate gene potentially related to Idiopathic Scoliosis susceptibility and natural history. Transcriptional profile of VDR mRNA isoforms might be changed in the structural tissues of the scoliotic spine and potentially influence the expression of VDR responsive genes. The purpose of the study was to determine differences in mRNA abundance of VDR isoforms in bone, cartilage and paravertebral muscles between tissues from curve concavity and convexity, between JIS and AIS and to identify VDR responsive genes differentiating Juvenile and Adolescent Idiopathic Scoliosis in paravertebral muscles. Methods In a group of 29 patients with JIS and AIS, specimens of bone, cartilage, paravertebral muscles were harvested at the both sides of the curve apex together with peripheral blood samples. Extracted total RNA served as a matrix for VDRs and VDRl mRNA quantification by QRT PCR. Subsequent microarray analysis of paravertebral muscular tissue samples was performed with HG U133A chips (Affymetrix). Quantitative data were compared by a nonparametric Mann Whitney U test. Microarray results were analyzed with GeneSpring 11GX application. Matrix plot of normalized log-intensities visualized the degree of differentiation between muscular tissue transcriptomes of JIS and AIS group. Fold Change Analysis with cutoff of Fold Change ≥2 identified differentially expressed VDR responsive genes in paravertebral muscles of JIS and AIS. Results No significant differences in transcript abundance of VDR isoforms between tissues of the curve concavity and convexity were found. Statistically significant difference between JIS and AIS group in mRNA abundance of VDRl isoform was found in paravertebral muscles of curve concavity. Higher degree of muscular transcriptome differentiation between curve concavity and convexity was visualized in JIS group. In paravertebral muscles Tob2 and MED13 were selected as genes differentially expressed in JIS and AIS group. Conclusions In Idiopathic Scolioses transcriptional activity and alternative splicing of VDR mRNA in osseous, cartilaginous, and paravertebral muscular tissues are tissue specific and equal on both sides of the curve. The number of mRNA copies of VDRl izoform in concave paravertebral muscles might be one of the factors differentiating JIS and AIS. In paravertebral muscles Tob2 and Med13 genes differentiate Adolescent and Juvenile type of Idiopathic Scoliosis. PMID:23259508
USDA-ARS?s Scientific Manuscript database
The amount of microarray gene expression data in public repositories has been increasing exponentially for the last couple of decades. High-throughput microarray data integration and analysis has become a critical step in exploring the large amount of expression data for biological discovery. Howeve...
Microarrays Made Simple: "DNA Chips" Paper Activity
ERIC Educational Resources Information Center
Barnard, Betsy
2006-01-01
DNA microarray technology is revolutionizing biological science. DNA microarrays (also called DNA chips) allow simultaneous screening of many genes for changes in expression between different cells. Now researchers can obtain information about genes in days or weeks that used to take months or years. The paper activity described in this article…
Over the last decade, the introduction of microarray technology has had a profound impact on gene expression research. The publication of studies with dissimilar or altogether contradictory results, obtained using different microarray platforms to analyze identical RNA samples, h...
Large-scale analysis of gene expression using cDNA microarrays promises the
rapid detection of the mode of toxicity for drugs and other chemicals. cDNA
microarrays were used to examine chemically-induced alterations of gene
expression in HepG2 cells exposed to oxidative ...
An efficient method to identify differentially expressed genes in microarray experiments
Qin, Huaizhen; Feng, Tao; Harding, Scott A.; Tsai, Chung-Jui; Zhang, Shuanglin
2013-01-01
Motivation Microarray experiments typically analyze thousands to tens of thousands of genes from small numbers of biological replicates. The fact that genes are normally expressed in functionally relevant patterns suggests that gene-expression data can be stratified and clustered into relatively homogenous groups. Cluster-wise dimensionality reduction should make it feasible to improve screening power while minimizing information loss. Results We propose a powerful and computationally simple method for finding differentially expressed genes in small microarray experiments. The method incorporates a novel stratification-based tight clustering algorithm, principal component analysis and information pooling. Comprehensive simulations show that our method is substantially more powerful than the popular SAM and eBayes approaches. We applied the method to three real microarray datasets: one from a Populus nitrogen stress experiment with 3 biological replicates; and two from public microarray datasets of human cancers with 10 to 40 biological replicates. In all three analyses, our method proved more robust than the popular alternatives for identification of differentially expressed genes. Availability The C++ code to implement the proposed method is available upon request for academic use. PMID:18453554
Clustering approaches to identifying gene expression patterns from DNA microarray data.
Do, Jin Hwan; Choi, Dong-Kug
2008-04-30
The analysis of microarray data is essential for large amounts of gene expression data. In this review we focus on clustering techniques. The biological rationale for this approach is the fact that many co-expressed genes are co-regulated, and identifying co-expressed genes could aid in functional annotation of novel genes, de novo identification of transcription factor binding sites and elucidation of complex biological pathways. Co-expressed genes are usually identified in microarray experiments by clustering techniques. There are many such methods, and the results obtained even for the same datasets may vary considerably depending on the algorithms and metrics for dissimilarity measures used, as well as on user-selectable parameters such as desired number of clusters and initial values. Therefore, biologists who want to interpret microarray data should be aware of the weakness and strengths of the clustering methods used. In this review, we survey the basic principles of clustering of DNA microarray data from crisp clustering algorithms such as hierarchical clustering, K-means and self-organizing maps, to complex clustering algorithms like fuzzy clustering.
Zenoni, Sara; D'Agostino, Nunzio; Tornielli, Giovanni B; Quattrocchio, Francesca; Chiusano, Maria L; Koes, Ronald; Zethof, Jan; Guzzo, Flavia; Delledonne, Massimo; Frusciante, Luigi; Gerats, Tom; Pezzotti, Mario
2011-10-01
Petunia is an excellent model system, especially for genetic, physiological and molecular studies. Thus far, however, genome-wide expression analysis has been applied rarely because of the lack of sequence information. We applied next-generation sequencing to generate, through de novo read assembly, a large catalogue of transcripts for Petunia axillaris and Petunia inflata. On the basis of both transcriptomes, comprehensive microarray chips for gene expression analysis were established and used for the analysis of global- and organ-specific gene expression in Petunia axillaris and Petunia inflata and to explore the molecular basis of the seed coat defects in a Petunia hybrida mutant, anthocyanin 11 (an11), lacking a WD40-repeat (WDR) transcription regulator. Among the transcripts differentially expressed in an11 seeds compared with wild type, many expected targets of AN11 were found but also several interesting new candidates that might play a role in morphogenesis of the seed coat. Our results validate the combination of next-generation sequencing with microarray analyses strategies to identify the transcriptome of two petunia species without previous knowledge of their genome, and to develop comprehensive chips as useful tools for the analysis of gene expression in P. axillaris, P. inflata and P. hybrida. © 2011 The Authors. The Plant Journal © 2011 Blackwell Publishing Ltd.
Ogunnaike, Babatunde A; Gelmi, Claudio A; Edwards, Jeremy S
2010-05-21
Gene expression studies generate large quantities of data with the defining characteristic that the number of genes (whose expression profiles are to be determined) exceed the number of available replicates by several orders of magnitude. Standard spot-by-spot analysis still seeks to extract useful information for each gene on the basis of the number of available replicates, and thus plays to the weakness of microarrays. On the other hand, because of the data volume, treating the entire data set as an ensemble, and developing theoretical distributions for these ensembles provides a framework that plays instead to the strength of microarrays. We present theoretical results that under reasonable assumptions, the distribution of microarray intensities follows the Gamma model, with the biological interpretations of the model parameters emerging naturally. We subsequently establish that for each microarray data set, the fractional intensities can be represented as a mixture of Beta densities, and develop a procedure for using these results to draw statistical inference regarding differential gene expression. We illustrate the results with experimental data from gene expression studies on Deinococcus radiodurans following DNA damage using cDNA microarrays. Copyright (c) 2010 Elsevier Ltd. All rights reserved.
Zeller, Tanja; Wild, Philipp S.; Truong, Vinh; Trégouët, David-Alexandre; Munzel, Thomas; Ziegler, Andreas; Cambien, François; Blankenberg, Stefan; Tiret, Laurence
2011-01-01
Background The hypothesis of dosage compensation of genes of the X chromosome, supported by previous microarray studies, was recently challenged by RNA-sequencing data. It was suggested that microarray studies were biased toward an over-estimation of X-linked expression levels as a consequence of the filtering of genes below the detection threshold of microarrays. Methodology/Principal Findings To investigate this hypothesis, we used microarray expression data from circulating monocytes in 1,467 individuals. In total, 25,349 and 1,156 probes were unambiguously assigned to autosomes and the X chromosome, respectively. Globally, there was a clear shift of X-linked expressions toward lower levels than autosomes. We compared the ratio of expression levels of X-linked to autosomal transcripts (X∶AA) using two different filtering methods: 1. gene expressions were filtered out using a detection threshold irrespective of gene chromosomal location (the standard method in microarrays); 2. equal proportions of genes were filtered out separately on the X and on autosomes. For a wide range of filtering proportions, the X∶AA ratio estimated with the first method was not significantly different from 1, the value expected if dosage compensation was achieved, whereas it was significantly lower than 1 with the second method, leading to the rejection of the hypothesis of dosage compensation. We further showed in simulated data that the choice of the most appropriate method was dependent on biological assumptions regarding the proportion of actively expressed genes on the X chromosome comparative to the autosomes and the extent of dosage compensation. Conclusion/Significance This study shows that the method used for filtering out lowly expressed genes in microarrays may have a major impact according to the hypothesis investigated. The hypothesis of dosage compensation of X-linked genes cannot be firmly accepted or rejected using microarray-based data. PMID:21912656
Cosart, Ted; Beja-Pereira, Albano; Luikart, Gordon
2014-11-01
The computer program EXONSAMPLER automates the sampling of thousands of exon sequences from publicly available reference genome sequences and gene annotation databases. It was designed to provide exon sequences for the efficient, next-generation gene sequencing method called exon capture. The exon sequences can be sampled by a list of gene name abbreviations (e.g. IFNG, TLR1), or by sampling exons from genes spaced evenly across chromosomes. It provides a list of genomic coordinates (a bed file), as well as a set of sequences in fasta format. User-adjustable parameters for collecting exon sequences include a minimum and maximum acceptable exon length, maximum number of exonic base pairs (bp) to sample per gene, and maximum total bp for the entire collection. It allows for partial sampling of very large exons. It can preferentially sample upstream (5 prime) exons, downstream (3 prime) exons, both external exons, or all internal exons. It is written in the Python programming language using its free libraries. We describe the use of EXONSAMPLER to collect exon sequences from the domestic cow (Bos taurus) genome for the design of an exon-capture microarray to sequence exons from related species, including the zebu cow and wild bison. We collected ~10% of the exome (~3 million bp), including 155 candidate genes, and ~16,000 exons evenly spaced genomewide. We prioritized the collection of 5 prime exons to facilitate discovery and genotyping of SNPs near upstream gene regulatory DNA sequences, which control gene expression and are often under natural selection. © 2014 John Wiley & Sons Ltd.
Characterization of candidate genes in inflammatory bowel disease–associated risk loci
Peloquin, Joanna M.; Sartor, R. Balfour; Newberry, Rodney D.; McGovern, Dermot P.; Yajnik, Vijay; Lira, Sergio A.
2016-01-01
GWAS have linked SNPs to risk of inflammatory bowel disease (IBD), but a systematic characterization of disease-associated genes has been lacking. Prior studies utilized microarrays that did not capture many genes encoded within risk loci or defined expression quantitative trait loci (eQTLs) using peripheral blood, which is not the target tissue in IBD. To address these gaps, we sought to characterize the expression of IBD-associated risk genes in disease-relevant tissues and in the setting of active IBD. Terminal ileal (TI) and colonic mucosal tissues were obtained from patients with Crohn’s disease or ulcerative colitis and from healthy controls. We developed a NanoString code set to profile 678 genes within IBD risk loci. A subset of patients and controls were genotyped for IBD-associated risk SNPs. Analyses included differential expression and variance analysis, weighted gene coexpression network analysis, and eQTL analysis. We identified 116 genes that discriminate between healthy TI and colon samples and uncovered patterns in variance of gene expression that highlight heterogeneity of disease. We identified 107 coexpressed gene pairs for which transcriptional regulation is either conserved or reversed in an inflammation-independent or -dependent manner. We demonstrate that on average approximately 60% of disease-associated genes are differentially expressed in inflamed tissue. Last, we identified eQTLs with either genotype-only effects on expression or an interaction effect between genotype and inflammation. Our data reinforce tissue specificity of expression in disease-associated candidate genes, highlight genes and gene pairs that are regulated in disease-relevant tissue and inflammation, and provide a foundation to advance the understanding of IBD pathogenesis. PMID:27668286
Li, Zhiguang; Kwekel, Joshua C; Chen, Tao
2012-01-01
Functional comparison across microarray platforms is used to assess the comparability or similarity of the biological relevance associated with the gene expression data generated by multiple microarray platforms. Comparisons at the functional level are very important considering that the ultimate purpose of microarray technology is to determine the biological meaning behind the gene expression changes under a specific condition, not just to generate a list of genes. Herein, we present a method named percentage of overlapping functions (POF) and illustrate how it is used to perform the functional comparison of microarray data generated across multiple platforms. This method facilitates the determination of functional differences or similarities in microarray data generated from multiple array platforms across all the functions that are presented on these platforms. This method can also be used to compare the functional differences or similarities between experiments, projects, or laboratories.
Thermodynamically optimal whole-genome tiling microarray design and validation.
Cho, Hyejin; Chou, Hui-Hsien
2016-06-13
Microarray is an efficient apparatus to interrogate the whole transcriptome of species. Microarray can be designed according to annotated gene sets, but the resulted microarrays cannot be used to identify novel transcripts and this design method is not applicable to unannotated species. Alternatively, a whole-genome tiling microarray can be designed using only genomic sequences without gene annotations, and it can be used to detect novel RNA transcripts as well as known genes. The difficulty with tiling microarray design lies in the tradeoff between probe-specificity and coverage of the genome. Sequence comparison methods based on BLAST or similar software are commonly employed in microarray design, but they cannot precisely determine the subtle thermodynamic competition between probe targets and partially matched probe nontargets during hybridizations. Using the whole-genome thermodynamic analysis software PICKY to design tiling microarrays, we can achieve maximum whole-genome coverage allowable under the thermodynamic constraints of each target genome. The resulted tiling microarrays are thermodynamically optimal in the sense that all selected probes share the same melting temperature separation range between their targets and closest nontargets, and no additional probes can be added without violating the specificity of the microarray to the target genome. This new design method was used to create two whole-genome tiling microarrays for Escherichia coli MG1655 and Agrobacterium tumefaciens C58 and the experiment results validated the design.
NCAM2 deletion in a boy with macrocephaly and autism: Cause, association or predisposition?
Scholz, Caroline; Steinemann, Doris; Mälzer, Madeleine; Roy, Mandy; Arslan-Kirchner, Mine; Illig, Thomas; Schmidtke, Jörg; Stuhrmann, Manfred
2016-10-01
We report on an 8-year-old boy with autism spectrum disorder (ASD), speech delay, behavioural problems, disturbed sleep and macrosomia including macrocephaly carrying a microdeletion that contains the entire NCAM2 gene and no other functional genes. Other family members with the microdeletion show a large skull circumference but do not exhibit any symptoms of autism spectrum disorder. Among many ASD-candidate genes, NCAM2 has been assumed to play a pivotal role in the development of ASD because of its function in the outgrowth and bundling of neurites. Our reported case raises the questions whether the NCAM2-deletion is the true cause of the ASD or only a risk factor and whether there might be any connection in NCAM2 with skull-size autism spectrum disorder, macrocephaly, neural cell adhesion molecule 2 protein (NCAM2), array comparative genomic hybridization (microarray). Copyright © 2016 Elsevier Masson SAS. All rights reserved.
Genetics of autism spectrum disorders.
Kumar, Ravinesh A; Christian, Susan L
2009-05-01
Autism spectrum disorders (ASDs) are a clinically complex group of childhood disorders that have firm evidence of an underlying genetic etiology. Many techniques have been used to characterize the genetic bases of ASDs. Linkage studies have identified several replicated susceptibility loci, including 2q24-2q31, 7q, and 17q11-17q21. Association studies and mutation analysis of candidate genes have implicated the synaptic genes NRXN1, NLGN3, NLGN4, SHANK3, and CNTNAP2 in ASDs. Traditional cytogenetic approaches highlight the high frequency of large chromosomal abnormalities (3%-7% of patients), including the most frequently observed maternal 15q11-13 duplications (1%-3% of patients). Newly developed techniques include high-resolution DNA microarray technologies, which have discovered formerly undetectable submicroscopic copy number variants, and genomewide association studies, which allow simultaneous detection of multiple genes associated with ASDs. Although great progress has been made in autism genetics, the molecular bases of most ASDs remains enigmatic.
Shekhar, M S; Gomathi, A; Gopikrishna, G; Ponniah, A G
2015-06-01
White spot syndrome virus (WSSV) continues to be the most devastating viral pathogen infecting penaeid shrimp the world over. The genome of WSSV has been deciphered and characterized from three geographical isolates and significant progress has been made in developing various molecular diagnostic methods to detect the virus. However, the information on host immune gene response to WSSV pathogenesis is limited. Microarray analysis was carried out as an approach to analyse the gene expression in black tiger shrimp Penaeus monodon in response to WSSV infection. Gill tissues collected from the WSSV infected shrimp at 6, 24, 48 h and moribund stage were analysed for differential gene expression. Shrimp cDNAs of 40,059 unique sequences were considered for designing the microarray chip. The Cy3-labeled cRNA derived from healthy and WSSV-infected shrimp was subjected to hybridization with all the DNA spots in the microarray which revealed 8,633 and 11,147 as up- and down-regulated genes respectively at different time intervals post infection. The altered expression of these numerous genes represented diverse functions such as immune response, osmoregulation, apoptosis, nucleic acid binding, energy and metabolism, signal transduction, stress response and molting. The changes in gene expression profiles observed by microarray analysis provides molecular insights and framework of genes which are up- and down-regulated at different time intervals during WSSV infection in shrimp. The microarray data was validated by Real Time analysis of four differentially expressed genes involved in apoptosis (translationally controlled tumor protein, inhibitor of apoptosis protein, ubiquitin conjugated enzyme E2 and caspase) for gene expression levels. The role of apoptosis related genes in WSSV infected shrimp is discussed herein.
2014-01-01
Background Genome-wide microarrays have been useful for predicting chemical-genetic interactions at the gene level. However, interpreting genome-wide microarray results can be overwhelming due to the vast output of gene expression data combined with off-target transcriptional responses many times induced by a drug treatment. This study demonstrates how experimental and computational methods can interact with each other, to arrive at more accurate predictions of drug-induced perturbations. We present a two-stage strategy that links microarray experimental testing and network training conditions to predict gene perturbations for a drug with a known mechanism of action in a well-studied organism. Results S. cerevisiae cells were treated with the antifungal, fluconazole, and expression profiling was conducted under different biological conditions using Affymetrix genome-wide microarrays. Transcripts were filtered with a formal network-based method, sparse simultaneous equation models and Lasso regression (SSEM-Lasso), under different network training conditions. Gene expression results were evaluated using both gene set and single gene target analyses, and the drug’s transcriptional effects were narrowed first by pathway and then by individual genes. Variables included: (i) Testing conditions – exposure time and concentration and (ii) Network training conditions – training compendium modifications. Two analyses of SSEM-Lasso output – gene set and single gene – were conducted to gain a better understanding of how SSEM-Lasso predicts perturbation targets. Conclusions This study demonstrates that genome-wide microarrays can be optimized using a two-stage strategy for a more in-depth understanding of how a cell manifests biological reactions to a drug treatment at the transcription level. Additionally, a more detailed understanding of how the statistical model, SSEM-Lasso, propagates perturbations through a network of gene regulatory interactions is achieved. PMID:24444313
Bumm, Klaus; Zheng, Mingzhong; Bailey, Clyde; Zhan, Fenghuang; Chiriva-Internati, M; Eddlemon, Paul; Terry, Julian; Barlogie, Bart; Shaughnessy, John D
2002-02-01
Clinical GeneOrganizer (CGO) is a novel windows-based archiving, organization and data mining software for the integration of gene expression profiling in clinical medicine. The program implements various user-friendly tools and extracts data for further statistical analysis. This software was written for Affymetrix GeneChip *.txt files, but can also be used for any other microarray-derived data. The MS-SQL server version acts as a data mart and links microarray data with clinical parameters of any other existing database and therefore represents a valuable tool for combining gene expression analysis and clinical disease characteristics.
RNAi targeting GPR4 influences HMEC-1 gene expression by microarray analysis
Ren, Juan; Zhang, Yuelang; Cai, Hui; Ma, Hongbing; Zhao, Dongli; Zhang, Xiaozhi; Li, Zongfang; Wang, Shufeng; Wang, Jiangsheng; Liu, Rui; Li, Yi; Qian, Jiansheng; Wei, Hongxia; Niu, Liying; Liu, Yan; Xiao, Lisha; Ding, Muyang; Jiang, Shiwen
2014-01-01
G-protein coupled receptor 4 (GPR4) belongs to a protein family comprised of 3 closely related G protein-coupled receptors. Recent studies have shown that GPR4 plays important roles in angiogenesis, proton sensing, and regulating tumor cells as an oncogenic gene. How GPR4 conducts its functions? Rare has been known. In order to detect the genes related to GPR4, microarray technology was employed. GPR4 is highly expressed in human vascular endothelial cell HMEC-1. Small interfering RNA against GPR4 was used to knockdown GPR4 expression in HMEC-1. Then RNA from the GPR4 knockdown cells and control cells were analyzed through genome microarray. Microarray results shown that among the whole genes and expressed sequence tags, 447 differentially expressed genes were identified, containing 318 up-regulated genes and 129 down-regulated genes. These genes whose expression dramatically changed may be involved in the GPR4 functions. These genes were related to cell apoptosis, cytoskeleton and signal transduction, cell proliferation, differentiation and cell-cycle regulation, gene transcription and translation and cell material and energy metabolism. PMID:24753754
Bull, James C.; Ryabov, Eugene V.; Prince, Gill; Mead, Andrew; Zhang, Cunjin; Baxter, Laura A.; Pell, Judith K.; Osborne, Juliet L.; Chandler, Dave
2012-01-01
Honeybees, Apis mellifera, show age-related division of labor in which young adults perform maintenance (“housekeeping”) tasks inside the colony before switching to outside foraging at approximately 23 days old. Disease resistance is an important feature of honeybee biology, but little is known about the interaction of pathogens and age-related division of labor. We tested a hypothesis that older forager bees and younger “house” bees differ in susceptibility to infection. We coupled an infection bioassay with a functional analysis of gene expression in individual bees using a whole genome microarray. Forager bees treated with the entomopathogenic fungus Metarhizium anisopliae s.l. survived for significantly longer than house bees. This was concomitant with substantial differences in gene expression including genes associated with immune function. In house bees, infection was associated with differential expression of 35 candidate immune genes contrasted with differential expression of only two candidate immune genes in forager bees. For control bees (i.e. not treated with M. anisopliae) the development from the house to the forager stage was associated with differential expression of 49 candidate immune genes, including up-regulation of the antimicrobial peptide gene abaecin, plus major components of the Toll pathway, serine proteases, and serpins. We infer that reduced pathogen susceptibility in forager bees was associated with age-related activation of specific immune system pathways. Our findings contrast with the view that the immunocompetence in social insects declines with the onset of foraging as a result of a trade-off in the allocation of resources for foraging. The up-regulation of immune-related genes in young adult bees in response to M. anisopliae infection was an indicator of disease susceptibility; this also challenges previous research in social insects, in which an elevated immune status has been used as a marker of increased disease resistance and fitness without considering the effects of age-related development. PMID:23300441
Comparison of RNA-seq and microarray-based models for clinical endpoint prediction.
Zhang, Wenqian; Yu, Ying; Hertwig, Falk; Thierry-Mieg, Jean; Zhang, Wenwei; Thierry-Mieg, Danielle; Wang, Jian; Furlanello, Cesare; Devanarayan, Viswanath; Cheng, Jie; Deng, Youping; Hero, Barbara; Hong, Huixiao; Jia, Meiwen; Li, Li; Lin, Simon M; Nikolsky, Yuri; Oberthuer, André; Qing, Tao; Su, Zhenqiang; Volland, Ruth; Wang, Charles; Wang, May D; Ai, Junmei; Albanese, Davide; Asgharzadeh, Shahab; Avigad, Smadar; Bao, Wenjun; Bessarabova, Marina; Brilliant, Murray H; Brors, Benedikt; Chierici, Marco; Chu, Tzu-Ming; Zhang, Jibin; Grundy, Richard G; He, Min Max; Hebbring, Scott; Kaufman, Howard L; Lababidi, Samir; Lancashire, Lee J; Li, Yan; Lu, Xin X; Luo, Heng; Ma, Xiwen; Ning, Baitang; Noguera, Rosa; Peifer, Martin; Phan, John H; Roels, Frederik; Rosswog, Carolina; Shao, Susan; Shen, Jie; Theissen, Jessica; Tonini, Gian Paolo; Vandesompele, Jo; Wu, Po-Yen; Xiao, Wenzhong; Xu, Joshua; Xu, Weihong; Xuan, Jiekun; Yang, Yong; Ye, Zhan; Dong, Zirui; Zhang, Ke K; Yin, Ye; Zhao, Chen; Zheng, Yuanting; Wolfinger, Russell D; Shi, Tieliu; Malkas, Linda H; Berthold, Frank; Wang, Jun; Tong, Weida; Shi, Leming; Peng, Zhiyu; Fischer, Matthias
2015-06-25
Gene expression profiling is being widely applied in cancer research to identify biomarkers for clinical endpoint prediction. Since RNA-seq provides a powerful tool for transcriptome-based applications beyond the limitations of microarrays, we sought to systematically evaluate the performance of RNA-seq-based and microarray-based classifiers in this MAQC-III/SEQC study for clinical endpoint prediction using neuroblastoma as a model. We generate gene expression profiles from 498 primary neuroblastomas using both RNA-seq and 44 k microarrays. Characterization of the neuroblastoma transcriptome by RNA-seq reveals that more than 48,000 genes and 200,000 transcripts are being expressed in this malignancy. We also find that RNA-seq provides much more detailed information on specific transcript expression patterns in clinico-genetic neuroblastoma subgroups than microarrays. To systematically compare the power of RNA-seq and microarray-based models in predicting clinical endpoints, we divide the cohort randomly into training and validation sets and develop 360 predictive models on six clinical endpoints of varying predictability. Evaluation of factors potentially affecting model performances reveals that prediction accuracies are most strongly influenced by the nature of the clinical endpoint, whereas technological platforms (RNA-seq vs. microarrays), RNA-seq data analysis pipelines, and feature levels (gene vs. transcript vs. exon-junction level) do not significantly affect performances of the models. We demonstrate that RNA-seq outperforms microarrays in determining the transcriptomic characteristics of cancer, while RNA-seq and microarray-based models perform similarly in clinical endpoint prediction. Our findings may be valuable to guide future studies on the development of gene expression-based predictive models and their implementation in clinical practice.
Gene expression analysis in MCF-7 breast cancer cells treated with recombinant bromelain.
Fouz, Nour; Amid, Azura; Hashim, Yumi Zuhanis Has-Yun
2014-08-01
The contributing molecular pathways underlying the pathogenesis of breast cancer need to be better characterized. The principle of our study was to better understand the genetic mechanism of oncogenesis for human breast cancer and to discover new possible tumor markers for use in clinical practice. We used complimentary DNA (cDNA) microarrays to compare gene expression profiles of treated Michigan Cancer Foundation-7 (MCF-7) with recombinant bromelain and untreated MCF-7. SpringGene analysis was carried out of differential expression followed by Ingenuity Pathway Analysis (IPA), to understand the underlying consequence in developing disease and disorders. We identified 1,102 known genes differentially expressed to a significant degree (p<0.001) changed between the treatment. Within this gene set, 20 genes were significantly changed between treated cells and the control cells with cutoff fold change of more than 1.5. These genes are RNA-binding motif, single-stranded interacting protein 1 (RBMS1), ribosomal protein L29 (RPL29), glutathione S-transferase mu 2 (GSTM2), C15orf32, Akt3, B cell translocation gene 1 (BTG1), C6orf62, C7orf60, kinesin-associated protein 3 (KIFAP3), FBXO11, AT-rich interactive domain 4A (ARID4A), COPS2, TBPL1|SLC2A12, TMEM59, SNORD46, glioma tumor suppressor candidate region gene 2 (GLTSCR2), and LRRFIP. Our observation on gene expression indicated that recombinant bromelain produces a unique signature affecting different pathways, specific for each congener. The microarray results give a molecular mechanistic insight and functional effects, following recombinant bromelain treatment. The extent of changes in genes is related to and involved significantly in gap junction signaling, amyloid processing, cell cycle regulation by BTG family proteins, and breast cancer regulation by stathmin1 that play major roles.
Temephos Resistance in Aedes aegypti in Colombia Compromises Dengue Vector Control
Grisales, Nelson; Poupardin, Rodolphe; Gomez, Santiago; Fonseca-Gonzalez, Idalyd; Ranson, Hilary; Lenhart, Audrey
2013-01-01
Background Control and prevention of dengue relies heavily on the application of insecticides to control dengue vector mosquitoes. In Colombia, application of the larvicide temephos to the aquatic breeding sites of Aedes aegypti is a key part of the dengue control strategy. Resistance to temephos was recently detected in the dengue-endemic city of Cucuta, leading to questions about its efficacy as a control tool. Here, we characterize the underlying mechanisms and estimate the operational impact of this resistance. Methodology/Principal Findings Larval bioassays of Ae. aegypti larvae from Cucuta determined the temephos LC50 to be 0.066 ppm (95% CI 0.06–0.074), approximately 15× higher than the value obtained from a susceptible laboratory colony. The efficacy of the field dose of temephos at killing this resistant Cucuta population was greatly reduced, with mortality rates <80% two weeks after application and <50% after 4 weeks. Neither biochemical assays nor partial sequencing of the ace-1 gene implicated target site resistance as the primary resistance mechanism. Synergism assays and microarray analysis suggested that metabolic mechanisms were most likely responsible for the temephos resistance. Interestingly, although the greatest synergism was observed with the carboxylesterase inhibitor, DEF, the primary candidate genes from the microarray analysis, and confirmed by quantitative PCR, were cytochrome P450 oxidases, notably CYP6N12, CYP6F3 and CYP6M11. Conclusions/Significance In Colombia, resistance to temephos in Ae. aegypti compromises the duration of its effect as a vector control tool. Several candidate genes potentially responsible for metabolic resistance to temephos were identified. Given the limited number of insecticides that are approved for vector control, future chemical-based control strategies should take into account the mechanisms underlying the resistance to discern which insecticides would likely lead to the greatest control efficacy while minimizing further selection of resistant phenotypes. PMID:24069492
Jovanović, Katarina K; Tanić, Miljana; Ivanović, Ivanka; Gligorijević, Nevenka; Dojčinović, Biljana P; Radulović, Siniša
2016-10-01
Ruthenium(II)-arene complexes are promising drug candidates for the therapy of solid tumors. In previous work, seven new compounds of the general formula [Ru(η 6 -p-cymene)(L 1-7 )Cl] were synthesized and characterized, of which the complex with L=isoquinoline-3-carboxylic acid (RuT 7 ) was two times as active on HeLa cells compared to normal cell line MRC-5, as indicated by IC 50 values determined after 48h of incubation (45.4±3.0 vs. 84.2±5.7μM, respectively). In the present study, cell cycle analysis of HeLa cells treated with RuT 7 showed S phase arrest and an increase in sub-G1 population. The apoptotic potential of the title compound was confirmed with the Annexin V-FITC/PI assay together with a morphological evaluation of cells using fluorescent microscopy. Analysis of the intracellular accumulation of ruthenium showed 8.9ng Ru/10 6 cells after 6h of incubation. To gain further insight in the molecular mechanism of action of RuT 7 on HeLa cells, a whole-transcriptome microarray gene expression analysis was performed. Analysis of functional categories and signaling and biochemical pathways associated with the response of HeLa cells to treatment with RuT 7 showed that it leads the cells through the intrinsic (mitochondrial) apoptotic pathway, via indirect DNA damage due to the action of reactive oxygen species, and through direct DNA binding of RuT 7 . Statistical analysis for enrichment of gene sets associated with known drug-induced toxicities identified fewer associated toxicity profiles in RuT 7 -treated cells compared to cisplatin treatment. Altogether these results provide the basis for further development of RuT 7 in animal and pre-clinical studies as a potential drug candidate. Copyright © 2016 Elsevier Inc. All rights reserved.
Genome-wide identification of WRKY family genes and their response to cold stress in Vitis vinifera
2014-01-01
Background WRKY transcription factors are one of the largest families of transcriptional regulators in plants. WRKY genes are not only found to play significant roles in biotic and abiotic stress response, but also regulate growth and development. Grapevine (Vitis vinifera) production is largely limited by stressful climate conditions such as cold stress and the role of WRKY genes in the survival of grapevine under these conditions remains unknown. Results We identified a total of 59 VvWRKYs from the V. vinifera genome, belonging to four subgroups according to conserved WRKY domains and zinc-finger structure. The majority of VvWRKYs were expressed in more than one tissue among the 7 tissues examined which included young leaves, mature leaves, tendril, stem apex, root, young fruits and ripe fruits. Publicly available microarray data suggested that a subset of VvWRKYs was activated in response to diverse stresses. Quantitative real-time PCR (qRT-PCR) results demonstrated that the expression levels of 36 VvWRKYs are changed following cold exposure. Comparative analysis was performed on data from publicly available microarray experiments, previous global transcriptome analysis studies, and qRT-PCR. We identified 15 VvWRKYs in at least two of these databases which may relate to cold stress. Among them, the transcription of three genes can be induced by exogenous ABA application, suggesting that they can be involved in an ABA-dependent signaling pathway in response to cold stress. Conclusions We identified 59 VvWRKYs from the V. vinifera genome and 15 of them showed cold stress-induced expression patterns. These genes represented candidate genes for future functional analysis of VvWRKYs involved in the low temperature-related signal pathways in grape. PMID:24755338
Gene-body hypermethylation of ATM in peripheral blood DNA of bilateral breast cancer patients
Flanagan, James M.; Munoz-Alegre, Marta; Henderson, Stephen; Tang, Thomas; Sun, Ping; Johnson, Nichola; Fletcher, Olivia; dos Santos Silva, Isabel; Peto, Julian; Boshoff, Chris; Narod, Steven; Petronis, Arturas
2009-01-01
Bilaterality of breast cancer is an indicator of constitutional cancer susceptibility; however, the molecular causes underlying this predisposition in the majority of cases is not known. We hypothesize that epigenetic misregulation of cancer-related genes could partially account for this predisposition. We have performed methylation microarray analysis of peripheral blood DNA from 14 women with bilateral breast cancer compared with 14 unaffected matched controls throughout 17 candidate breast cancer susceptibility genes including BRCA1, BRCA2, CHEK2, ATM, ESR1, SFN, CDKN2A, TP53, GSTP1, CDH1, CDH13, HIC1, PGR, SFRP1, MLH1, RARB and HSD17B4. We show that the majority of methylation variability is associated with intragenic repetitive elements. Detailed validation of the tiled region around ATM was performed by bisulphite modification and pyrosequencing of the same samples and in a second set of peripheral blood DNA from 190 bilateral breast cancer patients compared with 190 controls. We show significant hypermethylation of one intragenic repetitive element in breast cancer cases compared with controls (P = 0.0017), with the highest quartile of methylation associated with a 3-fold increased risk of breast cancer (OR 3.20, 95% CI 1.78–5.86, P = 0.000083). Increased methylation of this locus is associated with lower steady-state ATM mRNA level and correlates with age of cancer patients but not controls, suggesting a combined age–phenotype-related association. This research demonstrates the potential for gene-body epigenetic misregulation of ATM and other cancer-related genes in peripheral blood DNA that may be useful as a novel marker to estimate breast cancer risk. Accession numbers: The microarray data and associated .BED and .WIG files can be accessed through Gene Expression Omnibus accession number: GSE14603. PMID:19153073
DOE Office of Scientific and Technical Information (OSTI.GOV)
Labbe, Jessy L; Jorge, Veronique; Vion, Patrice
A Populus deltoides Populus trichocarpa F1 pedigree was analyzed for quantitative trait loci (QTLs) affecting ectomycorrhizal development and for microarray characterization of gene networks involved in this symbiosis. A 300 genotype progeny set was evaluated for its ability to form ectomycorrhiza with the basidiomycete Laccaria bicolor. The percentage of mycorrhizal root tips was determined on the root systems of all 300 progeny and their two parents. QTL analysis identified four significant QTLs, one on the P. deltoides and three on the P. trichocarpa genetic maps. These QTLs were aligned to the P. trichocarpa genome and each contained several megabases andmore » encompass numerous genes. NimbleGen whole-genome microarray, using cDNA from RNA extracts of ectomycorrhizal root tips from the parental genotypes P. trichocarpa and P. deltoides, was used to narrow the candidate gene list. Among the 1,543 differentially expressed genes (p value 0.05; 5.0-fold change in transcript level) having different transcript levels in mycorrhiza of the two parents, 41 transcripts were located in the QTL intervals: 20 in Myc_d1, 14 in Myc_t1, and seven in Myc_t2, while no significant differences among transcripts were found in Myc_t3. Among these 41 transcripts, 25 were overrepresented in P. deltoides relative to P. trichocarpa; 16 were overrepresented in P. trichocarpa. The transcript showing the highest overrepresentation in P. trichocarpa mycorrhiza libraries compared to P. deltoides mycorrhiza codes for an ethylene-sensitive EREBP-4 protein which may repress defense mechanisms in P. trichocarpa while the highest overrepresented transcripts in P. deltoides code for proteins/genes typically associated with pathogen resistance.« less
Deciphering the Function of New Gonococcal Vaccine Antigens Using Phenotypic Microarrays
Baarda, Benjamin I.; Emerson, Sarah; Proteau, Philip J.
2017-01-01
ABSTRACT The function and extracellular location of cell envelope proteins make them attractive candidates for developing vaccines against bacterial diseases, including challenging drug-resistant pathogens, such as Neisseria gonorrhoeae. A proteomics-driven reverse vaccinology approach has delivered multiple gonorrhea vaccine candidates; however, the biological functions of many of them remain to be elucidated. Herein, the functions of six gonorrhea vaccine candidates—NGO2121, NGO1985, NGO2054, NGO2111, NGO1205, and NGO1344—in cell envelope homeostasis were probed using phenotype microarrays under 1,056 conditions and a ΔbamE mutant (Δngo1780) as a reference of perturbed outer membrane integrity. Optimal growth conditions for an N. gonorrhoeae phenotype microarray assay in defined liquid medium were developed, which can be useful in other applications, including rapid and thorough antimicrobial susceptibility assessment. Our studies revealed 91 conditions having uniquely positive or negative effects on one of the examined mutants. A cluster analysis of 37 and 57 commonly beneficial and detrimental compounds, respectively, revealed three separate phenotype groups: NGO2121 and NGO1985; NGO1344 and BamE; and the trio of NGO1205, NGO2111, and NGO2054, with the last protein forming an independent branch of this cluster. Similar phenotypes were associated with loss of these vaccine candidates in the highly antibiotic-resistant WHO X strain. Based on their extensive sensitivity phenomes, NGO1985 and NGO2121 appear to be the most promising vaccine candidates. This study establishes the principle that phenotype microarrays can be successfully applied to a fastidious bacterial organism, such as N. gonorrhoeae. IMPORTANCE Innovative approaches are required to develop vaccines against prevalent and neglected sexually transmitted infections, such as gonorrhea. Herein, we have utilized phenotype microarrays in the first such investigation into Neisseria gonorrhoeae to probe the function of proteome-derived vaccine candidates in cell envelope homeostasis. Information gained from this screening can feed the vaccine candidate decision tree by providing insights into the roles these proteins play in membrane permeability, integrity, and overall N. gonorrhoeae physiology. The optimized screening protocol can be applied in investigations into the function of other hypothetical proteins of N. gonorrhoeae discovered in the expanding number of whole-genome sequences, in addition to revealing phenotypic differences between clinical and laboratory strains. PMID:28630127
2010-01-01
Background The development of DNA microarrays has facilitated the generation of hundreds of thousands of transcriptomic datasets. The use of a common reference microarray design allows existing transcriptomic data to be readily compared and re-analysed in the light of new data, and the combination of this design with large datasets is ideal for 'systems'-level analyses. One issue is that these datasets are typically collected over many years and may be heterogeneous in nature, containing different microarray file formats and gene array layouts, dye-swaps, and showing varying scales of log2- ratios of expression between microarrays. Excellent software exists for the normalisation and analysis of microarray data but many data have yet to be analysed as existing methods struggle with heterogeneous datasets; options include normalising microarrays on an individual or experimental group basis. Our solution was to develop the Batch Anti-Banana Algorithm in R (BABAR) algorithm and software package which uses cyclic loess to normalise across the complete dataset. We have already used BABAR to analyse the function of Salmonella genes involved in the process of infection of mammalian cells. Results The only input required by BABAR is unprocessed GenePix or BlueFuse microarray data files. BABAR provides a combination of 'within' and 'between' microarray normalisation steps and diagnostic boxplots. When applied to a real heterogeneous dataset, BABAR normalised the dataset to produce a comparable scaling between the microarrays, with the microarray data in excellent agreement with RT-PCR analysis. When applied to a real non-heterogeneous dataset and a simulated dataset, BABAR's performance in identifying differentially expressed genes showed some benefits over standard techniques. Conclusions BABAR is an easy-to-use software tool, simplifying the simultaneous normalisation of heterogeneous two-colour common reference design cDNA microarray-based transcriptomic datasets. We show BABAR transforms real and simulated datasets to allow for the correct interpretation of these data, and is the ideal tool to facilitate the identification of differentially expressed genes or network inference analysis from transcriptomic datasets. PMID:20128918
Mendrzyk, Frank; Radlwimmer, Bernhard; Joos, Stefan; Kokocinski, Felix; Benner, Axel; Stange, Daniel E; Neben, Kai; Fiegler, Heike; Carter, Nigel P; Reifenberger, Guido; Korshunov, Andrey; Lichter, Peter
2005-12-01
Medulloblastoma is the most common malignant brain tumor in children. Despite multimodal aggressive treatment, nearly half of the patients die as a result of this tumor. Identification of molecular markers for prognosis and development of novel pathogenesis-based therapies depends crucially on a better understanding of medulloblastoma pathomechanisms. We performed genome-wide analysis of DNA copy number imbalances in 47 medulloblastomas using comparative genomic hybridization to large insert DNA microarrays (matrix-CGH). The expression of selected candidate genes identified by matrix-CGH was analyzed immunohistochemically on tissue microarrays representing medulloblastomas from 189 clinically well-documented patients. To identify novel prognostic markers, genomic findings and protein expression data were correlated to patient survival. Matrix-CGH analysis revealed frequent DNA copy number alterations of several novel candidate regions. Among these, gains at 17q23.2-qter (P < .01) and losses at 17p13.1 to 17p13.3 (P = .04) were significantly correlated to poor prognosis. Within 17q23.2-qter and 7q21.2, two of the most frequently gained chromosomal regions, confined amplicons were identified that contained the PPM1D and CDK6 genes, respectively. Immunohistochemistry revealed strong expression of PPM1D in 148 (88%) of 168 and CDK6 in 50 (30%) of 169 medulloblastomas. Overexpression of CDK6 correlated significantly with poor prognosis (P < .01) and represented an independent prognostic marker of overall survival on multivariate analysis (P = .02). We identified CDK6 as a novel molecular marker that can be determined by immunohistochemistry on routinely processed tissue specimens and may facilitate the prognostic assessment of medulloblastoma patients. Furthermore, increased protein-levels of PPM1D and CDK6 may link the TP53 and RB1 tumor suppressor pathways to medulloblastoma pathomechanisms.
de Bruin, Christiaan; Mericq, Verónica; Andrew, Shayne F.; van Duyvenvoorde, Hermine A.; Verkaik, Nicole S.; Losekoot, Monique; Porollo, Aleksey; Garcia, Hernán; Kuang, Yi; Hanson, Dan; Clayton, Peter; van Gent, Dik C.; Wit, Jan M.; Hwa, Vivian
2015-01-01
Context: Severe short stature can be caused by defects in numerous biological processes including defects in IGF-1 signaling, centromere function, cell cycle control, and DNA damage repair. Many syndromic causes of short stature are associated with medical comorbidities including hypogonadism and microcephaly. Objective: To identify an underlying genetic etiology in two siblings with severe short stature and gonadal failure. Design: Clinical phenotyping, genetic analysis, complemented by in vitro functional studies of the candidate gene. Setting: An academic pediatric endocrinology clinic. Patients or Other Participants: Two adult siblings (male patient [P1] and female patient 2 [P2]) presented with a history of severe postnatal growth failure (adult heights: P1, −6.8 SD score; P2, −4 SD score), microcephaly, primary gonadal failure, and early-onset metabolic syndrome in late adolescence. In addition, P2 developed a malignant gastrointestinal stromal tumor at age 28. Intervention(s): Single nucleotide polymorphism microarray and exome sequencing. Results: Combined microarray analysis and whole exome sequencing of the two affected siblings and one unaffected sister identified a homozygous variant in XRCC4 as the probable candidate variant. Sanger sequencing and mRNA studies revealed a splice variant resulting in an in-frame deletion of 23 amino acids. Primary fibroblasts (P1) showed a DNA damage repair defect. Conclusions: In this study we have identified a novel pathogenic variant in XRCC4, a gene that plays a critical role in non-homologous end-joining DNA repair. This finding expands the spectrum of DNA damage repair syndromes to include XRCC4 deficiency causing severe postnatal growth failure, microcephaly, gonadal failure, metabolic syndrome, and possibly tumor predisposition. PMID:25742519
Microarray data from independent labs and studies can be compared to potentially identify toxicologically and biologically relevant genes. The Baseline Animal Database working group of HESI was formed to assess baseline gene expression from microarray data derived from control or...
Decreased triadin and increased calstabin2 expression in Great Danes with dilated cardiomyopathy.
Oyama, M A; Chittur, S V; Reynolds, C A
2009-01-01
Dilated cardiomyopathy (DCM) is a common cardiac disease of Great Dane dogs, yet very little is known about the underlying molecular abnormalities that contribute to disease. Discover a set of genes that are differentially expressed in Great Dane dogs with DCM as a way to identify candidate genes for further study as well as to better understand the molecular abnormalities that underlie the disease. Three Great Dane dogs with end-stage DCM and 3 large breed control dogs. Prospective study. Transcriptional activity of 42,869 canine DNA sequences was determined with a canine-specific oligonucleotide microarray. Genome expression patterns of left ventricular tissue samples from affected Great Dane dogs were evaluated by measuring the relative amount of complementary RNA hybridization to the microarray probes and comparing it with expression from large breed dogs with noncardiac disease. Three hundred and twenty-three transcripts were differentially expressed (> or = 2-fold change). The transcript with the greatest degree of upregulation (+61.3-fold) was calstabin2 (FKBP12.6), whereas the transcript with the greatest degree of downregulation (-9.07-fold) was triadin. Calstabin2 and triadin are both regulatory components of the cardiac ryanodine receptor (RyR2) and are critical to normal intracellular Ca2+ release and excitation-contraction coupling. Great Dane dogs with DCM demonstrate abnormal calstabin2 and triadin expression. These changes likely affect Ca2+ flux within cardiac cells and may contribute to the pathophysiology of disease. Microarray-based analysis identifies calstabin2, triadin, and RyR2 function as targets of future study.
Araripe, Luciana O; Montenegro, Horácio; Lemos, Bernardo; Hartl, Daniel L
2010-12-14
Hybrid male sterility (HMS) is a usual outcome of hybridization between closely related animal species. It arises because interactions between alleles that are functional within one species may be disrupted in hybrids. The identification of genes leading to hybrid sterility is of great interest for understanding the evolutionary process of speciation. In the current work we used marked P-element insertions as dominant markers to efficiently locate one genetic factor causing a severe reduction in fertility in hybrid males of Drosophila simulans and D. mauritiana. Our mapping effort identified a region of 9 kb on chromosome 3, containing three complete and one partial coding sequences. Within this region, two annotated genes are suggested as candidates for the HMS factor, based on the comparative molecular characterization and public-source information. Gene Taf1 is partially contained in the region, but yet shows high polymorphism with four fixed non-synonymous substitutions between the two species. Its molecular functions involve sequence-specific DNA binding and transcription factor activity. Gene agt is a small, intronless gene, whose molecular function is annotated as methylated-DNA-protein-cysteine S-methyltransferase activity. High polymorphism and one fixed non-synonymous substitution suggest this is a fast evolving gene. The gene trees of both genes perfectly separate D. simulans and D. mauritiana into monophyletic groups. Analysis of gene expression using microarray revealed trends that were similar to those previously found in comparisons between whole-genome hybrids and parental species. The identification following confirmation of the HMS candidate gene will add another case study leading to understanding the evolutionary process of hybrid incompatibility.
Strauss, Christian; Endimiani, Andrea; Perreten, Vincent
2015-01-01
A rapid and simple DNA labeling system has been developed for disposable microarrays and has been validated for the detection of 117 antibiotic resistance genes abundant in Gram-positive bacteria. The DNA was fragmented and amplified using phi-29 polymerase and random primers with linkers. Labeling and further amplification were then performed by classic PCR amplification using biotinylated primers specific for the linkers. The microarray developed by Perreten et al. (Perreten, V., Vorlet-Fawer, L., Slickers, P., Ehricht, R., Kuhnert, P., Frey, J., 2005. Microarray-based detection of 90 antibiotic resistance genes of gram-positive bacteria. J.Clin.Microbiol. 43, 2291-2302.) was improved by additional oligonucleotides. A total of 244 oligonucleotides (26 to 37 nucleotide length and with similar melting temperatures) were spotted on the microarray, including genes conferring resistance to clinically important antibiotic classes like β-lactams, macrolides, aminoglycosides, glycopeptides and tetracyclines. Each antibiotic resistance gene is represented by at least 2 oligonucleotides designed from consensus sequences of gene families. The specificity of the oligonucleotides and the quality of the amplification and labeling were verified by analysis of a collection of 65 strains belonging to 24 species. Association between genotype and phenotype was verified for 6 antibiotics using 77 Staphylococcus strains belonging to different species and revealed 95% test specificity and a 93% predictive value of a positive test. The DNA labeling and amplification is independent of the species and of the target genes and could be used for different types of microarrays. This system has also the advantage to detect several genes within one bacterium at once, like in Staphylococcus aureus strain BM3318, in which up to 15 genes were detected. This new microarray-based detection system offers a large potential for applications in clinical diagnostic, basic research, food safety and surveillance programs for antimicrobial resistance. Copyright © 2014 Elsevier B.V. All rights reserved.
Microarray profiling of chemical-induced effects is being increasingly used in medium and high-throughput formats. In this study, we describe computational methods to identify molecular targets from whole-genome microarray data using as an example the estrogen receptor α (ERα), ...
Optimization of cDNA microarrays procedures using criteria that do not rely on external standards.
Bruland, Torunn; Anderssen, Endre; Doseth, Berit; Bergum, Hallgeir; Beisvag, Vidar; Laegreid, Astrid
2007-10-18
The measurement of gene expression using microarray technology is a complicated process in which a large number of factors can be varied. Due to the lack of standard calibration samples such as are used in traditional chemical analysis it may be a problem to evaluate whether changes done to the microarray procedure actually improve the identification of truly differentially expressed genes. The purpose of the present work is to report the optimization of several steps in the microarray process both in laboratory practices and in data processing using criteria that do not rely on external standards. We performed a cDNA microarry experiment including RNA from samples with high expected differential gene expression termed "high contrasts" (rat cell lines AR42J and NRK52E) compared to self-self hybridization, and optimized a pipeline to maximize the number of genes found to be differentially expressed in the "high contrasts" RNA samples by estimating the false discovery rate (FDR) using a null distribution obtained from the self-self experiment. The proposed high-contrast versus self-self method (HCSSM) requires only four microarrays per evaluation. The effects of blocking reagent dose, filtering, and background corrections methodologies were investigated. In our experiments a dose of 250 ng LNA (locked nucleic acid) dT blocker, no background correction and weight based filtering gave the largest number of differentially expressed genes. The choice of background correction method had a stronger impact on the estimated number of differentially expressed genes than the choice of filtering method. Cross platform microarray (Illumina) analysis was used to validate that the increase in the number of differentially expressed genes found by HCSSM was real. The results show that HCSSM can be a useful and simple approach to optimize microarray procedures without including external standards. Our optimizing method is highly applicable to both long oligo-probe microarrays which have become commonly used for well characterized organisms such as man, mouse and rat, as well as to cDNA microarrays which are still of importance for organisms with incomplete genome sequence information such as many bacteria, plants and fish.
Optimization of cDNA microarrays procedures using criteria that do not rely on external standards
Bruland, Torunn; Anderssen, Endre; Doseth, Berit; Bergum, Hallgeir; Beisvag, Vidar; Lægreid, Astrid
2007-01-01
Background The measurement of gene expression using microarray technology is a complicated process in which a large number of factors can be varied. Due to the lack of standard calibration samples such as are used in traditional chemical analysis it may be a problem to evaluate whether changes done to the microarray procedure actually improve the identification of truly differentially expressed genes. The purpose of the present work is to report the optimization of several steps in the microarray process both in laboratory practices and in data processing using criteria that do not rely on external standards. Results We performed a cDNA microarry experiment including RNA from samples with high expected differential gene expression termed "high contrasts" (rat cell lines AR42J and NRK52E) compared to self-self hybridization, and optimized a pipeline to maximize the number of genes found to be differentially expressed in the "high contrasts" RNA samples by estimating the false discovery rate (FDR) using a null distribution obtained from the self-self experiment. The proposed high-contrast versus self-self method (HCSSM) requires only four microarrays per evaluation. The effects of blocking reagent dose, filtering, and background corrections methodologies were investigated. In our experiments a dose of 250 ng LNA (locked nucleic acid) dT blocker, no background correction and weight based filtering gave the largest number of differentially expressed genes. The choice of background correction method had a stronger impact on the estimated number of differentially expressed genes than the choice of filtering method. Cross platform microarray (Illumina) analysis was used to validate that the increase in the number of differentially expressed genes found by HCSSM was real. Conclusion The results show that HCSSM can be a useful and simple approach to optimize microarray procedures without including external standards. Our optimizing method is highly applicable to both long oligo-probe microarrays which have become commonly used for well characterized organisms such as man, mouse and rat, as well as to cDNA microarrays which are still of importance for organisms with incomplete genome sequence information such as many bacteria, plants and fish. PMID:17949480
2010-01-01
Background Analysis of gene expression and gene mutation may add information to be different from ordinary pathological tissue diagnosis. Since samples obtained endoscopically are very small, it is desired that more sensitive technology is developed for gene analysis. We investigated whether gene expression and gene mutation analysis by newly developed ultra-sensitive three-dimensional (3D) microarray is possible using small amount samples from endoscopic ultrasound-guided fine-needle aspiration (EUS-FNA) specimens and pancreatic juices. Methods Small amount samples from 17 EUS-FNA specimens and 16 pancreatic juices were obtained. After nucleic acid extraction, the samples were amplified with labeling and analyzed by the 3D microarray. Results The analyzable rate with the microarray was 46% (6/13) in EUS-FNA specimens of RNAlater® storage, and RNA degradations were observed in all the samples of frozen storage. In pancreatic juices, the analyzable rate was 67% (4/6) in frozen storage samples and 20% (2/10) in RNAlater® storage. EUS-FNA specimens were classified into cancer and non-cancer by gene expression analysis and K-ras codon 12 mutations were also detected using the 3D microarray. Conclusions Gene analysis from small amount samples obtained endoscopically was possible by newly developed 3D microarray technology. High quality RNA from EUS-FNA samples were obtained and remained in good condition only using RNA stabilizer. In contrast, high quality RNA from pancreatic juice samples were obtained only in frozen storage without RNA stabilizer. PMID:20416107
Chockalingam, Sriram; Aluru, Maneesha; Aluru, Srinivas
2016-09-19
Pre-processing of microarray data is a well-studied problem. Furthermore, all popular platforms come with their own recommended best practices for differential analysis of genes. However, for genome-scale network inference using microarray data collected from large public repositories, these methods filter out a considerable number of genes. This is primarily due to the effects of aggregating a diverse array of experiments with different technical and biological scenarios. Here we introduce a pre-processing pipeline suitable for inferring genome-scale gene networks from large microarray datasets. We show that partitioning of the available microarray datasets according to biological relevance into tissue- and process-specific categories significantly extends the limits of downstream network construction. We demonstrate the effectiveness of our pre-processing pipeline by inferring genome-scale networks for the model plant Arabidopsis thaliana using two different construction methods and a collection of 11,760 Affymetrix ATH1 microarray chips. Our pre-processing pipeline and the datasets used in this paper are made available at http://alurulab.cc.gatech.edu/microarray-pp.
Ferreira, Ana M; Tuominen, Iina; Sousa, Sónia; Gerbens, Frans; van Dijk-Bos, Krista; Osinga, Jan; Kooi, Krista A; Sanjabi, Bahram; Esendam, Chris; Oliveira, Carla; Terpstra, Peter; Hardonk, Menno; van der Sluis, Tineke; Zazula, Monika; Stachura, Jerzy; van der Zee, Ate G; Hollema, Harry; Sijmons, Rolf H; Aaltonen, Lauri A; Seruca, Raquel; Hofstra, Robert M W; Westers, Helga
2014-12-01
Microsatellite instability (MSI) in tumors results in an accumulation of mutations in (target) genes. Previous studies suggest that the profile of target genes differs according to tumor type. This paper describes the first genome-wide search for target genes for mismatch repair-deficient endometrial cancers. Genes expressed in normal endometrium containing coding repeats were analyzed for mutations in tumors. We identified 44 possible genes of which seven are highly mutated (>15%). Some candidates were also found mutated in colorectal and gastric tumors. The most frequently mutated gene, NRIP1 encoding nuclear receptor-interacting protein 1, was silenced in an endometrial tumor cell line and expression microarray experiments were performed. Silencing of NRIP1 was associated with differences in the expression of several genes in the estrogen-receptor network. Furthermore, an enrichment of genes related to cell cycle (regulation) and replication was observed. We present a new profile of target genes, some of them tissue specific, whereas others seem to play a more general role in MSI tumors. The high-mutation frequency combined with the expression data suggest, for the first time, an involvement of NRIP1 in endometrial cancer development. © 2014 WILEY PERIODICALS, INC.
Mining microarray data at NCBI's Gene Expression Omnibus (GEO)*.
Barrett, Tanya; Edgar, Ron
2006-01-01
The Gene Expression Omnibus (GEO) at the National Center for Biotechnology Information (NCBI) has emerged as the leading fully public repository for gene expression data. This chapter describes how to use Web-based interfaces, applications, and graphics to effectively explore, visualize, and interpret the hundreds of microarray studies and millions of gene expression patterns stored in GEO. Data can be examined from both experiment-centric and gene-centric perspectives using user-friendly tools that do not require specialized expertise in microarray analysis or time-consuming download of massive data sets. The GEO database is publicly accessible through the World Wide Web at http://www.ncbi.nlm.nih.gov/geo.
Parallel human genome analysis: microarray-based expression monitoring of 1000 genes.
Schena, M; Shalon, D; Heller, R; Chai, A; Brown, P O; Davis, R W
1996-01-01
Microarrays containing 1046 human cDNAs of unknown sequence were printed on glass with high-speed robotics. These 1.0-cm2 DNA "chips" were used to quantitatively monitor differential expression of the cognate human genes using a highly sensitive two-color hybridization assay. Array elements that displayed differential expression patterns under given experimental conditions were characterized by sequencing. The identification of known and novel heat shock and phorbol ester-regulated genes in human T cells demonstrates the sensitivity of the assay. Parallel gene analysis with microarrays provides a rapid and efficient method for large-scale human gene discovery. Images Fig. 1 Fig. 2 Fig. 3 PMID:8855227
Mining microarrays for metabolic meaning: nutritional regulation of hypothalamic gene expression.
Mobbs, Charles V; Yen, Kelvin; Mastaitis, Jason; Nguyen, Ha; Watson, Elizabeth; Wurmbach, Elisa; Sealfon, Stuart C; Brooks, Andrew; Salton, Stephen R J
2004-06-01
DNA microarray analysis has been used to investigate relative changes in the level of gene expression in the CNS, including changes that are associated with disease, injury, psychiatric disorders, drug exposure or withdrawal, and memory formation. We have used oligonucleotide microarrays to identify hypothalamic genes that respond to nutritional manipulation. In addition to commonly used microarray analysis based on criteria such as fold-regulation, we have also found that simply carrying out multiple t tests then sorting by P value constitutes a highly reliable method to detect true regulation, as assessed by real-time polymerase chain reaction (PCR), even for relatively low abundance genes or relatively low magnitude of regulation. Such analyses directly suggested novel mechanisms that mediate effects of nutritional state on neuroendocrine function and are being used to identify regulated gene products that may elucidate the metabolic pathology of obese ob/ob, lean Vgf-/Vgf-, and other models with profound metabolic impairments.
Miao, Feng; Smith, David D.; Zhang, Lingxiao; Min, Andrew; Feng, Wei; Natarajan, Rama
2008-01-01
OBJECTIVE—The complexity of interactions between genes and the environment is a major challenge for type 1 diabetes studies. Nuclear chromatin is the interface between genetics and environment and the principal carrier of epigenetic information. Because histone tail modifications in chromatin are linked to gene transcription, we hypothesized that histone methylation patterns in cells from type 1 diabetic patients can provide novel epigenetic insights into type 1 diabetes and its complications. RESEARCH DESIGN AND METHODS—We used chromatin immunoprecipitation (ChIP) linked to microarray (ChIP-chip) approach to compare genome-wide histone H3 lysine 9 dimethylation (H3K9me2) patterns in blood lymphocytes and monocytes from type 1 diabetic patients versus healthy control subjects. Bioinformatics evaluation of methylated candidates was performed by Ingenuity Pathway Analysis (IPA) tools. RESULTS—A subset of genes in the type 1 diabetic cohort showed significant increase in H3K9me2 in lymphocytes but not in monocytes. CLTA4, a type 1 diabetes susceptibility gene, was one of the candidates displaying increased promoter H3K9me2 in type 1 diabetes. IPA identified two high-scoring networks that encompassed genes showing altered H3K9me2. Many of them were associated with autoimmune and inflammation-related pathways, such as transforming growth factor-β, nuclear factor-κB, p38 mitogen-activated protein kinase, toll-like receptor, and interleukin-6. IPA also revealed biological relationships between these networks and known type 1 diabetes candidate genes. CONCLUSIONS—The concerted and synergistic alteration of histone methylation within the identified network in lymphocytes might have an effect on the etiology of type 1 diabetes and its complications. These studies provide evidence of a novel association between type 1 diabetes and altered histone methylation of key genes that are components of type 1 diabetes–related biological pathways and also a new understanding of the pathology of type 1 diabetes. PMID:18776137
Multiclass classification of microarray data samples with a reduced number of genes
2011-01-01
Background Multiclass classification of microarray data samples with a reduced number of genes is a rich and challenging problem in Bioinformatics research. The problem gets harder as the number of classes is increased. In addition, the performance of most classifiers is tightly linked to the effectiveness of mandatory gene selection methods. Critical to gene selection is the availability of estimates about the maximum number of genes that can be handled by any classification algorithm. Lack of such estimates may lead to either computationally demanding explorations of a search space with thousands of dimensions or classification models based on gene sets of unrestricted size. In the former case, unbiased but possibly overfitted classification models may arise. In the latter case, biased classification models unable to support statistically significant findings may be obtained. Results A novel bound on the maximum number of genes that can be handled by binary classifiers in binary mediated multiclass classification algorithms of microarray data samples is presented. The bound suggests that high-dimensional binary output domains might favor the existence of accurate and sparse binary mediated multiclass classifiers for microarray data samples. Conclusions A comprehensive experimental work shows that the bound is indeed useful to induce accurate and sparse multiclass classifiers for microarray data samples. PMID:21342522
Microarray labeling extension values: laboratory signatures for Affymetrix GeneChips
Lee, Yun-Shien; Chen, Chun-Houh; Tsai, Chi-Neu; Tsai, Chia-Lung; Chao, Angel; Wang, Tzu-Hao
2009-01-01
Interlaboratory comparison of microarray data, even when using the same platform, imposes several challenges to scientists. RNA quality, RNA labeling efficiency, hybridization procedures and data-mining tools can all contribute variations in each laboratory. In Affymetrix GeneChips, about 11–20 different 25-mer oligonucleotides are used to measure the level of each transcript. Here, we report that ‘labeling extension values (LEVs)’, which are correlation coefficients between probe intensities and probe positions, are highly correlated with the gene expression levels (GEVs) on eukayotic Affymetrix microarray data. By analyzing LEVs and GEVs in the publicly available 2414 cel files of 20 Affymetrix microarray types covering 13 species, we found that correlations between LEVs and GEVs only exist in eukaryotic RNAs, but not in prokaryotic ones. Surprisingly, Affymetrix results of the same specimens that were analyzed in different laboratories could be clearly differentiated only by LEVs, leading to the identification of ‘laboratory signatures’. In the examined dataset, GSE10797, filtering out high-LEV genes did not compromise the discovery of biological processes that are constructed by differentially expressed genes. In conclusion, LEVs provide a new filtering parameter for microarray analysis of gene expression and it may improve the inter- and intralaboratory comparability of Affymetrix GeneChips data. PMID:19295132
Microarray-based identification of differentially expressed genes in extramammary Paget’s disease
Lin, Jin-Ran; Liang, Jun; Zhang, Qiao-An; Huang, Qiong; Wang, Shang-Shang; Qin, Hai-Hong; Chen, Lian-Jun; Xu, Jin-Hua
2015-01-01
Extramammary Paget’s disease (EMPD) is a rare cutaneous malignancy accounting for approximately 1-2% of vulvar cancers. The rarity of this disease has caused difficulties in characterization and the molecular mechanism underlying EMPD development remains largely unclear. Here we used microarray analysis to identify differentially expressed genes in EMPD of the scrotum comparing with normal epithelium from healthy donors. Agilent single-channel microarray was used to compare the gene expression between 6 EMPD specimens and 6 normal scrotum epithelium samples. A total of 799 up-regulated genes and 723 down-regulated genes were identified in EMPD tissues. Real-time PCR was conducted to verify the differential expression of some representative genes, including ERBB4, TCF3, PAPSS2, PIK3R3, PRLR, SULT1A1, TCF7L1, and CREB3L4. Generally, the real-time PCR results were consistent with microarray data, and the expression of ERBB4, PRLR, TCF3, PIK3R3, SULT1A1, and TCF7L1 was significantly overexpressed in EMPD (P<0.05). Moreover, the overexpression of PRLR in EMPD, a receptor for the anterior pituitary hormone prolactin (PRL), was confirmed by immunohistochemistry. These data demonstrate that the differentially expressed genes from the microarray-based identification are tightly associated with EMPD occurrence. PMID:26221264
Gene expression signature of benign prostatic hyperplasia revealed by cDNA microarray analysis.
Luo, Jun; Dunn, Thomas; Ewing, Charles; Sauvageot, Jurga; Chen, Yidong; Trent, Jeffrey; Isaacs, William
2002-05-15
Despite the high prevalence of benign prostatic hyperplasia (BPH) in the aging male, little is known regarding the etiology of this disease. A better understanding of the molecular etiology of BPH would be facilitated by a comprehensive analysis of gene expression patterns that are characteristic of benign growth in the prostate gland. Since genes differentially expressed between BPH and normal prostate tissues are likely to reflect underlying pathogenic mechanisms involved in the development of BPH, we performed comparative gene expression analysis using cDNA microarray technology to identify candidate genes associated with BPH. Total RNA was extracted from a set of 9 BPH specimens from men with extensive hyperplasia and a set of 12 histologically normal prostate tissues excised from radical prostatectomy specimens. Each of these 21 RNA samples was labeled with Cy3 in a reverse transcription reaction and cohybridized with a Cy5 labeled common reference sample to a cDNA microarray containing 6,500 human genes. Normalized fluorescent intensity ratios from each hybridization experiment were extracted to represent the relative mRNA abundance for each gene in each sample. Weighted gene and random permutation analyses were performed to generate a subset of genes with statistically significant differences in expression between BPH and normal prostate tissues. Semi-quantitative PCR analysis was performed to validate differential expression. A subset of 76 genes involved in a wide range of cellular functions was identified to be differentially expressed between BPH and normal prostate tissues. Semi-quantitative PCR was performed on 10 genes and 8 were validated. Genes consistently upregulated in BPH when compared to normal prostate tissues included: a restricted set of growth factors and their binding proteins (e.g. IGF-1 and -2, TGF-beta3, BMP5, latent TGF-beta binding protein 1 and -2); hydrolases, proteases, and protease inhibitors (e.g. neuropathy target esterase, MMP2, alpha-2-macroglobulin); stress response enzymes (e.g. COX2, GSTM5); and extracellular matrix molecules (e.g. laminin alpha 4 and beta 1, chondroitin sulfate proteoglycan 2, lumican). Genes consistently expressing less mRNA in BPH than in normal prostate tissues were less commonly observed and included the transcription factor KLF4, thrombospondin 4, nitric oxide synthase 2A, transglutaminase 3, and gastrin releasing peptide. We identified a diverse set of genes that are potentially related to benign prostatic hyperplasia, including genes both previously implicated in BPH pathogenesis as well as others not previously linked to this disease. Further targeted validation and investigations of these genes at the DNA, mRNA, and protein levels are warranted to determine the clinical relevance and possible therapeutic utility of these genes. Copyright 2002 Wiley-Liss, Inc.
Integrative Genome Comparison of Primary and Metastatic Melanomas
Feng, Bin; Nazarian, Rosalynn M.; Bosenberg, Marcus; Wu, Min; Scott, Kenneth L.; Kwong, Lawrence N.; Xiao, Yonghong; Cordon-Cardo, Carlos; Granter, Scott R.; Ramaswamy, Sridhar; Golub, Todd; Duncan, Lyn M.; Wagner, Stephan N.; Brennan, Cameron; Chin, Lynda
2010-01-01
A cardinal feature of malignant melanoma is its metastatic propensity. An incomplete view of the genetic events driving metastatic progression has been a major barrier to rational development of effective therapeutics and prognostic diagnostics for melanoma patients. In this study, we conducted global genomic characterization of primary and metastatic melanomas to examine the genomic landscape associated with metastatic progression. In addition to uncovering three genomic subclasses of metastastic melanomas, we delineated 39 focal and recurrent regions of amplification and deletions, many of which encompassed resident genes that have not been implicated in cancer or metastasis. To identify progression-associated metastasis gene candidates, we applied a statistical approach, Integrative Genome Comparison (IGC), to define 32 genomic regions of interest that were significantly altered in metastatic relative to primary melanomas, encompassing 30 resident genes with statistically significant expression deregulation. Functional assays on a subset of these candidates, including MET, ASPM, AKAP9, IMP3, PRKCA, RPA3, and SCAP2, validated their pro-invasion activities in human melanoma cells. Validity of the IGC approach was further reinforced by tissue microarray analysis of Survivin showing significant increased protein expression in thick versus thin primary cutaneous melanomas, and a progression correlation with lymph node metastases. Together, these functional validation results and correlative analysis of human tissues support the thesis that integrated genomic and pathological analyses of staged melanomas provide a productive entry point for discovery of melanoma metastases genes. PMID:20520718
Alvarez, Mariano; Ferreira de Carvalho, Julie; Salmon, Armel; Ainouche, Malika L; Cavé-Radet, Armand; El Amrani, Abdelhak; Foster, Tammy E; Moyer, Sydney; Richards, Christina L
2018-06-04
Despite the severe impacts of the Deepwater Horizon oil spill, the foundation plant species Spartina alterniflora proved resilient to heavy oiling, providing an opportunity to identify mechanisms of response to the anthropogenic stress of crude oil exposure. We assessed plants from oil-affected and unaffected populations using a custom DNA microarray to identify genomewide transcription patterns and gene expression networks that respond to crude oil exposure. In addition, we used T-DNA insertion lines of the model grass Brachypodium distachyon to assess the contribution of four novel candidate genes to crude oil response. Responses in S. alterniflora to hydrocarbon exposure across the transcriptome as well as xenobiotic specific response pathways had little overlap with those previously identified in the model plant Arabidopsis thaliana. Among T-DNA insertion lines of B. distachyon, we found additional support for two candidate genes, one (ATTPS21) involved in volatile production, and the other (SUVH5) involved in epigenetic regulation of gene expression, that may be important in the response to crude oil. The architecture of crude oil response in S. alterniflora is unique from that of the model species A. thaliana, suggesting that xenobiotic response may be highly variable across plant species. In addition, further investigations of regulatory networks may benefit from more information about epigenetic response pathways. © 2018 John Wiley & Sons Ltd.
Sahu, Tejram; Malkov, Vlad; Morrison, Robert; Pei, Ying; Juompan, Laure; Milman, Neta; Zarling, Stasya; Anderson, Charles; Wong-Madden, Sharon; Wendler, Jason; Ishizuka, Andrew; MacMillen, Zachary W.; Garcia, Valentino; Kappe, Stefan H. I.; Krzych, Urszula; Duffy, Patrick E.
2016-01-01
Malaria vaccine development has been hampered by the limited availability of antigens identified through conventional discovery approaches, and improvements are needed to enhance the efficacy of the leading vaccine candidate RTS,S that targets the circumsporozoite protein (CSP) of the infective sporozoite. Here we report a transcriptome-based approach to identify novel pre-erythrocytic vaccine antigens that could potentially be used in combination with CSP. We hypothesized that stage-specific upregulated genes would enrich for protective vaccine targets, and used tiling microarray to identify P. falciparum genes transcribed at higher levels during liver stage versus sporozoite or blood stages of development. We prepared DNA vaccines for 21 genes using the predicted orthologues in P. yoelii and P. berghei and tested their efficacy using different delivery methods against pre-erythrocytic malaria in rodent models. In our primary screen using P. yoelii in BALB/c mice, we found that 16 antigens significantly reduced liver stage parasite burden. In our confirmatory screen using P. berghei in C57Bl/6 mice, we confirmed 6 antigens that were protective in both models. Two antigens, when combined with CSP, provided significantly greater protection than CSP alone in both models. Based on the observations reported here, transcriptional patterns of Plasmodium genes can be useful in identifying novel pre-erythrocytic antigens that induce protective immunity alone or in combination with CSP. PMID:27434123
Speake, Cate; Pichugin, Alexander; Sahu, Tejram; Malkov, Vlad; Morrison, Robert; Pei, Ying; Juompan, Laure; Milman, Neta; Zarling, Stasya; Anderson, Charles; Wong-Madden, Sharon; Wendler, Jason; Ishizuka, Andrew; MacMillen, Zachary W; Garcia, Valentino; Kappe, Stefan H I; Krzych, Urszula; Duffy, Patrick E
2016-01-01
Malaria vaccine development has been hampered by the limited availability of antigens identified through conventional discovery approaches, and improvements are needed to enhance the efficacy of the leading vaccine candidate RTS,S that targets the circumsporozoite protein (CSP) of the infective sporozoite. Here we report a transcriptome-based approach to identify novel pre-erythrocytic vaccine antigens that could potentially be used in combination with CSP. We hypothesized that stage-specific upregulated genes would enrich for protective vaccine targets, and used tiling microarray to identify P. falciparum genes transcribed at higher levels during liver stage versus sporozoite or blood stages of development. We prepared DNA vaccines for 21 genes using the predicted orthologues in P. yoelii and P. berghei and tested their efficacy using different delivery methods against pre-erythrocytic malaria in rodent models. In our primary screen using P. yoelii in BALB/c mice, we found that 16 antigens significantly reduced liver stage parasite burden. In our confirmatory screen using P. berghei in C57Bl/6 mice, we confirmed 6 antigens that were protective in both models. Two antigens, when combined with CSP, provided significantly greater protection than CSP alone in both models. Based on the observations reported here, transcriptional patterns of Plasmodium genes can be useful in identifying novel pre-erythrocytic antigens that induce protective immunity alone or in combination with CSP.
Transcription Factor Binding Site Enrichment Analysis in Co-Expression Modules in Celiac Disease
Romero-Garmendia, Irati; Jauregi-Miguel, Amaia; Plaza-Izurieta, Leticia; Cros, Marie-Pierre; Legarda, Maria; Irastorza, Iñaki; Herceg, Zdenko; Fernandez-Jimenez, Nora
2018-01-01
The aim of this study was to construct celiac co-expression patterns at a whole genome level and to identify transcription factors (TFs) that could drive the gliadin-related changes in coordination of gene expression observed in celiac disease (CD). Differential co-expression modules were identified in the acute and chronic responses to gliadin using expression data from a previous microarray study in duodenal biopsies. Transcription factor binding site (TFBS) and Gene Ontology (GO) annotation enrichment analyses were performed in differentially co-expressed genes (DCGs) and selection of candidate regulators was performed. Expression of candidates was measured in clinical samples and the activation of the TFs was further characterized in C2BBe1 cells upon gliadin challenge. Enrichment analyses of the DCGs identified 10 TFs and five were selected for further investigation. Expression changes related to active CD were detected in four TFs, as well as in several of their in silico predicted targets. The activation of TFs was further characterized in C2BBe1 cells upon gliadin challenge, and an increase in nuclear translocation of CAMP Responsive Element Binding Protein 1 (CREB1) and IFN regulatory factor-1 (IRF1) in response to gliadin was observed. Using transcriptome-wide co-expression analyses we are able to propose novel genes involved in CD pathogenesis that respond upon gliadin stimulation, also in non-celiac models. PMID:29748492
Transcription Factor Binding Site Enrichment Analysis in Co-Expression Modules in Celiac Disease.
Romero-Garmendia, Irati; Garcia-Etxebarria, Koldo; Hernandez-Vargas, Hector; Santin, Izortze; Jauregi-Miguel, Amaia; Plaza-Izurieta, Leticia; Cros, Marie-Pierre; Legarda, Maria; Irastorza, Iñaki; Herceg, Zdenko; Fernandez-Jimenez, Nora; Bilbao, Jose Ramon
2018-05-10
The aim of this study was to construct celiac co-expression patterns at a whole genome level and to identify transcription factors (TFs) that could drive the gliadin-related changes in coordination of gene expression observed in celiac disease (CD). Differential co-expression modules were identified in the acute and chronic responses to gliadin using expression data from a previous microarray study in duodenal biopsies. Transcription factor binding site (TFBS) and Gene Ontology (GO) annotation enrichment analyses were performed in differentially co-expressed genes (DCGs) and selection of candidate regulators was performed. Expression of candidates was measured in clinical samples and the activation of the TFs was further characterized in C2BBe1 cells upon gliadin challenge. Enrichment analyses of the DCGs identified 10 TFs and five were selected for further investigation. Expression changes related to active CD were detected in four TFs, as well as in several of their in silico predicted targets. The activation of TFs was further characterized in C2BBe1 cells upon gliadin challenge, and an increase in nuclear translocation of CAMP Responsive Element Binding Protein 1 (CREB1) and IFN regulatory factor-1 (IRF1) in response to gliadin was observed. Using transcriptome-wide co-expression analyses we are able to propose novel genes involved in CD pathogenesis that respond upon gliadin stimulation, also in non-celiac models.
Matowo, Johnson; Jones, Christopher M; Kabula, Bilali; Ranson, Hilary; Steen, Keith; Mosha, Franklin; Rowland, Mark; Weetman, David
2014-06-19
Pyrethroid resistance has been slower to emerge in Anopheles arabiensis than in An. gambiae s.s and An. funestus and, consequently, studies are only just beginning to unravel the genes involved. Permethrin resistance in An. arabiensis in Lower Moshi, Tanzania has been linked to elevated levels of both P450 monooxygenases and β-esterases. We have conducted a gene expression study to identify specific genes linked with metabolic resistance in the Lower Moshi An. arabiensis population. Microarray experiments employing an An. gambiae whole genome expression chip were performed on An. arabiensis, using interwoven loop designs. Permethrin-exposed survivors were compared to three separate unexposed mosquitoes from the same or a nearby population. A subsection of detoxification genes were chosen for subsequent quantitative real-time PCR (qRT-PCR). Microarray analysis revealed significant over expression of 87 probes and under expression of 85 probes (in pairwise comparisons between permethrin survivors and unexposed sympatric and allopatric samples from Dar es Salaam (controls). For qRT-PCR we targeted over expressed ABC transporter genes (ABC '2060'), a glutathione-S-transferase, P450s and esterases. Design of efficient, specific primers was successful for ABC '2060'and two P450s (CYP6P3, CYP6M2). For the CYP4G16 gene, we used the primers that were previously used in a microarray study of An. arabiensis from Zanzibar islands. Over expression of CYP4G16 and ABC '2060' was detected though with contrasting patterns in pairwise comparisons between survivors and controls. CYP4G16 was only up regulated in survivors, whereas ABC '2060' was similar in survivors and controls but over expressed in Lower Moshi samples compared to the Dar es Salaam samples. Increased transcription of CYP4G16 and ABC '2060' are linked directly and indirectly respectively, with permethrin resistance in Lower Moshi An. arabiensis. Increased transcription of a P450 (CYP4G16) and an ABC transporter (ABC 2060) are linked directly and indirectly respectively, with permethrin resistance in Lower Moshi An. arabiensis. Our study provides replication of CYP4G16 as a candidate gene for pyrethroid resistance in An. arabiensis, although its role may not be in detoxification, and requires further investigation.
Welker, Noah C; Habig, Jeffrey W; Bass, Brenda L
2007-07-01
We describe the first microarray analysis of a whole animal containing a mutation in the Dicer gene. We used adult Caenorhabditis elegans and, to distinguish among different roles of Dicer, we also performed microarray analyses of animals with mutations in rde-4 and rde-1, which are involved in silencing by siRNA, but not miRNA. Surprisingly, we find that the X chromosome is greatly enriched for genes regulated by Dicer. Comparison of all three microarray data sets indicates the majority of Dicer-regulated genes are not dependent on RDE-4 or RDE-1, including the X-linked genes. However, all three data sets are enriched in genes important for innate immunity and, specifically, show increased expression of innate immunity genes.
Welker, Noah C.; Habig, Jeffrey W.; Bass, Brenda L.
2007-01-01
We describe the first microarray analysis of a whole animal containing a mutation in the Dicer gene. We used adult Caenorhabditis elegans and, to distinguish among different roles of Dicer, we also performed microarray analyses of animals with mutations in rde-4 and rde-1, which are involved in silencing by siRNA, but not miRNA. Surprisingly, we find that the X chromosome is greatly enriched for genes regulated by Dicer. Comparison of all three microarray data sets indicates the majority of Dicer-regulated genes are not dependent on RDE-4 or RDE-1, including the X-linked genes. However, all three data sets are enriched in genes important for innate immunity and, specifically, show increased expression of innate immunity genes. PMID:17526642
2012-01-01
Over the last decade, the introduction of microarray technology has had a profound impact on gene expression research. The publication of studies with dissimilar or altogether contradictory results, obtained using different microarray platforms to analyze identical RNA samples, has raised concerns about the reliability of this technology. The MicroArray Quality Control (MAQC) project was initiated to address these concerns, as well as other performance and data analysis issues. Expression data on four titration pools from two distinct reference RNA samples were generated at multiple test sites using a variety of microarray-based and alternative technology platforms. Here we describe the experimental design and probe mapping efforts behind the MAQC project. We show intraplatform consistency across test sites as well as a high level of interplatform concordance in terms of genes identified as differentially expressed. This study provides a resource that represents an important first step toward establishing a framework for the use of microarrays in clinical and regulatory settings. PMID:16964229
Kulterer, Birgit; Friedl, Gerald; Jandrositz, Anita; Sanchez-Cabo, Fatima; Prokesch, Andreas; Paar, Christine; Scheideler, Marcel; Windhager, Reinhard; Preisegger, Karl-Heinz; Trajanoski, Zlatko
2007-03-12
Human mesenchymal stem cells (MSC) with the capacity to differentiate into osteoblasts provide potential for the development of novel treatment strategies, such as improved healing of large bone defects. However, their low frequency in bone marrow necessitate ex vivo expansion for further clinical application. In this study we asked if MSC are developing in an aberrant or unwanted way during ex vivo long-term cultivation and if artificial cultivation conditions exert any influence on their stem cell maintenance. To address this question we first developed human oligonucleotide microarrays with 30.000 elements and then performed large-scale expression profiling of long-term expanded MSC and MSC during differentiation into osteoblasts. The results showed that MSC did not alter their osteogenic differentiation capacity, surface marker profile, and the expression profiles of MSC during expansion. Microarray analysis of MSC during osteogenic differentiation identified three candidate genes for further examination and functional analysis: ID4, CRYAB, and SORT1. Additionally, we were able to reconstruct the three developmental phases during osteoblast differentiation: proliferation, matrix maturation, and mineralization, and illustrate the activation of the SMAD signaling pathways by TGF-beta2 and BMPs. With a variety of assays we could show that MSC represent a cell population which can be expanded for therapeutic applications.
Galindo, Cristi L; Soslow, Jonathan H; Brinkmeyer-Langford, Candice L; Gupte, Manisha; Smith, Holly M; Sengsayadeth, Seng; Sawyer, Douglas B; Benson, D Woodrow; Kornegay, Joe N; Markham, Larry W
2016-04-01
In Duchenne muscular dystrophy (DMD), abnormal cardiac function is typically preceded by a decade of skeletal muscle disease. Molecular reasons for differences in onset and progression of these muscle groups are unknown. Human biomarkers are lacking. We analyzed cardiac and skeletal muscle microarrays from normal and golden retriever muscular dystrophy (GRMD) dogs (ages 6, 12, or 47+ mo) to gain insight into muscle dysfunction and to identify putative DMD biomarkers. These biomarkers were then measured using human DMD blood samples. We identified GRMD candidate genes that might contribute to the disparity between cardiac and skeletal muscle disease, focusing on brain-derived neurotropic factor (BDNF) and osteopontin (OPN/SPP1, hereafter indicated as SPP1). BDNF was elevated in cardiac muscle of younger GRMD but was unaltered in skeletal muscle, while SPP1 was increased only in GRMD skeletal muscle. In human DMD, circulating levels of BDNF were inversely correlated with ventricular function and fibrosis, while SPP1 levels correlated with skeletal muscle function. These results highlight gene expression patterns that could account for differences in cardiac and skeletal disease in GRMD. Most notably, animal model-derived data were translated to DMD and support use of BDNF and SPP1 as biomarkers for cardiac and skeletal muscle involvement, respectively.
Franke, Lude; Bakel, Harm van; Fokkens, Like; de Jong, Edwin D.; Egmont-Petersen, Michael; Wijmenga, Cisca
2006-01-01
Most common genetic disorders have a complex inheritance and may result from variants in many genes, each contributing only weak effects to the disease. Pinpointing these disease genes within the myriad of susceptibility loci identified in linkage studies is difficult because these loci may contain hundreds of genes. However, in any disorder, most of the disease genes will be involved in only a few different molecular pathways. If we know something about the relationships between the genes, we can assess whether some genes (which may reside in different loci) functionally interact with each other, indicating a joint basis for the disease etiology. There are various repositories of information on pathway relationships. To consolidate this information, we developed a functional human gene network that integrates information on genes and the functional relationships between genes, based on data from the Kyoto Encyclopedia of Genes and Genomes, the Biomolecular Interaction Network Database, Reactome, the Human Protein Reference Database, the Gene Ontology database, predicted protein-protein interactions, human yeast two-hybrid interactions, and microarray coexpressions. We applied this network to interrelate positional candidate genes from different disease loci and then tested 96 heritable disorders for which the Online Mendelian Inheritance in Man database reported at least three disease genes. Artificial susceptibility loci, each containing 100 genes, were constructed around each disease gene, and we used the network to rank these genes on the basis of their functional interactions. By following up the top five genes per artificial locus, we were able to detect at least one known disease gene in 54% of the loci studied, representing a 2.8-fold increase over random selection. This suggests that our method can significantly reduce the cost and effort of pinpointing true disease genes in analyses of disorders for which numerous loci have been reported but for which most of the genes are unknown. PMID:16685651
Schönmann, Susan; Loy, Alexander; Wimmersberger, Céline; Sobek, Jens; Aquino, Catharine; Vandamme, Peter; Frey, Beat; Rehrauer, Hubert; Eberl, Leo
2009-04-01
For cultivation-independent and highly parallel analysis of members of the genus Burkholderia, an oligonucleotide microarray (phylochip) consisting of 131 hierarchically nested 16S rRNA gene-targeted oligonucleotide probes was developed. A novel primer pair was designed for selective amplification of a 1.3 kb 16S rRNA gene fragment of Burkholderia species prior to microarray analysis. The diagnostic performance of the microarray for identification and differentiation of Burkholderia species was tested with 44 reference strains of the genera Burkholderia, Pandoraea, Ralstonia and Limnobacter. Hybridization patterns based on presence/absence of probe signals were interpreted semi-automatically using the novel likelihood-based strategy of the web-tool Phylo- Detect. Eighty-eight per cent of the reference strains were correctly identified at the species level. The evaluated microarray was applied to investigate shifts in the Burkholderia community structure in acidic forest soil upon addition of cadmium, a condition that selected for Burkholderia species. The microarray results were in agreement with those obtained from phylogenetic analysis of Burkholderia 16S rRNA gene sequences recovered from the same cadmiumcontaminated soil, demonstrating the value of the Burkholderia phylochip for determinative and environmental studies.
Transcriptional Regulatory Network Analysis of MYB Transcription Factor Family Genes in Rice.
Smita, Shuchi; Katiyar, Amit; Chinnusamy, Viswanathan; Pandey, Dev M; Bansal, Kailash C
2015-01-01
MYB transcription factor (TF) is one of the largest TF families and regulates defense responses to various stresses, hormone signaling as well as many metabolic and developmental processes in plants. Understanding these regulatory hierarchies of gene expression networks in response to developmental and environmental cues is a major challenge due to the complex interactions between the genetic elements. Correlation analyses are useful to unravel co-regulated gene pairs governing biological process as well as identification of new candidate hub genes in response to these complex processes. High throughput expression profiling data are highly useful for construction of co-expression networks. In the present study, we utilized transcriptome data for comprehensive regulatory network studies of MYB TFs by "top-down" and "guide-gene" approaches. More than 50% of OsMYBs were strongly correlated under 50 experimental conditions with 51 hub genes via "top-down" approach. Further, clusters were identified using Markov Clustering (MCL). To maximize the clustering performance, parameter evaluation of the MCL inflation score (I) was performed in terms of enriched GO categories by measuring F-score. Comparison of co-expressed cluster and clads analyzed from phylogenetic analysis signifies their evolutionarily conserved co-regulatory role. We utilized compendium of known interaction and biological role with Gene Ontology enrichment analysis to hypothesize function of coexpressed OsMYBs. In the other part, the transcriptional regulatory network analysis by "guide-gene" approach revealed 40 putative targets of 26 OsMYB TF hubs with high correlation value utilizing 815 microarray data. The putative targets with MYB-binding cis-elements enrichment in their promoter region, functional co-occurrence as well as nuclear localization supports our finding. Specially, enrichment of MYB binding regions involved in drought-inducibility implying their regulatory role in drought response in rice. Thus, the co-regulatory network analysis facilitated the identification of complex OsMYB regulatory networks, and candidate target regulon genes of selected guide MYB genes. The results contribute to the candidate gene screening, and experimentally testable hypotheses for potential regulatory MYB TFs, and their targets under stress conditions.
Liu, Qing; Basu, Niladri; Goetz, Giles; Jiang, Nan; Hutz, Reinhold J.; Tonellato, Peter J.; Carvan, Michael J.
2013-01-01
The objective of this study was to identify and evaluate conserved biomarkers that could be used in most species of teleost fish at most life-stages. We investigated the effects of sublethal methylmercury (MeHg) exposure on developing rainbow trout and zebrafish. Juvenile rainbow trout and young adult zebrafish were fed food with MeHg added at 0, 0.5, 5 and 50 ppm. Atomic absorption spectrometry was applied to measure whole body total Hg levels, and pathologic analysis was performed to identify MeHg-induced toxicity. Fish at six weeks were sampled from each group for microarray analysis using RNA from whole fish. MeHg-exposed trout and zebrafish did not show overt signs of toxicity or pathology, nor were significant differences seen in mortality, length, mass, or condition factor. The accumulation of MeHg in trout and zebrafish exhibited dose- and time-dependent patterns during six weeks, and zebrafish exhibited greater assimilation of total Hg than rainbow trout. The dysregulated genes in MeHg-treated fish have multiple functional annotations, such as iron ion homeostasis, glutathione transferase activity, regulation of muscle contraction, troponin I binding and calcium-dependent protein binding. Genes were selected as biomarker candidates based on their microarray data and their expression was evaluated by QPCR. Unfortunately, these genes are not good consistent biomarkers for both rainbow trout and zebrafish from QPCR evaluation using individual fish. Our conclusion is that biomarker analysis for aquatic toxicant assessment using fish needs to be based on tissue-, sex- and species-specific consideration. PMID:23529582
Pathak, Bhakti R; Breed, Ananya A; Apte, Snehal; Acharya, Kshitish; Mahale, Smita D
2016-01-01
Cysteine-rich secretory protein 3 (CRISP-3) is upregulated in prostate cancer as compared to the normal prostate tissue. Higher expression of CRISP-3 has been linked to poor prognosis and hence it has been thought to act as a prognostic marker for prostate cancer. It is proposed to have a role in innate immunity but its role in prostate cancer is still unknown. In order to understand its function, its expression was stably knocked down in LNCaP cells. CRISP-3 knockdown did not affect cell viability but resulted in reduced invasiveness. Global gene expression changes upon CRISP-3 knockdown were identified by microarray analysis. Microarray data were quantitatively validated by evaluating the expression of seven candidate genes in three independent stable clones. Functional annotation of the differentially expressed genes identified cell adhesion, cell motility, and ion transport to be affected among other biological processes. Prostate-specific antigen (PSA, also known as Kallikrein 3) was the top most downregulated gene whose expression was also validated at protein level. Interestingly, expression of Annexin A1 (ANXA1), a known anti-inflammatory protein, was upregulated upon CRISP-3 knockdown. Re-introduction of CRISP-3 into the knockdown clone reversed the effect on invasiveness and also led to increased PSA expression. These results suggest that overexpression of CRISP-3 in prostate tumor may maintain higher PSA expression and lower ANXA1 expression. Our data also indicate that poor prognosis associated with higher CRISP-3 expression could be due to its role in cell invasion.
Genetical Genomics Identifies the Genetic Architecture for Growth and Weevil Resistance in Spruce
Porth, Ilga; White, Richard; Jaquish, Barry; Alfaro, René; Ritland, Carol; Ritland, Kermit
2012-01-01
In plants, relationships between resistance to herbivorous insect pests and growth are typically controlled by complex interactions between genetically correlated traits. These relationships often result in tradeoffs in phenotypic expression. In this study we used genetical genomics to elucidate genetic relationships between tree growth and resistance to white pine terminal weevil (Pissodes strobi Peck.) in a pedigree population of interior spruce (Picea glauca, P. engelmannii and their hybrids) that was growing at Vernon, B.C. and segregating for weevil resistance. Genetical genomics uses genetic perturbations caused by allelic segregation in pedigrees to co-locate quantitative trait loci (QTLs) for gene expression and quantitative traits. Bark tissue of apical leaders from 188 trees was assayed for gene expression using a 21.8K spruce EST-spotted microarray; the same individuals were genotyped for 384 SNP markers for the genetic map. Many of the expression QTLs (eQTL) co-localized with resistance trait QTLs. For a composite resistance phenotype of six attack and oviposition traits, 149 positional candidate genes were identified. Resistance and growth QTLs also overlapped with eQTL hotspots along the genome suggesting that: 1) genetic pleiotropy of resistance and growth traits in interior spruce was substantial, and 2) master regulatory genes were important for weevil resistance in spruce. These results will enable future work on functional genetic studies of insect resistance in spruce, and provide valuable information about candidate genes for genetic improvement of spruce. PMID:22973444
Zapata, Juan Carlos; Carrion, Ricardo; Patterson, Jean L.; Crasta, Oswald; Zhang, Yan; Mani, Sachin; Jett, Marti; Poonia, Bhawna; Djavani, Mahmoud; White, David M.; Lukashevich, Igor S.; Salvato, Maria S.
2013-01-01
Lassa virus (LASV) is the causative agent of Lassa Fever and is responsible for several hundred thousand infections and thousands of deaths annually in West Africa. LASV and the non-pathogenic Mopeia virus (MOPV) are both rodent-borne African arenaviruses. A live attenuated reassortant of MOPV and LASV, designated ML29, protects rodents and primates from LASV challenge and appears to be more attenuated than MOPV. To gain better insight into LASV-induced pathology and mechanism of attenuation we performed gene expression profiling in human peripheral blood mononuclear cells (PBMC) exposed to LASV and the vaccine candidate ML29. PBMC from healthy human subjects were exposed to either LASV or ML29. Although most PBMC are non-permissive for virus replication, they remain susceptible to signal transduction by virus particles. Total RNA was extracted and global gene expression was evaluated during the first 24 hours using high-density microarrays. Results were validated using RT-PCR, flow cytometry and ELISA. LASV and ML29 elicited differential expression of interferon-stimulated genes (ISG), as well as genes involved in apoptosis, NF-kB signaling and the coagulation pathways. These genes could eventually serve as biomarkers to predict disease outcomes. The remarkable differential expression of thrombomodulin, a key regulator of inflammation and coagulation, suggests its involvement with vascular abnormalities and mortality in Lassa fever disease. PMID:24069471
Jani, Saurin D; Argraves, Gary L; Barth, Jeremy L; Argraves, W Scott
2010-04-01
An important objective of DNA microarray-based gene expression experimentation is determining inter-relationships that exist between differentially expressed genes and biological processes, molecular functions, cellular components, signaling pathways, physiologic processes and diseases. Here we describe GeneMesh, a web-based program that facilitates analysis of DNA microarray gene expression data. GeneMesh relates genes in a query set to categories available in the Medical Subject Headings (MeSH) hierarchical index. The interface enables hypothesis driven relational analysis to a specific MeSH subcategory (e.g., Cardiovascular System, Genetic Processes, Immune System Diseases etc.) or unbiased relational analysis to broader MeSH categories (e.g., Anatomy, Biological Sciences, Disease etc.). Genes found associated with a given MeSH category are dynamically linked to facilitate tabular and graphical depiction of Entrez Gene information, Gene Ontology information, KEGG metabolic pathway diagrams and intermolecular interaction information. Expression intensity values of groups of genes that cluster in relation to a given MeSH category, gene ontology or pathway can be displayed as heat maps of Z score-normalized values. GeneMesh operates on gene expression data derived from a number of commercial microarray platforms including Affymetrix, Agilent and Illumina. GeneMesh is a versatile web-based tool for testing and developing new hypotheses through relating genes in a query set (e.g., differentially expressed genes from a DNA microarray experiment) to descriptors making up the hierarchical structure of the National Library of Medicine controlled vocabulary thesaurus, MeSH. The system further enhances the discovery process by providing links between sets of genes associated with a given MeSH category to a rich set of html linked tabular and graphic information including Entrez Gene summaries, gene ontologies, intermolecular interactions, overlays of genes onto KEGG pathway diagrams and heatmaps of expression intensity values. GeneMesh is freely available online at http://proteogenomics.musc.edu/genemesh/.
Microarray profiling of human white adipose tissue after exogenous leptin injection.
Taleb, S; Van Haaften, R; Henegar, C; Hukshorn, C; Cancello, R; Pelloux, V; Hanczar, B; Viguerie, N; Langin, D; Evelo, C; Zucker, J; Clément, K; Saris, W H M
2006-03-01
Leptin is a secreted adipocyte hormone that plays a key role in the regulation of body weight homeostasis. The leptin effect on human white adipose tissue (WAT) is still debated. The aim of this study was to assess whether the administration of polyethylene glycol-leptin (PEG-OB) in a single supraphysiological dose has transcriptional effects on genes of WAT and to identify its target genes and functional pathways in WAT. Blood samples and WAT biopsies were obtained from 10 healthy nonobese men before treatment and 72 h after the PEG-OB injection, leading to an approximate 809-fold increase in circulating leptin. The WAT gene expression profile before and after the PEG-OB injection was compared using pangenomic microarrays. Functional gene annotations based on the gene ontology of the PEG-OB regulated genes were performed using both an 'in house' automated procedure and GenMAPP (Gene Microarray Pathway Profiler), designed for viewing and analyzing gene expression data in the context of biological pathways. Statistical analysis of microarray data revealed that PEG-OB had a major down-regulated effect on WAT gene expression, as we obtained 1,822 and 100 down- and up-regulated genes, respectively. Microarray data were validated using reverse transcription quantitative PCR. Functional gene annotations of PEG-OB regulated genes revealed that the functional class related to immunity and inflammation was among the most mobilized PEG-OB pathway in WAT. These genes are mainly expressed in the cell of the stroma vascular fraction in comparison with adipocytes. Our observations support the hypothesis that leptin could act on WAT, particularly on genes related to inflammation and immunity, which may suggest a novel leptin target pathway in human WAT.
Chavan, Shweta S; Bauer, Michael A; Peterson, Erich A; Heuck, Christoph J; Johann, Donald J
2013-01-01
Transcriptome analysis by microarrays has produced important advances in biomedicine. For instance in multiple myeloma (MM), microarray approaches led to the development of an effective disease subtyping via cluster assignment, and a 70 gene risk score. Both enabled an improved molecular understanding of MM, and have provided prognostic information for the purposes of clinical management. Many researchers are now transitioning to Next Generation Sequencing (NGS) approaches and RNA-seq in particular, due to its discovery-based nature, improved sensitivity, and dynamic range. Additionally, RNA-seq allows for the analysis of gene isoforms, splice variants, and novel gene fusions. Given the voluminous amounts of historical microarray data, there is now a need to associate and integrate microarray and RNA-seq data via advanced bioinformatic approaches. Custom software was developed following a model-view-controller (MVC) approach to integrate Affymetrix probe set-IDs, and gene annotation information from a variety of sources. The tool/approach employs an assortment of strategies to integrate, cross reference, and associate microarray and RNA-seq datasets. Output from a variety of transcriptome reconstruction and quantitation tools (e.g., Cufflinks) can be directly integrated, and/or associated with Affymetrix probe set data, as well as necessary gene identifiers and/or symbols from a diversity of sources. Strategies are employed to maximize the annotation and cross referencing process. Custom gene sets (e.g., MM 70 risk score (GEP-70)) can be specified, and the tool can be directly assimilated into an RNA-seq pipeline. A novel bioinformatic approach to aid in the facilitation of both annotation and association of historic microarray data, in conjunction with richer RNA-seq data, is now assisting with the study of MM cancer biology.
Porto, Diogo Denardi; Bruneau, Maryline; Perini, Pâmela; Anzanello, Rafael; Renou, Jean-Pierre; dos Santos, Henrique Pessoa; Fialho, Flávio Bello; Revers, Luís Fernando
2015-05-01
Apple production depends on the fulfilment of a chilling requirement for bud dormancy release. Insufficient winter chilling results in irregular and suboptimal bud break in the spring, with negative impacts on apple yield. Trees from apple cultivars with contrasting chilling requirements for bud break were used to investigate the expression of the entire set of apple genes in response to chilling accumulation in the field and controlled conditions. Total RNA was analysed on the AryANE v.1.0 oligonucleotide microarray chip representing 57,000 apple genes. The data were tested for functional enrichment, and differential expression was confirmed by real-time PCR. The largest number of differentially expressed genes was found in samples treated with cold temperatures. Cold exposure mostly repressed expression of transcripts related to photosynthesis, and long-term cold exposure repressed flavonoid biosynthesis genes. Among the differentially expressed selected candidates, we identified genes whose annotations were related to the circadian clock, hormonal signalling, regulation of growth, and flower development. Two genes, annotated as FLOWERING LOCUS C-like and MADS AFFECTING FLOWERING, showed strong differential expression in several comparisons. One of these two genes was upregulated in most comparisons involving dormancy release, and this gene's chromosomal position co-localized with the confidence interval of a major quantitative trait locus for the timing of bud break. These results indicate that photosynthesis and auxin transport are major regulatory nodes of apple dormancy and unveil strong candidates for the control of bud dormancy. © The Author 2015. Published by Oxford University Press on behalf of the Society for Experimental Biology. All rights reserved. For permissions, please email: journals.permissions@oup.com.
Liu, Haiying; Murthi, Padma; Qin, Sharon; Kusuma, Gina D.; Borg, Anthony J.; Knöfler, Martin; Haslinger, Peter; Manuelpillai, Ursula; Pertile, Mark D.; Abumaree, Mohamed
2014-01-01
Human chorionic mesenchymal stem/stromal cells (CMSCs) derived from the placenta are similar to adult tissue-derived MSCs. The aim of this study was to investigate the role of these cells in normal placental development. Transcription factors, particularly members of the homeobox gene family, play crucial roles in maintaining stem cell proliferation and lineage specification in embryonic tissues. In adult tissues and organs, stem cells proliferate at low levels in their niche until they receive cues from the microenvironment to differentiate. The homeobox genes that are expressed in the CMSC niche in placental tissues have not been identified. We used the novel strategy of laser capture microdissection to isolate the stromal component of first trimester villi and excluded the cytotrophoblast and syncytiotrophoblast layers that comprise the outer layer of the chorionic villi. Microarray analysis was then used to screen for homeobox genes in the microdissected tissue. Candidate homeobox genes were selected for further RNA analysis. Immunohistochemistry of candidate genes in first trimester placental villous stromal tissue revealed homeobox genes Meis1, myeloid ectropic viral integration site 1 homolog 2 (MEIS2), H2.0-like Drosophila (HLX), transforming growth factor β-induced factor (TGIF), and distal-less homeobox 5 (DLX5) were expressed in the vascular niche where CMSCs have been shown to reside. Expression of MEIS2, HLX, TGIF, and DLX5 was also detected in scattered stromal cells. Real-time polymerase chain reaction and immunocytochemistry verified expression of MEIS2, HLX, TGIF, and DLX5 homeobox genes in first trimester and term CMSCs. These data suggest a combination of regulatory homeobox genes is expressed in CMSCs from early placental development to term, which may be required for stem cell proliferation and differentiation. PMID:24692208
Estimating gene function with least squares nonnegative matrix factorization.
Wang, Guoli; Ochs, Michael F
2007-01-01
Nonnegative matrix factorization is a machine learning algorithm that has extracted information from data in a number of fields, including imaging and spectral analysis, text mining, and microarray data analysis. One limitation with the method for linking genes through microarray data in order to estimate gene function is the high variance observed in transcription levels between different genes. Least squares nonnegative matrix factorization uses estimates of the uncertainties on the mRNA levels for each gene in each condition, to guide the algorithm to a local minimum in normalized chi2, rather than a Euclidean distance or divergence between the reconstructed data and the data itself. Herein, application of this method to microarray data is demonstrated in order to predict gene function.
Marco Antonio, David S; Hartfelder, Klaus
2017-01-01
Eye development in insects is best understood in Drosophila melanogaster, but little is known for other holometabolous insects. Combining a morphological with a gene expression analysis, we investigated eye development in the honeybee, putting emphasis on the sex-specific differences in eye size. Optic lobe development starts from an optic lobe anlage in the larval brain, which sequentially gives rise to the lobula, medulla, and lamina. The lamina differentiates in the last larval instar, when it receives optic nerve projections from the developing retina. The expression analysis focused on seven genes important for Drosophila eye development: eyes absent, sine oculis, embryonic lethal abnormal vision, minibrain, small optic lobes, epidermal growth factor receptor, and roughest. All except small optic lobes were more highly expressed in third-instar drone larvae, but then, in the fourth and fifth instar, their expression was sex-specifically modulated, showing shifts in temporal dynamics. The clearest differences were seen for small optic lobes, which is highly expressed in the developing eye of workers, and minibrain and roughest, which showed a strong expression peak coinciding with retina differentiation. A microarray analysis for optic lobe/retina complexes revealed the differential expression of several metabolism-related genes, as well as of two micro-RNAs. While we could not see major morphological differences in the developing eye structures before the pupal stage, the expression differences observed for the seven candidate genes and in the transcriptional microarray profiles indicate that molecular signatures underlying sex-specific optic lobe and retina development become established throughout the larval stages. © 2016 Wiley Periodicals, Inc.
Microarray data mining using Bioconductor packages.
Nie, Haisheng; Neerincx, Pieter B T; van der Poel, Jan; Ferrari, Francesco; Bicciato, Silvio; Leunissen, Jack A M; Groenen, Martien A M
2009-07-16
This paper describes the results of a Gene Ontology (GO) term enrichment analysis of chicken microarray data using the Bioconductor packages. By checking the enriched GO terms in three contrasts, MM8-PM8, MM8-MA8, and MM8-MM24, of the provided microarray data during this workshop, this analysis aimed to investigate the host reactions in chickens occurring shortly after a secondary challenge with either a homologous or heterologous species of Eimeria. The results of GO enrichment analysis using GO terms annotated to chicken genes and GO terms annotated to chicken-human orthologous genes were also compared. Furthermore, a locally adaptive statistical procedure (LAP) was performed to test differentially expressed chromosomal regions, rather than individual genes, in the chicken genome after Eimeria challenge. GO enrichment analysis identified significant (raw p-value < 0.05) GO terms for all three contrasts included in the analysis. Some of the GO terms linked to, generally, primary immune responses or secondary immune responses indicating the GO enrichment analysis is a useful approach to analyze microarray data. The comparisons of GO enrichment results using chicken gene information and chicken-human orthologous gene information showed more refined GO terms related to immune responses when using chicken-human orthologous gene information, this suggests that using chicken-human orthologous gene information has higher power to detect significant GO terms with more refined functionality. Furthermore, three chromosome regions were identified to be significantly up-regulated in contrast MM8-PM8 (q-value < 0.01). Overall, this paper describes a practical approach to analyze microarray data in farm animals where the genome information is still incomplete. For farm animals, such as chicken, with currently limited gene annotation, borrowing gene annotation information from orthologous genes in well-annotated species, such as human, will help improve the pathway analysis results substantially. Furthermore, LAP analysis approach is a relatively new and very useful way to be applied in microarray analysis.
Mining Microarray Data at NCBI’s Gene Expression Omnibus (GEO)*
Barrett, Tanya; Edgar, Ron
2006-01-01
Summary The Gene Expression Omnibus (GEO) at the National Center for Biotechnology Information (NCBI) has emerged as the leading fully public repository for gene expression data. This chapter describes how to use Web-based interfaces, applications, and graphics to effectively explore, visualize, and interpret the hundreds of microarray studies and millions of gene expression patterns stored in GEO. Data can be examined from both experiment-centric and gene-centric perspectives using user-friendly tools that do not require specialized expertise in microarray analysis or time-consuming download of massive data sets. The GEO database is publicly accessible through the World Wide Web at http://www.ncbi.nlm.nih.gov/geo. PMID:16888359
Chondrocyte channel transcriptomics
Lewis, Rebecca; May, Hannah; Mobasheri, Ali; Barrett-Jolley, Richard
2013-01-01
To date, a range of ion channels have been identified in chondrocytes using a number of different techniques, predominantly electrophysiological and/or biomolecular; each of these has its advantages and disadvantages. Here we aim to compare and contrast the data available from biophysical and microarray experiments. This letter analyses recent transcriptomics datasets from chondrocytes, accessible from the European Bioinformatics Institute (EBI). We discuss whether such bioinformatic analysis of microarray datasets can potentially accelerate identification and discovery of ion channels in chondrocytes. The ion channels which appear most frequently across these microarray datasets are discussed, along with their possible functions. We discuss whether functional or protein data exist which support the microarray data. A microarray experiment comparing gene expression in osteoarthritis and healthy cartilage is also discussed and we verify the differential expression of 2 of these genes, namely the genes encoding large calcium-activated potassium (BK) and aquaporin channels. PMID:23995703
Gene amplification of the transcription factor DP1 and CTNND1 in human lung cancer.
Castillo, Sandra D; Angulo, Barbara; Suarez-Gauthier, Ana; Melchor, Lorenzo; Medina, Pedro P; Sanchez-Verde, Lydia; Torres-Lanzas, Juan; Pita, Guillermo; Benitez, Javier; Sanchez-Cespedes, Montse
2010-09-01
The search for novel oncogenes is important because they could be the target of future specific anticancer therapies. In the present paper we report the identification of novel amplified genes in lung cancer by means of global gene expression analysis. To screen for amplicons, we aligned the gene expression data according to the position of transcripts in the human genome and searched for clusters of over-expressed genes. We found several clusters with gene over-expression, suggesting an underlying genomic amplification. FISH and microarray analysis for DNA copy number in two clusters, at chromosomes 11q12 and 13q34, confirmed the presence of amplifications spanning about 0.4 and 1 Mb for 11q12 and 13q34, respectively. Amplification at these regions each occurred at a frequency of 3%. Moreover, quantitative RT-PCR of each individual transcript within the amplicons allowed us to verify the increased in gene expression of several genes. The p120ctn and DP1 proteins, encoded by two candidate oncogenes, CTNND1 and TFDP1, at 11q12 and 13q amplicons, respectively, showed very strong immunostaining in lung tumours with gene amplification. We then focused on the 13q34 amplicon and in the TFDP1 candidate oncogene. To further determine the oncogenic properties of DP1, we searched for lung cancer cell lines carrying TFDP1 amplification. Depletion of TFDP1 expression by small interference RNA in a lung cancer cell line (HCC33) with TFDP1 amplification and protein over-expression reduced cell viability by 50%. In conclusion, we report the identification of two novel amplicons, at 13q34 and 11q12, each occurring at a frequency of 3% of non-small cell lung cancers. TFDP1, which encodes the E2F-associated transcription factor DP1 is a candidate oncogene at 13q34. The data discussed in this publication have been deposited in NCBIs Gene Expression Omnibus (GEO; http://www.ncbi.nlm.nih.gov/geo/) and are accessible through GEO Series Accession No. GSE21168.
Wimmer, Isabella; Tröscher, Anna R; Brunner, Florian; Rubino, Stephen J; Bien, Christian G; Weiner, Howard L; Lassmann, Hans; Bauer, Jan
2018-04-20
Formalin-fixed paraffin-embedded (FFPE) tissues are valuable resources commonly used in pathology. However, formalin fixation modifies nucleic acids challenging the isolation of high-quality RNA for genetic profiling. Here, we assessed feasibility and reliability of microarray studies analysing transcriptome data from fresh, fresh-frozen (FF) and FFPE tissues. We show that reproducible microarray data can be generated from only 2 ng FFPE-derived RNA. For RNA quality assessment, fragment size distribution (DV200) and qPCR proved most suitable. During RNA isolation, extending tissue lysis time to 10 hours reduced high-molecular-weight species, while additional incubation at 70 °C markedly increased RNA yields. Since FF- and FFPE-derived microarrays constitute different data entities, we used indirect measures to investigate gene signal variation and relative gene expression. Whole-genome analyses revealed high concordance rates, while reviewing on single-genes basis showed higher data variation in FFPE than FF arrays. Using an experimental model, gene set enrichment analysis (GSEA) of FFPE-derived microarrays and fresh tissue-derived RNA-Seq datasets yielded similarly affected pathways confirming the applicability of FFPE tissue in global gene expression analysis. Our study provides a workflow comprising RNA isolation, quality assessment and microarray profiling using minimal RNA input, thus enabling hypothesis-generating pathway analyses from limited amounts of precious, pathologically significant FFPE tissues.
Lee, Joseph C; Stiles, David; Lu, Jun; Cam, Margaret C
2007-01-01
Background Microarrays are a popular tool used in experiments to measure gene expression levels. Improving the reproducibility of microarray results produced by different chips from various manufacturers is important to create comparable and combinable experimental results. Alternative splicing has been cited as a possible cause of differences in expression measurements across platforms, though no study to this point has been conducted to show its influence in cross-platform differences. Results Using probe sequence data, a new microarray probe/transcript annotation was created based on the AceView Aug05 release that allowed for the categorization of genes based on their expression measurements' susceptibility to alternative splicing differences across microarray platforms. Examining gene expression data from multiple platforms in light of the new categorization, genes unsusceptible to alternative splicing differences showed higher signal agreement than those genes most susceptible to alternative splicing differences. The analysis gave rise to a different probe-level visualization method that can highlight probe differences according to transcript specificity. Conclusion The results highlight the need for detailed probe annotation at the transcriptome level. The presence of alternative splicing within a given sample can affect gene expression measurements and is a contributing factor to overall technical differences across platforms. PMID:17708771
Brodsky, Leonid; Leontovich, Andrei; Shtutman, Michael; Feinstein, Elena
2004-01-01
Mathematical methods of analysis of microarray hybridizations deal with gene expression profiles as elementary units. However, some of these profiles do not reflect a biologically relevant transcriptional response, but rather stem from technical artifacts. Here, we describe two technically independent but rationally interconnected methods for identification of such artifactual profiles. Our diagnostics are based on detection of deviations from uniformity, which is assumed as the main underlying principle of microarray design. Method 1 is based on detection of non-uniformity of microarray distribution of printed genes that are clustered based on the similarity of their expression profiles. Method 2 is based on evaluation of the presence of gene-specific microarray spots within the slides’ areas characterized by an abnormal concentration of low/high differential expression values, which we define as ‘patterns of differentials’. Applying two novel algorithms, for nested clustering (method 1) and for pattern detection (method 2), we can make a dual estimation of the profile’s quality for almost every printed gene. Genes with artifactual profiles detected by method 1 may then be removed from further analysis. Suspicious differential expression values detected by method 2 may be either removed or weighted according to the probabilities of patterns that cover them, thus diminishing their input in any further data analysis. PMID:14999086
APPLICATION OF DNA MICROARRAYS TO REPRODUCTIVE TOXICOLOGY AND THE DEVELOPMENT OF A TESTIS ARRAY
With the advent of sequence information for entire mammalian genomes, it is now possible to analyze gene expression and gene polymorphisms on a genomic scale. The primary tool for analysis of gene expression is the DNA microarray. We have used commercially available cDNA micro...
With the advent of sequence information for entire eukaryotic genomes, it is now possible to analyze gene expression on a genomic scale. The primary tool for genomic analysis of gene expression is the gene microarray. We have used commercially available and custom cDNA microarray...
Okaty, Benjamin W; Miller, Mark N; Sugino, Ken; Hempel, Chris M; Nelson, Sacha B
2009-01-01
Fast-spiking (FS) interneurons are important elements of neocortical circuitry that constitute the primary source of synaptic inhibition in adult cortex and impart temporal organization on ongoing cortical activity. The highly specialized intrinsic membrane and firing properties that allow cortical FS interneurons to perform these functions are due to equally specialized gene expression, which is ultimately coordinated by cell-type-specific transcriptional regulation. While embryonic transcriptional events govern the initial steps of cell-type specification in most cortical interneurons, including FS cells, the electrophysiological properties that distinguish adult cortical cell types emerge relatively late in postnatal development, and the transcriptional events that drive this maturational process are not known. To address this, we used mouse whole-genome microarrays and whole-cell patch clamp to characterize the transcriptional and electrophysiological maturation of cortical FS interneurons between postnatal day 7 (P7) and P40. We found that the intrinsic and synaptic physiology of FS cells undergoes profound regulation over the first four postnatal weeks, and that these changes are correlated with largely monotonic but bidirectional transcriptional regulation of thousands of genes belonging to multiple functional classes. Using our microarray screen as a guide, we discovered that upregulation of 2-pore K+ leak channels between P10 and P25 contributes to one of the major differences between the intrinsic membrane properties of immature and adult FS cells, and found a number of other candidate genes that likely confer cell-type specificity on mature FS cells. PMID:19474331
Recent molecular genetic studies and methodological issues in suicide research.
Tsai, Shih-Jen; Hong, Chen-Jee; Liou, Ying-Jay
2011-06-01
Suicide behavior (SB) spans a spectrum ranging from suicidal ideation to suicide attempts and completed suicide. Strong evidence suggests a genetic susceptibility to SB, including familial heritability and common occurrence in twins. This review addresses recent molecular genetic studies in SB that include case-control association, genome gene-expression microarray, and genome-wide association (GWA). This work also reviews epigenetics in SB and pharmacogenetic studies of antidepressant-induced suicide. SB fulfills criteria for a complex genetic phenotype in which environmental factors interact with multiple genes to influence susceptibility. So far, case-control association approaches are still the mainstream in SB genetic studies, although whole genome gene-expression microarray and GWA studies have begun to emerge in recent years. Genetic association studies have suggested several genes (e.g., serotonin transporter, tryptophan hydroxylase 2, and brain-derived neurotrophic factor) related to SB, but not all reports support these findings. The case-control approach while useful is limited by present knowledge of disease pathophysiology. Genome-wide studies of gene expression and genetic variation are not constrained by our limited knowledge. However, the explanatory power and path to clinical translation of risk estimates for common variants reported in genome-wide association studies remain unclear because of the presence of rare and structural genetic variation. As whole genome sequencing becomes increasingly widespread, available genomic information will no longer be the limiting factor in applying genetics to clinical medicine. These approaches provide exciting new avenues to identify new candidate genes for SB genetic studies. The other limitation of genetic association is the lack of a consistent definition of the SB phenotype among studies, an inconsistency that hampers the comparability of the studies and data pooling. In summary, SB involves multiple genes interacting with non-genetic factors. A better understanding of the SB genes by combining whole genome approaches with case-control association studies, may potentially lead to developing effective screening, prevention, and management of SB. Copyright © 2010 Elsevier Inc. All rights reserved.
Hierarchical Gene Selection and Genetic Fuzzy System for Cancer Microarray Data Classification
Nguyen, Thanh; Khosravi, Abbas; Creighton, Douglas; Nahavandi, Saeid
2015-01-01
This paper introduces a novel approach to gene selection based on a substantial modification of analytic hierarchy process (AHP). The modified AHP systematically integrates outcomes of individual filter methods to select the most informative genes for microarray classification. Five individual ranking methods including t-test, entropy, receiver operating characteristic (ROC) curve, Wilcoxon and signal to noise ratio are employed to rank genes. These ranked genes are then considered as inputs for the modified AHP. Additionally, a method that uses fuzzy standard additive model (FSAM) for cancer classification based on genes selected by AHP is also proposed in this paper. Traditional FSAM learning is a hybrid process comprising unsupervised structure learning and supervised parameter tuning. Genetic algorithm (GA) is incorporated in-between unsupervised and supervised training to optimize the number of fuzzy rules. The integration of GA enables FSAM to deal with the high-dimensional-low-sample nature of microarray data and thus enhance the efficiency of the classification. Experiments are carried out on numerous microarray datasets. Results demonstrate the performance dominance of the AHP-based gene selection against the single ranking methods. Furthermore, the combination of AHP-FSAM shows a great accuracy in microarray data classification compared to various competing classifiers. The proposed approach therefore is useful for medical practitioners and clinicians as a decision support system that can be implemented in the real medical practice. PMID:25823003
Hierarchical gene selection and genetic fuzzy system for cancer microarray data classification.
Nguyen, Thanh; Khosravi, Abbas; Creighton, Douglas; Nahavandi, Saeid
2015-01-01
This paper introduces a novel approach to gene selection based on a substantial modification of analytic hierarchy process (AHP). The modified AHP systematically integrates outcomes of individual filter methods to select the most informative genes for microarray classification. Five individual ranking methods including t-test, entropy, receiver operating characteristic (ROC) curve, Wilcoxon and signal to noise ratio are employed to rank genes. These ranked genes are then considered as inputs for the modified AHP. Additionally, a method that uses fuzzy standard additive model (FSAM) for cancer classification based on genes selected by AHP is also proposed in this paper. Traditional FSAM learning is a hybrid process comprising unsupervised structure learning and supervised parameter tuning. Genetic algorithm (GA) is incorporated in-between unsupervised and supervised training to optimize the number of fuzzy rules. The integration of GA enables FSAM to deal with the high-dimensional-low-sample nature of microarray data and thus enhance the efficiency of the classification. Experiments are carried out on numerous microarray datasets. Results demonstrate the performance dominance of the AHP-based gene selection against the single ranking methods. Furthermore, the combination of AHP-FSAM shows a great accuracy in microarray data classification compared to various competing classifiers. The proposed approach therefore is useful for medical practitioners and clinicians as a decision support system that can be implemented in the real medical practice.
NASA Technical Reports Server (NTRS)
Khaoustov, V. I.; Risin, D.; Pellis, N. R.; Yoffe, B.; McIntire, L. V. (Principal Investigator)
2001-01-01
Developed at NASA, the rotary cell culture system (RCCS) allows the creation of unique microgravity environment of low shear force, high-mass transfer, and enables three-dimensional (3D) cell culture of dissimilar cell types. Recently we demonstrated that a simulated microgravity is conducive for maintaining long-term cultures of functional hepatocytes and promote 3D cell assembly. Using deoxyribonucleic acid (DNA) microarray technology, it is now possible to measure the levels of thousands of different messenger ribonucleic acids (mRNAs) in a single hybridization step. This technique is particularly powerful for comparing gene expression in the same tissue under different environmental conditions. The aim of this research was to analyze gene expression of hepatoblastoma cell line (HepG2) during early stage of 3D-cell assembly in simulated microgravity. For this, mRNA from HepG2 cultured in the RCCS was analyzed by deoxyribonucleic acid microarray. Analyses of HepG2 mRNA by using 6K glass DNA microarray revealed changes in expression of 95 genes (overexpression of 85 genes and downregulation of 10 genes). Our preliminary results indicated that simulated microgravity modifies the expression of several genes and that microarray technology may provide new understanding of the fundamental biological questions of how gravity affects the development and function of individual cells.
Agarwal, Parul; Garg, Varsha; Gautam, Taru; Pillai, Beena; Kanoria, Shaveta; Burma, Pradeep Kumar
2014-04-01
Several reports of promoters from plants, viral and artificial origin that confer high constitutive expression are known. Among these the CaMV 35S promoter is used extensively for transgene expression in plants. We identified candidate promoters from Arabidopsis based on their transcript levels (meta-analysis of available microarray control datasets) to test their activity in comparison to the CaMV 35S promoter. A set of 11 candidate genes were identified which showed high transcript levels in the aerial tissue (i.e. leaf, shoot, flower and stem). In the initial part of the study binary vectors were developed wherein the promoter and 5'UTR region of these candidate genes (Upstream Regulatory Module, URM) were cloned upstream to the reporter gene β glucuronidase (gus). The promoter strengths were tested in transformed callus of Nicotiana tabacum and Gossypium hirsutum. On the basis of the results obtained from the callus, the influence of the URM cassettes on transgene expression was tested in transgenic tobacco. The URM regions of the genes encoding a subunit of photosystem I (PHOTO) and geranyl geranyl reductase (GGR) in A. thaliana genome showed significantly high levels of GUS activity in comparison to the CaMV 35S promoter. Further, when the 5'UTRs of both the genes were placed downstream to the CaMV 35S promoter it led to a substantial increase in GUS activity in transgenic tobacco lines and cotton callus. The enhancement observed was even higher to that observed with the viral leader sequences like Ω and AMV, known translational enhancers. Our results indicate that the two URM cassettes or the 5'UTR regions of PHOTO and GGR when placed downstream to the CaMV 35S promoter can be used to drive high levels of transgene expression in dicotyledons.
Kresse, Stine H; Berner, Jeanne-Marie; Meza-Zepeda, Leonardo A; Gregory, Simon G; Kuo, Wen-Lin; Gray, Joe W; Forus, Anne; Myklebost, Ola
2005-01-01
Background Amplification of the q21-q23 region on chromosome 1 is frequently found in sarcomas and a variety of other solid tumours. Previous analyses of sarcomas have indicated the presence of at least two separate amplicons within this region, one located in 1q21 and one located near the apolipoprotein A-II (APOA2) gene in 1q23. In this study we have mapped and characterized the amplicon in 1q23 in more detail. Results We have used fluorescence in situ hybridisation (FISH) and microarray-based comparative genomic hybridisation (array CGH) to map and define the borders of the amplicon in 10 sarcomas. A subregion of approximately 800 kb was identified as the core of the amplicon. The amplification patterns of nine possible candidate target genes located to this subregion were determined by Southern blot analysis. The genes activating transcription factor 6 (ATF6) and dual specificity phosphatase 12 (DUSP12) showed the highest level of amplification, and they were also shown to be over-expressed by quantitative real-time reverse transcription PCR (RT-PCR). In general, the level of expression reflected the level of amplification in the different tumours. DUSP12 was expressed significantly higher than ATF6 in a subset of the tumours. In addition, two genes known to be transcriptionally activated by ATF6, glucose-regulated protein 78 kDa and -94 kDa (GRP78 and GRP94), were shown to be over-expressed in the tumours that showed over-expression of ATF6. Conclusion ATF6 and DUSP12 seem to be the most likely candidate target genes for the 1q23 amplification in sarcomas. Both genes have possible roles in promoting cell growth, which makes them interesting candidate targets. PMID:16274472
2013-01-01
Background As high-throughput genomic technologies become accurate and affordable, an increasing number of data sets have been accumulated in the public domain and genomic information integration and meta-analysis have become routine in biomedical research. In this paper, we focus on microarray meta-analysis, where multiple microarray studies with relevant biological hypotheses are combined in order to improve candidate marker detection. Many methods have been developed and applied in the literature, but their performance and properties have only been minimally investigated. There is currently no clear conclusion or guideline as to the proper choice of a meta-analysis method given an application; the decision essentially requires both statistical and biological considerations. Results We performed 12 microarray meta-analysis methods for combining multiple simulated expression profiles, and such methods can be categorized for different hypothesis setting purposes: (1) HS A : DE genes with non-zero effect sizes in all studies, (2) HS B : DE genes with non-zero effect sizes in one or more studies and (3) HS r : DE gene with non-zero effect in "majority" of studies. We then performed a comprehensive comparative analysis through six large-scale real applications using four quantitative statistical evaluation criteria: detection capability, biological association, stability and robustness. We elucidated hypothesis settings behind the methods and further apply multi-dimensional scaling (MDS) and an entropy measure to characterize the meta-analysis methods and data structure, respectively. Conclusions The aggregated results from the simulation study categorized the 12 methods into three hypothesis settings (HS A , HS B , and HS r ). Evaluation in real data and results from MDS and entropy analyses provided an insightful and practical guideline to the choice of the most suitable method in a given application. All source files for simulation and real data are available on the author’s publication website. PMID:24359104
Chang, Lun-Ching; Lin, Hui-Min; Sibille, Etienne; Tseng, George C
2013-12-21
As high-throughput genomic technologies become accurate and affordable, an increasing number of data sets have been accumulated in the public domain and genomic information integration and meta-analysis have become routine in biomedical research. In this paper, we focus on microarray meta-analysis, where multiple microarray studies with relevant biological hypotheses are combined in order to improve candidate marker detection. Many methods have been developed and applied in the literature, but their performance and properties have only been minimally investigated. There is currently no clear conclusion or guideline as to the proper choice of a meta-analysis method given an application; the decision essentially requires both statistical and biological considerations. We performed 12 microarray meta-analysis methods for combining multiple simulated expression profiles, and such methods can be categorized for different hypothesis setting purposes: (1) HS(A): DE genes with non-zero effect sizes in all studies, (2) HS(B): DE genes with non-zero effect sizes in one or more studies and (3) HS(r): DE gene with non-zero effect in "majority" of studies. We then performed a comprehensive comparative analysis through six large-scale real applications using four quantitative statistical evaluation criteria: detection capability, biological association, stability and robustness. We elucidated hypothesis settings behind the methods and further apply multi-dimensional scaling (MDS) and an entropy measure to characterize the meta-analysis methods and data structure, respectively. The aggregated results from the simulation study categorized the 12 methods into three hypothesis settings (HS(A), HS(B), and HS(r)). Evaluation in real data and results from MDS and entropy analyses provided an insightful and practical guideline to the choice of the most suitable method in a given application. All source files for simulation and real data are available on the author's publication website.
Exploring the key genes and pathways in enchondromas using a gene expression microarray.
Shi, Zhongju; Zhou, Hengxing; Pan, Bin; Lu, Lu; Kang, Yi; Liu, Lu; Wei, Zhijian; Feng, Shiqing
2017-07-04
Enchondromas are the most common primary benign osseous neoplasms that occur in the medullary bone; they can undergo malignant transformation into chondrosarcoma. However, enchondromas are always undetected in patients, and the molecular mechanism is unclear. To identify key genes and pathways associated with the occurrence and development of enchondromas, we downloaded the gene expression dataset GSE22855 and obtained the differentially expressed genes (DEGs) by analyzing high-throughput gene expression in enchondromas. In total, 635 genes were identified as DEGs. Of these, 225 genes (35.43%) were up-regulated, and the remaining 410 genes (64.57%) were down-regulated. We identified the predominant gene ontology (GO) categories and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways that were significantly over-represented in the enchondromas samples compared with the control samples. Subsequently the top 10 core genes were identified from the protein-protein interaction (PPI) network. The enrichment analyses of the genes mainly involved in two significant modules showed that the DEGs were principally related to ribosomes, protein digestion and absorption, ECM-receptor interaction, focal adhesion, amoebiasis and the PI3K-Akt signaling pathway.Together, these data elucidate the molecular mechanisms underlying the occurrence and development of enchondromas and provide promising candidates for therapeutic intervention and prognostic evaluation. However, further experimental studies are needed to confirm these results.
Biomarkers of the Hedgehog/Smoothened pathway in healthy volunteers
Kadam, Sunil K; Patel, Bharvin K R; Jones, Emma; Nguyen, Tuan S; Verma, Lalit K; Landschulz, Katherine T; Stepaniants, Sergey; Li, Bin; Brandt, John T; Brail, Leslie H
2012-01-01
The Hedgehog (Hh) pathway is involved in oncogenic transformation and tumor maintenance. The primary objective of this study was to select surrogate tissue to measure messenger ribonucleic acid (mRNA) levels of Hh pathway genes for measurement of pharmacodynamic effect. Expression of Hh pathway specific genes was measured by quantitative real time polymerase chain reaction (qRT-PCR) and global gene expression using Affymetrix U133 microarrays. Correlations were made between the expression of specific genes determined by qRT-PCR and normalized microarray data. Gene ontology analysis using microarray data for a broader set of Hh pathway genes was performed to identify additional Hh pathway-related markers in the surrogate tissue. RNA extracted from blood, hair follicle, and skin obtained from healthy subjects was analyzed by qRT-PCR for 31 genes, whereas 8 samples were analyzed for a 7-gene subset. Twelve sample sets, each with ≤500 ng total RNA derived from hair, skin, and blood, were analyzed using Affymetrix U133 microarrays. Transcripts for several Hh pathway genes were undetectable in blood using qRT-PCR. Skin was the most desirable matrix, followed by hair follicle. Whether processed by robust multiarray average or microarray suite 5 (MAS5), expression patterns of individual samples showed co-clustered signals; both normalization methods were equally effective for unsupervised analysis. The MAS5- normalized probe sets appeared better suited for supervised analysis. This work provides the basis for selection of a surrogate tissue and an expression analysis-based approach to evaluate pathway-related genes as markers of pharmacodynamic effect with novel inhibitors of the Hh pathway. PMID:22611475
DFP: a Bioconductor package for fuzzy profile identification and gene reduction of microarray data
Glez-Peña, Daniel; Álvarez, Rodrigo; Díaz, Fernando; Fdez-Riverola, Florentino
2009-01-01
Background Expression profiling assays done by using DNA microarray technology generate enormous data sets that are not amenable to simple analysis. The greatest challenge in maximizing the use of this huge amount of data is to develop algorithms to interpret and interconnect results from different genes under different conditions. In this context, fuzzy logic can provide a systematic and unbiased way to both (i) find biologically significant insights relating to meaningful genes, thereby removing the need for expert knowledge in preliminary steps of microarray data analyses and (ii) reduce the cost and complexity of later applied machine learning techniques being able to achieve interpretable models. Results DFP is a new Bioconductor R package that implements a method for discretizing and selecting differentially expressed genes based on the application of fuzzy logic. DFP takes advantage of fuzzy membership functions to assign linguistic labels to gene expression levels. The technique builds a reduced set of relevant genes (FP, Fuzzy Pattern) able to summarize and represent each underlying class (pathology). A last step constructs a biased set of genes (DFP, Discriminant Fuzzy Pattern) by intersecting existing fuzzy patterns in order to detect discriminative elements. In addition, the software provides new functions and visualisation tools that summarize achieved results and aid in the interpretation of differentially expressed genes from multiple microarray experiments. Conclusion DFP integrates with other packages of the Bioconductor project, uses common data structures and is accompanied by ample documentation. It has the advantage that its parameters are highly configurable, facilitating the discovery of biologically relevant connections between sets of genes belonging to different pathologies. This information makes it possible to automatically filter irrelevant genes thereby reducing the large volume of data supplied by microarray experiments. Based on these contributions GENECBR, a successful tool for cancer diagnosis using microarray datasets, has recently been released. PMID:19178723
Kilicoglu, Halil; Shin, Dongwook; Rindflesch, Thomas C.
2014-01-01
Gene regulatory networks are a crucial aspect of systems biology in describing molecular mechanisms of the cell. Various computational models rely on random gene selection to infer such networks from microarray data. While incorporation of prior knowledge into data analysis has been deemed important, in practice, it has generally been limited to referencing genes in probe sets and using curated knowledge bases. We investigate the impact of augmenting microarray data with semantic relations automatically extracted from the literature, with the view that relations encoding gene/protein interactions eliminate the need for random selection of components in non-exhaustive approaches, producing a more accurate model of cellular behavior. A genetic algorithm is then used to optimize the strength of interactions using microarray data and an artificial neural network fitness function. The result is a directed and weighted network providing the individual contribution of each gene to its target. For testing, we used invasive ductile carcinoma of the breast to query the literature and a microarray set containing gene expression changes in these cells over several time points. Our model demonstrates significantly better fitness than the state-of-the-art model, which relies on an initial random selection of genes. Comparison to the component pathways of the KEGG Pathways in Cancer map reveals that the resulting networks contain both known and novel relationships. The p53 pathway results were manually validated in the literature. 60% of non-KEGG relationships were supported (74% for highly weighted interactions). The method was then applied to yeast data and our model again outperformed the comparison model. Our results demonstrate the advantage of combining gene interactions extracted from the literature in the form of semantic relations with microarray analysis in generating contribution-weighted gene regulatory networks. This methodology can make a significant contribution to understanding the complex interactions involved in cellular behavior and molecular physiology. PMID:24921649
Chen, Guocai; Cairelli, Michael J; Kilicoglu, Halil; Shin, Dongwook; Rindflesch, Thomas C
2014-06-01
Gene regulatory networks are a crucial aspect of systems biology in describing molecular mechanisms of the cell. Various computational models rely on random gene selection to infer such networks from microarray data. While incorporation of prior knowledge into data analysis has been deemed important, in practice, it has generally been limited to referencing genes in probe sets and using curated knowledge bases. We investigate the impact of augmenting microarray data with semantic relations automatically extracted from the literature, with the view that relations encoding gene/protein interactions eliminate the need for random selection of components in non-exhaustive approaches, producing a more accurate model of cellular behavior. A genetic algorithm is then used to optimize the strength of interactions using microarray data and an artificial neural network fitness function. The result is a directed and weighted network providing the individual contribution of each gene to its target. For testing, we used invasive ductile carcinoma of the breast to query the literature and a microarray set containing gene expression changes in these cells over several time points. Our model demonstrates significantly better fitness than the state-of-the-art model, which relies on an initial random selection of genes. Comparison to the component pathways of the KEGG Pathways in Cancer map reveals that the resulting networks contain both known and novel relationships. The p53 pathway results were manually validated in the literature. 60% of non-KEGG relationships were supported (74% for highly weighted interactions). The method was then applied to yeast data and our model again outperformed the comparison model. Our results demonstrate the advantage of combining gene interactions extracted from the literature in the form of semantic relations with microarray analysis in generating contribution-weighted gene regulatory networks. This methodology can make a significant contribution to understanding the complex interactions involved in cellular behavior and molecular physiology.
DFP: a Bioconductor package for fuzzy profile identification and gene reduction of microarray data.
Glez-Peña, Daniel; Alvarez, Rodrigo; Díaz, Fernando; Fdez-Riverola, Florentino
2009-01-29
Expression profiling assays done by using DNA microarray technology generate enormous data sets that are not amenable to simple analysis. The greatest challenge in maximizing the use of this huge amount of data is to develop algorithms to interpret and interconnect results from different genes under different conditions. In this context, fuzzy logic can provide a systematic and unbiased way to both (i) find biologically significant insights relating to meaningful genes, thereby removing the need for expert knowledge in preliminary steps of microarray data analyses and (ii) reduce the cost and complexity of later applied machine learning techniques being able to achieve interpretable models. DFP is a new Bioconductor R package that implements a method for discretizing and selecting differentially expressed genes based on the application of fuzzy logic. DFP takes advantage of fuzzy membership functions to assign linguistic labels to gene expression levels. The technique builds a reduced set of relevant genes (FP, Fuzzy Pattern) able to summarize and represent each underlying class (pathology). A last step constructs a biased set of genes (DFP, Discriminant Fuzzy Pattern) by intersecting existing fuzzy patterns in order to detect discriminative elements. In addition, the software provides new functions and visualisation tools that summarize achieved results and aid in the interpretation of differentially expressed genes from multiple microarray experiments. DFP integrates with other packages of the Bioconductor project, uses common data structures and is accompanied by ample documentation. It has the advantage that its parameters are highly configurable, facilitating the discovery of biologically relevant connections between sets of genes belonging to different pathologies. This information makes it possible to automatically filter irrelevant genes thereby reducing the large volume of data supplied by microarray experiments. Based on these contributions GENECBR, a successful tool for cancer diagnosis using microarray datasets, has recently been released.
Klein, Hans-Ulrich; Ruckert, Christian; Kohlmann, Alexander; Bullinger, Lars; Thiede, Christian; Haferlach, Torsten; Dugas, Martin
2009-12-15
Multiple gene expression signatures derived from microarray experiments have been published in the field of leukemia research. A comparison of these signatures with results from new experiments is useful for verification as well as for interpretation of the results obtained. Currently, the percentage of overlapping genes is frequently used to compare published gene signatures against a signature derived from a new experiment. However, it has been shown that the percentage of overlapping genes is of limited use for comparing two experiments due to the variability of gene signatures caused by different array platforms or assay-specific influencing parameters. Here, we present a robust approach for a systematic and quantitative comparison of published gene expression signatures with an exemplary query dataset. A database storing 138 leukemia-related published gene signatures was designed. Each gene signature was manually annotated with terms according to a leukemia-specific taxonomy. Two analysis steps are implemented to compare a new microarray dataset with the results from previous experiments stored and curated in the database. First, the global test method is applied to assess gene signatures and to constitute a ranking among them. In a subsequent analysis step, the focus is shifted from single gene signatures to chromosomal aberrations or molecular mutations as modeled in the taxonomy. Potentially interesting disease characteristics are detected based on the ranking of gene signatures associated with these aberrations stored in the database. Two example analyses are presented. An implementation of the approach is freely available as web-based application. The presented approach helps researchers to systematically integrate the knowledge derived from numerous microarray experiments into the analysis of a new dataset. By means of example leukemia datasets we demonstrate that this approach detects related experiments as well as related molecular mutations and may help to interpret new microarray data.
Rise, Matthew L; Nash, Gordon W; Hall, Jennifer R; Booman, Marije; Hori, Tiago S; Trippel, Edward A; Gamperl, A Kurt
2014-12-01
Early life stage mortality is an important issue for Atlantic cod aquaculture, yet the impact of the cod maternal (egg) transcriptome on egg quality and mortality during embryonic development is poorly understood. In the present work, we studied embryonic mortality and maternal transcript expression using eggs from 15 females. Total mortality at 7days post-fertilization (7 dpf, segmentation stage) was used as an indice of egg quality. A 20,000 probe (20K) microarray experiment compared the 7hours post-fertilization (7 hpf, ~2-cell stage) egg transcriptome of the two lowest quality females (>90% mortality at 7 dpf) to that of the highest quality female (~16% mortality at 7 dpf). Forty-three microarray probes were consistently differentially expressed in both low versus high quality egg comparisons (25 higher expressed in low quality eggs, and 18 higher expressed in high quality eggs). The microarray experiment also identified many immune-relevant genes [e.g. interferon (IFN) pathway genes ifngr1 and ifrd1)] that were highly expressed in eggs of all 3 females regardless of quality. Twelve of the 43 candidate egg quality-associated genes, and ifngr1, ifrd1 and irf7, were included in a qPCR study with 7 hpf eggs from all 15 females. Then, the genes that were confirmed by qPCR to be greater than 2-fold differentially expressed between 7 hpf eggs from the lowest and highest quality females (dcbld1, ddc, and acy3 more highly expressed in the 2 lowest quality females; kpna7 and hacd1 more highly expressed in the highest quality female), and the 3 IFN pathway genes, were included in a second qPCR study with unfertilized eggs. While some maternal transcripts included in these qPCR studies were associated with extremes in egg quality, there was little correlation between egg quality and gene expression when all females were considered. Both dcbld1 and ddc showed greater than 100-fold differences in transcript expression between females and were potentially influenced by family. The Atlantic cod ddc (dopa decarboxylase) complete cDNA was characterized, and has a 1461bp open reading frame encoding a 486 amino acid protein that contains all eight residues of the conserved pyridoxal 5'-phosphate binding site including the catalytic lysine. This study provides valuable new information and resources related to the Atlantic cod egg transcriptome. Some of these microarray-identified, qPCR-confirmed, Atlantic cod egg transcripts (e.g. ddc, kpna7) play important roles during embryonic development of other vertebrate species, and may have similar functions in Atlantic cod. Copyright © 2014. Published by Elsevier B.V.
An, Yu; Duan, Wenyuan; Huang, Guoying; Chen, Xiaoli; Li, Li; Nie, Chenxia; Hou, Jia; Gui, Yonghao; Wu, Yiming; Zhang, Feng; Shen, Yiping; Wu, Bailin; Wang, Hongyan
2016-01-08
Ventricular septal defects (VSDs) constitute the most prevalent congenital heart disease (CHD), occurs either in isolation (isolated VSD) or in combination with other cardiac defects (complex VSD). Copy number variation (CNV) has been highlighted as a possible contributing factor to the etiology of many congenital diseases. However, little is known concerning the involvement of CNVs in either isolated or complex VSDs. We analyzed 154 unrelated Chinese individuals with VSD by chromosomal microarray analysis. The subjects were recruited from four hospitals across China. Each case underwent clinical assessment to define the type of VSD, either isolated or complex VSD. CNVs detected were categorized into syndrom related CNVs, recurrent CNVs and rare CNVs. Genes encompassed by the CNVs were analyzed using enrichment and pathway analysis. Among 154 probands, we identified 29 rare CNVs in 26 VSD patients (16.9 %, 26/154) and 8 syndrome-related CNVs in 8 VSD patients (5.2 %, 8/154). 12 of the detected 29 rare CNVs (41.3 %) were recurrently reported in DECIPHER or ISCA database as associated with either VSD or general heart disease. Fifteen genes (5 %, 15/285) within CNVs were associated with a broad spectrum of complicated CHD. Among these15 genes, 7 genes were in "abnormal interventricular septum morphology" derived from the MGI (mouse genome informatics) database, and nine genes were associated with cardiovascular system development (GO:0072538).We also found that these VSD-related candidate genes are enriched in chromatin binding and transcription regulation, which are the biological processes underlying heart development. Our study demonstrates the potential clinical diagnostic utility of genomic imbalance profiling in VSD patients. Additionally, gene enrichment and pathway analysis helped us to implicate VSD related candidate genes.
Casel, Pierrot; Moreews, François; Lagarrigue, Sandrine; Klopp, Christophe
2009-07-16
Microarray is a powerful technology enabling to monitor tens of thousands of genes in a single experiment. Most microarrays are now using oligo-sets. The design of the oligo-nucleotides is time consuming and error prone. Genome wide microarray oligo-sets are designed using as large a set of transcripts as possible in order to monitor as many genes as possible. Depending on the genome sequencing state and on the assembly state the knowledge of the existing transcripts can be very different. This knowledge evolves with the different genome builds and gene builds. Once the design is done the microarrays are often used for several years. The biologists working in EADGENE expressed the need of up-to-dated annotation files for the oligo-sets they share including information about the orthologous genes of model species, the Gene Ontology, the corresponding pathways and the chromosomal location. The results of SigReannot on a chicken micro-array used in the EADGENE project compared to the initial annotations show that 23% of the oligo-nucleotide gene annotations were not confirmed, 2% were modified and 1% were added. The interest of this up-to-date annotation procedure is demonstrated through the analysis of real data previously published. SigReannot uses the oligo-nucleotide design procedure criteria to validate the probe-gene link and the Ensembl transcripts as reference for annotation. It therefore produces a high quality annotation based on reference gene sets.
Study of hepatitis B virus gene mutations with enzymatic colorimetry-based DNA microarray.
Mao, Hailei; Wang, Huimin; Zhang, Donglei; Mao, Hongju; Zhao, Jianlong; Shi, Jian; Cui, Zhichu
2006-01-01
To establish a modified microarray method for detecting HBV gene mutations in the clinic. Site-specific oligonucleotide probes were immobilized to microarray slides and hybridized to biotin-labeled HBV gene fragments amplified from two-step PCR. Hybridized targets were transferred to nitrocellulose membranes, followed by intensity measurement using BCIP/NBT colorimetry. HBV genes from 99 Hepatitis B patients and 40 healthy blood donors were analyzed. Mutation frequencies of HBV pre-core/core and basic core promoter (BCP) regions were found to be significantly higher in the patient group (42%, 40% versus 2.5%, 5%, P < 0.01). Compared with a traditional fluorescence method, the colorimetry method exhibited the same level of sensitivity and reproducibility. An enzymatic colorimetry-based DNA microarray assay was successfully established to monitor HBV mutations. Pre-core/core and BCP mutations of HBV genes could be major causes of HBV infection in HBeAg-negative patients and could also be relevant to chronicity and aggravation of hepatitis B.
Clustering gene expression data based on predicted differential effects of GV interaction.
Pan, Hai-Yan; Zhu, Jun; Han, Dan-Fu
2005-02-01
Microarray has become a popular biotechnology in biological and medical research. However, systematic and stochastic variabilities in microarray data are expected and unavoidable, resulting in the problem that the raw measurements have inherent "noise" within microarray experiments. Currently, logarithmic ratios are usually analyzed by various clustering methods directly, which may introduce bias interpretation in identifying groups of genes or samples. In this paper, a statistical method based on mixed model approaches was proposed for microarray data cluster analysis. The underlying rationale of this method is to partition the observed total gene expression level into various variations caused by different factors using an ANOVA model, and to predict the differential effects of GV (gene by variety) interaction using the adjusted unbiased prediction (AUP) method. The predicted GV interaction effects can then be used as the inputs of cluster analysis. We illustrated the application of our method with a gene expression dataset and elucidated the utility of our approach using an external validation.
The objective of this study is to develop a microarray to test for cyanobacteria and cyanotoxin genes in drinking water reservoirs as an aid to risk assessment and manages of water supplies. The microarray will include probes recognizing important freshwater cyanobacterial tax...
Controlling false-negative errors in microarray differential expression analysis: a PRIM approach.
Cole, Steve W; Galic, Zoran; Zack, Jerome A
2003-09-22
Theoretical considerations suggest that current microarray screening algorithms may fail to detect many true differences in gene expression (Type II analytic errors). We assessed 'false negative' error rates in differential expression analyses by conventional linear statistical models (e.g. t-test), microarray-adapted variants (e.g. SAM, Cyber-T), and a novel strategy based on hold-out cross-validation. The latter approach employs the machine-learning algorithm Patient Rule Induction Method (PRIM) to infer minimum thresholds for reliable change in gene expression from Boolean conjunctions of fold-induction and raw fluorescence measurements. Monte Carlo analyses based on four empirical data sets show that conventional statistical models and their microarray-adapted variants overlook more than 50% of genes showing significant up-regulation. Conjoint PRIM prediction rules recover approximately twice as many differentially expressed transcripts while maintaining strong control over false-positive (Type I) errors. As a result, experimental replication rates increase and total analytic error rates decline. RT-PCR studies confirm that gene inductions detected by PRIM but overlooked by other methods represent true changes in mRNA levels. PRIM-based conjoint inference rules thus represent an improved strategy for high-sensitivity screening of DNA microarrays. Freestanding JAVA application at http://microarray.crump.ucla.edu/focus
[Typing and subtyping avian influenza virus using DNA microarrays].
Yang, Zhongping; Wang, Xiurong; Tian, Lina; Wang, Yu; Chen, Hualan
2008-07-01
Outbreaks of highly pathogenic avian influenza (HPAI) virus has caused great economic loss to the poultry industry and resulted in human deaths in Thailand and Vietnam since 2004. Rapid typing and subtyping of viruses, especially HPAI from clinical specimens, are desirable for taking prompt control measures to prevent spreading of the disease. We described a simultaneous approach using microarray to detect and subtype avian influenza virus (AIV). We designed primers of probe genes and used reverse transcriptase PCR to prepare cDNAs of AIV M gene, H5, H7, H9 subtypes haemagglutinin genes and N1, N2 subtypes neuraminidase genes. They were cloned, sequenced, reamplified and spotted to form a glass-bound microarrays. We labeled samples using Cy3-dUTP by RT-PCR, hybridized and scanned the microarrays to typing and subtyping AIV. The hybridization pattern agreed perfectly with the known grid location of each probe, no cross hybridization could be detected. Examinating of HA subtypes 1 through 15, 30 infected samples and 21 field samples revealed the DNA microarray assay was more sensitive and specific than RT-PCR test and chicken embryo inoculation. It can simultaneously detect and differentiate the main epidemic AIV. The results show that DNA microarray technology is a useful diagnostic method.
Adipose Genes Down-Regulated During Experimental Endotoxemia Are Also Suppressed in Obesity
Hinkle, Christine C.; Haris, Lalarukh; Shah, Rhia; Mehta, Nehal N.; Putt, Mary E.; Reilly, Muredach P.
2012-01-01
Context: Adipose inflammation is a crucial link between obesity and its metabolic complications. Human experimental endotoxemia is a controlled model for the study of inflammatory cardiometabolic responses in vivo. Objective: We hypothesized that adipose genes down-regulated during endotoxemia would approximate changes observed with obesity-related inflammation and reveal novel candidates in cardiometabolic disease. Design, Subjects, and Intervention: Healthy volunteers (n = 14) underwent a 3 ng/kg endotoxin challenge; adipose biopsies were taken at 0, 4, 12, and 24 h for mRNA microarray. A priority list of highly down-regulated and biologically relevant genes was validated by RT-PCR in an independent sample of adipose from healthy subjects (n = 7) undergoing a subclinical 0.6 ng/kg endotoxemia protocol. Expression of validated genes was screened in adipose of lean and severely obese individuals (n = 11 per group), and cellular source was probed in cultured adipocytes and macrophages. Results: Endotoxemia (3 ng/kg) suppressed expression of 353 genes (to <67% of baseline; P < 1 × 10−5) of which 68 candidates were prioritized for validation. In low-dose (0.6 ng/kg) endotoxin validation, 22 (32%) of these 68 genes were confirmed. Functional classification revealed that many of these genes are involved in cell development and differentiation. Of validated genes, 59% (13 of 22) were down-regulated more than 1.5-fold in primary human adipocytes after treatment with endotoxin. In human macrophages, 59% (13 of 22) were up-regulated during differentiation to inflammatory M1 macrophages whereas 64% (14 of 22) were down-regulated during transition to homeostatic M2 macrophages. Finally, in obese vs. lean adipose, 91% (20 of 22) tended to have reduced expression (χ2 = 10.72, P < 0.01) with 50% (11 of 22) reaching P < 0.05 (χ2 = 9.28, P < 0.01). Conclusions: Exploration of down-regulated mRNA in adipose during human endotoxemia revealed suppression of genes involved in cell development and differentiation. A majority of candidates were also suppressed in endogenous human obesity, suggesting a potential pathophysiological role in human obesity-related adipose inflammation. PMID:22893715
Adipose genes down-regulated during experimental endotoxemia are also suppressed in obesity.
Shah, Rachana; Hinkle, Christine C; Haris, Lalarukh; Shah, Rhia; Mehta, Nehal N; Putt, Mary E; Reilly, Muredach P
2012-11-01
Adipose inflammation is a crucial link between obesity and its metabolic complications. Human experimental endotoxemia is a controlled model for the study of inflammatory cardiometabolic responses in vivo. We hypothesized that adipose genes down-regulated during endotoxemia would approximate changes observed with obesity-related inflammation and reveal novel candidates in cardiometabolic disease. Healthy volunteers (n = 14) underwent a 3 ng/kg endotoxin challenge; adipose biopsies were taken at 0, 4, 12, and 24 h for mRNA microarray. A priority list of highly down-regulated and biologically relevant genes was validated by RT-PCR in an independent sample of adipose from healthy subjects (n = 7) undergoing a subclinical 0.6 ng/kg endotoxemia protocol. Expression of validated genes was screened in adipose of lean and severely obese individuals (n = 11 per group), and cellular source was probed in cultured adipocytes and macrophages. Endotoxemia (3 ng/kg) suppressed expression of 353 genes (to <67% of baseline; P < 1 × 10(-5)) of which 68 candidates were prioritized for validation. In low-dose (0.6 ng/kg) endotoxin validation, 22 (32%) of these 68 genes were confirmed. Functional classification revealed that many of these genes are involved in cell development and differentiation. Of validated genes, 59% (13 of 22) were down-regulated more than 1.5-fold in primary human adipocytes after treatment with endotoxin. In human macrophages, 59% (13 of 22) were up-regulated during differentiation to inflammatory M1 macrophages whereas 64% (14 of 22) were down-regulated during transition to homeostatic M2 macrophages. Finally, in obese vs. lean adipose, 91% (20 of 22) tended to have reduced expression (χ(2) = 10.72, P < 0.01) with 50% (11 of 22) reaching P < 0.05 (χ(2) = 9.28, P < 0.01). Exploration of down-regulated mRNA in adipose during human endotoxemia revealed suppression of genes involved in cell development and differentiation. A majority of candidates were also suppressed in endogenous human obesity, suggesting a potential pathophysiological role in human obesity-related adipose inflammation.
Johnson, A J; Shukle, R H; Chen, M-S; Srivastava, S; Subramanyam, S; Schemerhorn, B J; Weintraub, P G; Abdel Moniem, H E M; Flanders, K L; Buntin, G D; Williams, C E
2015-01-01
Evidence is emerging that some proteins secreted by gall-forming parasites of plants act as effectors responsible for systemic changes in the host plant, such as galling and nutrient tissue formation. A large number of secreted salivary gland proteins (SSGPs) that are the putative effectors responsible for the physiological changes elicited in susceptible seedling wheat by Hessian fly, Mayetiola destructor (Say), larvae have been documented. However, how the genes encoding these candidate effectors might respond under field conditions is unknown. The goal of this study was to use microarray analysis to investigate variation in SSGP transcript abundance amongst field collections from different geographical regions (southeastern USA, central USA, and the Middle East). Results revealed significant variation in SSGP transcript abundance amongst the field collections studied. The field collections separated into three distinct groups that corresponded to the wheat classes grown in the different geographical regions as well as to recently described Hessian fly populations. These data support previous reports correlating Hessian fly population structure with micropopulation differences owing to agro-ecosystem parameters such as cultivation of regionally adapted wheat varieties, deployment of resistance genes and variation in climatic conditions. PMID:25528896
Rapid Characterization of Candidate Biomarkers for Pancreatic Cancer Using Cell Microarrays (CMAs)
Kim, Min-Sik; Kuppireddy, Sarada V.; Sakamuri, Sruthi; Singal, Mukul; Getnet, Derese; Harsha, H. C.; Goel, Renu; Balakrishnan, Lavanya; Jacob, Harrys K. C.; Kashyap, Manoj K.; Tankala, Shantal G.; Maitra, Anirban; Iacobuzio-Donahue, Christine A.; Jaffee, Elizabeth; Goggins, Michael G.; Velculescu, Victor E.; Hruban, Ralph H.; Pandey, Akhilesh
2013-01-01
Tissue microarrays have become a valuable tool for high-throughput analysis using immunohistochemical labeling. However, the large majority of biochemical studies are carried out in cell lines to further characterize candidate biomarkers or therapeutic targets with subsequent studies in animals or using primary tissues. Thus, cell line-based microarrays could be a useful screening tool in some situations. Here, we constructed a cell microarray (CMA) containing a panel of 40 pancreatic cancer cell lines available from American Type Culture Collection in addition to those locally available at Johns Hopkins. As proof of principle, we performed immunocytochemical labeling of an epithelial cell adhesion molecule (Ep-CAM), a molecule generally expressed in the epithelium, on this pancreatic cancer CMA. In addition, selected molecules that have been previously shown to be differentially expressed in pancreatic cancer in the literature were validated. For example, we observed strong labeling of CA19-9 antigen, a prognostic and predictive marker for pancreatic cancer. We also carried out a bioinformatics analysis of a literature curated catalog of pancreatic cancer biomarkers developed previously by our group and identified two candidate biomarkers, HLA class I and transmembrane protease, serine 4 (TMPRSS4), and examined their expression in the cell lines represented on the pancreatic cancer CMAs. Our results demonstrate the utility of CMAs as a useful resource for rapid screening of molecules of interest and suggest that CMAs can become a universal standard platform in cancer research. PMID:22985314
USDA-ARS?s Scientific Manuscript database
Puccinia striiformis f. sp. tritici (Pst) causes stripe rust, one of the most important diseases of wheat worldwide. To identify Pst genes involved in infection and sporulation, a custom oligonucleotide Genechip was made using sequences of 442 genes selected from Pst cDNA libraries. Microarray analy...
Lee, Sang Kil; Kim, Hyo Jong; Chi, Sung Gil
2010-01-01
Saccharomyces boulardii has been reported to be beneficial in the treatment of inflammatory bowel disease. The aim of this work was to evaluate the effect of S. boulardii in a mice model of 2,4,6-trinitrobencene sulfonic acid (TNBS) induced colitis and analyze the expression of genes in S. boulardii treated mice by microarray. BALB/c mice received TNBS or TNBS and S. boulardii treatment for 4 days. Microarray was performed on total mRNA form colon, and histologic evaluation was also performed. In mice treated with S. boulardii, the histological appearance and mortality rate were significantly restored compared with rats receiving only TNBS. Among 330 genes which were altered by both S. boulardii and TNBS (>2 folds), 193 genes were down-regulated by S. boulardii in microarray. Most of genes which were down-regulated by S. bouardii were functionally classified as inflammatory and immune response related genes. S. boulardii may reduce colonic inflammation along with regulation of inflammatory and immune responsive genes in TNBS-induced colitis.
NASA Astrophysics Data System (ADS)
Ardaneswari, Gianinna; Bustamam, Alhadi; Sarwinda, Devvi
2017-10-01
A Tumor is an abnormal growth of cells that serves no purpose. Carcinoma is a tumor that grows from the top of the cell membrane and the organ adenoma is a benign tumor of the gland-like cells or epithelial tissue. In the field of molecular biology, the development of microarray technology is used in the data store of disease genetic expression. For each of microarray gene, an amount of information is stored for each trait or condition. In gene expression data clustering can be done with a bicluster algorithm, thats clustering method which not only the objects to be clustered, but also the properties or condition of the object. This research proposed Plaid Model Biclustering as one of biclustering method. In this study, we discuss the implementation of Plaid Model Biclustering Method on microarray of Carcinoma and Adenoma tumor gene expression data. From the experimental results, we found three biclusters are formed by Carcinoma gene expression data and four biclusters are formed by Adenoma gene expression data.
Approximate geodesic distances reveal biologically relevant structures in microarray data.
Nilsson, Jens; Fioretos, Thoas; Höglund, Mattias; Fontes, Magnus
2004-04-12
Genome-wide gene expression measurements, as currently determined by the microarray technology, can be represented mathematically as points in a high-dimensional gene expression space. Genes interact with each other in regulatory networks, restricting the cellular gene expression profiles to a certain manifold, or surface, in gene expression space. To obtain knowledge about this manifold, various dimensionality reduction methods and distance metrics are used. For data points distributed on curved manifolds, a sensible distance measure would be the geodesic distance along the manifold. In this work, we examine whether an approximate geodesic distance measure captures biological similarities better than the traditionally used Euclidean distance. We computed approximate geodesic distances, determined by the Isomap algorithm, for one set of lymphoma and one set of lung cancer microarray samples. Compared with the ordinary Euclidean distance metric, this distance measure produced more instructive, biologically relevant, visualizations when applying multidimensional scaling. This suggests the Isomap algorithm as a promising tool for the interpretation of microarray data. Furthermore, the results demonstrate the benefit and importance of taking nonlinearities in gene expression data into account.
Copper homeostasis gene discovery in Drosophila melanogaster.
Norgate, Melanie; Southon, Adam; Zou, Sige; Zhan, Ming; Sun, Yu; Batterham, Phil; Camakaris, James
2007-06-01
Recent studies have shown a high level of conservation between Drosophila melanogaster and mammalian copper homeostasis mechanisms. These studies have also demonstrated the efficiency with which this species can be used to characterize novel genes, at both the cellular and whole organism level. As a versatile and inexpensive model organism, Drosophila is also particularly useful for gene discovery applications and thus has the potential to be extremely useful in identifying novel copper homeostasis genes and putative disease genes. In order to assess the suitability of Drosophila for this purpose, three screening approaches have been investigated. These include an analysis of the global transcriptional response to copper in both adult flies and an embryonic cell line using DNA microarray analysis. Two mutagenesis-based screens were also utilized. Several candidate copper homeostasis genes have been identified through this work. In addition, the results of each screen were carefully analyzed to identify any factors influencing efficiency and sensitivity. These are discussed here with the aim of maximizing the efficiency of future screens and the most suitable approaches are outlined. Building on this information, there is great potential for the further use of Drosophila for copper homeostasis gene discovery.
Gene expression in the liver of rainbow trout, Oncorhynchus mykiss, during the stress response
Momoda, T.S.; Schwindt, A.R.; Feist, G.W.; Gerwick, L.; Bayne, C.J.; Schreck, C.B.
2007-01-01
To better appreciate the mechanisms underlying the physiology of the stress response, an oligonucleotide microarray and real-time RT-PCR (QRT-PCR) were used to study gene expression in the livers of rainbow trout (Oncorhynchus mykiss). For increased confidence in the discovery of candidate genes responding to stress, we conducted two separate experiments using fish from different year classes. In both experiments, fish exposed to a 3 h stressor were compared to control (unstressed) fish. In the second experiment some additional fish were exposed to only 0.5 h of stress and others were sampled 21 h after experiencing a 3 h stressor. This 21 h post-stress treatment was a means to study gene expression during recovery from stress. The genes we report as differentially expressed are those that responded similarly in both experiments, suggesting that they are robust indicators of stress. Those genes are a major histocompatibility complex class 1 molecule (MHC1), JunB, glucose 6-phosphatase (G6Pase), and nuclear protein 1 (Nupr1). Interestingly, Nupr1 gene expression was still elevated 21 h after stress, which indicates that recovery was incomplete at that time.
Radiation Gene-expression Signatures in Primary Breast Cancer Cells.
Minafra, Luigi; Bravatà, Valentina; Cammarata, Francesco P; Russo, Giorgio; Gilardi, Maria C; Forte, Giusi I
2018-05-01
In breast cancer (BC) care, radiation therapy (RT) is an efficient treatment to control localized tumor. Radiobiological research is needed to understand molecular differences that affect radiosensitivity of different tumor subtypes and the response variability. The aim of this study was to analyze gene expression profiling (GEP) in primary BC cells following irradiation with doses of 9 Gy and 23 Gy delivered by intraoperative electron radiation therapy (IOERT) in order to define gene signatures of response to high doses of ionizing radiation. We performed GEP by cDNA microarrays and evaluated cell survival after IOERT treatment in primary BC cell cultures. Real-time quantitative reverse transcription polymerase chain reaction (qRT-PCR) was performed to validate candidate genes. We showed, for the first time, a 4-gene and a 6-gene signature, as new molecular biomarkers, in two primary BC cell cultures after exposure at 9 Gy and 23 Gy respectively, for which we observed a significantly high survival rate. Gene signatures activated by different doses of ionizing radiation may predict response to RT and contribute to defining a personalized biological-driven treatment plan. Copyright© 2018, International Institute of Anticancer Research (Dr. George J. Delinasios), All rights reserved.
Robust gene selection methods using weighting schemes for microarray data analysis.
Kang, Suyeon; Song, Jongwoo
2017-09-02
A common task in microarray data analysis is to identify informative genes that are differentially expressed between two different states. Owing to the high-dimensional nature of microarray data, identification of significant genes has been essential in analyzing the data. However, the performances of many gene selection techniques are highly dependent on the experimental conditions, such as the presence of measurement error or a limited number of sample replicates. We have proposed new filter-based gene selection techniques, by applying a simple modification to significance analysis of microarrays (SAM). To prove the effectiveness of the proposed method, we considered a series of synthetic datasets with different noise levels and sample sizes along with two real datasets. The following findings were made. First, our proposed methods outperform conventional methods for all simulation set-ups. In particular, our methods are much better when the given data are noisy and sample size is small. They showed relatively robust performance regardless of noise level and sample size, whereas the performance of SAM became significantly worse as the noise level became high or sample size decreased. When sufficient sample replicates were available, SAM and our methods showed similar performance. Finally, our proposed methods are competitive with traditional methods in classification tasks for microarrays. The results of simulation study and real data analysis have demonstrated that our proposed methods are effective for detecting significant genes and classification tasks, especially when the given data are noisy or have few sample replicates. By employing weighting schemes, we can obtain robust and reliable results for microarray data analysis.
Honoré, Paul; Granjeaud, Samuel; Tagett, Rebecca; Deraco, Stéphane; Beaudoing, Emmanuel; Rougemont, Jacques; Debono, Stéphane; Hingamp, Pascal
2006-09-20
High throughput gene expression profiling (GEP) is becoming a routine technique in life science laboratories. With experimental designs that repeatedly span thousands of genes and hundreds of samples, relying on a dedicated database infrastructure is no longer an option.GEP technology is a fast moving target, with new approaches constantly broadening the field diversity. This technology heterogeneity, compounded by the informatics complexity of GEP databases, means that software developments have so far focused on mainstream techniques, leaving less typical yet established techniques such as Nylon microarrays at best partially supported. MAF (MicroArray Facility) is the laboratory database system we have developed for managing the design, production and hybridization of spotted microarrays. Although it can support the widely used glass microarrays and oligo-chips, MAF was designed with the specific idiosyncrasies of Nylon based microarrays in mind. Notably single channel radioactive probes, microarray stripping and reuse, vector control hybridizations and spike-in controls are all natively supported by the software suite. MicroArray Facility is MIAME supportive and dynamically provides feedback on missing annotations to help users estimate effective MIAME compliance. Genomic data such as clone identifiers and gene symbols are also directly annotated by MAF software using standard public resources. The MAGE-ML data format is implemented for full data export. Journalized database operations (audit tracking), data anonymization, material traceability and user/project level confidentiality policies are also managed by MAF. MicroArray Facility is a complete data management system for microarray producers and end-users. Particular care has been devoted to adequately model Nylon based microarrays. The MAF system, developed and implemented in both private and academic environments, has proved a robust solution for shared facilities and industry service providers alike.
Honoré, Paul; Granjeaud, Samuel; Tagett, Rebecca; Deraco, Stéphane; Beaudoing, Emmanuel; Rougemont, Jacques; Debono, Stéphane; Hingamp, Pascal
2006-01-01
Background High throughput gene expression profiling (GEP) is becoming a routine technique in life science laboratories. With experimental designs that repeatedly span thousands of genes and hundreds of samples, relying on a dedicated database infrastructure is no longer an option. GEP technology is a fast moving target, with new approaches constantly broadening the field diversity. This technology heterogeneity, compounded by the informatics complexity of GEP databases, means that software developments have so far focused on mainstream techniques, leaving less typical yet established techniques such as Nylon microarrays at best partially supported. Results MAF (MicroArray Facility) is the laboratory database system we have developed for managing the design, production and hybridization of spotted microarrays. Although it can support the widely used glass microarrays and oligo-chips, MAF was designed with the specific idiosyncrasies of Nylon based microarrays in mind. Notably single channel radioactive probes, microarray stripping and reuse, vector control hybridizations and spike-in controls are all natively supported by the software suite. MicroArray Facility is MIAME supportive and dynamically provides feedback on missing annotations to help users estimate effective MIAME compliance. Genomic data such as clone identifiers and gene symbols are also directly annotated by MAF software using standard public resources. The MAGE-ML data format is implemented for full data export. Journalized database operations (audit tracking), data anonymization, material traceability and user/project level confidentiality policies are also managed by MAF. Conclusion MicroArray Facility is a complete data management system for microarray producers and end-users. Particular care has been devoted to adequately model Nylon based microarrays. The MAF system, developed and implemented in both private and academic environments, has proved a robust solution for shared facilities and industry service providers alike. PMID:16987406
The Microarray Revolution: Perspectives from Educators
ERIC Educational Resources Information Center
Brewster, Jay L.; Beason, K. Beth; Eckdahl, Todd T.; Evans, Irene M.
2004-01-01
In recent years, microarray analysis has become a key experimental tool, enabling the analysis of genome-wide patterns of gene expression. This review approaches the microarray revolution with a focus upon four topics: 1) the early development of this technology and its application to cancer diagnostics; 2) a primer of microarray research,…
DOE Office of Scientific and Technical Information (OSTI.GOV)
Andersen, G.L.; He, Z.; DeSantis, T.Z.
Microarrays have proven to be a useful and high-throughput method to provide targeted DNA sequence information for up to many thousands of specific genetic regions in a single test. A microarray consists of multiple DNA oligonucleotide probes that, under high stringency conditions, hybridize only to specific complementary nucleic acid sequences (targets). A fluorescent signal indicates the presence and, in many cases, the abundance of genetic regions of interest. In this chapter we will look at how microarrays are used in microbial ecology, especially with the recent increase in microbial community DNA sequence data. Of particular interest to microbial ecologists, phylogeneticmore » microarrays are used for the analysis of phylotypes in a community and functional gene arrays are used for the analysis of functional genes, and, by inference, phylotypes in environmental samples. A phylogenetic microarray that has been developed by the Andersen laboratory, the PhyloChip, will be discussed as an example of a microarray that targets the known diversity within the 16S rRNA gene to determine microbial community composition. Using multiple, confirmatory probes to increase the confidence of detection and a mismatch probe for every perfect match probe to minimize the effect of cross-hybridization by non-target regions, the PhyloChip is able to simultaneously identify any of thousands of taxa present in an environmental sample. The PhyloChip is shown to reveal greater diversity within a community than rRNA gene sequencing due to the placement of the entire gene product on the microarray compared with the analysis of up to thousands of individual molecules by traditional sequencing methods. A functional gene array that has been developed by the Zhou laboratory, the GeoChip, will be discussed as an example of a microarray that dynamically identifies functional activities of multiple members within a community. The recent version of GeoChip contains more than 24,000 50mer oligonucleotide probes and covers more than 10,000 gene sequences in 150 gene categories involved in carbon, nitrogen, sulfur, and phosphorus cycling, metal resistance and reduction, and organic contaminant degradation. GeoChip can be used as a generic tool for microbial community analysis, and also link microbial community structure to ecosystem functioning. Examples of the application of both arrays in different environmental samples will be described in the two subsequent sections.« less
Issues in the analysis of oligonucleotide tiling microarrays for transcript mapping
NASA Technical Reports Server (NTRS)
Royce, Thomas E.; Rozowsky, Joel S.; Bertone, Paul; Samanta, Manoj; Stolc, Viktor; Weissman, Sherman; Snyder, Michael; Gerstein, Mark
2005-01-01
Traditional microarrays use probes complementary to known genes to quantitate the differential gene expression between two or more conditions. Genomic tiling microarray experiments differ in that probes that span a genomic region at regular intervals are used to detect the presence or absence of transcription. This difference means the same sets of biases and the methods for addressing them are unlikely to be relevant to both types of experiment. We introduce the informatics challenges arising in the analysis of tiling microarray experiments as open problems to the scientific community and present initial approaches for the analysis of this nascent technology.
Wolff, Alexander; Bayerlová, Michaela; Gaedcke, Jochen; Kube, Dieter; Beißbarth, Tim
2018-01-01
Pipeline comparisons for gene expression data are highly valuable for applied real data analyses, as they enable the selection of suitable analysis strategies for the dataset at hand. Such pipelines for RNA-Seq data should include mapping of reads, counting and differential gene expression analysis or preprocessing, normalization and differential gene expression in case of microarray analysis, in order to give a global insight into pipeline performances. Four commonly used RNA-Seq pipelines (STAR/HTSeq-Count/edgeR, STAR/RSEM/edgeR, Sailfish/edgeR, TopHat2/Cufflinks/CuffDiff)) were investigated on multiple levels (alignment and counting) and cross-compared with the microarray counterpart on the level of gene expression and gene ontology enrichment. For these comparisons we generated two matched microarray and RNA-Seq datasets: Burkitt Lymphoma cell line data and rectal cancer patient data. The overall mapping rate of STAR was 98.98% for the cell line dataset and 98.49% for the patient dataset. Tophat's overall mapping rate was 97.02% and 96.73%, respectively, while Sailfish had only an overall mapping rate of 84.81% and 54.44%. The correlation of gene expression in microarray and RNA-Seq data was moderately worse for the patient dataset (ρ = 0.67-0.69) than for the cell line dataset (ρ = 0.87-0.88). An exception were the correlation results of Cufflinks, which were substantially lower (ρ = 0.21-0.29 and 0.34-0.53). For both datasets we identified very low numbers of differentially expressed genes using the microarray platform. For RNA-Seq we checked the agreement of differentially expressed genes identified in the different pipelines and of GO-term enrichment results. In conclusion the combination of STAR aligner with HTSeq-Count followed by STAR aligner with RSEM and Sailfish generated differentially expressed genes best suited for the dataset at hand and in agreement with most of the other transcriptomics pipelines.
Henry, Ellen C.; Welle, Stephen L.; Gasiewicz, Thomas A.
2010-01-01
The aryl hydrocarbon receptor (AhR), a ligand-dependent transcription factor, mediates toxicity of several classes of xenobiotics and also has important physiological roles in differentiation, reproduction, and immunity, although the endogenous ligand(s) mediating these functions is/are as yet unidentified. One candidate endogenous ligand, 2-(1′H-indolo-3′-carbonyl)-thiazole-4-carboxylic acid methyl ester (ITE), is a potent AhR agonist in vitro, activates the murine AhR in vivo, but does not induce toxicity. We hypothesized that ITE and the toxic ligand, 2,3,7,8-tetrachlorodibenzo-p-dioxin (TCDD), may modify transcription of different sets of genes to account for their different toxicity. To test this hypothesis, primary mouse lung fibroblasts were exposed to 0.5μM ITE, 0.2nM TCDD, or vehicle for 4 h, and total gene expression was evaluated using microarrays. After this short-term and low-dose treatment, several hundred genes were changed significantly, and the response to ITE and TCDD was remarkably similar, both qualitatively and quantitatively. Induced gene sets included the expected battery of AhR-dependent xenobiotic-metabolizing enzymes, as well as several sets that reflect the inflammatory role of lung fibroblasts. Real time quantitative RT-qPCR assay of several selected genes confirmed these microarray data and further suggested that there may be kinetic differences in expression between ligands. These data suggest that ITE and TCDD elicit an analogous change in AhR conformation such that the initial transcription response is the same. Furthermore, if the difference in toxicity between TCDD and ITE is mediated by differences in gene expression, then it is likely that secondary changes enabled by the persistent TCDD, but not by the shorter lived ITE, are responsible. PMID:19933214
2011-01-01
Background Skeletal muscle growth and development from embryo to adult consists of a series of carefully regulated changes in gene expression. Understanding these developmental changes in agriculturally important species is essential to the production of high quality meat products. For example, consumer demand for lean, inexpensive meat products has driven the turkey industry to unprecedented production through intensive genetic selection. However, achievements of increased body weight and muscle mass have been countered by an increased incidence of myopathies and meat quality defects. In a previous study, we developed and validated a turkey skeletal muscle-specific microarray as a tool for functional genomics studies. The goals of the current study were to utilize this microarray to elucidate functional pathways of genes responsible for key events in turkey skeletal muscle development and to compare differences in gene expression between two genetic lines of turkeys. To achieve these goals, skeletal muscle samples were collected at three critical stages in muscle development: 18d embryo (hyperplasia), 1d post-hatch (shift from myoblast-mediated growth to satellite cell-modulated growth by hypertrophy), and 16wk (market age) from two genetic lines: a randombred control line (RBC2) maintained without selection pressure, and a line (F) selected from the RBC2 line for increased 16wk body weight. Array hybridizations were performed in two experiments: Experiment 1 directly compared the developmental stages within genetic line, while Experiment 2 directly compared the two lines within each developmental stage. Results A total of 3474 genes were differentially expressed (false discovery rate; FDR < 0.001) by overall effect of development, while 16 genes were differentially expressed (FDR < 0.10) by overall effect of genetic line. Ingenuity Pathways Analysis was used to group annotated genes into networks, functions, and canonical pathways. The expression of 28 genes involved in extracellular matrix regulation, cell death/apoptosis, and calcium signaling/muscle function, as well as genes with miscellaneous function was confirmed by qPCR. Conclusions The current study identified gene pathways and uncovered novel genes important in turkey muscle growth and development. Future experiments will focus further on several of these candidate genes and the expression and mechanism of action of their protein products. PMID:21385442
Khan, Haseeb Ahmad
2004-01-01
The massive surge in the production of microarray data poses a great challenge for proper analysis and interpretation. In recent years numerous computational tools have been developed to extract meaningful interpretation of microarray gene expression data. However, a convenient tool for two-groups comparison of microarray data is still lacking and users have to rely on commercial statistical packages that might be costly and require special skills, in addition to extra time and effort for transferring data from one platform to other. Various statistical methods, including the t-test, analysis of variance, Pearson test and Mann-Whitney U test, have been reported for comparing microarray data, whereas the utilization of the Wilcoxon signed-rank test, which is an appropriate test for two-groups comparison of gene expression data, has largely been neglected in microarray studies. The aim of this investigation was to build an integrated tool, ArraySolver, for colour-coded graphical display and comparison of gene expression data using the Wilcoxon signed-rank test. The results of software validation showed similar outputs with ArraySolver and SPSS for large datasets. Whereas the former program appeared to be more accurate for 25 or fewer pairs (n < or = 25), suggesting its potential application in analysing molecular signatures that usually contain small numbers of genes. The main advantages of ArraySolver are easy data selection, convenient report format, accurate statistics and the familiar Excel platform.
2004-01-01
The massive surge in the production of microarray data poses a great challenge for proper analysis and interpretation. In recent years numerous computational tools have been developed to extract meaningful interpretation of microarray gene expression data. However, a convenient tool for two-groups comparison of microarray data is still lacking and users have to rely on commercial statistical packages that might be costly and require special skills, in addition to extra time and effort for transferring data from one platform to other. Various statistical methods, including the t-test, analysis of variance, Pearson test and Mann–Whitney U test, have been reported for comparing microarray data, whereas the utilization of the Wilcoxon signed-rank test, which is an appropriate test for two-groups comparison of gene expression data, has largely been neglected in microarray studies. The aim of this investigation was to build an integrated tool, ArraySolver, for colour-coded graphical display and comparison of gene expression data using the Wilcoxon signed-rank test. The results of software validation showed similar outputs with ArraySolver and SPSS for large datasets. Whereas the former program appeared to be more accurate for 25 or fewer pairs (n ≤ 25), suggesting its potential application in analysing molecular signatures that usually contain small numbers of genes. The main advantages of ArraySolver are easy data selection, convenient report format, accurate statistics and the familiar Excel platform. PMID:18629036
Wu, Baolin
2006-02-15
Differential gene expression detection and sample classification using microarray data have received much research interest recently. Owing to the large number of genes p and small number of samples n (p > n), microarray data analysis poses big challenges for statistical analysis. An obvious problem owing to the 'large p small n' is over-fitting. Just by chance, we are likely to find some non-differentially expressed genes that can classify the samples very well. The idea of shrinkage is to regularize the model parameters to reduce the effects of noise and produce reliable inferences. Shrinkage has been successfully applied in the microarray data analysis. The SAM statistics proposed by Tusher et al. and the 'nearest shrunken centroid' proposed by Tibshirani et al. are ad hoc shrinkage methods. Both methods are simple, intuitive and prove to be useful in empirical studies. Recently Wu proposed the penalized t/F-statistics with shrinkage by formally using the (1) penalized linear regression models for two-class microarray data, showing good performance. In this paper we systematically discussed the use of penalized regression models for analyzing microarray data. We generalize the two-class penalized t/F-statistics proposed by Wu to multi-class microarray data. We formally derive the ad hoc shrunken centroid used by Tibshirani et al. using the (1) penalized regression models. And we show that the penalized linear regression models provide a rigorous and unified statistical framework for sample classification and differential gene expression detection.
A study of metaheuristic algorithms for high dimensional feature selection on microarray data
NASA Astrophysics Data System (ADS)
Dankolo, Muhammad Nasiru; Radzi, Nor Haizan Mohamed; Sallehuddin, Roselina; Mustaffa, Noorfa Haszlinna
2017-11-01
Microarray systems enable experts to examine gene profile at molecular level using machine learning algorithms. It increases the potentials of classification and diagnosis of many diseases at gene expression level. Though, numerous difficulties may affect the efficiency of machine learning algorithms which includes vast number of genes features comprised in the original data. Many of these features may be unrelated to the intended analysis. Therefore, feature selection is necessary to be performed in the data pre-processing. Many feature selection algorithms are developed and applied on microarray which including the metaheuristic optimization algorithms. This paper discusses the application of the metaheuristics algorithms for feature selection in microarray dataset. This study reveals that, the algorithms have yield an interesting result with limited resources thereby saving computational expenses of machine learning algorithms.
Cytokine-related genes and oxidation-related genes detected in preeclamptic placentas.
Lee, Gui Se Ra; Joe, Yoon Seong; Kim, Sa Jin; Shin, Jong Chul
2010-10-01
To investigate cytokine- and oxidation-related genes for preeclampsia using DNA microarray analysis. Placentas were collected from 13 normal pregnancies and 13 patients with preeclampsia. Gene expression was studied using DNA microarray. Among significantly expressed genes, we focused on genes associated with cytokines and oxidation, and the results were confirmed using quantitative real time-polymerase chain reaction (QRT-PCR). 415 genes out of 30,940 genes were altered by > or =2-fold in the microarray analysis. 121 up-regulated genes and 294 down-regulated genes were found to be in preeclamptic placenta. Six cytokine-related genes and 5 oxidation-related genes were found from among the 121 up-regulated genes. The cytokine-related genes studied included oncostatin M (OSM), fms-related tyrosine kinase (FLT1) and vascular endothelial growth factor A (VEGFA), and the oxidation-related genes studied included spermine oxidase (SMOX), l cytochrome P450, family 26, subfamily A, polypeptide 1 (CYP26A1), acetate dehydrogenase A (LDHA). These six genes were also significantly higher in placentas from patients with preeclampsia than in those from women with normal pregnancies. The placental tissue of patients with preeclampsia showed significantly higher mRNA expression of these six genes than the normal group, using QRT-PCR. DNA microarray analysis is one of the great methods for simultaneously detecting the functionally associated genes of preeclampsia. The cytokine-related genes such as OSM, FLT1 and VEGFA, and the oxidation-related genes such as LDHA, CYP26A1 and SMOX might prove to be the starting point in the elucidation of the pathogenesis of preeclampsia.
Moschen, Sebastian; Bengoa Luoni, Sofia; Paniego, Norma B.; Hopp, H. Esteban; Dosio, Guillermo A. A.
2014-01-01
Cultivated sunflower (Helianthus annuus L.), an important source of edible vegetable oil, shows rapid onset of senescence, which limits production by reducing photosynthetic capacity under specific growing conditions. Carbon for grain filling depends strongly on light interception by green leaf area, which diminishes during grain filling due to leaf senescence. Transcription factors (TFs) regulate the progression of leaf senescence in plants and have been well explored in model systems, but information for many agronomic crops remains limited. Here, we characterize the expression profiles of a set of putative senescence associated genes (SAGs) identified by a candidate gene approach and sunflower microarray expression studies. We examined a time course of sunflower leaves undergoing natural senescence and used quantitative PCR (qPCR) to measure the expression of 11 candidate genes representing the NAC, WRKY, MYB and NF-Y TF families. In addition, we measured physiological parameters such as chlorophyll, total soluble sugars and nitrogen content. The expression of Ha-NAC01, Ha-NAC03, Ha-NAC04, Ha-NAC05 and Ha-MYB01 TFs increased before the remobilization rate increased and therefore, before the appearance of the first physiological symptoms of senescence, whereas Ha-NAC02 expression decreased. In addition, we also examined the trifurcate feed-forward pathway (involving ORE1, miR164, and ETHYLENE INSENSITIVE 2) previously reported for Arabidopsis. We measured transcription of Ha-NAC01 (the sunflower homolog of ORE1) and Ha-EIN2, along with the levels of miR164, in two leaves from different stem positions, and identified differences in transcription between basal and upper leaves. Interestingly, Ha-NAC01 and Ha-EIN2 transcription profiles showed an earlier up-regulation in upper leaves of plants close to maturity, compared with basal leaves of plants at pre-anthesis stages. These results suggest that the H. annuus TFs characterized in this work could play important roles as potential triggers of leaf senescence and thus can be considered putative candidate genes for senescence in sunflower. PMID:25110882
Moschen, Sebastian; Bengoa Luoni, Sofia; Paniego, Norma B; Hopp, H Esteban; Dosio, Guillermo A A; Fernandez, Paula; Heinz, Ruth A
2014-01-01
Cultivated sunflower (Helianthus annuus L.), an important source of edible vegetable oil, shows rapid onset of senescence, which limits production by reducing photosynthetic capacity under specific growing conditions. Carbon for grain filling depends strongly on light interception by green leaf area, which diminishes during grain filling due to leaf senescence. Transcription factors (TFs) regulate the progression of leaf senescence in plants and have been well explored in model systems, but information for many agronomic crops remains limited. Here, we characterize the expression profiles of a set of putative senescence associated genes (SAGs) identified by a candidate gene approach and sunflower microarray expression studies. We examined a time course of sunflower leaves undergoing natural senescence and used quantitative PCR (qPCR) to measure the expression of 11 candidate genes representing the NAC, WRKY, MYB and NF-Y TF families. In addition, we measured physiological parameters such as chlorophyll, total soluble sugars and nitrogen content. The expression of Ha-NAC01, Ha-NAC03, Ha-NAC04, Ha-NAC05 and Ha-MYB01 TFs increased before the remobilization rate increased and therefore, before the appearance of the first physiological symptoms of senescence, whereas Ha-NAC02 expression decreased. In addition, we also examined the trifurcate feed-forward pathway (involving ORE1, miR164, and ethylene insensitive 2) previously reported for Arabidopsis. We measured transcription of Ha-NAC01 (the sunflower homolog of ORE1) and Ha-EIN2, along with the levels of miR164, in two leaves from different stem positions, and identified differences in transcription between basal and upper leaves. Interestingly, Ha-NAC01 and Ha-EIN2 transcription profiles showed an earlier up-regulation in upper leaves of plants close to maturity, compared with basal leaves of plants at pre-anthesis stages. These results suggest that the H. annuus TFs characterized in this work could play important roles as potential triggers of leaf senescence and thus can be considered putative candidate genes for senescence in sunflower.
Best practices for hybridization design in two-colour microarray analysis.
Knapen, Dries; Vergauwen, Lucia; Laukens, Kris; Blust, Ronny
2009-07-01
Two-colour microarrays are a popular platform of choice in gene expression studies. Because two different samples are hybridized on a single microarray, and several microarrays are usually needed in a given experiment, there are many possible ways to combine samples on different microarrays. The actual combination employed is commonly referred to as the 'hybridization design'. Different types of hybridization designs have been developed, all aimed at optimizing the experimental setup for the detection of differentially expressed genes while coping with technical noise. Here, we first provide an overview of the different classes of hybridization designs, discussing their advantages and limitations, and then we illustrate the current trends in the use of different hybridization design types in contemporary research.
Malinowski, Douglas P
2007-05-01
In recent years, the application of genomic and proteomic technologies to the problem of breast cancer prognosis and the prediction of therapy response have begun to yield encouraging results. Independent studies employing transcriptional profiling of primary breast cancer specimens using DNA microarrays have identified gene expression profiles that correlate with clinical outcome in primary breast biopsy specimens. Recent advances in microarray technology have demonstrated reproducibility, making clinical applications more achievable. In this regard, one such DNA microarray device based upon a 70-gene expression signature was recently cleared by the US FDA for application to breast cancer prognosis. These DNA microarrays often employ at least 70 gene targets for transcriptional profiling and prognostic assessment in breast cancer. The use of PCR-based methods utilizing a small subset of genes has recently demonstrated the ability to predict the clinical outcome in early-stage breast cancer. Furthermore, protein-based immunohistochemistry methods have progressed from using gene clusters and gene expression profiling to smaller subsets of expressed proteins to predict prognosis in early-stage breast cancer. Beyond prognostic applications, DNA microarray-based transcriptional profiling has demonstrated the ability to predict response to chemotherapy in early-stage breast cancer patients. In this review, recent advances in the use of multiple markers for prognosis of disease recurrence in early-stage breast cancer and the prediction of therapy response will be discussed.
Bălăcescu, Loredana; Bălăcescu, O; Crişan, N; Fetica, B; Petruţ, B; Bungărdean, Cătălina; Rus, Meda; Tudoran, Oana; Meurice, G; Irimie, Al; Dragoş, N; Berindan-Neagoe, Ioana
2011-01-01
Prostate cancer represents the first leading cause of cancer among western male population, with different clinical behavior ranging from indolent to metastatic disease. Although many molecules and deregulated pathways are known, the molecular mechanisms involved in the development of prostate cancer are not fully understood. The aim of this study was to explore the molecular variation underlying the prostate cancer, based on microarray analysis and bioinformatics approaches. Normal and prostate cancer tissues were collected by macrodissection from prostatectomy pieces. All prostate cancer specimens used in our study were Gleason score 7. Gene expression microarray (Agilent Technologies) was used for Whole Human Genome evaluation. The bioinformatics and functional analysis were based on Limma and Ingenuity software. The microarray analysis identified 1119 differentially expressed genes between prostate cancer and normal prostate, which were up- or down-regulated at least 2-fold. P-values were adjusted for multiple testing using Benjamini-Hochberg method with a false discovery rate of 0.01. These genes were analyzed with Ingenuity Pathway Analysis software and were established 23 genetic networks. Our microarray results provide new information regarding the molecular networks in prostate cancer stratified as Gleason 7. These data highlighted gene expression profiles for better understanding of prostate cancer progression.
Stekel, Dov J.; Sarti, Donatella; Trevino, Victor; Zhang, Lihong; Salmon, Mike; Buckley, Chris D.; Stevens, Mark; Pallen, Mark J.; Penn, Charles; Falciani, Francesco
2005-01-01
A key step in the analysis of microarray data is the selection of genes that are differentially expressed. Ideally, such experiments should be properly replicated in order to infer both technical and biological variability, and the data should be subjected to rigorous hypothesis tests to identify the differentially expressed genes. However, in microarray experiments involving the analysis of very large numbers of biological samples, replication is not always practical. Therefore, there is a need for a method to select differentially expressed genes in a rational way from insufficiently replicated data. In this paper, we describe a simple method that uses bootstrapping to generate an error model from a replicated pilot study that can be used to identify differentially expressed genes in subsequent large-scale studies on the same platform, but in which there may be no replicated arrays. The method builds a stratified error model that includes array-to-array variability, feature-to-feature variability and the dependence of error on signal intensity. We apply this model to the characterization of the host response in a model of bacterial infection of human intestinal epithelial cells. We demonstrate the effectiveness of error model based microarray experiments and propose this as a general strategy for a microarray-based screening of large collections of biological samples. PMID:15800204
2014-01-01
Background Triple negative breast cancer (TNBC) and often basal-like cancers are defined as negative for estrogen receptor, progesterone receptor and Her2 gene expression. Over the past few years an incredible amount of data has been generated defining the molecular characteristics of both cancers. The aim of these studies is to better understand the cancers and identify genes and molecular pathways that might be useful as targeted therapies. In an attempt to contribute to the understanding of basal-like/TNBC, we examined the Gene Expression Omnibus (GEO) public datasets in search of genes that might define basal-like/TNBC. The Il32 gene was identified as a candidate. Findings Analysis of several GEO datasets showed differential expression of IL32 in patient samples previously designated as basal and/or TNBC compared to normal and luminal breast samples. As validation of the GEO results, RNA and protein expression levels were examined using MCF7 and MDA MB231 cell lines and tissue microarrays (TMAs). IL32 gene expression levels were higher in MDA MB231 compared to MCF7. Analysis of TMAs showed 42% of TNBC tissues and 25% of the non-TNBC were positive for IL32, while non-malignant patient samples and all but one hyperplastic tissue sample demonstrated lower levels of IL32 protein expression. Conclusion Data obtained from several publically available GEO datasets showed overexpression of IL32 gene in basal-like/TNBC samples compared to normal and luminal samples. In support of these data, analysis of TMA clinical samples demonstrated a particular pattern of IL32 differential expression. Considered together, these data suggest IL32 is a candidate suitable for further study. PMID:25100201
Sobkowiak, Alicja; Jończyk, Maciej; Jarochowska, Emilia; Biecek, Przemysław; Trzcinska-Danielewicz, Joanna; Leipner, Jörg; Fronk, Jan; Sowiński, Paweł
2014-06-01
Maize, despite being thermophyllic due to its tropical origin, demonstrates high intraspecific diversity in cold-tolerance. To search for molecular mechanisms of this diversity, transcriptomic response to cold was studied in two inbred lines of contrasting cold-tolerance. Microarray analysis was followed by extensive statistical elaboration of data, literature data mining, and gene ontology-based classification. The lines used had been bred earlier specifically for determination of QTLs for cold-performance of photosynthesis. This allowed direct comparison of present transcriptomic data with the earlier QTL mapping results. Cold-treated (14 h at 8/6 °C) maize seedlings of cold-tolerant ETH-DH7 and cold-sensitive ETH-DL3 lines at V3 stage showed strong, consistent response of the third leaf transcriptome: several thousand probes showed similar, statistically significant change in both lines, while only tens responded differently in the two lines. The most striking difference between the responses of the two lines to cold was the induction of expression of ca. twenty genes encoding membrane/cell wall proteins exclusively in the cold-tolerant ETH-DH7 line. The common response comprised mainly repression of numerous genes related to photosynthesis and induction of genes related to basic biological activity: transcription, regulation of gene expression, protein phosphorylation, cell wall organization. Among the genes showing differential response, several were close to the QTL regions identified in earlier studies with the same inbred lines and associated with biometrical, physiological or biochemical parameters. These transcripts, including two apparently non-protein-coding ones, are particularly attractive candidates for future studies on mechanisms determining divergent cold-tolerance of inbred maize lines.
Yan, Bo; Neilson, Karen M.; Ranganathan, Ramya; Maynard, Thomas; Streit, Andrea; Moody, Sally A.
2014-01-01
Background Six1 plays an important role in the development of several vertebrate organs, including cranial sensory placodes, somites and kidney. Although Six1 mutations cause one form of Branchio-Otic Syndrome (BOS), the responsible gene in many patients has not been identified; genes that act downstream of Six1 are potential BOS candidates. Results We sought to identify novel genes expressed during placode, somite and kidney development by comparing gene expression between control and Six1-expressing ectodermal explants. The expression patterns of 19 of the significantly up-regulated and 11 of the significantly down-regulated genes were assayed from cleavage to larval stages. 28/30 genes are expressed in the otocyst, a structure that is functionally disrupted in BOS, and 26/30 genes are expressed in the nephric mesoderm, a structure that is functionally disrupted in the related Branchio-Otic-Renal (BOR) syndrome. We also identified the chick homologues of 5 genes and show that they have conserved expression patterns. Conclusions Of the 30 genes selected for expression analyses, all are expressed at many of the developmental times and appropriate tissues to be regulated by Six1. Many have the potential to play a role in the disruption of hearing and kidney function seen in BOS/BOR patients. PMID:25403746
A Universal Genome Array and Transcriptome Atlas for Brachypodium Distachyon
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mockler, Todd
Brachypodium distachyon is the premier experimental model grass platform and is related to candidate feedstock crops for bioethanol production. Based on the DOE-JGI Brachypodium Bd21 genome sequence and annotation we designed a whole genome DNA microarray platform. The quality of this array platform is unprecedented due to the exceptional quality of the Brachypodium genome assembly and annotation and the stringent probe selection criteria employed in the design. We worked with members of the international community and the bioinformatics/design team at Affymetrix at all stages in the development of the array. We used the Brachypodium arrays to interrogate the transcriptomes ofmore » plants grown in a variety of environmental conditions including diurnal and circadian light/temperature conditions and under a variety of environmental conditions. We examined the transciptional responses of Brachypodium seedlings subjected to various abiotic stresses including heat, cold, salt, and high intensity light. We generated a gene expression atlas representing various organs and developmental stages. The results of these efforts including all microarray datasets are published and available at online public databases.« less
Identification of differentially expressed genes and false discovery rate in microarray studies.
Gusnanto, Arief; Calza, Stefano; Pawitan, Yudi
2007-04-01
To highlight the development in microarray data analysis for the identification of differentially expressed genes, particularly via control of false discovery rate. The emergence of high-throughput technology such as microarrays raises two fundamental statistical issues: multiplicity and sensitivity. We focus on the biological problem of identifying differentially expressed genes. First, multiplicity arises due to testing tens of thousands of hypotheses, rendering the standard P value meaningless. Second, known optimal single-test procedures such as the t-test perform poorly in the context of highly multiple tests. The standard approach of dealing with multiplicity is too conservative in the microarray context. The false discovery rate concept is fast becoming the key statistical assessment tool replacing the P value. We review the false discovery rate approach and argue that it is more sensible for microarray data. We also discuss some methods to take into account additional information from the microarrays to improve the false discovery rate. There is growing consensus on how to analyse microarray data using the false discovery rate framework in place of the classical P value. Further research is needed on the preprocessing of the raw data, such as the normalization step and filtering, and on finding the most sensitive test procedure.
Kumar, Mukesh; Rath, Nitish Kumar; Rath, Santanu Kumar
2016-04-01
Microarray-based gene expression profiling has emerged as an efficient technique for classification, prognosis, diagnosis, and treatment of cancer. Frequent changes in the behavior of this disease generates an enormous volume of data. Microarray data satisfies both the veracity and velocity properties of big data, as it keeps changing with time. Therefore, the analysis of microarray datasets in a small amount of time is essential. They often contain a large amount of expression, but only a fraction of it comprises genes that are significantly expressed. The precise identification of genes of interest that are responsible for causing cancer are imperative in microarray data analysis. Most existing schemes employ a two-phase process such as feature selection/extraction followed by classification. In this paper, various statistical methods (tests) based on MapReduce are proposed for selecting relevant features. After feature selection, a MapReduce-based K-nearest neighbor (mrKNN) classifier is also employed to classify microarray data. These algorithms are successfully implemented in a Hadoop framework. A comparative analysis is done on these MapReduce-based models using microarray datasets of various dimensions. From the obtained results, it is observed that these models consume much less execution time than conventional models in processing big data. Copyright © 2016 Elsevier Inc. All rights reserved.
Because of its ability to provide a "snap-shot" view of expression of large number of genes simultaneously, the microarray technology may be a useful tool to uncover new mechanisms of toxicity. This proposal will use the state-of-the-art gene microarrays and a new bioinformatic t...
Uddin, Raihan; Singh, Shiva M.
2017-01-01
As humans age many suffer from a decrease in normal brain functions including spatial learning impairments. This study aimed to better understand the molecular mechanisms in age-associated spatial learning impairment (ASLI). We used a mathematical modeling approach implemented in Weighted Gene Co-expression Network Analysis (WGCNA) to create and compare gene network models of young (learning unimpaired) and aged (predominantly learning impaired) brains from a set of exploratory datasets in rats in the context of ASLI. The major goal was to overcome some of the limitations previously observed in the traditional meta- and pathway analysis using these data, and identify novel ASLI related genes and their networks based on co-expression relationship of genes. This analysis identified a set of network modules in the young, each of which is highly enriched with genes functioning in broad but distinct GO functional categories or biological pathways. Interestingly, the analysis pointed to a single module that was highly enriched with genes functioning in “learning and memory” related functions and pathways. Subsequent differential network analysis of this “learning and memory” module in the aged (predominantly learning impaired) rats compared to the young learning unimpaired rats allowed us to identify a set of novel ASLI candidate hub genes. Some of these genes show significant repeatability in networks generated from independent young and aged validation datasets. These hub genes are highly co-expressed with other genes in the network, which not only show differential expression but also differential co-expression and differential connectivity across age and learning impairment. The known function of these hub genes indicate that they play key roles in critical pathways, including kinase and phosphatase signaling, in functions related to various ion channels, and in maintaining neuronal integrity relating to synaptic plasticity and memory formation. Taken together, they provide a new insight and generate new hypotheses into the molecular mechanisms responsible for age associated learning impairment, including spatial learning. PMID:29066959
Uddin, Raihan; Singh, Shiva M
2017-01-01
As humans age many suffer from a decrease in normal brain functions including spatial learning impairments. This study aimed to better understand the molecular mechanisms in age-associated spatial learning impairment (ASLI). We used a mathematical modeling approach implemented in Weighted Gene Co-expression Network Analysis (WGCNA) to create and compare gene network models of young (learning unimpaired) and aged (predominantly learning impaired) brains from a set of exploratory datasets in rats in the context of ASLI. The major goal was to overcome some of the limitations previously observed in the traditional meta- and pathway analysis using these data, and identify novel ASLI related genes and their networks based on co-expression relationship of genes. This analysis identified a set of network modules in the young, each of which is highly enriched with genes functioning in broad but distinct GO functional categories or biological pathways. Interestingly, the analysis pointed to a single module that was highly enriched with genes functioning in "learning and memory" related functions and pathways. Subsequent differential network analysis of this "learning and memory" module in the aged (predominantly learning impaired) rats compared to the young learning unimpaired rats allowed us to identify a set of novel ASLI candidate hub genes. Some of these genes show significant repeatability in networks generated from independent young and aged validation datasets. These hub genes are highly co-expressed with other genes in the network, which not only show differential expression but also differential co-expression and differential connectivity across age and learning impairment. The known function of these hub genes indicate that they play key roles in critical pathways, including kinase and phosphatase signaling, in functions related to various ion channels, and in maintaining neuronal integrity relating to synaptic plasticity and memory formation. Taken together, they provide a new insight and generate new hypotheses into the molecular mechanisms responsible for age associated learning impairment, including spatial learning.
Stochastic models for inferring genetic regulation from microarray gene expression data.
Tian, Tianhai
2010-03-01
Microarray expression profiles are inherently noisy and many different sources of variation exist in microarray experiments. It is still a significant challenge to develop stochastic models to realize noise in microarray expression profiles, which has profound influence on the reverse engineering of genetic regulation. Using the target genes of the tumour suppressor gene p53 as the test problem, we developed stochastic differential equation models and established the relationship between the noise strength of stochastic models and parameters of an error model for describing the distribution of the microarray measurements. Numerical results indicate that the simulated variance from stochastic models with a stochastic degradation process can be represented by a monomial in terms of the hybridization intensity and the order of the monomial depends on the type of stochastic process. The developed stochastic models with multiple stochastic processes generated simulations whose variance is consistent with the prediction of the error model. This work also established a general method to develop stochastic models from experimental information. 2009 Elsevier Ireland Ltd. All rights reserved.
Strakova, Eva; Zikova, Alice; Vohradsky, Jiri
2014-01-01
A computational model of gene expression was applied to a novel test set of microarray time series measurements to reveal regulatory interactions between transcriptional regulators represented by 45 sigma factors and the genes expressed during germination of a prokaryote Streptomyces coelicolor. Using microarrays, the first 5.5 h of the process was recorded in 13 time points, which provided a database of gene expression time series on genome-wide scale. The computational modeling of the kinetic relations between the sigma factors, individual genes and genes clustered according to the similarity of their expression kinetics identified kinetically plausible sigma factor-controlled networks. Using genome sequence annotations, functional groups of genes that were predominantly controlled by specific sigma factors were identified. Using external binding data complementing the modeling approach, specific genes involved in the control of the studied process were identified and their function suggested.
Zinke, Ingo; Schütz, Christina S.; Katzenberger, Jörg D.; Bauer, Matthias; Pankratz, Michael J.
2002-01-01
We have identified genes regulated by starvation and sugar signals in Drosophila larvae using whole-genome microarrays. Based on expression profiles in the two nutrient conditions, they were organized into different categories that reflect distinct physiological pathways mediating sugar and fat metabolism, and cell growth. In the category of genes regulated in sugar-fed, but not in starved, animals, there is an upregulation of genes encoding key enzymes of the fat biosynthesis pathway and a downregulation of genes encoding lipases. The highest and earliest activated gene upon sugar ingestion is sugarbabe, a zinc finger protein that is induced in the gut and the fat body. Identification of potential targets using microarrays suggests that sugarbabe functions to repress genes involved in dietary fat breakdown and absorption. The current analysis provides a basis for studying the genetic mechanisms underlying nutrient signalling. PMID:12426388
The Zur regulon of Corynebacterium glutamicum ATCC 13032
2010-01-01
Background Zinc is considered as an essential element for all living organisms, but it can be toxic at large concentrations. Bacteria therefore tightly regulate zinc metabolism. The Cg2502 protein of Corynebacterium glutamicum was a candidate to control zinc metabolism in this species, since it was classified as metalloregulator of the zinc uptake regulator (Zur) subgroup of the ferric uptake regulator (Fur) family of DNA-binding transcription regulators. Results The cg2502 (zur) gene was deleted in the chromosome of C. glutamicum ATCC 13032 by an allelic exchange procedure to generate the zur-deficient mutant C. glutamicum JS2502. Whole-genome DNA microarray hybridizations and real-time RT-PCR assays comparing the gene expression in C. glutamicum JS2502 with that of the wild-type strain detected 18 genes with enhanced expression in the zur mutant. The expression data were combined with results from cross-genome comparisons of shared regulatory sites, revealing the presence of candidate Zur-binding sites in the mapped promoter regions of five transcription units encoding components of potential zinc ABC-type transporters (cg0041-cg0042/cg0043; cg2911-cg2912-cg2913), a putative secreted protein (cg0040), a putative oxidoreductase (cg0795), and a putative P-loop GTPase of the COG0523 protein family (cg0794). Enhanced transcript levels of the respective genes in C. glutamicum JS2502 were verified by real-time RT-PCR, and complementation of the mutant with a wild-type zur gene reversed the effect of differential gene expression. The zinc-dependent expression of the putative cg0042 and cg2911 operons was detected in vivo with a gfp reporter system. Moreover, the zinc-dependent binding of purified Zur protein to double-stranded 40-mer oligonucleotides containing candidate Zur-binding sites was demonstrated in vitro by DNA band shift assays. Conclusion Whole-genome expression profiling and DNA band shift assays demonstrated that Zur directly represses in a zinc-dependent manner the expression of nine genes organized in five transcription units. Accordingly, the Zur (Cg2502) protein is the key transcription regulator for genes involved in zinc homeostasis in C. glutamicum. PMID:20055984
de Abreu Neto, Joao B.; Frei, Michael
2016-01-01
Plants are exposed to a wide range of abiotic stresses (AS), which often occur in combination. Because physiological investigations typically focus on one stress, our understanding of unspecific stress responses remains limited. The plant redox homeostasis, i.e., the production and removal of reactive oxygen species (ROS), may be involved in many environmental stress conditions. Therefore, this study intended to identify genes, which are activated in diverse AS, focusing on ROS-related pathways. We conducted a meta-analysis (MA) of microarray experiments, focusing on rice. Transcriptome data were mined from public databases and fellow researchers, which represented 36 different experiments and investigated diverse AS, including ozone stress, drought, heat, cold, salinity, and mineral deficiencies/toxicities. To overcome the inherent artifacts of different MA methods, data were processed using Fisher, rOP, REM, and product of rank (GeneSelector), and genes identified by most approaches were considered as shared differentially expressed genes (DEGs). Two MA strategies were adopted: first, datasets were separated into shoot, root, and seedling experiments, and these tissues were analyzed separately to identify shared DEGs. Second, shoot and seedling experiments were classed into oxidative stress (OS), i.e., ozone and hydrogen peroxide treatments directly producing ROS in plant tissue, and other AS, in which ROS production is indirect. In all tissues and stress conditions, genes a priori considered as ROS-related were overrepresented among the DEGs, as they represented 4% of all expressed genes but 7–10% of the DEGs. The combined MA approach was substantially more conservative than individual MA methods and identified 1001 shared DEGs in shoots, 837 shared DEGs in root, and 1172 shared DEGs in seedlings. Within the OS and AS groups, 990 and 1727 shared DEGs were identified, respectively. In total, 311 genes were shared between OS and AS, including many regulatory genes. Combined co-expression analysis identified among those a cluster of 42 genes, many involved in the photosynthetic apparatus and responsive to drought, iron deficiency, arsenic toxicity, and ozone. Our data demonstrate the importance of redox homeostasis in plant stress responses and the power of MA to identify candidate genes underlying unspecific signaling pathways. PMID:26793229
Analysis and modelling of septic shock microarray data using Singular Value Decomposition.
Allanki, Srinivas; Dixit, Madhulika; Thangaraj, Paul; Sinha, Nandan Kumar
2017-06-01
Being a high throughput technique, enormous amounts of microarray data has been generated and there arises a need for more efficient techniques of analysis, in terms of speed and accuracy. Finding the differentially expressed genes based on just fold change and p-value might not extract all the vital biological signals that occur at a lower gene expression level. Besides this, numerous mathematical models have been generated to predict the clinical outcome from microarray data, while very few, if not none, aim at predicting the vital genes that are important in a disease progression. Such models help a basic researcher narrow down and concentrate on a promising set of genes which leads to the discovery of gene-based therapies. In this article, as a first objective, we have used the lesser known and used Singular Value Decomposition (SVD) technique to build a microarray data analysis tool that works with gene expression patterns and intrinsic structure of the data in an unsupervised manner. We have re-analysed a microarray data over the clinical course of Septic shock from Cazalis et al. (2014) and have shown that our proposed analysis provides additional information compared to the conventional method. As a second objective, we developed a novel mathematical model that predicts a set of vital genes in the disease progression that works by generating samples in the continuum between health and disease, using a simple normal-distribution-based random number generator. We also verify that most of the predicted genes are indeed related to septic shock. Copyright © 2017 Elsevier Inc. All rights reserved.
Evaluation of artificial time series microarray data for dynamic gene regulatory network inference.
Xenitidis, P; Seimenis, I; Kakolyris, S; Adamopoulos, A
2017-08-07
High-throughput technology like microarrays is widely used in the inference of gene regulatory networks (GRNs). We focused on time series data since we are interested in the dynamics of GRNs and the identification of dynamic networks. We evaluated the amount of information that exists in artificial time series microarray data and the ability of an inference process to produce accurate models based on them. We used dynamic artificial gene regulatory networks in order to create artificial microarray data. Key features that characterize microarray data such as the time separation of directly triggered genes, the percentage of directly triggered genes and the triggering function type were altered in order to reveal the limits that are imposed by the nature of microarray data on the inference process. We examined the effect of various factors on the inference performance such as the network size, the presence of noise in microarray data, and the network sparseness. We used a system theory approach and examined the relationship between the pole placement of the inferred system and the inference performance. We examined the relationship between the inference performance in the time domain and the true system parameter identification. Simulation results indicated that time separation and the percentage of directly triggered genes are crucial factors. Also, network sparseness, the triggering function type and noise in input data affect the inference performance. When two factors were simultaneously varied, it was found that variation of one parameter significantly affects the dynamic response of the other. Crucial factors were also examined using a real GRN and acquired results confirmed simulation findings with artificial data. Different initial conditions were also used as an alternative triggering approach. Relevant results confirmed that the number of datasets constitutes the most significant parameter with regard to the inference performance. Copyright © 2017 Elsevier Ltd. All rights reserved.
SPERM RNA AMPLIFICATION FOR GENE EXPRESSION PROFILING BY DNA MICROARRAY TECHNOLOGY
Sperm RNA Amplification for Gene Expression Profiling by DNA Microarray Technology
Hongzu Ren, Kary E. Thompson, Judith E. Schmid and David J. Dix, Reproductive Toxicology Division, NHEERL, Office of Research and Development, US Environmental Protection Agency, Research Triang...
MICROARRAY ANALYSIS OF DICHLOROACETIC ACID-INDUCED CHANGES IN GENE EXPRESSION
MICROARRAY ANALYSIS OF DICHLOROACETIC ACID-INDUCED CHANGES IN GENE EXPRESSION
Dichloroacetic acid (DCA) is a major by-product of water disinfection by chlorination. Several studies have demonstrated the hepatocarcinogenicity of DCA in rodents when administered in dri...
Baines, John F.; Roller, Julia; Saminadin-Peter, Sarah S.; Parsch, John; Jiggins, Francis M.
2009-01-01
Background Bacterial and fungal infections induce a potent immune response in Drosophila melanogaster, but it is unclear whether viral infections induce an antiviral immune response. Using microarrays, we examined the changes in gene expression in Drosophila that occur in response to infection with the sigma virus, a negative-stranded RNA virus (Rhabdoviridae) that occurs in wild populations of D. melanogaster. Principal Findings We detected many changes in gene expression in infected flies, but found no evidence for the activation of the Toll, IMD or Jak-STAT pathways, which control immune responses against bacteria and fungi. We identified a number of functional categories of genes, including serine proteases, ribosomal proteins and chorion proteins that were overrepresented among the differentially expressed genes. We also found that the sigma virus alters the expression of many more genes in males than in females. Conclusions These data suggest that either Drosophila do not mount an immune response against the sigma virus, or that the immune response is not controlled by known immune pathways. If the latter is true, the genes that we identified as differentially expressed after infection are promising candidates for controlling the host's response to the sigma virus. PMID:19718442
Carpenter, Jennifer; Hutter, Stephan; Baines, John F; Roller, Julia; Saminadin-Peter, Sarah S; Parsch, John; Jiggins, Francis M
2009-08-31
Bacterial and fungal infections induce a potent immune response in Drosophila melanogaster, but it is unclear whether viral infections induce an antiviral immune response. Using microarrays, we examined the changes in gene expression in Drosophila that occur in response to infection with the sigma virus, a negative-stranded RNA virus (Rhabdoviridae) that occurs in wild populations of D. melanogaster. We detected many changes in gene expression in infected flies, but found no evidence for the activation of the Toll, IMD or Jak-STAT pathways, which control immune responses against bacteria and fungi. We identified a number of functional categories of genes, including serine proteases, ribosomal proteins and chorion proteins that were overrepresented among the differentially expressed genes. We also found that the sigma virus alters the expression of many more genes in males than in females. These data suggest that either Drosophila do not mount an immune response against the sigma virus, or that the immune response is not controlled by known immune pathways. If the latter is true, the genes that we identified as differentially expressed after infection are promising candidates for controlling the host's response to the sigma virus.
Gerber, Simon D.; Amann, Ruth; Wyder, Stefan; Trueb, Beat
2012-01-01
Fgfrl1 (fibroblast growth factor receptor-like 1) is a transmembrane receptor that is essential for the development of the metanephric kidney. It is expressed in all nascent nephrogenic structures and in the ureteric bud. Fgfrl1 null mice fail to develop the metanephric kidneys. Mutant kidney rudiments show a dramatic reduction of ureteric branching and a lack of mesenchymal-to-epithelial transition. Here, we compared the expression profiles of wildtype and Fgfrl1 mutant kidneys to identify genes that act downstream of Fgfrl1 signaling during the early steps of nephron formation. We detected 56 differentially expressed transcripts with 2-fold or greater reduction, among them many genes involved in Fgf, Wnt, Bmp, Notch, and Six/Eya/Dach signaling. We validated the microarray data by qPCR and whole-mount in situ hybridization and showed the expression pattern of candidate genes in normal kidneys. Some of these genes might play an important role during early nephron formation. Our study should help to define the minimal set of genes that is required to form a functional nephron. PMID:22432025
Ethanol modulation of gene networks: implications for alcoholism.
Farris, Sean P; Miles, Michael F
2012-01-01
Alcoholism is a complex disease caused by a confluence of environmental and genetic factors influencing multiple brain pathways to produce a variety of behavioral sequelae, including addiction. Genetic factors contribute to over 50% of the risk for alcoholism and recent evidence points to a large number of genes with small effect sizes as the likely molecular basis for this disease. Recent progress in genomics (microarrays or RNA-Seq) and genetics has led to the identification of a large number of potential candidate genes influencing ethanol behaviors or alcoholism itself. To organize this complex information, investigators have begun to focus on the contribution of gene networks, rather than individual genes, for various ethanol-induced behaviors in animal models or behavioral endophenotypes comprising alcoholism. This chapter reviews some of the methods used for constructing gene networks from genomic data and some of the recent progress made in applying such approaches to the study of the neurobiology of ethanol. We show that rapid technology development in gathering genomic data, together with sophisticated experimental design and a growing collection of analysis tools are producing novel insights for understanding the molecular basis of alcoholism and that such approaches promise new opportunities for therapeutic development. Copyright © 2011 Elsevier Inc. All rights reserved.
Gene ARMADA: an integrated multi-analysis platform for microarray data implemented in MATLAB.
Chatziioannou, Aristotelis; Moulos, Panagiotis; Kolisis, Fragiskos N
2009-10-27
The microarray data analysis realm is ever growing through the development of various tools, open source and commercial. However there is absence of predefined rational algorithmic analysis workflows or batch standardized processing to incorporate all steps, from raw data import up to the derivation of significantly differentially expressed gene lists. This absence obfuscates the analytical procedure and obstructs the massive comparative processing of genomic microarray datasets. Moreover, the solutions provided, heavily depend on the programming skills of the user, whereas in the case of GUI embedded solutions, they do not provide direct support of various raw image analysis formats or a versatile and simultaneously flexible combination of signal processing methods. We describe here Gene ARMADA (Automated Robust MicroArray Data Analysis), a MATLAB implemented platform with a Graphical User Interface. This suite integrates all steps of microarray data analysis including automated data import, noise correction and filtering, normalization, statistical selection of differentially expressed genes, clustering, classification and annotation. In its current version, Gene ARMADA fully supports 2 coloured cDNA and Affymetrix oligonucleotide arrays, plus custom arrays for which experimental details are given in tabular form (Excel spreadsheet, comma separated values, tab-delimited text formats). It also supports the analysis of already processed results through its versatile import editor. Besides being fully automated, Gene ARMADA incorporates numerous functionalities of the Statistics and Bioinformatics Toolboxes of MATLAB. In addition, it provides numerous visualization and exploration tools plus customizable export data formats for seamless integration by other analysis tools or MATLAB, for further processing. Gene ARMADA requires MATLAB 7.4 (R2007a) or higher and is also distributed as a stand-alone application with MATLAB Component Runtime. Gene ARMADA provides a highly adaptable, integrative, yet flexible tool which can be used for automated quality control, analysis, annotation and visualization of microarray data, constituting a starting point for further data interpretation and integration with numerous other tools.
2014-01-01
Background Induced resistance (IR) can be part of a sustainable plant protection strategy against important plant diseases. β-aminobutyric acid (BABA) can induce resistance in a wide range of plants against several types of pathogens, including potato infected with Phytophthora infestans. However, the molecular mechanisms behind this are unclear and seem to be dependent on the system studied. To elucidate the defence responses activated by BABA in potato, a genome-wide transcript microarray analysis in combination with label-free quantitative proteomics analysis of the apoplast secretome were performed two days after treatment of the leaf canopy with BABA at two concentrations, 1 and 10 mM. Results Over 5000 transcripts were differentially expressed and over 90 secretome proteins changed in abundance indicating a massive activation of defence mechanisms with 10 mM BABA, the concentration effective against late blight disease. To aid analysis, we present a more comprehensive functional annotation of the microarray probes and gene models by retrieving information from orthologous gene families across 26 sequenced plant genomes. The new annotation provided GO terms to 8616 previously un-annotated probes. Conclusions BABA at 10 mM affected several processes related to plant hormones and amino acid metabolism. A major accumulation of PR proteins was also evident, and in the mevalonate pathway, genes involved in sterol biosynthesis were down-regulated, whereas several enzymes involved in the sesquiterpene phytoalexin biosynthesis were up-regulated. Interestingly, abscisic acid (ABA) responsive genes were not as clearly regulated by BABA in potato as previously reported in Arabidopsis. Together these findings provide candidates and markers for improved resistance in potato, one of the most important crops in the world. PMID:24773703
Biomarkers of acute respiratory allergen exposure: Screening for sensitization potential
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pucheu-Haston, Cherie M., E-mail: Pucheu-Haston.Cherie@epa.go; Copeland, Lisa B.; Vallanat, Beena
2010-04-15
Effective hazard screening will require the development of high-throughput or in vitro assays for the identification of potential sensitizers. The goal of this preliminary study was to identify potential biomarkers that differentiate the response to allergens vs non-allergens following an acute exposure in naive individuals. Female BALB/c mice received a single intratracheal aspiration exposure to Metarhizium anisopliae crude antigen (MACA) or bovine serum albumin (BSA) in Hank's Balanced Salt Solution (HBSS) or HBSS alone. Mice were terminated after 1, 3, 6, 12, 18 and 24 h. Bronchoalveolar lavage fluid (BALF) was evaluated to determine total and differential cellularity, total proteinmore » concentration and LDH activity. RNA was isolated from lung tissue for microarray analysis and qRT-PCR. MACA administration induced a rapid increase in BALF neutrophils, lymphocytes, eosinophils and total protein compared to BSA or HBSS. Microarray analysis demonstrated differential expression of genes involved in cytokine production, signaling, inflammatory cell recruitment, adhesion and activation in 3 and 12 h MACA-treated samples compared to BSA or HBSS. Further analyses allowed identification of approx 100 candidate biomarker genes. Eleven genes were selected for further assessment by qRT-PCR. Of these, 6 demonstrated persistently increased expression (Ccl17, Ccl22, Ccl7, Cxcl10, Cxcl2, Saa1), while C3ar1 increased from 6-24 h. In conclusion, a single respiratory exposure of mice to an allergenic mold extract induces an inflammatory response which is distinct in phenotype and gene transcription from the response to a control protein. Further validation of these biomarkers with additional allergens and irritants is needed. These biomarkers may facilitate improvements in screening methods.« less
Tilton, Susan C.; Menachery, Vineet D.; Gralinski, Lisa E.; Schäfer, Alexandra; Matzke, Melissa M.; Webb-Robertson, Bobbie-Jo M.; Chang, Jean; Luna, Maria L.; Long, Casey E.; Shukla, Anil K.; Bankhead, Armand R.; Burkett, Susan E.; Zornetzer, Gregory; Tseng, Chien-Te Kent; Metz, Thomas O.; Pickles, Raymond; McWeeney, Shannon; Smith, Richard D.; Katze, Michael G.; Waters, Katrina M.; Baric, Ralph S.
2013-01-01
The severe acute respiratory syndrome coronavirus accessory protein ORF6 antagonizes interferon signaling by blocking karyopherin-mediated nuclear import processes. Viral nuclear import antagonists, expressed by several highly pathogenic RNA viruses, likely mediate pleiotropic effects on host gene expression, presumably interfering with transcription factors, cytokines, hormones, and/or signaling cascades that occur in response to infection. By bioinformatic and systems biology approaches, we evaluated the impact of nuclear import antagonism on host expression networks by using human lung epithelial cells infected with either wild-type virus or a mutant that does not express ORF6 protein. Microarray analysis revealed significant changes in differential gene expression, with approximately twice as many upregulated genes in the mutant virus samples by 48 h postinfection, despite identical viral titers. Our data demonstrated that ORF6 protein expression attenuates the activity of numerous karyopherin-dependent host transcription factors (VDR, CREB1, SMAD4, p53, EpasI, and Oct3/4) that are critical for establishing antiviral responses and regulating key host responses during virus infection. Results were confirmed by proteomic and chromatin immunoprecipitation assay analyses and in parallel microarray studies using infected primary human airway epithelial cell cultures. The data strongly support the hypothesis that viral antagonists of nuclear import actively manipulate host responses in specific hierarchical patterns, contributing to the viral pathogenic potential in vivo. Importantly, these studies and modeling approaches not only provide templates for evaluating virus antagonism of nuclear import processes but also can reveal candidate cellular genes and pathways that may significantly influence disease outcomes following severe acute respiratory syndrome coronavirus infection in vivo. PMID:23365422
Kim, Seungjin; Krajmalnik-Brown, Rosa; Kim, Jong-Oh; Chung, Jinwook
2014-11-01
The application of effective remediation technologies can benefit from adequate preliminary testing, such as in lab-scale and Pilot-scale systems. Bioremediation technologies have demonstrated tremendous potential with regards to cost, but they cannot be used for all contaminated sites due to limitations in biological activity. The purpose of this study was to develop a DNA diagnostic method that reduces the time to select contaminated sites that are good candidates for bioremediation. We applied an oligonucleotide microarray method to detect and monitor genes that lead to aliphatic and aromatic degradation. Further, the bioremediation of a contaminated site, selected based on the results of the genetic diagnostic method, was achieved successfully by applying bioslurping in field tests. This gene-based diagnostic technique is a powerful tool to evaluate the potential for bioremediation in petroleum hydrocarbon contaminated soil. Copyright © 2014 Elsevier B.V. All rights reserved.
Yuan, Haiming; Meng, Zhe; Zhang, Lina; Luo, Xiangyang; Liu, Liping; Chen, Mengfan; Li, Xinwei; Zhao, Weiwei; Liang, Liyang
2016-01-01
Interstitial duplications distal to 15q13 are very rare. Here, we reported a 14-year-old boy with severe short stature, delayed bone age, hypogonadism, global developmental delay and intellectual disability. His had distinctive facial features including macrocephaly, broad forehead, deep-set and widely spaced eyes, broad nose bridge, shallow philtrum and thick lips. A de novo 6.4 Mb interstitial duplication of 15q15.3q21.2 was detected by chromosomal microarray analysis. We compared our patient's clinical phenotypes with those of several individuals with overlapping duplications and several candidate genes responsible for the phenotypes were identified as well. The results suggest a novel contiguous gene duplication syndrome characterized with shared features including short stature, hypogonadism, global developmental delay and other congenital anomalies.
Derivation of an artificial gene to improve classification accuracy upon gene selection.
Seo, Minseok; Oh, Sejong
2012-02-01
Classification analysis has been developed continuously since 1936. This research field has advanced as a result of development of classifiers such as KNN, ANN, and SVM, as well as through data preprocessing areas. Feature (gene) selection is required for very high dimensional data such as microarray before classification work. The goal of feature selection is to choose a subset of informative features that reduces processing time and provides higher classification accuracy. In this study, we devised a method of artificial gene making (AGM) for microarray data to improve classification accuracy. Our artificial gene was derived from a whole microarray dataset, and combined with a result of gene selection for classification analysis. We experimentally confirmed a clear improvement of classification accuracy after inserting artificial gene. Our artificial gene worked well for popular feature (gene) selection algorithms and classifiers. The proposed approach can be applied to any type of high dimensional dataset. Copyright © 2011 Elsevier Ltd. All rights reserved.
FISH Oracle: a web server for flexible visualization of DNA copy number data in a genomic context.
Mader, Malte; Simon, Ronald; Steinbiss, Sascha; Kurtz, Stefan
2011-07-28
The rapidly growing amount of array CGH data requires improved visualization software supporting the process of identifying candidate cancer genes. Optimally, such software should work across multiple microarray platforms, should be able to cope with data from different sources and should be easy to operate. We have developed a web-based software FISH Oracle to visualize data from multiple array CGH experiments in a genomic context. Its fast visualization engine and advanced web and database technology supports highly interactive use. FISH Oracle comes with a convenient data import mechanism, powerful search options for genomic elements (e.g. gene names or karyobands), quick navigation and zooming into interesting regions, and mechanisms to export the visualization into different high quality formats. These features make the software especially suitable for the needs of life scientists. FISH Oracle offers a fast and easy to use visualization tool for array CGH and SNP array data. It allows for the identification of genomic regions representing minimal common changes based on data from one or more experiments. FISH Oracle will be instrumental to identify candidate onco and tumor suppressor genes based on the frequency and genomic position of DNA copy number changes. The FISH Oracle application and an installed demo web server are available at http://www.zbh.uni-hamburg.de/fishoracle.
FISH Oracle: a web server for flexible visualization of DNA copy number data in a genomic context
2011-01-01
Background The rapidly growing amount of array CGH data requires improved visualization software supporting the process of identifying candidate cancer genes. Optimally, such software should work across multiple microarray platforms, should be able to cope with data from different sources and should be easy to operate. Results We have developed a web-based software FISH Oracle to visualize data from multiple array CGH experiments in a genomic context. Its fast visualization engine and advanced web and database technology supports highly interactive use. FISH Oracle comes with a convenient data import mechanism, powerful search options for genomic elements (e.g. gene names or karyobands), quick navigation and zooming into interesting regions, and mechanisms to export the visualization into different high quality formats. These features make the software especially suitable for the needs of life scientists. Conclusions FISH Oracle offers a fast and easy to use visualization tool for array CGH and SNP array data. It allows for the identification of genomic regions representing minimal common changes based on data from one or more experiments. FISH Oracle will be instrumental to identify candidate onco and tumor suppressor genes based on the frequency and genomic position of DNA copy number changes. The FISH Oracle application and an installed demo web server are available at http://www.zbh.uni-hamburg.de/fishoracle. PMID:21884636
A fisheye viewer for microarray-based gene expression data
Wu, Min; Thao, Cheng; Mu, Xiangming; Munson, Ethan V
2006-01-01
Background Microarray has been widely used to measure the relative amounts of every mRNA transcript from the genome in a single scan. Biologists have been accustomed to reading their experimental data directly from tables. However, microarray data are quite large and are stored in a series of files in a machine-readable format, so direct reading of the full data set is not feasible. The challenge is to design a user interface that allows biologists to usefully view large tables of raw microarray-based gene expression data. This paper presents one such interface – an electronic table (E-table) that uses fisheye distortion technology. Results The Fisheye Viewer for microarray-based gene expression data has been successfully developed to view MIAME data stored in the MAGE-ML format. The viewer can be downloaded from the project web site . The fisheye viewer was implemented in Java so that it could run on multiple platforms. We implemented the E-table by adapting JTable, a default table implementation in the Java Swing user interface library. Fisheye views use variable magnification to balance magnification for easy viewing and compression for maximizing the amount of data on the screen. Conclusion This Fisheye Viewer is a lightweight but useful tool for biologists to quickly overview the raw microarray-based gene expression data in an E-table. PMID:17038193
A proposed metric for assessing the measurement quality of individual microarrays
Kim, Kyoungmi; Page, Grier P; Beasley, T Mark; Barnes, Stephen; Scheirer, Katherine E; Allison, David B
2006-01-01
Background High-density microarray technology is increasingly applied to study gene expression levels on a large scale. Microarray experiments rely on several critical steps that may introduce error and uncertainty in analyses. These steps include mRNA sample extraction, amplification and labeling, hybridization, and scanning. In some cases this may be manifested as systematic spatial variation on the surface of microarray in which expression measurements within an individual array may vary as a function of geographic position on the array surface. Results We hypothesized that an index of the degree of spatiality of gene expression measurements associated with their physical geographic locations on an array could indicate the summary of the physical reliability of the microarray. We introduced a novel way to formulate this index using a statistical analysis tool. Our approach regressed gene expression intensity measurements on a polynomial response surface of the microarray's Cartesian coordinates. We demonstrated this method using a fixed model and presented results from real and simulated datasets. Conclusion We demonstrated the potential of such a quantitative metric for assessing the reliability of individual arrays. Moreover, we showed that this procedure can be incorporated into laboratory practice as a means to set quality control specifications and as a tool to determine whether an array has sufficient quality to be retained in terms of spatial correlation of gene expression measurements. PMID:16430768
Ronza, P; Cao, A; Robledo, D; Gómez-Tato, A; Álvarez-Dios, J A; Hasanuzzaman, A F M; Quiroga, M I; Villalba, A; Pardo, B G; Martínez, P
2018-04-18
European flat oyster (Ostrea edulis) production has suffered a severe decline due to bonamiosis. The responsible parasite enters in oyster haemocytes, causing an acute inflammatory response frequently leading to death. We used an immune-enriched oligo-microarray to understand the haemocyte response to Bonamia ostreae by comparing expression profiles between naïve (NS) and long-term affected (AS) populations along a time series (1 d, 30 d, 90 d). AS showed a much higher response just after challenge, which might be indicative of selection for resistance. No regulated genes were detected at 30 d in both populations while a notable reactivation was observed at 90 d, suggesting parasite latency during infection. Genes related to extracellular matrix and protease inhibitors, up-regulated in AS, and those related to histones, down-regulated in NS, might play an important role along the infection. Twenty-four candidate genes related to resistance should be further validated for selection programs aimed to control bonamiosis. Copyright © 2018 Elsevier Inc. All rights reserved.
Larson, Amy; Fair, Benjamin Jung; Pleiss, Jeffrey A
2016-06-01
Pre-mRNA splicing is an essential component of eukaryotic gene expression and is highly conserved from unicellular yeasts to humans. Here, we present the development and implementation of a sequencing-based reverse genetic screen designed to identify nonessential genes that impact pre-mRNA splicing in the fission yeast Schizosaccharomyces pombe, an organism that shares many of the complex features of splicing in higher eukaryotes. Using a custom-designed barcoding scheme, we simultaneously queried ∼3000 mutant strains for their impact on the splicing efficiency of two endogenous pre-mRNAs. A total of 61 nonessential genes were identified whose deletions resulted in defects in pre-mRNA splicing; enriched among these were factors encoding known or predicted components of the spliceosome. Included among the candidates identified here are genes with well-characterized roles in other RNA-processing pathways, including heterochromatic silencing and 3' end processing. Splicing-sensitive microarrays confirm broad splicing defects for many of these factors, revealing novel functional connections between these pathways. Copyright © 2016 Larson et al.
Hendry, William J; Hariri, Hussam Y; Alwis, Imala D; Gunewardena, Sumedha S; Hendry, Isabel R
2014-12-01
Neonatal treatment of hamsters with diethylstilbestrol (DES) induces uterine hyperplasia/dysplasia/neoplasia (endometrial adenocarcinoma) in adult animals. We subsequently determined that the neonatal DES exposure event directly and permanently disrupts the developing hamster uterus (initiation stage) so that it responds abnormally when it is stimulated with estrogen in adulthood (promotion stage). To identify candidate molecular elements involved in progression of the disruption/neoplastic process, we performed: (1) immunoblot analyses and (2) microarray profiling (Affymetrix Gene Chip System) on sets of uterine protein and RNA extracts, respectively, and (3) immunohistochemical analysis on uterine sections; all from both initiation stage and promotion stage groups of animals. Here we report that: (1) progression of the neonatal DES-induced hyperplasia/dysplasia/neoplasia phenomenon in the hamster uterus involves a wide spectrum of specific gene expression alterations and (2) the gene products involved and their manner of altered expression differ dramatically during the initiation vs. promotion stages of the phenomenon. Copyright © 2014 Elsevier Inc. All rights reserved.
Larson, Amy; Fair, Benjamin Jung; Pleiss, Jeffrey A.
2016-01-01
Pre-mRNA splicing is an essential component of eukaryotic gene expression and is highly conserved from unicellular yeasts to humans. Here, we present the development and implementation of a sequencing-based reverse genetic screen designed to identify nonessential genes that impact pre-mRNA splicing in the fission yeast Schizosaccharomyces pombe, an organism that shares many of the complex features of splicing in higher eukaryotes. Using a custom-designed barcoding scheme, we simultaneously queried ∼3000 mutant strains for their impact on the splicing efficiency of two endogenous pre-mRNAs. A total of 61 nonessential genes were identified whose deletions resulted in defects in pre-mRNA splicing; enriched among these were factors encoding known or predicted components of the spliceosome. Included among the candidates identified here are genes with well-characterized roles in other RNA-processing pathways, including heterochromatic silencing and 3ʹ end processing. Splicing-sensitive microarrays confirm broad splicing defects for many of these factors, revealing novel functional connections between these pathways. PMID:27172183
Vallée, Maud; Gravel, Catherine; Palin, Marie-France; Reghenas, Hélène; Stothard, Paul; Wishart, David S; Sirard, Marc-André
2005-07-01
The main objective of the present study was to identify novel oocyte-specific genes in three different species: bovine, mouse, and Xenopus laevis. To achieve this goal, two powerful technologies were combined: a polymerase chain reaction (PCR)-based cDNA subtraction, and cDNA microarrays. Three subtractive libraries consisting of 3456 clones were established and enriched for oocyte-specific transcripts. Sequencing analysis of the positive insert-containing clones resulted in the following classification: 53% of the clones corresponded to known cDNAs, 26% were classified as uncharacterized cDNAs, and a final 9% were classified as novel sequences. All these clones were used for cDNA microarray preparation. Results from these microarray analyses revealed that in addition to already known oocyte-specific genes, such as GDF9, BMP15, and ZP, known genes with unknown function in the oocyte were identified, such as a MLF1-interacting protein (MLF1IP), B-cell translocation gene 4 (BTG4), and phosphotyrosine-binding protein (xPTB). Furthermore, 15 novel oocyte-specific genes were validated by reverse transcription-PCR to confirm their preferential expression in the oocyte compared to somatic tissues. The results obtained in the present study confirmed that microarray analysis is a robust technique to identify true positives from the suppressive subtractive hybridization experiment. Furthermore, obtaining oocyte-specific genes from three species simultaneously allowed us to look at important genes that are conserved across species. Further characterization of these novel oocyte-specific genes will lead to a better understanding of the molecular mechanisms related to the unique functions found in the oocyte.
A database for the analysis of immunity genes in Drosophila: PADMA database.
Lee, Mark J; Mondal, Ariful; Small, Chiyedza; Paddibhatla, Indira; Kawaguchi, Akira; Govind, Shubha
2011-01-01
While microarray experiments generate voluminous data, discerning trends that support an existing or alternative paradigm is challenging. To synergize hypothesis building and testing, we designed the Pathogen Associated Drosophila MicroArray (PADMA) database for easy retrieval and comparison of microarray results from immunity-related experiments (www.padmadatabase.org). PADMA also allows biologists to upload their microarray-results and compare it with datasets housed within PADMA. We tested PADMA using a preliminary dataset from Ganaspis xanthopoda-infected fly larvae, and uncovered unexpected trends in gene expression, reshaping our hypothesis. Thus, the PADMA database will be a useful resource to fly researchers to evaluate, revise, and refine hypotheses.
Development and validation of a flax (Linum usitatissimum L.) gene expression oligo microarray
2010-01-01
Background Flax (Linum usitatissimum L.) has been cultivated for around 9,000 years and is therefore one of the oldest cultivated species. Today, flax is still grown for its oil (oil-flax or linseed cultivars) and its cellulose-rich fibres (fibre-flax cultivars) used for high-value linen garments and composite materials. Despite the wide industrial use of flax-derived products, and our actual understanding of the regulation of both wood fibre production and oil biosynthesis more information must be acquired in both domains. Recent advances in genomics are now providing opportunities to improve our fundamental knowledge of these complex processes. In this paper we report the development and validation of a high-density oligo microarray platform dedicated to gene expression analyses in flax. Results Nine different RNA samples obtained from flax inner- and outer-stems, seeds, leaves and roots were used to generate a collection of 1,066,481 ESTs by massive parallel pyrosequencing. Sequences were assembled into 59,626 unigenes and 48,021 sequences were selected for oligo design and high-density microarray (Nimblegen 385K) fabrication with eight, non-overlapping 25-mers oligos per unigene. 18 independent experiments were used to evaluate the hybridization quality, precision, specificity and accuracy and all results confirmed the high technical quality of our microarray platform. Cross-validation of microarray data was carried out using quantitative qRT-PCR. Nine target genes were selected on the basis of microarray results and reflected the whole range of fold change (both up-regulated and down-regulated genes in different samples). A statistically significant positive correlation was obtained comparing expression levels for each target gene across all biological replicates both in qRT-PCR and microarray results. Further experiments illustrated the capacity of our arrays to detect differential gene expression in a variety of flax tissues as well as between two contrasted flax varieties. Conclusion All results suggest that our high-density flax oligo-microarray platform can be used as a very sensitive tool for analyzing gene expression in a large variety of tissues as well as in different cultivars. Moreover, this highly reliable platform can also be used for the quantification of mRNA transcriptional profiling in different flax tissues. PMID:20964859
Development and validation of a flax (Linum usitatissimum L.) gene expression oligo microarray.
Fenart, Stéphane; Ndong, Yves-Placide Assoumou; Duarte, Jorge; Rivière, Nathalie; Wilmer, Jeroen; van Wuytswinkel, Olivier; Lucau, Anca; Cariou, Emmanuelle; Neutelings, Godfrey; Gutierrez, Laurent; Chabbert, Brigitte; Guillot, Xavier; Tavernier, Reynald; Hawkins, Simon; Thomasset, Brigitte
2010-10-21
Flax (Linum usitatissimum L.) has been cultivated for around 9,000 years and is therefore one of the oldest cultivated species. Today, flax is still grown for its oil (oil-flax or linseed cultivars) and its cellulose-rich fibres (fibre-flax cultivars) used for high-value linen garments and composite materials. Despite the wide industrial use of flax-derived products, and our actual understanding of the regulation of both wood fibre production and oil biosynthesis more information must be acquired in both domains. Recent advances in genomics are now providing opportunities to improve our fundamental knowledge of these complex processes. In this paper we report the development and validation of a high-density oligo microarray platform dedicated to gene expression analyses in flax. Nine different RNA samples obtained from flax inner- and outer-stems, seeds, leaves and roots were used to generate a collection of 1,066,481 ESTs by massive parallel pyrosequencing. Sequences were assembled into 59,626 unigenes and 48,021 sequences were selected for oligo design and high-density microarray (Nimblegen 385K) fabrication with eight, non-overlapping 25-mers oligos per unigene. 18 independent experiments were used to evaluate the hybridization quality, precision, specificity and accuracy and all results confirmed the high technical quality of our microarray platform. Cross-validation of microarray data was carried out using quantitative qRT-PCR. Nine target genes were selected on the basis of microarray results and reflected the whole range of fold change (both up-regulated and down-regulated genes in different samples). A statistically significant positive correlation was obtained comparing expression levels for each target gene across all biological replicates both in qRT-PCR and microarray results. Further experiments illustrated the capacity of our arrays to detect differential gene expression in a variety of flax tissues as well as between two contrasted flax varieties. All results suggest that our high-density flax oligo-microarray platform can be used as a very sensitive tool for analyzing gene expression in a large variety of tissues as well as in different cultivars. Moreover, this highly reliable platform can also be used for the quantification of mRNA transcriptional profiling in different flax tissues.
Microarray expression profiling in adhesion and normal peritoneal tissues.
Ambler, Dana R; Golden, Alicia M; Gell, Jennifer S; Saed, Ghassan M; Carey, David J; Diamond, Michael P
2012-05-01
To identify molecular markers associated with adhesion and normal peritoneal tissue using microarray expression profiling. Comparative study. University hospital. Five premenopausal women. Adhesion and normal peritoneal tissue samples were obtained from premenopausal women. Ribonucleic acid was extracted using standard protocols and processed for hybridization to Affymetrix Whole Transcript Human Gene Expression Chips. Microarray data were obtained from five different patients, each with adhesion tissue and normal peritoneal samples. Real-time polymerase chain reaction was performed for confirmation using standard protocols. Gene expression in postoperative adhesion and normal peritoneal tissues. A total of 1,263 genes were differentially expressed between adhesion and normal tissues. One hundred seventy-three genes were found to be up-regulated and 56 genes were down-regulated in the adhesion tissues compared with normal peritoneal tissues. The genes were sorted into functional categories according to Gene Ontology annotations. Twenty-six up-regulated genes and 11 down-regulated genes were identified with functions potentially relevant to the pathophysiology of postoperative adhesions. We evaluated and confirmed expression of 12 of these specific genes via polymerase chain reaction. The pathogenesis, natural history, and optimal treatment of postoperative adhesive disease remains unanswered. Microarray analysis of adhesions identified specific genes with increased and decreased expression when compared with normal peritoneum. Knowledge of these genes and ontologic pathways with altered expression provide targets for new therapies to treat patients who have or are at risk for postoperative adhesions. Copyright © 2012 American Society for Reproductive Medicine. Published by Elsevier Inc. All rights reserved.
Multi-membership gene regulation in pathway based microarray analysis
2011-01-01
Background Gene expression analysis has been intensively researched for more than a decade. Recently, there has been elevated interest in the integration of microarray data analysis with other types of biological knowledge in a holistic analytical approach. We propose a methodology that can be facilitated for pathway based microarray data analysis, based on the observation that a substantial proportion of genes present in biochemical pathway databases are members of a number of distinct pathways. Our methodology aims towards establishing the state of individual pathways, by identifying those truly affected by the experimental conditions based on the behaviour of such genes. For that purpose it considers all the pathways in which a gene participates and the general census of gene expression per pathway. Results We utilise hill climbing, simulated annealing and a genetic algorithm to analyse the consistency of the produced results, through the application of fuzzy adjusted rand indexes and hamming distance. All algorithms produce highly consistent genes to pathways allocations, revealing the contribution of genes to pathway functionality, in agreement with current pathway state visualisation techniques, with the simulated annealing search proving slightly superior in terms of efficiency. Conclusions We show that the expression values of genes, which are members of a number of biochemical pathways or modules, are the net effect of the contribution of each gene to these biochemical processes. We show that by manipulating the pathway and module contribution of such genes to follow underlying trends we can interpret microarray results centred on the behaviour of these genes. PMID:21939531
Multi-membership gene regulation in pathway based microarray analysis.
Pavlidis, Stelios P; Payne, Annette M; Swift, Stephen M
2011-09-22
Gene expression analysis has been intensively researched for more than a decade. Recently, there has been elevated interest in the integration of microarray data analysis with other types of biological knowledge in a holistic analytical approach. We propose a methodology that can be facilitated for pathway based microarray data analysis, based on the observation that a substantial proportion of genes present in biochemical pathway databases are members of a number of distinct pathways. Our methodology aims towards establishing the state of individual pathways, by identifying those truly affected by the experimental conditions based on the behaviour of such genes. For that purpose it considers all the pathways in which a gene participates and the general census of gene expression per pathway. We utilise hill climbing, simulated annealing and a genetic algorithm to analyse the consistency of the produced results, through the application of fuzzy adjusted rand indexes and hamming distance. All algorithms produce highly consistent genes to pathways allocations, revealing the contribution of genes to pathway functionality, in agreement with current pathway state visualisation techniques, with the simulated annealing search proving slightly superior in terms of efficiency. We show that the expression values of genes, which are members of a number of biochemical pathways or modules, are the net effect of the contribution of each gene to these biochemical processes. We show that by manipulating the pathway and module contribution of such genes to follow underlying trends we can interpret microarray results centred on the behaviour of these genes.
IMPROVING THE RELIABILITY OF MICROARRAYS FOR TOXICOLOGY RESEARCH: A COLLABORATIVE APPROACH
Microarray-based gene expression profiling is a critical tool to identify molecular biomarkers of specific chemical stressors. Although current microarray technologies have progressed from their infancy, biological and technical repeatability and reliability are often still limit...
Estimation of gene induction enables a relevance-based ranking of gene sets.
Bartholomé, Kilian; Kreutz, Clemens; Timmer, Jens
2009-07-01
In order to handle and interpret the vast amounts of data produced by microarray experiments, the analysis of sets of genes with a common biological functionality has been shown to be advantageous compared to single gene analyses. Some statistical methods have been proposed to analyse the differential gene expression of gene sets in microarray experiments. However, most of these methods either require threshhold values to be chosen for the analysis, or they need some reference set for the determination of significance. We present a method that estimates the number of differentially expressed genes in a gene set without requiring a threshold value for significance of genes. The method is self-contained (i.e., it does not require a reference set for comparison). In contrast to other methods which are focused on significance, our approach emphasizes the relevance of the regulation of gene sets. The presented method measures the degree of regulation of a gene set and is a useful tool to compare the induction of different gene sets and place the results of microarray experiments into the biological context. An R-package is available.
A Model-Based Joint Identification of Differentially Expressed Genes and Phenotype-Associated Genes
Seo, Minseok; Shin, Su-kyung; Kwon, Eun-Young; Kim, Sung-Eun; Bae, Yun-Jung; Lee, Seungyeoun; Sung, Mi-Kyung; Choi, Myung-Sook; Park, Taesung
2016-01-01
Over the last decade, many analytical methods and tools have been developed for microarray data. The detection of differentially expressed genes (DEGs) among different treatment groups is often a primary purpose of microarray data analysis. In addition, association studies investigating the relationship between genes and a phenotype of interest such as survival time are also popular in microarray data analysis. Phenotype association analysis provides a list of phenotype-associated genes (PAGs). However, it is sometimes necessary to identify genes that are both DEGs and PAGs. We consider the joint identification of DEGs and PAGs in microarray data analyses. The first approach we used was a naïve approach that detects DEGs and PAGs separately and then identifies the genes in an intersection of the list of PAGs and DEGs. The second approach we considered was a hierarchical approach that detects DEGs first and then chooses PAGs from among the DEGs or vice versa. In this study, we propose a new model-based approach for the joint identification of DEGs and PAGs. Unlike the previous two-step approaches, the proposed method identifies genes simultaneously that are DEGs and PAGs. This method uses standard regression models but adopts different null hypothesis from ordinary regression models, which allows us to perform joint identification in one-step. The proposed model-based methods were evaluated using experimental data and simulation studies. The proposed methods were used to analyze a microarray experiment in which the main interest lies in detecting genes that are both DEGs and PAGs, where DEGs are identified between two diet groups and PAGs are associated with four phenotypes reflecting the expression of leptin, adiponectin, insulin-like growth factor 1, and insulin. Model-based approaches provided a larger number of genes, which are both DEGs and PAGs, than other methods. Simulation studies showed that they have more power than other methods. Through analysis of data from experimental microarrays and simulation studies, the proposed model-based approach was shown to provide a more powerful result than the naïve approach and the hierarchical approach. Since our approach is model-based, it is very flexible and can easily handle different types of covariates. PMID:26964035
MADGE: scalable distributed data management software for cDNA microarrays.
McIndoe, Richard A; Lanzen, Aaron; Hurtz, Kimberly
2003-01-01
The human genome project and the development of new high-throughput technologies have created unparalleled opportunities to study the mechanism of diseases, monitor the disease progression and evaluate effective therapies. Gene expression profiling is a critical tool to accomplish these goals. The use of nucleic acid microarrays to assess the gene expression of thousands of genes simultaneously has seen phenomenal growth over the past five years. Although commercial sources of microarrays exist, investigators wanting more flexibility in the genes represented on the array will turn to in-house production. The creation and use of cDNA microarrays is a complicated process that generates an enormous amount of information. Effective data management of this information is essential to efficiently access, analyze, troubleshoot and evaluate the microarray experiments. We have developed a distributable software package designed to track and store the various pieces of data generated by a cDNA microarray facility. This includes the clone collection storage data, annotation data, workflow queues, microarray data, data repositories, sample submission information, and project/investigator information. This application was designed using a 3-tier client server model. The data access layer (1st tier) contains the relational database system tuned to support a large number of transactions. The data services layer (2nd tier) is a distributed COM server with full database transaction support. The application layer (3rd tier) is an internet based user interface that contains both client and server side code for dynamic interactions with the user. This software is freely available to academic institutions and non-profit organizations at http://www.genomics.mcg.edu/niddkbtc.
Microarray gene expression profiling analysis combined with bioinformatics in multiple sclerosis.
Liu, Mingyuan; Hou, Xiaojun; Zhang, Ping; Hao, Yong; Yang, Yiting; Wu, Xiongfeng; Zhu, Desheng; Guan, Yangtai
2013-05-01
Multiple sclerosis (MS) is the most prevalent demyelinating disease and the principal cause of neurological disability in young adults. Recent microarray gene expression profiling studies have identified several genetic variants contributing to the complex pathogenesis of MS, however, expressional and functional studies are still required to further understand its molecular mechanism. The present study aimed to analyze the molecular mechanism of MS using microarray analysis combined with bioinformatics techniques. We downloaded the gene expression profile of MS from Gene Expression Omnibus (GEO) and analysed the microarray data using the differentially coexpressed genes (DCGs) and links package in R and Database for Annotation, Visualization and Integrated Discovery. The regulatory impact factor (RIF) algorithm was used to measure the impact factor of transcription factor. A total of 1,297 DCGs between MS patients and healthy controls were identified. Functional annotation indicated that these DCGs were associated with immune and neurological functions. Furthermore, the RIF result suggested that IKZF1, BACH1, CEBPB, EGR1, FOS may play central regulatory roles in controlling gene expression in the pathogenesis of MS. Our findings confirm the presence of multiple molecular alterations in MS and indicate the possibility for identifying prognostic factors associated with MS pathogenesis.
Homogeneous versus heterogeneous probes for microbial ecological microarrays.
Bae, Jin-Woo; Park, Yong-Ha
2006-07-01
Microbial ecological microarrays have been developed for investigating the composition and functions of microorganism communities in environmental niches. These arrays include microbial identification microarrays, which use oligonucleotides, gene fragments or microbial genomes as probes. In this article, the advantages and disadvantages of each type of probe are reviewed. Oligonucleotide probes are currently useful for probing uncultivated bacteria that are not amenable to gene fragment probing, whereas the functional gene fragments amplified randomly from microbial genomes require phylogenetic and hierarchical categorization before use as microbial identification probes, despite their high resolution for both specificity and sensitivity. Until more bacteria are sequenced and gene fragment probes are thoroughly validated, heterogeneous bacterial genome probes will provide a simple, sensitive and quantitative tool for exploring the ecosystem structure.
Bian, Zhong-Rui; Yin, Juan; Sun, Wen; Lin, Dian-Jie
2017-04-01
Diagnose of active tuberculosis (TB) is challenging and treatment response is also difficult to efficiently monitor. The aim of this study was to use an integrated analysis of microarray and network-based method to the samples from publically available datasets to obtain a diagnostic module set and pathways in active TB. Towards this goal, background protein-protein interactions (PPI) network was generated based on global PPI information and gene expression data, following by identification of differential expression network (DEN) from the background PPI network. Then, ego genes were extracted according to the degree features in DEN. Next, module collection was conducted by ego gene expansion based on EgoNet algorithm. After that, differential expression of modules between active TB and controls was evaluated using random permutation test. Finally, biological significance of differential modules was detected by pathways enrichment analysis based on Reactome database, and Fisher's exact test was implemented to extract differential pathways for active TB. Totally, 47 ego genes and 47 candidate modules were identified from the DEN. By setting the cutoff-criteria of gene size >5 and classification accuracy ≥0.9, 7 ego modules (Module 4, Module 7, Module 9, Module 19, Module 25, Module 38 and Module 43) were extracted, and all of them had the statistical significance between active TB and controls. Then, Fisher's exact test was conducted to capture differential pathways for active TB. Interestingly, genes in Module 4, Module 25, Module 38, and Module 43 were enriched in the same pathway, formation of a pool of free 40S subunits. Significant pathway for Module 7 and Module 9 was eukaryotic translation termination, and for Module 19 was nonsense mediated decay enhanced by the exon junction complex (EJC). Accordingly, differential modules and pathways might be potential biomarkers for treating active TB, and provide valuable clues for better understanding of molecular mechanism of active TB. Copyright © 2017 Elsevier Ltd. All rights reserved.
Wistow, Graeme; Bernstein, Steven L; Wyatt, M Keith; Behal, Amita; Touchman, Jeffrey W; Bouffard, Gerald; Smith, Don; Peterson, Katherine
2002-06-15
To explore the expression profile of the human lens and to provide a resource for microarray studies, expressed sequence tag (EST) analysis has been performed on cDNA libraries from adult lenses. A cDNA library was constructed from two adult (40 year old) human lenses. Over two thousand clones were sequenced from the unamplified, un-normalized library. The library was then normalized and a further 2200 sequences were obtained. All the data were analyzed using GRIST (GRouping and Identification of Sequence Tags), a procedure for gene identification and clustering. The lens library (by) contains a low percentage of non-mRNA contaminants and a high fraction (over 75%) of apparently full length cDNA clones. Approximately 2000 reads from the unamplified library yields 810 clusters, potentially representing individual genes expressed in the lens. After normalization, the content of crystallins and other abundant cDNAs is markedly reduced and a similar number of reads from this library (fs) yields 1455 unique groups of which only two thirds correspond to named genes in GenBank. Among the most abundant cDNAs is one for a novel gene related to glutamine synthetase, which was designated "lengsin" (LGS). Analyses of ESTs also reveal examples of alternative transcripts, including a major alternative splice form for the lens specific membrane protein MP19. Variant forms for other transcripts, including those encoding the apoptosis inhibitor Livin and the armadillo repeat protein ARVCF, are also described. The lens cDNA libraries are a resource for gene discovery, full length cDNAs for functional studies and microarrays. The discovery of an abundant, novel transcript, lengsin, and a major novel splice form of MP19 reflect the utility of unamplified libraries constructed from dissected tissue. Many novel transcripts and splice forms are represented, some of which may be candidates for genetic diseases.
Haitsma, Jack J.; Furmli, Suleiman; Masoom, Hussain; Liu, Mingyao; Imai, Yumiko; Slutsky, Arthur S.; Beyene, Joseph; Greenwood, Celia M. T.; dos Santos, Claudia
2012-01-01
Objectives To perform a meta-analysis of gene expression microarray data from animal studies of lung injury, and to identify an injury-specific gene expression signature capable of predicting the development of lung injury in humans. Methods We performed a microarray meta-analysis using 77 microarray chips across six platforms, two species and different animal lung injury models exposed to lung injury with or/and without mechanical ventilation. Individual gene chips were classified and grouped based on the strategy used to induce lung injury. Effect size (change in gene expression) was calculated between non-injurious and injurious conditions comparing two main strategies to pool chips: (1) one-hit and (2) two-hit lung injury models. A random effects model was used to integrate individual effect sizes calculated from each experiment. Classification models were built using the gene expression signatures generated by the meta-analysis to predict the development of lung injury in human lung transplant recipients. Results Two injury-specific lists of differentially expressed genes generated from our meta-analysis of lung injury models were validated using external data sets and prospective data from animal models of ventilator-induced lung injury (VILI). Pathway analysis of gene sets revealed that both new and previously implicated VILI-related pathways are enriched with differentially regulated genes. Classification model based on gene expression signatures identified in animal models of lung injury predicted development of primary graft failure (PGF) in lung transplant recipients with larger than 80% accuracy based upon injury profiles from transplant donors. We also found that better classifier performance can be achieved by using meta-analysis to identify differentially-expressed genes than using single study-based differential analysis. Conclusion Taken together, our data suggests that microarray analysis of gene expression data allows for the detection of “injury" gene predictors that can classify lung injury samples and identify patients at risk for clinically relevant lung injury complications. PMID:23071521
Role of the Chemokine MCP-1 in Sensitization of PKC-Mediated Apoptosis in Prostate Cancer Cells
2010-02-01
component. As phorbol esters are strong inducers of gene expression, we analyzed changes in gene expression using Affymetrix microarrays. These studies...were carried out at the UPenn Microarray Facility. We studied the dynamics of changes in gene expression by PMA at different times between 0 and 24 h...after PMA treatment. We identified ~ 5,000 PMA- genes up- or down-regulated by PMA (> 2-fold change), identified early and late genes , and classified
Peñagaricano, Francisco; Zorrilla, Pilar; Naya, Hugo; Robello, Carlos; Urioste, Jorge I
2012-02-01
The white coat colour of sheep is an important economic trait. For unknown reasons, some animals are born with, and others develop with time, black skin spots that can also produce pigmented fibres. The presence of pigmented fibres in the white wool significantly decreases the fibre quality. The aim of this work was to study gene expression in black spots (with and without pigmented fibres) and white skin by microarray techniques, in order to identify the possible genes involved in the development of this trait. Five unrelated Corriedale sheep were used and, for each animal, the three possible comparisons (three different hybridisations) between the three samples of interest were performed. Differential gene expression patterns were analysed using different t-test approaches. Most of the major genes with well-known roles in skin pigmentation, e.g. ASIP, MC1R and C-KIT, showed no significant difference in the gene expression between white skin and black spots. On the other hand, many of the differentially expressed genes (raw P-value < 0.005) detected in this study, e.g. C-FOS, KLF4 and UFC1, fulfil biological functions that are plausible to be involved in the formation of black spots. The gene expression of C-FOS and KLF4, transcription factors involved in the cellular response to external factors such as ultraviolet light, was validated by quantitative polymerase chain reaction (PCR). This exploratory study provides a list of candidate genes that could be associated with the development of black skin spots that should be studied in more detail. Characterisation of these genes will enable us to discern the molecular mechanisms involved in the development of this feature and, hence, increase our understanding of melanocyte biology and skin pigmentation. In sheep, understanding this phenomenon is a first step towards developing molecular tools to assist in the selection against the presence of pigmented fibres in white wool.
Characterization of microRNA profile in mammary tissue of dairy and beef breed heifers.
Wicik, Z; Gajewska, M; Majewska, A; Walkiewicz, D; Osińska, E; Motyl, T
2016-02-01
MicroRNAs (miRNAs) are small non-coding RNAs that participate in the regulation of gene expression. Their role during mammary gland development is still largely unknown. In this study, we performed a microarray analysis to identify miRNAs associated with high mammogenic potential of the bovine mammary gland. We identified 54 significantly differentially expressed miRNAs between the mammary tissue of dairy (Holstein-Friesian, HF) and beef (Limousin, LM) postpubertal heifers. Fifty-two miRNAs had higher expression in the mammary tissue of LM heifers. The expression of the top candidate miRNAs (bta-miR-10b, bta-miR-29b, bta-miR-101, bta-miR-375, bta-miR-2285t, bta-miR-146b, bta-let7b, bta-miR-107, bta-miR-1434-3p) identified in the microarray experiment was additionally evaluated by qPCR. Enrichment analyses for targeted genes revealed that the major differences between miRNA expression in the mammary gland of HF versus LM were associated with the regulation of signalling pathways that are crucial for mammary gland development, such as TGF-beta, insulin, WNT and inflammatory pathways. Moreover, a number of genes potentially targeted by significantly differentially expressed miRNAs were associated with the activity of mammary stem cells. These data indicate that the high developmental potential of the mammary gland in dairy cattle, leading to high milk productivity, depends also on a specific miRNA expression pattern. © 2015 Blackwell Verlag GmbH.
Laassri, Majid; Bidzhieva, Bella; Speicher, James; Pletnev, Alexander G; Chumakov, Konstantin
2011-05-01
Genetic stability is an important characteristic of live viral vaccines because an accumulation of mutants can cause reversion to a virulent phenotype as well as a loss of immunogenic properties. This study was aimed at evaluating the genetic stability of a live attenuated West Nile (WN) virus vaccine candidate that was generated by replacing the pre-membrane and envelope protein genes of dengue 4 virus with those from WN. Chimeric virus was serially propagated in Vero, SH-SY5Y human neuroblastoma and HeLa cells and screened for point mutations using hybridization with microarrays of overlapping oligonucleotide probes covering the entire genome. The analysis revealed several spontaneous mutations that led to amino acid changes, most of which were located in the envelope (E) and non-structural NS4A, NS4B, and NS5 proteins. Viruses passaged in Vero and SH-SY5Y cells shared two common mutations: G(2337) C (Met(457) Ile) in the E gene and A(6751) G (Lys(125) Arg) in the NS4A gene. Quantitative assessment of the contents of these mutants in viral stocks indicated that they accumulated independently with different kinetics during propagation in cell cultures. Mutant viruses grew better in Vero cells compared to the parental virus, suggesting that they have a higher fitness. When tested in newborn mice, the cell culture-passaged viruses did not exhibit increased neurovirulence. The approach described in this article could be useful for monitoring the molecular consistency and quality control of vaccine strains. Copyright © 2011 Wiley-Liss, Inc.
Laassri, Majid; Bidzhieva, Bella; Speicher, James; Pletnev, Alexander G.; Chumakov, Konstantin
2012-01-01
Genetic stability is an important characteristic of live viral vaccines because an accumulation of mutants can cause reversion to a virulent phenotype as well as a loss of immunogenic properties. This study was aimed at evaluating the genetic stability of a live attenuated West Nile (WN) virus vaccine candidate that was generated by replacing the pre-membrane and envelope protein genes of dengue 4 virus with those from WN. Chimeric virus was serially propagated in Vero, SH-SY5Y human neuroblastoma and HeLa cells and screened for point mutations using hybridization with microarrays of overlapping oligonucleotide probes covering the entire genome. The analysis revealed several spontaneous mutations that led to amino acid changes, most of which were located in the envelope (E) and non-structural NS4A, NS4B, and NS5 proteins. Viruses passaged in Vero and SH-SY5Y cells shared two common mutations: G2337C (Met457Ile) in the E gene and A6751G (Lys125Arg) in the NS4A gene. Quantitative assessment of the contents of these mutants in viral stocks indicated that they accumulated independently with different kinetics during propagation in cell cultures. Mutant viruses grew better in Vero cells compared to the parental virus, suggesting that they have a higher fitness. When tested in newborn mice, the cell culture-passaged viruses did not exhibit increased neurovirulence. The approach described in this paper could be useful for monitoring the molecular consistency and quality control of vaccine strains. PMID:21360544
Gene set analysis approaches for RNA-seq data: performance evaluation and application guideline
Rahmatallah, Yasir; Emmert-Streib, Frank
2016-01-01
Transcriptome sequencing (RNA-seq) is gradually replacing microarrays for high-throughput studies of gene expression. The main challenge of analyzing microarray data is not in finding differentially expressed genes, but in gaining insights into the biological processes underlying phenotypic differences. To interpret experimental results from microarrays, gene set analysis (GSA) has become the method of choice, in particular because it incorporates pre-existing biological knowledge (in a form of functionally related gene sets) into the analysis. Here we provide a brief review of several statistically different GSA approaches (competitive and self-contained) that can be adapted from microarrays practice as well as those specifically designed for RNA-seq. We evaluate their performance (in terms of Type I error rate, power, robustness to the sample size and heterogeneity, as well as the sensitivity to different types of selection biases) on simulated and real RNA-seq data. Not surprisingly, the performance of various GSA approaches depends only on the statistical hypothesis they test and does not depend on whether the test was developed for microarrays or RNA-seq data. Interestingly, we found that competitive methods have lower power as well as robustness to the samples heterogeneity than self-contained methods, leading to poor results reproducibility. We also found that the power of unsupervised competitive methods depends on the balance between up- and down-regulated genes in tested gene sets. These properties of competitive methods have been overlooked before. Our evaluation provides a concise guideline for selecting GSA approaches, best performing under particular experimental settings in the context of RNA-seq. PMID:26342128
In vitro study of the effects of ELF electric fields on gene expression in human epidermal cells.
Collard, Jean-Francois; Mertens, Benjamin; Hinsenkamp, Maurice
2011-01-01
An acceleration of differentiation, at the expense of proliferation, is observed after exposure of various biological models to low frequency and low amplitude electric and electromagnetic fields. Following these results showing significant modifications, we try to identify the biological mechanism involved at the cell level through microarray screening. For this study, we use epidermis cultures harvested from human abdominoplasty. Two platinum electrodes are used to apply the electric signal. The gene expressions of 38,500 well-characterized human genes are analyzed using Affymetrix(®) microarray U133 Plus 2.0 chips. The protocol is repeated on three different patients. After three periods of exposure, a total of 24 chips have been processed. After the application of ELF electric fields, the microarray analysis confirms a modification of the gene expression of epidermis cells. Particularly, four up-regulated genes (DKK1, TXNRD1, ATF3, and MME) and one down-regulated gene (MACF1) are involved in the regulation of proliferation and differentiation. Expression of these five genes was also confirmed by real-time rtPCR in all samples used for microarray analysis. These results corroborate an acceleration of cell differentiation at the expense of cell proliferation. © 2010 Wiley-Liss, Inc.
Yıldırım, Kubilay; Uylaş, Senem
2016-12-01
Boron (B) is an essential nutrient for normal growth of plants. Despite its low abundance in soils, it could be highly toxic to plants in especially arid and semi-arid environments. Poplars are known to be tolerant species to B toxicity and accumulation. However, physiological and gene regulation responses of these trees to B toxicity have not been investigated yet. Here, B accumulation and tolerance level of black poplar clones were firstly tested in the current study. Rooted cutting of these clones were treated with elevated B toxicity to select the most B accumulator and tolerant genotype. Then we carried out a microarray based transcriptome experiment on the leaves and roots of this genotype to find out transcriptional networks, genes and molecular mechanisms behind B toxicity tolerance. The results of the study indicated that black poplar is quite suitable for phytoremediation of B pollution. It could resist 15 ppm soil B content and >1500 ppm B accumulation in leaves, which are highly toxic concentrations for almost all agricultural plants. Transcriptomics results of study revealed totally 1625 and 1419 altered probe sets under 15 ppm B toxicity in leaf and root tissues, respectively. The highest induction were recorded for the probes sets annotated to tyrosine aminotransferase, ATP binding cassette transporters, glutathione S transferases and metallochaperone proteins. Strong up regulation of these genes attributed to internal excretion of B into the cell vacuole and existence of B detoxification processes in black poplar. Many other candidate genes functional in signalling, gene regulation, antioxidation, B uptake and transport processes were also identified in this hyper B accumulator plant for the first time with the current study. Copyright © 2016 Elsevier Masson SAS. All rights reserved.
Zheng, Qi; Zhang, Yong; Chen, Ying; Yang, Ning; Wang, Xiu-Jie; Zhu, Dahai
2009-02-22
The genetic closeness and divergent muscle growth rates of broilers and layers make them great models for myogenesis study. In order to discover the molecular mechanisms determining the divergent muscle growth rates and muscle mass control in different chicken lines, we systematically identified differentially expressed genes between broiler and layer skeletal muscle cells during different developmental stages by microarray hybridization experiment. Taken together, 543 differentially expressed genes were identified between broilers and layers across different developmental stages. We found that differential regulation of slow-type muscle gene expression, satellite cell proliferation and differentiation, protein degradation rate and genes in some metabolic pathways could give great contributions to the divergent muscle growth rates of the two chicken lines. Interestingly, the expression profiles of a few differentially expressed genes were positively or negatively correlated with the growth rates of broilers and layers, indicating that those genes may function in regulating muscle growth during development. The multiple muscle cell growth regulatory processes identified by our study implied that complicated molecular networks involved in the regulation of chicken muscle growth. These findings will not only offer genetic information for identifying candidate genes for chicken breeding, but also provide new clues for deciphering mechanisms underlining muscle development in vertebrates.
Liu, Bao-Hong; Cai, Jian-Ping
2017-01-01
Salmonella enterica Pullorum is one of the leading causes of mortality in poultry. Understanding the molecular response in chickens in response to the infection by S. enterica is important in revealing the mechanisms of pathogenesis and disease progress. There have been studies on identifying genes associated with Salmonella infection by differential expression analysis, but the relationships among regulated genes have not been investigated. In this study, we employed weighted gene coexpression network analysis (WGCNA) and differential coexpression analysis (DCEA) to identify coexpression modules by exploring microarray data derived from chicken splenic tissues in response to the S. enterica infection. A total of 19 modules from 13,538 genes were associated with the Jak-STAT signaling pathway, the extracellular matrix, cytoskeleton organization, the regulation of the actin cytoskeleton, G-protein coupled receptor activity, Toll-like receptor signaling pathways, and immune system processes; among them, 14 differentially coexpressed modules (DCMs) and 2,856 differentially coexpressed genes (DCGs) were identified. The global expression of module genes between infected and uninfected chickens showed slight differences but considerable changes for global coexpression. Furthermore, DCGs were consistently linked to the hubs of the modules. These results will help prioritize candidate genes for future studies of Salmonella infection.
2017-01-01
Salmonella enterica Pullorum is one of the leading causes of mortality in poultry. Understanding the molecular response in chickens in response to the infection by S. enterica is important in revealing the mechanisms of pathogenesis and disease progress. There have been studies on identifying genes associated with Salmonella infection by differential expression analysis, but the relationships among regulated genes have not been investigated. In this study, we employed weighted gene coexpression network analysis (WGCNA) and differential coexpression analysis (DCEA) to identify coexpression modules by exploring microarray data derived from chicken splenic tissues in response to the S. enterica infection. A total of 19 modules from 13,538 genes were associated with the Jak-STAT signaling pathway, the extracellular matrix, cytoskeleton organization, the regulation of the actin cytoskeleton, G-protein coupled receptor activity, Toll-like receptor signaling pathways, and immune system processes; among them, 14 differentially coexpressed modules (DCMs) and 2,856 differentially coexpressed genes (DCGs) were identified. The global expression of module genes between infected and uninfected chickens showed slight differences but considerable changes for global coexpression. Furthermore, DCGs were consistently linked to the hubs of the modules. These results will help prioritize candidate genes for future studies of Salmonella infection. PMID:28529955
Zhao, Zhongming; Guo, An-Yuan; van den Oord, Edwin J C G; Aliev, Fazil; Jia, Peilin; Edenberg, Howard J; Riley, Brien P; Dick, Danielle M; Bettinger, Jill C; Davies, Andrew G; Grotewiel, Michael S; Schuckit, Marc A; Agrawal, Arpana; Kramer, John; Nurnberger, John I; Kendler, Kenneth S; Webb, Bradley T; Miles, Michael F
2012-01-01
A variety of species and experimental designs have been used to study genetic influences on alcohol dependence, ethanol response, and related traits. Integration of these heterogeneous data can be used to produce a ranked target gene list for additional investigation. In this study, we performed a unique multi-species evidence-based data integration using three microarray experiments in mice or humans that generated an initial alcohol dependence (AD) related genes list, human linkage and association results, and gene sets implicated in C. elegans and Drosophila. We then used permutation and false discovery rate (FDR) analyses on the genome-wide association studies (GWAS) dataset from the Collaborative Study on the Genetics of Alcoholism (COGA) to evaluate the ranking results and weighting matrices. We found one weighting score matrix could increase FDR based q-values for a list of 47 genes with a score greater than 2. Our follow up functional enrichment tests revealed these genes were primarily involved in brain responses to ethanol and neural adaptations occurring with alcoholism. These results, along with our experimental validation of specific genes in mice, C. elegans and Drosophila, suggest that a cross-species evidence-based approach is useful to identify candidate genes contributing to alcoholism.
[Expression of cell adhesion molecules in acute leukemia cell].
Ju, Xiaoping; Peng, Min; Xu, Xiaoping; Lu, Shuqing; Li, Yao; Ying, Kang; Xie, Yi; Mao, Yumin; Xia, Fang
2002-11-01
To investigate the role of cell adhesion molecule in the development and extramedullary infiltration (EI) of acute leukemia. The expressions of neural cell adhesion molecule (NCAM) gene, intercellular adhesion molecule-1 (ICAM-1) and vascular cell adhesion molecule (VCAM-1) genes in 25 acute leukemia patients bone marrow cells were detected by microarray and reverse transcriptase-polymerase chain reaction (RT-PCR). The expressions of NCAM, ICAM-1 and VCAM-1 gene were significantly higher in acute leukemia cells and leukemia cells with EI than in normal tissues and leukemia cells without EI, respectively, both by cDNA microarray and by RT-PCR. The cDNA microarray is a powerful technique in analysis of acute leukemia cells associated genes. High expressions of cell adhesion molecule genes might be correlated with leukemia pathogenesis and infiltration of acute leukemia cell.
Shrinkage regression-based methods for microarray missing value imputation.
Wang, Hsiuying; Chiu, Chia-Chun; Wu, Yi-Ching; Wu, Wei-Sheng
2013-01-01
Missing values commonly occur in the microarray data, which usually contain more than 5% missing values with up to 90% of genes affected. Inaccurate missing value estimation results in reducing the power of downstream microarray data analyses. Many types of methods have been developed to estimate missing values. Among them, the regression-based methods are very popular and have been shown to perform better than the other types of methods in many testing microarray datasets. To further improve the performances of the regression-based methods, we propose shrinkage regression-based methods. Our methods take the advantage of the correlation structure in the microarray data and select similar genes for the target gene by Pearson correlation coefficients. Besides, our methods incorporate the least squares principle, utilize a shrinkage estimation approach to adjust the coefficients of the regression model, and then use the new coefficients to estimate missing values. Simulation results show that the proposed methods provide more accurate missing value estimation in six testing microarray datasets than the existing regression-based methods do. Imputation of missing values is a very important aspect of microarray data analyses because most of the downstream analyses require a complete dataset. Therefore, exploring accurate and efficient methods for estimating missing values has become an essential issue. Since our proposed shrinkage regression-based methods can provide accurate missing value estimation, they are competitive alternatives to the existing regression-based methods.
Nguyen, Doan H.; Toshida, Hiroshi; Schurr, Jill; Beuerman, Roger W.
2010-01-01
Previous studies showed that loss of muscarinic parasympathetic input to the lacrimal gland (LG) leads to a dramatic reduction in tear secretion and profound changes to LG structure. In this study, we used DNA microarrays to examine the regulation of the gene expression of the genes for secretory function and organization of the LG. Long-Evans rats anesthetized with a mixture of ketamine/xylazine (80:10 mg/kg) underwent unilateral sectioning of the greater superficial petrosal nerve, the input to the pterygopalatine ganglion. After 7 days, tear secretion was measured, the animals were killed, and structural changes in the LG were examined by light microscopy. Total RNA from control and experimental LGs (n = 5) was used for DNA microarray analysis employing the U34A GeneChip. Three statistical algorithms (detection, change call, and signal log ratio) were used to determine differential gene expression using the Microarray Suite (5.0) and Data Mining Tools (3.0). Tear secretion was significantly reduced and corneal ulcers developed in all experimental eyes. Light microscopy showed breakdown of the acinar structure of the LG. DNA microarray analysis showed downregulation of genes associated with the endoplasmic reticulum and Golgi, including genes involved in protein folding and processing. Conversely, transcripts for cytoskeleton and extracellular matrix components, inflammation, and apoptosis were upregulated. The number of significantly upregulated genes (116) was substantially greater than the number of downregulated genes (49). Removal of the main secretory input to the rat LG resulted in clinical symptoms associated with severe dry eye. Components of the secretory pathway were negatively affected, and the increase in cell proliferation and inflammation may lead to loss of organization in the parasympathectomized lacrimal gland. PMID:15084711
Computational Predictions Provide Insights into the Biology of TAL Effector Target Sites
Grau, Jan; Wolf, Annett; Reschke, Maik; Bonas, Ulla; Posch, Stefan; Boch, Jens
2013-01-01
Transcription activator-like (TAL) effectors are injected into host plant cells by Xanthomonas bacteria to function as transcriptional activators for the benefit of the pathogen. The DNA binding domain of TAL effectors is composed of conserved amino acid repeat structures containing repeat-variable diresidues (RVDs) that determine DNA binding specificity. In this paper, we present TALgetter, a new approach for predicting TAL effector target sites based on a statistical model. In contrast to previous approaches, the parameters of TALgetter are estimated from training data computationally. We demonstrate that TALgetter successfully predicts known TAL effector target sites and often yields a greater number of predictions that are consistent with up-regulation in gene expression microarrays than an existing approach, Target Finder of the TALE-NT suite. We study the binding specificities estimated by TALgetter and approve that different RVDs are differently important for transcriptional activation. In subsequent studies, the predictions of TALgetter indicate a previously unreported positional preference of TAL effector target sites relative to the transcription start site. In addition, several TAL effectors are predicted to bind to the TATA-box, which might constitute one general mode of transcriptional activation by TAL effectors. Scrutinizing the predicted target sites of TALgetter, we propose several novel TAL effector virulence targets in rice and sweet orange. TAL-mediated induction of the candidates is supported by gene expression microarrays. Validity of these targets is also supported by functional analogy to known TAL effector targets, by an over-representation of TAL effector targets with similar function, or by a biological function related to pathogen infection. Hence, these predicted TAL effector virulence targets are promising candidates for studying the virulence function of TAL effectors. TALgetter is implemented as part of the open-source Java library Jstacs, and is freely available as a web-application and a command line program. PMID:23526890
MicroRNA-integrated and network-embedded gene selection with diffusion distance.
Huang, Di; Zhou, Xiaobo; Lyon, Christopher J; Hsueh, Willa A; Wong, Stephen T C
2010-10-29
Gene network information has been used to improve gene selection in microarray-based studies by selecting marker genes based both on their expression and the coordinate expression of genes within their gene network under a given condition. Here we propose a new network-embedded gene selection model. In this model, we first address the limitations of microarray data. Microarray data, although widely used for gene selection, measures only mRNA abundance, which does not always reflect the ultimate gene phenotype, since it does not account for post-transcriptional effects. To overcome this important (critical in certain cases) but ignored-in-almost-all-existing-studies limitation, we design a new strategy to integrate together microarray data with the information of microRNA, the major post-transcriptional regulatory factor. We also handle the challenges led by gene collaboration mechanism. To incorporate the biological facts that genes without direct interactions may work closely due to signal transduction and that two genes may be functionally connected through multi paths, we adopt the concept of diffusion distance. This concept permits us to simulate biological signal propagation and therefore to estimate the collaboration probability for all gene pairs, directly or indirectly-connected, according to multi paths connecting them. We demonstrate, using type 2 diabetes (DM2) as an example, that the proposed strategies can enhance the identification of functional gene partners, which is the key issue in a network-embedded gene selection model. More importantly, we show that our gene selection model outperforms related ones. Genes selected by our model 1) have improved classification capability; 2) agree with biological evidence of DM2-association; and 3) are involved in many well-known DM2-associated pathways.
2013-01-01
Background The Grooved Carpet shell clam Ruditapes decussatus is the autochthonous European clam and the most appreciated from a gastronomic and economic point of view. The production is in decline due to several factors such as Perkinsiosis and habitat invasion and competition by the introduced exotic species, the manila clam Ruditapes philippinarum. After we sequenced R. decussatus transcriptome we have designed an oligo microarray capable of contributing to provide some clues on molecular response of the clam to Perkinsiosis. Results A database consisting of 41,119 unique transcripts was constructed, of which 12,479 (30.3%) were annotated by similarity. An oligo-DNA microarray platform was then designed and applied to profile gene expression in R. decussatus heavily infected by Perkinsus olseni. Functional annotation of differentially expressed genes between those two conditionswas performed by gene set enrichment analysis. As expected, microarrays unveil genes related with stress/infectious agents such as hydrolases, proteases and others. The extensive role of innate immune system was also analyzed and effect of parasitosis upon expression of important molecules such as lectins reviewed. Conclusions This study represents a first attempt to characterize Ruditapes decussatus transcriptome, an important marine resource for the European aquaculture. The trancriptome sequencing and consequent annotation will increase the available tools and resources for this specie, introducing the possibility of high throughput experiments such as microarrays analysis. In this specific case microarray approach was used to unveil some important aspects of host-parasite interaction between the Carpet shell clam and Perkinsus, two non-model species, highlighting some genes associated with this interaction. Ample information was obtained to identify biological processes significantly enriched among differentially expressed genes in Perkinsus infected versus non-infected gills. An overview on the genes related with the immune system on R. decussatus transcriptome is also reported. PMID:24168212
Leite, Ricardo B; Milan, Massimo; Coppe, Alessandro; Bortoluzzi, Stefania; dos Anjos, António; Reinhardt, Richard; Saavedra, Carlos; Patarnello, Tomaso; Cancela, M Leonor; Bargelloni, Luca
2013-10-29
The Grooved Carpet shell clam Ruditapes decussatus is the autochthonous European clam and the most appreciated from a gastronomic and economic point of view. The production is in decline due to several factors such as Perkinsiosis and habitat invasion and competition by the introduced exotic species, the manila clam Ruditapes philippinarum. After we sequenced R. decussatus transcriptome we have designed an oligo microarray capable of contributing to provide some clues on molecular response of the clam to Perkinsiosis. A database consisting of 41,119 unique transcripts was constructed, of which 12,479 (30.3%) were annotated by similarity. An oligo-DNA microarray platform was then designed and applied to profile gene expression in R. decussatus heavily infected by Perkinsus olseni. Functional annotation of differentially expressed genes between those two conditionswas performed by gene set enrichment analysis. As expected, microarrays unveil genes related with stress/infectious agents such as hydrolases, proteases and others. The extensive role of innate immune system was also analyzed and effect of parasitosis upon expression of important molecules such as lectins reviewed. This study represents a first attempt to characterize Ruditapes decussatus transcriptome, an important marine resource for the European aquaculture. The trancriptome sequencing and consequent annotation will increase the available tools and resources for this specie, introducing the possibility of high throughput experiments such as microarrays analysis. In this specific case microarray approach was used to unveil some important aspects of host-parasite interaction between the Carpet shell clam and Perkinsus, two non-model species, highlighting some genes associated with this interaction. Ample information was obtained to identify biological processes significantly enriched among differentially expressed genes in Perkinsus infected versus non-infected gills. An overview on the genes related with the immune system on R. decussatus transcriptome is also reported.
Gene Expression Analyses of Subchondral Bone in Early Experimental Osteoarthritis by Microarray
Chen, YuXian; Shen, Jun; Lu, HuaDing; Zeng, Chun; Ren, JianHua; Zeng, Hua; Li, ZhiFu; Chen, ShaoMing; Cai, DaoZhang; Zhao, Qing
2012-01-01
Osteoarthritis (OA) is a degenerative joint disease that affects both cartilage and bone. A better understanding of the early molecular changes in subchondral bone may help elucidate the pathogenesis of OA. We used microarray technology to investigate the time course of molecular changes in the subchondral bone in the early stages of experimental osteoarthritis in a rat model. We identified 2,234 differentially expressed (DE) genes at 1 week, 1,944 at 2 weeks and 1,517 at 4 weeks post-surgery. Further analyses of the dysregulated genes indicated that the events underlying subchondral bone remodeling occurred sequentially and in a time-dependent manner at the gene expression level. Some of the identified dysregulated genes that were identified have suspected roles in bone development or remodeling; these genes include Alp, Igf1, Tgf β1, Postn, Mmp3, Tnfsf11, Acp5, Bmp5, Aspn and Ihh. The differences in the expression of these genes were confirmed by real-time PCR, and the results indicated that our microarray data accurately reflected gene expression patterns characteristic of early OA. To validate the results of our microarray analysis at the protein level, immunohistochemistry staining was used to investigate the expression of Mmp3 and Aspn protein in tissue sections. These analyses indicate that Mmp3 protein expression completely matched the results of both the microarray and real-time PCR analyses; however, Aspn protein expression was not observed to differ at any time. In summary, our study demonstrated a simple method of separation of subchondral bone sample from the knee joint of rat, which can effectively avoid bone RNA degradation. These findings also revealed the gene expression profiles of subchondral bone in the rat OA model at multiple time points post-surgery and identified important DE genes with known or suspected roles in bone development or remodeling. These genes may be novel diagnostic markers or therapeutic targets for OA. PMID:22384228
Genome Wide Methylome Alterations in Lung Cancer.
Mullapudi, Nandita; Ye, Bin; Suzuki, Masako; Fazzari, Melissa; Han, Weiguo; Shi, Miao K; Marquardt, Gaby; Lin, Juan; Wang, Tao; Keller, Steven; Zhu, Changcheng; Locker, Joseph D; Spivack, Simon D
2015-01-01
Aberrant cytosine 5-methylation underlies many deregulated elements of cancer. Among paired non-small cell lung cancers (NSCLC), we sought to profile DNA 5-methyl-cytosine features which may underlie genome-wide deregulation. In one of the more dense interrogations of the methylome, we sampled 1.2 million CpG sites from twenty-four NSCLC tumor (T)-non-tumor (NT) pairs using a methylation-sensitive restriction enzyme- based HELP-microarray assay. We found 225,350 differentially methylated (DM) sites in adenocarcinomas versus adjacent non-tumor tissue that vary in frequency across genomic compartment, particularly notable in gene bodies (GB; p<2.2E-16). Further, when DM was coupled to differential transcriptome (DE) in the same samples, 37,056 differential loci in adenocarcinoma emerged. Approximately 90% of the DM-DE relationships were non-canonical; for example, promoter DM associated with DE in the same direction. Of the canonical changes noted, promoter (PR) DM loci with reciprocal changes in expression in adenocarcinomas included HBEGF, AGER, PTPRM, DPT, CST1, MELK; DM GB loci with concordant changes in expression included FOXM1, FERMT1, SLC7A5, and FAP genes. IPA analyses showed adenocarcinoma-specific promoter DMxDE overlay identified familiar lung cancer nodes [tP53, Akt] as well as less familiar nodes [HBEGF, NQO1, GRK5, VWF, HPGD, CDH5, CTNNAL1, PTPN13, DACH1, SMAD6, LAMA3, AR]. The unique findings from this study include the discovery of numerous candidate The unique findings from this study include the discovery of numerous candidate methylation sites in both PR and GB regions not previously identified in NSCLC, and many non-canonical relationships to gene expression. These DNA methylation features could potentially be developed as risk or diagnostic biomarkers, or as candidate targets for newer methylation locus-targeted preventive or therapeutic agents.
ERIC Educational Resources Information Center
Rowland-Goldsmith, Melissa
2009-01-01
DNA microarray is an ordered grid containing known sequences of DNA, which represent many of the genes in a particular organism. Each DNA sequence is unique to a specific gene. This technology enables the researcher to screen many genes from cells or tissue grown in different conditions. We developed an undergraduate lecture and laboratory…
SORBS2 and TLR3 induce premature senescence in primary human fibroblasts and keratinocytes
2013-01-01
Background Genetic aberrations are required for the progression of HPV-induced cervical precancers. A prerequisite for clonal expansion of cancer cells is unlimited proliferative capacity. In a cell culture model for cervical carcinogenesis loss of genes located on chromosome 4q35→qter and chromosome 10p14-p15 were found to be associated with escape from senescence. Moreover, by LOH and I-FISH analyses a higher frequency of allele loss of these regions was also observed in cervical carcinomas as compared to CIN3. The aim of this study was to identify candidate senescence-related genes located on chromosome 4q35→qter and chromosome 10p14-p15 which may contribute to clonal expansion at the transition of CIN3 to cancer. Methods Microarray expression analyses were used to identify candidate genes down-regulated in cervical carcinomas as compared to CIN3. In order to relate these genes with the process of senescence their respective cDNAs were overexpressed in HPV16-immortalized keratinocytes as well as in primary human fibroblasts and keratinocytes using lentivirus mediated gene transduction. Results Overall fifteen genes located on chromosome 4q35→qter and chromosome 10p14-p15 were identified. Ten of these genes could be validated in biopsies by RT-PCR. Of interest is the novel finding that SORBS2 and TLR3 can induce senescence in primary human fibroblasts and keratinocytes but not in HPV-immortalized cell lines. Intriguingly, the endogenous expression of both genes increases during finite passaging of primary keratinocytes in vitro. Conclusions The relevance of the genes SORBS2 and TLR3 in the process of cellular senescence warrants further investigation. In ongoing experiments we are investigating whether this increase in gene expression is also characteristic of replicative senescence. PMID:24165198
Alshamlan, Hala; Badr, Ghada; Alohali, Yousef
2015-01-01
An artificial bee colony (ABC) is a relatively recent swarm intelligence optimization approach. In this paper, we propose the first attempt at applying ABC algorithm in analyzing a microarray gene expression profile. In addition, we propose an innovative feature selection algorithm, minimum redundancy maximum relevance (mRMR), and combine it with an ABC algorithm, mRMR-ABC, to select informative genes from microarray profile. The new approach is based on a support vector machine (SVM) algorithm to measure the classification accuracy for selected genes. We evaluate the performance of the proposed mRMR-ABC algorithm by conducting extensive experiments on six binary and multiclass gene expression microarray datasets. Furthermore, we compare our proposed mRMR-ABC algorithm with previously known techniques. We reimplemented two of these techniques for the sake of a fair comparison using the same parameters. These two techniques are mRMR when combined with a genetic algorithm (mRMR-GA) and mRMR when combined with a particle swarm optimization algorithm (mRMR-PSO). The experimental results prove that the proposed mRMR-ABC algorithm achieves accurate classification performance using small number of predictive genes when tested using both datasets and compared to previously suggested methods. This shows that mRMR-ABC is a promising approach for solving gene selection and cancer classification problems. PMID:25961028
Bourguignon, Natalia; Bargiela, Rafael; Rojo, David; Chernikova, Tatyana N; de Rodas, Sara A López; García-Cantalejo, Jesús; Näther, Daniela J; Golyshin, Peter N; Barbas, Coral; Ferrero, Marcela; Ferrer, Manuel
2016-12-01
The analysis of catabolic capacities of microorganisms is currently often achieved by cultivation approaches and by the analysis of genomic or metagenomic datasets. Recently, a microarray system designed from curated key aromatic catabolic gene families and key alkane degradation genes was designed. The collection of genes in the microarray can be exploited to indicate whether a given microbe or microbial community is likely to be functionally connected with certain degradative phenotypes, without previous knowledge of genome data. Herein, this microarray was applied to capture new insights into the catabolic capacities of copper-resistant actinomycete Amycolatopsis tucumanensis DSM 45259. The array data support the presumptive ability of the DSM 45259 strain to utilize single alkanes (n-decane and n-tetradecane) and aromatics such as benzoate, phthalate and phenol as sole carbon sources, which was experimentally validated by cultivation and mass spectrometry. Interestingly, while in strain DSM 45259 alkB gene encoding an alkane hydroxylase is most likely highly similar to that found in other actinomycetes, the genes encoding benzoate 1,2-dioxygenase, phthalate 4,5-dioxygenase and phenol hydroxylase were homologous to proteobacterial genes. This suggests that strain DSM 45259 contains catabolic genes distantly related to those found in other actinomycetes. Together, this study not only provided new insight into the catabolic abilities of strain DSM 45259, but also suggests that this strain contains genes uncommon within actinomycetes.