DNA microarrays: a powerful genomic tool for biomedical and clinical research
Trevino, Victor; Falciani, Francesco; Barrera-Saldaña, Hugo A.
2007-01-01
Among the many benefits of the Human Genome Project are new and powerful tools such as the genome-wide hybridization devices referred as microarrays. Initially designed to measure gene transcriptional levels, microarray technologies are now used for comparing other genome features among individuals and their tissues and cells. Results provide valuable information on disease subcategories, disease prognosis, and treatment outcome. Likewise, reveal differences in genetic makeup, regulatory mechanisms and subtle variations are approaching the era of personalized medicine. To understand this powerful tool, its versatility and how it is dramatically changing the molecular approach to biomedical and clinical research, this review describes the technology, its applications, a didactic step-by-step review of a typical microarray protocol, and a real experiment. Finally, it calls the attention of the medical community to integrate multidisciplinary teams, to take advantage of this technology and its expanding applications that in a slide reveals our genetic inheritance and destiny. PMID:17660860
Importing MAGE-ML format microarray data into BioConductor.
Durinck, Steffen; Allemeersch, Joke; Carey, Vincent J; Moreau, Yves; De Moor, Bart
2004-12-12
The microarray gene expression markup language (MAGE-ML) is a widely used XML (eXtensible Markup Language) standard for describing and exchanging information about microarray experiments. It can describe microarray designs, microarray experiment designs, gene expression data and data analysis results. We describe RMAGEML, a new Bioconductor package that provides a link between cDNA microarray data stored in MAGE-ML format and the Bioconductor framework for preprocessing, visualization and analysis of microarray experiments. http://www.bioconductor.org. Open Source.
cDNA microarray analysis of esophageal cancer: discoveries and prospects.
Shimada, Yutaka; Sato, Fumiaki; Shimizu, Kazuharu; Tsujimoto, Gozoh; Tsukada, Kazuhiro
2009-07-01
Recent progress in molecular biology has revealed many genetic and epigenetic alterations that are involved in the development and progression of esophageal cancer. Microarray analysis has also revealed several genetic networks that are involved in esophageal cancer. However, clinical application of microarray techniques and use of microarray data have not yet occurred. In this review, we focus on the recent developments and problems with microarray analysis of esophageal cancer.
[DNA microarray reveals changes in gene expression of endothelial cells under shear stress].
Cheng, Min; Zhang, Wensheng; Chen, Huaiqing; Wu, Wenchao; Huang, Hua
2004-04-01
cDNA microarray technology is used as a powerful tool for rapid, comprehensive, and quantitative analysis of gene profiles of cultured human umbilical vein endothelial cells(HUVECs) in the normal static group and the shear stressed (4.20 dyne/cm2, 2 h) group. The total RNA from normal static cultured HUVECs was labeled by Cy3-dCTP, and total RNA of HUVECs from the paired shear stressed experiment was labeled by Cy5-dCTP. The expression ratios reported are the average from the two separate experiments. After bioinformatics analysis, we identified a total of 108 genes (approximately 0.026%) revealing differential expression. Of these 53 genes expressions were up-regulated, the most enhanced ones being human homolog of yeast IPP isomerase, human low density lipoprotein receptor gene, Squalene epoxidase gene, 7-dehydrocholesterol reductase, and 55 were down-regulated, the most decreased ones being heat shock 70 kD protein 1, TCB gene encoding cytosolic thyroid hormone-binding protein in HUVECs exposed to low shear stress. These results indicate that the cDNA microarray technique is effective in screening the differentially expressed genes in endothelial cells induced by various experimental conditions and the data may serve as stimuli to further researches.
Burgarella, Sarah; Cattaneo, Dario; Masseroli, Marco
2006-01-01
We developed MicroGen, a multi-database Web based system for managing all the information characterizing spotted microarray experiments. It supports information gathering and storing according to the Minimum Information About Microarray Experiments (MIAME) standard. It also allows easy sharing of information and data among all multidisciplinary actors involved in spotted microarray experiments. PMID:17238488
Richard, Arianne C; Lyons, Paul A; Peters, James E; Biasci, Daniele; Flint, Shaun M; Lee, James C; McKinney, Eoin F; Siegel, Richard M; Smith, Kenneth G C
2014-08-04
Although numerous investigations have compared gene expression microarray platforms, preprocessing methods and batch correction algorithms using constructed spike-in or dilution datasets, there remains a paucity of studies examining the properties of microarray data using diverse biological samples. Most microarray experiments seek to identify subtle differences between samples with variable background noise, a scenario poorly represented by constructed datasets. Thus, microarray users lack important information regarding the complexities introduced in real-world experimental settings. The recent development of a multiplexed, digital technology for nucleic acid measurement enables counting of individual RNA molecules without amplification and, for the first time, permits such a study. Using a set of human leukocyte subset RNA samples, we compared previously acquired microarray expression values with RNA molecule counts determined by the nCounter Analysis System (NanoString Technologies) in selected genes. We found that gene measurements across samples correlated well between the two platforms, particularly for high-variance genes, while genes deemed unexpressed by the nCounter generally had both low expression and low variance on the microarray. Confirming previous findings from spike-in and dilution datasets, this "gold-standard" comparison demonstrated signal compression that varied dramatically by expression level and, to a lesser extent, by dataset. Most importantly, examination of three different cell types revealed that noise levels differed across tissues. Microarray measurements generally correlate with relative RNA molecule counts within optimal ranges but suffer from expression-dependent accuracy bias and precision that varies across datasets. We urge microarray users to consider expression-level effects in signal interpretation and to evaluate noise properties in each dataset independently.
Computational synchronization of microarray data with application to Plasmodium falciparum.
Zhao, Wei; Dauwels, Justin; Niles, Jacquin C; Cao, Jianshu
2012-06-21
Microarrays are widely used to investigate the blood stage of Plasmodium falciparum infection. Starting with synchronized cells, gene expression levels are continually measured over the 48-hour intra-erythrocytic cycle (IDC). However, the cell population gradually loses synchrony during the experiment. As a result, the microarray measurements are blurred. In this paper, we propose a generalized deconvolution approach to reconstruct the intrinsic expression pattern, and apply it to P. falciparum IDC microarray data. We develop a statistical model for the decay of synchrony among cells, and reconstruct the expression pattern through statistical inference. The proposed method can handle microarray measurements with noise and missing data. The original gene expression patterns become more apparent in the reconstructed profiles, making it easier to analyze and interpret the data. We hypothesize that reconstructed gene expression patterns represent better temporally resolved expression profiles that can be probabilistically modeled to match changes in expression level to IDC transitions. In particular, we identify transcriptionally regulated protein kinases putatively involved in regulating the P. falciparum IDC. By analyzing publicly available microarray data sets for the P. falciparum IDC, protein kinases are ranked in terms of their likelihood to be involved in regulating transitions between the ring, trophozoite and schizont developmental stages of the P. falciparum IDC. In our theoretical framework, a few protein kinases have high probability rankings, and could potentially be involved in regulating these developmental transitions. This study proposes a new methodology for extracting intrinsic expression patterns from microarray data. By applying this method to P. falciparum microarray data, several protein kinases are predicted to play a significant role in the P. falciparum IDC. Earlier experiments have indeed confirmed that several of these kinases are involved in this process. Overall, these results indicate that further functional analysis of these additional putative protein kinases may reveal new insights into how the P. falciparum IDC is regulated.
Shaw, Joseph R; Colbourne, John K; Davey, Jennifer C; Glaholt, Stephen P; Hampton, Thomas H; Chen, Celia Y; Folt, Carol L; Hamilton, Joshua W
2007-12-21
Genomic research tools such as microarrays are proving to be important resources to study the complex regulation of genes that respond to environmental perturbations. A first generation cDNA microarray was developed for the environmental indicator species Daphnia pulex, to identify genes whose regulation is modulated following exposure to the metal stressor cadmium. Our experiments revealed interesting changes in gene transcription that suggest their biological roles and their potentially toxicological features in responding to this important environmental contaminant. Our microarray identified genes reported in the literature to be regulated in response to cadmium exposure, suggested functional attributes for genes that share no sequence similarity to proteins in the public databases, and pointed to genes that are likely members of expanded gene families in the Daphnia genome. Genes identified on the microarray also were associated with cadmium induced phenotypes and population-level outcomes that we experimentally determined. A subset of genes regulated in response to cadmium exposure was independently validated using quantitative-realtime (Q-RT)-PCR. These microarray studies led to the discovery of three genes coding for the metal detoxication protein metallothionein (MT). The gene structures and predicted translated sequences of D. pulex MTs clearly place them in this gene family. Yet, they share little homology with previously characterized MTs. The genomic information obtained from this study represents an important first step in characterizing microarray patterns that may be diagnostic to specific environmental contaminants and give insights into their toxicological mechanisms, while also providing a practical tool for evolutionary, ecological, and toxicological functional gene discovery studies. Advances in Daphnia genomics will enable the further development of this species as a model organism for the environmental sciences.
Shaw, Joseph R; Colbourne, John K; Davey, Jennifer C; Glaholt, Stephen P; Hampton, Thomas H; Chen, Celia Y; Folt, Carol L; Hamilton, Joshua W
2007-01-01
Background Genomic research tools such as microarrays are proving to be important resources to study the complex regulation of genes that respond to environmental perturbations. A first generation cDNA microarray was developed for the environmental indicator species Daphnia pulex, to identify genes whose regulation is modulated following exposure to the metal stressor cadmium. Our experiments revealed interesting changes in gene transcription that suggest their biological roles and their potentially toxicological features in responding to this important environmental contaminant. Results Our microarray identified genes reported in the literature to be regulated in response to cadmium exposure, suggested functional attributes for genes that share no sequence similarity to proteins in the public databases, and pointed to genes that are likely members of expanded gene families in the Daphnia genome. Genes identified on the microarray also were associated with cadmium induced phenotypes and population-level outcomes that we experimentally determined. A subset of genes regulated in response to cadmium exposure was independently validated using quantitative-realtime (Q-RT)-PCR. These microarray studies led to the discovery of three genes coding for the metal detoxication protein metallothionein (MT). The gene structures and predicted translated sequences of D. pulex MTs clearly place them in this gene family. Yet, they share little homology with previously characterized MTs. Conclusion The genomic information obtained from this study represents an important first step in characterizing microarray patterns that may be diagnostic to specific environmental contaminants and give insights into their toxicological mechanisms, while also providing a practical tool for evolutionary, ecological, and toxicological functional gene discovery studies. Advances in Daphnia genomics will enable the further development of this species as a model organism for the environmental sciences. PMID:18154678
Jiang, Ming-Ming; Mai, Zhi-Tao; Wan, Shan-Zhi; Chi, Yu-Min; Zhang, Xin; Sun, Bao-Hua; Di, Qing-Guo
2018-04-01
Circular RNAs (circRNAs) are a novel class of non-protein-coding RNA. Emerging evidence indicates that circRNAs participate in the regulation of many pathophysiological processes. This study aims to explore the expression profiles and pathological effects of circRNAs in non-small cell lung cancer (NSCLC). Human circRNAs microarray analysis was performed to screen the expression profile of circRNAs in NSCLC tissue. Expressions of circRNA and miRNA in NSCLC tissues and cells were quantified by qRTPCR. Functional experiments were performed to investigate the biological functions of circRNA, including CCK-8 assay, colony formation assay, transwell assay and xenograft in vivo assay. Human circRNAs microarray revealed a total 957 abnormally expressed circRNAs (> twofold, P < 0.05) in NSCLC tissue compared with adjacent normal tissue. In further studies, hsa_circ_0007385 was significantly up regulated in NSCLC tissue and cells. In vitro experiments with hsa_circ_0007385 knockdown resulted in significant suppression of the proliferation, migration and invasion of NSCLC cells. In vivo xenograft assay using hsa_circ_0007385 knockdown, significantly reduced tumor growth. Bioinformatics analysis and luciferase reporter assay verified the potential target miR-181, suggesting a possible regulatory pathway for hsa_circ_0007385. In summary, results suggest hsa_circ_0007385 plays a role in NSCLC tumorigenesis, providing a potential therapeutic target for NSCLC.
Draghici, Sorin; Tarca, Adi L; Yu, Longfei; Ethier, Stephen; Romero, Roberto
2008-03-01
The BioArray Software Environment (BASE) is a very popular MIAME-compliant, web-based microarray data repository. However in BASE, like in most other microarray data repositories, the experiment annotation and raw data uploading can be very timeconsuming, especially for large microarray experiments. We developed KUTE (Karmanos Universal daTabase for microarray Experiments), as a plug-in for BASE 2.0 that addresses these issues. KUTE provides an automatic experiment annotation feature and a completely redesigned data work-flow that dramatically reduce the human-computer interaction time. For instance, in BASE 2.0 a typical Affymetrix experiment involving 100 arrays required 4 h 30 min of user interaction time forexperiment annotation, and 45 min for data upload/download. In contrast, for the same experiment, KUTE required only 28 min of user interaction time for experiment annotation, and 3.3 min for data upload/download. http://vortex.cs.wayne.edu/kute/index.html.
Burgarella, Sarah; Cattaneo, Dario; Pinciroli, Francesco; Masseroli, Marco
2005-12-01
Improvements of bio-nano-technologies and biomolecular techniques have led to increasing production of high-throughput experimental data. Spotted cDNA microarray is one of the most diffuse technologies, used in single research laboratories and in biotechnology service facilities. Although they are routinely performed, spotted microarray experiments are complex procedures entailing several experimental steps and actors with different technical skills and roles. During an experiment, involved actors, who can also be located in a distance, need to access and share specific experiment information according to their roles. Furthermore, complete information describing all experimental steps must be orderly collected to allow subsequent correct interpretation of experimental results. We developed MicroGen, a web system for managing information and workflow in the production pipeline of spotted microarray experiments. It is constituted of a core multi-database system able to store all data completely characterizing different spotted microarray experiments according to the Minimum Information About Microarray Experiments (MIAME) standard, and of an intuitive and user-friendly web interface able to support the collaborative work required among multidisciplinary actors and roles involved in spotted microarray experiment production. MicroGen supports six types of user roles: the researcher who designs and requests the experiment, the spotting operator, the hybridisation operator, the image processing operator, the system administrator, and the generic public user who can access the unrestricted part of the system to get information about MicroGen services. MicroGen represents a MIAME compliant information system that enables managing workflow and supporting collaborative work in spotted microarray experiment production.
Split-plot microarray experiments: issues of design, power and sample size.
Tsai, Pi-Wen; Lee, Mei-Ling Ting
2005-01-01
This article focuses on microarray experiments with two or more factors in which treatment combinations of the factors corresponding to the samples paired together onto arrays are not completely random. A main effect of one (or more) factor(s) is confounded with arrays (the experimental blocks). This is called a split-plot microarray experiment. We utilise an analysis of variance (ANOVA) model to assess differentially expressed genes for between-array and within-array comparisons that are generic under a split-plot microarray experiment. Instead of standard t- or F-test statistics that rely on mean square errors of the ANOVA model, we use a robust method, referred to as 'a pooled percentile estimator', to identify genes that are differentially expressed across different treatment conditions. We illustrate the design and analysis of split-plot microarray experiments based on a case application described by Jin et al. A brief discussion of power and sample size for split-plot microarray experiments is also presented.
Comparative transcriptional profiling of human Merkel cells and Merkel cell carcinoma.
Mouchet, Nicolas; Coquart, Nolwenn; Lebonvallet, Nicolas; Le Gall-Ianotto, Christelle; Mogha, Ariane; Fautrel, Alain; Boulais, Nicholas; Dréno, Brigitte; Martin, Ludovic; Hu, Weiguo; Galibert, Marie-Dominique; Misery, Laurent
2014-12-01
Merkel cell carcinoma is believed to be derived from Merkel cells after infection by Merkel cell polyomavirus (MCPyV) and other poorly understood events. Transcriptional profiling using cDNA microarrays was performed on cells from MCPy-negative and MCPy-positive Merkel cell carcinomas and isolated normal Merkel cells. This microarray revealed numerous significantly upregulated genes and some downregulated genes. The extensive list of genes that were identified in these experiments provides a large body of potentially valuable information of Merkel cell carcinoma carcinogenesis and could represent a source of potential targets for cancer therapy. © 2014 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
ArrayNinja: An Open Source Platform for Unified Planning and Analysis of Microarray Experiments.
Dickson, B M; Cornett, E M; Ramjan, Z; Rothbart, S B
2016-01-01
Microarray-based proteomic platforms have emerged as valuable tools for studying various aspects of protein function, particularly in the field of chromatin biochemistry. Microarray technology itself is largely unrestricted in regard to printable material and platform design, and efficient multidimensional optimization of assay parameters requires fluidity in the design and analysis of custom print layouts. This motivates the need for streamlined software infrastructure that facilitates the combined planning and analysis of custom microarray experiments. To this end, we have developed ArrayNinja as a portable, open source, and interactive application that unifies the planning and visualization of microarray experiments and provides maximum flexibility to end users. Array experiments can be planned, stored to a private database, and merged with the imaged results for a level of data interaction and centralization that is not currently attainable with available microarray informatics tools. © 2016 Elsevier Inc. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Thomassen, Mads; Skov, Vibe; Eiriksdottir, Freyja
2006-06-16
The quality of DNA microarray based gene expression data relies on the reproducibility of several steps in a microarray experiment. We have developed a spotted genome wide microarray chip with oligonucleotides printed in duplicate in order to minimise undesirable biases, thereby optimising detection of true differential expression. The validation study design consisted of an assessment of the microarray chip performance using the MessageAmp and FairPlay labelling kits. Intraclass correlation coefficient (ICC) was used to demonstrate that MessageAmp was significantly more reproducible than FairPlay. Further examinations with MessageAmp revealed the applicability of the system. The linear range of the chips wasmore » three orders of magnitude, the precision was high, as 95% of measurements deviated less than 1.24-fold from the expected value, and the coefficient of variation for relative expression was 13.6%. Relative quantitation was more reproducible than absolute quantitation and substantial reduction of variance was attained with duplicate spotting. An analysis of variance (ANOVA) demonstrated no significant day-to-day variation.« less
Specific roles for the Ccr4-Not complex subunits in expression of the genome
Azzouz, Nowel; Panasenko, Olesya O.; Deluen, Cécile; Hsieh, Julien; Theiler, Grégory; Collart, Martine A.
2009-01-01
In this work we used micro-array experiments to determine the role of each nonessential subunit of the conserved Ccr4-Not complex in the control of gene expression in the yeast Saccharomyces cerevisiae. The study was performed with cells growing exponentially in high glucose and with cells grown to glucose depletion. Specific patterns of gene deregulation were observed upon deletion of any given subunit, revealing the specificity of each subunit's function. Consistently, the purification of the Ccr4-Not complex through Caf40p by tandem affinity purification from wild-type cells or cells lacking individual subunits of the Ccr4-Not complex revealed that each subunit had a particular impact on complex integrity. Furthermore, the micro-arrays revealed that the role of each subunit was specific to the growth conditions. From the study of only two different growth conditions, revealing an impact of the Ccr4-Not complex on more than 85% of all studied genes, we can infer that the Ccr4-Not complex is important for expression of most of the yeast genome. PMID:19155328
DNA microarray unravels rapid changes in transcriptome of MK-801 treated rat brain
Kobayashi, Yuka; Kulikova, Sofya P; Shibato, Junko; Rakwal, Randeep; Satoh, Hiroyuki; Pinault, Didier; Masuo, Yoshinori
2015-01-01
AIM: To investigate the impact of MK-801 on gene expression patterns genome wide in rat brain regions. METHODS: Rats were treated with an intraperitoneal injection of MK-801 [0.08 (low-dose) and 0.16 (high-dose) mg/kg] or NaCl (vehicle control). In a first series of experiment, the frontoparietal electrocorticogram was recorded 15 min before and 60 min after injection. In a second series of experiments, the whole brain of each animal was rapidly removed at 40 min post-injection, and different regions were separated: amygdala, cerebral cortex, hippocampus, hypothalamus, midbrain and ventral striatum on ice followed by DNA microarray (4 × 44 K whole rat genome chip) analysis. RESULTS: Spectral analysis revealed that a single systemic injection of MK-801 significantly and selectively augmented the power of baseline gamma frequency (30-80 Hz) oscillations in the frontoparietal electroencephalogram. DNA microarray analysis showed the largest number (up- and down- regulations) of gene expressions in the cerebral cortex (378), midbrain (376), hippocampus (375), ventral striatum (353), amygdala (301), and hypothalamus (201) under low-dose (0.08 mg/kg) of MK-801. Under high-dose (0.16 mg/kg), ventral striatum (811) showed the largest number of gene expression changes. Gene expression changes were functionally categorized to reveal expression of genes and function varies with each brain region. CONCLUSION: Acute MK-801 treatment increases synchrony of baseline gamma oscillations, and causes very early changes in gene expressions in six individual rat brain regions, a first report. PMID:26629322
Where statistics and molecular microarray experiments biology meet.
Kelmansky, Diana M
2013-01-01
This review chapter presents a statistical point of view to microarray experiments with the purpose of understanding the apparent contradictions that often appear in relation to their results. We give a brief introduction of molecular biology for nonspecialists. We describe microarray experiments from their construction and the biological principles the experiments rely on, to data acquisition and analysis. The role of epidemiological approaches and sample size considerations are also discussed.
Hinchliffe, Doug J; Meredith, William R; Yeater, Kathleen M; Kim, Hee Jin; Woodward, Andrew W; Chen, Z Jeffrey; Triplett, Barbara A
2010-05-01
Gene expression profiles of developing cotton (Gossypium hirsutum L.) fibers from two near-isogenic lines (NILs) that differ in fiber-bundle strength, short-fiber content, and in fewer than two genetic loci were compared using an oligonucleotide microarray. Fiber gene expression was compared at five time points spanning fiber elongation and secondary cell wall (SCW) biosynthesis. Fiber samples were collected from field plots in a randomized, complete block design, with three spatially distinct biological replications for each NIL at each time point. Microarray hybridizations were performed in a loop experimental design that allowed comparisons of fiber gene expression profiles as a function of time between the two NILs. Overall, developmental expression patterns revealed by the microarray experiment agreed with previously reported cotton fiber gene expression patterns for specific genes. Additionally, genes expressed coordinately with the onset of SCW biosynthesis in cotton fiber correlated with gene expression patterns of other SCW-producing plant tissues. Functional classification and enrichment analysis of differentially expressed genes between the two NILs revealed that genes associated with SCW biosynthesis were significantly up-regulated in fibers of the high-fiber quality line at the transition stage of cotton fiber development. For independent corroboration of the microarray results, 15 genes were selected for quantitative reverse transcription PCR analysis of fiber gene expression. These analyses, conducted over multiple field years, confirmed the temporal difference in fiber gene expression between the two NILs. We hypothesize that the loci conferring temporal differences in fiber gene expression between the NILs are important regulatory sequences that offer the potential for more targeted manipulation of cotton fiber quality.
Lee, Joseph C; Stiles, David; Lu, Jun; Cam, Margaret C
2007-01-01
Background Microarrays are a popular tool used in experiments to measure gene expression levels. Improving the reproducibility of microarray results produced by different chips from various manufacturers is important to create comparable and combinable experimental results. Alternative splicing has been cited as a possible cause of differences in expression measurements across platforms, though no study to this point has been conducted to show its influence in cross-platform differences. Results Using probe sequence data, a new microarray probe/transcript annotation was created based on the AceView Aug05 release that allowed for the categorization of genes based on their expression measurements' susceptibility to alternative splicing differences across microarray platforms. Examining gene expression data from multiple platforms in light of the new categorization, genes unsusceptible to alternative splicing differences showed higher signal agreement than those genes most susceptible to alternative splicing differences. The analysis gave rise to a different probe-level visualization method that can highlight probe differences according to transcript specificity. Conclusion The results highlight the need for detailed probe annotation at the transcriptome level. The presence of alternative splicing within a given sample can affect gene expression measurements and is a contributing factor to overall technical differences across platforms. PMID:17708771
Design of microarray experiments for genetical genomics studies.
Bueno Filho, Júlio S S; Gilmour, Steven G; Rosa, Guilherme J M
2006-10-01
Microarray experiments have been used recently in genetical genomics studies, as an additional tool to understand the genetic mechanisms governing variation in complex traits, such as for estimating heritabilities of mRNA transcript abundances, for mapping expression quantitative trait loci, and for inferring regulatory networks controlling gene expression. Several articles on the design of microarray experiments discuss situations in which treatment effects are assumed fixed and without any structure. In the case of two-color microarray platforms, several authors have studied reference and circular designs. Here, we discuss the optimal design of microarray experiments whose goals refer to specific genetic questions. Some examples are used to illustrate the choice of a design for comparing fixed, structured treatments, such as genotypic groups. Experiments targeting single genes or chromosomic regions (such as with transgene research) or multiple epistatic loci (such as within a selective phenotyping context) are discussed. In addition, microarray experiments in which treatments refer to families or to subjects (within family structures or complex pedigrees) are presented. In these cases treatments are more appropriately considered to be random effects, with specific covariance structures, in which the genetic goals relate to the estimation of genetic variances and the heritability of transcriptional abundances.
Issues in the analysis of oligonucleotide tiling microarrays for transcript mapping
NASA Technical Reports Server (NTRS)
Royce, Thomas E.; Rozowsky, Joel S.; Bertone, Paul; Samanta, Manoj; Stolc, Viktor; Weissman, Sherman; Snyder, Michael; Gerstein, Mark
2005-01-01
Traditional microarrays use probes complementary to known genes to quantitate the differential gene expression between two or more conditions. Genomic tiling microarray experiments differ in that probes that span a genomic region at regular intervals are used to detect the presence or absence of transcription. This difference means the same sets of biases and the methods for addressing them are unlikely to be relevant to both types of experiment. We introduce the informatics challenges arising in the analysis of tiling microarray experiments as open problems to the scientific community and present initial approaches for the analysis of this nascent technology.
The MGED Ontology: a resource for semantics-based description of microarray experiments.
Whetzel, Patricia L; Parkinson, Helen; Causton, Helen C; Fan, Liju; Fostel, Jennifer; Fragoso, Gilberto; Game, Laurence; Heiskanen, Mervi; Morrison, Norman; Rocca-Serra, Philippe; Sansone, Susanna-Assunta; Taylor, Chris; White, Joseph; Stoeckert, Christian J
2006-04-01
The generation of large amounts of microarray data and the need to share these data bring challenges for both data management and annotation and highlights the need for standards. MIAME specifies the minimum information needed to describe a microarray experiment and the Microarray Gene Expression Object Model (MAGE-OM) and resulting MAGE-ML provide a mechanism to standardize data representation for data exchange, however a common terminology for data annotation is needed to support these standards. Here we describe the MGED Ontology (MO) developed by the Ontology Working Group of the Microarray Gene Expression Data (MGED) Society. The MO provides terms for annotating all aspects of a microarray experiment from the design of the experiment and array layout, through to the preparation of the biological sample and the protocols used to hybridize the RNA and analyze the data. The MO was developed to provide terms for annotating experiments in line with the MIAME guidelines, i.e. to provide the semantics to describe a microarray experiment according to the concepts specified in MIAME. The MO does not attempt to incorporate terms from existing ontologies, e.g. those that deal with anatomical parts or developmental stages terms, but provides a framework to reference terms in other ontologies and therefore facilitates the use of ontologies in microarray data annotation. The MGED Ontology version.1.2.0 is available as a file in both DAML and OWL formats at http://mged.sourceforge.net/ontologies/index.php. Release notes and annotation examples are provided. The MO is also provided via the NCICB's Enterprise Vocabulary System (http://nciterms.nci.nih.gov/NCIBrowser/Dictionary.do). Stoeckrt@pcbi.upenn.edu Supplementary data are available at Bioinformatics online.
Methods to study legionella transcriptome in vitro and in vivo.
Faucher, Sebastien P; Shuman, Howard A
2013-01-01
The study of transcriptome responses can provide insight into the regulatory pathways and genetic factors that contribute to a specific phenotype. For bacterial pathogens, it can identify putative new virulence systems and shed light on the mechanisms underlying the regulation of virulence factors. Microarrays have been previously used to study gene regulation in Legionella pneumophila. In the past few years a sharp reduction of the costs associated with microarray experiments together with the availability of relatively inexpensive custom-designed commercial microarrays has made microarray technology an accessible tool for the majority of researchers. Here we describe the methodologies to conduct microarray experiments from in vitro and in vivo samples.
Women's experiences receiving abnormal prenatal chromosomal microarray testing results.
Bernhardt, Barbara A; Soucier, Danielle; Hanson, Karen; Savage, Melissa S; Jackson, Laird; Wapner, Ronald J
2013-02-01
Genomic microarrays can detect copy-number variants not detectable by conventional cytogenetics. This technology is diffusing rapidly into prenatal settings even though the clinical implications of many copy-number variants are currently unknown. We conducted a qualitative pilot study to explore the experiences of women receiving abnormal results from prenatal microarray testing performed in a research setting. Participants were a subset of women participating in a multicenter prospective study "Prenatal Cytogenetic Diagnosis by Array-based Copy Number Analysis." Telephone interviews were conducted with 23 women receiving abnormal prenatal microarray results. We found that five key elements dominated the experiences of women who had received abnormal prenatal microarray results: an offer too good to pass up, blindsided by the results, uncertainty and unquantifiable risks, need for support, and toxic knowledge. As prenatal microarray testing is increasingly used, uncertain findings will be common, resulting in greater need for careful pre- and posttest counseling, and more education of and resources for providers so they can adequately support the women who are undergoing testing.
Yamamoto, F; Yamamoto, M
2004-07-01
We previously developed a PCR-based DNA fingerprinting technique named the Methylation Sensitive (MS)-AFLP method, which permits comparative genome-wide scanning of methylation status with a manageable number of fingerprinting experiments. The technique uses the methylation sensitive restriction enzyme NotI in the context of the existing Amplified Fragment Length Polymorphism (AFLP) method. Here we report the successful conversion of this gel electrophoresis-based DNA fingerprinting technique into a DNA microarray hybridization technique (DNA Microarray MS-AFLP). By performing a total of 30 (15 x 2 reciprocal labeling) DNA Microarray MS-AFLP hybridization experiments on genomic DNA from two breast and three prostate cancer cell lines in all pairwise combinations, and Southern hybridization experiments using more than 100 different probes, we have demonstrated that the DNA Microarray MS-AFLP is a reliable method for genetic and epigenetic analyses. No statistically significant differences were observed in the number of differences between the breast-prostate hybridization experiments and the breast-breast or prostate-prostate comparisons.
ArrayWiki: an enabling technology for sharing public microarray data repositories and meta-analyses
Stokes, Todd H; Torrance, JT; Li, Henry; Wang, May D
2008-01-01
Background A survey of microarray databases reveals that most of the repository contents and data models are heterogeneous (i.e., data obtained from different chip manufacturers), and that the repositories provide only basic biological keywords linking to PubMed. As a result, it is difficult to find datasets using research context or analysis parameters information beyond a few keywords. For example, to reduce the "curse-of-dimension" problem in microarray analysis, the number of samples is often increased by merging array data from different datasets. Knowing chip data parameters such as pre-processing steps (e.g., normalization, artefact removal, etc), and knowing any previous biological validation of the dataset is essential due to the heterogeneity of the data. However, most of the microarray repositories do not have meta-data information in the first place, and do not have a a mechanism to add or insert this information. Thus, there is a critical need to create "intelligent" microarray repositories that (1) enable update of meta-data with the raw array data, and (2) provide standardized archiving protocols to minimize bias from the raw data sources. Results To address the problems discussed, we have developed a community maintained system called ArrayWiki that unites disparate meta-data of microarray meta-experiments from multiple primary sources with four key features. First, ArrayWiki provides a user-friendly knowledge management interface in addition to a programmable interface using standards developed by Wikipedia. Second, ArrayWiki includes automated quality control processes (caCORRECT) and novel visualization methods (BioPNG, Gel Plots), which provide extra information about data quality unavailable in other microarray repositories. Third, it provides a user-curation capability through the familiar Wiki interface. Fourth, ArrayWiki provides users with simple text-based searches across all experiment meta-data, and exposes data to search engine crawlers (Semantic Agents) such as Google to further enhance data discovery. Conclusions Microarray data and meta information in ArrayWiki are distributed and visualized using a novel and compact data storage format, BioPNG. Also, they are open to the research community for curation, modification, and contribution. By making a small investment of time to learn the syntax and structure common to all sites running MediaWiki software, domain scientists and practioners can all contribute to make better use of microarray technologies in research and medical practices. ArrayWiki is available at . PMID:18541053
Li, Chen-Ye; Ma, Lan; Yu, Bo
2017-11-01
Circular RNAs (circRNAs) are a novel class of RNAs generated from back-splicing and characterized by covalently closed continuous loops. Recently, circRNAs have recently shown large regulation on cardiovascular system, including atherosclerosis. The present study aims to investigate the circRNA expression profile and identify their roles on vascular endothelial cells induced by oxLDL. Human circRNA microarray analysis revealed that total 943 differently expressed circRNAs were screened with 2 fold change. Hsa_circ_0003575 was validated to be significantly up-regulated in oxLDL induced HUVECs. Loss-of-function experiments indicated that hsa_circ_0003575 silencing promoted the proliferation and angiogenesis ability of HUVECs. Bioinformatics online programs predicted the potential circRNA-miRNA-mRNA network for hsa_circ_0003575. In summary, circRNA microarray analysis reveals the expression profiles of HUVECs and verifies the role of hsa_circ_0003575 on HUVECs, providing a therapeutic strategy for vascular endothelial cell injury of atherosclerosis. Copyright © 2017. Published by Elsevier Masson SAS.
Schwartz, S; Kohan, M; Pasion, R; Papenhausen, P R; Platt, L D
2018-02-01
Screening via noninvasive prenatal testing (NIPT) involving the analysis of cell-free DNA (cfDNA) from plasma has become readily available to screen for chromosomal and DNA aberrations through maternal blood. This report reviews a laboratory's experience with follow-up of positive NIPT screens for microdeletions. Patients that were screened positive by NIPT for a microdeletion involving 1p, 4p, 5p, 15q, or 22q who underwent diagnostic studies by either chorionic villus sampling or amniocentesis were evaluated. The overall positive predictive value for 349 patients was 9.2%. When a microdeletion was confirmed, 39.3% of the cases had additional abnormal microarray findings. Unrelated abnormal microarray findings were detected in 11.8% of the patients in whom the screen positive microdeletion was not confirmed. Stretches of homozygosity in the microdeletion were frequently associated with a false positive cfDNA microdeletion result. Overall, this report reveals that while cfDNA analysis will screen for microdeletions, the positive predictive value is low; in our series it is 9.2%. Therefore, the patient should be counseled accordingly. Confirmatory diagnostic microarray studies are imperative because of the high percentage of false positives and the frequent additional abnormalities not delineated by cfDNA analysis. © 2018 John Wiley & Sons, Ltd.
Overcoming confounded controls in the analysis of gene expression data from microarray experiments.
Bhattacharya, Soumyaroop; Long, Dang; Lyons-Weiler, James
2003-01-01
A potential limitation of data from microarray experiments exists when improper control samples are used. In cancer research, comparisons of tumour expression profiles to those from normal samples is challenging due to tissue heterogeneity (mixed cell populations). A specific example exists in a published colon cancer dataset, in which tissue heterogeneity was reported among the normal samples. In this paper, we show how to overcome or avoid the problem of using normal samples that do not derive from the same tissue of origin as the tumour. We advocate an exploratory unsupervised bootstrap analysis that can reveal unexpected and undesired, but strongly supported, clusters of samples that reflect tissue differences instead of tumour versus normal differences. All of the algorithms used in the analysis, including the maximum difference subset algorithm, unsupervised bootstrap analysis, pooled variance t-test for finding differentially expressed genes and the jackknife to reduce false positives, are incorporated into our online Gene Expression Data Analyzer ( http:// bioinformatics.upmc.edu/GE2/GEDA.html ).
Microarray Meta-Analysis of RNA-Binding Protein Functions in Alternative Polyadenylation
Hu, Wenchao; Liu, Yuting; Yan, Jun
2014-01-01
Alternative polyadenylation (APA) is a post-transcriptional mechanism to generate diverse mRNA transcripts with different 3′UTRs from the same gene. In this study, we systematically searched for the APA events with differential expression in public mouse microarray data. Hundreds of genes with over-represented differential APA events and the corresponding experiments were identified. We further revealed that global APA differential expression occurred prevalently in tissues such as brain comparing to peripheral tissues, and biological processes such as development, differentiation and immune responses. Interestingly, we also observed widespread differential APA events in RNA-binding protein (RBP) genes such as Rbm3, Eif4e2 and Elavl1. Given the fact that RBPs are considered as the main regulators of differential APA expression, we constructed a co-expression network between APAs and RBPs using the microarray data. Further incorporation of CLIP-seq data of selected RBPs showed that Nova2 represses and Mbnl1 promotes the polyadenylation of closest poly(A) sites respectively. Altogether, our study is the first microarray meta-analysis in a mammal on the regulation of APA by RBPs that integrated massive mRNA expression data under a wide-range of biological conditions. Finally, we present our results as a comprehensive resource in an online website for the research community. PMID:24622240
An efficient method to identify differentially expressed genes in microarray experiments
Qin, Huaizhen; Feng, Tao; Harding, Scott A.; Tsai, Chung-Jui; Zhang, Shuanglin
2013-01-01
Motivation Microarray experiments typically analyze thousands to tens of thousands of genes from small numbers of biological replicates. The fact that genes are normally expressed in functionally relevant patterns suggests that gene-expression data can be stratified and clustered into relatively homogenous groups. Cluster-wise dimensionality reduction should make it feasible to improve screening power while minimizing information loss. Results We propose a powerful and computationally simple method for finding differentially expressed genes in small microarray experiments. The method incorporates a novel stratification-based tight clustering algorithm, principal component analysis and information pooling. Comprehensive simulations show that our method is substantially more powerful than the popular SAM and eBayes approaches. We applied the method to three real microarray datasets: one from a Populus nitrogen stress experiment with 3 biological replicates; and two from public microarray datasets of human cancers with 10 to 40 biological replicates. In all three analyses, our method proved more robust than the popular alternatives for identification of differentially expressed genes. Availability The C++ code to implement the proposed method is available upon request for academic use. PMID:18453554
Transcriptome Analysis of Early Responsive Genes in Rice during Magnaporthe oryzae Infection.
Wang, Yiming; Kwon, Soon Jae; Wu, Jingni; Choi, Jaeyoung; Lee, Yong-Hwan; Agrawal, Ganesh Kumar; Tamogami, Shigeru; Rakwal, Randeep; Park, Sang-Ryeol; Kim, Beom-Gi; Jung, Ki-Hong; Kang, Kyu Young; Kim, Sang Gon; Kim, Sun Tae
2014-12-01
Rice blast disease caused by Magnaporthe oryzae is one of the most serious diseases of cultivated rice (Oryza sativa L.) in most rice-growing regions of the world. In order to investigate early response genes in rice, we utilized the transcriptome analysis approach using a 300 K tilling microarray to rice leaves infected with compatible and incompatible M. oryzae strains. Prior to the microarray experiment, total RNA was validated by measuring the differential expression of rice defense-related marker genes (chitinase 2, barwin, PBZ1, and PR-10) by RT-PCR, and phytoalexins (sakuranetin and momilactone A) with HPLC. Microarray analysis revealed that 231 genes were up-regulated (>2 fold change, p < 0.05) in the incompatible interaction compared to the compatible one. Highly expressed genes were functionally characterized into metabolic processes and oxidation-reduction categories. The oxidative stress response was induced in both early and later infection stages. Biotic stress overview from MapMan analysis revealed that the phytohormone ethylene as well as signaling molecules jasmonic acid and salicylic acid is important for defense gene regulation. WRKY and Myb transcription factors were also involved in signal transduction processes. Additionally, receptor-like kinases were more likely associated with the defense response, and their expression patterns were validated by RT-PCR. Our results suggest that candidate genes, including receptor-like protein kinases, may play a key role in disease resistance against M. oryzae attack.
Kimura, Shinzo; Ishidou, Emi; Kurita, Sakiko; Suzuki, Yoshiteru; Shibato, Junko; Rakwal, Randeep; Iwahashi, Hitoshi
2006-07-21
Ionizing radiation (IR) is the most enigmatic of genotoxic stress inducers in our environment that has been around from the eons of time. IR is generally considered harmful, and has been the subject of numerous studies, mostly looking at the DNA damaging effects in cells and the repair mechanisms therein. Moreover, few studies have focused on large-scale identification of cellular responses to IR, and to this end, we describe here an initial study on the transcriptional responses of the unicellular genome model, yeast (Saccharomyces cerevisiae strain S288C), by cDNA microarray. The effect of two different IR, X-rays, and gamma (gamma)-rays, was investigated by irradiating the yeast cells cultured in YPD medium with 50 Gy doses of X- and gamma-rays, followed by resuspension of the cells in YPD for time-course experiments. The samples were collected for microarray analysis at 20, 40, and 80 min after irradiation. Microarray analysis revealed a time-course transcriptional profile of changed gene expressions. Up-regulated genes belonged to the functional categories mainly related to cell cycle and DNA processing, cell rescue defense and virulence, protein and cell fate, and metabolism (X- and gamma-rays). Similarly, for X- and gamma-rays, the down-regulated genes belonged to mostly transcription and protein synthesis, cell cycle and DNA processing, control of cellular organization, cell fate, and C-compound and carbohydrate metabolism categories, respectively. This study provides for the first time a snapshot of the genome-wide mRNA expression profiles in X- and gamma-ray post-irradiated yeast cells and comparatively interprets/discusses the changed gene functional categories as effects of these two radiations vis-à-vis their energy levels.
ELISA-BASE: An Integrated Bioinformatics Tool for Analyzing and Tracking ELISA Microarray Data
DOE Office of Scientific and Technical Information (OSTI.GOV)
White, Amanda M.; Collett, James L.; Seurynck-Servoss, Shannon L.
ELISA-BASE is an open-source database for capturing, organizing and analyzing protein enzyme-linked immunosorbent assay (ELISA) microarray data. ELISA-BASE is an extension of the BioArray Soft-ware Environment (BASE) database system, which was developed for DNA microarrays. In order to make BASE suitable for protein microarray experiments, we developed several plugins for importing and analyzing quantitative ELISA microarray data. Most notably, our Protein Microarray Analysis Tool (ProMAT) for processing quantita-tive ELISA data is now available as a plugin to the database.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hatazawa, Yukino; Research Fellow of Japan Society for the Promotion of Science, Tokyo; Minami, Kimiko
The expression of the transcriptional coactivator PGC1α is increased in skeletal muscles during exercise. Previously, we showed that increased PGC1α leads to prolonged exercise performance (the duration for which running can be continued) and, at the same time, increases the expression of branched-chain amino acid (BCAA) metabolism-related enzymes and genes that are involved in supplying substrates for the TCA cycle. We recently created mice with PGC1α knockout specifically in the skeletal muscles (PGC1α KO mice), which show decreased mitochondrial content. In this study, global gene expression (microarray) analysis was performed in the skeletal muscles of PGC1α KO mice compared withmore » that of wild-type control mice. As a result, decreased expression of genes involved in the TCA cycle, oxidative phosphorylation, and BCAA metabolism were observed. Compared with previously obtained microarray data on PGC1α-overexpressing transgenic mice, each gene showed the completely opposite direction of expression change. Bioinformatic analysis of the promoter region of genes with decreased expression in PGC1α KO mice predicted the involvement of several transcription factors, including a nuclear receptor, ERR, in their regulation. As PGC1α KO microarray data in this study show opposing findings to the PGC1α transgenic data, a loss-of-function experiment, as well as a gain-of-function experiment, revealed PGC1α’s function in the oxidative energy metabolism of skeletal muscles. - Highlights: • Microarray analysis was performed in the skeletal muscle of PGC1α KO mice. • Expression of genes in the oxidative energy metabolism was decreased. • Bioinformatic analysis of promoter region of the genes predicted involvement of ERR. • PGC1α KO microarray data in this study show the mirror image of transgenic data.« less
Usadel, Björn; Nagel, Axel; Steinhauser, Dirk; Gibon, Yves; Bläsing, Oliver E; Redestig, Henning; Sreenivasulu, Nese; Krall, Leonard; Hannah, Matthew A; Poree, Fabien; Fernie, Alisdair R; Stitt, Mark
2006-12-18
Microarray technology has become a widely accepted and standardized tool in biology. The first microarray data analysis programs were developed to support pair-wise comparison. However, as microarray experiments have become more routine, large scale experiments have become more common, which investigate multiple time points or sets of mutants or transgenics. To extract biological information from such high-throughput expression data, it is necessary to develop efficient analytical platforms, which combine manually curated gene ontologies with efficient visualization and navigation tools. Currently, most tools focus on a few limited biological aspects, rather than offering a holistic, integrated analysis. Here we introduce PageMan, a multiplatform, user-friendly, and stand-alone software tool that annotates, investigates, and condenses high-throughput microarray data in the context of functional ontologies. It includes a GUI tool to transform different ontologies into a suitable format, enabling the user to compare and choose between different ontologies. It is equipped with several statistical modules for data analysis, including over-representation analysis and Wilcoxon statistical testing. Results are exported in a graphical format for direct use, or for further editing in graphics programs.PageMan provides a fast overview of single treatments, allows genome-level responses to be compared across several microarray experiments covering, for example, stress responses at multiple time points. This aids in searching for trait-specific changes in pathways using mutants or transgenics, analyzing development time-courses, and comparison between species. In a case study, we analyze the results of publicly available microarrays of multiple cold stress experiments using PageMan, and compare the results to a previously published meta-analysis.PageMan offers a complete user's guide, a web-based over-representation analysis as well as a tutorial, and is freely available at http://mapman.mpimp-golm.mpg.de/pageman/. PageMan allows multiple microarray experiments to be efficiently condensed into a single page graphical display. The flexible interface allows data to be quickly and easily visualized, facilitating comparisons within experiments and to published experiments, thus enabling researchers to gain a rapid overview of the biological responses in the experiments.
Fully Automated Complementary DNA Microarray Segmentation using a Novel Fuzzy-based Algorithm.
Saberkari, Hamidreza; Bahrami, Sheyda; Shamsi, Mousa; Amoshahy, Mohammad Javad; Ghavifekr, Habib Badri; Sedaaghi, Mohammad Hossein
2015-01-01
DNA microarray is a powerful approach to study simultaneously, the expression of 1000 of genes in a single experiment. The average value of the fluorescent intensity could be calculated in a microarray experiment. The calculated intensity values are very close in amount to the levels of expression of a particular gene. However, determining the appropriate position of every spot in microarray images is a main challenge, which leads to the accurate classification of normal and abnormal (cancer) cells. In this paper, first a preprocessing approach is performed to eliminate the noise and artifacts available in microarray cells using the nonlinear anisotropic diffusion filtering method. Then, the coordinate center of each spot is positioned utilizing the mathematical morphology operations. Finally, the position of each spot is exactly determined through applying a novel hybrid model based on the principle component analysis and the spatial fuzzy c-means clustering (SFCM) algorithm. Using a Gaussian kernel in SFCM algorithm will lead to improving the quality in complementary DNA microarray segmentation. The performance of the proposed algorithm has been evaluated on the real microarray images, which is available in Stanford Microarray Databases. Results illustrate that the accuracy of microarray cells segmentation in the proposed algorithm reaches to 100% and 98% for noiseless/noisy cells, respectively.
A database for the analysis of immunity genes in Drosophila: PADMA database.
Lee, Mark J; Mondal, Ariful; Small, Chiyedza; Paddibhatla, Indira; Kawaguchi, Akira; Govind, Shubha
2011-01-01
While microarray experiments generate voluminous data, discerning trends that support an existing or alternative paradigm is challenging. To synergize hypothesis building and testing, we designed the Pathogen Associated Drosophila MicroArray (PADMA) database for easy retrieval and comparison of microarray results from immunity-related experiments (www.padmadatabase.org). PADMA also allows biologists to upload their microarray-results and compare it with datasets housed within PADMA. We tested PADMA using a preliminary dataset from Ganaspis xanthopoda-infected fly larvae, and uncovered unexpected trends in gene expression, reshaping our hypothesis. Thus, the PADMA database will be a useful resource to fly researchers to evaluate, revise, and refine hypotheses.
Hirakawa, Ikumi; Miyagawa, Shinichi; Katsu, Yoshinao; Kagami, Yoshihiro; Tatarazako, Norihisa; Kobayashi, Tohru; Kusano, Teruhiko; Mizutani, Takeshi; Ogino, Yukiko; Takeuchi, Takashi; Ohta, Yasuhiko; Iguchi, Taisen
2012-05-01
The occurrence of oocytes in the testis (testis-ova) of several fish species is often associated with exposure of estrogenic chemicals. However, induction mechanisms of the testis-ova remain to be elucidated. To develop marker genes for detecting testis-ova in the testis, adult male medaka were exposed to nominal concentration of 100 ng L(-1) of 17α-ethinylestradiol (EE2) for 3-5 weeks, and 800 ng estradiol benzoate (EB) for 3 weeks (experiment I), and a measured concentration of 20 ng L(-1) EE2 for 1-6 weeks (experiment II). Histological analysis was performed for the testis, and microarray analyses were performed for the testis, liver and brain. Microarray analysis in the estrogen-exposed medaka liver showed vitellogenin and choriogenin as estrogen responsive genes. Testis-ova were induced in the testis after 4 weeks of exposure to 100 ng L(-1) EE2, 3 weeks of exposure to 800 ng EB, and 6 weeks of exposure to 20 ng L(-1) EE2. Microarray analysis of estrogen-exposed testes revealed up-regulation of genes related to zona pellucida (ZP) and the oocytes marker gene, 42Sp50. Using quantitative RT-PCR we confirmed that Zpc5 gene can be used as a marker for the detection of testis-ova in male medaka. Copyright © 2011 Elsevier Ltd. All rights reserved.
Experimental design for three-color and four-color gene expression microarrays.
Woo, Yong; Krueger, Winfried; Kaur, Anupinder; Churchill, Gary
2005-06-01
Three-color microarrays, compared with two-color microarrays, can increase design efficiency and power to detect differential expression without additional samples and arrays. Furthermore, three-color microarray technology is currently available at a reasonable cost. Despite the potential advantages, clear guidelines for designing and analyzing three-color experiments do not exist. We propose a three- and a four-color cyclic design (loop) and a complementary graphical representation to help design experiments that are balanced, efficient and robust to hybridization failures. In theory, three-color loop designs are more efficient than two-color loop designs. Experiments using both two- and three-color platforms were performed in parallel and their outputs were analyzed using linear mixed model analysis in R/MAANOVA. These results demonstrate that three-color experiments using the same number of samples (and fewer arrays) will perform as efficiently as two-color experiments. The improved efficiency of the design is somewhat offset by a reduced dynamic range and increased variability in the three-color experimental system. This result suggests that, with minor technological improvements, three-color microarrays using loop designs could detect differential expression more efficiently than two-color loop designs. http://www.jax.org/staff/churchill/labsite/software Multicolor cyclic design construction methods and examples along with additional results of the experiment are provided at http://www.jax.org/staff/churchill/labsite/pubs/yong.
Caryoscope: An Open Source Java application for viewing microarray data in a genomic context
Awad, Ihab AB; Rees, Christian A; Hernandez-Boussard, Tina; Ball, Catherine A; Sherlock, Gavin
2004-01-01
Background Microarray-based comparative genome hybridization experiments generate data that can be mapped onto the genome. These data are interpreted more easily when represented graphically in a genomic context. Results We have developed Caryoscope, which is an open source Java application for visualizing microarray data from array comparative genome hybridization experiments in a genomic context. Caryoscope can read General Feature Format files (GFF files), as well as comma- and tab-delimited files, that define the genomic positions of the microarray reporters for which data are obtained. The microarray data can be browsed using an interactive, zoomable interface, which helps users identify regions of chromosomal deletion or amplification. The graphical representation of the data can be exported in a number of graphic formats, including publication-quality formats such as PostScript. Conclusion Caryoscope is a useful tool that can aid in the visualization, exploration and interpretation of microarray data in a genomic context. PMID:15488149
Dynamic, electronically switchable surfaces for membrane protein microarrays.
Tang, C S; Dusseiller, M; Makohliso, S; Heuschkel, M; Sharma, S; Keller, B; Vörös, J
2006-02-01
Microarray technology is a powerful tool that provides a high throughput of bioanalytical information within a single experiment. These miniaturized and parallelized binding assays are highly sensitive and have found widespread popularity especially during the genomic era. However, as drug diagnostics studies are often targeted at membrane proteins, the current arraying technologies are ill-equipped to handle the fragile nature of the protein molecules. In addition, to understand the complex structure and functions of proteins, different strategies to immobilize the probe molecules selectively onto a platform for protein microarray are required. We propose a novel approach to create a (membrane) protein microarray by using an indium tin oxide (ITO) microelectrode array with an electronic multiplexing capability. A polycationic, protein- and vesicle-resistant copolymer, poly(l-lysine)-grafted-poly(ethylene glycol) (PLL-g-PEG), is exposed to and adsorbed uniformly onto the microelectrode array, as a passivating adlayer. An electronic stimulation is then applied onto the individual ITO microelectrodes resulting in the localized release of the polymer thus revealing a bare ITO surface. Different polymer and biological moieties are specifically immobilized onto the activated ITO microelectrodes while the other regions remain protein-resistant as they are unaffected by the induced electrical potential. The desorption process of the PLL-g-PEG is observed to be highly selective, rapid, and reversible without compromising on the integrity and performance of the conductive ITO microelectrodes. As such, we have successfully created a stable and heterogeneous microarray of biomolecules by using selective electronic addressing on ITO microelectrodes. Both pharmaceutical diagnostics and biomedical technology are expected to benefit directly from this unique method.
Comparison of RNA-seq and microarray-based models for clinical endpoint prediction.
Zhang, Wenqian; Yu, Ying; Hertwig, Falk; Thierry-Mieg, Jean; Zhang, Wenwei; Thierry-Mieg, Danielle; Wang, Jian; Furlanello, Cesare; Devanarayan, Viswanath; Cheng, Jie; Deng, Youping; Hero, Barbara; Hong, Huixiao; Jia, Meiwen; Li, Li; Lin, Simon M; Nikolsky, Yuri; Oberthuer, André; Qing, Tao; Su, Zhenqiang; Volland, Ruth; Wang, Charles; Wang, May D; Ai, Junmei; Albanese, Davide; Asgharzadeh, Shahab; Avigad, Smadar; Bao, Wenjun; Bessarabova, Marina; Brilliant, Murray H; Brors, Benedikt; Chierici, Marco; Chu, Tzu-Ming; Zhang, Jibin; Grundy, Richard G; He, Min Max; Hebbring, Scott; Kaufman, Howard L; Lababidi, Samir; Lancashire, Lee J; Li, Yan; Lu, Xin X; Luo, Heng; Ma, Xiwen; Ning, Baitang; Noguera, Rosa; Peifer, Martin; Phan, John H; Roels, Frederik; Rosswog, Carolina; Shao, Susan; Shen, Jie; Theissen, Jessica; Tonini, Gian Paolo; Vandesompele, Jo; Wu, Po-Yen; Xiao, Wenzhong; Xu, Joshua; Xu, Weihong; Xuan, Jiekun; Yang, Yong; Ye, Zhan; Dong, Zirui; Zhang, Ke K; Yin, Ye; Zhao, Chen; Zheng, Yuanting; Wolfinger, Russell D; Shi, Tieliu; Malkas, Linda H; Berthold, Frank; Wang, Jun; Tong, Weida; Shi, Leming; Peng, Zhiyu; Fischer, Matthias
2015-06-25
Gene expression profiling is being widely applied in cancer research to identify biomarkers for clinical endpoint prediction. Since RNA-seq provides a powerful tool for transcriptome-based applications beyond the limitations of microarrays, we sought to systematically evaluate the performance of RNA-seq-based and microarray-based classifiers in this MAQC-III/SEQC study for clinical endpoint prediction using neuroblastoma as a model. We generate gene expression profiles from 498 primary neuroblastomas using both RNA-seq and 44 k microarrays. Characterization of the neuroblastoma transcriptome by RNA-seq reveals that more than 48,000 genes and 200,000 transcripts are being expressed in this malignancy. We also find that RNA-seq provides much more detailed information on specific transcript expression patterns in clinico-genetic neuroblastoma subgroups than microarrays. To systematically compare the power of RNA-seq and microarray-based models in predicting clinical endpoints, we divide the cohort randomly into training and validation sets and develop 360 predictive models on six clinical endpoints of varying predictability. Evaluation of factors potentially affecting model performances reveals that prediction accuracies are most strongly influenced by the nature of the clinical endpoint, whereas technological platforms (RNA-seq vs. microarrays), RNA-seq data analysis pipelines, and feature levels (gene vs. transcript vs. exon-junction level) do not significantly affect performances of the models. We demonstrate that RNA-seq outperforms microarrays in determining the transcriptomic characteristics of cancer, while RNA-seq and microarray-based models perform similarly in clinical endpoint prediction. Our findings may be valuable to guide future studies on the development of gene expression-based predictive models and their implementation in clinical practice.
Clustering-based spot segmentation of cDNA microarray images.
Uslan, Volkan; Bucak, Ihsan Ömür
2010-01-01
Microarrays are utilized as that they provide useful information about thousands of gene expressions simultaneously. In this study segmentation step of microarray image processing has been implemented. Clustering-based methods, fuzzy c-means and k-means, have been applied for the segmentation step that separates the spots from the background. The experiments show that fuzzy c-means have segmented spots of the microarray image more accurately than the k-means.
A perspective on microarrays: current applications, pitfalls, and potential uses
Jaluria, Pratik; Konstantopoulos, Konstantinos; Betenbaugh, Michael; Shiloach, Joseph
2007-01-01
With advances in robotics, computational capabilities, and the fabrication of high quality glass slides coinciding with increased genomic information being available on public databases, microarray technology is increasingly being used in laboratories around the world. In fact, fields as varied as: toxicology, evolutionary biology, drug development and production, disease characterization, diagnostics development, cellular physiology and stress responses, and forensics have benefiting from its use. However, for many researchers not familiar with microarrays, current articles and reviews often address neither the fundamental principles behind the technology nor the proper designing of experiments. Although, microarray technology is relatively simple, conceptually, its practice does require careful planning and detailed understanding of the limitations inherently present. Without these considerations, it can be exceedingly difficult to ascertain valuable information from microarray data. Therefore, this text aims to outline key features in microarray technology, paying particular attention to current applications as outlined in recent publications, experimental design, statistical methods, and potential uses. Furthermore, this review is not meant to be comprehensive, but rather substantive; highlighting important concepts and detailing steps necessary to conduct and interpret microarray experiments. Collectively, the information included in this text will highlight the versatility of microarray technology and provide a glimpse of what the future may hold. PMID:17254338
Yang, Yunfeng; Zhu, Mengxia; Wu, Liyou; Zhou, Jizhong
2008-09-16
Using genomic DNA as common reference in microarray experiments has recently been tested by different laboratories. Conflicting results have been reported with regard to the reliability of microarray results using this method. To explain it, we hypothesize that data processing is a critical element that impacts the data quality. Microarray experiments were performed in a gamma-proteobacterium Shewanella oneidensis. Pair-wise comparison of three experimental conditions was obtained either with two labeled cDNA samples co-hybridized to the same array, or by employing Shewanella genomic DNA as a standard reference. Various data processing techniques were exploited to reduce the amount of inconsistency between both methods and the results were assessed. We discovered that data quality was significantly improved by imposing the constraint of minimal number of replicates, logarithmic transformation and random error analyses. These findings demonstrate that data processing significantly influences data quality, which provides an explanation for the conflicting evaluation in the literature. This work could serve as a guideline for microarray data analysis using genomic DNA as a standard reference.
USDA-ARS?s Scientific Manuscript database
The amount of microarray gene expression data in public repositories has been increasing exponentially for the last couple of decades. High-throughput microarray data integration and analysis has become a critical step in exploring the large amount of expression data for biological discovery. Howeve...
Questioning the utility of pooling samples in microarray experiments with cell lines.
Lusa, L; Cappelletti, V; Gariboldi, M; Ferrario, C; De Cecco, L; Reid, J F; Toffanin, S; Gallus, G; McShane, L M; Daidone, M G; Pierotti, M A
2006-01-01
We describe a microarray experiment using the MCF-7 breast cancer cell line in two different experimental conditions for which the same number of independent pools as the number of individual samples was hybridized on Affymetrix GeneChips. Unexpectedly, when using individual samples, the number of probe sets found to be differentially expressed between treated and untreated cells was about three times greater than that found using pools. These findings indicate that pooling samples in microarray experiments where the biological variability is expected to be small might not be helpful and could even decrease one's ability to identify differentially expressed genes.
Chondrocyte channel transcriptomics
Lewis, Rebecca; May, Hannah; Mobasheri, Ali; Barrett-Jolley, Richard
2013-01-01
To date, a range of ion channels have been identified in chondrocytes using a number of different techniques, predominantly electrophysiological and/or biomolecular; each of these has its advantages and disadvantages. Here we aim to compare and contrast the data available from biophysical and microarray experiments. This letter analyses recent transcriptomics datasets from chondrocytes, accessible from the European Bioinformatics Institute (EBI). We discuss whether such bioinformatic analysis of microarray datasets can potentially accelerate identification and discovery of ion channels in chondrocytes. The ion channels which appear most frequently across these microarray datasets are discussed, along with their possible functions. We discuss whether functional or protein data exist which support the microarray data. A microarray experiment comparing gene expression in osteoarthritis and healthy cartilage is also discussed and we verify the differential expression of 2 of these genes, namely the genes encoding large calcium-activated potassium (BK) and aquaporin channels. PMID:23995703
Microarray expression technology: from start to finish.
Elvidge, Gareth
2006-01-01
The recent introduction of new microarray expression technologies and the further development of established platforms ensure that the researcher is presented with a range of options for performing an experiment. Whilst this has opened up the possibilities for future applications, such as exon-specific arrays, increased sample throughput and 'chromatin immunoprecipitation (ChIP) on chip' experiments, the initial decision processes and experiment planning are made more difficult. This review will give an overview of the various technologies that are available to perform a microarray expression experiment, from the initial planning stages through to the final data analysis. Both practical aspects and data analysis options will be considered. The relative advantages and disadvantages will be discussed with insights provided for future directions of the technology.
Genome image programs: visualization and interpretation of Escherichia coli microarray experiments.
Zimmer, Daniel P; Paliy, Oleg; Thomas, Brian; Gyaneshwar, Prasad; Kustu, Sydney
2004-08-01
We have developed programs to facilitate analysis of microarray data in Escherichia coli. They fall into two categories: manipulation of microarray images and identification of known biological relationships among lists of genes. A program in the first category arranges spots from glass-slide DNA microarrays according to their position in the E. coli genome and displays them compactly in genome order. The resulting genome image is presented in a web browser with an image map that allows the user to identify genes in the reordered image. Another program in the first category aligns genome images from two or more experiments. These images assist in visualizing regions of the genome with common transcriptional control. Such regions include multigene operons and clusters of operons, which are easily identified as strings of adjacent, similarly colored spots. The images are also useful for assessing the overall quality of experiments. The second category of programs includes a database and a number of tools for displaying biological information about many E. coli genes simultaneously rather than one gene at a time, which facilitates identifying relationships among them. These programs have accelerated and enhanced our interpretation of results from E. coli DNA microarray experiments. Examples are given. Copyright 2004 Genetics Society of America
Stekel, Dov J.; Sarti, Donatella; Trevino, Victor; Zhang, Lihong; Salmon, Mike; Buckley, Chris D.; Stevens, Mark; Pallen, Mark J.; Penn, Charles; Falciani, Francesco
2005-01-01
A key step in the analysis of microarray data is the selection of genes that are differentially expressed. Ideally, such experiments should be properly replicated in order to infer both technical and biological variability, and the data should be subjected to rigorous hypothesis tests to identify the differentially expressed genes. However, in microarray experiments involving the analysis of very large numbers of biological samples, replication is not always practical. Therefore, there is a need for a method to select differentially expressed genes in a rational way from insufficiently replicated data. In this paper, we describe a simple method that uses bootstrapping to generate an error model from a replicated pilot study that can be used to identify differentially expressed genes in subsequent large-scale studies on the same platform, but in which there may be no replicated arrays. The method builds a stratified error model that includes array-to-array variability, feature-to-feature variability and the dependence of error on signal intensity. We apply this model to the characterization of the host response in a model of bacterial infection of human intestinal epithelial cells. We demonstrate the effectiveness of error model based microarray experiments and propose this as a general strategy for a microarray-based screening of large collections of biological samples. PMID:15800204
Kang, Seung-Hui; Park, Chan Hee; Jeung, Hei Cheul; Kim, Ki-Yeol; Rha, Sun Young; Chung, Hyun Cheol
2007-06-01
In array-CGH, various factors may act as variables influencing the result of experiments. Among them, Cot-1 DNA, which has been used as a repetitive sequence-blocking agent, may become an artifact-inducing factor in BAC array-CGH. To identify the effect of Cot-1 DNA on Microarray-CGH experiments, Cot-1 DNA was labeled directly and Microarray-CGH experiments were performed. The results confirmed that probes which hybridized more completely with Cot-1 DNA had a higher sequence similarity to the Alu element. Further, in the sex-mismatched Microarray-CGH experiments, the variation and intensity in the fluorescent signal were reduced in the high intensity probe group in which probes were better hybridized with Cot-1 DNA. Otherwise, those of the low intensity probe group showed no alterations regardless of Cot-1 DNA. These results confirmed by in silico methods that Cot-1 DNA could block repetitive sequences in gDNA and probes. In addition, it was confirmed biologically that the blocking effect of Cot-1 DNA could be presented via its repetitive sequences, especially Alu elements. Thus, in contrast to BAC-array CGH, the use of Cot-1 DNA is advantageous in controlling experimental variation in Microarray-CGH.
2014-01-01
Background Long noncoding RNAs (lncRNAs) constitute a major, but poorly characterized part of human transcriptome. Recent evidence indicates that many lncRNAs are involved in cancer and can be used as predictive and prognostic biomarkers. Significant fraction of lncRNAs is represented on widely used microarray platforms, however they have usually been ignored in cancer studies. Results We developed a computational pipeline to annotate lncRNAs on popular Affymetrix U133 microarrays, creating a resource allowing measurement of expression of 1581 lncRNAs. This resource can be utilized to interrogate existing microarray datasets for various lncRNA studies. We found that these lncRNAs fall into three distinct classes according to their statistical distribution by length. Remarkably, these three classes of lncRNAs were co-localized with protein coding genes exhibiting distinct gene ontology groups. This annotation was applied to microarray analysis which identified a 159 lncRNA signature that discriminates between localized and metastatic stages of neuroblastoma. Analysis of an independent patient cohort revealed that this signature differentiates also relapsing from non-relapsing primary tumors. This is the first example of the signature developed via the analysis of expression of lncRNAs solely. One of these lncRNAs, termed HOXD-AS1, is encoded in HOXD cluster. HOXD-AS1 is evolutionary conserved among hominids and has all bona fide features of a gene. Studying retinoid acid (RA) response of SH-SY5Y cell line, a model of human metastatic neuroblastoma, we found that HOXD-AS1 is a subject to morphogenic regulation, is activated by PI3K/Akt pathway and itself is involved in control of RA-induced cell differentiation. Knock-down experiments revealed that HOXD-AS1 controls expression levels of clinically significant protein-coding genes involved in angiogenesis and inflammation, the hallmarks of metastatic cancer. Conclusions Our findings greatly extend the number of noncoding RNAs functionally implicated in tumor development and patient treatment and highlight their role as potential prognostic biomarkers of neuroblastomas. PMID:25522241
A meta-data based method for DNA microarray imputation.
Jörnsten, Rebecka; Ouyang, Ming; Wang, Hui-Yu
2007-03-29
DNA microarray experiments are conducted in logical sets, such as time course profiling after a treatment is applied to the samples, or comparisons of the samples under two or more conditions. Due to cost and design constraints of spotted cDNA microarray experiments, each logical set commonly includes only a small number of replicates per condition. Despite the vast improvement of the microarray technology in recent years, missing values are prevalent. Intuitively, imputation of missing values is best done using many replicates within the same logical set. In practice, there are few replicates and thus reliable imputation within logical sets is difficult. However, it is in the case of few replicates that the presence of missing values, and how they are imputed, can have the most profound impact on the outcome of downstream analyses (e.g. significance analysis and clustering). This study explores the feasibility of imputation across logical sets, using the vast amount of publicly available microarray data to improve imputation reliability in the small sample size setting. We download all cDNA microarray data of Saccharomyces cerevisiae, Arabidopsis thaliana, and Caenorhabditis elegans from the Stanford Microarray Database. Through cross-validation and simulation, we find that, for all three species, our proposed imputation using data from public databases is far superior to imputation within a logical set, sometimes to an astonishing degree. Furthermore, the imputation root mean square error for significant genes is generally a lot less than that of non-significant ones. Since downstream analysis of significant genes, such as clustering and network analysis, can be very sensitive to small perturbations of estimated gene effects, it is highly recommended that researchers apply reliable data imputation prior to further analysis. Our method can also be applied to cDNA microarray experiments from other species, provided good reference data are available.
Klein, Hans-Ulrich; Ruckert, Christian; Kohlmann, Alexander; Bullinger, Lars; Thiede, Christian; Haferlach, Torsten; Dugas, Martin
2009-12-15
Multiple gene expression signatures derived from microarray experiments have been published in the field of leukemia research. A comparison of these signatures with results from new experiments is useful for verification as well as for interpretation of the results obtained. Currently, the percentage of overlapping genes is frequently used to compare published gene signatures against a signature derived from a new experiment. However, it has been shown that the percentage of overlapping genes is of limited use for comparing two experiments due to the variability of gene signatures caused by different array platforms or assay-specific influencing parameters. Here, we present a robust approach for a systematic and quantitative comparison of published gene expression signatures with an exemplary query dataset. A database storing 138 leukemia-related published gene signatures was designed. Each gene signature was manually annotated with terms according to a leukemia-specific taxonomy. Two analysis steps are implemented to compare a new microarray dataset with the results from previous experiments stored and curated in the database. First, the global test method is applied to assess gene signatures and to constitute a ranking among them. In a subsequent analysis step, the focus is shifted from single gene signatures to chromosomal aberrations or molecular mutations as modeled in the taxonomy. Potentially interesting disease characteristics are detected based on the ranking of gene signatures associated with these aberrations stored in the database. Two example analyses are presented. An implementation of the approach is freely available as web-based application. The presented approach helps researchers to systematically integrate the knowledge derived from numerous microarray experiments into the analysis of a new dataset. By means of example leukemia datasets we demonstrate that this approach detects related experiments as well as related molecular mutations and may help to interpret new microarray data.
Sule, Preeti; Horne, Shelley M.; Logue, Catherine M.; Prüß, Birgit M.
2011-01-01
To understand the continuous problems that Escherichia coli O157:H7 causes as food pathogen, this study assessed global gene regulation in bacteria growing on meat. Since FlhD/FlhC of E. coli K-12 laboratory strains was previously established as a major control point in transducing signals from the environment to several cellular processes, this study compared the expression pattern of an E. coli O157:H7 parent strain to that of its isogenic flhC mutant. This was done with bacteria that had been grown on meat. Microarray experiments revealed 287 putative targets of FlhC. Real-time PCR was performed as an alternative estimate of transcription and confirmed microarray data for 13 out of 15 genes tested (87%). The confirmed genes are representative of cellular functions, such as central metabolism, cell division, biofilm formation, and pathogenicity. An additional 13 genes from the same cellular functions that had not been hypothesized as being regulated by FlhC by the microarray experiment were tested with real-time PCR and also exhibited higher expression levels in the flhC mutant than in the parent strain. Physiological experiments were performed and confirmed that FlhC reduced the cell division rate, the amount of biofilm biomass, and pathogenicity in a chicken embryo lethality model. Altogether, this study provides valuable insight into the complex regulatory network of the pathogen that enables its survival under various environmental conditions. This information may be used to develop strategies that could be used to reduce the number of cells or pathogenicity of E. coli O157:H7 on meat by interfering with the signal transduction pathways. PMID:21498760
Transcriptomic responses to wounding: meta-analysis of gene expression microarray data.
Sass, Piotr Andrzej; Dąbrowski, Michał; Charzyńska, Agata; Sachadyn, Paweł
2017-11-07
A vast amount of microarray data on transcriptomic response to injury has been collected so far. We designed the analysis in order to identify the genes displaying significant changes in expression after wounding in different organisms and tissues. This meta-analysis is the first study to compare gene expression profiles in response to wounding in as different tissues as heart, liver, skin, bones, and spinal cord, and species, including rat, mouse and human. We collected available microarray transcriptomic profiles obtained from different tissue injury experiments and selected the genes showing a minimum twofold change in expression in response to wounding in prevailing number of experiments for each of five wound healing stages we distinguished: haemostasis & early inflammation, inflammation, early repair, late repair and remodelling. During the initial phases after wounding, haemostasis & early inflammation and inflammation, the transcriptomic responses showed little consistency between different tissues and experiments. For the later phases, wound repair and remodelling, we identified a number of genes displaying similar transcriptional responses in all examined tissues. As revealed by ontological analyses, activation of certain pathways was rather specific for selected phases of wound healing, such as e.g. responses to vitamin D pronounced during inflammation. Conversely, we observed induction of genes encoding inflammatory agents and extracellular matrix proteins in all wound healing phases. Further, we selected several genes differentially upregulated throughout different stages of wound response, including established factors of wound healing in addition to those previously unreported in this context such as PTPRC and AQP4. We found that transcriptomic responses to wounding showed similar traits in a diverse selection of tissues including skin, muscles, internal organs and nervous system. Notably, we distinguished transcriptional induction of inflammatory genes not only in the initial response to wounding, but also later, during wound repair and tissue remodelling.
Vallée, Maud; Gravel, Catherine; Palin, Marie-France; Reghenas, Hélène; Stothard, Paul; Wishart, David S; Sirard, Marc-André
2005-07-01
The main objective of the present study was to identify novel oocyte-specific genes in three different species: bovine, mouse, and Xenopus laevis. To achieve this goal, two powerful technologies were combined: a polymerase chain reaction (PCR)-based cDNA subtraction, and cDNA microarrays. Three subtractive libraries consisting of 3456 clones were established and enriched for oocyte-specific transcripts. Sequencing analysis of the positive insert-containing clones resulted in the following classification: 53% of the clones corresponded to known cDNAs, 26% were classified as uncharacterized cDNAs, and a final 9% were classified as novel sequences. All these clones were used for cDNA microarray preparation. Results from these microarray analyses revealed that in addition to already known oocyte-specific genes, such as GDF9, BMP15, and ZP, known genes with unknown function in the oocyte were identified, such as a MLF1-interacting protein (MLF1IP), B-cell translocation gene 4 (BTG4), and phosphotyrosine-binding protein (xPTB). Furthermore, 15 novel oocyte-specific genes were validated by reverse transcription-PCR to confirm their preferential expression in the oocyte compared to somatic tissues. The results obtained in the present study confirmed that microarray analysis is a robust technique to identify true positives from the suppressive subtractive hybridization experiment. Furthermore, obtaining oocyte-specific genes from three species simultaneously allowed us to look at important genes that are conserved across species. Further characterization of these novel oocyte-specific genes will lead to a better understanding of the molecular mechanisms related to the unique functions found in the oocyte.
Characterization and simulation of cDNA microarray spots using a novel mathematical model
Kim, Hye Young; Lee, Seo Eun; Kim, Min Jung; Han, Jin Il; Kim, Bo Kyung; Lee, Yong Sung; Lee, Young Seek; Kim, Jin Hyuk
2007-01-01
Background The quality of cDNA microarray data is crucial for expanding its application to other research areas, such as the study of gene regulatory networks. Despite the fact that a number of algorithms have been suggested to increase the accuracy of microarray gene expression data, it is necessary to obtain reliable microarray images by improving wet-lab experiments. As the first step of a cDNA microarray experiment, spotting cDNA probes is critical to determining the quality of spot images. Results We developed a governing equation of cDNA deposition during evaporation of a drop in the microarray spotting process. The governing equation included four parameters: the surface site density on the support, the extrapolated equilibrium constant for the binding of cDNA molecules with surface sites on glass slides, the macromolecular interaction factor, and the volume constant of a drop of cDNA solution. We simulated cDNA deposition from the single model equation by varying the value of the parameters. The morphology of the resulting cDNA deposit can be classified into three types: a doughnut shape, a peak shape, and a volcano shape. The spot morphology can be changed into a flat shape by varying the experimental conditions while considering the parameters of the governing equation of cDNA deposition. The four parameters were estimated by fitting the governing equation to the real microarray images. With the results of the simulation and the parameter estimation, the phenomenon of the formation of cDNA deposits in each type was investigated. Conclusion This study explains how various spot shapes can exist and suggests which parameters are to be adjusted for obtaining a good spot. This system is able to explore the cDNA microarray spotting process in a predictable, manageable and descriptive manner. We hope it can provide a way to predict the incidents that can occur during a real cDNA microarray experiment, and produce useful data for several research applications involving cDNA microarrays. PMID:18096047
The MGED ontology: a framework for describing functional genomics experiments.
Stoeckert, Christian J; Parkinson, Helen
2003-01-01
The Microarray Gene Expression Data (MGED) society was formed with an initial focus on experiments involving microarray technology. Despite the diversity of applications, there are common concepts used and a common need to capture experimental information in a standardized manner. In building the MGED ontology, it was recognized that it would be impractical to cover all the different types of experiments on all the different types of organisms by listing and defining all the types of organisms and their properties. Our solution was to create a framework for describing microarray experiments with an initial focus on the biological sample and its manipulation. For concepts that are common for many species, we could provide a manageable listing of controlled terms. For concepts that are species-specific or whose values cannot be readily listed, we created an 'OntologyEntry' concept that referenced an external resource. The MGED ontology is a work in progress that needs additional instances and particularly needs constraints to be added. The ontology currently covers the experimental sample and design, and we have begun capturing aspects of the microarrays themselves as well. The primary application of the ontology will be to develop forms for entering information into databases, and consequently allowing queries, taking advantage of the structure provided by the ontology. The application of an ontology of experimental conditions extends beyond microarray experiments and, as the scope of MGED includes other aspects of functional genomics, so too will the MGED ontology.
Profiling In Situ Microbial Community Structure with an Amplification Microarray
Knickerbocker, Christopher; Bryant, Lexi; Golova, Julia; Wiles, Cory; Williams, Kenneth H.; Peacock, Aaron D.; Long, Philip E.
2013-01-01
The objectives of this study were to unify amplification, labeling, and microarray hybridization chemistries within a single, closed microfluidic chamber (an amplification microarray) and verify technology performance on a series of groundwater samples from an in situ field experiment designed to compare U(VI) mobility under conditions of various alkalinities (as HCO3−) during stimulated microbial activity accompanying acetate amendment. Analytical limits of detection were between 2 and 200 cell equivalents of purified DNA. Amplification microarray signatures were well correlated with 16S rRNA-targeted quantitative PCR results and hybridization microarray signatures. The succession of the microbial community was evident with and consistent between the two microarray platforms. Amplification microarray analysis of acetate-treated groundwater showed elevated levels of iron-reducing bacteria (Flexibacter, Geobacter, Rhodoferax, and Shewanella) relative to the average background profile, as expected. Identical molecular signatures were evident in the transect treated with acetate plus NaHCO3, but at much lower signal intensities and with a much more rapid decline (to nondetection). Azoarcus, Thaurea, and Methylobacterium were responsive in the acetate-only transect but not in the presence of bicarbonate. Observed differences in microbial community composition or response to bicarbonate amendment likely had an effect on measured rates of U reduction, with higher rates probable in the part of the field experiment that was amended with bicarbonate. The simplification in microarray-based work flow is a significant technological advance toward entirely closed-amplicon microarray-based tests and is generally extensible to any number of environmental monitoring applications. PMID:23160129
Helm, Benjamin M; Langley, Katherine; Spangler, Brooke; Vergano, Samantha
2014-08-01
Single nucleotide polymorphism microarrays have the ability to reveal parental consanguinity which may or may not be known to healthcare providers. Consanguinity can have significant implications for the health of patients and for individual and family psychosocial well-being. These results often present ethical and legal dilemmas that can have important ramifications. Unexpected consanguinity can be confounding to healthcare professionals who may be unprepared to handle these results or to communicate them to families or other appropriate representatives. There are few published accounts of experiences with consanguinity and SNP arrays. In this paper we discuss three cases where molecular evidence of parental incest was identified by SNP microarray. We hope to further highlight consanguinity as a potential incidental finding, how the cases were handled by the clinical team, and what resources were found to be most helpful. This paper aims to contribute further to professional discourse on incidental findings with genomic technology and how they were addressed clinically. These experiences may provide some guidance on how others can prepare for these findings and help improve practice. As genetic and genomic testing is utilized more by non-genetics providers, we also hope to inform about the importance of engaging with geneticists and genetic counselors when addressing these findings.
An Introduction to MAMA (Meta-Analysis of MicroArray data) System.
Zhang, Zhe; Fenstermacher, David
2005-01-01
Analyzing microarray data across multiple experiments has been proven advantageous. To support this kind of analysis, we are developing a software system called MAMA (Meta-Analysis of MicroArray data). MAMA utilizes a client-server architecture with a relational database on the server-side for the storage of microarray datasets collected from various resources. The client-side is an application running on the end user's computer that allows the user to manipulate microarray data and analytical results locally. MAMA implementation will integrate several analytical methods, including meta-analysis within an open-source framework offering other developers the flexibility to plug in additional statistical algorithms.
Baumann, Antoine; Devaux, Yvan; Audibert, Gérard; Zhang, Lu; Bracard, Serge; Colnat-Coulbois, Sophie; Klein, Olivier; Zannad, Faiez; Charpentier, Claire; Longrois, Dan; Mertes, Paul-Michel
2013-01-01
Delayed cerebral ischemia (DCI) is a potentially devastating complication after intracranial aneurysm rupture and its mechanisms remain poorly elucidated. Early identification of the patients prone to developing DCI after rupture may represent a major breakthrough in its prevention and treatment. The single gene approach of DCI has demonstrated interest in humans. We hypothesized that whole genome expression profile of blood cells may be useful for better comprehension and prediction of aneurysmal DCI. Over a 35-month period, 218 patients with aneurysm rupture were included in this study. DCI was defined as the occurrence of a new delayed neurological deficit occurring within 2 weeks after aneurysm rupture with evidence of ischemia either on perfusion-diffusion MRI, CT angiography or CT perfusion imaging, or with cerebral angiography. DCI patients were matched against controls based on 4 out of 5 criteria (age, sex, Fisher grade, aneurysm location and smoking status). Genome-wide expression analysis of blood cells obtained at admission was performed by microarrays. Transcriptomic analysis was performed using long oligonucleotide microarrays representing 25,000 genes. Quantitative PCR: 1 µg of total RNA extracted was reverse-transcribed, and the resulting cDNA was diluted 10-fold before performing quantitative PCR. Microarray data were first analyzed by 'Significance Analysis of Microarrays' software which includes the Benjamini correction for multiple testing. In a second step, microarray data fold change was compared using a two-tailed, paired t test. Analysis of receiver-operating characteristic (ROC) curves and the area under the ROC curves were used for prediction analysis. Logistic regression models were used to investigate the additive value of multiple biomarkers. A total of 16 patients demonstrated DCI. Significance Analysis of Microarrays software failed to retrieve significant genes, most probably because of the heterogeneity of the patients included in the microarray experiments and the small size of the DCI population sample. Standard two-tailed paired t test and C-statistic revealed significant associations between gene expression and the occurrence of DCI: in particular, the expression of neuroregulin 1 was 1.6-fold upregulated in patients with DCI (p = 0.01) and predicted DCI with an area under the ROC curve of 0.96. Logistic regression analyses revealed a significant association between neuroregulin 1 and DCI (odds ratio 1.46, 95% confidence interval 1.02-2.09, p = 0.02). This pilot study suggests that blood cells may be a reservoir of prognostic biomarkers of DCI in patients with intracranial aneurysm rupture. Despite an evident lack of power, this study elicited neuroregulin 1, a vasoreactivity-, inflammation- and angiogenesis-related gene, as a possible candidate predictor of DCI. Larger cohort studies are needed but genome-wide microarray-based studies are promising research tools for the understanding of DCI after intracranial aneurysm rupture. © 2013 S. Karger AG, Basel.
RDFBuilder: a tool to automatically build RDF-based interfaces for MAGE-OM microarray data sources.
Anguita, Alberto; Martin, Luis; Garcia-Remesal, Miguel; Maojo, Victor
2013-07-01
This paper presents RDFBuilder, a tool that enables RDF-based access to MAGE-ML-compliant microarray databases. We have developed a system that automatically transforms the MAGE-OM model and microarray data stored in the ArrayExpress database into RDF format. Additionally, the system automatically enables a SPARQL endpoint. This allows users to execute SPARQL queries for retrieving microarray data, either from specific experiments or from more than one experiment at a time. Our system optimizes response times by caching and reusing information from previous queries. In this paper, we describe our methods for achieving this transformation. We show that our approach is complementary to other existing initiatives, such as Bio2RDF, for accessing and retrieving data from the ArrayExpress database. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Implementation of mutual information and bayes theorem for classification microarray data
NASA Astrophysics Data System (ADS)
Dwifebri Purbolaksono, Mahendra; Widiastuti, Kurnia C.; Syahrul Mubarok, Mohamad; Adiwijaya; Aminy Ma’ruf, Firda
2018-03-01
Microarray Technology is one of technology which able to read the structure of gen. The analysis is important for this technology. It is for deciding which attribute is more important than the others. Microarray technology is able to get cancer information to diagnose a person’s gen. Preparation of microarray data is a huge problem and takes a long time. That is because microarray data contains high number of insignificant and irrelevant attributes. So, it needs a method to reduce the dimension of microarray data without eliminating important information in every attribute. This research uses Mutual Information to reduce dimension. System is built with Machine Learning approach specifically Bayes Theorem. This theorem uses a statistical and probability approach. By combining both methods, it will be powerful for Microarray Data Classification. The experiment results show that system is good to classify Microarray data with highest F1-score using Bayesian Network by 91.06%, and Naïve Bayes by 88.85%.
Wang, Wenyu; Liu, Yang; Hao, Jingcan; Zheng, Shuyu; Wen, Yan; Xiao, Xiao; He, Awen; Fan, Qianrui; Zhang, Feng; Liu, Ruiyu
2016-10-10
Hip cartilage destruction is consistently observed in the non-traumatic osteonecrosis of femoral head (NOFH) and accelerates its bone necrosis. The molecular mechanism underlying the cartilage damage of NOFH remains elusive. In this study, we conducted a systematically comparative study of gene expression profiles between NOFH and osteoarthritis (OA). Hip articular cartilage specimens were collected from 12 NOFH patients and 12 controls with traumatic femoral neck fracture for microarray (n=4) and quantitative real-time PCR validation experiments (n=8). Gene expression profiling of articular cartilage was performed using Agilent Human 4×44K Microarray chip. The accuracy of microarray experiment was further validated by qRT-PCR. Gene expression results of OA hip cartilage were derived from previously published study. Significance Analysis of Microarrays (SAM) software was applied for identifying differently expressed genes. Gene ontology (GO) and pathway enrichment analysis were conducted by Gene Set Enrichment Analysis software and DAVID tool, respectively. Totally, 27 differently expressed genes were identified for NOFH. Comparing the gene expression profiles of NOFH cartilage and OA cartilage detected 8 common differently expressed genes, including COL5A1, OGN, ANGPTL4, CRIP1, NFIL3, METRNL, ID2 and STEAP1. GO comparative analysis identified 10 common significant GO terms, mainly implicated in apoptosis and development process. Pathway comparative analysis observed that ECM-receptor interaction pathway and focal adhesion pathway were enriched in the differently expressed genes of both NOFH and hip OA. In conclusion, we identified a set of differently expressed genes, GO and pathways for NOFH articular destruction, some of which were also involved in the hip OA. Our study results may help to reveal the pathogenetic similarities and differences of cartilage damage of NOFH and hip OA. Copyright © 2016 Elsevier B.V. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Liu, Yu-Ching; Department of Veterinary Medicine, National Chung Hsing University, Taichung 402, Taiwan; Ho, Heng-Chien
2012-07-15
The purpose of this study was to identify the genes induced early in murine oral carcinogenesis. Murine tongue tumors induced by the carcinogen, 4-nitroquinoline 1-oxide (4-NQO), and paired non-tumor tissues were subjected to microarray analysis. Hierarchical clustering of upregulated genes in the tumor tissues revealed an association of induced genes with inflammation. Cytokines/cytokine receptors induced early were subsequently identified, clearly indicating their involvement in oral carcinogenesis. Hierarchical clustering also showed that cytokine-mediated inflammation was possibly linked with Mapk6. Cox2 exhibited the greatest extent (9–18 fold) of induction in the microarray data, and its early induction was observed in a 2more » h painting experiment by RT-PCR. MetaCore analysis showed that overexpressed Cox2 may interact with p53 and transcriptionally inhibit expression of several downstream genes. A painting experiment in transgenic mice also demonstrated that NF-κB activates early independently of Cox2 induction. MetaCore analysis revealed the most striking metabolic alterations in tumor tissues, especially in lipid metabolism resulting from the reduction of Pparα and Rxrg. Reduced expression of Mapk12 was noted, and MetaCore analysis established its relationship with decreased efficiency of Pparα phosphorylation. In conclusion, in addition to cytokines/cytokine receptors, the early induction of Cox2 and NF-κB activation is involved in murine oral carcinogenesis.« less
Protein microarray analysis reveals BAFF-binding autoantibodies in systemic lupus erythematosus
Price, Jordan V.; Haddon, David J.; Kemmer, Dodge; Delepine, Guillaume; Mandelbaum, Gil; Jarrell, Justin A.; Gupta, Rohit; Balboni, Imelda; Chakravarty, Eliza F.; Sokolove, Jeremy; Shum, Anthony K.; Anderson, Mark S.; Cheng, Mickie H.; Robinson, William H.; Browne, Sarah K.; Holland, Steven M.; Baechler, Emily C.; Utz, Paul J.
2013-01-01
Autoantibodies against cytokines, chemokines, and growth factors inhibit normal immunity and are implicated in inflammatory autoimmune disease and diseases of immune deficiency. In an effort to evaluate serum from autoimmune and immunodeficient patients for Abs against cytokines, chemokines, and growth factors in a high-throughput and unbiased manner, we constructed a multiplex protein microarray for detection of serum factor–binding Abs and used the microarray to detect autoantibody targets in SLE. We designed a nitrocellulose-surface microarray containing human cytokines, chemokines, and other circulating proteins and demonstrated that the array permitted specific detection of serum factor–binding probes. We used the arrays to detect previously described autoantibodies against cytokines in samples from individuals with autoimmune polyendocrine syndrome type 1 and chronic mycobacterial infection. Serum profiling from individuals with SLE revealed that among several targets, elevated IgG autoantibody reactivity to B cell–activating factor (BAFF) was associated with SLE compared with control samples. BAFF reactivity correlated with the severity of disease-associated features, including IFN-α–driven SLE pathology. Our results showed that serum factor protein microarrays facilitate detection of autoantibody reactivity to serum factors in human samples and that BAFF-reactive autoantibodies may be associated with an elevated inflammatory disease state within the spectrum of SLE. PMID:24270423
Optimization of cDNA microarrays procedures using criteria that do not rely on external standards.
Bruland, Torunn; Anderssen, Endre; Doseth, Berit; Bergum, Hallgeir; Beisvag, Vidar; Laegreid, Astrid
2007-10-18
The measurement of gene expression using microarray technology is a complicated process in which a large number of factors can be varied. Due to the lack of standard calibration samples such as are used in traditional chemical analysis it may be a problem to evaluate whether changes done to the microarray procedure actually improve the identification of truly differentially expressed genes. The purpose of the present work is to report the optimization of several steps in the microarray process both in laboratory practices and in data processing using criteria that do not rely on external standards. We performed a cDNA microarry experiment including RNA from samples with high expected differential gene expression termed "high contrasts" (rat cell lines AR42J and NRK52E) compared to self-self hybridization, and optimized a pipeline to maximize the number of genes found to be differentially expressed in the "high contrasts" RNA samples by estimating the false discovery rate (FDR) using a null distribution obtained from the self-self experiment. The proposed high-contrast versus self-self method (HCSSM) requires only four microarrays per evaluation. The effects of blocking reagent dose, filtering, and background corrections methodologies were investigated. In our experiments a dose of 250 ng LNA (locked nucleic acid) dT blocker, no background correction and weight based filtering gave the largest number of differentially expressed genes. The choice of background correction method had a stronger impact on the estimated number of differentially expressed genes than the choice of filtering method. Cross platform microarray (Illumina) analysis was used to validate that the increase in the number of differentially expressed genes found by HCSSM was real. The results show that HCSSM can be a useful and simple approach to optimize microarray procedures without including external standards. Our optimizing method is highly applicable to both long oligo-probe microarrays which have become commonly used for well characterized organisms such as man, mouse and rat, as well as to cDNA microarrays which are still of importance for organisms with incomplete genome sequence information such as many bacteria, plants and fish.
Optimization of cDNA microarrays procedures using criteria that do not rely on external standards
Bruland, Torunn; Anderssen, Endre; Doseth, Berit; Bergum, Hallgeir; Beisvag, Vidar; Lægreid, Astrid
2007-01-01
Background The measurement of gene expression using microarray technology is a complicated process in which a large number of factors can be varied. Due to the lack of standard calibration samples such as are used in traditional chemical analysis it may be a problem to evaluate whether changes done to the microarray procedure actually improve the identification of truly differentially expressed genes. The purpose of the present work is to report the optimization of several steps in the microarray process both in laboratory practices and in data processing using criteria that do not rely on external standards. Results We performed a cDNA microarry experiment including RNA from samples with high expected differential gene expression termed "high contrasts" (rat cell lines AR42J and NRK52E) compared to self-self hybridization, and optimized a pipeline to maximize the number of genes found to be differentially expressed in the "high contrasts" RNA samples by estimating the false discovery rate (FDR) using a null distribution obtained from the self-self experiment. The proposed high-contrast versus self-self method (HCSSM) requires only four microarrays per evaluation. The effects of blocking reagent dose, filtering, and background corrections methodologies were investigated. In our experiments a dose of 250 ng LNA (locked nucleic acid) dT blocker, no background correction and weight based filtering gave the largest number of differentially expressed genes. The choice of background correction method had a stronger impact on the estimated number of differentially expressed genes than the choice of filtering method. Cross platform microarray (Illumina) analysis was used to validate that the increase in the number of differentially expressed genes found by HCSSM was real. Conclusion The results show that HCSSM can be a useful and simple approach to optimize microarray procedures without including external standards. Our optimizing method is highly applicable to both long oligo-probe microarrays which have become commonly used for well characterized organisms such as man, mouse and rat, as well as to cDNA microarrays which are still of importance for organisms with incomplete genome sequence information such as many bacteria, plants and fish. PMID:17949480
Hatazawa, Yukino; Minami, Kimiko; Yoshimura, Ryoji; Onishi, Takumi; Manio, Mark Christian; Inoue, Kazuo; Sawada, Naoki; Suzuki, Osamu; Miura, Shinji; Kamei, Yasutomi
2016-12-09
The expression of the transcriptional coactivator PGC1α is increased in skeletal muscles during exercise. Previously, we showed that increased PGC1α leads to prolonged exercise performance (the duration for which running can be continued) and, at the same time, increases the expression of branched-chain amino acid (BCAA) metabolism-related enzymes and genes that are involved in supplying substrates for the TCA cycle. We recently created mice with PGC1α knockout specifically in the skeletal muscles (PGC1α KO mice), which show decreased mitochondrial content. In this study, global gene expression (microarray) analysis was performed in the skeletal muscles of PGC1α KO mice compared with that of wild-type control mice. As a result, decreased expression of genes involved in the TCA cycle, oxidative phosphorylation, and BCAA metabolism were observed. Compared with previously obtained microarray data on PGC1α-overexpressing transgenic mice, each gene showed the completely opposite direction of expression change. Bioinformatic analysis of the promoter region of genes with decreased expression in PGC1α KO mice predicted the involvement of several transcription factors, including a nuclear receptor, ERR, in their regulation. As PGC1α KO microarray data in this study show opposing findings to the PGC1α transgenic data, a loss-of-function experiment, as well as a gain-of-function experiment, revealed PGC1α's function in the oxidative energy metabolism of skeletal muscles. Copyright © 2016 Elsevier Inc. All rights reserved.
Zhang, Xiaomeng; Shao, Bin; Wu, Yangle; Qi, Ouyang
2013-01-01
One of the major objectives in systems biology is to understand the relation between the topological structures and the dynamics of biological regulatory networks. In this context, various mathematical tools have been developed to deduct structures of regulatory networks from microarray expression data. In general, from a single data set, one cannot deduct the whole network structure; additional expression data are usually needed. Thus how to design a microarray expression experiment in order to get the most information is a practical problem in systems biology. Here we propose three methods, namely, maximum distance method, trajectory entropy method, and sampling method, to derive the optimal initial conditions for experiments. The performance of these methods is tested and evaluated in three well-known regulatory networks (budding yeast cell cycle, fission yeast cell cycle, and E. coli. SOS network). Based on the evaluation, we propose an efficient strategy for the design of microarray expression experiments.
Alexiev, Borislav A; Zou, Ying S
2014-12-01
Chromosomal microarray analysis using novel Molecular Inversion Probe (MIP) technology demonstrated 2,570 kb copy neutral LOH of 10q11.22 in two clear cell papillary renal cell carcinomas. In addition, one of the tumors had a big 29,784 kb deletion of 13q11-q14.2. There were two variants of unknown significance, a 2,509 kb gain of Xp22.33 and a 257 kb homozygous deletion of 8p11.22. The somatic mutation panel containing 74 mutations in nine genes did not reveal any mutations. Besides identification of submicroscopic duplications or deletions, SNP microarrays can reveal abnormal allelic imbalances including LOH and copy neutral LOH, which cannot be recognized by chromosome, FISH, and non-SNP microarray arrays. To the best of our knowledge, this is the first study demonstrating copy neutral LOH of 10q11.22 in clear cell papillary renal cell carcinomas using the new MIP SNP OncoScan FFPE Assay Kit on formalin-fixed paraffin-embedded tumor samples. Copyright © 2014 Elsevier GmbH. All rights reserved.
Bian, Zehua; Zhang, Jiwei; Li, Min; Feng, Yuyang; Wang, Xue; Zhang, Jia; Yao, Surui; Jin, Guoying; Du, Jun; Han, Weifeng; Yin, Yuan; Huang, Shenglin; Fei, Bojian; Zou, Jian; Huang, Zhaohui
2018-06-18
Long non-coding RNAs (lncRNAs) play key roles in human cancers. Here, FEZF1-AS1, a highly overexpressed lncRNA in colorectal cancer (CRC), was identified by lncRNA microarrays. We aimed to explore the roles and possible molecular mechanisms of FEZF1-AS1 in CRC. LncRNA expression in CRC tissues was measured by lncRNA microarray and qRT-PCR. The functional roles of FEZF1-AS1 in CRC were demonstrated by a series of in vitro and in vivo experiments. RNA pull-down, RNA immunoprecipitation and luciferase analyses were used to demonstrate the potential mechanisms of FEZF1-AS1. We identified a series of differentially expressed lncRNAs in CRC using lncRNA microarrays, and revealed that FEZF1-AS1 is one of the most overexpressed. Further validation in two expanded CRC cohorts confirmed the upregulation of FEZF1-AS1 in CRC, and revealed that increased FEZF1-AS1 expression is associated with poor survival. Functional assays revealed that FEZF1-AS1 promotes CRC cell proliferation and metastasis. Mechanistically, FEZF1-AS1 could bind and increase the stability of the pyruvate kinase 2 (PKM2) protein, resulting in increased cytoplasmic and nuclear PKM2 levels. Increased cytoplasmic PKM2 promoted pyruvate kinase activity and lactate production (aerobic glycolysis), whereas FEZF1-AS1-induced nuclear PKM2 upregulation further activated STAT3 signaling. In addition, PKM2 was upregulated in CRC tissues and correlated with FEZF1-AS1 expression and patient survival. Together, these data provide mechanistic insights into the regulation of FEZF1-AS1 on both STAT3 signaling and glycolysis by binding PKM2 and increasing its stability. Copyright ©2018, American Association for Cancer Research.
Severgnini, Marco; Bicciato, Silvio; Mangano, Eleonora; Scarlatti, Francesca; Mezzelani, Alessandra; Mattioli, Michela; Ghidoni, Riccardo; Peano, Clelia; Bonnal, Raoul; Viti, Federica; Milanesi, Luciano; De Bellis, Gianluca; Battaglia, Cristina
2006-06-01
Meta-analysis of microarray data is increasingly important, considering both the availability of multiple platforms using disparate technologies and the accumulation in public repositories of data sets from different laboratories. We addressed the issue of comparing gene expression profiles from two microarray platforms by devising a standardized investigative strategy. We tested this procedure by studying MDA-MB-231 cells, which undergo apoptosis on treatment with resveratrol. Gene expression profiles were obtained using high-density, short-oligonucleotide, single-color microarray platforms: GeneChip (Affymetrix) and CodeLink (Amersham). Interplatform analyses were carried out on 8414 common transcripts represented on both platforms, as identified by LocusLink ID, representing 70.8% and 88.6% of annotated GeneChip and CodeLink features, respectively. We identified 105 differentially expressed genes (DEGs) on CodeLink and 42 DEGs on GeneChip. Among them, only 9 DEGs were commonly identified by both platforms. Multiple analyses (BLAST alignment of probes with target sequences, gene ontology, literature mining, and quantitative real-time PCR) permitted us to investigate the factors contributing to the generation of platform-dependent results in single-color microarray experiments. An effective approach to cross-platform comparison involves microarrays of similar technologies, samples prepared by identical methods, and a standardized battery of bioinformatic and statistical analyses.
Huerta, Mario; Munyi, Marc; Expósito, David; Querol, Enric; Cedano, Juan
2014-06-15
The microarrays performed by scientific teams grow exponentially. These microarray data could be useful for researchers around the world, but unfortunately they are underused. To fully exploit these data, it is necessary (i) to extract these data from a repository of the high-throughput gene expression data like Gene Expression Omnibus (GEO) and (ii) to make the data from different microarrays comparable with tools easy to use for scientists. We have developed these two solutions in our server, implementing a database of microarray marker genes (Marker Genes Data Base). This database contains the marker genes of all GEO microarray datasets and it is updated monthly with the new microarrays from GEO. Thus, researchers can see whether the marker genes of their microarray are marker genes in other microarrays in the database, expanding the analysis of their microarray to the rest of the public microarrays. This solution helps not only to corroborate the conclusions regarding a researcher's microarray but also to identify the phenotype of different subsets of individuals under investigation, to frame the results with microarray experiments from other species, pathologies or tissues, to search for drugs that promote the transition between the studied phenotypes, to detect undesirable side effects of the treatment applied, etc. Thus, the researcher can quickly add relevant information to his/her studies from all of the previous analyses performed in other studies as long as they have been deposited in public repositories. Marker-gene database tool: http://ibb.uab.es/mgdb © The Author 2014. Published by Oxford University Press.
Gong, Wei; He, Kun; Covington, Mike; Dinesh-Kumar, S. P.; Snyder, Michael; Harmer, Stacey L.; Zhu, Yu-Xian; Deng, Xing Wang
2009-01-01
We used our collection of Arabidopsis transcription factor (TF) ORFeome clones to construct protein microarrays containing as many as 802 TF proteins. These protein microarrays were used for both protein-DNA and protein-protein interaction analyses. For protein-DNA interaction studies, we examined AP2/ERF family TFs and their cognate cis-elements. By careful comparison of the DNA-binding specificity of 13 TFs on the protein microarray with previous non-microarray data, we showed that protein microarrays provide an efficient and high throughput tool for genome-wide analysis of TF-DNA interactions. This microarray protein-DNA interaction analysis allowed us to derive a comprehensive view of DNA-binding profiles of AP2/ERF family proteins in Arabidopsis. It also revealed four TFs that bound the EE (evening element) and had the expected phased gene expression under clock-regulation, thus providing a basis for further functional analysis of their roles in clock regulation of gene expression. We also developed procedures for detecting protein interactions using this TF protein microarray and discovered four novel partners that interact with HY5, which can be validated by yeast two-hybrid assays. Thus, plant TF protein microarrays offer an attractive high-throughput alternative to traditional techniques for TF functional characterization on a global scale. PMID:19802365
Best practices for hybridization design in two-colour microarray analysis.
Knapen, Dries; Vergauwen, Lucia; Laukens, Kris; Blust, Ronny
2009-07-01
Two-colour microarrays are a popular platform of choice in gene expression studies. Because two different samples are hybridized on a single microarray, and several microarrays are usually needed in a given experiment, there are many possible ways to combine samples on different microarrays. The actual combination employed is commonly referred to as the 'hybridization design'. Different types of hybridization designs have been developed, all aimed at optimizing the experimental setup for the detection of differentially expressed genes while coping with technical noise. Here, we first provide an overview of the different classes of hybridization designs, discussing their advantages and limitations, and then we illustrate the current trends in the use of different hybridization design types in contemporary research.
Thermodynamically optimal whole-genome tiling microarray design and validation.
Cho, Hyejin; Chou, Hui-Hsien
2016-06-13
Microarray is an efficient apparatus to interrogate the whole transcriptome of species. Microarray can be designed according to annotated gene sets, but the resulted microarrays cannot be used to identify novel transcripts and this design method is not applicable to unannotated species. Alternatively, a whole-genome tiling microarray can be designed using only genomic sequences without gene annotations, and it can be used to detect novel RNA transcripts as well as known genes. The difficulty with tiling microarray design lies in the tradeoff between probe-specificity and coverage of the genome. Sequence comparison methods based on BLAST or similar software are commonly employed in microarray design, but they cannot precisely determine the subtle thermodynamic competition between probe targets and partially matched probe nontargets during hybridizations. Using the whole-genome thermodynamic analysis software PICKY to design tiling microarrays, we can achieve maximum whole-genome coverage allowable under the thermodynamic constraints of each target genome. The resulted tiling microarrays are thermodynamically optimal in the sense that all selected probes share the same melting temperature separation range between their targets and closest nontargets, and no additional probes can be added without violating the specificity of the microarray to the target genome. This new design method was used to create two whole-genome tiling microarrays for Escherichia coli MG1655 and Agrobacterium tumefaciens C58 and the experiment results validated the design.
The observation of transcriptional changes following embryonic ethanol exposure may provide significant insights into the biological response to ethanol exposure. In this study, we used microarray analysis to examine the transcriptional response of the developing limb to a dose ...
Li, Zhiguang; Kwekel, Joshua C; Chen, Tao
2012-01-01
Functional comparison across microarray platforms is used to assess the comparability or similarity of the biological relevance associated with the gene expression data generated by multiple microarray platforms. Comparisons at the functional level are very important considering that the ultimate purpose of microarray technology is to determine the biological meaning behind the gene expression changes under a specific condition, not just to generate a list of genes. Herein, we present a method named percentage of overlapping functions (POF) and illustrate how it is used to perform the functional comparison of microarray data generated across multiple platforms. This method facilitates the determination of functional differences or similarities in microarray data generated from multiple array platforms across all the functions that are presented on these platforms. This method can also be used to compare the functional differences or similarities between experiments, projects, or laboratories.
BATS: a Bayesian user-friendly software for analyzing time series microarray experiments.
Angelini, Claudia; Cutillo, Luisa; De Canditiis, Daniela; Mutarelli, Margherita; Pensky, Marianna
2008-10-06
Gene expression levels in a given cell can be influenced by different factors, namely pharmacological or medical treatments. The response to a given stimulus is usually different for different genes and may depend on time. One of the goals of modern molecular biology is the high-throughput identification of genes associated with a particular treatment or a biological process of interest. From methodological and computational point of view, analyzing high-dimensional time course microarray data requires very specific set of tools which are usually not included in standard software packages. Recently, the authors of this paper developed a fully Bayesian approach which allows one to identify differentially expressed genes in a 'one-sample' time-course microarray experiment, to rank them and to estimate their expression profiles. The method is based on explicit expressions for calculations and, hence, very computationally efficient. The software package BATS (Bayesian Analysis of Time Series) presented here implements the methodology described above. It allows an user to automatically identify and rank differentially expressed genes and to estimate their expression profiles when at least 5-6 time points are available. The package has a user-friendly interface. BATS successfully manages various technical difficulties which arise in time-course microarray experiments, such as a small number of observations, non-uniform sampling intervals and replicated or missing data. BATS is a free user-friendly software for the analysis of both simulated and real microarray time course experiments. The software, the user manual and a brief illustrative example are freely available online at the BATS website: http://www.na.iac.cnr.it/bats.
2010-01-01
Background The zebra mussel (Dreissena polymorpha) has been well known for its expertise in attaching to substances under the water. Studies in past decades on this underwater adhesion focused on the adhesive protein isolated from the byssogenesis apparatus of the zebra mussel. However, the mechanism of the initiation, maintenance, and determination of the attachment process remains largely unknown. Results In this study, we used a zebra mussel cDNA microarray previously developed in our lab and a factorial analysis to identify the genes that were involved in response to the changes of four factors: temperature (Factor A), current velocity (Factor B), dissolved oxygen (Factor C), and byssogenesis status (Factor D). Twenty probes in the microarray were found to be modified by one of the factors. The transcription products of four selected genes, DPFP-BG20_A01, EGP-BG97/192_B06, EGP-BG13_G05, and NH-BG17_C09 were unique to the zebra mussel foot based on the results of quantitative reverse transcription PCR (qRT-PCR). The expression profiles of these four genes under the attachment and non-attachment were also confirmed by qRT-PCR and the result is accordant to that from microarray assay. The in situ hybridization with the RNA probes of two identified genes DPFP-BG20_A01 and EGP-BG97/192_B06 indicated that both of them were expressed by a type of exocrine gland cell located in the middle part of the zebra mussel foot. Conclusions The results of this study suggested that the changes of D. polymorpha byssogenesis status and the environmental factors can dramatically affect the expression profiles of the genes unique to the foot. It turns out that the factorial design and analysis of the microarray experiment is a reliable method to identify the influence of multiple factors on the expression profiles of the probesets in the microarray; therein it provides a powerful tool to reveal the mechanism of zebra mussel underwater attachment. PMID:20509938
Khan, Rishi L; Gonye, Gregory E; Gao, Guang; Schwaber, James S
2006-01-01
Background Using microarrays by co-hybridizing two samples labeled with different dyes enables differential gene expression measurements and comparisons across slides while controlling for within-slide variability. Typically one dye produces weaker signal intensities than the other often causing signals to be undetectable. In addition, undetectable spots represent a large problem for two-color microarray designs and most arrays contain at least 40% undetectable spots even when labeled with reference samples such as Stratagene's Universal Reference RNAs™. Results We introduce a novel universal reference sample that produces strong signal for all spots on the array, increasing the average fraction of detectable spots to 97%. Maximizing detectable spots on the reference image channel also decreases the variability of microarray data allowing for reliable detection of smaller differential gene expression changes. The reference sample is derived from sequence contained in the parental EST clone vector pT7T3D-Pac and is called vector RNA (vRNA). We show that vRNA can also be used for quality control of microarray printing and PCR product quality, detection of hybridization anomalies, and simplification of spot finding and segmentation tasks. This reference sample can be made inexpensively in large quantities as a renewable resource that is consistent across experiments. Conclusion Results of this study show that vRNA provides a useful universal reference that yields high signal for almost all spots on a microarray, reduces variation and allows for comparisons between experiments and laboratories. Further, it can be used for quality control of microarray printing and PCR product quality, detection of hybridization anomalies, and simplification of spot finding and segmentation tasks. This type of reference allows for detection of small changes in differential expression while reference designs in general allow for large-scale multivariate experimental designs. vRNA in combination with reference designs enable systems biology microarray experiments of small physiologically relevant changes. PMID:16677381
Equalizer reduces SNP bias in Affymetrix microarrays.
Quigley, David
2015-07-30
Gene expression microarrays measure the levels of messenger ribonucleic acid (mRNA) in a sample using probe sequences that hybridize with transcribed regions. These probe sequences are designed using a reference genome for the relevant species. However, most model organisms and all humans have genomes that deviate from their reference. These variations, which include single nucleotide polymorphisms, insertions of additional nucleotides, and nucleotide deletions, can affect the microarray's performance. Genetic experiments comparing individuals bearing different population-associated single nucleotide polymorphisms that intersect microarray probes are therefore subject to systemic bias, as the reduction in binding efficiency due to a technical artifact is confounded with genetic differences between parental strains. This problem has been recognized for some time, and earlier methods of compensation have attempted to identify probes affected by genome variants using statistical models. These methods may require replicate microarray measurement of gene expression in the relevant tissue in inbred parental samples, which are not always available in model organisms and are never available in humans. By using sequence information for the genomes of organisms under investigation, potentially problematic probes can now be identified a priori. However, there is no published software tool that makes it easy to eliminate these probes from an annotation. I present equalizer, a software package that uses genome variant data to modify annotation files for the commonly used Affymetrix IVT and Gene/Exon platforms. These files can be used by any microarray normalization method for subsequent analysis. I demonstrate how use of equalizer on experiments mapping germline influence on gene expression in a genetic cross between two divergent mouse species and in human samples significantly reduces probe hybridization-induced bias, reducing false positive and false negative findings. The equalizer package reduces probe hybridization bias from experiments performed on the Affymetrix microarray platform, allowing accurate assessment of germline influence on gene expression.
Development of DNA Microarrays for Metabolic Pathway and Bioprocess Monitoring
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gregory Stephanopoulos
Transcriptional profiling experiments utilizing DNA microarrays to study the intracellular accumulation of PHB in Synechocystis has proved difficult in large part because strains that show significant differences in PHB which would justify global analysis of gene expression have not been isolated.
Haitsma, Jack J.; Furmli, Suleiman; Masoom, Hussain; Liu, Mingyao; Imai, Yumiko; Slutsky, Arthur S.; Beyene, Joseph; Greenwood, Celia M. T.; dos Santos, Claudia
2012-01-01
Objectives To perform a meta-analysis of gene expression microarray data from animal studies of lung injury, and to identify an injury-specific gene expression signature capable of predicting the development of lung injury in humans. Methods We performed a microarray meta-analysis using 77 microarray chips across six platforms, two species and different animal lung injury models exposed to lung injury with or/and without mechanical ventilation. Individual gene chips were classified and grouped based on the strategy used to induce lung injury. Effect size (change in gene expression) was calculated between non-injurious and injurious conditions comparing two main strategies to pool chips: (1) one-hit and (2) two-hit lung injury models. A random effects model was used to integrate individual effect sizes calculated from each experiment. Classification models were built using the gene expression signatures generated by the meta-analysis to predict the development of lung injury in human lung transplant recipients. Results Two injury-specific lists of differentially expressed genes generated from our meta-analysis of lung injury models were validated using external data sets and prospective data from animal models of ventilator-induced lung injury (VILI). Pathway analysis of gene sets revealed that both new and previously implicated VILI-related pathways are enriched with differentially regulated genes. Classification model based on gene expression signatures identified in animal models of lung injury predicted development of primary graft failure (PGF) in lung transplant recipients with larger than 80% accuracy based upon injury profiles from transplant donors. We also found that better classifier performance can be achieved by using meta-analysis to identify differentially-expressed genes than using single study-based differential analysis. Conclusion Taken together, our data suggests that microarray analysis of gene expression data allows for the detection of “injury" gene predictors that can classify lung injury samples and identify patients at risk for clinically relevant lung injury complications. PMID:23071521
A study of metaheuristic algorithms for high dimensional feature selection on microarray data
NASA Astrophysics Data System (ADS)
Dankolo, Muhammad Nasiru; Radzi, Nor Haizan Mohamed; Sallehuddin, Roselina; Mustaffa, Noorfa Haszlinna
2017-11-01
Microarray systems enable experts to examine gene profile at molecular level using machine learning algorithms. It increases the potentials of classification and diagnosis of many diseases at gene expression level. Though, numerous difficulties may affect the efficiency of machine learning algorithms which includes vast number of genes features comprised in the original data. Many of these features may be unrelated to the intended analysis. Therefore, feature selection is necessary to be performed in the data pre-processing. Many feature selection algorithms are developed and applied on microarray which including the metaheuristic optimization algorithms. This paper discusses the application of the metaheuristics algorithms for feature selection in microarray dataset. This study reveals that, the algorithms have yield an interesting result with limited resources thereby saving computational expenses of machine learning algorithms.
Wu, Chengjiang; Zhao, Yangjing; Lin, Yu; Yang, Xinxin; Yan, Meina; Min, Yujiao; Pan, Zihui; Xia, Sheng; Shao, Qixiang
2018-01-01
DNA microarray and high-throughput sequencing have been widely used to identify the differentially expressed genes (DEGs) in systemic lupus erythematosus (SLE). However, the big data from gene microarrays are also challenging to work with in terms of analysis and processing. The presents study combined data from the microarray expression profile (GSE65391) and bioinformatics analysis to identify the key genes and cellular pathways in SLE. Gene ontology (GO) and cellular pathway enrichment analyses of DEGs were performed to investigate significantly enriched pathways. A protein-protein interaction network was constructed to determine the key genes in the occurrence and development of SLE. A total of 310 DEGs were identified in SLE, including 193 upregulated genes and 117 downregulated genes. GO analysis revealed that the most significant biological process of DEGs was immune system process. Kyoto Encyclopedia of Genes and Genome pathway analysis showed that these DEGs were enriched in signaling pathways associated with the immune system, including the RIG-I-like receptor signaling pathway, intestinal immune network for IgA production, antigen processing and presentation and the toll-like receptor signaling pathway. The current study screened the top 10 genes with higher degrees as hub genes, which included 2′-5′-oligoadenylate synthetase 1, MX dynamin like GTPase 2, interferon induced protein with tetratricopeptide repeats 1, interferon regulatory factor 7, interferon induced with helicase C domain 1, signal transducer and activator of transcription 1, ISG15 ubiquitin-like modifier, DExD/H-box helicase 58, interferon induced protein with tetratricopeptide repeats 3 and 2′-5′-oligoadenylate synthetase 2. Module analysis revealed that these hub genes were also involved in the RIG-I-like receptor signaling, cytosolic DNA-sensing, toll-like receptor signaling and ribosome biogenesis pathways. In addition, these hub genes, from different probe sets, exhibited significant co-expressed tendency in multi-experiment microarray datasets (P<0.01). In conclusion, these key genes and cellular pathways may improve the current understanding of the underlying mechanism of development of SLE. These key genes may be potential biomarkers of diagnosis, therapy and prognosis for SLE. PMID:29257335
MeV+R: using MeV as a graphical user interface for Bioconductor applications in microarray analysis
Chu, Vu T; Gottardo, Raphael; Raftery, Adrian E; Bumgarner, Roger E; Yeung, Ka Yee
2008-01-01
We present MeV+R, an integration of the JAVA MultiExperiment Viewer program with Bioconductor packages. This integration of MultiExperiment Viewer and R is easily extensible to other R packages and provides users with point and click access to traditionally command line driven tools written in R. We demonstrate the ability to use MultiExperiment Viewer as a graphical user interface for Bioconductor applications in microarray data analysis by incorporating three Bioconductor packages, RAMA, BRIDGE and iterativeBMA. PMID:18652698
Mandaokar, Ajin; Kumar, V Dinesh; Amway, Matt; Browse, John
2003-07-01
Jasmonate (JA) is a signaling compound essential for anther development and pollen fertility in Arabidopsis. Mutations that block the pathway of JA synthesis result into male sterility. To understand the processes of anther and pollen maturation, we used microarray and differential display approaches to compare gene expression pattern in anthers of wild-type Arabidopsis and the male-sterile mutant, opr3. Microarray experiment revealed 25 genes that were up-regulated more than 1.8-fold in wild-type anthers as compared to mutant anthers. Experiments based on differential display identified 13 additional genes up-regulated in wild-type anthers compared to opr3 for a total of 38 differentially expressed genes. Searches of the Arabidopsis and non-redundant databases disclosed known or likely functions for 28 of the 38 genes identified, while 10 genes encode proteins of unknown function. Northern blot analysis of eight representative clones as probes confirmed low expression in opr3 anthers compared with wild-type anthers. JA responsiveness of these same genes was also investigated by northern blot analysis of anther RNA isolated from wild-type and opr3 plants, In these experiments, four genes were induced in opr3 anthers within 0.5-1 h of JA treatment while the remaining genes were up-regulated only 1-8 h after JA application. None of these genes was induced by JA in anthers of the coil mutant that is deficient in JA responsiveness. The four early-induced genes in opr3 encode lipoxygenase, a putative bHLH transcription factor, epithiospecifier protein and an unknown protein. We propose that these and other early components may be involved in JA signaling and in the initiation of developmental processes. The four late genes encode an extensin-like protein, a peptide transporter and two unknown proteins, which may represent components required later in anther and pollen maturation. Transcript profiling has provided a successful approach to identify genes involved in anther and pollen maturation in Arabidopsis.
Advances in cell-free protein array methods.
Yu, Xiaobo; Petritis, Brianne; Duan, Hu; Xu, Danke; LaBaer, Joshua
2018-01-01
Cell-free protein microarrays represent a special form of protein microarray which display proteins made fresh at the time of the experiment, avoiding storage and denaturation. They have been used increasingly in basic and translational research over the past decade to study protein-protein interactions, the pathogen-host relationship, post-translational modifications, and antibody biomarkers of different human diseases. Their role in the first blood-based diagnostic test for early stage breast cancer highlights their value in managing human health. Cell-free protein microarrays will continue to evolve to become widespread tools for research and clinical management. Areas covered: We review the advantages and disadvantages of different cell-free protein arrays, with an emphasis on the methods that have been studied in the last five years. We also discuss the applications of each microarray method. Expert commentary: Given the growing roles and impact of cell-free protein microarrays in research and medicine, we discuss: 1) the current technical and practical limitations of cell-free protein microarrays; 2) the biomarker discovery and verification pipeline using protein microarrays; and 3) how cell-free protein microarrays will advance over the next five years, both in their technology and applications.
Chockalingam, Sriram; Aluru, Maneesha; Aluru, Srinivas
2016-09-19
Pre-processing of microarray data is a well-studied problem. Furthermore, all popular platforms come with their own recommended best practices for differential analysis of genes. However, for genome-scale network inference using microarray data collected from large public repositories, these methods filter out a considerable number of genes. This is primarily due to the effects of aggregating a diverse array of experiments with different technical and biological scenarios. Here we introduce a pre-processing pipeline suitable for inferring genome-scale gene networks from large microarray datasets. We show that partitioning of the available microarray datasets according to biological relevance into tissue- and process-specific categories significantly extends the limits of downstream network construction. We demonstrate the effectiveness of our pre-processing pipeline by inferring genome-scale networks for the model plant Arabidopsis thaliana using two different construction methods and a collection of 11,760 Affymetrix ATH1 microarray chips. Our pre-processing pipeline and the datasets used in this paper are made available at http://alurulab.cc.gatech.edu/microarray-pp.
Analysis of ripening-related gene expression in papaya using an Arabidopsis-based microarray
2012-01-01
Background Papaya (Carica papaya L.) is a commercially important crop that produces climacteric fruits with a soft and sweet pulp that contain a wide range of health promoting phytochemicals. Despite its importance, little is known about transcriptional modifications during papaya fruit ripening and their control. In this study we report the analysis of ripe papaya transcriptome by using a cross-species (XSpecies) microarray technique based on the phylogenetic proximity between papaya and Arabidopsis thaliana. Results Papaya transcriptome analyses resulted in the identification of 414 ripening-related genes with some having their expression validated by qPCR. The transcription profile was compared with that from ripening tomato and grape. There were many similarities between papaya and tomato especially with respect to the expression of genes encoding proteins involved in primary metabolism, regulation of transcription, biotic and abiotic stress and cell wall metabolism. XSpecies microarray data indicated that transcription factors (TFs) of the MADS-box, NAC and AP2/ERF gene families were involved in the control of papaya ripening and revealed that cell wall-related gene expression in papaya had similarities to the expression profiles seen in Arabidopsis during hypocotyl development. Conclusion The cross-species array experiment identified a ripening-related set of genes in papaya allowing the comparison of transcription control between papaya and other fruit bearing taxa during the ripening process. PMID:23256600
Steger, Doris; Berry, David; Haider, Susanne; Horn, Matthias; Wagner, Michael; Stocker, Roman; Loy, Alexander
2011-01-01
The hybridization of nucleic acid targets with surface-immobilized probes is a widely used assay for the parallel detection of multiple targets in medical and biological research. Despite its widespread application, DNA microarray technology still suffers from several biases and lack of reproducibility, stemming in part from an incomplete understanding of the processes governing surface hybridization. In particular, non-random spatial variations within individual microarray hybridizations are often observed, but the mechanisms underpinning this positional bias remain incompletely explained. This study identifies and rationalizes a systematic spatial bias in the intensity of surface hybridization, characterized by markedly increased signal intensity of spots located at the boundaries of the spotted areas of the microarray slide. Combining observations from a simplified single-probe block array format with predictions from a mathematical model, the mechanism responsible for this bias is found to be a position-dependent variation in lateral diffusion of target molecules. Numerical simulations reveal a strong influence of microarray well geometry on the spatial bias. Reciprocal adjustment of the size of the microarray hybridization chamber to the area of surface-bound probes is a simple and effective measure to minimize or eliminate the diffusion-based bias, resulting in increased uniformity and accuracy of quantitative DNA microarray hybridization.
Haider, Susanne; Horn, Matthias; Wagner, Michael; Stocker, Roman; Loy, Alexander
2011-01-01
Background The hybridization of nucleic acid targets with surface-immobilized probes is a widely used assay for the parallel detection of multiple targets in medical and biological research. Despite its widespread application, DNA microarray technology still suffers from several biases and lack of reproducibility, stemming in part from an incomplete understanding of the processes governing surface hybridization. In particular, non-random spatial variations within individual microarray hybridizations are often observed, but the mechanisms underpinning this positional bias remain incompletely explained. Methodology/Principal Findings This study identifies and rationalizes a systematic spatial bias in the intensity of surface hybridization, characterized by markedly increased signal intensity of spots located at the boundaries of the spotted areas of the microarray slide. Combining observations from a simplified single-probe block array format with predictions from a mathematical model, the mechanism responsible for this bias is found to be a position-dependent variation in lateral diffusion of target molecules. Numerical simulations reveal a strong influence of microarray well geometry on the spatial bias. Conclusions Reciprocal adjustment of the size of the microarray hybridization chamber to the area of surface-bound probes is a simple and effective measure to minimize or eliminate the diffusion-based bias, resulting in increased uniformity and accuracy of quantitative DNA microarray hybridization. PMID:21858215
Spot detection and image segmentation in DNA microarray data.
Qin, Li; Rueda, Luis; Ali, Adnan; Ngom, Alioune
2005-01-01
Following the invention of microarrays in 1994, the development and applications of this technology have grown exponentially. The numerous applications of microarray technology include clinical diagnosis and treatment, drug design and discovery, tumour detection, and environmental health research. One of the key issues in the experimental approaches utilising microarrays is to extract quantitative information from the spots, which represent genes in a given experiment. For this process, the initial stages are important and they influence future steps in the analysis. Identifying the spots and separating the background from the foreground is a fundamental problem in DNA microarray data analysis. In this review, we present an overview of state-of-the-art methods for microarray image segmentation. We discuss the foundations of the circle-shaped approach, adaptive shape segmentation, histogram-based methods and the recently introduced clustering-based techniques. We analytically show that clustering-based techniques are equivalent to the one-dimensional, standard k-means clustering algorithm that utilises the Euclidean distance.
A Prototype System for Retrieval of Gene Functional Information
Folk, Lillian C.; Patrick, Timothy B.; Pattison, James S.; Wolfinger, Russell D.; Mitchell, Joyce A.
2003-01-01
Microarrays allow researchers to gather data about the expression patterns of thousands of genes simultaneously. Statistical analysis can reveal which genes show statistically significant results. Making biological sense of those results requires the retrieval of functional information about the genes thus identified, typically a manual gene-by-gene retrieval of information from various on-line databases. For experiments generating thousands of genes of interest, retrieval of functional information can become a significant bottleneck. To address this issue, we are currently developing a prototype system to automate the process of retrieval of functional information from multiple on-line sources. PMID:14728346
Microarrays for Undergraduate Classes
ERIC Educational Resources Information Center
Hancock, Dale; Nguyen, Lisa L.; Denyer, Gareth S.; Johnston, Jill M.
2006-01-01
A microarray experiment is presented that, in six laboratory sessions, takes undergraduate students from the tissue sample right through to data analysis. The model chosen, the murine erythroleukemia cell line, can be easily cultured in sufficient quantities for class use. Large changes in gene expression can be induced in these cells by…
Zhu, Yuerong; Zhu, Yuelin; Xu, Wei
2008-01-01
Background Though microarray experiments are very popular in life science research, managing and analyzing microarray data are still challenging tasks for many biologists. Most microarray programs require users to have sophisticated knowledge of mathematics, statistics and computer skills for usage. With accumulating microarray data deposited in public databases, easy-to-use programs to re-analyze previously published microarray data are in high demand. Results EzArray is a web-based Affymetrix expression array data management and analysis system for researchers who need to organize microarray data efficiently and get data analyzed instantly. EzArray organizes microarray data into projects that can be analyzed online with predefined or custom procedures. EzArray performs data preprocessing and detection of differentially expressed genes with statistical methods. All analysis procedures are optimized and highly automated so that even novice users with limited pre-knowledge of microarray data analysis can complete initial analysis quickly. Since all input files, analysis parameters, and executed scripts can be downloaded, EzArray provides maximum reproducibility for each analysis. In addition, EzArray integrates with Gene Expression Omnibus (GEO) and allows instantaneous re-analysis of published array data. Conclusion EzArray is a novel Affymetrix expression array data analysis and sharing system. EzArray provides easy-to-use tools for re-analyzing published microarray data and will help both novice and experienced users perform initial analysis of their microarray data from the location of data storage. We believe EzArray will be a useful system for facilities with microarray services and laboratories with multiple members involved in microarray data analysis. EzArray is freely available from . PMID:18218103
Development and application of a microarray meter tool to optimize microarray experiments
Rouse, Richard JD; Field, Katrine; Lapira, Jennifer; Lee, Allen; Wick, Ivan; Eckhardt, Colleen; Bhasker, C Ramana; Soverchia, Laura; Hardiman, Gary
2008-01-01
Background Successful microarray experimentation requires a complex interplay between the slide chemistry, the printing pins, the nucleic acid probes and targets, and the hybridization milieu. Optimization of these parameters and a careful evaluation of emerging slide chemistries are a prerequisite to any large scale array fabrication effort. We have developed a 'microarray meter' tool which assesses the inherent variations associated with microarray measurement prior to embarking on large scale projects. Findings The microarray meter consists of nucleic acid targets (reference and dynamic range control) and probe components. Different plate designs containing identical probe material were formulated to accommodate different robotic and pin designs. We examined the variability in probe quality and quantity (as judged by the amount of DNA printed and remaining post-hybridization) using three robots equipped with capillary printing pins. Discussion The generation of microarray data with minimal variation requires consistent quality control of the (DNA microarray) manufacturing and experimental processes. Spot reproducibility is a measure primarily of the variations associated with printing. The microarray meter assesses array quality by measuring the DNA content for every feature. It provides a post-hybridization analysis of array quality by scoring probe performance using three metrics, a) a measure of variability in the signal intensities, b) a measure of the signal dynamic range and c) a measure of variability of the spot morphologies. PMID:18710498
Ling, Zhi-Qiang; Wang, Yi; Mukaisho, Kenichi; Hattori, Takanori; Tatsuta, Takeshi; Ge, Ming-Hua; Jin, Li; Mao, Wei-Min; Sugihara, Hiroyuki
2010-06-01
Tests of differentially expressed genes (DEGs) from microarray experiments are based on the null hypothesis that genes that are irrelevant to the phenotype/stimulus are expressed equally in the target and control samples. However, this strict hypothesis is not always true, as there can be several transcriptomic background differences between target and control samples, including different cell/tissue types, different cell cycle stages and different biological donors. These differences lead to increased false positives, which have little biological/medical significance. In this article, we propose a statistical framework to identify DEGs between target and control samples from expression microarray data allowing transcriptomic background differences between these samples by introducing a modified null hypothesis that the gene expression background difference is normally distributed. We use an iterative procedure to perform robust estimation of the null hypothesis and identify DEGs as outliers. We evaluated our method using our own triplicate microarray experiment, followed by validations with reverse transcription-polymerase chain reaction (RT-PCR) and on the MicroArray Quality Control dataset. The evaluations suggest that our technique (i) results in less false positive and false negative results, as measured by the degree of agreement with RT-PCR of the same samples, (ii) can be applied to different microarray platforms and results in better reproducibility as measured by the degree of DEG identification concordance both intra- and inter-platforms and (iii) can be applied efficiently with only a few microarray replicates. Based on these evaluations, we propose that this method not only identifies more reliable and biologically/medically significant DEG, but also reduces the power-cost tradeoff problem in the microarray field. Source code and binaries freely available for download at http://comonca.org.cn/fdca/resources/softwares/deg.zip.
Mining meiosis and gametogenesis with DNA microarrays.
Schlecht, Ulrich; Primig, Michael
2003-04-01
Gametogenesis is a key developmental process that involves complex transcriptional regulation of numerous genes including many that are conserved between unicellular eukaryotes and mammals. Recent expression-profiling experiments using microarrays have provided insight into the co-ordinated transcription of several hundred genes during mitotic growth and meiotic development in budding and fission yeast. Furthermore, microarray-based studies have identified numerous loci that are regulated during the cell cycle or expressed in a germ-cell specific manner in eukaryotic model systems like Caenorhabditis elegans, Mus musculus as well as Homo sapiens. The unprecedented amount of information produced by post-genome biology has spawned novel approaches to organizing biological knowledge using currently available information technology. This review outlines experiments that contribute to an emerging comprehensive picture of the molecular machinery governing sexual reproduction in eukaryotes.
MiMiR – an integrated platform for microarray data sharing, mining and analysis
Tomlinson, Chris; Thimma, Manjula; Alexandrakis, Stelios; Castillo, Tito; Dennis, Jayne L; Brooks, Anthony; Bradley, Thomas; Turnbull, Carly; Blaveri, Ekaterini; Barton, Geraint; Chiba, Norie; Maratou, Klio; Soutter, Pat; Aitman, Tim; Game, Laurence
2008-01-01
Background Despite considerable efforts within the microarray community for standardising data format, content and description, microarray technologies present major challenges in managing, sharing, analysing and re-using the large amount of data generated locally or internationally. Additionally, it is recognised that inconsistent and low quality experimental annotation in public data repositories significantly compromises the re-use of microarray data for meta-analysis. MiMiR, the Microarray data Mining Resource was designed to tackle some of these limitations and challenges. Here we present new software components and enhancements to the original infrastructure that increase accessibility, utility and opportunities for large scale mining of experimental and clinical data. Results A user friendly Online Annotation Tool allows researchers to submit detailed experimental information via the web at the time of data generation rather than at the time of publication. This ensures the easy access and high accuracy of meta-data collected. Experiments are programmatically built in the MiMiR database from the submitted information and details are systematically curated and further annotated by a team of trained annotators using a new Curation and Annotation Tool. Clinical information can be annotated and coded with a clinical Data Mapping Tool within an appropriate ethical framework. Users can visualise experimental annotation, assess data quality, download and share data via a web-based experiment browser called MiMiR Online. All requests to access data in MiMiR are routed through a sophisticated middleware security layer thereby allowing secure data access and sharing amongst MiMiR registered users prior to publication. Data in MiMiR can be mined and analysed using the integrated EMAAS open source analysis web portal or via export of data and meta-data into Rosetta Resolver data analysis package. Conclusion The new MiMiR suite of software enables systematic and effective capture of extensive experimental and clinical information with the highest MIAME score, and secure data sharing prior to publication. MiMiR currently contains more than 150 experiments corresponding to over 3000 hybridisations and supports the Microarray Centre's large microarray user community and two international consortia. The MiMiR flexible and scalable hardware and software architecture enables secure warehousing of thousands of datasets, including clinical studies, from microarray and potentially other -omics technologies. PMID:18801157
MiMiR--an integrated platform for microarray data sharing, mining and analysis.
Tomlinson, Chris; Thimma, Manjula; Alexandrakis, Stelios; Castillo, Tito; Dennis, Jayne L; Brooks, Anthony; Bradley, Thomas; Turnbull, Carly; Blaveri, Ekaterini; Barton, Geraint; Chiba, Norie; Maratou, Klio; Soutter, Pat; Aitman, Tim; Game, Laurence
2008-09-18
Despite considerable efforts within the microarray community for standardising data format, content and description, microarray technologies present major challenges in managing, sharing, analysing and re-using the large amount of data generated locally or internationally. Additionally, it is recognised that inconsistent and low quality experimental annotation in public data repositories significantly compromises the re-use of microarray data for meta-analysis. MiMiR, the Microarray data Mining Resource was designed to tackle some of these limitations and challenges. Here we present new software components and enhancements to the original infrastructure that increase accessibility, utility and opportunities for large scale mining of experimental and clinical data. A user friendly Online Annotation Tool allows researchers to submit detailed experimental information via the web at the time of data generation rather than at the time of publication. This ensures the easy access and high accuracy of meta-data collected. Experiments are programmatically built in the MiMiR database from the submitted information and details are systematically curated and further annotated by a team of trained annotators using a new Curation and Annotation Tool. Clinical information can be annotated and coded with a clinical Data Mapping Tool within an appropriate ethical framework. Users can visualise experimental annotation, assess data quality, download and share data via a web-based experiment browser called MiMiR Online. All requests to access data in MiMiR are routed through a sophisticated middleware security layer thereby allowing secure data access and sharing amongst MiMiR registered users prior to publication. Data in MiMiR can be mined and analysed using the integrated EMAAS open source analysis web portal or via export of data and meta-data into Rosetta Resolver data analysis package. The new MiMiR suite of software enables systematic and effective capture of extensive experimental and clinical information with the highest MIAME score, and secure data sharing prior to publication. MiMiR currently contains more than 150 experiments corresponding to over 3000 hybridisations and supports the Microarray Centre's large microarray user community and two international consortia. The MiMiR flexible and scalable hardware and software architecture enables secure warehousing of thousands of datasets, including clinical studies, from microarray and potentially other -omics technologies.
Construction of a cDNA microarray derived from the ascidian Ciona intestinalis.
Azumi, Kaoru; Takahashi, Hiroki; Miki, Yasufumi; Fujie, Manabu; Usami, Takeshi; Ishikawa, Hisayoshi; Kitayama, Atsusi; Satou, Yutaka; Ueno, Naoto; Satoh, Nori
2003-10-01
A cDNA microarray was constructed from a basal chordate, the ascidian Ciona intestinalis. The draft genome of Ciona has been read and inferred to contain approximately 16,000 protein-coding genes, and cDNAs for transcripts of 13,464 genes have been characterized and compiled as the "Ciona intestinalis Gene Collection Release I". In the present study, we constructed a cDNA microarray of these 13,464 Ciona genes. A preliminary experiment with Cy3- and Cy5-labeled probes showed extensive differential gene expression between fertilized eggs and larvae. In addition, there was a good correlation between results obtained by the present microarray analysis and those from previous EST analyses. This first microarray of a large collection of Ciona intestinalis cDNA clones should facilitate the analysis of global gene expression and gene networks during the embryogenesis of basal chordates.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gentry, T.; Schadt, C.; Zhou, J.
Microarray technology has the unparalleled potential tosimultaneously determine the dynamics and/or activities of most, if notall, of the microbial populations in complex environments such as soilsand sediments. Researchers have developed several types of arrays thatcharacterize the microbial populations in these samples based on theirphylogenetic relatedness or functional genomic content. Several recentstudies have used these microarrays to investigate ecological issues;however, most have only analyzed a limited number of samples withrelatively few experiments utilizing the full high-throughput potentialof microarray analysis. This is due in part to the unique analyticalchallenges that these samples present with regard to sensitivity,specificity, quantitation, and data analysis. Thismore » review discussesspecific applications of microarrays to microbial ecology research alongwith some of the latest studies addressing the difficulties encounteredduring analysis of complex microbial communities within environmentalsamples. With continued development, microarray technology may ultimatelyachieve its potential for comprehensive, high-throughput characterizationof microbial populations in near real-time.« less
Evaluating concentration estimation errors in ELISA microarray experiments
DOE Office of Scientific and Technical Information (OSTI.GOV)
Daly, Don S.; White, Amanda M.; Varnum, Susan M.
Enzyme-linked immunosorbent assay (ELISA) is a standard immunoassay to predict a protein concentration in a sample. Deploying ELISA in a microarray format permits simultaneous prediction of the concentrations of numerous proteins in a small sample. These predictions, however, are uncertain due to processing error and biological variability. Evaluating prediction error is critical to interpreting biological significance and improving the ELISA microarray process. Evaluating prediction error must be automated to realize a reliable high-throughput ELISA microarray system. Methods: In this paper, we present a statistical method based on propagation of error to evaluate prediction errors in the ELISA microarray process. Althoughmore » propagation of error is central to this method, it is effective only when comparable data are available. Therefore, we briefly discuss the roles of experimental design, data screening, normalization and statistical diagnostics when evaluating ELISA microarray prediction errors. We use an ELISA microarray investigation of breast cancer biomarkers to illustrate the evaluation of prediction errors. The illustration begins with a description of the design and resulting data, followed by a brief discussion of data screening and normalization. In our illustration, we fit a standard curve to the screened and normalized data, review the modeling diagnostics, and apply propagation of error.« less
Zhou, Jinxu; Wang, Hongxiang; Chu, Junsheng; Huang, Qilin; Li, Guangxu; Yan, Yong; Xu, Tao; Chen, Juxiang; Wang, Yuhai
2018-04-24
Recent studies have found circular RNAs (circRNAs) involved in the biological process of cancers. However, little is known about their functional roles in glioblastoma. Human circRNA microarray analysis was performed to screen the expression profile of circRNAs in IDH1 wild-type glioblastoma tissue. The expression of hsa_circ_0008344 in glioblastoma and normal brain samples was quantified by qRT-PCR. Functional experiments were performed to investigate the biological functions of hsa_circ_0008344, including MTT assay, colony formation assay, transwell assay, and cell apoptosis assay. CircRNA microarray revealed a total of 417 abnormally expressed circRNAs (>1.5-fold, P < .05) in glioblastoma tissue compared with the adjacent normal brain. Hsa_circ_0008344, among the top differentially expressed circRNAs, was significantly upregulated in IDH1 wild-type glioblastoma. Further in vitro studies showed that knockdown of hsa_circ_0008344 suppressed glioblastoma cell proliferation, colony formation, migration, and invasion, but increased cell apoptotic rate. Hsa_circ_0008344 is upregulated in glioblastoma and may contribute to the progression of this malignancy. © 2018 Wiley Periodicals, Inc.
Satapathy, Lopamudra; Singh, Dharmendra; Ranjan, Prashant; Kumar, Dhananjay; Kumar, Manish; Prabhu, Kumble Vinod; Mukhopadhyay, Kunal
2014-12-01
WRKY, a plant-specific transcription factor family, has important roles in pathogen defense, abiotic cues and phytohormone signaling, yet little is known about their roles and molecular mechanism of function in response to rust diseases in wheat. We identified 100 TaWRKY sequences using wheat Expressed Sequence Tag database of which 22 WRKY sequences were novel. Identified proteins were characterized based on their zinc finger motifs and phylogenetic analysis clustered them into six clades consisting of class IIc and class III WRKY proteins. Functional annotation revealed major functions in metabolic and cellular processes in control plants; whereas response to stimuli, signaling and defense in pathogen inoculated plants, their major molecular function being binding to DNA. Tag-based expression analysis of the identified genes revealed differential expression between mock and Puccinia triticina inoculated wheat near isogenic lines. Gene expression was also performed with six rust-related microarray experiments at Gene Expression Omnibus database. TaWRKY10, 15, 17 and 56 were common in both tag-based and microarray-based differential expression analysis and could be representing rust specific WRKY genes. The obtained results will bestow insight into the functional characterization of WRKY transcription factors responsive to leaf rust pathogenesis that can be used as candidate genes in molecular breeding programs to improve biotic stress tolerance in wheat.
Jeong, Joo Yeon; Lee, Dong Hoon; Kang, Sang Soo
2013-12-01
Stress affects body weight and food intake, but the underlying mechanisms are not well understood. We evaluated the changes in body weight and food intake of ICR male mice subjected to daily 2 hours restraint stress for 15 days. Hypothalamic gene expression profiling was analyzed by cDNA microarray. Daily body weight and food intake measurements revealed that both parameters decreased rapidly after initiating daily restraint stress. Body weights of stressed mice then remained significantly lower than the control body weights, even though food intake slowly recovered to 90% of the control intake at the end of the experiment. cDNA microarray analysis revealed that chronic restraint stress affects the expression of hypothalamic genes possibly related to body weight control. Since decreases of daily food intake and body weight were remarkable in days 1 to 4 of restraint, we examined the expression of food intake-related genes in the hypothalamus. During these periods, the expressions of ghrelin and pro-opiomelanocortin mRNA were significantly changed in mice undergoing restraint stress. Moreover, daily serum corticosterone levels gradually increased, while leptin levels significantly decreased. The present study demonstrates that restraint stress affects body weight and food intake by initially modifying canonical food intake-related genes and then later modifying other genes involved in energy metabolism. These genetic changes appear to be mediated, at least in part, by corticosterone.
A Java-based tool for the design of classification microarrays.
Meng, Da; Broschat, Shira L; Call, Douglas R
2008-08-04
Classification microarrays are used for purposes such as identifying strains of bacteria and determining genetic relationships to understand the epidemiology of an infectious disease. For these cases, mixed microarrays, which are composed of DNA from more than one organism, are more effective than conventional microarrays composed of DNA from a single organism. Selection of probes is a key factor in designing successful mixed microarrays because redundant sequences are inefficient and limited representation of diversity can restrict application of the microarray. We have developed a Java-based software tool, called PLASMID, for use in selecting the minimum set of probe sequences needed to classify different groups of plasmids or bacteria. The software program was successfully applied to several different sets of data. The utility of PLASMID was illustrated using existing mixed-plasmid microarray data as well as data from a virtual mixed-genome microarray constructed from different strains of Streptococcus. Moreover, use of data from expression microarray experiments demonstrated the generality of PLASMID. In this paper we describe a new software tool for selecting a set of probes for a classification microarray. While the tool was developed for the design of mixed microarrays-and mixed-plasmid microarrays in particular-it can also be used to design expression arrays. The user can choose from several clustering methods (including hierarchical, non-hierarchical, and a model-based genetic algorithm), several probe ranking methods, and several different display methods. A novel approach is used for probe redundancy reduction, and probe selection is accomplished via stepwise discriminant analysis. Data can be entered in different formats (including Excel and comma-delimited text), and dendrogram, heat map, and scatter plot images can be saved in several different formats (including jpeg and tiff). Weights generated using stepwise discriminant analysis can be stored for analysis of subsequent experimental data. Additionally, PLASMID can be used to construct virtual microarrays with genomes from public databases, which can then be used to identify an optimal set of probes.
EDGE3: A web-based solution for management and analysis of Agilent two color microarray experiments
Vollrath, Aaron L; Smith, Adam A; Craven, Mark; Bradfield, Christopher A
2009-01-01
Background The ability to generate transcriptional data on the scale of entire genomes has been a boon both in the improvement of biological understanding and in the amount of data generated. The latter, the amount of data generated, has implications when it comes to effective storage, analysis and sharing of these data. A number of software tools have been developed to store, analyze, and share microarray data. However, a majority of these tools do not offer all of these features nor do they specifically target the commonly used two color Agilent DNA microarray platform. Thus, the motivating factor for the development of EDGE3 was to incorporate the storage, analysis and sharing of microarray data in a manner that would provide a means for research groups to collaborate on Agilent-based microarray experiments without a large investment in software-related expenditures or extensive training of end-users. Results EDGE3 has been developed with two major functions in mind. The first function is to provide a workflow process for the generation of microarray data by a research laboratory or a microarray facility. The second is to store, analyze, and share microarray data in a manner that doesn't require complicated software. To satisfy the first function, EDGE3 has been developed as a means to establish a well defined experimental workflow and information system for microarray generation. To satisfy the second function, the software application utilized as the user interface of EDGE3 is a web browser. Within the web browser, a user is able to access the entire functionality, including, but not limited to, the ability to perform a number of bioinformatics based analyses, collaborate between research groups through a user-based security model, and access to the raw data files and quality control files generated by the software used to extract the signals from an array image. Conclusion Here, we present EDGE3, an open-source, web-based application that allows for the storage, analysis, and controlled sharing of transcription-based microarray data generated on the Agilent DNA platform. In addition, EDGE3 provides a means for managing RNA samples and arrays during the hybridization process. EDGE3 is freely available for download at . PMID:19732451
Vollrath, Aaron L; Smith, Adam A; Craven, Mark; Bradfield, Christopher A
2009-09-04
The ability to generate transcriptional data on the scale of entire genomes has been a boon both in the improvement of biological understanding and in the amount of data generated. The latter, the amount of data generated, has implications when it comes to effective storage, analysis and sharing of these data. A number of software tools have been developed to store, analyze, and share microarray data. However, a majority of these tools do not offer all of these features nor do they specifically target the commonly used two color Agilent DNA microarray platform. Thus, the motivating factor for the development of EDGE(3) was to incorporate the storage, analysis and sharing of microarray data in a manner that would provide a means for research groups to collaborate on Agilent-based microarray experiments without a large investment in software-related expenditures or extensive training of end-users. EDGE(3) has been developed with two major functions in mind. The first function is to provide a workflow process for the generation of microarray data by a research laboratory or a microarray facility. The second is to store, analyze, and share microarray data in a manner that doesn't require complicated software. To satisfy the first function, EDGE3 has been developed as a means to establish a well defined experimental workflow and information system for microarray generation. To satisfy the second function, the software application utilized as the user interface of EDGE(3) is a web browser. Within the web browser, a user is able to access the entire functionality, including, but not limited to, the ability to perform a number of bioinformatics based analyses, collaborate between research groups through a user-based security model, and access to the raw data files and quality control files generated by the software used to extract the signals from an array image. Here, we present EDGE(3), an open-source, web-based application that allows for the storage, analysis, and controlled sharing of transcription-based microarray data generated on the Agilent DNA platform. In addition, EDGE(3) provides a means for managing RNA samples and arrays during the hybridization process. EDGE(3) is freely available for download at http://edge.oncology.wisc.edu/.
Palma, Angelina S.; Liu, Yan; Zhang, Hongtao; Zhang, Yibing; McCleary, Barry V.; Yu, Guangli; Huang, Qilin; Guidolin, Leticia S.; Ciocchini, Andres E.; Torosantucci, Antonella; Wang, Denong; Carvalho, Ana Luísa; Fontes, Carlos M. G. A.; Mulloy, Barbara; Childs, Robert A.; Feizi, Ten; Chai, Wengang
2015-01-01
Glucans are polymers of d-glucose with differing linkages in linear or branched sequences. They are constituents of microbial and plant cell-walls and involved in important bio-recognition processes, including immunomodulation, anticancer activities, pathogen virulence, and plant cell-wall biodegradation. Translational possibilities for these activities in medicine and biotechnology are considerable. High-throughput micro-methods are needed to screen proteins for recognition of specific glucan sequences as a lead to structure–function studies and their exploitation. We describe construction of a “glucome” microarray, the first sequence-defined glycome-scale microarray, using a “designer” approach from targeted ligand-bearing glucans in conjunction with a novel high-sensitivity mass spectrometric sequencing method, as a screening tool to assign glucan recognition motifs. The glucome microarray comprises 153 oligosaccharide probes with high purity, representing major sequences in glucans. Negative-ion electrospray tandem mass spectrometry with collision-induced dissociation was used for complete linkage analysis of gluco-oligosaccharides in linear “homo” and “hetero” and branched sequences. The system is validated using antibodies and carbohydrate-binding modules known to target α- or β-glucans in different biological contexts, extending knowledge on their specificities, and applied to reveal new information on glucan recognition by two signaling molecules of the immune system against pathogens: Dectin-1 and DC-SIGN. The sequencing of the glucan oligosaccharides by the MS method and their interrogation on the microarrays provides detailed information on linkage, sequence and chain length requirements of glucan-recognizing proteins, and are a sensitive means of revealing unsuspected sequences in the polysaccharides. PMID:25670804
NASA Astrophysics Data System (ADS)
Kittang, Ann-Iren; Kvaløy, Brita; Winge, Per; Iversen, Tor-Henning
2010-11-01
Gene expression analysis using microarrays has proved to be an important method in life science. The opportunity to grow higher plants on the International Space Station (ISS) opens up the possibility for gene expression profiling of plants grown in microgravity. The work presented focuses on how to meet the scientific requirements of plant growth and the sample preservation, given the technical and operational constraints associated with space research. The growth chamber (Multigen-2 Science Testing Unit) and a protocol suggested to be used in the European Modular Cultivation System (EMCS) Multigen-2 experiment on the ISS to grow and later preserve Arabidopsis seedlings, were tested on ground. The results showed that most of the plants developed normally. In order to avoid high population stress the number of seedlings per growth area should be reduced. The RNAlater preservation method to be used in the space experiment was compared with a quick freeze in Liquid Nitrogen (LN2). The RNA from samples preserved in RNAlater at room temperature for 24 h was slightly more degraded than the RNA from the LN2 preserved samples (RNA integrity number, RIN: 7.7 and 8.6, respectively). However, the RNA quality and quantity was satisfactory for microarray analysis. Of the genes analysed, 74 genes (0.28%) were significantly differentially expressed, most of them showing moderate to low regulation. Among the genes induced in the RNAlater preserved samples, three salt inducible transcription factors (ZAT10, SZF1 and SZF2) were identified, suggesting that the high salt concentration in RNAlater causes salt stress before the transcription stopped. In conclusion, the Multigen-2 preservation protocol tested here will allow for the genes regulated by microgravity in the space experiment to be revealed. The results do indicate that not all the biological processes are stopped instantly by the RNAlater. The limited diffusion indirectly caused by the microgravity may potentially result in a different degree of salt stress in the test compared to the 1 × g control during the space experiment. This has to be accounted for during the evaluation of the results. Since slightly degraded RNA was observed, further optimalisation of the preservation protocol will be performed.
Plant-pathogen interactions: what microarray tells about it?
Lodha, T D; Basak, J
2012-01-01
Plant defense responses are mediated by elementary regulatory proteins that affect expression of thousands of genes. Over the last decade, microarray technology has played a key role in deciphering the underlying networks of gene regulation in plants that lead to a wide variety of defence responses. Microarray is an important tool to quantify and profile the expression of thousands of genes simultaneously, with two main aims: (1) gene discovery and (2) global expression profiling. Several microarray technologies are currently in use; most include a glass slide platform with spotted cDNA or oligonucleotides. Till date, microarray technology has been used in the identification of regulatory genes, end-point defence genes, to understand the signal transduction processes underlying disease resistance and its intimate links to other physiological pathways. Microarray technology can be used for in-depth, simultaneous profiling of host/pathogen genes as the disease progresses from infection to resistance/susceptibility at different developmental stages of the host, which can be done in different environments, for clearer understanding of the processes involved. A thorough knowledge of plant disease resistance using successful combination of microarray and other high throughput techniques, as well as biochemical, genetic, and cell biological experiments is needed for practical application to secure and stabilize yield of many crop plants. This review starts with a brief introduction to microarray technology, followed by the basics of plant-pathogen interaction, the use of DNA microarrays over the last decade to unravel the mysteries of plant-pathogen interaction, and ends with the future prospects of this technology.
NRF2-regulated metabolic gene signature as a prognostic biomarker in non-small cell lung cancer
Namani, Akhileshwar; Cui, Qin Qin; Wu, Yihe; Wang, Hongyan; Wang, Xiu Jun; Tang, Xiuwen
2017-01-01
Mutations in Kelch-like ECH-associated protein 1 (KEAP1) cause the aberrant activation of nuclear factor erythroid-derived 2-like 2 (NRF2), which leads to oncogenesis and drug resistance in lung cancer cells. Our study was designed to identify the genes involved in lung cancer progression targeted by NRF2. A series of microarray experiments in normal and cancer cells, as well as in animal models, have revealed regulatory genes downstream of NRF2 that are involved in wide variety of pathways. Specifically, we carried out individual and combinatorial microarray analysis of KEAP1 overexpression and NRF2 siRNA-knockdown in a KEAP1 mutant-A549 non-small cell lung cancer (NSCLC) cell line. As a result, we identified a list of genes which were mainly involved in metabolic functions in NSCLC by using functional annotation analysis. In addition, we carried out in silico analysis to characterize the antioxidant responsive element sequences in the promoter regions of known and putative NRF2-regulated metabolic genes. We further identified an NRF2-regulated metabolic gene signature (NRMGS) by correlating the microarray data with lung adenocarcinoma RNA-Seq gene expression data from The Cancer Genome Atlas followed by qRT-PCR validation, and finally showed that higher expression of the signature conferred a poor prognosis in 8 independent NSCLC cohorts. Our findings provide novel prognostic biomarkers for NSCLC. PMID:29050246
A microarray for assessing transcription from pelagic marine microbial taxa
Shilova, Irina N; Robidart, Julie C; James Tripp, H; Turk-Kubo, Kendra; Wawrik, Boris; Post, Anton F; Thompson, Anne W; Ward, Bess; Hollibaugh, James T; Millard, Andy; Ostrowski, Martin; J Scanlan, David; Paerl, Ryan W; Stuart, Rhona; Zehr, Jonathan P
2014-01-01
Metagenomic approaches have revealed unprecedented genetic diversity within microbial communities across vast expanses of the world's oceans. Linking this genetic diversity with key metabolic and cellular activities of microbial assemblages is a fundamental challenge. Here we report on a collaborative effort to design MicroTOOLs (Microbiological Targets for Ocean Observing Laboratories), a high-density oligonucleotide microarray that targets functional genes of diverse taxa in pelagic and coastal marine microbial communities. MicroTOOLs integrates nucleotide sequence information from disparate data types: genomes, PCR-amplicons, metagenomes, and metatranscriptomes. It targets 19 400 unique sequences over 145 different genes that are relevant to stress responses and microbial metabolism across the three domains of life and viruses. MicroTOOLs was used in a proof-of-concept experiment that compared the functional responses of microbial communities following Fe and P enrichments of surface water samples from the North Pacific Subtropical Gyre. We detected transcription of 68% of the gene targets across major taxonomic groups, and the pattern of transcription indicated relief from Fe limitation and transition to N limitation in some taxa. Prochlorococcus (eHLI), Synechococcus (sub-cluster 5.3) and Alphaproteobacteria SAR11 clade (HIMB59) showed the strongest responses to the Fe enrichment. In addition, members of uncharacterized lineages also responded. The MicroTOOLs microarray provides a robust tool for comprehensive characterization of major functional groups of microbes in the open ocean, and the design can be easily amended for specific environments and research questions. PMID:24477198
Timmerman, Peter; Barderas, Rodrigo; Desmet, Johan; Altschuh, Danièle; Shochat, Susana; Hollestelle, Martine J; Höppener, Jo W M; Monasterio, Alberto; Casal, J Ignacio; Meloen, Rob H
2009-12-04
The great success of therapeutic monoclonal antibodies has fueled research toward mimicry of their binding sites and the development of new strategies for peptide-based mimetics production. Here, we describe a new combinatorial approach for the production of peptidomimetics using the complementarity-determining regions (CDRs) from gastrin17 (pyroEGPWLEEEEEAYGWMDF-NH(2)) antibodies as starting material for cyclic peptide synthesis in a microarray format. Gastrin17 is a trophic factor in gastrointestinal tumors, including pancreatic cancer, which makes it an interesting target for development of therapeutic antibodies. Screening of microarrays containing bicyclic peptidomimetics identified a high number of gastrin binders. A strong correlation was observed between gastrin binding and overall charge of the peptidomimetic. Most of the best gastrin binders proceeded from CDRs containing charged residues. In contrast, CDRs from high affinity antibodies containing mostly neutral residues failed to yield good binders. Our experiments revealed essential differences in the mode of antigen binding between CDR-derived peptidomimetics (K(d) values in micromolar range) and the parental monoclonal antibodies (K(d) values in nanomolar range). However, chemically derived peptidomimetics from gastrin binders were very effective in gastrin neutralization studies using cell-based assays, yielding a neutralizing activity in pancreatic tumoral cell lines comparable with that of gastrin-specific monoclonal antibodies. These data support the use of combinatorial CDR-peptide microarrays as a tool for the development of a new generation of chemically synthesized cyclic peptidomimetics with functional activity.
Timmerman, Peter; Barderas, Rodrigo; Desmet, Johan; Altschuh, Danièle; Shochat, Susana; Hollestelle, Martine J.; Höppener, Jo W. M.; Monasterio, Alberto; Casal, J. Ignacio; Meloen, Rob H.
2009-01-01
The great success of therapeutic monoclonal antibodies has fueled research toward mimicry of their binding sites and the development of new strategies for peptide-based mimetics production. Here, we describe a new combinatorial approach for the production of peptidomimetics using the complementarity-determining regions (CDRs) from gastrin17 (pyroEGPWLEEEEEAYGWMDF-NH2) antibodies as starting material for cyclic peptide synthesis in a microarray format. Gastrin17 is a trophic factor in gastrointestinal tumors, including pancreatic cancer, which makes it an interesting target for development of therapeutic antibodies. Screening of microarrays containing bicyclic peptidomimetics identified a high number of gastrin binders. A strong correlation was observed between gastrin binding and overall charge of the peptidomimetic. Most of the best gastrin binders proceeded from CDRs containing charged residues. In contrast, CDRs from high affinity antibodies containing mostly neutral residues failed to yield good binders. Our experiments revealed essential differences in the mode of antigen binding between CDR-derived peptidomimetics (Kd values in micromolar range) and the parental monoclonal antibodies (Kd values in nanomolar range). However, chemically derived peptidomimetics from gastrin binders were very effective in gastrin neutralization studies using cell-based assays, yielding a neutralizing activity in pancreatic tumoral cell lines comparable with that of gastrin-specific monoclonal antibodies. These data support the use of combinatorial CDR-peptide microarrays as a tool for the development of a new generation of chemically synthesized cyclic peptidomimetics with functional activity. PMID:19808684
Akkiprik, Mustafa; Peker, İrem; Özmen, Tolga; Amuran, Gökçe Güllü; Güllüoğlu, Bahadır M; Kaya, Handan; Özer, Ayşe
2015-11-10
IGFBP5 is an important regulatory protein in breast cancer progression. We tried to identify differentially expressed genes (DEGs) between breast tumor tissues with IGFBP5 overexpression and their adjacent normal tissues. In this study, thirty-eight breast cancer and adjacent normal breast tissue samples were used to determine IGFBP5 expression by qPCR. cDNA microarrays were applied to the highest IGFBP5 overexpressed tumor samples compared to their adjacent normal breast tissue. Microarray analysis revealed that a total of 186 genes were differentially expressed in breast cancer compared with normal breast tissues. Of the 186 genes, 169 genes were downregulated and 17 genes were upregulated in the tumor samples. KEGG pathway analyses showed that protein digestion and absorption, focal adhesion, salivary secretion, drug metabolism-cytochrome P450, and phenylalanine metabolism pathways are involved. Among these DEGs, the prominent top two genes (MMP11 and COL1A1) which potentially correlated with IGFBP5 were selected for validation using real time RT-qPCR. Only COL1A1 expression showed a consistent upregulation with IGFBP5 expression and COL1A1 and MMP11 were significantly positively correlated. We concluded that the discovery of coordinately expressed genes related with IGFBP5 might contribute to understanding of the molecular mechanism of the function of IGFBP5 in breast cancer. Further functional studies on DEGs and association with IGFBP5 may identify novel biomarkers for clinical applications in breast cancer.
[Typing and subtyping avian influenza virus using DNA microarrays].
Yang, Zhongping; Wang, Xiurong; Tian, Lina; Wang, Yu; Chen, Hualan
2008-07-01
Outbreaks of highly pathogenic avian influenza (HPAI) virus has caused great economic loss to the poultry industry and resulted in human deaths in Thailand and Vietnam since 2004. Rapid typing and subtyping of viruses, especially HPAI from clinical specimens, are desirable for taking prompt control measures to prevent spreading of the disease. We described a simultaneous approach using microarray to detect and subtype avian influenza virus (AIV). We designed primers of probe genes and used reverse transcriptase PCR to prepare cDNAs of AIV M gene, H5, H7, H9 subtypes haemagglutinin genes and N1, N2 subtypes neuraminidase genes. They were cloned, sequenced, reamplified and spotted to form a glass-bound microarrays. We labeled samples using Cy3-dUTP by RT-PCR, hybridized and scanned the microarrays to typing and subtyping AIV. The hybridization pattern agreed perfectly with the known grid location of each probe, no cross hybridization could be detected. Examinating of HA subtypes 1 through 15, 30 infected samples and 21 field samples revealed the DNA microarray assay was more sensitive and specific than RT-PCR test and chicken embryo inoculation. It can simultaneously detect and differentiate the main epidemic AIV. The results show that DNA microarray technology is a useful diagnostic method.
ERIC Educational Resources Information Center
Bradford, William D.; Cahoon, Laty; Freel, Sara R.; Hoopes, Laura L. Mays; Eckdahl, Todd T.
2005-01-01
In order to engage their students in a core methodology of the new genomics era, an everincreasing number of faculty at primarily undergraduate institutions are gaining access to microarray technology. Their students are conducting successful microarray experiments designed to address a variety of interesting questions. A next step in these…
Gupta, Surya; De Puysseleyr, Veronic; Van der Heyden, José; Maddelein, Davy; Lemmens, Irma; Lievens, Sam; Degroeve, Sven; Tavernier, Jan; Martens, Lennart
2017-05-01
Protein-protein interaction (PPI) studies have dramatically expanded our knowledge about cellular behaviour and development in different conditions. A multitude of high-throughput PPI techniques have been developed to achieve proteome-scale coverage for PPI studies, including the microarray based Mammalian Protein-Protein Interaction Trap (MAPPIT) system. Because such high-throughput techniques typically report thousands of interactions, managing and analysing the large amounts of acquired data is a challenge. We have therefore built the MAPPIT cell microArray Protein Protein Interaction-Data management & Analysis Tool (MAPPI-DAT) as an automated data management and analysis tool for MAPPIT cell microarray experiments. MAPPI-DAT stores the experimental data and metadata in a systematic and structured way, automates data analysis and interpretation, and enables the meta-analysis of MAPPIT cell microarray data across all stored experiments. MAPPI-DAT is developed in Python, using R for data analysis and MySQL as data management system. MAPPI-DAT is cross-platform and can be ran on Microsoft Windows, Linux and OS X/macOS. The source code and a Microsoft Windows executable are freely available under the permissive Apache2 open source license at https://github.com/compomics/MAPPI-DAT. jan.tavernier@vib-ugent.be or lennart.martens@vib-ugent.be. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press.
Thormar, Hans G; Gudmundsson, Bjarki; Eiriksdottir, Freyja; Kil, Siyoen; Gunnarsson, Gudmundur H; Magnusson, Magnus Karl; Hsu, Jason C; Jonsson, Jon J
2013-04-01
The causes of imprecision in microarray expression analysis are poorly understood, limiting the use of this technology in molecular diagnostics. Two-dimensional strandness-dependent electrophoresis (2D-SDE) separates nucleic acid molecules on the basis of length and strandness, i.e., double-stranded DNA (dsDNA), single-stranded DNA (ssDNA), and RNA·DNA hybrids. We used 2D-SDE to measure the efficiency of cDNA synthesis and its importance for the imprecision of an in vitro transcription-based microarray expression analysis. The relative amount of double-stranded cDNA formed in replicate experiments that used the same RNA sample template was highly variable, ranging between 0% and 72% of the total DNA. Microarray experiments showed an inverse relationship between the difference between sample pairs in probe variance and the relative amount of dsDNA. Approximately 15% of probes showed between-sample variation (P < 0.05) when the dsDNA percentage was between 12% and 35%. In contrast, only 3% of probes showed between-sample variation when the dsDNA percentage was 69% and 72%. Replication experiments of the 35% dsDNA and 72% dsDNA samples were used to separate sample variation from probe replication variation. The estimated SD of the sample-to-sample variation and of the probe replicates was lower in 72% dsDNA samples than in 35% dsDNA samples. Variation in the relative amount of double-stranded cDNA synthesized can be an important component of the imprecision in T7 RNA polymerase-based microarray expression analysis. © 2013 American Association for Clinical Chemistry
An object model and database for functional genomics.
Jones, Andrew; Hunt, Ela; Wastling, Jonathan M; Pizarro, Angel; Stoeckert, Christian J
2004-07-10
Large-scale functional genomics analysis is now feasible and presents significant challenges in data analysis, storage and querying. Data standards are required to enable the development of public data repositories and to improve data sharing. There is an established data format for microarrays (microarray gene expression markup language, MAGE-ML) and a draft standard for proteomics (PEDRo). We believe that all types of functional genomics experiments should be annotated in a consistent manner, and we hope to open up new ways of comparing multiple datasets used in functional genomics. We have created a functional genomics experiment object model (FGE-OM), developed from the microarray model, MAGE-OM and two models for proteomics, PEDRo and our own model (Gla-PSI-Glasgow Proposal for the Proteomics Standards Initiative). FGE-OM comprises three namespaces representing (i) the parts of the model common to all functional genomics experiments; (ii) microarray-specific components; and (iii) proteomics-specific components. We believe that FGE-OM should initiate discussion about the contents and structure of the next version of MAGE and the future of proteomics standards. A prototype database called RNA And Protein Abundance Database (RAPAD), based on FGE-OM, has been implemented and populated with data from microbial pathogenesis. FGE-OM and the RAPAD schema are available from http://www.gusdb.org/fge.html, along with a set of more detailed diagrams. RAPAD can be accessed by registration at the site.
Deutsch, Eric W; Ball, Catherine A; Berman, Jules J; Bova, G Steven; Brazma, Alvis; Bumgarner, Roger E; Campbell, David; Causton, Helen C; Christiansen, Jeffrey H; Daian, Fabrice; Dauga, Delphine; Davidson, Duncan R; Gimenez, Gregory; Goo, Young Ah; Grimmond, Sean; Henrich, Thorsten; Herrmann, Bernhard G; Johnson, Michael H; Korb, Martin; Mills, Jason C; Oudes, Asa J; Parkinson, Helen E; Pascal, Laura E; Pollet, Nicolas; Quackenbush, John; Ramialison, Mirana; Ringwald, Martin; Salgado, David; Sansone, Susanna-Assunta; Sherlock, Gavin; Stoeckert, Christian J; Swedlow, Jason; Taylor, Ronald C; Walashek, Laura; Warford, Anthony; Wilkinson, David G; Zhou, Yi; Zon, Leonard I; Liu, Alvin Y; True, Lawrence D
2008-03-01
One purpose of the biomedical literature is to report results in sufficient detail that the methods of data collection and analysis can be independently replicated and verified. Here we present reporting guidelines for gene expression localization experiments: the minimum information specification for in situ hybridization and immunohistochemistry experiments (MISFISHIE). MISFISHIE is modeled after the Minimum Information About a Microarray Experiment (MIAME) specification for microarray experiments. Both guidelines define what information should be reported without dictating a format for encoding that information. MISFISHIE describes six types of information to be provided for each experiment: experimental design, biomaterials and treatments, reporters, staining, imaging data and image characterizations. This specification has benefited the consortium within which it was developed and is expected to benefit the wider research community. We welcome feedback from the scientific community to help improve our proposal.
Meade, Kieran G; Gormley, Eamonn; Park, Stephen D E; Fitzsimons, Tara; Rosa, Guilherme J M; Costello, Eamon; Keane, Joseph; Coussens, Paul M; MacHugh, David E
2006-09-15
Microarray analysis of messenger RNA (mRNA) abundance was used to investigate the gene expression program of peripheral blood mononuclear cells (PBMC) from cattle infected with Mycobacterium bovis, the causative agent of bovine tuberculosis. An immunospecific bovine microarray platform (BOTL-4) with spot features representing 1336 genes was used for transcriptional profiling of PBMC from six M. bovis-infected cattle stimulated in vitro with bovine purified protein derivative of tuberculin (PPD-bovine). Cells were harvested at four time points (3 h, 6 h, 12 h and 24 h post-stimulation) and a split-plot design with pooled samples was used for the microarray experiment to compare gene expression between PPD-bovine stimulated PBMC and unstimulated controls for each time point. Statistical analyses of these data revealed 224 genes (approximately 17% of transcripts on the array) differentially expressed between stimulated and unstimulated PBMC across the 24 h time course (P<0.05). Of the 224 genes, 87 genes were significantly upregulated and 137 genes were significantly downregulated in M. bovis-infected PBMC stimulated with PPD-bovine across the 24 h time course. However, perturbation of the PBMC transcriptome was most apparent at time points 3 h and 12 h post-stimulation, with 81 and 84 genes differentially expressed, respectively. In addition, a more stringent statistical threshold (P<0.01) revealed 35 genes (approximately 3%) that were differentially expressed across the time course. Real-time quantitative reverse transcription PCR (qRT-PCR) of selected genes validated the microarray results and demonstrated a wide range of differentially expressed genes in PPD-bovine-, PPD-avian- and Concanavalin A (ConA) stimulated PBMC, including the interferon-gamma gene (IFNG), which was upregulated in PBMC stimulated with PPD-bovine (40-fold), PPD-avian (10-fold) and ConA (8-fold) after in vitro culture for 12 h. The pattern of expression of these genes in PPD-bovine stimulated PBMC provides the first description of an M. bovis-specific signature of infection that may provide insights into the molecular basis of the host response to infection. Although the present study was carried out with mixed PBMC cell populations, it will guide future studies to dissect immune cell-specific gene expression patterns in response to M. bovis infection.
Stochastic models for inferring genetic regulation from microarray gene expression data.
Tian, Tianhai
2010-03-01
Microarray expression profiles are inherently noisy and many different sources of variation exist in microarray experiments. It is still a significant challenge to develop stochastic models to realize noise in microarray expression profiles, which has profound influence on the reverse engineering of genetic regulation. Using the target genes of the tumour suppressor gene p53 as the test problem, we developed stochastic differential equation models and established the relationship between the noise strength of stochastic models and parameters of an error model for describing the distribution of the microarray measurements. Numerical results indicate that the simulated variance from stochastic models with a stochastic degradation process can be represented by a monomial in terms of the hybridization intensity and the order of the monomial depends on the type of stochastic process. The developed stochastic models with multiple stochastic processes generated simulations whose variance is consistent with the prediction of the error model. This work also established a general method to develop stochastic models from experimental information. 2009 Elsevier Ireland Ltd. All rights reserved.
Emerging Use of Gene Expression Microarrays in Plant Physiology
Wullschleger, Stan D.; Difazio, Stephen P.
2003-01-01
Microarrays have become an important technology for the global analysis of gene expression in humans, animals, plants, and microbes. Implemented in the context of a well-designed experiment, cDNA and oligonucleotide arrays can provide highthroughput, simultaneous analysis of transcript abundance for hundreds, if not thousands, of genes. However, despite widespread acceptance, the use of microarrays as a tool to better understand processes of interest to the plant physiologist is still being explored. To help illustrate current uses of microarrays in the plant sciences, several case studies that we believe demonstrate the emerging application of gene expression arrays in plant physiology weremore » selected from among the many posters and presentations at the 2003 Plant and Animal Genome XI Conference. Based on this survey, microarrays are being used to assess gene expression in plants exposed to the experimental manipulation of air temperature, soil water content and aluminium concentration in the root zone. Analysis often includes characterizing transcript profiles for multiple post-treatment sampling periods and categorizing genes with common patterns of response using hierarchical clustering techniques. In addition, microarrays are also providing insights into developmental changes in gene expression associated with fibre and root elongation in cotton and maize, respectively. Technical and analytical limitations of microarrays are discussed and projects attempting to advance areas of microarray design and data analysis are highlighted. Finally, although much work remains, we conclude that microarrays are a valuable tool for the plant physiologist interested in the characterization and identification of individual genes and gene families with potential application in the fields of agriculture, horticulture and forestry.« less
Immunological Targeting of Tumor Initiating Prostate Cancer Cells
2014-10-01
clinically using well-accepted immuno-competent animal models. 2) Keywords: Prostate Cancer, Lymphocyte, Vaccine, Antibody 3) Overall Project Summary...castrate animals . Task 1: Identify and verify antigenic targets from CAstrate Resistant Luminal Epithelial Cells (CRLEC) (months 1-16... animals per group will be processed to derive sufficient RNA for microarray analysis; the experiment will be repeated x 3. Microarray analysis will
Circular RNA Expression Profile of Pancreatic Ductal Adenocarcinoma Revealed by Microarray.
Li, Haimin; Hao, Xiaokun; Wang, Huimin; Liu, Zhengcai; He, Yong; Pu, Meng; Zhang, Hongtao; Yu, Hengchao; Duan, Juanli; Qu, Shibin
2016-01-01
Circular RNAs (circRNAs) are a special novel type of a stable, diverse and conserved noncoding RNA in mammalian cells. Particularly in cancer, circRNAs have been reported to be widely involved in the physiological/pathological process of life. However, it is unclear whether circRNAs are specifically involved in pancreatic ductal adenocarcinoma (PDAC). We investigated the expression profile of circRNAs in six PDAC cancer samples and paired adjacent normal tissues using microarray. A high-throughput circRNA microarray was used to identify dysregulated circular RNAs in six PDAC patients. Bioinformatic analyses were applied to study these differentially expressed circRNAs. Furthermore, quantitative reverse transcription polymerase chain reaction (qRT-PCR) was performed to confirm these results. We revealed and confirmed that a number of circRNAs were dysregulated, which suggests a potential role in pancreatic cancer. this study demonstrates that clusters of circRNAs are aberrantly expressed in PDAC compared with normal samples and provides new potential targets for the future treatment of PDAC and novel insights into PDAC biology. © 2016 The Author(s) Published by S. Karger AG, Basel.
Di Giannatale, Elisabetta; Di Serafino, Gabriella; Zilli, Katiuscia; Alessiani, Alessandra; Sacchini, Lorena; Garofolo, Giuliano; Aprea, Giuseppe; Marotta, Francesca
2014-01-01
Campylobacter has developed resistance to several antimicrobial agents over the years, including macrolides, quinolones and fluoroquinolones, becoming a significant public health hazard. A total of 145 strains derived from raw milk, chicken faeces, chicken carcasses, cattle faeces and human faeces collected from various Italian regions, were screened for antimicrobial susceptibility, molecular characterization (SmaI pulsed-field gel electrophoresis) and detection of virulence genes (sequencing and DNA microarray analysis). The prevalence of C. jejuni and C. coli was 62.75% and 37.24% respectively. Antimicrobial susceptibility revealed a high level of resistance for ciprofloxacin (62.76%), tetracycline (55.86%) and nalidixic acid (55.17%). Genotyping of Campylobacter isolates using PFGE revealed a total of 86 unique SmaI patterns. Virulence gene profiles were determined using a new microbial diagnostic microarray composed of 70-mer oligonucleotide probes targeting genes implicated in Campylobacter pathogenicity. Correspondence between PFGE and microarray clusters was observed. Comparisons of PFGE and virulence profiles reflected the high genetic diversity of the strains examined, leading us to speculate different degrees of pathogenicity inside Campylobacter populations. PMID:24556669
Di Giannatale, Elisabetta; Di Serafino, Gabriella; Zilli, Katiuscia; Alessiani, Alessandra; Sacchini, Lorena; Garofolo, Giuliano; Aprea, Giuseppe; Marotta, Francesca
2014-02-19
Campylobacter has developed resistance to several antimicrobial agents over the years, including macrolides, quinolones and fluoroquinolones, becoming a significant public health hazard. A total of 145 strains derived from raw milk, chicken faeces, chicken carcasses, cattle faeces and human faeces collected from various Italian regions, were screened for antimicrobial susceptibility, molecular characterization (SmaI pulsed-field gel electrophoresis) and detection of virulence genes (sequencing and DNA microarray analysis). The prevalence of C. jejuni and C. coli was 62.75% and 37.24% respectively. Antimicrobial susceptibility revealed a high level of resistance for ciprofloxacin (62.76%), tetracycline (55.86%) and nalidixic acid (55.17%). Genotyping of Campylobacter isolates using PFGE revealed a total of 86 unique SmaI patterns. Virulence gene profiles were determined using a new microbial diagnostic microarray composed of 70-mer oligonucleotide probes targeting genes implicated in Campylobacter pathogenicity. Correspondence between PFGE and microarray clusters was observed. Comparisons of PFGE and virulence profiles reflected the high genetic diversity of the strains examined, leading us to speculate different degrees of pathogenicity inside Campylobacter populations.
Sayanjali, Behnam; Christensen, Gitte J M; Al-Zeer, Munir A; Mollenkopf, Hans-Joachim; Meyer, Thomas F; Brüggemann, Holger
2016-11-01
Propionibacterium acnes has been detected in diseased human prostate tissue, and cell culture experiments suggest that the bacterium can establish a low-grade inflammation. Here, we investigated its impact on human primary prostate epithelial cells. Microarray analysis confirmed the inflammation-inducing capability of P. acnes but also showed deregulation of genes involved in the cell cycle. qPCR experiments showed that viable P. acnes downregulates a master regulator of cell cycle progression, FOXM1. Flow cytometry experiments revealed that P. acnes increases the number of cells in S-phase. We tested the hypothesis that a P. acnes-produced berninamycin-like thiopeptide is responsible for this effect, since it is related to the FOXM1 inhibitor siomycin. The thiopeptide biosynthesis gene cluster was strongly expressed; it is present in subtype IB of P. acnes, but absent from type IA, which is most abundant on human skin. A knock-out mutant lacking the gene encoding the berninamycin-like peptide precursor was unable to downregulate FOXM1 and to halt the cell cycle. Our study reveals a novel host cell-interacting activity of P. acnes. Copyright © 2016 The Authors. Published by Elsevier GmbH.. All rights reserved.
Li, Lingyun; Li, Qingbo; Rohlin, Lars; Kim, UnMi; Salmon, Kirsty; Rejtar, Tomas; Gunsalus, Robert P.; Karger, Barry L.; Ferry, James G.
2008-01-01
Summary Methanosarcina acetivorans strain C2A is an acetate- and methanol-utilizing methane-producing organism for which the genome, the largest yet sequenced among the Archaea, reveals extensive physiological diversity. LC linear ion trap-FTICR mass spectrometry was employed to analyze acetate- vs. methanol-grown cells metabolically labeled with 14N vs. 15N, respectively, to obtain quantitative protein abundance ratios. DNA microarray analyses of acetate- vs. methanol-grown cells was also performed to determine gene expression ratios. The combined approaches were highly complementary, extending the physiological understanding of growth and methanogenesis. Of the 1081 proteins detected, 255 were ≥ 3-fold differentially abundant. DNA microarray analysis revealed 410 genes that were ≥ 2.5-fold differentially expressed of 1972 genes with detected expression. The ratios of differentially abundant proteins were in good agreement with expression ratios of the encoding genes. Taken together, the results suggest several novel roles for electron transport components specific to acetate-grown cells, including two flavodoxins each specific for growth on acetate or methanol. Protein abundance ratios indicated that duplicate CO dehydrogenase/acetyl-CoA complexes function in the conversion of acetate to methane. Surprisingly, the protein abundance and gene expression ratios indicated a general stress response in acetate- vs. methanol-grown cells that included enzymes specific for polyphosphate accumulation and oxidative stress. The microarray analysis identified transcripts of several genes encoding regulatory proteins with identity to the PhoU, MarR, GlnK, and TetR families commonly found in the Bacteria domain. An analysis of neighboring genes suggested roles in controlling phosphate metabolism (PhoU), ammonia assimilation (GlnK), and molybdopterin cofactor biosynthesis (TetR). Finally, the proteomic and microarray results suggested roles for two-component regulatory systems specific for each growth substrate. PMID:17269732
Microarray profiling of human white adipose tissue after exogenous leptin injection.
Taleb, S; Van Haaften, R; Henegar, C; Hukshorn, C; Cancello, R; Pelloux, V; Hanczar, B; Viguerie, N; Langin, D; Evelo, C; Zucker, J; Clément, K; Saris, W H M
2006-03-01
Leptin is a secreted adipocyte hormone that plays a key role in the regulation of body weight homeostasis. The leptin effect on human white adipose tissue (WAT) is still debated. The aim of this study was to assess whether the administration of polyethylene glycol-leptin (PEG-OB) in a single supraphysiological dose has transcriptional effects on genes of WAT and to identify its target genes and functional pathways in WAT. Blood samples and WAT biopsies were obtained from 10 healthy nonobese men before treatment and 72 h after the PEG-OB injection, leading to an approximate 809-fold increase in circulating leptin. The WAT gene expression profile before and after the PEG-OB injection was compared using pangenomic microarrays. Functional gene annotations based on the gene ontology of the PEG-OB regulated genes were performed using both an 'in house' automated procedure and GenMAPP (Gene Microarray Pathway Profiler), designed for viewing and analyzing gene expression data in the context of biological pathways. Statistical analysis of microarray data revealed that PEG-OB had a major down-regulated effect on WAT gene expression, as we obtained 1,822 and 100 down- and up-regulated genes, respectively. Microarray data were validated using reverse transcription quantitative PCR. Functional gene annotations of PEG-OB regulated genes revealed that the functional class related to immunity and inflammation was among the most mobilized PEG-OB pathway in WAT. These genes are mainly expressed in the cell of the stroma vascular fraction in comparison with adipocytes. Our observations support the hypothesis that leptin could act on WAT, particularly on genes related to inflammation and immunity, which may suggest a novel leptin target pathway in human WAT.
Reuse of imputed data in microarray analysis increases imputation efficiency
Kim, Ki-Yeol; Kim, Byoung-Jin; Yi, Gwan-Su
2004-01-01
Background The imputation of missing values is necessary for the efficient use of DNA microarray data, because many clustering algorithms and some statistical analysis require a complete data set. A few imputation methods for DNA microarray data have been introduced, but the efficiency of the methods was low and the validity of imputed values in these methods had not been fully checked. Results We developed a new cluster-based imputation method called sequential K-nearest neighbor (SKNN) method. This imputes the missing values sequentially from the gene having least missing values, and uses the imputed values for the later imputation. Although it uses the imputed values, the efficiency of this new method is greatly improved in its accuracy and computational complexity over the conventional KNN-based method and other methods based on maximum likelihood estimation. The performance of SKNN was in particular higher than other imputation methods for the data with high missing rates and large number of experiments. Application of Expectation Maximization (EM) to the SKNN method improved the accuracy, but increased computational time proportional to the number of iterations. The Multiple Imputation (MI) method, which is well known but not applied previously to microarray data, showed a similarly high accuracy as the SKNN method, with slightly higher dependency on the types of data sets. Conclusions Sequential reuse of imputed data in KNN-based imputation greatly increases the efficiency of imputation. The SKNN method should be practically useful to save the data of some microarray experiments which have high amounts of missing entries. The SKNN method generates reliable imputed values which can be used for further cluster-based analysis of microarray data. PMID:15504240
Chromosomal Microarray versus Karyotyping for Prenatal Diagnosis
Wapner, Ronald J.; Martin, Christa Lese; Levy, Brynn; Ballif, Blake C.; Eng, Christine M.; Zachary, Julia M.; Savage, Melissa; Platt, Lawrence D.; Saltzman, Daniel; Grobman, William A.; Klugman, Susan; Scholl, Thomas; Simpson, Joe Leigh; McCall, Kimberly; Aggarwal, Vimla S.; Bunke, Brian; Nahum, Odelia; Patel, Ankita; Lamb, Allen N.; Thom, Elizabeth A.; Beaudet, Arthur L.; Ledbetter, David H.; Shaffer, Lisa G.; Jackson, Laird
2013-01-01
Background Chromosomal microarray analysis has emerged as a primary diagnostic tool for the evaluation of developmental delay and structural malformations in children. We aimed to evaluate the accuracy, efficacy, and incremental yield of chromosomal microarray analysis as compared with karyotyping for routine prenatal diagnosis. Methods Samples from women undergoing prenatal diagnosis at 29 centers were sent to a central karyotyping laboratory. Each sample was split in two; standard karyotyping was performed on one portion and the other was sent to one of four laboratories for chromosomal microarray. Results We enrolled a total of 4406 women. Indications for prenatal diagnosis were advanced maternal age (46.6%), abnormal result on Down’s syndrome screening (18.8%), structural anomalies on ultrasonography (25.2%), and other indications (9.4%). In 4340 (98.8%) of the fetal samples, microarray analysis was successful; 87.9% of samples could be used without tissue culture. Microarray analysis of the 4282 nonmosaic samples identified all the aneuploidies and unbalanced rearrangements identified on karyotyping but did not identify balanced translocations and fetal triploidy. In samples with a normal karyotype, microarray analysis revealed clinically relevant deletions or duplications in 6.0% with a structural anomaly and in 1.7% of those whose indications were advanced maternal age or positive screening results. Conclusions In the context of prenatal diagnostic testing, chromosomal microarray analysis identified additional, clinically significant cytogenetic information as compared with karyotyping and was equally efficacious in identifying aneuploidies and unbalanced rearrangements but did not identify balanced translocations and triploidies. (Funded by the Eunice Kennedy Shriver National Institute of Child Health and Human Development and others; ClinicalTrials.gov number, NCT01279733.) PMID:23215555
Geue, Lutz; Stieber, Bettina; Monecke, Stefan; Engelmann, Ines; Gunzer, Florian; Slickers, Peter; Braun, Sascha D; Ehricht, Ralf
2014-08-01
In this study, we developed a new rapid, economic, and automated microarray-based genotyping test for the standardized subtyping of Shiga toxins 1 and 2 of Escherichia coli. The microarrays from Alere Technologies can be used in two different formats, the ArrayTube and the ArrayStrip (which enables high-throughput testing in a 96-well format). One microarray chip harbors all the gene sequences necessary to distinguish between all Stx subtypes, facilitating the identification of single and multiple subtypes within a single isolate in one experiment. Specific software was developed to automatically analyze all data obtained from the microarray. The assay was validated with 21 Shiga toxin-producing E. coli (STEC) reference strains that were previously tested by the complete set of conventional subtyping PCRs. The microarray results showed 100% concordance with the PCR results. Essentially identical results were detected when the standard DNA extraction method was replaced by a time-saving heat lysis protocol. For further validation of the microarray, we identified the Stx subtypes or combinations of the subtypes in 446 STEC field isolates of human and animal origin. In summary, this oligonucleotide array represents an excellent diagnostic tool that provides some advantages over standard PCR-based subtyping. The number of the spotted probes on the microarrays can be increased by additional probes, such as for novel alleles, species markers, or resistance genes, should the need arise. Copyright © 2014, American Society for Microbiology. All Rights Reserved.
Estimation of gene induction enables a relevance-based ranking of gene sets.
Bartholomé, Kilian; Kreutz, Clemens; Timmer, Jens
2009-07-01
In order to handle and interpret the vast amounts of data produced by microarray experiments, the analysis of sets of genes with a common biological functionality has been shown to be advantageous compared to single gene analyses. Some statistical methods have been proposed to analyse the differential gene expression of gene sets in microarray experiments. However, most of these methods either require threshhold values to be chosen for the analysis, or they need some reference set for the determination of significance. We present a method that estimates the number of differentially expressed genes in a gene set without requiring a threshold value for significance of genes. The method is self-contained (i.e., it does not require a reference set for comparison). In contrast to other methods which are focused on significance, our approach emphasizes the relevance of the regulation of gene sets. The presented method measures the degree of regulation of a gene set and is a useful tool to compare the induction of different gene sets and place the results of microarray experiments into the biological context. An R-package is available.
Tra, Yolande V; Evans, Irene M
2010-01-01
BIO2010 put forth the goal of improving the mathematical educational background of biology students. The analysis and interpretation of microarray high-dimensional data can be very challenging and is best done by a statistician and a biologist working and teaching in a collaborative manner. We set up such a collaboration and designed a course on microarray data analysis. We started using Genome Consortium for Active Teaching (GCAT) materials and Microarray Genome and Clustering Tool software and added R statistical software along with Bioconductor packages. In response to student feedback, one microarray data set was fully analyzed in class, starting from preprocessing to gene discovery to pathway analysis using the latter software. A class project was to conduct a similar analysis where students analyzed their own data or data from a published journal paper. This exercise showed the impact that filtering, preprocessing, and different normalization methods had on gene inclusion in the final data set. We conclude that this course achieved its goals to equip students with skills to analyze data from a microarray experiment. We offer our insight about collaborative teaching as well as how other faculty might design and implement a similar interdisciplinary course.
WebArray: an online platform for microarray data analysis
Xia, Xiaoqin; McClelland, Michael; Wang, Yipeng
2005-01-01
Background Many cutting-edge microarray analysis tools and algorithms, including commonly used limma and affy packages in Bioconductor, need sophisticated knowledge of mathematics, statistics and computer skills for implementation. Commercially available software can provide a user-friendly interface at considerable cost. To facilitate the use of these tools for microarray data analysis on an open platform we developed an online microarray data analysis platform, WebArray, for bench biologists to utilize these tools to explore data from single/dual color microarray experiments. Results The currently implemented functions were based on limma and affy package from Bioconductor, the spacings LOESS histogram (SPLOSH) method, PCA-assisted normalization method and genome mapping method. WebArray incorporates these packages and provides a user-friendly interface for accessing a wide range of key functions of limma and others, such as spot quality weight, background correction, graphical plotting, normalization, linear modeling, empirical bayes statistical analysis, false discovery rate (FDR) estimation, chromosomal mapping for genome comparison. Conclusion WebArray offers a convenient platform for bench biologists to access several cutting-edge microarray data analysis tools. The website is freely available at . It runs on a Linux server with Apache and MySQL. PMID:16371165
Evans, Irene M.
2010-01-01
BIO2010 put forth the goal of improving the mathematical educational background of biology students. The analysis and interpretation of microarray high-dimensional data can be very challenging and is best done by a statistician and a biologist working and teaching in a collaborative manner. We set up such a collaboration and designed a course on microarray data analysis. We started using Genome Consortium for Active Teaching (GCAT) materials and Microarray Genome and Clustering Tool software and added R statistical software along with Bioconductor packages. In response to student feedback, one microarray data set was fully analyzed in class, starting from preprocessing to gene discovery to pathway analysis using the latter software. A class project was to conduct a similar analysis where students analyzed their own data or data from a published journal paper. This exercise showed the impact that filtering, preprocessing, and different normalization methods had on gene inclusion in the final data set. We conclude that this course achieved its goals to equip students with skills to analyze data from a microarray experiment. We offer our insight about collaborative teaching as well as how other faculty might design and implement a similar interdisciplinary course. PMID:20810954
Autoregressive-model-based missing value estimation for DNA microarray time series data.
Choong, Miew Keen; Charbit, Maurice; Yan, Hong
2009-01-01
Missing value estimation is important in DNA microarray data analysis. A number of algorithms have been developed to solve this problem, but they have several limitations. Most existing algorithms are not able to deal with the situation where a particular time point (column) of the data is missing entirely. In this paper, we present an autoregressive-model-based missing value estimation method (ARLSimpute) that takes into account the dynamic property of microarray temporal data and the local similarity structures in the data. ARLSimpute is especially effective for the situation where a particular time point contains many missing values or where the entire time point is missing. Experiment results suggest that our proposed algorithm is an accurate missing value estimator in comparison with other imputation methods on simulated as well as real microarray time series datasets.
Deutsch, Eric W; Ball, Catherine A; Berman, Jules J; Bova, G Steven; Brazma, Alvis; Bumgarner, Roger E; Campbell, David; Causton, Helen C; Christiansen, Jeffrey H; Daian, Fabrice; Dauga, Delphine; Davidson, Duncan R; Gimenez, Gregory; Goo, Young Ah; Grimmond, Sean; Henrich, Thorsten; Herrmann, Bernhard G; Johnson, Michael H; Korb, Martin; Mills, Jason C; Oudes, Asa J; Parkinson, Helen E; Pascal, Laura E; Pollet, Nicolas; Quackenbush, John; Ramialison, Mirana; Ringwald, Martin; Salgado, David; Sansone, Susanna-Assunta; Sherlock, Gavin; Stoeckert, Christian J; Swedlow, Jason; Taylor, Ronald C; Walashek, Laura; Warford, Anthony; Wilkinson, David G; Zhou, Yi; Zon, Leonard I; Liu, Alvin Y; True, Lawrence D
2015-01-01
One purpose of the biomedical literature is to report results in sufficient detail so that the methods of data collection and analysis can be independently replicated and verified. Here we present for consideration a minimum information specification for gene expression localization experiments, called the “Minimum Information Specification For In Situ Hybridization and Immunohistochemistry Experiments (MISFISHIE)”. It is modelled after the MIAME (Minimum Information About a Microarray Experiment) specification for microarray experiments. Data specifications like MIAME and MISFISHIE specify the information content without dictating a format for encoding that information. The MISFISHIE specification describes six types of information that should be provided for each experiment: Experimental Design, Biomaterials and Treatments, Reporters, Staining, Imaging Data, and Image Characterizations. This specification has benefited the consortium within which it was initially developed and is expected to benefit the wider research community. We welcome feedback from the scientific community to help improve our proposal. PMID:18327244
Wimmer, Isabella; Tröscher, Anna R; Brunner, Florian; Rubino, Stephen J; Bien, Christian G; Weiner, Howard L; Lassmann, Hans; Bauer, Jan
2018-04-20
Formalin-fixed paraffin-embedded (FFPE) tissues are valuable resources commonly used in pathology. However, formalin fixation modifies nucleic acids challenging the isolation of high-quality RNA for genetic profiling. Here, we assessed feasibility and reliability of microarray studies analysing transcriptome data from fresh, fresh-frozen (FF) and FFPE tissues. We show that reproducible microarray data can be generated from only 2 ng FFPE-derived RNA. For RNA quality assessment, fragment size distribution (DV200) and qPCR proved most suitable. During RNA isolation, extending tissue lysis time to 10 hours reduced high-molecular-weight species, while additional incubation at 70 °C markedly increased RNA yields. Since FF- and FFPE-derived microarrays constitute different data entities, we used indirect measures to investigate gene signal variation and relative gene expression. Whole-genome analyses revealed high concordance rates, while reviewing on single-genes basis showed higher data variation in FFPE than FF arrays. Using an experimental model, gene set enrichment analysis (GSEA) of FFPE-derived microarrays and fresh tissue-derived RNA-Seq datasets yielded similarly affected pathways confirming the applicability of FFPE tissue in global gene expression analysis. Our study provides a workflow comprising RNA isolation, quality assessment and microarray profiling using minimal RNA input, thus enabling hypothesis-generating pathway analyses from limited amounts of precious, pathologically significant FFPE tissues.
Catto, James W F; Abbod, Maysam F; Wild, Peter J; Linkens, Derek A; Pilarsky, Christian; Rehman, Ishtiaq; Rosario, Derek J; Denzinger, Stefan; Burger, Maximilian; Stoehr, Robert; Knuechel, Ruth; Hartmann, Arndt; Hamdy, Freddie C
2010-03-01
New methods for identifying bladder cancer (BCa) progression are required. Gene expression microarrays can reveal insights into disease biology and identify novel biomarkers. However, these experiments produce large datasets that are difficult to interpret. To develop a novel method of microarray analysis combining two forms of artificial intelligence (AI): neurofuzzy modelling (NFM) and artificial neural networks (ANN) and validate it in a BCa cohort. We used AI and statistical analyses to identify progression-related genes in a microarray dataset (n=66 tumours, n=2800 genes). The AI-selected genes were then investigated in a second cohort (n=262 tumours) using immunohistochemistry. We compared the accuracy of AI and statistical approaches to identify tumour progression. AI identified 11 progression-associated genes (odds ratio [OR]: 0.70; 95% confidence interval [CI], 0.56-0.87; p=0.0004), and these were more discriminate than genes chosen using statistical analyses (OR: 1.24; 95% CI, 0.96-1.60; p=0.09). The expression of six AI-selected genes (LIG3, FAS, KRT18, ICAM1, DSG2, and BRCA2) was determined using commercial antibodies and successfully identified tumour progression (concordance index: 0.66; log-rank test: p=0.01). AI-selected genes were more discriminate than pathologic criteria at determining progression (Cox multivariate analysis: p=0.01). Limitations include the use of statistical correlation to identify 200 genes for AI analysis and that we did not compare regression identified genes with immunohistochemistry. AI and statistical analyses use different techniques of inference to determine gene-phenotype associations and identify distinct prognostic gene signatures that are equally valid. We have identified a prognostic gene signature whose members reflect a variety of carcinogenic pathways that could identify progression in non-muscle-invasive BCa. 2009 European Association of Urology. Published by Elsevier B.V. All rights reserved.
Mining microarray data at NCBI's Gene Expression Omnibus (GEO)*.
Barrett, Tanya; Edgar, Ron
2006-01-01
The Gene Expression Omnibus (GEO) at the National Center for Biotechnology Information (NCBI) has emerged as the leading fully public repository for gene expression data. This chapter describes how to use Web-based interfaces, applications, and graphics to effectively explore, visualize, and interpret the hundreds of microarray studies and millions of gene expression patterns stored in GEO. Data can be examined from both experiment-centric and gene-centric perspectives using user-friendly tools that do not require specialized expertise in microarray analysis or time-consuming download of massive data sets. The GEO database is publicly accessible through the World Wide Web at http://www.ncbi.nlm.nih.gov/geo.
Generalized Correlation Coefficient for Non-Parametric Analysis of Microarray Time-Course Data.
Tan, Qihua; Thomassen, Mads; Burton, Mark; Mose, Kristian Fredløv; Andersen, Klaus Ejner; Hjelmborg, Jacob; Kruse, Torben
2017-06-06
Modeling complex time-course patterns is a challenging issue in microarray study due to complex gene expression patterns in response to the time-course experiment. We introduce the generalized correlation coefficient and propose a combinatory approach for detecting, testing and clustering the heterogeneous time-course gene expression patterns. Application of the method identified nonlinear time-course patterns in high agreement with parametric analysis. We conclude that the non-parametric nature in the generalized correlation analysis could be an useful and efficient tool for analyzing microarray time-course data and for exploring the complex relationships in the omics data for studying their association with disease and health.
Liang, Shan; Fang, Lu; Zhou, Renchao; Tang, Tian; Deng, Shulin; Dong, Suisui; Huang, Yelin; Zhong, Cairong; Shi, Suhua
2012-01-01
Differential responses to the environmental stresses at the level of transcription play a critical role in adaptation. Mangrove species compose a dominant community in intertidal zones and form dense forests at the sea-land interface, and although the anatomical and physiological features associated with their salt-tolerant lifestyles have been well characterized, little is known about the impact of transcriptional phenotypes on their adaptation to these saline environments. We report the time-course transcript profiles in the roots of a true mangrove species, Ceriops tagal, as revealed by a series of microarray experiments. The expression of a total of 432 transcripts changed significantly in the roots of C. tagal under salt shock, of which 83 had a more than 2-fold change and were further assembled into 59 unigenes. Global transcription was stable at the early stage of salt stress and then was gradually dysregulated with the increased duration of the stress. Importantly, a pair-wise comparison of predicted homologous gene pairs revealed that the transcriptional regulations of most of the differentially expressed genes were highly divergent in C. tagal from that in salt-sensitive species, Arabidopsis thaliana. This work suggests that transcriptional homeostasis and specific transcriptional regulation are major events in the roots of C. tagal when subjected to salt shock, which could contribute to the establishment of adaptation to saline environments and, thus, facilitate the salt-tolerant lifestyle of this mangrove species. Furthermore, the candidate genes underlying the adaptation were identified through comparative analyses. This study provides a foundation for dissecting the genetic basis of the adaptation of mangroves to intertidal environments.
Yang, Chuanping; Wei, Hairong
2015-02-01
Microarray and RNA-seq experiments have become an important part of modern genomics and systems biology. Obtaining meaningful biological data from these experiments is an arduous task that demands close attention to many details. Negligence at any step can lead to gene expression data containing inadequate or composite information that is recalcitrant for pattern extraction. Therefore, it is imperative to carefully consider experimental design before launching a time-consuming and costly experiment. Contemporarily, most genomics experiments have two objectives: (1) to generate two or more groups of comparable data for identifying differentially expressed genes, gene families, biological processes, or metabolic pathways under experimental conditions; (2) to build local gene regulatory networks and identify hierarchically important regulators governing biological processes and pathways of interest. Since the first objective aims to identify the active molecular identities and the second provides a basis for understanding the underlying molecular mechanisms through inferring causality relationships mediated by treatment, an optimal experiment is to produce biologically relevant and extractable data to meet both objectives without substantially increasing the cost. This review discusses the major issues that researchers commonly face when embarking on microarray or RNA-seq experiments and summarizes important aspects of experimental design, which aim to help researchers deliberate how to generate gene expression profiles with low background noise but with more interaction to facilitate novel biological discoveries in modern plant genomics. Copyright © 2015 The Author. Published by Elsevier Inc. All rights reserved.
Kim, Eun Ju; Lee, Dong Hun; Kim, Yeon Kyung; Kim, Min-Kyoung; Kim, Jung Yun; Lee, Min Jung; Choi, Won Woo; Eun, Hee Chul; Chung, Jin Ho
2014-12-01
Sensitive skin represents hyperactive sensory symptoms showing exaggerated reactions in response to internal stimulants or external irritants. Although sensitive skin is a very common condition affecting an estimated 50% of the population, its pathophysiology remains largely elusive, particularly with regard to its metabolic aspects. The objective of our study was to investigate the pathogenesis of sensitive skin. We recruited healthy participants with 'sensitive' or 'non-sensitive' skin based on standardized questionnaires and 10% lactic acid stinging test, and obtained skin samples for microarray analysis and subsequent experiments. Microarray transcriptome profiling revealed that genes involved in muscle contraction, carbohydrate and lipid metabolism, and ion transport and balance were significantly decreased in sensitive skin. These altered genes could account for the abnormal muscle contraction, decreased ATP amount in sensitive skin. In addition, pain-related transcripts such as TRPV1, ASIC3 and CGRP were significantly up-regulated in sensitive skin, compared with non-sensitive skin. Our findings suggest that sensitive skin is closely associated with the dysfunction of muscle contraction and metabolic homeostasis. Copyright © 2014 Japanese Society for Investigative Dermatology. Published by Elsevier Ireland Ltd. All rights reserved.
The FDA's Experience with Emerging Genomics Technologies-Past, Present, and Future.
Xu, Joshua; Thakkar, Shraddha; Gong, Binsheng; Tong, Weida
2016-07-01
The rapid advancement of emerging genomics technologies and their application for assessing safety and efficacy of FDA-regulated products require a high standard of reliability and robustness supporting regulatory decision-making in the FDA. To facilitate the regulatory application, the FDA implemented a novel data submission program, Voluntary Genomics Data Submission (VGDS), and also to engage the stakeholders. As part of the endeavor, for the past 10 years, the FDA has led an international consortium of regulatory agencies, academia, pharmaceutical companies, and genomics platform providers, which was named MicroArray Quality Control Consortium (MAQC), to address issues such as reproducibility, precision, specificity/sensitivity, and data interpretation. Three projects have been completed so far assessing these genomics technologies: gene expression microarrays, whole genome genotyping arrays, and whole transcriptome sequencing (i.e., RNA-seq). The resultant studies provide the basic parameters for fit-for-purpose application of these new data streams in regulatory environments, and the solutions have been made available to the public through peer-reviewed publications. The latest MAQC project is also called the SEquencing Quality Control (SEQC) project focused on next-generation sequencing. Using reference samples with built-in controls, SEQC studies have demonstrated that relative gene expression can be measured accurately and reliably across laboratories and RNA-seq platforms. Besides prediction performance comparable to microarrays in clinical settings and safety assessments, RNA-seq is shown to have better sensitivity for low expression and reveal novel transcriptomic features. Future effort of MAQC will be focused on quality control of whole genome sequencing and targeted sequencing.
Data submission and quality in microarray-based microRNA profiling
Witwer, Kenneth W.
2014-01-01
Background Public sharing of scientific data has assumed greater importance in the ‘omics’ era. Transparency is necessary for confirmation and validation, and multiple examiners aid in extracting maximal value from large datasets. Accordingly, database submission and provision of the Minimum Information About a Microarray Experiment (MIAME) are required by most journals as a prerequisite for review or acceptance. Methods In this study, the level of data submission and MIAME compliance was reviewed for 127 articles that included microarray-based microRNA profiling and that were published from July, 2011 through April, 2012 in the journals that published the largest number of such articles—PLOS ONE, the Journal of Biological Chemistry, Blood, and Oncogene—along with articles from nine other journals, including Clinical Chemistry, that published smaller numbers of array-based articles. Results Overall, data submission was reported at publication for less than 40% of all articles, and almost 75% of articles were MIAME-noncompliant. On average, articles that included full data submission scored significantly higher on a quality metric than articles with limited or no data submission, and studies with adequate description of methods disproportionately included larger numbers of experimental repeats. Finally, for several articles that were not MIAME-compliant, data re-analysis revealed less than complete support for the published conclusions, in one case leading to retraction. Conclusions These findings buttress the hypothesis that reluctance to share data is associated with low study quality and suggest that most miRNA array investigations are underpowered and/or potentially compromised by a lack of appropriate reporting and data submission. PMID:23358751
Data submission and quality in microarray-based microRNA profiling.
Witwer, Kenneth W
2013-02-01
Public sharing of scientific data has assumed greater importance in the omics era. Transparency is necessary for confirmation and validation, and multiple examiners aid in extracting maximal value from large data sets. Accordingly, database submission and provision of the Minimum Information About a Microarray Experiment (MIAME)(3) are required by most journals as a prerequisite for review or acceptance. In this study, the level of data submission and MIAME compliance was reviewed for 127 articles that included microarray-based microRNA (miRNA) profiling and were published from July 2011 through April 2012 in the journals that published the largest number of such articles--PLOS ONE, the Journal of Biological Chemistry, Blood, and Oncogene--along with articles from 9 other journals, including Clinical Chemistry, that published smaller numbers of array-based articles. Overall, data submission was reported at publication for <40% of all articles, and almost 75% of articles were MIAME noncompliant. On average, articles that included full data submission scored significantly higher on a quality metric than articles with limited or no data submission, and studies with adequate description of methods disproportionately included larger numbers of experimental repeats. Finally, for several articles that were not MIAME compliant, data reanalysis revealed less than complete support for the published conclusions, in 1 case leading to retraction. These findings buttress the hypothesis that reluctance to share data is associated with low study quality and suggest that most miRNA array investigations are underpowered and/or potentially compromised by a lack of appropriate reporting and data submission. © 2012 American Association for Clinical Chemistry
Design and verification of a pangenome microarray oligonucleotide probe set for Dehalococcoides spp.
Hug, Laura A; Salehi, Maryam; Nuin, Paulo; Tillier, Elisabeth R; Edwards, Elizabeth A
2011-08-01
Dehalococcoides spp. are an industrially relevant group of Chloroflexi bacteria capable of reductively dechlorinating contaminants in groundwater environments. Existing Dehalococcoides genomes revealed a high level of sequence identity within this group, including 98 to 100% 16S rRNA sequence identity between strains with diverse substrate specificities. Common molecular techniques for identification of microbial populations are often not applicable for distinguishing Dehalococcoides strains. Here we describe an oligonucleotide microarray probe set designed based on clustered Dehalococcoides genes from five different sources (strain DET195, CBDB1, BAV1, and VS genomes and the KB-1 metagenome). This "pangenome" probe set provides coverage of core Dehalococcoides genes as well as strain-specific genes while optimizing the potential for hybridization to closely related, previously unknown Dehalococcoides strains. The pangenome probe set was compared to probe sets designed independently for each of the five Dehalococcoides strains. The pangenome probe set demonstrated better predictability and higher detection of Dehalococcoides genes than strain-specific probe sets on nontarget strains with <99% average nucleotide identity. An in silico analysis of the expected probe hybridization against the recently released Dehalococcoides strain GT genome and additional KB-1 metagenome sequence data indicated that the pangenome probe set performs more robustly than the combined strain-specific probe sets in the detection of genes not included in the original design. The pangenome probe set represents a highly specific, universal tool for the detection and characterization of Dehalococcoides from contaminated sites. It has the potential to become a common platform for Dehalococcoides-focused research, allowing meaningful comparisons between microarray experiments regardless of the strain examined.
The FDA’s Experience with Emerging Genomics Technologies—Past, Present, and Future
Xu, Joshua; Thakkar, Shraddha; Gong, Binsheng; Tong, Weida
2016-01-01
The rapid advancement of emerging genomics technologies and their application for assessing safety and efficacy of FDA-regulated products require a high standard of reliability and robustness supporting regulatory decision-making in the FDA. To facilitate the regulatory application, the FDA implemented a novel data submission program, Voluntary Genomics Data Submission (VGDS), and also to engage the stakeholders. As part of the endeavor, for the past 10 years, the FDA has led an international consortium of regulatory agencies, academia, pharmaceutical companies, and genomics platform providers, which was named MicroArray Quality Control Consortium (MAQC), to address issues such as reproducibility, precision, specificity/sensitivity, and data interpretation. Three projects have been completed so far assessing these genomics technologies: gene expression microarrays, whole genome genotyping arrays, and whole transcriptome sequencing (i.e., RNA-seq). The resultant studies provide the basic parameters for fit-for-purpose application of these new data streams in regulatory environments, and the solutions have been made available to the public through peer-reviewed publications. The latest MAQC project is also called the SEquencing Quality Control (SEQC) project focused on next-generation sequencing. Using reference samples with built-in controls, SEQC studies have demonstrated that relative gene expression can be measured accurately and reliably across laboratories and RNA-seq platforms. Besides prediction performance comparable to microarrays in clinical settings and safety assessments, RNA-seq is shown to have better sensitivity for low expression and reveal novel transcriptomic features. Future effort of MAQC will be focused on quality control of whole genome sequencing and targeted sequencing. PMID:27116022
Denou, Emmanuel; Pridmore, Raymond David; Berger, Bernard; Panoff, Jean-Michel; Arigoni, Fabrizio; Brüssow, Harald
2008-05-01
Lactobacillus johnsonii strains NCC533 and ATCC 33200 (the type strain of this species) differed significantly in gut residence time (12 versus 5 days) after oral feeding to mice. Genes affecting the long gut residence time of the probiotic strain NCC533 were targeted for analysis. We hypothesized that genes specific for this strain, which are expressed during passage of the bacterium through the gut, affect the phenotype. When the DNA of the type strain was hybridized against a microarray of the sequenced NCC533 strain, we identified 233 genes that were specific for the long-gut-persistence isolate. Whole-genome transcription analysis of the NCC533 strain using the microarray format identified 174 genes that were strongly and consistently expressed in the jejunum of mice monocolonized with this strain. Fusion of the two microarray data sets identified three gene loci that were both expressed in vivo and specific to the long-gut-persistence isolate. The identified genes included LJ1027 and LJ1028, two glycosyltransferase genes in the exopolysaccharide synthesis operon; LJ1654 to LJ1656, encoding a sugar phosphotransferase system (PTS) transporter annotated as mannose PTS; and LJ1680, whose product shares 30% amino acid identity with immunoglobulin A proteases from pathogenic bacteria. Knockout mutants were tested in vivo. The experiments revealed that deletion of LJ1654 to LJ1656 and LJ1680 decreased the gut residence time, while a mutant with a deleted exopolysaccharide biosynthesis cluster had a slightly increased residence time.
Feng, Yinling; Wang, Xuefeng
2017-03-01
In order to investigate commonly disturbed genes and pathways in various brain regions of patients with Parkinson's disease (PD), microarray datasets from previous studies were collected and systematically analyzed. Different normalization methods were applied to microarray datasets from different platforms. A strategy combining gene co‑expression networks and clinical information was adopted, using weighted gene co‑expression network analysis (WGCNA) to screen for commonly disturbed genes in different brain regions of patients with PD. Functional enrichment analysis of commonly disturbed genes was performed using the Database for Annotation, Visualization, and Integrated Discovery (DAVID). Co‑pathway relationships were identified with Pearson's correlation coefficient tests and a hypergeometric distribution‑based test. Common genes in pathway pairs were selected out and regarded as risk genes. A total of 17 microarray datasets from 7 platforms were retained for further analysis. Five gene coexpression modules were identified, containing 9,745, 736, 233, 101 and 93 genes, respectively. One module was significantly correlated with PD samples and thus the 736 genes it contained were considered to be candidate PD‑associated genes. Functional enrichment analysis demonstrated that these genes were implicated in oxidative phosphorylation and PD. A total of 44 pathway pairs and 52 risk genes were revealed, and a risk gene pathway relationship network was constructed. Eight modules were identified and were revealed to be associated with PD, cancers and metabolism. A number of disturbed pathways and risk genes were unveiled in PD, and these findings may help advance understanding of PD pathogenesis.
Goodman, Corey W.; Major, Heather J.; Walls, William D.; Sheffield, Val C.; Casavant, Thomas L.; Darbro, Benjamin W.
2016-01-01
Chromosomal microarrays (CMAs) are routinely used in both research and clinical laboratories; yet, little attention has been given to the estimation of genome-wide true and false negatives during the assessment of these assays and how such information could be used to calibrate various algorithmic metrics to improve performance. Low-throughput, locus-specific methods such as fluorescence in situ hybridization (FISH), quantitative PCR (qPCR), or multiplex ligation-dependent probe amplification (MLPA) preclude rigorous calibration of various metrics used by copy number variant (CNV) detection algorithms. To aid this task, we have established a comparative methodology, CNV-ROC, which is capable of performing a high throughput, low cost, analysis of CMAs that takes into consideration genome-wide true and false negatives. CNV-ROC uses a higher resolution microarray to confirm calls from a lower resolution microarray and provides for a true measure of genome-wide performance metrics at the resolution offered by microarray testing. CNV-ROC also provides for a very precise comparison of CNV calls between two microarray platforms without the need to establish an arbitrary degree of overlap. Comparison of CNVs across microarrays is done on a per-probe basis and receiver operator characteristic (ROC) analysis is used to calibrate algorithmic metrics, such as log2 ratio threshold, to enhance CNV calling performance. CNV-ROC addresses a critical and consistently overlooked aspect of analytical assessments of genome-wide techniques like CMAs which is the measurement and use of genome-wide true and false negative data for the calculation of performance metrics and comparison of CNV profiles between different microarray experiments. PMID:25595567
Leavey, Katherine; Bainbridge, Shannon A; Cox, Brian J
2015-01-01
Preeclampsia (PE) is a life-threatening hypertensive pathology of pregnancy affecting 3-5% of all pregnancies. To date, PE has no cure, early detection markers, or effective treatments short of the removal of what is thought to be the causative organ, the placenta, which may necessitate a preterm delivery. Additionally, numerous small placental microarray studies attempting to identify "PE-specific" genes have yielded inconsistent results. We therefore hypothesize that preeclampsia is a multifactorial disease encompassing several pathology subclasses, and that large cohort placental gene expression analysis will reveal these groups. To address our hypothesis, we utilized known bioinformatic methods to aggregate 7 microarray data sets across multiple platforms in order to generate a large data set of 173 patient samples, including 77 with preeclampsia. Unsupervised clustering of these patient samples revealed three distinct molecular subclasses of PE. This included a "canonical" PE subclass demonstrating elevated expression of known PE markers and genes associated with poor oxygenation and increased secretion, as well as two other subclasses potentially representing a poor maternal response to pregnancy and an immunological presentation of preeclampsia. Our analysis sheds new light on the heterogeneity of PE patients, and offers up additional avenues for future investigation. Hopefully, our subclassification of preeclampsia based on molecular diversity will finally lead to the development of robust diagnostics and patient-based treatments for this disorder.
Improved microarray methods for profiling the yeast knockout strain collection
Yuan, Daniel S.; Pan, Xuewen; Ooi, Siew Loon; Peyser, Brian D.; Spencer, Forrest A.; Irizarry, Rafael A.; Boeke, Jef D.
2005-01-01
A remarkable feature of the Yeast Knockout strain collection is the presence of two unique 20mer TAG sequences in almost every strain. In principle, the relative abundances of strains in a complex mixture can be profiled swiftly and quantitatively by amplifying these sequences and hybridizing them to microarrays, but TAG microarrays have not been widely used. Here, we introduce a TAG microarray design with sophisticated controls and describe a robust method for hybridizing high concentrations of dye-labeled TAGs in single-stranded form. We also highlight the importance of avoiding PCR contamination and provide procedures for detection and eradication. Validation experiments using these methods yielded false positive (FP) and false negative (FN) rates for individual TAG detection of 3–6% and 15–18%, respectively. Analysis demonstrated that cross-hybridization was the chief source of FPs, while TAG amplification defects were the main cause of FNs. The materials, protocols, data and associated software described here comprise a suite of experimental resources that should facilitate the use of TAG microarrays for a wide variety of genetic screens. PMID:15994458
Clustering approaches to identifying gene expression patterns from DNA microarray data.
Do, Jin Hwan; Choi, Dong-Kug
2008-04-30
The analysis of microarray data is essential for large amounts of gene expression data. In this review we focus on clustering techniques. The biological rationale for this approach is the fact that many co-expressed genes are co-regulated, and identifying co-expressed genes could aid in functional annotation of novel genes, de novo identification of transcription factor binding sites and elucidation of complex biological pathways. Co-expressed genes are usually identified in microarray experiments by clustering techniques. There are many such methods, and the results obtained even for the same datasets may vary considerably depending on the algorithms and metrics for dissimilarity measures used, as well as on user-selectable parameters such as desired number of clusters and initial values. Therefore, biologists who want to interpret microarray data should be aware of the weakness and strengths of the clustering methods used. In this review, we survey the basic principles of clustering of DNA microarray data from crisp clustering algorithms such as hierarchical clustering, K-means and self-organizing maps, to complex clustering algorithms like fuzzy clustering.
Hayeems, R Z; Babul-Hirji, R; Hoang, N; Weksberg, R; Shuman, C
2016-04-01
Advances in genome-based microarray and sequencing technologies hold tremendous promise for understanding, better-managing and/or preventing disease and disease-related risk. Chromosome microarray technology (array based comparative genomic hybridization [aCGH]) is widely utilized in pediatric care to inform diagnostic etiology and medical management. Less clear is how parents experience and perceive the value of this technology. This study explored parents' experiences with aCGH in the pediatric setting, focusing on how they make meaning of various types of test results. We conducted in-person or telephone-based semi-structured interviews with parents of 21 children who underwent aCGH testing in 2010. Transcripts were coded and analyzed thematically according to the principles of interpretive description. We learned that parents expect genomic tests to be of personal use; their experiences with aCGH results characterize this use as intrinsic in the test's ability to provide a much sought-after answer for their child's condition, and instrumental in its ability to guide care, access to services, and family planning. In addition, parents experience uncertainty regardless of whether aCGH results are of pathogenic, uncertain, or benign significance; this triggers frustration, fear, and hope. Findings reported herein better characterize the notion of personal utility and highlight the pervasive nature of uncertainty in the context of genomic testing. Empiric research that links pre-test counseling content and psychosocial outcomes is warranted to optimize patient care.
Grenville-Briggs, Laura J; Stansfield, Ian
2011-01-01
This report describes a linked series of Masters-level computer practical workshops. They comprise an advanced functional genomics investigation, based upon analysis of a microarray dataset probing yeast DNA damage responses. The workshops require the students to analyse highly complex transcriptomics datasets, and were designed to stimulate active learning through experience of current research methods in bioinformatics and functional genomics. They seek to closely mimic a realistic research environment, and require the students first to propose research hypotheses, then test those hypotheses using specific sections of the microarray dataset. The complexity of the microarray data provides students with the freedom to propose their own unique hypotheses, tested using appropriate sections of the microarray data. This research latitude was highly regarded by students and is a strength of this practical. In addition, the focus on DNA damage by radiation and mutagenic chemicals allows them to place their results in a human medical context, and successfully sparks broad interest in the subject material. In evaluation, 79% of students scored the practical workshops on a five-point scale as 4 or 5 (totally effective) for student learning. More broadly, the general use of microarray data as a "student research playground" is also discussed. Copyright © 2011 Wiley Periodicals, Inc.
Thomas, E. V.; Phillippy, K. H.; Brahamsha, B.; Haaland, D. M.; Timlin, J. A.; Elbourne, L. D. H.; Palenik, B.; Paulsen, I. T.
2009-01-01
Until recently microarray experiments often involved relatively few arrays with only a single representation of each gene on each array. A complete genome microarray with multiple spots per gene (spread out spatially across the array) was developed in order to compare the gene expression of a marine cyanobacterium and a knockout mutant strain in a defined artificial seawater medium. Statistical methods were developed for analysis in the special situation of this case study where there is gene replication within an array and where relatively few arrays are used, which can be the case with current array technology. Due in part to the replication within an array, it was possible to detect very small changes in the levels of expression between the wild type and mutant strains. One interesting biological outcome of this experiment is the indication of the extent to which the phosphorus regulatory system of this cyanobacterium affects the expression of multiple genes beyond those strictly involved in phosphorus acquisition. PMID:19404483
Thomas, E. V.; Phillippy, K. H.; Brahamsha, B.; ...
2009-01-01
Until recently microarray experiments often involved relatively few arrays with only a single representation of each gene on each array. A complete genome microarray with multiple spots per gene (spread out spatially across the array) was developed in order to compare the gene expression of a marine cyanobacterium and a knockout mutant strain in a defined artificial seawater medium. Statistical methods were developed for analysis in the special situation of this case study where there is gene replication within an array and where relatively few arrays are used, which can be the case with current array technology. Due in partmore » to the replication within an array, it was possible to detect very small changes in the levels of expression between the wild type and mutant strains. One interesting biological outcome of this experiment is the indication of the extent to which the phosphorus regulatory system of this cyanobacterium affects the expression of multiple genes beyond those strictly involved in phosphorus acquisition.« less
Development and validation of a flax (Linum usitatissimum L.) gene expression oligo microarray
2010-01-01
Background Flax (Linum usitatissimum L.) has been cultivated for around 9,000 years and is therefore one of the oldest cultivated species. Today, flax is still grown for its oil (oil-flax or linseed cultivars) and its cellulose-rich fibres (fibre-flax cultivars) used for high-value linen garments and composite materials. Despite the wide industrial use of flax-derived products, and our actual understanding of the regulation of both wood fibre production and oil biosynthesis more information must be acquired in both domains. Recent advances in genomics are now providing opportunities to improve our fundamental knowledge of these complex processes. In this paper we report the development and validation of a high-density oligo microarray platform dedicated to gene expression analyses in flax. Results Nine different RNA samples obtained from flax inner- and outer-stems, seeds, leaves and roots were used to generate a collection of 1,066,481 ESTs by massive parallel pyrosequencing. Sequences were assembled into 59,626 unigenes and 48,021 sequences were selected for oligo design and high-density microarray (Nimblegen 385K) fabrication with eight, non-overlapping 25-mers oligos per unigene. 18 independent experiments were used to evaluate the hybridization quality, precision, specificity and accuracy and all results confirmed the high technical quality of our microarray platform. Cross-validation of microarray data was carried out using quantitative qRT-PCR. Nine target genes were selected on the basis of microarray results and reflected the whole range of fold change (both up-regulated and down-regulated genes in different samples). A statistically significant positive correlation was obtained comparing expression levels for each target gene across all biological replicates both in qRT-PCR and microarray results. Further experiments illustrated the capacity of our arrays to detect differential gene expression in a variety of flax tissues as well as between two contrasted flax varieties. Conclusion All results suggest that our high-density flax oligo-microarray platform can be used as a very sensitive tool for analyzing gene expression in a large variety of tissues as well as in different cultivars. Moreover, this highly reliable platform can also be used for the quantification of mRNA transcriptional profiling in different flax tissues. PMID:20964859
Development and validation of a flax (Linum usitatissimum L.) gene expression oligo microarray.
Fenart, Stéphane; Ndong, Yves-Placide Assoumou; Duarte, Jorge; Rivière, Nathalie; Wilmer, Jeroen; van Wuytswinkel, Olivier; Lucau, Anca; Cariou, Emmanuelle; Neutelings, Godfrey; Gutierrez, Laurent; Chabbert, Brigitte; Guillot, Xavier; Tavernier, Reynald; Hawkins, Simon; Thomasset, Brigitte
2010-10-21
Flax (Linum usitatissimum L.) has been cultivated for around 9,000 years and is therefore one of the oldest cultivated species. Today, flax is still grown for its oil (oil-flax or linseed cultivars) and its cellulose-rich fibres (fibre-flax cultivars) used for high-value linen garments and composite materials. Despite the wide industrial use of flax-derived products, and our actual understanding of the regulation of both wood fibre production and oil biosynthesis more information must be acquired in both domains. Recent advances in genomics are now providing opportunities to improve our fundamental knowledge of these complex processes. In this paper we report the development and validation of a high-density oligo microarray platform dedicated to gene expression analyses in flax. Nine different RNA samples obtained from flax inner- and outer-stems, seeds, leaves and roots were used to generate a collection of 1,066,481 ESTs by massive parallel pyrosequencing. Sequences were assembled into 59,626 unigenes and 48,021 sequences were selected for oligo design and high-density microarray (Nimblegen 385K) fabrication with eight, non-overlapping 25-mers oligos per unigene. 18 independent experiments were used to evaluate the hybridization quality, precision, specificity and accuracy and all results confirmed the high technical quality of our microarray platform. Cross-validation of microarray data was carried out using quantitative qRT-PCR. Nine target genes were selected on the basis of microarray results and reflected the whole range of fold change (both up-regulated and down-regulated genes in different samples). A statistically significant positive correlation was obtained comparing expression levels for each target gene across all biological replicates both in qRT-PCR and microarray results. Further experiments illustrated the capacity of our arrays to detect differential gene expression in a variety of flax tissues as well as between two contrasted flax varieties. All results suggest that our high-density flax oligo-microarray platform can be used as a very sensitive tool for analyzing gene expression in a large variety of tissues as well as in different cultivars. Moreover, this highly reliable platform can also be used for the quantification of mRNA transcriptional profiling in different flax tissues.
Mining Microarray Data at NCBI’s Gene Expression Omnibus (GEO)*
Barrett, Tanya; Edgar, Ron
2006-01-01
Summary The Gene Expression Omnibus (GEO) at the National Center for Biotechnology Information (NCBI) has emerged as the leading fully public repository for gene expression data. This chapter describes how to use Web-based interfaces, applications, and graphics to effectively explore, visualize, and interpret the hundreds of microarray studies and millions of gene expression patterns stored in GEO. Data can be examined from both experiment-centric and gene-centric perspectives using user-friendly tools that do not require specialized expertise in microarray analysis or time-consuming download of massive data sets. The GEO database is publicly accessible through the World Wide Web at http://www.ncbi.nlm.nih.gov/geo. PMID:16888359
Dotsey, Emmanuel Y.; Gorlani, Andrea; Ingale, Sampat; Achenbach, Chad J.; Forthal, Donald N.; Felgner, Philip L.; Gach, Johannes S.
2015-01-01
In recent years, high throughput discovery of human recombinant monoclonal antibodies (mAbs) has been applied to greatly advance our understanding of the specificity, and functional activity of antibodies against HIV. Thousands of antibodies have been generated and screened in functional neutralization assays, and antibodies associated with cross-strain neutralization and passive protection in primates, have been identified. To facilitate this type of discovery, a high throughput-screening tool is needed to accurately classify mAbs, and their antigen targets. In this study, we analyzed and evaluated a prototype microarray chip comprised of the HIV-1 recombinant proteins gp140, gp120, gp41, and several membrane proximal external region peptides. The protein microarray analysis of 11 HIV-1 envelope-specific mAbs revealed diverse binding affinities and specificities across clades. Half maximal effective concentrations, generated by our chip analysis, correlated significantly (P<0.0001) with concentrations from ELISA binding measurements. Polyclonal immune responses in plasma samples from HIV-1 infected subjects exhibited different binding patterns, and reactivity against printed proteins. Examining the totality of the specificity of the humoral response in this way reveals the exquisite diversity, and specificity of the humoral response to HIV. PMID:25938510
New Statistics for Testing Differential Expression of Pathways from Microarray Data
NASA Astrophysics Data System (ADS)
Siu, Hoicheong; Dong, Hua; Jin, Li; Xiong, Momiao
Exploring biological meaning from microarray data is very important but remains a great challenge. Here, we developed three new statistics: linear combination test, quadratic test and de-correlation test to identify differentially expressed pathways from gene expression profile. We apply our statistics to two rheumatoid arthritis datasets. Notably, our results reveal three significant pathways and 275 genes in common in two datasets. The pathways we found are meaningful to uncover the disease mechanisms of rheumatoid arthritis, which implies that our statistics are a powerful tool in functional analysis of gene expression data.
Kudo, Toru; Sasaki, Yohei; Terashima, Shin; Matsuda-Imai, Noriko; Takano, Tomoyuki; Saito, Misa; Kanno, Maasa; Ozaki, Soichi; Suwabe, Keita; Suzuki, Go; Watanabe, Masao; Matsuoka, Makoto; Takayama, Seiji; Yano, Kentaro
2016-10-13
In quantitative gene expression analysis, normalization using a reference gene as an internal control is frequently performed for appropriate interpretation of the results. Efforts have been devoted to exploring superior novel reference genes using microarray transcriptomic data and to evaluating commonly used reference genes by targeting analysis. However, because the number of specifically detectable genes is totally dependent on probe design in the microarray analysis, exploration using microarray data may miss some of the best choices for the reference genes. Recently emerging RNA sequencing (RNA-seq) provides an ideal resource for comprehensive exploration of reference genes since this method is capable of detecting all expressed genes, in principle including even unknown genes. We report the results of a comprehensive exploration of reference genes using public RNA-seq data from plants such as Arabidopsis thaliana (Arabidopsis), Glycine max (soybean), Solanum lycopersicum (tomato) and Oryza sativa (rice). To select reference genes suitable for the broadest experimental conditions possible, candidates were surveyed by the following four steps: (1) evaluation of the basal expression level of each gene in each experiment; (2) evaluation of the expression stability of each gene in each experiment; (3) evaluation of the expression stability of each gene across the experiments; and (4) selection of top-ranked genes, after ranking according to the number of experiments in which the gene was expressed stably. Employing this procedure, 13, 10, 12 and 21 top candidates for reference genes were proposed in Arabidopsis, soybean, tomato and rice, respectively. Microarray expression data confirmed that the expression of the proposed reference genes under broad experimental conditions was more stable than that of commonly used reference genes. These novel reference genes will be useful for analyzing gene expression profiles across experiments carried out under various experimental conditions.
Evaluation of microarray data normalization procedures using spike-in experiments
Rydén, Patrik; Andersson, Henrik; Landfors, Mattias; Näslund, Linda; Hartmanová, Blanka; Noppa, Laila; Sjöstedt, Anders
2006-01-01
Background Recently, a large number of methods for the analysis of microarray data have been proposed but there are few comparisons of their relative performances. By using so-called spike-in experiments, it is possible to characterize the analyzed data and thereby enable comparisons of different analysis methods. Results A spike-in experiment using eight in-house produced arrays was used to evaluate established and novel methods for filtration, background adjustment, scanning, channel adjustment, and censoring. The S-plus package EDMA, a stand-alone tool providing characterization of analyzed cDNA-microarray data obtained from spike-in experiments, was developed and used to evaluate 252 normalization methods. For all analyses, the sensitivities at low false positive rates were observed together with estimates of the overall bias and the standard deviation. In general, there was a trade-off between the ability of the analyses to identify differentially expressed genes (i.e. the analyses' sensitivities) and their ability to provide unbiased estimators of the desired ratios. Virtually all analysis underestimated the magnitude of the regulations; often less than 50% of the true regulations were observed. Moreover, the bias depended on the underlying mRNA-concentration; low concentration resulted in high bias. Many of the analyses had relatively low sensitivities, but analyses that used either the constrained model (i.e. a procedure that combines data from several scans) or partial filtration (a novel method for treating data from so-called not-found spots) had with few exceptions high sensitivities. These methods gave considerable higher sensitivities than some commonly used analysis methods. Conclusion The use of spike-in experiments is a powerful approach for evaluating microarray preprocessing procedures. Analyzed data are characterized by properties of the observed log-ratios and the analysis' ability to detect differentially expressed genes. If bias is not a major problem; we recommend the use of either the CM-procedure or partial filtration. PMID:16774679
Inoue, Daisuke; Hinoura, Takuji; Suzuki, Noriko; Pang, Junqin; Malla, Rabin; Shrestha, Sadhana; Chapagain, Saroj Kumar; Matsuzawa, Hiroaki; Nakamura, Takashi; Tanaka, Yasuhiro; Ike, Michihiko; Nishida, Kei; Sei, Kazunari
2015-01-01
Because of heavy dependence on groundwater for drinking water and other domestic use, microbial contamination of groundwater is a serious problem in the Kathmandu Valley, Nepal. This study investigated comprehensively the occurrence of pathogenic bacteria in shallow well groundwater in the Kathmandu Valley by applying DNA microarray analysis targeting 941 pathogenic bacterial species/groups. Water quality measurements found significant coliform (fecal) contamination in 10 of the 11 investigated groundwater samples and significant nitrogen contamination in some samples. The results of DNA microarray analysis revealed the presence of 1-37 pathogen species/groups, including 1-27 biosafety level 2 ones, in 9 of the 11 groundwater samples. While the detected pathogens included several feces- and animal-related ones, those belonging to Legionella and Arthrobacter, which were considered not to be directly associated with feces, were detected prevalently. This study could provide a rough picture of overall pathogenic bacterial contamination in the Kathmandu Valley, and demonstrated the usefulness of DNA microarray analysis as a comprehensive screening tool of a wide variety of pathogenic bacteria.
Microarray Analysis of Long Noncoding RNAs in Female Diabetic Peripheral Neuropathy Patients.
Luo, Lin; Ji, Lin-Dan; Cai, Jiang-Jia; Feng, Mei; Zhou, Mi; Hu, Su-Pei; Xu, Jin; Zhou, Wen-Hua
2018-01-01
Diabetic peripheral neuropathy (DPN) is the most common complication of diabetes mellitus (DM). Because of its controversial pathogenesis, DPN is still not diagnosed or managed properly in most patients. In this study, human lncRNA microarrays were used to identify the differentially expressed lncRNAs in DM and DPN patients, and some of the discovered lncRNAs were further validated in additional 78 samples by quantitative realtime PCR (qRT-PCR). The microarray analysis identified 446 and 1327 differentially expressed lncRNAs in DM and DPN, respectively. The KEGG pathway analysis further revealed that the differentially expressed lncRNA-coexpressed mRNAs between DPN and DM groups were significantly enriched in the MAPK signaling pathway. The lncRNA/mRNA coexpression network indicated that BDNF and TRAF2 correlated with 6 lncRNAs. The qRT-PCR confirmed the initial microarray results. These findings demonstrated that the interplay between lncRNAs and mRNA may be involved in the pathogenesis of DPN, especially the neurotrophin-MAPK signaling pathway, thus providing relevant information for future studies. © 2018 The Author(s). Published by S. Karger AG, Basel.
Cell cycle arrest and gene expression profiling of testis in mice exposed to fluoride.
Su, Kai; Sun, Zilong; Niu, Ruiyan; Lei, Ying; Cheng, Jing; Wang, Jundong
2017-05-01
Exposure to fluoride results in low reproductive capacity; however, the mechanism underlying the impact of fluoride on male productive system still remains obscure. To assess the potential toxicity in testis of mice administrated with fluoride, global genome microarray and real-time PCR were performed to detect and identify the altered transcriptions. The results revealed that 763 differentially expressed genes were identified, including 330 up-regulated and 433 down-regulated genes, which were involved in spermatogenesis, apoptosis, DNA damage, DNA replication, and cell differentiation. Twelve differential expressed genes were selected to confirm the microarray results using real-time PCR, and the result kept the same tendency with that of microarray. Furthermore, compared with the control group, more apoptotic spermatogenic cells were observed in the fluoride group, and the spermatogonium were markedly increased in S phase and decreased in G2/M phase by fluoride. Our findings suggested global genome microarray provides an insight into the reproductive toxicity induced by fluoride, and several important biological clues for further investigations. © 2016 Wiley Periodicals, Inc. Environ Toxicol 32: 1558-1565, 2017. © 2016 Wiley Periodicals, Inc.
Approximate geodesic distances reveal biologically relevant structures in microarray data.
Nilsson, Jens; Fioretos, Thoas; Höglund, Mattias; Fontes, Magnus
2004-04-12
Genome-wide gene expression measurements, as currently determined by the microarray technology, can be represented mathematically as points in a high-dimensional gene expression space. Genes interact with each other in regulatory networks, restricting the cellular gene expression profiles to a certain manifold, or surface, in gene expression space. To obtain knowledge about this manifold, various dimensionality reduction methods and distance metrics are used. For data points distributed on curved manifolds, a sensible distance measure would be the geodesic distance along the manifold. In this work, we examine whether an approximate geodesic distance measure captures biological similarities better than the traditionally used Euclidean distance. We computed approximate geodesic distances, determined by the Isomap algorithm, for one set of lymphoma and one set of lung cancer microarray samples. Compared with the ordinary Euclidean distance metric, this distance measure produced more instructive, biologically relevant, visualizations when applying multidimensional scaling. This suggests the Isomap algorithm as a promising tool for the interpretation of microarray data. Furthermore, the results demonstrate the benefit and importance of taking nonlinearities in gene expression data into account.
Novel genetic tools for studying food-borne Salmonella.
Andrews-Polymenis, Helene L; Santiviago, Carlos A; McClelland, Michael
2009-04-01
Nontyphoidal Salmonellae are highly prevalent food-borne pathogens. High-throughput sequencing of Salmonella genomes is expanding our knowledge of the evolution of serovars and epidemic isolates. Genome sequences have also allowed the creation of complete microarrays. Microarrays have improved the throughput of in vivo expression technology (IVET) used to uncover promoters active during infection. In another method, signature tagged mutagenesis (STM), pools of mutants are subjected to selection. Changes in the population are monitored on a microarray, revealing genes under selection. Complete genome sequences permit the construction of pools of targeted in-frame deletions that have improved STM by minimizing the number of clones and the polarity of each mutant. Together, genome sequences and the continuing development of new tools for functional genomics will drive a revolution in the understanding of Salmonellae in many different niches that are critical for food safety.
Ishiwata, Ryosuke R; Morioka, Masaki S; Ogishima, Soichi; Tanaka, Hiroshi
2009-02-15
BioCichlid is a 3D visualization system of time-course microarray data on molecular networks, aiming at interpretation of gene expression data by transcriptional relationships based on the central dogma with physical and genetic interactions. BioCichlid visualizes both physical (protein) and genetic (regulatory) network layers, and provides animation of time-course gene expression data on the genetic network layer. Transcriptional regulations are represented to bridge the physical network (transcription factors) and genetic network (regulated genes) layers, thus integrating promoter analysis into the pathway mapping. BioCichlid enhances the interpretation of microarray data and allows for revealing the underlying mechanisms causing differential gene expressions. BioCichlid is freely available and can be accessed at http://newton.tmd.ac.jp/. Source codes for both biocichlid server and client are also available.
Grote, Lauren; Myers, Melanie; Lovell, Anne; Saal, Howard; Sund, Kristen Lipscomb
2014-01-01
SNP microarrays are capable of detecting regions of homozygosity (ROH) which can suggest parental relatedness. This study was designed to describe pre- and post-test counseling practices of genetics professionals regarding ROH, explore perceived comfort and ethical concerns in the follow-up of such results, demonstrate awareness of laws surrounding duty to report consanguinity and incest, and allow respondents to share their personal experiences with results suggesting a parental relationship. A 35 question survey was administered to 240 genetic counselors and geneticists who had ordered or counseled for SNP microarray. The results are presented using descriptive statistics. There was variation in both pre- and post-test counseling practices of genetics professionals. Twenty-five percent of respondents reported pre-test counseling that ROH can indicate parental relatedness. The most commonly reported ethical concern was disclosure of findings suggesting parental relatedness to parents of the patient; only 48.4% reported disclosing parental relatedness when indicated. Fifty-seven percent felt comfortable receiving results suggesting parental consanguinity while 17% felt comfortable receiving results suggesting parental incest. Twenty percent of respondents were extremely/moderately familiar with the laws about duty to report incest. Personal experiences in post-test counseling included both parental acknowledgement and denial of relatedness. This study highlights the differences in genetics professionals' pre- and post-test counseling practices, comfort, and experiences surrounding parental relatedness suggested by SNP microarray results. It identifies a need for professional organizations to offer guidance to genetics professionals about how to respond to and counsel for molecular results suggesting parental consanguinity or incest. © 2013 Wiley Periodicals, Inc.
Genome-scale cluster analysis of replicated microarrays using shrinkage correlation coefficient.
Yao, Jianchao; Chang, Chunqi; Salmi, Mari L; Hung, Yeung Sam; Loraine, Ann; Roux, Stanley J
2008-06-18
Currently, clustering with some form of correlation coefficient as the gene similarity metric has become a popular method for profiling genomic data. The Pearson correlation coefficient and the standard deviation (SD)-weighted correlation coefficient are the two most widely-used correlations as the similarity metrics in clustering microarray data. However, these two correlations are not optimal for analyzing replicated microarray data generated by most laboratories. An effective correlation coefficient is needed to provide statistically sufficient analysis of replicated microarray data. In this study, we describe a novel correlation coefficient, shrinkage correlation coefficient (SCC), that fully exploits the similarity between the replicated microarray experimental samples. The methodology considers both the number of replicates and the variance within each experimental group in clustering expression data, and provides a robust statistical estimation of the error of replicated microarray data. The value of SCC is revealed by its comparison with two other correlation coefficients that are currently the most widely-used (Pearson correlation coefficient and SD-weighted correlation coefficient) using statistical measures on both synthetic expression data as well as real gene expression data from Saccharomyces cerevisiae. Two leading clustering methods, hierarchical and k-means clustering were applied for the comparison. The comparison indicated that using SCC achieves better clustering performance. Applying SCC-based hierarchical clustering to the replicated microarray data obtained from germinating spores of the fern Ceratopteris richardii, we discovered two clusters of genes with shared expression patterns during spore germination. Functional analysis suggested that some of the genetic mechanisms that control germination in such diverse plant lineages as mosses and angiosperms are also conserved among ferns. This study shows that SCC is an alternative to the Pearson correlation coefficient and the SD-weighted correlation coefficient, and is particularly useful for clustering replicated microarray data. This computational approach should be generally useful for proteomic data or other high-throughput analysis methodology.
Goodman, Corey W; Major, Heather J; Walls, William D; Sheffield, Val C; Casavant, Thomas L; Darbro, Benjamin W
2015-04-01
Chromosomal microarrays (CMAs) are routinely used in both research and clinical laboratories; yet, little attention has been given to the estimation of genome-wide true and false negatives during the assessment of these assays and how such information could be used to calibrate various algorithmic metrics to improve performance. Low-throughput, locus-specific methods such as fluorescence in situ hybridization (FISH), quantitative PCR (qPCR), or multiplex ligation-dependent probe amplification (MLPA) preclude rigorous calibration of various metrics used by copy number variant (CNV) detection algorithms. To aid this task, we have established a comparative methodology, CNV-ROC, which is capable of performing a high throughput, low cost, analysis of CMAs that takes into consideration genome-wide true and false negatives. CNV-ROC uses a higher resolution microarray to confirm calls from a lower resolution microarray and provides for a true measure of genome-wide performance metrics at the resolution offered by microarray testing. CNV-ROC also provides for a very precise comparison of CNV calls between two microarray platforms without the need to establish an arbitrary degree of overlap. Comparison of CNVs across microarrays is done on a per-probe basis and receiver operator characteristic (ROC) analysis is used to calibrate algorithmic metrics, such as log2 ratio threshold, to enhance CNV calling performance. CNV-ROC addresses a critical and consistently overlooked aspect of analytical assessments of genome-wide techniques like CMAs which is the measurement and use of genome-wide true and false negative data for the calculation of performance metrics and comparison of CNV profiles between different microarray experiments. Copyright © 2015 Elsevier Inc. All rights reserved.
Identifying Fishes through DNA Barcodes and Microarrays.
Kochzius, Marc; Seidel, Christian; Antoniou, Aglaia; Botla, Sandeep Kumar; Campo, Daniel; Cariani, Alessia; Vazquez, Eva Garcia; Hauschild, Janet; Hervet, Caroline; Hjörleifsdottir, Sigridur; Hreggvidsson, Gudmundur; Kappel, Kristina; Landi, Monica; Magoulas, Antonios; Marteinsson, Viggo; Nölte, Manfred; Planes, Serge; Tinti, Fausto; Turan, Cemal; Venugopal, Moleyur N; Weber, Hannes; Blohm, Dietmar
2010-09-07
International fish trade reached an import value of 62.8 billion Euro in 2006, of which 44.6% are covered by the European Union. Species identification is a key problem throughout the life cycle of fishes: from eggs and larvae to adults in fisheries research and control, as well as processed fish products in consumer protection. This study aims to evaluate the applicability of the three mitochondrial genes 16S rRNA (16S), cytochrome b (cyt b), and cytochrome oxidase subunit I (COI) for the identification of 50 European marine fish species by combining techniques of "DNA barcoding" and microarrays. In a DNA barcoding approach, neighbour Joining (NJ) phylogenetic trees of 369 16S, 212 cyt b, and 447 COI sequences indicated that cyt b and COI are suitable for unambiguous identification, whereas 16S failed to discriminate closely related flatfish and gurnard species. In course of probe design for DNA microarray development, each of the markers yielded a high number of potentially species-specific probes in silico, although many of them were rejected based on microarray hybridisation experiments. None of the markers provided probes to discriminate the sibling flatfish and gurnard species. However, since 16S-probes were less negatively influenced by the "position of label" effect and showed the lowest rejection rate and the highest mean signal intensity, 16S is more suitable for DNA microarray probe design than cty b and COI. The large portion of rejected COI-probes after hybridisation experiments (>90%) renders the DNA barcoding marker as rather unsuitable for this high-throughput technology. Based on these data, a DNA microarray containing 64 functional oligonucleotide probes for the identification of 30 out of the 50 fish species investigated was developed. It represents the next step towards an automated and easy-to-handle method to identify fish, ichthyoplankton, and fish products.
DFP: a Bioconductor package for fuzzy profile identification and gene reduction of microarray data
Glez-Peña, Daniel; Álvarez, Rodrigo; Díaz, Fernando; Fdez-Riverola, Florentino
2009-01-01
Background Expression profiling assays done by using DNA microarray technology generate enormous data sets that are not amenable to simple analysis. The greatest challenge in maximizing the use of this huge amount of data is to develop algorithms to interpret and interconnect results from different genes under different conditions. In this context, fuzzy logic can provide a systematic and unbiased way to both (i) find biologically significant insights relating to meaningful genes, thereby removing the need for expert knowledge in preliminary steps of microarray data analyses and (ii) reduce the cost and complexity of later applied machine learning techniques being able to achieve interpretable models. Results DFP is a new Bioconductor R package that implements a method for discretizing and selecting differentially expressed genes based on the application of fuzzy logic. DFP takes advantage of fuzzy membership functions to assign linguistic labels to gene expression levels. The technique builds a reduced set of relevant genes (FP, Fuzzy Pattern) able to summarize and represent each underlying class (pathology). A last step constructs a biased set of genes (DFP, Discriminant Fuzzy Pattern) by intersecting existing fuzzy patterns in order to detect discriminative elements. In addition, the software provides new functions and visualisation tools that summarize achieved results and aid in the interpretation of differentially expressed genes from multiple microarray experiments. Conclusion DFP integrates with other packages of the Bioconductor project, uses common data structures and is accompanied by ample documentation. It has the advantage that its parameters are highly configurable, facilitating the discovery of biologically relevant connections between sets of genes belonging to different pathologies. This information makes it possible to automatically filter irrelevant genes thereby reducing the large volume of data supplied by microarray experiments. Based on these contributions GENECBR, a successful tool for cancer diagnosis using microarray datasets, has recently been released. PMID:19178723
DFP: a Bioconductor package for fuzzy profile identification and gene reduction of microarray data.
Glez-Peña, Daniel; Alvarez, Rodrigo; Díaz, Fernando; Fdez-Riverola, Florentino
2009-01-29
Expression profiling assays done by using DNA microarray technology generate enormous data sets that are not amenable to simple analysis. The greatest challenge in maximizing the use of this huge amount of data is to develop algorithms to interpret and interconnect results from different genes under different conditions. In this context, fuzzy logic can provide a systematic and unbiased way to both (i) find biologically significant insights relating to meaningful genes, thereby removing the need for expert knowledge in preliminary steps of microarray data analyses and (ii) reduce the cost and complexity of later applied machine learning techniques being able to achieve interpretable models. DFP is a new Bioconductor R package that implements a method for discretizing and selecting differentially expressed genes based on the application of fuzzy logic. DFP takes advantage of fuzzy membership functions to assign linguistic labels to gene expression levels. The technique builds a reduced set of relevant genes (FP, Fuzzy Pattern) able to summarize and represent each underlying class (pathology). A last step constructs a biased set of genes (DFP, Discriminant Fuzzy Pattern) by intersecting existing fuzzy patterns in order to detect discriminative elements. In addition, the software provides new functions and visualisation tools that summarize achieved results and aid in the interpretation of differentially expressed genes from multiple microarray experiments. DFP integrates with other packages of the Bioconductor project, uses common data structures and is accompanied by ample documentation. It has the advantage that its parameters are highly configurable, facilitating the discovery of biologically relevant connections between sets of genes belonging to different pathologies. This information makes it possible to automatically filter irrelevant genes thereby reducing the large volume of data supplied by microarray experiments. Based on these contributions GENECBR, a successful tool for cancer diagnosis using microarray datasets, has recently been released.
Retrieving relevant time-course experiments: a study on Arabidopsis microarrays.
Şener, Duygu Dede; Oğul, Hasan
2016-06-01
Understanding time-course regulation of genes in response to a stimulus is a major concern in current systems biology. The problem is usually approached by computational methods to model the gene behaviour or its networked interactions with the others by a set of latent parameters. The model parameters can be estimated through a meta-analysis of available data obtained from other relevant experiments. The key question here is how to find the relevant experiments which are potentially useful in analysing current data. In this study, the authors address this problem in the context of time-course gene expression experiments from an information retrieval perspective. To this end, they introduce a computational framework that takes a time-course experiment as a query and reports a list of relevant experiments retrieved from a given repository. These retrieved experiments can then be used to associate the environmental factors of query experiment with the findings previously reported. The model is tested using a set of time-course Arabidopsis microarrays. The experimental results show that relevant experiments can be successfully retrieved based on content similarity.
Expression profiling and pathway analysis of Krüppel-like factor 4 in mouse embryonic fibroblasts
Hagos, Engda G; Ghaleb, Amr M; Kumar, Amrita; Neish, Andrew S; Yang, Vincent W
2011-01-01
Background: Krüppel-like factor 4 (KLF4) is a zinc-finger transcription factor with diverse regulatory functions in proliferation, differentiation, and development. KLF4 also plays a role in inflammation, tumorigenesis, and reprogramming of somatic cells to induced pluripotent stem (iPS) cells. To gain insight into the mechanisms by which KLF4 regulates these processes, we conducted DNA microarray analyses to identify differentially expressed genes in mouse embryonic fibroblasts (MEFs) wild type and null for Klf4. Methods: Expression profiles of fibroblasts isolated from mouse embryos wild type or null for the Klf4 alleles were examined by DNA microarrays. Differentially expressed genes were subjected to the Database for Annotation, Visualization and Integrated Discovery (DAVID). The microarray data were also interrogated with the Ingenuity Pathway Analysis (IPA) and Gene Set Enrichment Analysis (GSEA) for pathway identification. Results obtained from the microarray analysis were confirmed by Western blotting for select genes with biological relevance to determine the correlation between mRNA and protein levels. Results: One hundred and sixty three up-regulated and 88 down-regulated genes were identified that demonstrated a fold-change of at least 1.5 and a P-value < 0.05 in Klf4-null MEFs compared to wild type MEFs. Many of the up-regulated genes in Klf4-null MEFs encode proto-oncogenes, growth factors, extracellular matrix, and cell cycle activators. In contrast, genes encoding tumor suppressors and those involved in JAK-STAT signaling pathways are down-regulated in Klf4-null MEFs. IPA and GSEA also identified various pathways that are regulated by KLF4. Lastly, Western blotting of select target genes confirmed the changes revealed by microarray data. Conclusions: These data are not only consistent with previous functional studies of KLF4's role in tumor suppression and somatic cell reprogramming, but also revealed novel target genes that mediate KLF4's functions. PMID:21892412
2013-01-01
Background Analysis of global gene expression by DNA microarrays is widely used in experimental molecular biology. However, the complexity of such high-dimensional data sets makes it difficult to fully understand the underlying biological features present in the data. The aim of this study is to introduce a method for DNA microarray analysis that provides an intuitive interpretation of data through dimension reduction and pattern recognition. We present the first “Archetypal Analysis” of global gene expression. The analysis is based on microarray data from five integrated studies of Pseudomonas aeruginosa isolated from the airways of cystic fibrosis patients. Results Our analysis clustered samples into distinct groups with comprehensible characteristics since the archetypes representing the individual groups are closely related to samples present in the data set. Significant changes in gene expression between different groups identified adaptive changes of the bacteria residing in the cystic fibrosis lung. The analysis suggests a similar gene expression pattern between isolates with a high mutation rate (hypermutators) despite accumulation of different mutations for these isolates. This suggests positive selection in the cystic fibrosis lung environment, and changes in gene expression for these isolates are therefore most likely related to adaptation of the bacteria. Conclusions Archetypal analysis succeeded in identifying adaptive changes of P. aeruginosa. The combination of clustering and matrix factorization made it possible to reveal minor similarities among different groups of data, which other analytical methods failed to identify. We suggest that this analysis could be used to supplement current methods used to analyze DNA microarray data. PMID:24059747
The statistics of identifying differentially expressed genes in Expresso and TM4: a comparison
Sioson, Allan A; Mane, Shrinivasrao P; Li, Pinghua; Sha, Wei; Heath, Lenwood S; Bohnert, Hans J; Grene, Ruth
2006-01-01
Background Analysis of DNA microarray data takes as input spot intensity measurements from scanner software and returns differential expression of genes between two conditions, together with a statistical significance assessment. This process typically consists of two steps: data normalization and identification of differentially expressed genes through statistical analysis. The Expresso microarray experiment management system implements these steps with a two-stage, log-linear ANOVA mixed model technique, tailored to individual experimental designs. The complement of tools in TM4, on the other hand, is based on a number of preset design choices that limit its flexibility. In the TM4 microarray analysis suite, normalization, filter, and analysis methods form an analysis pipeline. TM4 computes integrated intensity values (IIV) from the average intensities and spot pixel counts returned by the scanner software as input to its normalization steps. By contrast, Expresso can use either IIV data or median intensity values (MIV). Here, we compare Expresso and TM4 analysis of two experiments and assess the results against qRT-PCR data. Results The Expresso analysis using MIV data consistently identifies more genes as differentially expressed, when compared to Expresso analysis with IIV data. The typical TM4 normalization and filtering pipeline corrects systematic intensity-specific bias on a per microarray basis. Subsequent statistical analysis with Expresso or a TM4 t-test can effectively identify differentially expressed genes. The best agreement with qRT-PCR data is obtained through the use of Expresso analysis and MIV data. Conclusion The results of this research are of practical value to biologists who analyze microarray data sets. The TM4 normalization and filtering pipeline corrects microarray-specific systematic bias and complements the normalization stage in Expresso analysis. The results of Expresso using MIV data have the best agreement with qRT-PCR results. In one experiment, MIV is a better choice than IIV as input to data normalization and statistical analysis methods, as it yields as greater number of statistically significant differentially expressed genes; TM4 does not support the choice of MIV input data. Overall, the more flexible and extensive statistical models of Expresso achieve more accurate analytical results, when judged by the yardstick of qRT-PCR data, in the context of an experimental design of modest complexity. PMID:16626497
Watson, Christopher M.; Crinnion, Laura A.; Gurgel‐Gianetti, Juliana; Harrison, Sally M.; Daly, Catherine; Antanavicuite, Agne; Lascelles, Carolina; Markham, Alexander F.; Pena, Sergio D. J.; Bonthron, David T.
2015-01-01
ABSTRACT Autozygosity mapping is a powerful technique for the identification of rare, autosomal recessive, disease‐causing genes. The ease with which this category of disease gene can be identified has greatly increased through the availability of genome‐wide SNP genotyping microarrays and subsequently of exome sequencing. Although these methods have simplified the generation of experimental data, its analysis, particularly when disparate data types must be integrated, remains time consuming. Moreover, the huge volume of sequence variant data generated from next generation sequencing experiments opens up the possibility of using these data instead of microarray genotype data to identify disease loci. To allow these two types of data to be used in an integrated fashion, we have developed AgileVCFMapper, a program that performs both the mapping of disease loci by SNP genotyping and the analysis of potentially deleterious variants using exome sequence variant data, in a single step. This method does not require microarray SNP genotype data, although analysis with a combination of microarray and exome genotype data enables more precise delineation of disease loci, due to superior marker density and distribution. PMID:26037133
Automatic Identification and Quantification of Extra-Well Fluorescence in Microarray Images.
Rivera, Robert; Wang, Jie; Yu, Xiaobo; Demirkan, Gokhan; Hopper, Marika; Bian, Xiaofang; Tahsin, Tasnia; Magee, D Mitchell; Qiu, Ji; LaBaer, Joshua; Wallstrom, Garrick
2017-11-03
In recent studies involving NAPPA microarrays, extra-well fluorescence is used as a key measure for identifying disease biomarkers because there is evidence to support that it is better correlated with strong antibody responses than statistical analysis involving intraspot intensity. Because this feature is not well quantified by traditional image analysis software, identification and quantification of extra-well fluorescence is performed manually, which is both time-consuming and highly susceptible to variation between raters. A system that could automate this task efficiently and effectively would greatly improve the process of data acquisition in microarray studies, thereby accelerating the discovery of disease biomarkers. In this study, we experimented with different machine learning methods, as well as novel heuristics, for identifying spots exhibiting extra-well fluorescence (rings) in microarray images and assigning each ring a grade of 1-5 based on its intensity and morphology. The sensitivity of our final system for identifying rings was found to be 72% at 99% specificity and 98% at 92% specificity. Our system performs this task significantly faster than a human, while maintaining high performance, and therefore represents a valuable tool for microarray image analysis.
Integrative missing value estimation for microarray data.
Hu, Jianjun; Li, Haifeng; Waterman, Michael S; Zhou, Xianghong Jasmine
2006-10-12
Missing value estimation is an important preprocessing step in microarray analysis. Although several methods have been developed to solve this problem, their performance is unsatisfactory for datasets with high rates of missing data, high measurement noise, or limited numbers of samples. In fact, more than 80% of the time-series datasets in Stanford Microarray Database contain less than eight samples. We present the integrative Missing Value Estimation method (iMISS) by incorporating information from multiple reference microarray datasets to improve missing value estimation. For each gene with missing data, we derive a consistent neighbor-gene list by taking reference data sets into consideration. To determine whether the given reference data sets are sufficiently informative for integration, we use a submatrix imputation approach. Our experiments showed that iMISS can significantly and consistently improve the accuracy of the state-of-the-art Local Least Square (LLS) imputation algorithm by up to 15% improvement in our benchmark tests. We demonstrated that the order-statistics-based integrative imputation algorithms can achieve significant improvements over the state-of-the-art missing value estimation approaches such as LLS and is especially good for imputing microarray datasets with a limited number of samples, high rates of missing data, or very noisy measurements. With the rapid accumulation of microarray datasets, the performance of our approach can be further improved by incorporating larger and more appropriate reference datasets.
Rode, Tone Mari; Berget, Ingunn; Langsrud, Solveig; Møretrø, Trond; Holck, Askild
2009-07-01
Microorganisms are constantly exposed to new and altered growth conditions, and respond by changing gene expression patterns. Several methods for studying gene expression exist. During the last decade, the analysis of microarrays has been one of the most common approaches applied for large scale gene expression studies. A relatively new method for gene expression analysis is MassARRAY, which combines real competitive-PCR and MALDI-TOF (matrix-assisted laser desorption/ionization time-of-flight) mass spectrometry. In contrast to microarray methods, MassARRAY technology is suitable for analysing a larger number of samples, though for a smaller set of genes. In this study we compare the results from MassARRAY with microarrays on gene expression responses of Staphylococcus aureus exposed to acid stress at pH 4.5. RNA isolated from the same stress experiments was analysed using both the MassARRAY and the microarray methods. The MassARRAY and microarray methods showed good correlation. Both MassARRAY and microarray estimated somewhat lower fold changes compared with quantitative real-time PCR (qRT-PCR). The results confirmed the up-regulation of the urease genes in acidic environments, and also indicated the importance of metal ion regulation. This study shows that the MassARRAY technology is suitable for gene expression analysis in prokaryotes, and has advantages when a set of genes is being analysed for an organism exposed to many different environmental conditions.
A proposed metric for assessing the measurement quality of individual microarrays
Kim, Kyoungmi; Page, Grier P; Beasley, T Mark; Barnes, Stephen; Scheirer, Katherine E; Allison, David B
2006-01-01
Background High-density microarray technology is increasingly applied to study gene expression levels on a large scale. Microarray experiments rely on several critical steps that may introduce error and uncertainty in analyses. These steps include mRNA sample extraction, amplification and labeling, hybridization, and scanning. In some cases this may be manifested as systematic spatial variation on the surface of microarray in which expression measurements within an individual array may vary as a function of geographic position on the array surface. Results We hypothesized that an index of the degree of spatiality of gene expression measurements associated with their physical geographic locations on an array could indicate the summary of the physical reliability of the microarray. We introduced a novel way to formulate this index using a statistical analysis tool. Our approach regressed gene expression intensity measurements on a polynomial response surface of the microarray's Cartesian coordinates. We demonstrated this method using a fixed model and presented results from real and simulated datasets. Conclusion We demonstrated the potential of such a quantitative metric for assessing the reliability of individual arrays. Moreover, we showed that this procedure can be incorporated into laboratory practice as a means to set quality control specifications and as a tool to determine whether an array has sufficient quality to be retained in terms of spatial correlation of gene expression measurements. PMID:16430768
The XBabelPhish MAGE-ML and XML translator.
Maier, Don; Wymore, Farrell; Sherlock, Gavin; Ball, Catherine A
2008-01-18
MAGE-ML has been promoted as a standard format for describing microarray experiments and the data they produce. Two characteristics of the MAGE-ML format compromise its use as a universal standard: First, MAGE-ML files are exceptionally large - too large to be easily read by most people, and often too large to be read by most software programs. Second, the MAGE-ML standard permits many ways of representing the same information. As a result, different producers of MAGE-ML create different documents describing the same experiment and its data. Recognizing all the variants is an unwieldy software engineering task, resulting in software packages that can read and process MAGE-ML from some, but not all producers. This Tower of MAGE-ML Babel bars the unencumbered exchange of microarray experiment descriptions couched in MAGE-ML. We have developed XBabelPhish - an XQuery-based technology for translating one MAGE-ML variant into another. XBabelPhish's use is not restricted to translating MAGE-ML documents. It can transform XML files independent of their DTD, XML schema, or semantic content. Moreover, it is designed to work on very large (> 200 Mb.) files, which are common in the world of MAGE-ML. XBabelPhish provides a way to inter-translate MAGE-ML variants for improved interchange of microarray experiment information. More generally, it can be used to transform most XML files, including very large ones that exceed the capacity of most XML tools.
Deciphering the Function of New Gonococcal Vaccine Antigens Using Phenotypic Microarrays
Baarda, Benjamin I.; Emerson, Sarah; Proteau, Philip J.
2017-01-01
ABSTRACT The function and extracellular location of cell envelope proteins make them attractive candidates for developing vaccines against bacterial diseases, including challenging drug-resistant pathogens, such as Neisseria gonorrhoeae. A proteomics-driven reverse vaccinology approach has delivered multiple gonorrhea vaccine candidates; however, the biological functions of many of them remain to be elucidated. Herein, the functions of six gonorrhea vaccine candidates—NGO2121, NGO1985, NGO2054, NGO2111, NGO1205, and NGO1344—in cell envelope homeostasis were probed using phenotype microarrays under 1,056 conditions and a ΔbamE mutant (Δngo1780) as a reference of perturbed outer membrane integrity. Optimal growth conditions for an N. gonorrhoeae phenotype microarray assay in defined liquid medium were developed, which can be useful in other applications, including rapid and thorough antimicrobial susceptibility assessment. Our studies revealed 91 conditions having uniquely positive or negative effects on one of the examined mutants. A cluster analysis of 37 and 57 commonly beneficial and detrimental compounds, respectively, revealed three separate phenotype groups: NGO2121 and NGO1985; NGO1344 and BamE; and the trio of NGO1205, NGO2111, and NGO2054, with the last protein forming an independent branch of this cluster. Similar phenotypes were associated with loss of these vaccine candidates in the highly antibiotic-resistant WHO X strain. Based on their extensive sensitivity phenomes, NGO1985 and NGO2121 appear to be the most promising vaccine candidates. This study establishes the principle that phenotype microarrays can be successfully applied to a fastidious bacterial organism, such as N. gonorrhoeae. IMPORTANCE Innovative approaches are required to develop vaccines against prevalent and neglected sexually transmitted infections, such as gonorrhea. Herein, we have utilized phenotype microarrays in the first such investigation into Neisseria gonorrhoeae to probe the function of proteome-derived vaccine candidates in cell envelope homeostasis. Information gained from this screening can feed the vaccine candidate decision tree by providing insights into the roles these proteins play in membrane permeability, integrity, and overall N. gonorrhoeae physiology. The optimized screening protocol can be applied in investigations into the function of other hypothetical proteins of N. gonorrhoeae discovered in the expanding number of whole-genome sequences, in addition to revealing phenotypic differences between clinical and laboratory strains. PMID:28630127
2014-01-01
Background The production of biofuels in photosynthetic microalgae and cyanobacteria is a promising alternative to the generation of fuels from fossil resources. To be economically competitive, producer strains need to be established that synthesize the targeted product at high yield and over a long time. Engineering cyanobacteria into forced fuel producers should considerably interfere with overall cell homeostasis, which in turn might counteract productivity and sustainability of the process. Therefore, in-depth characterization of the cellular response upon long-term production is of high interest for the targeted improvement of a desired strain. Results The transcriptome-wide response to continuous ethanol production was examined in Synechocystis sp. PCC6803 using high resolution microarrays. In two independent experiments, ethanol production rates of 0.0338% (v/v) ethanol d-1 and 0.0303% (v/v) ethanol d-1 were obtained over 18 consecutive days, measuring two sets of biological triplicates in fully automated photobioreactors. Ethanol production caused a significant (~40%) delay in biomass accumulation, the development of a bleaching phenotype and a down-regulation of light harvesting capacity. However, microarray analyses performed at day 4, 7, 11 and 18 of the experiment revealed only three mRNAs with a strongly modified accumulation level throughout the course of the experiment. In addition to the overexpressed adhA (slr1192) gene, this was an approximately 4 fold reduction in cpcB (sll1577) and 3 to 6 fold increase in rps8 (sll1809) mRNA levels. Much weaker modifications of expression level or modifications restricted to day 18 of the experiment were observed for genes involved in carbon assimilation (Ribulose bisphosphate carboxylase and Glutamate decarboxylase). Molecular analysis of the reduced cpcB levels revealed a post-transcriptional processing of the cpcBA operon mRNA leaving a truncated mRNA cpcA* likely not competent for translation. Moreover, western blots and zinc-enhanced bilin fluorescence blots confirmed a severe reduction in the amounts of both phycocyanin subunits, explaining the cause of the bleaching phenotype. Conclusions Changes in gene expression upon induction of long-term ethanol production in Synechocystis sp. PCC6803 are highly specific. In particular, we did not observe a comprehensive stress response as might have been expected. PMID:24502290
Applying dynamic Bayesian networks to perturbed gene expression data.
Dojer, Norbert; Gambin, Anna; Mizera, Andrzej; Wilczyński, Bartek; Tiuryn, Jerzy
2006-05-08
A central goal of molecular biology is to understand the regulatory mechanisms of gene transcription and protein synthesis. Because of their solid basis in statistics, allowing to deal with the stochastic aspects of gene expressions and noisy measurements in a natural way, Bayesian networks appear attractive in the field of inferring gene interactions structure from microarray experiments data. However, the basic formalism has some disadvantages, e.g. it is sometimes hard to distinguish between the origin and the target of an interaction. Two kinds of microarray experiments yield data particularly rich in information regarding the direction of interactions: time series and perturbation experiments. In order to correctly handle them, the basic formalism must be modified. For example, dynamic Bayesian networks (DBN) apply to time series microarray data. To our knowledge the DBN technique has not been applied in the context of perturbation experiments. We extend the framework of dynamic Bayesian networks in order to incorporate perturbations. Moreover, an exact algorithm for inferring an optimal network is proposed and a discretization method specialized for time series data from perturbation experiments is introduced. We apply our procedure to realistic simulations data. The results are compared with those obtained by standard DBN learning techniques. Moreover, the advantages of using exact learning algorithm instead of heuristic methods are analyzed. We show that the quality of inferred networks dramatically improves when using data from perturbation experiments. We also conclude that the exact algorithm should be used when it is possible, i.e. when considered set of genes is small enough.
MADGE: scalable distributed data management software for cDNA microarrays.
McIndoe, Richard A; Lanzen, Aaron; Hurtz, Kimberly
2003-01-01
The human genome project and the development of new high-throughput technologies have created unparalleled opportunities to study the mechanism of diseases, monitor the disease progression and evaluate effective therapies. Gene expression profiling is a critical tool to accomplish these goals. The use of nucleic acid microarrays to assess the gene expression of thousands of genes simultaneously has seen phenomenal growth over the past five years. Although commercial sources of microarrays exist, investigators wanting more flexibility in the genes represented on the array will turn to in-house production. The creation and use of cDNA microarrays is a complicated process that generates an enormous amount of information. Effective data management of this information is essential to efficiently access, analyze, troubleshoot and evaluate the microarray experiments. We have developed a distributable software package designed to track and store the various pieces of data generated by a cDNA microarray facility. This includes the clone collection storage data, annotation data, workflow queues, microarray data, data repositories, sample submission information, and project/investigator information. This application was designed using a 3-tier client server model. The data access layer (1st tier) contains the relational database system tuned to support a large number of transactions. The data services layer (2nd tier) is a distributed COM server with full database transaction support. The application layer (3rd tier) is an internet based user interface that contains both client and server side code for dynamic interactions with the user. This software is freely available to academic institutions and non-profit organizations at http://www.genomics.mcg.edu/niddkbtc.
Salehi, Reza; Tsoi, Stephen C M; Colazo, Marcos G; Ambrose, Divakar J; Robert, Claude; Dyck, Michael K
2017-01-30
Early embryonic loss is a large contributor to infertility in cattle. Moreover, bovine becomes an interesting model to study human preimplantation embryo development due to their similar developmental process. Although genetic factors are known to affect early embryonic development, the discovery of such factors has been a serious challenge. Microarray technology allows quantitative measurement and gene expression profiling of transcript levels on a genome-wide basis. One of the main decisions that have to be made when planning a microarray experiment is whether to use a one- or two-color approach. Two-color design increases technical replication, minimizes variability, improves sensitivity and accuracy as well as allows having loop designs, defining the common reference samples. Although microarray is a powerful biological tool, there are potential pitfalls that can attenuate its power. Hence, in this technical paper we demonstrate an optimized protocol for RNA extraction, amplification, labeling, hybridization of the labeled amplified RNA to the array, array scanning and data analysis using the two-color analysis strategy.
Metadata management and semantics in microarray repositories.
Kocabaş, F; Can, T; Baykal, N
2011-12-01
The number of microarray and other high-throughput experiments on primary repositories keeps increasing as do the size and complexity of the results in response to biomedical investigations. Initiatives have been started on standardization of content, object model, exchange format and ontology. However, there are backlogs and inability to exchange data between microarray repositories, which indicate that there is a great need for a standard format and data management. We have introduced a metadata framework that includes a metadata card and semantic nets that make experimental results visible, understandable and usable. These are encoded in syntax encoding schemes and represented in RDF (Resource Description Frame-word), can be integrated with other metadata cards and semantic nets, and can be exchanged, shared and queried. We demonstrated the performance and potential benefits through a case study on a selected microarray repository. We concluded that the backlogs can be reduced and that exchange of information and asking of knowledge discovery questions can become possible with the use of this metadata framework.
Cross species analysis of microarray expression data
Lu, Yong; Huggins, Peter; Bar-Joseph, Ziv
2009-01-01
Motivation: Many biological systems operate in a similar manner across a large number of species or conditions. Cross-species analysis of sequence and interaction data is often applied to determine the function of new genes. In contrast to these static measurements, microarrays measure the dynamic, condition-specific response of complex biological systems. The recent exponential growth in microarray expression datasets allows researchers to combine expression experiments from multiple species to identify genes that are not only conserved in sequence but also operated in a similar way in the different species studied. Results: In this review we discuss the computational and technical challenges associated with these studies, the approaches that have been developed to address these challenges and the advantages of cross-species analysis of microarray data. We show how successful application of these methods lead to insights that cannot be obtained when analyzing data from a single species. We also highlight current open problems and discuss possible ways to address them. Contact: zivbj@cs.cmu.edu PMID:19357096
Clustering gene expression data based on predicted differential effects of GV interaction.
Pan, Hai-Yan; Zhu, Jun; Han, Dan-Fu
2005-02-01
Microarray has become a popular biotechnology in biological and medical research. However, systematic and stochastic variabilities in microarray data are expected and unavoidable, resulting in the problem that the raw measurements have inherent "noise" within microarray experiments. Currently, logarithmic ratios are usually analyzed by various clustering methods directly, which may introduce bias interpretation in identifying groups of genes or samples. In this paper, a statistical method based on mixed model approaches was proposed for microarray data cluster analysis. The underlying rationale of this method is to partition the observed total gene expression level into various variations caused by different factors using an ANOVA model, and to predict the differential effects of GV (gene by variety) interaction using the adjusted unbiased prediction (AUP) method. The predicted GV interaction effects can then be used as the inputs of cluster analysis. We illustrated the application of our method with a gene expression dataset and elucidated the utility of our approach using an external validation.
Wilkins, Ella J; Archibald, Alison D; Sahhar, Margaret A; White, Susan M
2016-11-01
Chromosomal microarray is an increasingly utilized diagnostic test, particularly in the pediatric setting. However, the clinical significance of copy number variants detected by this technology is not always understood, creating uncertainties in interpreting and communicating results. The aim of this study was to explore parents' experiences of an uncertain microarray result for their child. This research utilized a qualitative approach with a phenomenological methodology. Semi-structured interviews were conducted with nine parents of eight children who received an uncertain microarray result for their child, either a 16p11.2 microdeletion or 15q13.3 microdeletion. Interviews were transcribed verbatim and thematic analysis was used to identify themes within the data. Participants were unprepared for the abnormal test result. They had a complex perception of the extent of their child's condition and a mixed understanding of the clinical relevance of the result, but were accepting of the limitations of medical knowledge, and appeared to have adapted to the result. The test result was empowering for parents in terms of access to medical and educational services; however, they articulated significant unmet support needs. Participants expressed hope for the future, in particular that more information would become available over time. This research has demonstrated that parents of children who have an uncertain microarray result appeared to adapt to uncertainty and limited availability of information and valued honesty and empathic ongoing support from health professionals. Genetic health professionals are well positioned to provide such support and aid patients' and families' adaptation to their situation as well as promote empowerment. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
DNA microarray-based PCR ribotyping of Clostridium difficile.
Schneeberg, Alexander; Ehricht, Ralf; Slickers, Peter; Baier, Vico; Neubauer, Heinrich; Zimmermann, Stefan; Rabold, Denise; Lübke-Becker, Antina; Seyboldt, Christian
2015-02-01
This study presents a DNA microarray-based assay for fast and simple PCR ribotyping of Clostridium difficile strains. Hybridization probes were designed to query the modularly structured intergenic spacer region (ISR), which is also the template for conventional and PCR ribotyping with subsequent capillary gel electrophoresis (seq-PCR) ribotyping. The probes were derived from sequences available in GenBank as well as from theoretical ISR module combinations. A database of reference hybridization patterns was set up from a collection of 142 well-characterized C. difficile isolates representing 48 seq-PCR ribotypes. The reference hybridization patterns calculated by the arithmetic mean were compared using a similarity matrix analysis. The 48 investigated seq-PCR ribotypes revealed 27 array profiles that were clearly distinguishable. The most frequent human-pathogenic ribotypes 001, 014/020, 027, and 078/126 were discriminated by the microarray. C. difficile strains related to 078/126 (033, 045/FLI01, 078, 126, 126/FLI01, 413, 413/FLI01, 598, 620, 652, and 660) and 014/020 (014, 020, and 449) showed similar hybridization patterns, confirming their genetic relatedness, which was previously reported. A panel of 50 C. difficile field isolates was tested by seq-PCR ribotyping and the DNA microarray-based assay in parallel. Taking into account that the current version of the microarray does not discriminate some closely related seq-PCR ribotypes, all isolates were typed correctly. Moreover, seq-PCR ribotypes without reference profiles available in the database (ribotype 009 and 5 new types) were correctly recognized as new ribotypes, confirming the performance and expansion potential of the microarray. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
Cross-species transcriptomic approach reveals genes in hamster implantation sites.
Lei, Wei; Herington, Jennifer; Galindo, Cristi L; Ding, Tianbing; Brown, Naoko; Reese, Jeff; Paria, Bibhash C
2014-12-01
The mouse model has greatly contributed to understanding molecular mechanisms involved in the regulation of progesterone (P4) plus estrogen (E)-dependent blastocyst implantation process. However, little is known about contributory molecular mechanisms of the P4-only-dependent blastocyst implantation process that occurs in species such as hamsters, guineapigs, rabbits, pigs, rhesus monkeys, and perhaps humans. We used the hamster as a model of P4-only-dependent blastocyst implantation and carried out cross-species microarray (CSM) analyses to reveal differentially expressed genes at the blastocyst implantation site (BIS), in order to advance the understanding of molecular mechanisms of implantation. Upregulation of 112 genes and downregulation of 77 genes at the BIS were identified using a mouse microarray platform, while use of the human microarray revealed 62 up- and 38 down-regulated genes at the BIS. Excitingly, a sizable number of genes (30 up- and 11 down-regulated genes) were identified as a shared pool by both CSMs. Real-time RT-PCR and in situ hybridization validated the expression patterns of several up- and down-regulated genes identified by both CSMs at the hamster and mouse BIS to demonstrate the merit of CSM findings across species, in addition to revealing genes specific to hamsters. Functional annotation analysis found that genes involved in the spliceosome, proteasome, and ubiquination pathways are enriched at the hamster BIS, while genes associated with tight junction, SAPK/JNK signaling, and PPARα/RXRα signalings are repressed at the BIS. Overall, this study provides a pool of genes and evidence of their participation in up- and down-regulated cellular functions/pathways at the hamster BIS. © 2014 Society for Reproduction and Fertility.
Phelps, Jamie P; Dao, Philip; Jin, Hongfan; Rasochova, Lada
2007-02-01
Coat protein of the cowpea chlorotic mottle virus (CCMV), a plant bromovirus, has been expressed in a soluble form in a prokaryote, Pseudomonas fluorescens, and assembled into virus-like particles (VLPs) in vivo that were structurally similar to the native CCMV particles derived from plants. The CCMV VLPs were purified by PEG precipitation followed by separation on a sucrose density gradient and analyzed by size exclusion chromatography, UV spectrometry, and transmission electron microscopy. DNA microarray experiments revealed that the VLPs encapsulated very large numbers of different host RNAs in a non-specific manner. The development of a P. fluorescens expression system now enables production of CCMV VLPs by bacterial fermentation for use in pharmaceutical or nanotechnology applications.
Strakova, Eva; Zikova, Alice; Vohradsky, Jiri
2014-01-01
A computational model of gene expression was applied to a novel test set of microarray time series measurements to reveal regulatory interactions between transcriptional regulators represented by 45 sigma factors and the genes expressed during germination of a prokaryote Streptomyces coelicolor. Using microarrays, the first 5.5 h of the process was recorded in 13 time points, which provided a database of gene expression time series on genome-wide scale. The computational modeling of the kinetic relations between the sigma factors, individual genes and genes clustered according to the similarity of their expression kinetics identified kinetically plausible sigma factor-controlled networks. Using genome sequence annotations, functional groups of genes that were predominantly controlled by specific sigma factors were identified. Using external binding data complementing the modeling approach, specific genes involved in the control of the studied process were identified and their function suggested.
Plasmonically amplified fluorescence bioassay with microarray format
NASA Astrophysics Data System (ADS)
Gogalic, S.; Hageneder, S.; Ctortecka, C.; Bauch, M.; Khan, I.; Preininger, Claudia; Sauer, U.; Dostalek, J.
2015-05-01
Plasmonic amplification of fluorescence signal in bioassays with microarray detection format is reported. A crossed relief diffraction grating was designed to couple an excitation laser beam to surface plasmons at the wavelength overlapping with the absorption and emission bands of fluorophore Dy647 that was used as a label. The surface of periodically corrugated sensor chip was coated with surface plasmon-supporting gold layer and a thin SU8 polymer film carrying epoxy groups. These groups were employed for the covalent immobilization of capture antibodies at arrays of spots. The plasmonic amplification of fluorescence signal on the developed microarray chip was tested by using interleukin 8 sandwich immunoassay. The readout was performed ex situ after drying the chip by using a commercial scanner with high numerical aperture collecting lens. Obtained results reveal the enhancement of fluorescence signal by a factor of 5 when compared to a regular glass chip.
Bikel, Shirley; Jacobo-Albavera, Leonor; Sánchez-Muñoz, Fausto; Cornejo-Granados, Fernanda; Canizales-Quinteros, Samuel; Soberón, Xavier; Sotelo-Mundo, Rogerio R; Del Río-Navarro, Blanca E; Mendoza-Vargas, Alfredo; Sánchez, Filiberto; Ochoa-Leyva, Adrian
2017-01-01
In spite of the emergence of RNA sequencing (RNA-seq), microarrays remain in widespread use for gene expression analysis in the clinic. There are over 767,000 RNA microarrays from human samples in public repositories, which are an invaluable resource for biomedical research and personalized medicine. The absolute gene expression analysis allows the transcriptome profiling of all expressed genes under a specific biological condition without the need of a reference sample. However, the background fluorescence represents a challenge to determine the absolute gene expression in microarrays. Given that the Y chromosome is absent in female subjects, we used it as a new approach for absolute gene expression analysis in which the fluorescence of the Y chromosome genes of female subjects was used as the background fluorescence for all the probes in the microarray. This fluorescence was used to establish an absolute gene expression threshold, allowing the differentiation between expressed and non-expressed genes in microarrays. We extracted the RNA from 16 children leukocyte samples (nine males and seven females, ages 6-10 years). An Affymetrix Gene Chip Human Gene 1.0 ST Array was carried out for each sample and the fluorescence of 124 genes of the Y chromosome was used to calculate the absolute gene expression threshold. After that, several expressed and non-expressed genes according to our absolute gene expression threshold were compared against the expression obtained using real-time quantitative polymerase chain reaction (RT-qPCR). From the 124 genes of the Y chromosome, three genes (DDX3Y, TXLNG2P and EIF1AY) that displayed significant differences between sexes were used to calculate the absolute gene expression threshold. Using this threshold, we selected 13 expressed and non-expressed genes and confirmed their expression level by RT-qPCR. Then, we selected the top 5% most expressed genes and found that several KEGG pathways were significantly enriched. Interestingly, these pathways were related to the typical functions of leukocytes cells, such as antigen processing and presentation and natural killer cell mediated cytotoxicity. We also applied this method to obtain the absolute gene expression threshold in already published microarray data of liver cells, where the top 5% expressed genes showed an enrichment of typical KEGG pathways for liver cells. Our results suggest that the three selected genes of the Y chromosome can be used to calculate an absolute gene expression threshold, allowing a transcriptome profiling of microarray data without the need of an additional reference experiment. Our approach based on the establishment of a threshold for absolute gene expression analysis will allow a new way to analyze thousands of microarrays from public databases. This allows the study of different human diseases without the need of having additional samples for relative expression experiments.
NASA Technical Reports Server (NTRS)
Khaoustov, V. I.; Risin, D.; Pellis, N. R.; Yoffe, B.; McIntire, L. V. (Principal Investigator)
2001-01-01
Developed at NASA, the rotary cell culture system (RCCS) allows the creation of unique microgravity environment of low shear force, high-mass transfer, and enables three-dimensional (3D) cell culture of dissimilar cell types. Recently we demonstrated that a simulated microgravity is conducive for maintaining long-term cultures of functional hepatocytes and promote 3D cell assembly. Using deoxyribonucleic acid (DNA) microarray technology, it is now possible to measure the levels of thousands of different messenger ribonucleic acids (mRNAs) in a single hybridization step. This technique is particularly powerful for comparing gene expression in the same tissue under different environmental conditions. The aim of this research was to analyze gene expression of hepatoblastoma cell line (HepG2) during early stage of 3D-cell assembly in simulated microgravity. For this, mRNA from HepG2 cultured in the RCCS was analyzed by deoxyribonucleic acid microarray. Analyses of HepG2 mRNA by using 6K glass DNA microarray revealed changes in expression of 95 genes (overexpression of 85 genes and downregulation of 10 genes). Our preliminary results indicated that simulated microgravity modifies the expression of several genes and that microarray technology may provide new understanding of the fundamental biological questions of how gravity affects the development and function of individual cells.
Characterizing biomarkers in osteosarcoma metastasis based on an ego-network.
Liu, Zhen; Song, Yan
2017-06-01
To characterize biomarkers that underlie osteosarcoma (OS) metastasis based on an ego-network. From the microarray data, we obtained 13,326 genes. By combining PPI data and microarray data, 10,520 shared genes were found and constructed into ego-networks. 17 significant ego-networks were identified with p < 0.05. In the pathway enrichment analysis, seven ego-networks were identified with the most significant pathway. These significant ego-modules were potential biomarkers that reveal the potential mechanisms in OS metastasis, which may contribute to understanding cancer prognoses and providing new perspectives in the treatment of cancer.
Long noncoding RNA OR3A4 promotes metastasis and tumorigenicity in gastric cancer
Guo, Xiaobo; Yang, Ziguo; Zhi, Qiaoming; Wang, Dan; Guo, Lei; Li, Guimei; Miao, Ruizhen; Shi, Yulong; Kuang, Yuting
2016-01-01
The contribution of long noncoding RNAs (lncRNAs) to metastasis of gastric cancer remains largely unknown. We used microarray analysis to identify lncRNAs differentially expressed between normal gastric tissues and gastric cancer tissues and validated these differences in quantitative real-time (qRT)-PCR experiments. The expression levels of lncRNA olfactory receptor, family 3, subfamily A, member 4 (OR3A4) were significantly associated with lymphatic metastasis, the depth of cancer invasion, and distal metastasis in 130 paired gastric cancer tissues. The effects of OR3A4 were assessed by overexpressing and silencing OR3A4 in gastric cancer cells. OR3A4 promoted cancer cell growth, angiogenesis, metastasis, and tumorigenesis in vitro and in vivo. Global microarray analysis combined with RT-PCR, RNA immunoprecipitation, and RNA pull-down analyses after OR3A4 transfection demonstrated that OR3A4 influenced biologic functions in gastric cancer cells via regulating the activation of PDLIM2, MACC1, NTN4, and GNB2L1. Our results reveal OR3A4 as an oncogenic lncRNA that promotes tumor progression, Therefore, lncRNAs might function as key regulatory hubs in gastric cancer progression. PMID:26863570
Duan, Fenghai; Xu, Ye
2017-01-01
To analyze a microarray experiment to identify the genes with expressions varying after the diagnosis of breast cancer. A total of 44 928 probe sets in an Affymetrix microarray data publicly available on Gene Expression Omnibus from 249 patients with breast cancer were analyzed by the nonparametric multivariate adaptive splines. Then, the identified genes with turning points were grouped by K-means clustering, and their network relationship was subsequently analyzed by the Ingenuity Pathway Analysis. In total, 1640 probe sets (genes) were reliably identified to have turning points along with the age at diagnosis in their expression profiling, of which 927 expressed lower after turning points and 713 expressed higher after the turning points. K-means clustered them into 3 groups with turning points centering at 54, 62.5, and 72, respectively. The pathway analysis showed that the identified genes were actively involved in various cancer-related functions or networks. In this article, we applied the nonparametric multivariate adaptive splines method to a publicly available gene expression data and successfully identified genes with expressions varying before and after breast cancer diagnosis.
arrayCGHbase: an analysis platform for comparative genomic hybridization microarrays
Menten, Björn; Pattyn, Filip; De Preter, Katleen; Robbrecht, Piet; Michels, Evi; Buysse, Karen; Mortier, Geert; De Paepe, Anne; van Vooren, Steven; Vermeesch, Joris; Moreau, Yves; De Moor, Bart; Vermeulen, Stefan; Speleman, Frank; Vandesompele, Jo
2005-01-01
Background The availability of the human genome sequence as well as the large number of physically accessible oligonucleotides, cDNA, and BAC clones across the entire genome has triggered and accelerated the use of several platforms for analysis of DNA copy number changes, amongst others microarray comparative genomic hybridization (arrayCGH). One of the challenges inherent to this new technology is the management and analysis of large numbers of data points generated in each individual experiment. Results We have developed arrayCGHbase, a comprehensive analysis platform for arrayCGH experiments consisting of a MIAME (Minimal Information About a Microarray Experiment) supportive database using MySQL underlying a data mining web tool, to store, analyze, interpret, compare, and visualize arrayCGH results in a uniform and user-friendly format. Following its flexible design, arrayCGHbase is compatible with all existing and forthcoming arrayCGH platforms. Data can be exported in a multitude of formats, including BED files to map copy number information on the genome using the Ensembl or UCSC genome browser. Conclusion ArrayCGHbase is a web based and platform independent arrayCGH data analysis tool, that allows users to access the analysis suite through the internet or a local intranet after installation on a private server. ArrayCGHbase is available at . PMID:15910681
Framework for Parallel Preprocessing of Microarray Data Using Hadoop
2018-01-01
Nowadays, microarray technology has become one of the popular ways to study gene expression and diagnosis of disease. National Center for Biology Information (NCBI) hosts public databases containing large volumes of biological data required to be preprocessed, since they carry high levels of noise and bias. Robust Multiarray Average (RMA) is one of the standard and popular methods that is utilized to preprocess the data and remove the noises. Most of the preprocessing algorithms are time-consuming and not able to handle a large number of datasets with thousands of experiments. Parallel processing can be used to address the above-mentioned issues. Hadoop is a well-known and ideal distributed file system framework that provides a parallel environment to run the experiment. In this research, for the first time, the capability of Hadoop and statistical power of R have been leveraged to parallelize the available preprocessing algorithm called RMA to efficiently process microarray data. The experiment has been run on cluster containing 5 nodes, while each node has 16 cores and 16 GB memory. It compares efficiency and the performance of parallelized RMA using Hadoop with parallelized RMA using affyPara package as well as sequential RMA. The result shows the speed-up rate of the proposed approach outperforms the sequential approach and affyPara approach. PMID:29796018
Mutual information estimation reveals global associations between stimuli and biological processes
Suzuki, Taiji; Sugiyama, Masashi; Kanamori, Takafumi; Sese, Jun
2009-01-01
Background Although microarray gene expression analysis has become popular, it remains difficult to interpret the biological changes caused by stimuli or variation of conditions. Clustering of genes and associating each group with biological functions are often used methods. However, such methods only detect partial changes within cell processes. Herein, we propose a method for discovering global changes within a cell by associating observed conditions of gene expression with gene functions. Results To elucidate the association, we introduce a novel feature selection method called Least-Squares Mutual Information (LSMI), which computes mutual information without density estimaion, and therefore LSMI can detect nonlinear associations within a cell. We demonstrate the effectiveness of LSMI through comparison with existing methods. The results of the application to yeast microarray datasets reveal that non-natural stimuli affect various biological processes, whereas others are no significant relation to specific cell processes. Furthermore, we discover that biological processes can be categorized into four types according to the responses of various stimuli: DNA/RNA metabolism, gene expression, protein metabolism, and protein localization. Conclusion We proposed a novel feature selection method called LSMI, and applied LSMI to mining the association between conditions of yeast and biological processes through microarray datasets. In fact, LSMI allows us to elucidate the global organization of cellular process control. PMID:19208155
HOXB9 Expression Correlates with Histological Grade and Prognosis in LSCC
2017-01-01
The purpose of this study was to investigate the HOX gene expression profile in laryngeal squamous cell carcinoma (LSCC) and assess whether some genes are associated with the clinicopathological features and prognosis in LSCC patients. The HOX gene levels were tested by microarray and validated by qRT-PCR in paired cancerous and adjacent noncancerous LSCC tissue samples. The microarray testing data of 39 HOX genes revealed 15 HOX genes that were at least 2-fold upregulated and 2 that were downregulated. After qRT-PCR evaluation, the three most upregulated genes (HOXB9, HOXB13, and HOXD13) were selected for tissue microarray (TMA) analysis. The correlations between the HOXB9, HOXB13, and HOXD13 expression levels and both clinicopathological features and prognosis were analyzed. Three HOX gene expression levels were markedly increased in LSCC tissues compared with adjacent noncancerous tissues (P < 0.001). HOXB9 was found to correlate with histological grade (P < 0.01) and prognosis (P < 0.01) in LSCC. In conclusion, this study revealed that HOXB9, HOXB13, and HOXD13 were upregulated and may play important roles in LSCC. Moreover, HOXB9 may serve as a novel marker of poor prognosis and a potential therapeutic target in LSCC patients. PMID:28808656
Persson, Anna-Karin; Gebauer, Mathias; Jordan, Suzana; Metz-Weidmann, Christiane; Schulte, Anke M; Schneider, Hans-Christoph; Ding-Pfennigdorff, Danping; Thun, Jonas; Xu, Xiao-Jun; Wiesenfeld-Hallin, Zsuzsanna; Darvasi, Ariel; Fried, Kaj; Devor, Marshall
2009-01-01
Background Nerve injury-triggered hyperexcitability in primary sensory neurons is considered a major source of chronic neuropathic pain. The hyperexcitability, in turn, is thought to be related to transcriptional switching in afferent cell somata. Analysis using expression microarrays has revealed that many genes are regulated in the dorsal root ganglion (DRG) following axotomy. But which contribute to pain phenotype versus other nerve injury-evoked processes such as nerve regeneration? Using the L5 spinal nerve ligation model of neuropathy we examined differential changes in gene expression in the L5 (and L4) DRGs in five mouse strains with contrasting susceptibility to neuropathic pain. We sought genes for which the degree of regulation correlates with strain-specific pain phenotype. Results In an initial experiment six candidate genes previously identified as important in pain physiology were selected for in situ hybridization to DRG sections. Among these, regulation of the Na+ channel α subunit Scn11a correlated with levels of spontaneous pain behavior, and regulation of the cool receptor Trpm8 correlated with heat hypersensibility. In a larger scale experiment, mRNA extracted from individual mouse DRGs was processed on Affymetrix whole-genome expression microarrays. Overall, 2552 ± 477 transcripts were significantly regulated in the axotomized L5DRG 3 days postoperatively. However, in only a small fraction of these was the degree of regulation correlated with pain behavior across strains. Very few genes in the "uninjured" L4DRG showed altered expression (24 ± 28). Conclusion Correlational analysis based on in situ hybridization provided evidence that differential regulation of Scn11a and Trpm8 contributes to across-strain variability in pain phenotype. This does not, of course, constitute evidence that the others are unrelated to pain. Correlational analysis based on microarray data yielded a larger "look-up table" of genes whose regulation likely contributes to pain variability. While this list is enriched in genes of potential importance for pain physiology, and is relatively free of the bias inherent in the candidate gene approach, additional steps are required to clarify which transcripts on the list are in fact of functional importance. PMID:19228393
A formal concept analysis approach to consensus clustering of multi-experiment expression data
2014-01-01
Background Presently, with the increasing number and complexity of available gene expression datasets, the combination of data from multiple microarray studies addressing a similar biological question is gaining importance. The analysis and integration of multiple datasets are expected to yield more reliable and robust results since they are based on a larger number of samples and the effects of the individual study-specific biases are diminished. This is supported by recent studies suggesting that important biological signals are often preserved or enhanced by multiple experiments. An approach to combining data from different experiments is the aggregation of their clusterings into a consensus or representative clustering solution which increases the confidence in the common features of all the datasets and reveals the important differences among them. Results We propose a novel generic consensus clustering technique that applies Formal Concept Analysis (FCA) approach for the consolidation and analysis of clustering solutions derived from several microarray datasets. These datasets are initially divided into groups of related experiments with respect to a predefined criterion. Subsequently, a consensus clustering algorithm is applied to each group resulting in a clustering solution per group. These solutions are pooled together and further analysed by employing FCA which allows extracting valuable insights from the data and generating a gene partition over all the experiments. In order to validate the FCA-enhanced approach two consensus clustering algorithms are adapted to incorporate the FCA analysis. Their performance is evaluated on gene expression data from multi-experiment study examining the global cell-cycle control of fission yeast. The FCA results derived from both methods demonstrate that, although both algorithms optimize different clustering characteristics, FCA is able to overcome and diminish these differences and preserve some relevant biological signals. Conclusions The proposed FCA-enhanced consensus clustering technique is a general approach to the combination of clustering algorithms with FCA for deriving clustering solutions from multiple gene expression matrices. The experimental results presented herein demonstrate that it is a robust data integration technique able to produce good quality clustering solution that is representative for the whole set of expression matrices. PMID:24885407
Peschl, Patrick; Ramberger, Melanie; Höftberger, Romana; Jöhrer, Karin; Baumann, Matthias; Rostásy, Kevin; Reindl, Markus
2017-01-01
Acute disseminated encephalomyelitis (ADEM) is a rare autoimmune-mediated demyelinating disease affecting mainly children and young adults. Differentiation to multiple sclerosis is not always possible, due to overlapping clinical symptoms and recurrent and multiphasic forms. Until now, immunoglobulins reactive to myelin oligodendrocyte glycoprotein (MOG antibodies) have been found in a subset of patients with ADEM. However, there are still patients lacking autoantibodies, necessitating the identification of new autoantibodies as biomarkers in those patients. Therefore, we aimed to identify novel autoantibody targets in ADEM patients. Sixteen ADEM patients (11 seronegative, 5 seropositive for MOG antibodies) were analysed for potential new biomarkers, using a protein microarray and immunohistochemistry on rat brain tissue to identify antibodies against intracellular and surface neuronal and glial antigens. Nine candidate antigens were identified in the protein microarray analysis in at least two patients per group. Immunohistochemistry on rat brain tissue did not reveal new target antigens. Although no new autoantibody targets could be found here, future studies should aim to identify new biomarkers for therapeutic and prognostic purposes. The microarray analysis and immunohistochemistry methods used here have several limitations, which should be considered in future searches for biomarkers. PMID:28327523
Engelmann, Brett W
2017-01-01
The Src Homology 2 (SH2) domain family primarily recognizes phosphorylated tyrosine (pY) containing peptide motifs. The relative affinity preferences among competing SH2 domains for phosphopeptide ligands define "specificity space," and underpins many functional pY mediated interactions within signaling networks. The degree of promiscuity exhibited and the dynamic range of affinities supported by individual domains or phosphopeptides is best resolved by a carefully executed and controlled quantitative high-throughput experiment. Here, I describe the fabrication and application of a cellulose-peptide conjugate microarray (CPCMA) platform to the quantitative analysis of SH2 domain specificity space. Included herein are instructions for optimal experimental design with special attention paid to common sources of systematic error, phosphopeptide SPOT synthesis, microarray fabrication, analyte titrations, data capture, and analysis.
CLIC, a tool for expanding biological pathways based on co-expression across thousands of datasets
Li, Yang; Liu, Jun S.; Mootha, Vamsi K.
2017-01-01
In recent years, there has been a huge rise in the number of publicly available transcriptional profiling datasets. These massive compendia comprise billions of measurements and provide a special opportunity to predict the function of unstudied genes based on co-expression to well-studied pathways. Such analyses can be very challenging, however, since biological pathways are modular and may exhibit co-expression only in specific contexts. To overcome these challenges we introduce CLIC, CLustering by Inferred Co-expression. CLIC accepts as input a pathway consisting of two or more genes. It then uses a Bayesian partition model to simultaneously partition the input gene set into coherent co-expressed modules (CEMs), while assigning the posterior probability for each dataset in support of each CEM. CLIC then expands each CEM by scanning the transcriptome for additional co-expressed genes, quantified by an integrated log-likelihood ratio (LLR) score weighted for each dataset. As a byproduct, CLIC automatically learns the conditions (datasets) within which a CEM is operative. We implemented CLIC using a compendium of 1774 mouse microarray datasets (28628 microarrays) or 1887 human microarray datasets (45158 microarrays). CLIC analysis reveals that of 910 canonical biological pathways, 30% consist of strongly co-expressed gene modules for which new members are predicted. For example, CLIC predicts a functional connection between protein C7orf55 (FMC1) and the mitochondrial ATP synthase complex that we have experimentally validated. CLIC is freely available at www.gene-clic.org. We anticipate that CLIC will be valuable both for revealing new components of biological pathways as well as the conditions in which they are active. PMID:28719601
Chavan, Shweta S; Bauer, Michael A; Peterson, Erich A; Heuck, Christoph J; Johann, Donald J
2013-01-01
Transcriptome analysis by microarrays has produced important advances in biomedicine. For instance in multiple myeloma (MM), microarray approaches led to the development of an effective disease subtyping via cluster assignment, and a 70 gene risk score. Both enabled an improved molecular understanding of MM, and have provided prognostic information for the purposes of clinical management. Many researchers are now transitioning to Next Generation Sequencing (NGS) approaches and RNA-seq in particular, due to its discovery-based nature, improved sensitivity, and dynamic range. Additionally, RNA-seq allows for the analysis of gene isoforms, splice variants, and novel gene fusions. Given the voluminous amounts of historical microarray data, there is now a need to associate and integrate microarray and RNA-seq data via advanced bioinformatic approaches. Custom software was developed following a model-view-controller (MVC) approach to integrate Affymetrix probe set-IDs, and gene annotation information from a variety of sources. The tool/approach employs an assortment of strategies to integrate, cross reference, and associate microarray and RNA-seq datasets. Output from a variety of transcriptome reconstruction and quantitation tools (e.g., Cufflinks) can be directly integrated, and/or associated with Affymetrix probe set data, as well as necessary gene identifiers and/or symbols from a diversity of sources. Strategies are employed to maximize the annotation and cross referencing process. Custom gene sets (e.g., MM 70 risk score (GEP-70)) can be specified, and the tool can be directly assimilated into an RNA-seq pipeline. A novel bioinformatic approach to aid in the facilitation of both annotation and association of historic microarray data, in conjunction with richer RNA-seq data, is now assisting with the study of MM cancer biology.
Strauss, Christian; Endimiani, Andrea; Perreten, Vincent
2015-01-01
A rapid and simple DNA labeling system has been developed for disposable microarrays and has been validated for the detection of 117 antibiotic resistance genes abundant in Gram-positive bacteria. The DNA was fragmented and amplified using phi-29 polymerase and random primers with linkers. Labeling and further amplification were then performed by classic PCR amplification using biotinylated primers specific for the linkers. The microarray developed by Perreten et al. (Perreten, V., Vorlet-Fawer, L., Slickers, P., Ehricht, R., Kuhnert, P., Frey, J., 2005. Microarray-based detection of 90 antibiotic resistance genes of gram-positive bacteria. J.Clin.Microbiol. 43, 2291-2302.) was improved by additional oligonucleotides. A total of 244 oligonucleotides (26 to 37 nucleotide length and with similar melting temperatures) were spotted on the microarray, including genes conferring resistance to clinically important antibiotic classes like β-lactams, macrolides, aminoglycosides, glycopeptides and tetracyclines. Each antibiotic resistance gene is represented by at least 2 oligonucleotides designed from consensus sequences of gene families. The specificity of the oligonucleotides and the quality of the amplification and labeling were verified by analysis of a collection of 65 strains belonging to 24 species. Association between genotype and phenotype was verified for 6 antibiotics using 77 Staphylococcus strains belonging to different species and revealed 95% test specificity and a 93% predictive value of a positive test. The DNA labeling and amplification is independent of the species and of the target genes and could be used for different types of microarrays. This system has also the advantage to detect several genes within one bacterium at once, like in Staphylococcus aureus strain BM3318, in which up to 15 genes were detected. This new microarray-based detection system offers a large potential for applications in clinical diagnostic, basic research, food safety and surveillance programs for antimicrobial resistance. Copyright © 2014 Elsevier B.V. All rights reserved.
Evaluation of artificial time series microarray data for dynamic gene regulatory network inference.
Xenitidis, P; Seimenis, I; Kakolyris, S; Adamopoulos, A
2017-08-07
High-throughput technology like microarrays is widely used in the inference of gene regulatory networks (GRNs). We focused on time series data since we are interested in the dynamics of GRNs and the identification of dynamic networks. We evaluated the amount of information that exists in artificial time series microarray data and the ability of an inference process to produce accurate models based on them. We used dynamic artificial gene regulatory networks in order to create artificial microarray data. Key features that characterize microarray data such as the time separation of directly triggered genes, the percentage of directly triggered genes and the triggering function type were altered in order to reveal the limits that are imposed by the nature of microarray data on the inference process. We examined the effect of various factors on the inference performance such as the network size, the presence of noise in microarray data, and the network sparseness. We used a system theory approach and examined the relationship between the pole placement of the inferred system and the inference performance. We examined the relationship between the inference performance in the time domain and the true system parameter identification. Simulation results indicated that time separation and the percentage of directly triggered genes are crucial factors. Also, network sparseness, the triggering function type and noise in input data affect the inference performance. When two factors were simultaneously varied, it was found that variation of one parameter significantly affects the dynamic response of the other. Crucial factors were also examined using a real GRN and acquired results confirmed simulation findings with artificial data. Different initial conditions were also used as an alternative triggering approach. Relevant results confirmed that the number of datasets constitutes the most significant parameter with regard to the inference performance. Copyright © 2017 Elsevier Ltd. All rights reserved.
Brodsky, Leonid; Leontovich, Andrei; Shtutman, Michael; Feinstein, Elena
2004-01-01
Mathematical methods of analysis of microarray hybridizations deal with gene expression profiles as elementary units. However, some of these profiles do not reflect a biologically relevant transcriptional response, but rather stem from technical artifacts. Here, we describe two technically independent but rationally interconnected methods for identification of such artifactual profiles. Our diagnostics are based on detection of deviations from uniformity, which is assumed as the main underlying principle of microarray design. Method 1 is based on detection of non-uniformity of microarray distribution of printed genes that are clustered based on the similarity of their expression profiles. Method 2 is based on evaluation of the presence of gene-specific microarray spots within the slides’ areas characterized by an abnormal concentration of low/high differential expression values, which we define as ‘patterns of differentials’. Applying two novel algorithms, for nested clustering (method 1) and for pattern detection (method 2), we can make a dual estimation of the profile’s quality for almost every printed gene. Genes with artifactual profiles detected by method 1 may then be removed from further analysis. Suspicious differential expression values detected by method 2 may be either removed or weighted according to the probabilities of patterns that cover them, thus diminishing their input in any further data analysis. PMID:14999086
Imholte, Gregory; Gottardo, Raphael
2017-01-01
Summary The peptide microarray immunoassay simultaneously screens sample serum against thousands of peptides, determining the presence of antibodies bound to array probes. Peptide microarrays tiling immunogenic regions of pathogens (e.g. envelope proteins of a virus) are an important high throughput tool for querying and mapping antibody binding. Because of the assay’s many steps, from probe synthesis to incubation, peptide microarray data can be noisy with extreme outliers. In addition, subjects may produce different antibody profiles in response to an identical vaccine stimulus or infection, due to variability among subjects’ immune systems. We present a robust Bayesian hierarchical model for peptide microarray experiments, pepBayes, to estimate the probability of antibody response for each subject/peptide combination. Heavy-tailed error distributions accommodate outliers and extreme responses, and tailored random effect terms automatically incorporate technical effects prevalent in the assay. We apply our model to two vaccine trial datasets to demonstrate model performance. Our approach enjoys high sensitivity and specificity when detecting vaccine induced antibody responses. A simulation study shows an adaptive thresholding classification method has appropriate false discovery rate control with high sensitivity, and receiver operating characteristics generated on vaccine trial data suggest that pepBayes clearly separates responses from non-responses. PMID:27061097
Mothers' appreciation of chromosomal microarray analysis for autism spectrum disorder.
Giarelli, Ellen; Reiff, Marian
2015-10-01
The aim of this study was to examine mothers' experiences with chromosomal microarray analysis (CMA) for a child with autism spectrum disorder (ASD). This is a descriptive qualitative study using thematic content analysis of in-depth interview with 48 mothers of children who had genetic testing for ASD. The principal theme, "something is missing," included missing knowledge about genetics, information on use of the results, explanations of the relevance to the diagnosis, and relevance to life-long care. Two subordinate themes were (a) disappreciation of the helpfulness of scientific information to explain the diagnosis, and (b) returning to personal experience for interpretation. The test "appreciated" in value when results could be linked to the phenotype. © 2015, Wiley Periodicals, Inc.
Detection of pathogenic Vibrio spp. in shellfish by using multiplex PCR and DNA microarrays.
Panicker, Gitika; Call, Douglas R; Krug, Melissa J; Bej, Asim K
2004-12-01
This study describes the development of a gene-specific DNA microarray coupled with multiplex PCR for the comprehensive detection of pathogenic vibrios that are natural inhabitants of warm coastal waters and shellfish. Multiplex PCR with vvh and viuB for Vibrio vulnificus, with ompU, toxR, tcpI, and hlyA for V. cholerae, and with tlh, tdh, trh, and open reading frame 8 for V. parahaemolyticus helped to ensure that total and pathogenic strains, including subtypes of the three Vibrio spp., could be detected and discriminated. For DNA microarrays, oligonucleotide probes for these targeted genes were deposited onto epoxysilane-derivatized, 12-well, Teflon-masked slides by using a MicroGrid II arrayer. Amplified PCR products were hybridized to arrays at 50 degrees C and detected by using tyramide signal amplification with Alexa Fluor 546 fluorescent dye. Slides were imaged by using an arrayWoRx scanner. The detection sensitivity for pure cultures without enrichment was 10(2) to 10(3) CFU/ml, and the specificity was 100%. However, 5 h of sample enrichment followed by DNA extraction with Instagene matrix and multiplex PCR with microarray hybridization resulted in the detection of 1 CFU in 1 g of oyster tissue homogenate. Thus, enrichment of the bacterial pathogens permitted higher sensitivity in compliance with the Interstate Shellfish Sanitation Conference guideline. Application of the DNA microarray methodology to natural oysters revealed the presence of V. vulnificus (100%) and V. parahaemolyticus (83%). However, V. cholerae was not detected in natural oysters. An assay involving a combination of multiplex PCR and DNA microarray hybridization would help to ensure rapid and accurate detection of pathogenic vibrios in shellfish, thereby improving the microbiological safety of shellfish for consumers.
Detection of Pathogenic Vibrio spp. in Shellfish by Using Multiplex PCR and DNA Microarrays
Panicker, Gitika; Call, Douglas R.; Krug, Melissa J.; Bej, Asim K.
2004-01-01
This study describes the development of a gene-specific DNA microarray coupled with multiplex PCR for the comprehensive detection of pathogenic vibrios that are natural inhabitants of warm coastal waters and shellfish. Multiplex PCR with vvh and viuB for Vibrio vulnificus, with ompU, toxR, tcpI, and hlyA for V. cholerae, and with tlh, tdh, trh, and open reading frame 8 for V. parahaemolyticus helped to ensure that total and pathogenic strains, including subtypes of the three Vibrio spp., could be detected and discriminated. For DNA microarrays, oligonucleotide probes for these targeted genes were deposited onto epoxysilane-derivatized, 12-well, Teflon-masked slides by using a MicroGrid II arrayer. Amplified PCR products were hybridized to arrays at 50°C and detected by using tyramide signal amplification with Alexa Fluor 546 fluorescent dye. Slides were imaged by using an arrayWoRx scanner. The detection sensitivity for pure cultures without enrichment was 102 to 103 CFU/ml, and the specificity was 100%. However, 5 h of sample enrichment followed by DNA extraction with Instagene matrix and multiplex PCR with microarray hybridization resulted in the detection of 1 CFU in 1 g of oyster tissue homogenate. Thus, enrichment of the bacterial pathogens permitted higher sensitivity in compliance with the Interstate Shellfish Sanitation Conference guideline. Application of the DNA microarray methodology to natural oysters revealed the presence of V. vulnificus (100%) and V. parahaemolyticus (83%). However, V. cholerae was not detected in natural oysters. An assay involving a combination of multiplex PCR and DNA microarray hybridization would help to ensure rapid and accurate detection of pathogenic vibrios in shellfish, thereby improving the microbiological safety of shellfish for consumers. PMID:15574946
Bruno, D L; Ganesamoorthy, D; Schoumans, J; Bankier, A; Coman, D; Delatycki, M; Gardner, R J M; Hunter, M; James, P A; Kannu, P; McGillivray, G; Pachter, N; Peters, H; Rieubland, C; Savarirayan, R; Scheffer, I E; Sheffield, L; Tan, T; White, S M; Yeung, A; Bowman, Z; Ngo, C; Choy, K W; Cacheux, V; Wong, L; Amor, D J; Slater, H R
2009-02-01
Microarray genome analysis is realising its promise for improving detection of genetic abnormalities in individuals with mental retardation and congenital abnormality. Copy number variations (CNVs) are now readily detectable using a variety of platforms and a major challenge is the distinction of pathogenic from ubiquitous, benign polymorphic CNVs. The aim of this study was to investigate replacement of time consuming, locus specific testing for specific microdeletion and microduplication syndromes with microarray analysis, which theoretically should detect all known syndromes with CNV aetiologies as well as new ones. Genome wide copy number analysis was performed on 117 patients using Affymetrix 250K microarrays. 434 CNVs (195 losses and 239 gains) were found, including 18 pathogenic CNVs and 9 identified as "potentially pathogenic". Almost all pathogenic CNVs were larger than 500 kb, significantly larger than the median size of all CNVs detected. Segmental regions of loss of heterozygosity larger than 5 Mb were found in 5 patients. Genome microarray analysis has improved diagnostic success in this group of patients. Several examples of recently discovered "new syndromes" were found suggesting they are more common than previously suspected and collectively are likely to be a major cause of mental retardation. The findings have several implications for clinical practice. The study revealed the potential to make genetic diagnoses that were not evident in the clinical presentation, with implications for pretest counselling and the consent process. The importance of contributing novel CNVs to high quality databases for genotype-phenotype analysis and review of guidelines for selection of individuals for microarray analysis is emphasised.
Shekhar, M S; Gomathi, A; Gopikrishna, G; Ponniah, A G
2015-06-01
White spot syndrome virus (WSSV) continues to be the most devastating viral pathogen infecting penaeid shrimp the world over. The genome of WSSV has been deciphered and characterized from three geographical isolates and significant progress has been made in developing various molecular diagnostic methods to detect the virus. However, the information on host immune gene response to WSSV pathogenesis is limited. Microarray analysis was carried out as an approach to analyse the gene expression in black tiger shrimp Penaeus monodon in response to WSSV infection. Gill tissues collected from the WSSV infected shrimp at 6, 24, 48 h and moribund stage were analysed for differential gene expression. Shrimp cDNAs of 40,059 unique sequences were considered for designing the microarray chip. The Cy3-labeled cRNA derived from healthy and WSSV-infected shrimp was subjected to hybridization with all the DNA spots in the microarray which revealed 8,633 and 11,147 as up- and down-regulated genes respectively at different time intervals post infection. The altered expression of these numerous genes represented diverse functions such as immune response, osmoregulation, apoptosis, nucleic acid binding, energy and metabolism, signal transduction, stress response and molting. The changes in gene expression profiles observed by microarray analysis provides molecular insights and framework of genes which are up- and down-regulated at different time intervals during WSSV infection in shrimp. The microarray data was validated by Real Time analysis of four differentially expressed genes involved in apoptosis (translationally controlled tumor protein, inhibitor of apoptosis protein, ubiquitin conjugated enzyme E2 and caspase) for gene expression levels. The role of apoptosis related genes in WSSV infected shrimp is discussed herein.
Development and application of antibody microarray for lymphocystis disease virus detection in fish.
Sheng, Xiuzhen; Xu, Xiaoli; Zhan, Wenbin
2013-05-01
Lymphocystis disease virus (LCDV) is the causative agent of lymphocystis disease affecting marine and freshwater fish worldwide. Here an antibody microarray was developed and employed to detect LCDV in fish. Rabbit anti-LCDV serum was arrayed on agarose gel-modified slides as capture antibody, and Cy3-conjugated anti-LCDV monoclonal antibody (MAbs) was added as detection antibody. The signals were imaged with a laser chip scanner and analyzed by corresponding software. To improve the sensitivity, different substrate binders (poly-L-lysine, MPTS, aldehyde, APES and agarose gel modified slides, and commercially available amino-modified slides), markers (fluorescein isothiocyanate, Cy3, horseradish peroxidase, biotin or colloidal gold) conjugated to anti-LCDV Mabs, and storage time of the antibody were assessed. The results showed that the antibody microarrays based on agarose gel-modified slides gave a lower detection limit of 0.55μg/ml of LCDV when Cy3 and HRP conjugated anti-LCDV MAbs were used as detection antibody; and the lowest detectable LCDV protein concentration was 0.0686 μg/ml when streptavidin-biotin conjugated to anti-LCDV MAbs served as detection antibody. The developed antibody microarray proved to have a high specificity for LCDV detection and a shelf-life of more than 8 months at -20°C. Furthermore, the LCDV detection results of the microarray in fish gills or fins (n=50) presented a concordance rate of 100% with enzyme-linked immunosorbent assay (ELISA) and 98% with immunofluorescence assay technique (IFAT). These results revealed that the developed antibody microarray could serve as an effective tool for diagnostic and epidemiological studies of LCDV in fish. Copyright © 2013 Elsevier B.V. All rights reserved.
Translating standards into practice - one Semantic Web API for Gene Expression.
Deus, Helena F; Prud'hommeaux, Eric; Miller, Michael; Zhao, Jun; Malone, James; Adamusiak, Tomasz; McCusker, Jim; Das, Sudeshna; Rocca Serra, Philippe; Fox, Ronan; Marshall, M Scott
2012-08-01
Sharing and describing experimental results unambiguously with sufficient detail to enable replication of results is a fundamental tenet of scientific research. In today's cluttered world of "-omics" sciences, data standards and standardized use of terminologies and ontologies for biomedical informatics play an important role in reporting high-throughput experiment results in formats that can be interpreted by both researchers and analytical tools. Increasing adoption of Semantic Web and Linked Data technologies for the integration of heterogeneous and distributed health care and life sciences (HCLSs) datasets has made the reuse of standards even more pressing; dynamic semantic query federation can be used for integrative bioinformatics when ontologies and identifiers are reused across data instances. We present here a methodology to integrate the results and experimental context of three different representations of microarray-based transcriptomic experiments: the Gene Expression Atlas, the W3C BioRDF task force approach to reporting Provenance of Microarray Experiments, and the HSCI blood genomics project. Our approach does not attempt to improve the expressivity of existing standards for genomics but, instead, to enable integration of existing datasets published from microarray-based transcriptomic experiments. SPARQL Construct is used to create a posteriori mappings of concepts and properties and linking rules that match entities based on query constraints. We discuss how our integrative approach can encourage reuse of the Experimental Factor Ontology (EFO) and the Ontology for Biomedical Investigations (OBIs) for the reporting of experimental context and results of gene expression studies. Copyright © 2012 Elsevier Inc. All rights reserved.
Methods for processing microarray data.
Ares, Manuel
2014-02-01
Quality control must be maintained at every step of a microarray experiment, from RNA isolation through statistical evaluation. Here we provide suggestions for analyzing microarray data. Because the utility of the results depends directly on the design of the experiment, the first critical step is to ensure that the experiment can be properly analyzed and interpreted. What is the biological question? What is the best way to perform the experiment? How many replicates will be required to obtain the desired statistical resolution? Next, the samples must be prepared, pass quality controls for integrity and representation, and be hybridized and scanned. Also, slides with defects, missing data, high background, or weak signal must be rejected. Data from individual slides must be normalized and combined so that the data are as free of systematic bias as possible. The third phase is to apply statistical filters and tests to the data to determine genes (1) expressed above background, (2) whose expression level changes in different samples, and (3) whose RNA-processing patterns or protein associations change. Next, a subset of the data should be validated by an alternative method, such as reverse transcription-polymerase chain reaction (RT-PCR). Provided that this endorses the general conclusions of the array analysis, gene sets whose expression, splicing, polyadenylation, protein binding, etc. change in different samples can be classified with respect to function, sequence motif properties, as well as other categories to extract hypotheses for their biological roles and regulatory logic.
Haram, Kerstyn M; Peltier, Heidi J; Lu, Bin; Bhasin, Manoj; Otu, Hasan H; Choy, Bob; Regan, Meredith; Libermann, Towia A; Latham, Gary J; Sanda, Martin G; Arredouani, Mohamed S
2008-10-01
Translation of preclinical studies into effective human cancer therapy is hampered by the lack of defined molecular expression patterns in mouse models that correspond to the human counterpart. We sought to generate an open source TRAMP mouse microarray dataset and to use this array to identify differentially expressed genes from human prostate cancer (PCa) that have concordant expression in TRAMP tumors, and thereby represent lead targets for preclinical therapy development. We performed microarrays on total RNA extracted and amplified from eight TRAMP tumors and nine normal prostates. A subset of differentially expressed genes was validated by QRT-PCR. Differentially expressed TRAMP genes were analyzed for concordant expression in publicly available human prostate array datasets and a subset of resulting genes was analyzed by QRT-PCR. Cross-referencing differentially expressed TRAMP genes to public human prostate array datasets revealed 66 genes with concordant expression in mouse and human PCa; 56 between metastases and normal and 10 between primary tumor and normal tissues. Of these 10 genes, two, Sox4 and Tubb2a, were validated by QRT-PCR. Our analysis also revealed various dysregulations in major biologic pathways in the TRAMP prostates. We report a TRAMP microarray dataset of which a gene subset was validated by QRT-PCR with expression patterns consistent with previous gene-specific TRAMP studies. Concordance analysis between TRAMP and human PCa associated genes supports the utility of the model and suggests several novel molecular targets for preclinical therapy.
Reboiro-Jato, Miguel; Arrais, Joel P; Oliveira, José Luis; Fdez-Riverola, Florentino
2014-01-30
The diagnosis and prognosis of several diseases can be shortened through the use of different large-scale genome experiments. In this context, microarrays can generate expression data for a huge set of genes. However, to obtain solid statistical evidence from the resulting data, it is necessary to train and to validate many classification techniques in order to find the best discriminative method. This is a time-consuming process that normally depends on intricate statistical tools. geneCommittee is a web-based interactive tool for routinely evaluating the discriminative classification power of custom hypothesis in the form of biologically relevant gene sets. While the user can work with different gene set collections and several microarray data files to configure specific classification experiments, the tool is able to run several tests in parallel. Provided with a straightforward and intuitive interface, geneCommittee is able to render valuable information for diagnostic analyses and clinical management decisions based on systematically evaluating custom hypothesis over different data sets using complementary classifiers, a key aspect in clinical research. geneCommittee allows the enrichment of microarrays raw data with gene functional annotations, producing integrated datasets that simplify the construction of better discriminative hypothesis, and allows the creation of a set of complementary classifiers. The trained committees can then be used for clinical research and diagnosis. Full documentation including common use cases and guided analysis workflows is freely available at http://sing.ei.uvigo.es/GC/.
Jupiter, Daniel; Chen, Hailin; VanBuren, Vincent
2009-01-01
Background Although expression microarrays have become a standard tool used by biologists, analysis of data produced by microarray experiments may still present challenges. Comparison of data from different platforms, organisms, and labs may involve complicated data processing, and inferring relationships between genes remains difficult. Results STARNET 2 is a new web-based tool that allows post hoc visual analysis of correlations that are derived from expression microarray data. STARNET 2 facilitates user discovery of putative gene regulatory networks in a variety of species (human, rat, mouse, chicken, zebrafish, Drosophila, C. elegans, S. cerevisiae, Arabidopsis and rice) by graphing networks of genes that are closely co-expressed across a large heterogeneous set of preselected microarray experiments. For each of the represented organisms, raw microarray data were retrieved from NCBI's Gene Expression Omnibus for a selected Affymetrix platform. All pairwise Pearson correlation coefficients were computed for expression profiles measured on each platform, respectively. These precompiled results were stored in a MySQL database, and supplemented by additional data retrieved from NCBI. A web-based tool allows user-specified queries of the database, centered at a gene of interest. The result of a query includes graphs of correlation networks, graphs of known interactions involving genes and gene products that are present in the correlation networks, and initial statistical analyses. Two analyses may be performed in parallel to compare networks, which is facilitated by the new HEATSEEKER module. Conclusion STARNET 2 is a useful tool for developing new hypotheses about regulatory relationships between genes and gene products, and has coverage for 10 species. Interpretation of the correlation networks is supported with a database of previously documented interactions, a test for enrichment of Gene Ontology terms, and heat maps of correlation distances that may be used to compare two networks. The list of genes in a STARNET network may be useful in developing a list of candidate genes to use for the inference of causal networks. The tool is freely available at , and does not require user registration. PMID:19828039
Weighted analysis of paired microarray experiments.
Kristiansson, Erik; Sjögren, Anders; Rudemo, Mats; Nerman, Olle
2005-01-01
In microarray experiments quality often varies, for example between samples and between arrays. The need for quality control is therefore strong. A statistical model and a corresponding analysis method is suggested for experiments with pairing, including designs with individuals observed before and after treatment and many experiments with two-colour spotted arrays. The model is of mixed type with some parameters estimated by an empirical Bayes method. Differences in quality are modelled by individual variances and correlations between repetitions. The method is applied to three real and several simulated datasets. Two of the real datasets are of Affymetrix type with patients profiled before and after treatment, and the third dataset is of two-colour spotted cDNA type. In all cases, the patients or arrays had different estimated variances, leading to distinctly unequal weights in the analysis. We suggest also plots which illustrate the variances and correlations that affect the weights computed by our analysis method. For simulated data the improvement relative to previously published methods without weighting is shown to be substantial.
Oneda, Beatrice; Baldinger, Rosa; Reissmann, Regina; Reshetnikova, Irina; Krejci, Pavel; Masood, Rahim; Ochsenbein-Kölble, Nicole; Bartholdi, Deborah; Steindl, Katharina; Morotti, Denise; Faranda, Marzia; Baumer, Alessandra; Asadollahi, Reza; Joset, Pascal; Niedrist, Dunja; Breymann, Christian; Hebisch, Gundula; Hüsler, Margaret; Mueller, René; Prentl, Elke; Wisser, Josef; Zimmermann, Roland; Rauch, Anita
2014-06-01
The objective of this study was to determine for the first time the reliability and the diagnostic power of high-resolution microarray testing in routine prenatal diagnostics. We applied high-resolution chromosomal microarray testing in 464 cytogenetically normal prenatal samples with any indication for invasive testing. High-resolution testing revealed a diagnostic yield of 6.9% and 1.6% in cases of fetal ultrasound anomalies and cases of advanced maternal age (AMA), respectively, which is similar to previous studies using low-resolution microarrays. In three (0.6%) additional cases with an indication of AMA, an aberration in susceptibility risk loci was detected. Moreover, one case (0.2%) showed an X-linked aberration in a female fetus, a finding relevant for future family planning. We found the rate of cases, in which the parents had to be tested for interpretation of unreported copy number variants (3.7%), and the rate of remaining variants of unknown significance (0.4%) acceptably low. Of note, these findings did not cause termination of pregnancy after expert genetic counseling. The 0.4% rate of confined placental mosaicism was similar to that observed by conventional karyotyping and notably involved a case of placental microdeletion. High-resolution prenatal microarray testing is a reliable technique that increases diagnostic yield by at least 17.3% when compared with conventional karyotyping, without an increase in the frequency of variants of uncertain significance. © 2014 John Wiley & Sons, Ltd.
Kim, Chang Sup; Seo, Jeong Hyun; Cha, Hyung Joon
2012-08-07
The development of analytical tools is important for understanding the infection mechanisms of pathogenic bacteria or viruses. In the present work, a functional carbohydrate microarray combined with a fluorescence immunoassay was developed to analyze the interactions of Vibrio cholerae toxin (ctx) proteins and GM1-related carbohydrates. Ctx proteins were loaded onto the surface-immobilized GM1 pentasaccharide and six related carbohydrates, and their binding affinities were detected immunologically. The analysis of the ctx-carbohydrate interactions revealed that the intrinsic selectivity of ctx was GM1 pentasaccharide ≫ GM2 tetrasaccharide > asialo GM1 tetrasaccharide ≥ GM3trisaccharide, indicating that a two-finger grip formation and the terminal monosaccharides play important roles in the ctx-GM1 interaction. In addition, whole cholera toxin (ctxAB(5)) had a stricter substrate specificity and a stronger binding affinity than only the cholera toxin B subunit (ctxB). On the basis of the quantitative analysis, the carbohydrate microarray showed the sensitivity of detection of the ctxAB(5)-GM1 interaction with a limit-of-detection (LOD) of 2 ng mL(-1) (23 pM), which is comparable to other reported high sensitivity assay tools. In addition, the carbohydrate microarray successfully detected the actual toxin directly secreted from V. cholerae, without showing cross-reactivity to other bacteria. Collectively, these results demonstrate that the functional carbohydrate microarray is suitable for analyzing toxin protein-carbohydrate interactions and can be applied as a biosensor for toxin detection.
Bacterial identification and subtyping using DNA microarray and DNA sequencing.
Al-Khaldi, Sufian F; Mossoba, Magdi M; Allard, Marc M; Lienau, E Kurt; Brown, Eric D
2012-01-01
The era of fast and accurate discovery of biological sequence motifs in prokaryotic and eukaryotic cells is here. The co-evolution of direct genome sequencing and DNA microarray strategies not only will identify, isotype, and serotype pathogenic bacteria, but also it will aid in the discovery of new gene functions by detecting gene expressions in different diseases and environmental conditions. Microarray bacterial identification has made great advances in working with pure and mixed bacterial samples. The technological advances have moved beyond bacterial gene expression to include bacterial identification and isotyping. Application of new tools such as mid-infrared chemical imaging improves detection of hybridization in DNA microarrays. The research in this field is promising and future work will reveal the potential of infrared technology in bacterial identification. On the other hand, DNA sequencing by using 454 pyrosequencing is so cost effective that the promise of $1,000 per bacterial genome sequence is becoming a reality. Pyrosequencing technology is a simple to use technique that can produce accurate and quantitative analysis of DNA sequences with a great speed. The deposition of massive amounts of bacterial genomic information in databanks is creating fingerprint phylogenetic analysis that will ultimately replace several technologies such as Pulsed Field Gel Electrophoresis. In this chapter, we will review (1) the use of DNA microarray using fluorescence and infrared imaging detection for identification of pathogenic bacteria, and (2) use of pyrosequencing in DNA cluster analysis to fingerprint bacterial phylogenetic trees.
Bikel, Shirley; Jacobo-Albavera, Leonor; Sánchez-Muñoz, Fausto; Cornejo-Granados, Fernanda; Canizales-Quinteros, Samuel; Soberón, Xavier; Sotelo-Mundo, Rogerio R.; del Río-Navarro, Blanca E.; Mendoza-Vargas, Alfredo; Sánchez, Filiberto
2017-01-01
Background In spite of the emergence of RNA sequencing (RNA-seq), microarrays remain in widespread use for gene expression analysis in the clinic. There are over 767,000 RNA microarrays from human samples in public repositories, which are an invaluable resource for biomedical research and personalized medicine. The absolute gene expression analysis allows the transcriptome profiling of all expressed genes under a specific biological condition without the need of a reference sample. However, the background fluorescence represents a challenge to determine the absolute gene expression in microarrays. Given that the Y chromosome is absent in female subjects, we used it as a new approach for absolute gene expression analysis in which the fluorescence of the Y chromosome genes of female subjects was used as the background fluorescence for all the probes in the microarray. This fluorescence was used to establish an absolute gene expression threshold, allowing the differentiation between expressed and non-expressed genes in microarrays. Methods We extracted the RNA from 16 children leukocyte samples (nine males and seven females, ages 6–10 years). An Affymetrix Gene Chip Human Gene 1.0 ST Array was carried out for each sample and the fluorescence of 124 genes of the Y chromosome was used to calculate the absolute gene expression threshold. After that, several expressed and non-expressed genes according to our absolute gene expression threshold were compared against the expression obtained using real-time quantitative polymerase chain reaction (RT-qPCR). Results From the 124 genes of the Y chromosome, three genes (DDX3Y, TXLNG2P and EIF1AY) that displayed significant differences between sexes were used to calculate the absolute gene expression threshold. Using this threshold, we selected 13 expressed and non-expressed genes and confirmed their expression level by RT-qPCR. Then, we selected the top 5% most expressed genes and found that several KEGG pathways were significantly enriched. Interestingly, these pathways were related to the typical functions of leukocytes cells, such as antigen processing and presentation and natural killer cell mediated cytotoxicity. We also applied this method to obtain the absolute gene expression threshold in already published microarray data of liver cells, where the top 5% expressed genes showed an enrichment of typical KEGG pathways for liver cells. Our results suggest that the three selected genes of the Y chromosome can be used to calculate an absolute gene expression threshold, allowing a transcriptome profiling of microarray data without the need of an additional reference experiment. Discussion Our approach based on the establishment of a threshold for absolute gene expression analysis will allow a new way to analyze thousands of microarrays from public databases. This allows the study of different human diseases without the need of having additional samples for relative expression experiments. PMID:29230367
Loughridge, Alice B.; Greenwood, Benjamin N.; Day, Heidi E. W.; McQueen, Matthew B.; Fleshner, Monika
2013-01-01
Serotonin (5-HT) is implicated in the development of stress-related mood disorders in humans. Physical activity reduces the risk of developing stress-related mood disorders, such as depression and anxiety. In rats, 6 weeks of wheel running protects against stress-induced behaviors thought to resemble symptoms of human anxiety and depression. The mechanisms by which exercise confers protection against stress-induced behaviors, however, remain unknown. One way by which exercise could generate stress resistance is by producing plastic changes in gene expression in the dorsal raphe nucleus (DRN). The DRN has a high concentration of 5-HT neurons and is implicated in stress-related mood disorders. The goal of the current experiment was to identify changes in the expression of genes that could be novel targets of exercise-induced stress resistance in the DRN. Adult, male F344 rats were allowed voluntary access to running wheels for 6 weeks; exposed to inescapable stress or no stress; and sacrificed immediately and 2 h after stressor termination. Laser capture micro dissection selectively sampled the DRN. mRNA expression was measured using the whole genome Affymetrix microarray. Comprehensive data analyses of gene expression included differential gene expression, log fold change (LFC) contrast analyses with False Discovery Rate correction, KEGG and Wiki Web Gestalt pathway enrichment analyses, and Weighted Gene Correlational Network Analysis (WGCNA). Our results suggest that physically active rats exposed to stress modulate expression of twice the number of genes, and display a more rapid and strongly coordinated response, than sedentary rats. Bioinformatics analyses revealed several potential targets of stress resistance including genes that are related to immune processes, tryptophan metabolism, and circadian/diurnal rhythms. PMID:23717271
Sequential Multiplex Analyte Capturing for Phosphoprotein Profiling*
Poetz, Oliver; Henzler, Tanja; Hartmann, Michael; Kazmaier, Cornelia; Templin, Markus F.; Herget, Thomas; Joos, Thomas O.
2010-01-01
Microarray-based sandwich immunoassays can simultaneously detect dozens of proteins. However, their use in quantifying large numbers of proteins is hampered by cross-reactivity and incompatibilities caused by the immunoassays themselves. Sequential multiplex analyte capturing addresses these problems by repeatedly probing the same sample with different sets of antibody-coated, magnetic suspension bead arrays. As a miniaturized immunoassay format, suspension bead array-based assays fulfill the criteria of the ambient analyte theory, and our experiments reveal that the analyte concentrations are not significantly changed. The value of sequential multiplex analyte capturing was demonstrated by probing tumor cell line lysates for the abundance of seven different receptor tyrosine kinases and their degree of phosphorylation and by measuring the complex phosphorylation pattern of the epidermal growth factor receptor in the same sample from the same cavity. PMID:20682761
DigOut: viewing differential expression genes as outliers.
Yu, Hui; Tu, Kang; Xie, Lu; Li, Yuan-Yuan
2010-12-01
With regards to well-replicated two-conditional microarray datasets, the selection of differentially expressed (DE) genes is a well-studied computational topic, but for multi-conditional microarray datasets with limited or no replication, the same task is not properly addressed by previous studies. This paper adopts multivariate outlier analysis to analyze replication-lacking multi-conditional microarray datasets, finding that it performs significantly better than the widely used limit fold change (LFC) model in a simulated comparative experiment. Compared with the LFC model, the multivariate outlier analysis also demonstrates improved stability against sample variations in a series of manipulated real expression datasets. The reanalysis of a real non-replicated multi-conditional expression dataset series leads to satisfactory results. In conclusion, a multivariate outlier analysis algorithm, like DigOut, is particularly useful for selecting DE genes from non-replicated multi-conditional gene expression dataset.
Methylation oligonucleotide microarray: a novel tool to analyze methylation patterns
NASA Astrophysics Data System (ADS)
Hou, Peng; Ji, Meiju; He, Nongyao; Lu, Zuhong
2003-04-01
A new technique to analyze methylation patterns in several adjacent CpG sites was developed and reported here. We selected a 336bp segment of the 5"-untranslated region and the first exon of the p16Ink4a gene, which include the most densely packed CpG fragment of the islands containing 32 CpG dinucleotides, as the investigated target. The probes that include all types of methylation patterns were designed to fabricate a DNA microarray to determine the methylation patterns of seven adjacent CpG dinucleotides sites. High accuracy and reproducibility were observed in several parallel experiments. The results led us to the conclusion that the methylation oligonucleotide microarray can be applied as a novel and powerful tool to map methylation patterns and changes in multiple CpG island loci in a variety of tumors.
Gene Expression Analysis: Teaching Students to Do 30,000 Experiments at Once with Microarray
ERIC Educational Resources Information Center
Carvalho, Felicia I.; Johns, Christopher; Gillespie, Marc E.
2012-01-01
Genome scale experiments routinely produce large data sets that require computational analysis, yet there are few student-based labs that illustrate the design and execution of these experiments. In order for students to understand and participate in the genomic world, teaching labs must be available where students generate and analyze large data…
Identification of Common Differentially Expressed Genes in Urinary Bladder Cancer
Zaravinos, Apostolos; Lambrou, George I.; Boulalas, Ioannis; Delakas, Dimitris; Spandidos, Demetrios A.
2011-01-01
Background Current diagnosis and treatment of urinary bladder cancer (BC) has shown great progress with the utilization of microarrays. Purpose Our goal was to identify common differentially expressed (DE) genes among clinically relevant subclasses of BC using microarrays. Methodology/Principal Findings BC samples and controls, both experimental and publicly available datasets, were analyzed by whole genome microarrays. We grouped the samples according to their histology and defined the DE genes in each sample individually, as well as in each tumor group. A dual analysis strategy was followed. First, experimental samples were analyzed and conclusions were formulated; and second, experimental sets were combined with publicly available microarray datasets and were further analyzed in search of common DE genes. The experimental dataset identified 831 genes that were DE in all tumor samples, simultaneously. Moreover, 33 genes were up-regulated and 85 genes were down-regulated in all 10 BC samples compared to the 5 normal tissues, simultaneously. Hierarchical clustering partitioned tumor groups in accordance to their histology. K-means clustering of all genes and all samples, as well as clustering of tumor groups, presented 49 clusters. K-means clustering of common DE genes in all samples revealed 24 clusters. Genes manifested various differential patterns of expression, based on PCA. YY1 and NFκB were among the most common transcription factors that regulated the expression of the identified DE genes. Chromosome 1 contained 32 DE genes, followed by chromosomes 2 and 11, which contained 25 and 23 DE genes, respectively. Chromosome 21 had the least number of DE genes. GO analysis revealed the prevalence of transport and binding genes in the common down-regulated DE genes; the prevalence of RNA metabolism and processing genes in the up-regulated DE genes; as well as the prevalence of genes responsible for cell communication and signal transduction in the DE genes that were down-regulated in T1-Grade III tumors and up-regulated in T2/T3-Grade III tumors. Combination of samples from all microarray platforms revealed 17 common DE genes, (BMP4, CRYGD, DBH, GJB1, KRT83, MPZ, NHLH1, TACR3, ACTC1, MFAP4, SPARCL1, TAGLN, TPM2, CDC20, LHCGR, TM9SF1 and HCCS) 4 of which participate in numerous pathways. Conclusions/Significance The identification of the common DE genes among BC samples of different histology can provide further insight into the discovery of new putative markers. PMID:21483740
Reverse engineering biological networks :applications in immune responses to bio-toxins.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Martino, Anthony A.; Sinclair, Michael B.; Davidson, George S.
Our aim is to determine the network of events, or the regulatory network, that defines an immune response to a bio-toxin. As a model system, we are studying T cell regulatory network triggered through tyrosine kinase receptor activation using a combination of pathway stimulation and time-series microarray experiments. Our approach is composed of five steps (1) microarray experiments and data error analysis, (2) data clustering, (3) data smoothing and discretization, (4) network reverse engineering, and (5) network dynamics analysis and fingerprint identification. The technological outcome of this study is a suite of experimental protocols and computational tools that reverse engineermore » regulatory networks provided gene expression data. The practical biological outcome of this work is an immune response fingerprint in terms of gene expression levels. Inferring regulatory networks from microarray data is a new field of investigation that is no more than five years old. To the best of our knowledge, this work is the first attempt that integrates experiments, error analyses, data clustering, inference, and network analysis to solve a practical problem. Our systematic approach of counting, enumeration, and sampling networks matching experimental data is new to the field of network reverse engineering. The resulting mathematical analyses and computational tools lead to new results on their own and should be useful to others who analyze and infer networks.« less
GenePublisher: Automated analysis of DNA microarray data.
Knudsen, Steen; Workman, Christopher; Sicheritz-Ponten, Thomas; Friis, Carsten
2003-07-01
GenePublisher, a system for automatic analysis of data from DNA microarray experiments, has been implemented with a web interface at http://www.cbs.dtu.dk/services/GenePublisher. Raw data are uploaded to the server together with a specification of the data. The server performs normalization, statistical analysis and visualization of the data. The results are run against databases of signal transduction pathways, metabolic pathways and promoter sequences in order to extract more information. The results of the entire analysis are summarized in report form and returned to the user.
Variation of gene expression in Bacillus subtilis samples of fermentation replicates.
Zhou, Ying; Yu, Wen-Bang; Ye, Bang-Ce
2011-06-01
The application of comprehensive gene expression profiling technologies to compare wild and mutated microorganism samples or to assess molecular differences between various treatments has been widely used. However, little is known about the normal variation of gene expression in microorganisms. In this study, an Agilent customized microarray representing 4,106 genes was used to quantify transcript levels of five-repeated flasks to assess normal variation in Bacillus subtilis gene expression. CV analysis and analysis of variance were employed to investigate the normal variance of genes and the components of variance, respectively. The results showed that above 80% of the total variation was caused by biological variance. For the 12 replicates, 451 of 4,106 genes exhibited variance with CV values over 10%. The functional category enrichment analysis demonstrated that these variable genes were mainly involved in cell type differentiation, cell type localization, cell cycle and DNA processing, and spore or cyst coat. Using power analysis, the minimal biological replicate number for a B. subtilis microarray experiment was determined to be six. The results contribute to the definition of the baseline level of variability in B. subtilis gene expression and emphasize the importance of replicate microarray experiments.
Recursive feature selection with significant variables of support vectors.
Tsai, Chen-An; Huang, Chien-Hsun; Chang, Ching-Wei; Chen, Chun-Houh
2012-01-01
The development of DNA microarray makes researchers screen thousands of genes simultaneously and it also helps determine high- and low-expression level genes in normal and disease tissues. Selecting relevant genes for cancer classification is an important issue. Most of the gene selection methods use univariate ranking criteria and arbitrarily choose a threshold to choose genes. However, the parameter setting may not be compatible to the selected classification algorithms. In this paper, we propose a new gene selection method (SVM-t) based on the use of t-statistics embedded in support vector machine. We compared the performance to two similar SVM-based methods: SVM recursive feature elimination (SVMRFE) and recursive support vector machine (RSVM). The three methods were compared based on extensive simulation experiments and analyses of two published microarray datasets. In the simulation experiments, we found that the proposed method is more robust in selecting informative genes than SVMRFE and RSVM and capable to attain good classification performance when the variations of informative and noninformative genes are different. In the analysis of two microarray datasets, the proposed method yields better performance in identifying fewer genes with good prediction accuracy, compared to SVMRFE and RSVM.
Seliger, Barbara; Dressler, Sven P.; Wang, Ena; Kellner, Roland; Recktenwald, Christian V.; Lottspeich, Friedrich; Marincola, Francesco M.; Baumgärtner, Maja; Atkins, Derek; Lichtenfels, Rudolf
2012-01-01
Results obtained from expression profilings of renal cell carcinoma using different “ome”-based approaches and comprehensive data analysis demonstrated that proteome-based technologies and cDNA microarray analyses complement each other during the discovery phase for disease-related candidate biomarkers. The integration of the respective data revealed the uniqueness and complementarities of the different technologies. While comparative cDNA microarray analyses though restricted to upregulated targets largely revealed genes involved in controlling gene/protein expression (19%) and signal transduction processes (13%), proteomics/PROTEOMEX-defined candidate biomarkers include enzymes of the cellular metabolism (36%), transport proteins (12%) and cell motility/structural molecules (10%). Candidate biomarkers defined by proteomics and PROTEOMEX are frequently shared, whereas the sharing rate between cDNA microarray and proteome-based profilings is limited. Putative candidate biomarkers provide insights into their cellular (dys)function and their diagnostic/prognostic value but still warrant further validation in larger patient numbers. Based on the fact that merely 3 candidate biomarkers were shared by all applied technologies, namely annexin A4, tubulin alpha-1A chain and ubiquitin carboxyl-terminal hydrolase L1 the analysis at a single hierarchical level of biological regulation seems to provide only limited results thus emphasizing the importance and benefit of performing rather combinatorial screenings which can complement the standard clinical predictors. PMID:19235166
Abbey, Darren; Hickman, Meleah; Gresham, David; Berman, Judith
2011-01-01
Phenotypic diversity can arise rapidly through loss of heterozygosity (LOH) or by the acquisition of copy number variations (CNV) spanning whole chromosomes or shorter contiguous chromosome segments. In Candida albicans, a heterozygous diploid yeast pathogen with no known meiotic cycle, homozygosis and aneuploidy alter clinical characteristics, including drug resistance. Here, we developed a high-resolution microarray that simultaneously detects ∼39,000 single nucleotide polymorphism (SNP) alleles and ∼20,000 copy number variation loci across the C. albicans genome. An important feature of the array analysis is a computational pipeline that determines SNP allele ratios based upon chromosome copy number. Using the array and analysis tools, we constructed a haplotype map (hapmap) of strain SC5314 to assign SNP alleles to specific homologs, and we used it to follow the acquisition of loss of heterozygosity (LOH) and copy number changes in a series of derived laboratory strains. This high-resolution SNP/CGH microarray and the associated hapmap facilitated the phasing of alleles in lab strains and revealed detrimental genome changes that arose frequently during molecular manipulations of laboratory strains. Furthermore, it provided a useful tool for rapid, high-resolution, and cost-effective characterization of changes in allele diversity as well as changes in chromosome copy number in new C. albicans isolates. PMID:22384363
Rao, J; Liu, D; Zhang, N; He, H; Ge, F; Chen, C
2014-01-01
Fusarium wilt, caused by a soilborne pathogen Fusarium oxysporum f. sp. lilii, is the major disease of lily (Lilium L.). In order to isolate the genes differentially expressed in a resistant reaction to F. oxysporum in L. regale Wilson, a cDNA library was constructed with L. regale root during F. oxysporum infection using the suppression subtractive hybridization (SSH), and a total of 585 unique expressed sequence tags (ESTs) were obtained. Furthermore, the gene expression profiles in the incompatible interaction between L. regale and F. oxysporum were revealed by oligonucleotide microarray analysis of 585 unique ESTs comparison to the compatible interaction between a susceptible Lilium Oriental Hybrid 'Siberia' and F. oxysporum. The result of expression profile analysis indicated that the genes encoding pathogenesis-related proteins (PRs), antioxidative stress enzymes, secondary metabolism enzymes, transcription factors, signal transduction proteins as well as a large number of unknown genes were involved in early defense response of L. regale to F. oxysporum infection. Moreover, the following quantitative reverse transcription PCR (QRT-PCR) analysis confirmed reliability of the oligonucleotide microarray data. In the present study, isolation of differentially expressed genes in L. regale during response to F. oxysporum helped to uncover the molecular mechanism associated with the resistance of L. regale against F. oxysporum.
Integrated analysis of chromosome copy number variation and gene expression in cervical carcinoma
Yan, Deng; Yi, Song; Chiu, Wang Chi; Qin, Liu Gui; Kin, Wong Hoi; Kwok Hung, Chung Tony; Linxiao, Han; Wai, Choy Kwong; Yi, Sui; Tao, Yang; Tao, Tang
2017-01-01
Objective This study was conducted to explore chromosomal copy number variations (CNV) and transcript expression and to examine pathways in cervical pathogenesis using genome-wide high resolution microarrays. Methods Genome-wide chromosomal CNVs were investigated in 6 cervical cancer cell lines by Human Genome CGH Microarray Kit (4x44K). Gene expression profiles in cervical cancer cell lines, primary cervical carcinoma and normal cervical epithelium tissues were also studied using the Whole Human Genome Microarray Kit (4x44K). Results Fifty common chromosomal CNVs were identified in the cervical cancer cell lines. Correlation analysis revealed that gene up-regulation or down-regulation is significantly correlated with genomic amplification (P=0.009) or deletion (P=0.006) events. Expression profiles were identified through cluster analysis. Gene annotation analysis pinpointed cell cycle pathways was significantly (P=1.15E-08) affected in cervical cancer. Common CNVs were associated with cervical cancer. Conclusion Chromosomal CNVs may contribute to their transcript expression in cervical cancer. PMID:29312578
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gardner, Shea N.; McLoughlin, Kevin; Be, Nicholas A.
Venezuelan equine encephalitis virus (VEEV) is a mosquito-borne alphavirus that has caused large outbreaks of severe illness in both horses and humans. New approaches are needed to rapidly infer the origin of a newly discovered VEEV strain, estimate its equine amplification and resultant epidemic potential, and predict human virulence phenotype. We performed whole genome single nucleotide polymorphism (SNP) analysis of all available VEE antigenic complex genomes, verified that a SNP-based phylogeny accurately captured the features of a phylogenetic tree based on multiple sequence alignment, and developed a high resolution genome-wide SNP microarray. We used the microarray to analyze a broadmore » panel of VEEV isolates, found excellent concordance between array- and sequence-based SNP calls, genotyped unsequenced isolates, and placed them on a phylogeny with sequenced genomes. The microarray successfully genotyped VEEV directly from tissue samples of an infected mouse, bypassing the need for viral isolation, culture and genomic sequencing. Lastly, we identified genomic variants associated with serotypes and host species, revealing a complex relationship between genotype and phenotype.« less
Microarray analyses reveal distinct roles for Rel proteins in the Drosophila immune response
Pal, Subhamoy; Wu, Junlin; Wu, Louisa P.
2007-01-01
The NF-κB group of transcription factors play an important role in mediating immune responses in organisms as diverse as insects and mammals. The fruit fly Drosophila melanogaster express three closely related NF-κB-like transcription factors: Dorsal, Dif, and Relish. To study their roles in vivo, we used microarrays to determine the effect of null mutations in individual Rel transcription factors on larval immune gene expression. Of the 188 genes that were significantly up-regulated in wildtype larvae upon bacterial challenge, overlapping but distinct groups of genes were affected in the Rel mutants. We also ectopically expressed Dorsal or Dif and used cDNA microarrays to determine the genes that were up-regulated in the presence of these transcription factors. This expression was sufficient to drive expression of some immune genes, suggesting redundancy in the regulation of these genes. Combining this data, we also identified novel genes that may be specific targets of Dif. PMID:17537510
De Hertogh, Benoît; De Meulder, Bertrand; Berger, Fabrice; Pierre, Michael; Bareke, Eric; Gaigneaux, Anthoula; Depiereux, Eric
2010-01-11
Recent reanalysis of spike-in datasets underscored the need for new and more accurate benchmark datasets for statistical microarray analysis. We present here a fresh method using biologically-relevant data to evaluate the performance of statistical methods. Our novel method ranks the probesets from a dataset composed of publicly-available biological microarray data and extracts subset matrices with precise information/noise ratios. Our method can be used to determine the capability of different methods to better estimate variance for a given number of replicates. The mean-variance and mean-fold change relationships of the matrices revealed a closer approximation of biological reality. Performance analysis refined the results from benchmarks published previously.We show that the Shrinkage t test (close to Limma) was the best of the methods tested, except when two replicates were examined, where the Regularized t test and the Window t test performed slightly better. The R scripts used for the analysis are available at http://urbm-cluster.urbm.fundp.ac.be/~bdemeulder/.
Kameue, Chiyoko; Tsukahara, Takamitsu; Ushida, Kazunari
2006-03-01
Butyrate induces apoptosis of various cancer cell lines in a p53-independent manner and inhibits the proliferation of cancer cells. In a previous report, we reported a significant reduction in tumor incidence in rat colon as a result of dietary sodium gluconate (GNA). The stimulation of apoptosis through enhanced butyrate production in the large intestine was involved in the antitumorigenic effect of GNA. In the present study, a cDNA microarray analysis was performed to investigate the particular mechanism involved in the antitumorigenic effect of GNA. Some up-regulated genes suggested by microarray analysis were further evaluated using real-time PCR. A microarray revealed that GNA regulates the expression of retinoic acid receptor (RAR) and retinoid X receptor (RXR), and several genes known as the target of retinoids in cancer cells. In other words, the antitumorigenic effect of GNA may involve the regulation of the retinoid signaling pathway by butyrate in a retinoid-independent manner.
Wang, Denong; Tang, Jin; Liu, Shaoyi
2015-01-01
Using carbohydrate microarrays, we explored potential natural ligands of antitumor monoclonal antibody HAE3. This antibody was raised against a murine mammary tumor antigen but was found to cross-react with a number of human epithelial tumors in tissues. Our carbohydrate microarray analysis reveals that HAE3 is specific for an O-glycan cryptic epitope that is normally hidden in the cores of blood group substances. Using HAE3 to screen tumor cell surface markers by flow cytometry, we found that the HAE3 glycoepitope, gpHAE3, was highly expressed by a number of human breast cancer cell lines, including some triple-negative cancers that lack the estrogen, progesterone, and Her2/neu receptors. Taken together, we demonstrate that HAE3 recognizes a conserved cryptic glycoepitope of blood group precursors, which is nevertheless selectively expressed and surface-exposed in certain breast tumor cells. The potential of this class of O-glycan cryptic antigens in breast cancer subtyping and targeted immunotherapy warrants further investigation. PMID:26539555
Integrated analysis of chromosome copy number variation and gene expression in cervical carcinoma.
Yan, Deng; Yi, Song; Chiu, Wang Chi; Qin, Liu Gui; Kin, Wong Hoi; Kwok Hung, Chung Tony; Linxiao, Han; Wai, Choy Kwong; Yi, Sui; Tao, Yang; Tao, Tang
2017-12-12
This study was conducted to explore chromosomal copy number variations (CNV) and transcript expression and to examine pathways in cervical pathogenesis using genome-wide high resolution microarrays. Genome-wide chromosomal CNVs were investigated in 6 cervical cancer cell lines by Human Genome CGH Microarray Kit (4x44K). Gene expression profiles in cervical cancer cell lines, primary cervical carcinoma and normal cervical epithelium tissues were also studied using the Whole Human Genome Microarray Kit (4x44K). Fifty common chromosomal CNVs were identified in the cervical cancer cell lines. Correlation analysis revealed that gene up-regulation or down-regulation is significantly correlated with genomic amplification ( P =0.009) or deletion ( P =0.006) events. Expression profiles were identified through cluster analysis. Gene annotation analysis pinpointed cell cycle pathways was significantly ( P =1.15E-08) affected in cervical cancer. Common CNVs were associated with cervical cancer. Chromosomal CNVs may contribute to their transcript expression in cervical cancer.
Mansourian, Robert; Mutch, David M; Antille, Nicolas; Aubert, Jerome; Fogel, Paul; Le Goff, Jean-Marc; Moulin, Julie; Petrov, Anton; Rytz, Andreas; Voegel, Johannes J; Roberts, Matthew-Alan
2004-11-01
Microarray technology has become a powerful research tool in many fields of study; however, the cost of microarrays often results in the use of a low number of replicates (k). Under circumstances where k is low, it becomes difficult to perform standard statistical tests to extract the most biologically significant experimental results. Other more advanced statistical tests have been developed; however, their use and interpretation often remain difficult to implement in routine biological research. The present work outlines a method that achieves sufficient statistical power for selecting differentially expressed genes under conditions of low k, while remaining as an intuitive and computationally efficient procedure. The present study describes a Global Error Assessment (GEA) methodology to select differentially expressed genes in microarray datasets, and was developed using an in vitro experiment that compared control and interferon-gamma treated skin cells. In this experiment, up to nine replicates were used to confidently estimate error, thereby enabling methods of different statistical power to be compared. Gene expression results of a similar absolute expression are binned, so as to enable a highly accurate local estimate of the mean squared error within conditions. The model then relates variability of gene expression in each bin to absolute expression levels and uses this in a test derived from the classical ANOVA. The GEA selection method is compared with both the classical and permutational ANOVA tests, and demonstrates an increased stability, robustness and confidence in gene selection. A subset of the selected genes were validated by real-time reverse transcription-polymerase chain reaction (RT-PCR). All these results suggest that GEA methodology is (i) suitable for selection of differentially expressed genes in microarray data, (ii) intuitive and computationally efficient and (iii) especially advantageous under conditions of low k. The GEA code for R software is freely available upon request to authors.
An efficient pseudomedian filter for tiling microrrays.
Royce, Thomas E; Carriero, Nicholas J; Gerstein, Mark B
2007-06-07
Tiling microarrays are becoming an essential technology in the functional genomics toolbox. They have been applied to the tasks of novel transcript identification, elucidation of transcription factor binding sites, detection of methylated DNA and several other applications in several model organisms. These experiments are being conducted at increasingly finer resolutions as the microarray technology enjoys increasingly greater feature densities. The increased densities naturally lead to increased data analysis requirements. Specifically, the most widely employed algorithm for tiling array analysis involves smoothing observed signals by computing pseudomedians within sliding windows, a O(n2logn) calculation in each window. This poor time complexity is an issue for tiling array analysis and could prove to be a real bottleneck as tiling microarray experiments become grander in scope and finer in resolution. We therefore implemented Monahan's HLQEST algorithm that reduces the runtime complexity for computing the pseudomedian of n numbers to O(nlogn) from O(n2logn). For a representative tiling microarray dataset, this modification reduced the smoothing procedure's runtime by nearly 90%. We then leveraged the fact that elements within sliding windows remain largely unchanged in overlapping windows (as one slides across genomic space) to further reduce computation by an additional 43%. This was achieved by the application of skip lists to maintaining a sorted list of values from window to window. This sorted list could be maintained with simple O(log n) inserts and deletes. We illustrate the favorable scaling properties of our algorithms with both time complexity analysis and benchmarking on synthetic datasets. Tiling microarray analyses that rely upon a sliding window pseudomedian calculation can require many hours of computation. We have eased this requirement significantly by implementing efficient algorithms that scale well with genomic feature density. This result not only speeds the current standard analyses, but also makes possible ones where many iterations of the filter may be required, such as might be required in a bootstrap or parameter estimation setting. Source code and executables are available at http://tiling.gersteinlab.org/pseudomedian/.
An efficient pseudomedian filter for tiling microrrays
Royce, Thomas E; Carriero, Nicholas J; Gerstein, Mark B
2007-01-01
Background Tiling microarrays are becoming an essential technology in the functional genomics toolbox. They have been applied to the tasks of novel transcript identification, elucidation of transcription factor binding sites, detection of methylated DNA and several other applications in several model organisms. These experiments are being conducted at increasingly finer resolutions as the microarray technology enjoys increasingly greater feature densities. The increased densities naturally lead to increased data analysis requirements. Specifically, the most widely employed algorithm for tiling array analysis involves smoothing observed signals by computing pseudomedians within sliding windows, a O(n2logn) calculation in each window. This poor time complexity is an issue for tiling array analysis and could prove to be a real bottleneck as tiling microarray experiments become grander in scope and finer in resolution. Results We therefore implemented Monahan's HLQEST algorithm that reduces the runtime complexity for computing the pseudomedian of n numbers to O(nlogn) from O(n2logn). For a representative tiling microarray dataset, this modification reduced the smoothing procedure's runtime by nearly 90%. We then leveraged the fact that elements within sliding windows remain largely unchanged in overlapping windows (as one slides across genomic space) to further reduce computation by an additional 43%. This was achieved by the application of skip lists to maintaining a sorted list of values from window to window. This sorted list could be maintained with simple O(log n) inserts and deletes. We illustrate the favorable scaling properties of our algorithms with both time complexity analysis and benchmarking on synthetic datasets. Conclusion Tiling microarray analyses that rely upon a sliding window pseudomedian calculation can require many hours of computation. We have eased this requirement significantly by implementing efficient algorithms that scale well with genomic feature density. This result not only speeds the current standard analyses, but also makes possible ones where many iterations of the filter may be required, such as might be required in a bootstrap or parameter estimation setting. Source code and executables are available at . PMID:17555595
Hierarchical Gene Selection and Genetic Fuzzy System for Cancer Microarray Data Classification
Nguyen, Thanh; Khosravi, Abbas; Creighton, Douglas; Nahavandi, Saeid
2015-01-01
This paper introduces a novel approach to gene selection based on a substantial modification of analytic hierarchy process (AHP). The modified AHP systematically integrates outcomes of individual filter methods to select the most informative genes for microarray classification. Five individual ranking methods including t-test, entropy, receiver operating characteristic (ROC) curve, Wilcoxon and signal to noise ratio are employed to rank genes. These ranked genes are then considered as inputs for the modified AHP. Additionally, a method that uses fuzzy standard additive model (FSAM) for cancer classification based on genes selected by AHP is also proposed in this paper. Traditional FSAM learning is a hybrid process comprising unsupervised structure learning and supervised parameter tuning. Genetic algorithm (GA) is incorporated in-between unsupervised and supervised training to optimize the number of fuzzy rules. The integration of GA enables FSAM to deal with the high-dimensional-low-sample nature of microarray data and thus enhance the efficiency of the classification. Experiments are carried out on numerous microarray datasets. Results demonstrate the performance dominance of the AHP-based gene selection against the single ranking methods. Furthermore, the combination of AHP-FSAM shows a great accuracy in microarray data classification compared to various competing classifiers. The proposed approach therefore is useful for medical practitioners and clinicians as a decision support system that can be implemented in the real medical practice. PMID:25823003
Hierarchical gene selection and genetic fuzzy system for cancer microarray data classification.
Nguyen, Thanh; Khosravi, Abbas; Creighton, Douglas; Nahavandi, Saeid
2015-01-01
This paper introduces a novel approach to gene selection based on a substantial modification of analytic hierarchy process (AHP). The modified AHP systematically integrates outcomes of individual filter methods to select the most informative genes for microarray classification. Five individual ranking methods including t-test, entropy, receiver operating characteristic (ROC) curve, Wilcoxon and signal to noise ratio are employed to rank genes. These ranked genes are then considered as inputs for the modified AHP. Additionally, a method that uses fuzzy standard additive model (FSAM) for cancer classification based on genes selected by AHP is also proposed in this paper. Traditional FSAM learning is a hybrid process comprising unsupervised structure learning and supervised parameter tuning. Genetic algorithm (GA) is incorporated in-between unsupervised and supervised training to optimize the number of fuzzy rules. The integration of GA enables FSAM to deal with the high-dimensional-low-sample nature of microarray data and thus enhance the efficiency of the classification. Experiments are carried out on numerous microarray datasets. Results demonstrate the performance dominance of the AHP-based gene selection against the single ranking methods. Furthermore, the combination of AHP-FSAM shows a great accuracy in microarray data classification compared to various competing classifiers. The proposed approach therefore is useful for medical practitioners and clinicians as a decision support system that can be implemented in the real medical practice.
Ontology-based, Tissue MicroArray oriented, image centered tissue bank
Viti, Federica; Merelli, Ivan; Caprera, Andrea; Lazzari, Barbara; Stella, Alessandra; Milanesi, Luciano
2008-01-01
Background Tissue MicroArray technique is becoming increasingly important in pathology for the validation of experimental data from transcriptomic analysis. This approach produces many images which need to be properly managed, if possible with an infrastructure able to support tissue sharing between institutes. Moreover, the available frameworks oriented to Tissue MicroArray provide good storage for clinical patient, sample treatment and block construction information, but their utility is limited by the lack of data integration with biomolecular information. Results In this work we propose a Tissue MicroArray web oriented system to support researchers in managing bio-samples and, through the use of ontologies, enables tissue sharing aimed at the design of Tissue MicroArray experiments and results evaluation. Indeed, our system provides ontological description both for pre-analysis tissue images and for post-process analysis image results, which is crucial for information exchange. Moreover, working on well-defined terms it is then possible to query web resources for literature articles to integrate both pathology and bioinformatics data. Conclusions Using this system, users associate an ontology-based description to each image uploaded into the database and also integrate results with the ontological description of biosequences identified in every tissue. Moreover, it is possible to integrate the ontological description provided by the user with a full compliant gene ontology definition, enabling statistical studies about correlation between the analyzed pathology and the most commonly related biological processes. PMID:18460177
Integrating Microarray Data and GRNs.
Koumakis, L; Potamias, G; Tsiknakis, M; Zervakis, M; Moustakis, V
2016-01-01
With the completion of the Human Genome Project and the emergence of high-throughput technologies, a vast amount of molecular and biological data are being produced. Two of the most important and significant data sources come from microarray gene-expression experiments and respective databanks (e,g., Gene Expression Omnibus-GEO (http://www.ncbi.nlm.nih.gov/geo)), and from molecular pathways and Gene Regulatory Networks (GRNs) stored and curated in public (e.g., Kyoto Encyclopedia of Genes and Genomes-KEGG (http://www.genome.jp/kegg/pathway.html), Reactome (http://www.reactome.org/ReactomeGWT/entrypoint.html)) as well as in commercial repositories (e.g., Ingenuity IPA (http://www.ingenuity.com/products/ipa)). The association of these two sources aims to give new insight in disease understanding and reveal new molecular targets in the treatment of specific phenotypes.Three major research lines and respective efforts that try to utilize and combine data from both of these sources could be identified, namely: (1) de novo reconstruction of GRNs, (2) identification of Gene-signatures, and (3) identification of differentially expressed GRN functional paths (i.e., sub-GRN paths that distinguish between different phenotypes). In this chapter, we give an overview of the existing methods that support the different types of gene-expression and GRN integration with a focus on methodologies that aim to identify phenotype-discriminant GRNs or subnetworks, and we also present our methodology.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Perkins, Timothy N.; Dentener, Mieke A.
Growth and development of the mature lung is a complex process orchestrated by a number of intricate developmental signaling pathways. Wingless-type MMTV-integration site (WNT) signaling plays critical roles in controlling branching morphogenesis cell differentiation, and formation of the conducting and respiratory airways. In addition, WNT pathways are often re-activated in mature lungs during repair and regeneration. WNT- signaling has been elucidated as a crucial contributor to the development of idiopathic pulmonary fibrosis as well as other hyper-proliferative lung diseases. Silicosis, a detrimental occupational lung disease caused by excessive inhalation of crystalline silica dust, is hallmarked by repeated cycles of damagingmore » inflammation, epithelial hyperplasia, and formation of dense, hyalinized nodules of whorled collagen. However, mechanisms of epithelial cell hyperplasia and matrix deposition are not well understood, as most research efforts have focused on the pronounced inflammatory response. Microarray data from our previous studies has revealed a number of WNT-signaling and WNT-target genes altered by crystalline silica in human lung epithelial cells. In the present study, we utilize pathway analysis to designate connections between genes altered by silica in WNT-signaling networks. Furthermore, we confirm microarray findings by QRT-PCR and demonstrate both activation of canonical (β-catenin) and down-regulation of non-canonical (WNT5A) signaling in immortalized (BEAS-2B) and primary (PBEC) human bronchial epithelial cells. These findings suggest that WNT-signaling and cross-talk with other pathways (e.g. Notch), may contribute to proliferative, fibrogenic and inflammatory responses to silica in lung epithelial cells. - Highlights: • Pathway analysis reveals silica-induced WNT-signaling in lung epithelial cells. • Silica-induced canonical WNT-signaling is mediated by autocrine/paracrine signals. • Crystalline silica decreases non-canonical WNT5A signaling. • Microarray reveals WNT as a novel complex signaling network in silica-mediated injury.« less
Ma, Chuang; Wang, Xiangfeng
2012-09-01
One of the computational challenges in plant systems biology is to accurately infer transcriptional regulation relationships based on correlation analyses of gene expression patterns. Despite several correlation methods that are applied in biology to analyze microarray data, concerns regarding the compatibility of these methods with the gene expression data profiled by high-throughput RNA transcriptome sequencing (RNA-Seq) technology have been raised. These concerns are mainly due to the fact that the distribution of read counts in RNA-Seq experiments is different from that of fluorescence intensities in microarray experiments. Therefore, a comprehensive evaluation of the existing correlation methods and, if necessary, introduction of novel methods into biology is appropriate. In this study, we compared four existing correlation methods used in microarray analysis and one novel method called the Gini correlation coefficient on previously published microarray-based and sequencing-based gene expression data in Arabidopsis (Arabidopsis thaliana) and maize (Zea mays). The comparisons were performed on more than 11,000 regulatory relationships in Arabidopsis, including 8,929 pairs of transcription factors and target genes. Our analyses pinpointed the strengths and weaknesses of each method and indicated that the Gini correlation can compensate for the shortcomings of the Pearson correlation, the Spearman correlation, the Kendall correlation, and the Tukey's biweight correlation. The Gini correlation method, with the other four evaluated methods in this study, was implemented as an R package named rsgcc that can be utilized as an alternative option for biologists to perform clustering analyses of gene expression patterns or transcriptional network analyses.
Ma, Chuang; Wang, Xiangfeng
2012-01-01
One of the computational challenges in plant systems biology is to accurately infer transcriptional regulation relationships based on correlation analyses of gene expression patterns. Despite several correlation methods that are applied in biology to analyze microarray data, concerns regarding the compatibility of these methods with the gene expression data profiled by high-throughput RNA transcriptome sequencing (RNA-Seq) technology have been raised. These concerns are mainly due to the fact that the distribution of read counts in RNA-Seq experiments is different from that of fluorescence intensities in microarray experiments. Therefore, a comprehensive evaluation of the existing correlation methods and, if necessary, introduction of novel methods into biology is appropriate. In this study, we compared four existing correlation methods used in microarray analysis and one novel method called the Gini correlation coefficient on previously published microarray-based and sequencing-based gene expression data in Arabidopsis (Arabidopsis thaliana) and maize (Zea mays). The comparisons were performed on more than 11,000 regulatory relationships in Arabidopsis, including 8,929 pairs of transcription factors and target genes. Our analyses pinpointed the strengths and weaknesses of each method and indicated that the Gini correlation can compensate for the shortcomings of the Pearson correlation, the Spearman correlation, the Kendall correlation, and the Tukey’s biweight correlation. The Gini correlation method, with the other four evaluated methods in this study, was implemented as an R package named rsgcc that can be utilized as an alternative option for biologists to perform clustering analyses of gene expression patterns or transcriptional network analyses. PMID:22797655
Yi, Ming; Mudunuri, Uma; Che, Anney; Stephens, Robert M
2009-06-29
One of the challenges in the analysis of microarray data is to integrate and compare the selected (e.g., differential) gene lists from multiple experiments for common or unique underlying biological themes. A common way to approach this problem is to extract common genes from these gene lists and then subject these genes to enrichment analysis to reveal the underlying biology. However, the capacity of this approach is largely restricted by the limited number of common genes shared by datasets from multiple experiments, which could be caused by the complexity of the biological system itself. We now introduce a new Pathway Pattern Extraction Pipeline (PPEP), which extends the existing WPS application by providing a new pathway-level comparative analysis scheme. To facilitate comparing and correlating results from different studies and sources, PPEP contains new interfaces that allow evaluation of the pathway-level enrichment patterns across multiple gene lists. As an exploratory tool, this analysis pipeline may help reveal the underlying biological themes at both the pathway and gene levels. The analysis scheme provided by PPEP begins with multiple gene lists, which may be derived from different studies in terms of the biological contexts, applied technologies, or methodologies. These lists are then subjected to pathway-level comparative analysis for extraction of pathway-level patterns. This analysis pipeline helps to explore the commonality or uniqueness of these lists at the level of pathways or biological processes from different but relevant biological systems using a combination of statistical enrichment measurements, pathway-level pattern extraction, and graphical display of the relationships of genes and their associated pathways as Gene-Term Association Networks (GTANs) within the WPS platform. As a proof of concept, we have used the new method to analyze many datasets from our collaborators as well as some public microarray datasets. This tool provides a new pathway-level analysis scheme for integrative and comparative analysis of data derived from different but relevant systems. The tool is freely available as a Pathway Pattern Extraction Pipeline implemented in our existing software package WPS, which can be obtained at http://www.abcc.ncifcrf.gov/wps/wps_index.php.
Circular RNA and gene expression profiles in gastric cancer based on microarray chip technology.
Sui, Weiguo; Shi, Zhoufang; Xue, Wen; Ou, Minglin; Zhu, Ying; Chen, Jiejing; Lin, Hua; Liu, Fuhua; Dai, Yong
2017-03-01
The aim of the present study was to screen gastric cancer (GC) tissue and adjacent tissue for differences in mRNA and circular (circRNA) expression, to analyze the differences in circRNA and mRNA expression, and to investigate the circRNA expression in gastric carcinoma and its mechanism. circRNA and mRNA differential expression profiles generated using Agilent microarray technology were analyzed in the GC tissues and adjacent tissues. qRT-PCR was used to verify the differential expression of circRNAs and mRNAs according to the interactions between circRNAs and miRNAs as well as the possible existence of miRNA and mRNA interactions. We found that: i) the circRNA expression profile revealed 1,285 significant differences in circRNA expression, with circRNA expression downregulated in 594 samples and upregulated in 691 samples via interactions with miRNAs. The qRT-PCR validation experiments showed that hsa_circRNA_400071, hsa_circRNA_000543 and hsa_circRNA_001959 expression was consistent with the microarray analysis results. ii) 29,112 genes were found in the GC tissues and adjacent tissues, including 5,460 differentially expressed genes. Among them, 2,390 differentially expressed genes were upregulated and 3,070 genes were downregulated. Gene Ontology (GO) analysis of the differentially expressed genes revealed these genes involved in biological process classification, cellular component classification and molecular function classification. Pathway analysis of the differentially expressed genes identified 83 significantly enriched genes, including 28 upregulated genes and 55 downregulated genes. iii) 69 differentially expressed circRNAs were found that might adsorb specific miRNAs to regulate the expression of their target gene mRNAs. The conclusions are: i) differentially expressed circRNAs had corresponding miRNA binding sites. These circRNAs regulated the expression of target genes through interactions with miRNAs and might become new molecular biomarkers for GC in the future. ii) Differentially expressed genes may be involved in the occurrence of GC via a variety of mechanisms. iii) CD44, CXXC5, MYH9, MALAT1 and other genes may have important implications for the occurrence and development of GC through the regulation, interaction, and mutual influence of circRNA-miRNA-mRNA via different mechanisms.
Isolation of Microarray-Grade Total RNA, MicroRNA, and DNA from a Single PAXgene Blood RNA Tube
Kruhøffer, Mogens; Dyrskjøt, Lars; Voss, Thorsten; Lindberg, Raija L.P.; Wyrich, Ralf; Thykjaer, Thomas; Orntoft, Torben F.
2007-01-01
We have developed a procedure for isolation of microRNA and genomic DNA in addition to total RNA from whole blood stabilized in PAXgene Blood RNA tubes. The procedure is based on automatic extraction on a BioRobot MDx and includes isolation of DNA from a fraction of the stabilized blood and recovery of small RNA species that are otherwise lost. The procedure presented here is suitable for large-scale experiments and is amenable to further automation. Procured total RNA and DNA was tested using Affymetrix Expression and single-nucleotide polymorphism GeneChips, respectively, and isolated microRNA was tested using spotted locked nucleic acid-based microarrays. We conclude that the yield and quality of total RNA, microRNA, and DNA from a single PAXgene blood RNA tube is sufficient for downstream microarray analysis. PMID:17690207
Finding Groups in Gene Expression Data
2005-01-01
The vast potential of the genomic insight offered by microarray technologies has led to their widespread use since they were introduced a decade ago. Application areas include gene function discovery, disease diagnosis, and inferring regulatory networks. Microarray experiments enable large-scale, high-throughput investigations of gene activity and have thus provided the data analyst with a distinctive, high-dimensional field of study. Many questions in this field relate to finding subgroups of data profiles which are very similar. A popular type of exploratory tool for finding subgroups is cluster analysis, and many different flavors of algorithms have been used and indeed tailored for microarray data. Cluster analysis, however, implies a partitioning of the entire data set, and this does not always match the objective. Sometimes pattern discovery or bump hunting tools are more appropriate. This paper reviews these various tools for finding interesting subgroups. PMID:16046827
Surface Glycosylation Profiles of Urine Extracellular Vesicles
Gerlach, Jared Q.; Krüger, Anja; Gallogly, Susan; Hanley, Shirley A.; Hogan, Marie C.; Ward, Christopher J.
2013-01-01
Urinary extracellular vesicles (uEVs) are released by cells throughout the nephron and contain biomolecules from their cells of origin. Although uEV-associated proteins and RNA have been studied in detail, little information exists regarding uEV glycosylation characteristics. Surface glycosylation profiling by flow cytometry and lectin microarray was applied to uEVs enriched from urine of healthy adults by ultracentrifugation and centrifugal filtration. The carbohydrate specificity of lectin microarray profiles was confirmed by competitive sugar inhibition and carbohydrate-specific enzyme hydrolysis. Glycosylation profiles of uEVs and purified Tamm Horsfall protein were compared. In both flow cytometry and lectin microarray assays, uEVs demonstrated surface binding, at low to moderate intensities, of a broad range of lectins whether prepared by ultracentrifugation or centrifugal filtration. In general, ultracentrifugation-prepared uEVs demonstrated higher lectin binding intensities than centrifugal filtration-prepared uEVs consistent with lesser amounts of co-purified non-vesicular proteins. The surface glycosylation profiles of uEVs showed little inter-individual variation and were distinct from those of Tamm Horsfall protein, which bound a limited number of lectins. In a pilot study, lectin microarray was used to compare uEVs from individuals with autosomal dominant polycystic kidney disease to those of age-matched controls. The lectin microarray profiles of polycystic kidney disease and healthy uEVs showed differences in binding intensity of 6/43 lectins. Our results reveal a complex surface glycosylation profile of uEVs that is accessible to lectin-based analysis following multiple uEV enrichment techniques, is distinct from co-purified Tamm Horsfall protein and may demonstrate disease-specific modifications. PMID:24069349
Jin, S J; Liu, M; Long, W J; Luo, X P
2016-12-02
Objective: To explore the clinical phenotypes and the genetic cause for a boy with unexplained growth retardation, nephrocalcinosis, auditory anomalies and multi-organ/system developmental disorders. Method: Routine G-banding and chromosome microarray analysis were applied to a child with unexplained growth retardation, nephrocalcinosis, auditory anomalies and multi-organ/system developmental disorders treated in the Department of Pediatrics of Tongji Hospital Affiliated to Tongji Medical College of Huazhong University of Science and Technology in September 2015 and his parents to conduct the chromosomal karyotype analysis and the whole genome scanning. Deleted genes were searched in the Decipher and NCBI databases, and their relationships with the clinical phenotypes were analyzed. Result: A six-month-old boy was refered to us because of unexplained growth retardation and feeding intolerance.The affected child presented with abnormal manifestation such as special face, umbilical hernia, growth retardation, hypothyroidism, congenital heart disease, right ear sensorineural deafness, hypercalcemia and nephrocalcinosis. The child's karyotype was 46, XY, 16qh + , and his parents' karyotypes were normal. Chromosome microarray analysis revealed a 1 436 kb deletion on the 7q11.23(72701098_74136633) region of the child. This region included 23 protein-coding genes, which were reported to be corresponding to Williams-Beuren syndrome and its certain clinical phenotypes. His parents' results of chromosome microarray analysis were normal. Conclusion: A boy with characteristic manifestation of Williams-Beuren syndrome and rare nephrocalcinosis was diagnosed using chromosome microarray analysis. The deletion on the 7q11.23 might be related to the clinical phenotypes of Williams-Beuren syndrome, yet further studies are needed.
Yuen, Peter S.T.; Jo, Sang-Kyung; Holly, Mikaela K.; Hu, Xuzhen; Star, Robert A.
2006-01-01
Acute renal failure (ARF) has a high morbidity and mortality. In animal ARF models, effective treatments must be administered before or shortly after the insult, limiting their clinical potential. We used microarrays to identify early biomarkers that distinguish ischemic from nephrotoxic ARF, or biomarkers that detect both injury types. We compared rat kidney transcriptomes 2 and 8 hours after ischemia/reperfusion and after mercuric chloride. Quality control and statistical analyses were necessary to normalize microarrays from different lots, eliminate outliers, and exclude unaltered genes. Principal component analysis revealed distinct ischemic and nephrotoxic trajectories, and clear array groupings. Therefore, we used supervised analysis, t-tests and fold changes, to compile gene lists for each group, exclusive or non-exclusive, alone or in combination. There was little network connectivity, even in the largest group. Some microarray-identified genes were validated by TaqMan assay, ruling out artifacts. Western blotting confirmed that HO-1 and ATF3 proteins were upregulated; however, unexpectedly, their localization changed within the kidney. HO-1 staining shifted from cortical (early) to outer stripe of the outer medulla (late), primarily in detaching cells, after mercuric chloride, but not ischemia/reperfusion. ATF3 staining was similar, but with additional early transient expression in the outer stripe after ischemia/reperfusion. We conclude that microarray-identified genes must be evaluated not only for protein levels, but also for anatomical distribution among different zones, nephron segments, or cell types. Although protein detection reagents are limited, microarray data lay a rich foundation to explore biomarkers, therapeutics, and pathophysiology of ARF. PMID:16507785
DOE Office of Scientific and Technical Information (OSTI.GOV)
Andersen, G.L.; He, Z.; DeSantis, T.Z.
Microarrays have proven to be a useful and high-throughput method to provide targeted DNA sequence information for up to many thousands of specific genetic regions in a single test. A microarray consists of multiple DNA oligonucleotide probes that, under high stringency conditions, hybridize only to specific complementary nucleic acid sequences (targets). A fluorescent signal indicates the presence and, in many cases, the abundance of genetic regions of interest. In this chapter we will look at how microarrays are used in microbial ecology, especially with the recent increase in microbial community DNA sequence data. Of particular interest to microbial ecologists, phylogeneticmore » microarrays are used for the analysis of phylotypes in a community and functional gene arrays are used for the analysis of functional genes, and, by inference, phylotypes in environmental samples. A phylogenetic microarray that has been developed by the Andersen laboratory, the PhyloChip, will be discussed as an example of a microarray that targets the known diversity within the 16S rRNA gene to determine microbial community composition. Using multiple, confirmatory probes to increase the confidence of detection and a mismatch probe for every perfect match probe to minimize the effect of cross-hybridization by non-target regions, the PhyloChip is able to simultaneously identify any of thousands of taxa present in an environmental sample. The PhyloChip is shown to reveal greater diversity within a community than rRNA gene sequencing due to the placement of the entire gene product on the microarray compared with the analysis of up to thousands of individual molecules by traditional sequencing methods. A functional gene array that has been developed by the Zhou laboratory, the GeoChip, will be discussed as an example of a microarray that dynamically identifies functional activities of multiple members within a community. The recent version of GeoChip contains more than 24,000 50mer oligonucleotide probes and covers more than 10,000 gene sequences in 150 gene categories involved in carbon, nitrogen, sulfur, and phosphorus cycling, metal resistance and reduction, and organic contaminant degradation. GeoChip can be used as a generic tool for microbial community analysis, and also link microbial community structure to ecosystem functioning. Examples of the application of both arrays in different environmental samples will be described in the two subsequent sections.« less
van Haaften, Rachel I M; Luceri, Cristina; van Erk, Arie; Evelo, Chris T A
2009-06-01
Omics technology used for large-scale measurements of gene expression is rapidly evolving. This work pointed out the need of an extensive bioinformatics analyses for array quality assessment before and after gene expression clustering and pathway analysis. A study focused on the effect of red wine polyphenols on rat colon mucosa was used to test the impact of quality control and normalisation steps on the biological conclusions. The integration of data visualization, pathway analysis and clustering revealed an artifact problem that was solved with an adapted normalisation. We propose a possible point to point standard analysis procedure, based on a combination of clustering and data visualization for the analysis of microarray data.
Workflows for microarray data processing in the Kepler environment.
Stropp, Thomas; McPhillips, Timothy; Ludäscher, Bertram; Bieda, Mark
2012-05-17
Microarray data analysis has been the subject of extensive and ongoing pipeline development due to its complexity, the availability of several options at each analysis step, and the development of new analysis demands, including integration with new data sources. Bioinformatics pipelines are usually custom built for different applications, making them typically difficult to modify, extend and repurpose. Scientific workflow systems are intended to address these issues by providing general-purpose frameworks in which to develop and execute such pipelines. The Kepler workflow environment is a well-established system under continual development that is employed in several areas of scientific research. Kepler provides a flexible graphical interface, featuring clear display of parameter values, for design and modification of workflows. It has capabilities for developing novel computational components in the R, Python, and Java programming languages, all of which are widely used for bioinformatics algorithm development, along with capabilities for invoking external applications and using web services. We developed a series of fully functional bioinformatics pipelines addressing common tasks in microarray processing in the Kepler workflow environment. These pipelines consist of a set of tools for GFF file processing of NimbleGen chromatin immunoprecipitation on microarray (ChIP-chip) datasets and more comprehensive workflows for Affymetrix gene expression microarray bioinformatics and basic primer design for PCR experiments, which are often used to validate microarray results. Although functional in themselves, these workflows can be easily customized, extended, or repurposed to match the needs of specific projects and are designed to be a toolkit and starting point for specific applications. These workflows illustrate a workflow programming paradigm focusing on local resources (programs and data) and therefore are close to traditional shell scripting or R/BioConductor scripting approaches to pipeline design. Finally, we suggest that microarray data processing task workflows may provide a basis for future example-based comparison of different workflow systems. We provide a set of tools and complete workflows for microarray data analysis in the Kepler environment, which has the advantages of offering graphical, clear display of conceptual steps and parameters and the ability to easily integrate other resources such as remote data and web services.
Naiser, Thomas; Ehler, Oliver; Kayser, Jona; Mai, Timo; Michel, Wolfgang; Ott, Albrecht
2008-01-01
Background The high binding specificity of short 10 to 30 mer oligonucleotide probes enables single base mismatch (MM) discrimination and thus provides the basis for genotyping and resequencing microarray applications. Recent experiments indicate that the underlying principles governing DNA microarray hybridization – and in particular MM discrimination – are not completely understood. Microarrays usually address complex mixtures of DNA targets. In order to reduce the level of complexity and to study the problem of surface-based hybridization with point defects in more detail, we performed array based hybridization experiments in well controlled and simple situations. Results We performed microarray hybridization experiments with short 16 to 40 mer target and probe lengths (in situations without competitive hybridization) in order to systematically investigate the impact of point-mutations – varying defect type and position – on the oligonucleotide duplex binding affinity. The influence of single base bulges and single base MMs depends predominantly on position – it is largest in the middle of the strand. The position-dependent influence of base bulges is very similar to that of single base MMs, however certain bulges give rise to an unexpectedly high binding affinity. Besides the defect (MM or bulge) type, which is the second contribution in importance to hybridization affinity, there is also a sequence dependence, which extends beyond the defect next-neighbor and which is difficult to quantify. Direct comparison between binding affinities of DNA/DNA and RNA/DNA duplexes shows, that RNA/DNA purine-purine MMs are more discriminating than corresponding DNA/DNA MMs. In DNA/DNA MM discrimination the affected base pair (C·G vs. A·T) is the pertinent parameter. We attribute these differences to the different structures of the duplexes (A vs. B form). Conclusion We have shown that DNA microarrays can resolve even subtle changes in hybridization affinity for simple target mixtures. We have further shown that the impact of point defects on oligonucleotide stability can be broken down to a hierarchy of effects. In order to explain our observations we propose DNA molecular dynamics – in form of zipping of the oligonucleotide duplex – to play an important role. PMID:18477387
Riis, Margit L H; Lüders, Torben; Markert, Elke K; Haakensen, Vilde D; Nesbakken, Anne-Jorun; Kristensen, Vessela N; Bukholm, Ida R K
2012-01-01
Gene expression studies on breast cancer have generally been performed on tissue obtained at the time of surgery. In this study, we have compared the gene expression profiles in preoperative tissue (core needle biopsies) while tumor is still in its normal milieu to postoperative tissue from the same tumor obtained during surgery. Thirteen patients were included of which eleven had undergone sentinel node diagnosis procedure before operation. Microarray gene expression analysis was performed using total RNA from all the samples. Paired significance analysis of microarrays revealed 228 differently expressed genes, including several early response stress-related genes such as members of the fos and jun families as well as genes of which the expression has previously been associated with cancer. The expression profiles found in the analyses of breast cancer tissue must be evaluated with caution. Different profiles may simply be the result of differences in the surgical trauma and timing of when samples are taken and not necessarily associated with tumor biology.
Riis, Margit L. H.; Lüders, Torben; Markert, Elke K.; Haakensen, Vilde D.; Nesbakken, Anne-Jorun; Kristensen, Vessela N.; Bukholm, Ida R. K.
2012-01-01
Gene expression studies on breast cancer have generally been performed on tissue obtained at the time of surgery. In this study, we have compared the gene expression profiles in preoperative tissue (core needle biopsies) while tumor is still in its normal milieu to postoperative tissue from the same tumor obtained during surgery. Thirteen patients were included of which eleven had undergone sentinel node diagnosis procedure before operation. Microarray gene expression analysis was performed using total RNA from all the samples. Paired significance analysis of microarrays revealed 228 differently expressed genes, including several early response stress-related genes such as members of the fos and jun families as well as genes of which the expression has previously been associated with cancer. The expression profiles found in the analyses of breast cancer tissue must be evaluated with caution. Different profiles may simply be the result of differences in the surgical trauma and timing of when samples are taken and not necessarily associated with tumor biology. PMID:23227362
Schüler, Susann; Wenz, Ingrid; Wiederanders, B; Slickers, P; Ehricht, R
2006-06-12
Recent developments in DNA microarray technology led to a variety of open and closed devices and systems including high and low density microarrays for high-throughput screening applications as well as microarrays of lower density for specific diagnostic purposes. Beside predefined microarrays for specific applications manufacturers offer the production of custom-designed microarrays adapted to customers' wishes. Array based assays demand complex procedures including several steps for sample preparation (RNA extraction, amplification and sample labelling), hybridization and detection, thus leading to a high variability between several approaches and resulting in the necessity of extensive standardization and normalization procedures. In the present work a custom designed human proteinase DNA microarray of lower density in ArrayTube format was established. This highly economic open platform only requires standard laboratory equipment and allows the study of the molecular regulation of cell behaviour by proteinases. We established a procedure for sample preparation and hybridization and verified the array based gene expression profile by quantitative real-time PCR (QRT-PCR). Moreover, we compared the results with the well established Affymetrix microarray. By application of standard labelling procedures with e.g. Klenow fragment exo-, single primer amplification (SPA) or In Vitro Transcription (IVT) we noticed a loss of signal conservation for some genes. To overcome this problem we developed a protocol in accordance with the SPA protocol, in which we included target specific primers designed individually for each spotted oligomer. Here we present a complete array based assay in which only the specific transcripts of interest are amplified in parallel and in a linear manner. The array represents a proof of principle which can be adapted to other species as well. As the designed protocol for amplifying mRNA starts from as little as 100 ng total RNA, it presents an alternative method for detecting even low expressed genes by microarray experiments in a highly reproducible and sensitive manner. Preservation of signal integrity is demonstrated out by QRT-PCR measurements. The little amounts of total RNA necessary for the analyses make this method applicable for investigations with limited material as in clinical samples from, for example, organ or tumour biopsies. Those are arguments in favour of the high potential of our assay compared to established procedures for amplification within the field of diagnostic expression profiling. Nevertheless, the screening character of microarray data must be mentioned, and independent methods should verify the results.
Galectins are human milk glycan receptors
Noll, Alexander J; Gourdine, Jean-Philippe; Yu, Ying; Lasanajak, Yi; Smith, David F; Cummings, Richard D
2016-01-01
The biological recognition of human milk glycans (HMGs) is poorly understood. Because HMGs are rich in galactose we explored whether they might interact with human galectins, which bind galactose-containing glycans and are highly expressed in epithelial cells and other cell types. We screened a number of human galectins for their binding to HMGs on a shotgun glycan microarray consisting of 247 HMGs derived from human milk, as well as to a defined HMG microarray. Recombinant human galectins (hGal)-1, -3, -4, -7, -8 and -9 bound selectively to glycans, with each galectin recognizing a relatively unique binding motif; by contrast hGal-2 did not recognize HMGs, but did bind to the human blood group A Type 2 determinants on other microarrays. Unlike other galectins, hGal-7 preferentially bound to glycans expressing a terminal Type 1 (Galβ1-3GlcNAc) sequence, a motif that had eluded detection on non-HMG glycan microarrays. Interactions with HMGs were confirmed in a solution setting by isothermal titration microcalorimetry and hapten inhibition experiments. These results demonstrate that galectins selectively bind to HMGs and suggest the possibility that galectin–HMG interactions may play a role in infant immunity. PMID:26747425
Estimating differential expression from multiple indicators
Ilmjärv, Sten; Hundahl, Christian Ansgar; Reimets, Riin; Niitsoo, Margus; Kolde, Raivo; Vilo, Jaak; Vasar, Eero; Luuk, Hendrik
2014-01-01
Regardless of the advent of high-throughput sequencing, microarrays remain central in current biomedical research. Conventional microarray analysis pipelines apply data reduction before the estimation of differential expression, which is likely to render the estimates susceptible to noise from signal summarization and reduce statistical power. We present a probe-level framework, which capitalizes on the high number of concurrent measurements to provide more robust differential expression estimates. The framework naturally extends to various experimental designs and target categories (e.g. transcripts, genes, genomic regions) as well as small sample sizes. Benchmarking in relation to popular microarray and RNA-sequencing data-analysis pipelines indicated high and stable performance on the Microarray Quality Control dataset and in a cell-culture model of hypoxia. Experimental-data-exhibiting long-range epigenetic silencing of gene expression was used to demonstrate the efficacy of detecting differential expression of genomic regions, a level of analysis not embraced by conventional workflows. Finally, we designed and conducted an experiment to identify hypothermia-responsive genes in terms of monotonic time-response. As a novel insight, hypothermia-dependent up-regulation of multiple genes of two major antioxidant pathways was identified and verified by quantitative real-time PCR. PMID:24586062
Kitchen, Robert R; Sabine, Vicky S; Simen, Arthur A; Dixon, J Michael; Bartlett, John M S; Sims, Andrew H
2011-12-01
Systematic processing noise, which includes batch effects, is very common in microarray experiments but is often ignored despite its potential to confound or compromise experimental results. Compromised results are most likely when re-analysing or integrating datasets from public repositories due to the different conditions under which each dataset is generated. To better understand the relative noise-contributions of various factors in experimental-design, we assessed several Illumina and Affymetrix datasets for technical variation between replicate hybridisations of Universal Human Reference (UHRR) and individual or pooled breast-tumour RNA. A varying degree of systematic noise was observed in each of the datasets, however in all cases the relative amount of variation between standard control RNA replicates was found to be greatest at earlier points in the sample-preparation workflow. For example, 40.6% of the total variation in reported expressions were attributed to replicate extractions, compared to 13.9% due to amplification/labelling and 10.8% between replicate hybridisations. Deliberate probe-wise batch-correction methods were effective in reducing the magnitude of this variation, although the level of improvement was dependent on the sources of noise included in the model. Systematic noise introduced at the chip, run, and experiment levels of a combined Illumina dataset were found to be highly dependent upon the experimental design. Both UHRR and pools of RNA, which were derived from the samples of interest, modelled technical variation well although the pools were significantly better correlated (4% average improvement) and better emulated the effects of systematic noise, over all probes, than the UHRRs. The effect of this noise was not uniform over all probes, with low GC-content probes found to be more vulnerable to batch variation than probes with a higher GC-content. The magnitude of systematic processing noise in a microarray experiment is variable across probes and experiments, however it is generally the case that procedures earlier in the sample-preparation workflow are liable to introduce the most noise. Careful experimental design is important to protect against noise, detailed meta-data should always be provided, and diagnostic procedures should be routinely performed prior to downstream analyses for the detection of bias in microarray studies.
2011-01-01
Background Systematic processing noise, which includes batch effects, is very common in microarray experiments but is often ignored despite its potential to confound or compromise experimental results. Compromised results are most likely when re-analysing or integrating datasets from public repositories due to the different conditions under which each dataset is generated. To better understand the relative noise-contributions of various factors in experimental-design, we assessed several Illumina and Affymetrix datasets for technical variation between replicate hybridisations of Universal Human Reference (UHRR) and individual or pooled breast-tumour RNA. Results A varying degree of systematic noise was observed in each of the datasets, however in all cases the relative amount of variation between standard control RNA replicates was found to be greatest at earlier points in the sample-preparation workflow. For example, 40.6% of the total variation in reported expressions were attributed to replicate extractions, compared to 13.9% due to amplification/labelling and 10.8% between replicate hybridisations. Deliberate probe-wise batch-correction methods were effective in reducing the magnitude of this variation, although the level of improvement was dependent on the sources of noise included in the model. Systematic noise introduced at the chip, run, and experiment levels of a combined Illumina dataset were found to be highly dependant upon the experimental design. Both UHRR and pools of RNA, which were derived from the samples of interest, modelled technical variation well although the pools were significantly better correlated (4% average improvement) and better emulated the effects of systematic noise, over all probes, than the UHRRs. The effect of this noise was not uniform over all probes, with low GC-content probes found to be more vulnerable to batch variation than probes with a higher GC-content. Conclusions The magnitude of systematic processing noise in a microarray experiment is variable across probes and experiments, however it is generally the case that procedures earlier in the sample-preparation workflow are liable to introduce the most noise. Careful experimental design is important to protect against noise, detailed meta-data should always be provided, and diagnostic procedures should be routinely performed prior to downstream analyses for the detection of bias in microarray studies. PMID:22133085
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chin, Mark H.; Qian, Weijun; Wang, Haixing
2008-02-10
The molecular mechanisms underlying the changes in the nigrostriatal pathway in Parkinson disease (PD) are not completely understood. Here we use mass spectrometry and microarrays to study the proteomic and transcriptomic changes in the striatum of two mouse models of PD, induced by the distinct neurotoxins 1-methyl-4-phenyl-1,2,3,6-tetrahydropyridine (MPTP) and methamphetamine (METH). Proteomic analyses resulted in the identification and relative quantification of 912 proteins with two or more unique peptides and 85 proteins with significant abundance changes following neurotoxin treatment. Similarly, microarray analyses revealed 181 genes with significant changes in mRNA following neurotoxin treatment. The combined protein and gene list providesmore » a clearer picture of the potential mechanisms underlying neurodegeneration observed in PD. Functional analysis of this combined list revealed a number of significant categories, including mitochondrial dysfunction, oxidative stress response and apoptosis. Additionally, codon usage and miRNAs may play an important role in translational control in the striatum. These results constitute one of the largest datasets integrating protein and transcript changes for these neurotoxin models with many similar endpoint phenotypes but distinct mechanisms.« less
Sitras, V; Fenton, C; Acharya, G
2015-02-01
Cardiovascular disease (CVD) and preeclampsia (PE) share common clinical features. We aimed to identify common transcriptomic signatures involved in CVD and PE in humans. Meta-analysis of individual raw microarray data deposited in GEO, obtained from blood samples of patients with CVD versus controls and placental samples from women with PE versus healthy women with uncomplicated pregnancies. Annotation of cases versus control samples was taken directly from the microarray documentation. Genes that showed a significant differential expression in the majority of experiments were selected for subsequent analysis. Hypergeometric gene list analysis was performed using Bioconductor GOstats package. Bioinformatic analysis was performed in PANTHER. Seven studies in CVD and 5 studies in PE were eligible for meta-analysis. A total of 181 genes were found to be differentially expressed in microarray studies investigating gene expression in blood samples obtained from patients with CVD compared to controls and 925 genes were differentially expressed between preeclamptic and healthy placentas. Among these differentially expressed genes, 22 were common between CVD and PE. Bioinformatic analysis of these genes revealed oxidative stress, p-53 pathway feedback, inflammation mediated by chemokines and cytokines, interleukin signaling, B-cell activation, PDGF signaling, Wnt signaling, integrin signaling and Alzheimer disease pathways to be involved in the pathophysiology of both CVD and PE. Metabolism, development, response to stimulus, immune response and cell communication were the associated biologic processes in both conditions. Gene set enrichment analysis showed the following overlapping pathways between CVD and PE: TGF-β-signaling, apoptosis, graft-versus-host disease, allograft rejection, chemokine signaling, steroid hormone synthesis, type I and II diabetes mellitus, VEGF signaling, pathways in cancer, GNRH signaling, Huntingtons disease and Notch signaling. CVD and PE share same common traits in their gene expression profile indicating common pathways in their pathophysiology. Copyright © 2014 Elsevier Ltd. All rights reserved.
Smith, Maria W.; Herfort, Lydie; Tyrol, Kaitlin; Suciu, Dominic; Campbell, Victoria; Crump, Byron C.; Peterson, Tawnya D.; Zuber, Peter; Baptista, Antonio M.; Simon, Holly M.
2010-01-01
Through their metabolic activities, microbial populations mediate the impact of high gradient regions on ecological function and productivity of the highly dynamic Columbia River coastal margin (CRCM). A 2226-probe oligonucleotide DNA microarray was developed to investigate expression patterns for microbial genes involved in nitrogen and carbon metabolism in the CRCM. Initial experiments with the environmental microarrays were directed toward validation of the platform and yielded high reproducibility in multiple tests. Bioinformatic and experimental validation also indicated that >85% of the microarray probes were specific for their corresponding target genes and for a few homologs within the same microbial family. The validated probe set was used to query gene expression responses by microbial assemblages to environmental variability. Sixty-four samples from the river, estuary, plume, and adjacent ocean were collected in different seasons and analyzed to correlate the measured variability in chemical, physical and biological water parameters to differences in global gene expression profiles. The method produced robust seasonal profiles corresponding to pre-freshet spring (April) and late summer (August). Overall relative gene expression was high in both seasons and was consistent with high microbial abundance measured by total RNA, heterotrophic bacterial production, and chlorophyll a. Both seasonal patterns involved large numbers of genes that were highly expressed relative to background, yet each produced very different gene expression profiles. April patterns revealed high differential gene expression in the coastal margin samples (estuary, plume and adjacent ocean) relative to freshwater, while little differential gene expression was observed along the river-to-ocean transition in August. Microbial gene expression profiles appeared to relate, in part, to seasonal differences in nutrient availability and potential resource competition. Furthermore, our results suggest that highly-active particle-attached microbiota in the Columbia River water column may perform dissimilatory nitrate reduction (both dentrification and DNRA) within anoxic particle microniches. PMID:20967204
Kilicoglu, Halil; Shin, Dongwook; Rindflesch, Thomas C.
2014-01-01
Gene regulatory networks are a crucial aspect of systems biology in describing molecular mechanisms of the cell. Various computational models rely on random gene selection to infer such networks from microarray data. While incorporation of prior knowledge into data analysis has been deemed important, in practice, it has generally been limited to referencing genes in probe sets and using curated knowledge bases. We investigate the impact of augmenting microarray data with semantic relations automatically extracted from the literature, with the view that relations encoding gene/protein interactions eliminate the need for random selection of components in non-exhaustive approaches, producing a more accurate model of cellular behavior. A genetic algorithm is then used to optimize the strength of interactions using microarray data and an artificial neural network fitness function. The result is a directed and weighted network providing the individual contribution of each gene to its target. For testing, we used invasive ductile carcinoma of the breast to query the literature and a microarray set containing gene expression changes in these cells over several time points. Our model demonstrates significantly better fitness than the state-of-the-art model, which relies on an initial random selection of genes. Comparison to the component pathways of the KEGG Pathways in Cancer map reveals that the resulting networks contain both known and novel relationships. The p53 pathway results were manually validated in the literature. 60% of non-KEGG relationships were supported (74% for highly weighted interactions). The method was then applied to yeast data and our model again outperformed the comparison model. Our results demonstrate the advantage of combining gene interactions extracted from the literature in the form of semantic relations with microarray analysis in generating contribution-weighted gene regulatory networks. This methodology can make a significant contribution to understanding the complex interactions involved in cellular behavior and molecular physiology. PMID:24921649
Chen, Guocai; Cairelli, Michael J; Kilicoglu, Halil; Shin, Dongwook; Rindflesch, Thomas C
2014-06-01
Gene regulatory networks are a crucial aspect of systems biology in describing molecular mechanisms of the cell. Various computational models rely on random gene selection to infer such networks from microarray data. While incorporation of prior knowledge into data analysis has been deemed important, in practice, it has generally been limited to referencing genes in probe sets and using curated knowledge bases. We investigate the impact of augmenting microarray data with semantic relations automatically extracted from the literature, with the view that relations encoding gene/protein interactions eliminate the need for random selection of components in non-exhaustive approaches, producing a more accurate model of cellular behavior. A genetic algorithm is then used to optimize the strength of interactions using microarray data and an artificial neural network fitness function. The result is a directed and weighted network providing the individual contribution of each gene to its target. For testing, we used invasive ductile carcinoma of the breast to query the literature and a microarray set containing gene expression changes in these cells over several time points. Our model demonstrates significantly better fitness than the state-of-the-art model, which relies on an initial random selection of genes. Comparison to the component pathways of the KEGG Pathways in Cancer map reveals that the resulting networks contain both known and novel relationships. The p53 pathway results were manually validated in the literature. 60% of non-KEGG relationships were supported (74% for highly weighted interactions). The method was then applied to yeast data and our model again outperformed the comparison model. Our results demonstrate the advantage of combining gene interactions extracted from the literature in the form of semantic relations with microarray analysis in generating contribution-weighted gene regulatory networks. This methodology can make a significant contribution to understanding the complex interactions involved in cellular behavior and molecular physiology.
Initial Genomics of the Human Nucleolus
Németh, Attila; Conesa, Ana; Santoyo-Lopez, Javier; Medina, Ignacio; Montaner, David; Péterfia, Bálint; Solovei, Irina; Cremer, Thomas; Dopazo, Joaquin; Längst, Gernot
2010-01-01
We report for the first time the genomics of a nuclear compartment of the eukaryotic cell. 454 sequencing and microarray analysis revealed the pattern of nucleolus-associated chromatin domains (NADs) in the linear human genome and identified different gene families and certain satellite repeats as the major building blocks of NADs, which constitute about 4% of the genome. Bioinformatic evaluation showed that NAD–localized genes take part in specific biological processes, like the response to other organisms, odor perception, and tissue development. 3D FISH and immunofluorescence experiments illustrated the spatial distribution of NAD–specific chromatin within interphase nuclei and its alteration upon transcriptional changes. Altogether, our findings describe the nature of DNA sequences associated with the human nucleolus and provide insights into the function of the nucleolus in genome organization and establishment of nuclear architecture. PMID:20361057
Ghan, Ryan; Van Sluyter, Steven C; Hochberg, Uri; Degu, Asfaw; Hopper, Daniel W; Tillet, Richard L; Schlauch, Karen A; Haynes, Paul A; Fait, Aaron; Cramer, Grant R
2015-11-16
Grape cultivars and wines are distinguishable by their color, flavor and aroma profiles. Omic analyses (transcripts, proteins and metabolites) are powerful tools for assessing biochemical differences in biological systems. Berry skins of red- (Cabernet Sauvignon, Merlot, Pinot Noir) and white-skinned (Chardonnay, Semillon) wine grapes were harvested near optimum maturity (°Brix-to-titratable acidity ratio) from the same experimental vineyard. The cultivars were exposed to a mild, seasonal water-deficit treatment from fruit set until harvest in 2011. Identical sample aliquots were analyzed for transcripts by grapevine whole-genome oligonucleotide microarray and RNAseq technologies, proteins by nano-liquid chromatography-mass spectroscopy, and metabolites by gas chromatography-mass spectroscopy and liquid chromatography-mass spectroscopy. Principal components analysis of each of five Omic technologies showed similar results across cultivars in all Omic datasets. Comparison of the processed data of genes mapped in RNAseq and microarray data revealed a strong Pearson's correlation (0.80). The exclusion of probesets associated with genes with potential for cross-hybridization on the microarray improved the correlation to 0.93. The overall concordance of protein with transcript data was low with a Pearson's correlation of 0.27 and 0.24 for the RNAseq and microarray data, respectively. Integration of metabolite with protein and transcript data produced an expected model of phenylpropanoid biosynthesis, which distinguished red from white grapes, yet provided detail of individual cultivar differences. The mild water deficit treatment did not significantly alter the abundance of proteins or metabolites measured in the five cultivars, but did have a small effect on gene expression. The five Omic technologies were consistent in distinguishing cultivar variation. There was high concordance between transcriptomic technologies, but generally protein abundance did not correlate well with transcript abundance. The integration of multiple high-throughput Omic datasets revealed complex biochemical variation amongst five cultivars of an ancient and economically important crop species.
Pap, Domonkos; Sziksz, Erna; Kiss, Zoltán; Rokonay, Réka; Veres-Székely, Apor; Lippai, Rita; Takács, István Márton; Kis, Éva; Fekete, Andrea; Reusz, György; Szabó, Attila J; Vannay, Adam
2017-01-01
Congenital obstructive nephropathy (CON) is the main cause of pediatric chronic kidney diseases leading to renal fibrosis. High morbidity and limited treatment opportunities of CON urge the better understanding of the underlying molecular mechanisms. To identify the differentially expressed genes, microarray analysis was performed on the kidney samples of neonatal rats underwent unilateral ureteral obstruction (UUO). Microarray results were then validated by real-time RT-PCR and bioinformatics analysis was carried out to identify the relevant genes, functional groups and pathways involved in the pathomechanism of CON. Renal expression of matrix metalloproteinase (MMP)-12 and interleukin (IL)-24 were evaluated by real-time RT-PCR, flow cytometry and immunohistochemical analysis. Effect of the main profibrotic factors on the expression of MMP-12 and IL-24 was investigated on HK-2 and HEK-293 cell lines. Finally, the effect of IL-24 treatment on the expression of pro-inflammatory cytokines and MMPs were tested in vitro. Microarray analysis revealed 880 transcripts showing >2.0-fold change following UUO, enriched mainly in immune response related processes. The most up-regulated genes were MMPs and members of IL-20 cytokine subfamily, including MMP-3, MMP-7, MMP-12, IL-19 and IL-24. We found that while TGF-β treatment inhibits the expression of MMP-12 and IL-24, H2O2 or PDGF-B treatment induce the epithelial expression of MMP-12. We demonstrated that IL-24 treatment decreases the expression of IL-6 and MMP-3 in the renal epithelial cells. This study provides an extensive view of UUO induced changes in the gene expression profile of the developing kidney and describes novel molecules, which may play significant role in the pathomechanism of CON. © 2017 The Author(s)Published by S. Karger AG, Basel.
Kesherwani, Varun; Shahshahan, Hamid R.
2017-01-01
Although diabetes mellitus (DM) causes cardiomyopathy and exacerbates heart failure, the underlying molecular mechanisms for diabetic cardiomyopathy/heart failure are poorly understood. Insulin2 mutant (Ins2+/-) Akita is a mouse model of T1DM, which manifests cardiac dysfunction. However, molecular changes at cardiac transcriptome level that lead to cardiomyopathy remain unclear. To understand the molecular changes in the heart of diabetic Akita mice, we profiled cardiac transcriptome of Ins2+/- Akita and Ins2+/+ control mice using next generation sequencing (NGS) and microarray, and determined the implications of differentially expressed genes on various heart failure signaling pathways using Ingenuity pathway (IPA) analysis. First, we validated hyperglycemia, increased cardiac fibrosis, and cardiac dysfunction in twelve-week male diabetic Akita. Then, we analyzed the transcriptome levels in the heart. NGS analyses on Akita heart revealed 137 differentially expressed transcripts, where Bone Morphogenic Protein-10 (BMP10) was the most upregulated and hairy and enhancer of split-related (HELT) was the most downregulated gene. Moreover, twelve long non-coding RNAs (lncRNAs) were upregulated. The microarray analyses on Akita heart showed 351 differentially expressed transcripts, where vomeronasal-1 receptor-180 (Vmn1r180) was the most upregulated and WD Repeat Domain 83 Opposite Strand (WDR83OS) was the most downregulated gene. Further, miR-101c and H19 lncRNA were upregulated but Neat1 lncRNA was downregulated in Akita heart. Eleven common genes were upregulated in Akita heart in both NGS and microarray analyses. IPA analyses revealed the role of these differentially expressed genes in key signaling pathways involved in diabetic cardiomyopathy. Our results provide a platform to initiate focused future studies by targeting these genes and/or non-coding RNAs, which are differentially expressed in Akita hearts and are involved in diabetic cardiomyopathy. PMID:28837672
Ni, Ming; Ye, Fuqiang; Zhu, Juanjuan; Li, Zongwei; Yang, Shuai; Yang, Bite; Han, Lu; Wu, Yongge; Chen, Ying; Li, Fei; Wang, Shengqi; Bo, Xiaochen
2014-12-01
Numerous public microarray datasets are valuable resources for the scientific communities. Several online tools have made great steps to use these data by querying related datasets with users' own gene signatures or expression profiles. However, dataset annotation and result exhibition still need to be improved. ExpTreeDB is a database that allows for queries on human and mouse microarray experiments from Gene Expression Omnibus with gene signatures or profiles. Compared with similar applications, ExpTreeDB pays more attention to dataset annotations and result visualization. We introduced a multiple-level annotation system to depict and organize original experiments. For example, a tamoxifen-treated cell line experiment is hierarchically annotated as 'agent→drug→estrogen receptor antagonist→tamoxifen'. Consequently, retrieved results are exhibited by an interactive tree-structured graphics, which provide an overview for related experiments and might enlighten users on key items of interest. The database is freely available at http://biotech.bmi.ac.cn/ExpTreeDB. Web site is implemented in Perl, PHP, R, MySQL and Apache. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
RubisCO Gene Clusters Found in a Metagenome Microarray from Acid Mine Drainage
Guo, Xue; Yin, Huaqun; Cong, Jing; Dai, Zhimin; Liang, Yili
2013-01-01
The enzyme responsible for carbon dioxide fixation in the Calvin cycle, ribulose-1,5-bisphosphate carboxylase/oxygenase (RubisCO), is always detected as a phylogenetic marker to analyze the distribution and activity of autotrophic bacteria. However, such an approach provides no indication as to the significance of genomic content and organization. Horizontal transfers of RubisCO genes occurring in eubacteria and plastids may seriously affect the credibility of this approach. Here, we presented a new method to analyze the diversity and genomic content of RubisCO genes in acid mine drainage (AMD). A metagenome microarray containing 7,776 large-insertion fosmids was constructed to quickly screen genome fragments containing RubisCO form I large-subunit genes (cbbL). Forty-six cbbL-containing fosmids were detected, and six fosmids were fully sequenced. To evaluate the reliability of the metagenome microarray and understand the microbial community in AMD, the diversities of cbbL and the 16S rRNA gene were analyzed. Fosmid sequences revealed that the form I RubisCO gene cluster could be subdivided into form IA and IB RubisCO gene clusters in AMD, because of significant divergences in molecular phylogenetics and conservative genomic organization. Interestingly, the form I RubisCO gene cluster coexisted with the form II RubisCO gene cluster in one fosmid genomic fragment. Phylogenetic analyses revealed that horizontal transfers of RubisCO genes may occur widely in AMD, which makes the evolutionary history of RubisCO difficult to reconcile with organismal phylogeny. PMID:23335778
Initiation of follicular atresia: gene networks during early atresia in pig ovaries.
Zhang, Jinbi; Liu, Yang; Yao, Wang; Li, Qifa; Liu, Hong-Lin; Pan, Zengxiang
2018-05-09
In mammals, more than 99% of ovarian follicles undergo a degenerative process known as atresia. The molecular events involve in atresia initiation remain incompletely understood. The objective of this study was to analyze differential gene expression profiles of medium antral ovarian follicles during early atresia in pig. The transcriptome evaluation was performed on cDNA microarrays using healthy and early atretic follicle samples and was validated by quantitative PCR. Annotation analysis applying current database (sus scrofa 11.1) revealed 450 significantly differential expressed genes between healthy and early atretic follicles. Among them, 142 were significantly up-regulated in early atretic with respect to healthy group and 308 were down-regulated. Similar expression trends were observed between microarray data and qRT-PCR confirmation, which indicated the reliability of the microarray analysis. Further analysis of the differential expressed genes revealed the most significantly affected biological functions during early atresia including blood vessel development, regulation of DNA-templated transcription in response to stress and negative regulation of cell adhesion. The pathway and interaction analysis suggested that atresia initiation associates with 1) a crosstalk of cell apoptosis, autophagy, and ferroptosis rather than change of typical apoptosis markers, 2) dramatic shift of steroidogenic enzymes, 3) deficient glutathione metabolism, and 4) vascular degeneration. The novel gene candidates and pathways identified in the current study will lead to a comprehensive view of the molecular regulation of ovarian follicular atresia and a new understanding of atresia initiation.
2011-01-01
Background Although many biological databases are applying semantic web technologies, meaningful biological hypothesis testing cannot be easily achieved. Database-driven high throughput genomic hypothesis testing requires both of the capabilities of obtaining semantically relevant experimental data and of performing relevant statistical testing for the retrieved data. Tissue Microarray (TMA) data are semantically rich and contains many biologically important hypotheses waiting for high throughput conclusions. Methods An application-specific ontology was developed for managing TMA and DNA microarray databases by semantic web technologies. Data were represented as Resource Description Framework (RDF) according to the framework of the ontology. Applications for hypothesis testing (Xperanto-RDF) for TMA data were designed and implemented by (1) formulating the syntactic and semantic structures of the hypotheses derived from TMA experiments, (2) formulating SPARQLs to reflect the semantic structures of the hypotheses, and (3) performing statistical test with the result sets returned by the SPARQLs. Results When a user designs a hypothesis in Xperanto-RDF and submits it, the hypothesis can be tested against TMA experimental data stored in Xperanto-RDF. When we evaluated four previously validated hypotheses as an illustration, all the hypotheses were supported by Xperanto-RDF. Conclusions We demonstrated the utility of high throughput biological hypothesis testing. We believe that preliminary investigation before performing highly controlled experiment can be benefited. PMID:21342584
Dye bias correction in dual-labeled cDNA microarray gene expression measurements.
Rosenzweig, Barry A; Pine, P Scott; Domon, Olen E; Morris, Suzanne M; Chen, James J; Sistare, Frank D
2004-01-01
A significant limitation to the analytical accuracy and precision of dual-labeled spotted cDNA microarrays is the signal error due to dye bias. Transcript-dependent dye bias may be due to gene-specific differences of incorporation of two distinctly different chemical dyes and the resultant differential hybridization efficiencies of these two chemically different targets for the same probe. Several approaches were used to assess and minimize the effects of dye bias on fluorescent hybridization signals and maximize the experimental design efficiency of a cell culture experiment. Dye bias was measured at the individual transcript level within each batch of simultaneously processed arrays by replicate dual-labeled split-control sample hybridizations and accounted for a significant component of fluorescent signal differences. This transcript-dependent dye bias alone could introduce unacceptably high numbers of both false-positive and false-negative signals. We found that within a given set of concurrently processed hybridizations, the bias is remarkably consistent and therefore measurable and correctable. The additional microarrays and reagents required for paired technical replicate dye-swap corrections commonly performed to control for dye bias could be costly to end users. Incorporating split-control microarrays within a set of concurrently processed hybridizations to specifically measure dye bias can eliminate the need for technical dye swap replicates and reduce microarray and reagent costs while maintaining experimental accuracy and technical precision. These data support a practical and more efficient experimental design to measure and mathematically correct for dye bias. PMID:15033598
Casel, Pierrot; Moreews, François; Lagarrigue, Sandrine; Klopp, Christophe
2009-07-16
Microarray is a powerful technology enabling to monitor tens of thousands of genes in a single experiment. Most microarrays are now using oligo-sets. The design of the oligo-nucleotides is time consuming and error prone. Genome wide microarray oligo-sets are designed using as large a set of transcripts as possible in order to monitor as many genes as possible. Depending on the genome sequencing state and on the assembly state the knowledge of the existing transcripts can be very different. This knowledge evolves with the different genome builds and gene builds. Once the design is done the microarrays are often used for several years. The biologists working in EADGENE expressed the need of up-to-dated annotation files for the oligo-sets they share including information about the orthologous genes of model species, the Gene Ontology, the corresponding pathways and the chromosomal location. The results of SigReannot on a chicken micro-array used in the EADGENE project compared to the initial annotations show that 23% of the oligo-nucleotide gene annotations were not confirmed, 2% were modified and 1% were added. The interest of this up-to-date annotation procedure is demonstrated through the analysis of real data previously published. SigReannot uses the oligo-nucleotide design procedure criteria to validate the probe-gene link and the Ensembl transcripts as reference for annotation. It therefore produces a high quality annotation based on reference gene sets.
Genome-Wide Identification, Evolution and Expression Analysis of mTERF Gene Family in Maize
Zhao, Yanxin; Cai, Manjun; Zhang, Xiaobo; Li, Yurong; Zhang, Jianhua; Zhao, Hailiang; Kong, Fei; Zheng, Yonglian; Qiu, Fazhan
2014-01-01
Plant mitochondrial transcription termination factor (mTERF) genes comprise a large family with important roles in regulating organelle gene expression. In this study, a comprehensive database search yielded 31 potential mTERF genes in maize (Zea mays L.) and most of them were targeted to mitochondria or chloroplasts. Maize mTERF were divided into nine main groups based on phylogenetic analysis, and group IX represented the mitochondria and species-specific clade that diverged from other groups. Tandem and segmental duplication both contributed to the expansion of the mTERF gene family in the maize genome. Comprehensive expression analysis of these genes, using microarray data and RNA-seq data, revealed that these genes exhibit a variety of expression patterns. Environmental stimulus experiments revealed differential up or down-regulation expression of maize mTERF genes in seedlings exposed to light/dark, salts and plant hormones, respectively, suggesting various important roles of maize mTERF genes in light acclimation and stress-related responses. These results will be useful for elucidating the roles of mTERF genes in the growth, development and stress response of maize. PMID:24718683
Direct Detection of Drug-Resistant Hepatitis B Virus in Serum Using a Dendron-Modified Microarray
Kim, Doo Hyun; Kang, Hong Seok; Hur, Seong-Suk; Sim, Seobo; Ahn, Sung Hyun; Park, Yong Kwang; Park, Eun-Sook; Lee, Ah Ram; Park, Soree; Kwon, So Young; Lee, Jeong-Hoon
2018-01-01
Background/Aims Direct sequencing is the gold standard for the detection of drug-resistance mutations in hepatitis B virus (HBV); however, this procedure is time-consuming, labor-intensive, and difficult to adapt to high-throughput screening. In this study, we aimed to develop a dendron-modified DNA microarray for the detection of genotypic resistance mutations and evaluate its efficiency. Methods The specificity, sensitivity, and selectivity of dendron-modified slides for the detection of representative drug-resistance mutations were evaluated and compared to those of conventional slides. The diagnostic accuracy was validated using sera obtained from 13 patients who developed viral breakthrough during lamivudine, adefovir, or entecavir therapy and compared with the accuracy of restriction fragment mass polymorphism and direct sequencing data. Results The dendron-modified slides significantly outperformed the conventional microarray slides and were able to detect HBV DNA at a very low level (1 copy/μL). Notably, HBV mutants could be detected in the chronic hepatitis B patient sera without virus purification. The validation of our data revealed that this technique is fully compatible with sequencing data of drug-resistant HBV. Conclusions We developed a novel diagnostic technique for the simultaneous detection of several drug-resistance mutations using a dendron-modified DNA microarray. This technique can be directly applied to sera from chronic hepatitis B patients who show resistance to several nucleos(t)ide analogues. PMID:29271185
A perspective on DNA microarray technology in food and nutritional science.
Kato, Hisanori; Saito, Kenji; Kimura, Takeshi
2005-09-01
The functions of nutrients and other foods have been revealed at the level of gene regulation. The advent of DNA microarray technology has enabled us to analyze the body's response to these factors in a much more holistic manner than before. This review is intended to overview the present status of this DNA microarray technology, hoping to provide food and nutrition scientists, especially those who are planning to introduce this technology, with hints and suggestions. The number of papers examining transcriptomics analysis in food and nutrition science has expanded over the last few years. The effects of some dietary conditions and administration of specific nutrients or food factors are studied in various animal models and cultured cells. The target food components range from macronutrients and micronutrients to other functional food factors. Such studies have already yielded fruitful results, which include discovery of novel functions of a food, uncovering hitherto unknown mechanisms of action, and analyses of food safety. The potency of DNA microarray technology in food and nutrition science is broadly recognized. This technique will surely continue to provide researchers and the public with valuable information on the beneficial and adverse effects of food factors. It should also be acknowledged, however, that there remain problems such as standardization of the data and sharing of the results among researchers in this field.
Kitchen, Robert R; Sabine, Vicky S; Sims, Andrew H; Macaskill, E Jane; Renshaw, Lorna; Thomas, Jeremy S; van Hemert, Jano I; Dixon, J Michael; Bartlett, John M S
2010-02-24
Microarray technology is a popular means of producing whole genome transcriptional profiles, however high cost and scarcity of mRNA has led many studies to be conducted based on the analysis of single samples. We exploit the design of the Illumina platform, specifically multiple arrays on each chip, to evaluate intra-experiment technical variation using repeated hybridisations of universal human reference RNA (UHRR) and duplicate hybridisations of primary breast tumour samples from a clinical study. A clear batch-specific bias was detected in the measured expressions of both the UHRR and clinical samples. This bias was found to persist following standard microarray normalisation techniques. However, when mean-centering or empirical Bayes batch-correction methods (ComBat) were applied to the data, inter-batch variation in the UHRR and clinical samples were greatly reduced. Correlation between replicate UHRR samples improved by two orders of magnitude following batch-correction using ComBat (ranging from 0.9833-0.9991 to 0.9997-0.9999) and increased the consistency of the gene-lists from the duplicate clinical samples, from 11.6% in quantile normalised data to 66.4% in batch-corrected data. The use of UHRR as an inter-batch calibrator provided a small additional benefit when used in conjunction with ComBat, further increasing the agreement between the two gene-lists, up to 74.1%. In the interests of practicalities and cost, these results suggest that single samples can generate reliable data, but only after careful compensation for technical bias in the experiment. We recommend that investigators appreciate the propensity for such variation in the design stages of a microarray experiment and that the use of suitable correction methods become routine during the statistical analysis of the data.
2010-01-01
Background Microarray technology is a popular means of producing whole genome transcriptional profiles, however high cost and scarcity of mRNA has led many studies to be conducted based on the analysis of single samples. We exploit the design of the Illumina platform, specifically multiple arrays on each chip, to evaluate intra-experiment technical variation using repeated hybridisations of universal human reference RNA (UHRR) and duplicate hybridisations of primary breast tumour samples from a clinical study. Results A clear batch-specific bias was detected in the measured expressions of both the UHRR and clinical samples. This bias was found to persist following standard microarray normalisation techniques. However, when mean-centering or empirical Bayes batch-correction methods (ComBat) were applied to the data, inter-batch variation in the UHRR and clinical samples were greatly reduced. Correlation between replicate UHRR samples improved by two orders of magnitude following batch-correction using ComBat (ranging from 0.9833-0.9991 to 0.9997-0.9999) and increased the consistency of the gene-lists from the duplicate clinical samples, from 11.6% in quantile normalised data to 66.4% in batch-corrected data. The use of UHRR as an inter-batch calibrator provided a small additional benefit when used in conjunction with ComBat, further increasing the agreement between the two gene-lists, up to 74.1%. Conclusion In the interests of practicalities and cost, these results suggest that single samples can generate reliable data, but only after careful compensation for technical bias in the experiment. We recommend that investigators appreciate the propensity for such variation in the design stages of a microarray experiment and that the use of suitable correction methods become routine during the statistical analysis of the data. PMID:20181233
NASA Astrophysics Data System (ADS)
Parro, V.; Rivas, L. A.; Rodríguez-Manfredi, J. A.; Blanco, Y.; de Diego-Castilla, G.; Cruz-Gil, P.; Moreno-Paz, M.; García-Villadangos, M.; Compostizo, C.; Herrero, P. L.
2009-04-01
Immunosensors have been extensively used since many years for environmental monitoring. Different technological platforms allow new biosensor designs and implementations. We have reported (Rivas et al., 2008) a shotgun approach for antibody production for biomarker detection in astrobiology and environmental monitoring, the production of 150 new polyclonal antibodies against microbial strains and environmental extracts, and the construction and validation of an antibody microarray (LDCHIP200, for "Life Detector Chip") containing 200 different antibodies. We have successfully used the LDCHIP200 for the detection of biological polymers in extreme environments in different parts of the world (e.g., a deep South African mine, Antarctica's Dry valleys, Yellowstone, Iceland, and Rio Tinto). Clustering analysis associated similar immunopatterns to samples from apparently very different environments, indicating that they indeed share similar universal biomarkers. A redundancy in the number of antibodies against different target biomarkers apart of revealing the presence of certain biomolecules, it renders a sample-specific immuno-profile, an "immnuno-fingerprint", which may constitute by itself an indirect biosignature. We will present a case study of immunoprofiling different iron-sulfur as well as phylosilicates rich samples along the Rio Tinto river banks. Based on protein microarray technology, we designed and built the concept instrument called SOLID (for "Signs Of LIfe Detector"; Parro et al., 2005; 2008a, b; http://cab.inta.es/solid) for automatic in situ analysis of soil samples and molecular biomarkers detection. A field prototype, SOLID2, was successfully tested for the analysis of grinded core samples during the 2005 "MARTE" campaign of a Mars drilling simulation experiment by a sandwich microarray immunoassay (Parro et al., 2008b). We will show the new version of the instrument (SOLID3) which is able to perform both sandwich and competitive immunoassays. SOLID3 consists of two separate functional units: a Sample Preparation Unit (SPU), for ten different extractions by ultrasonication, and a Sample Analysis Unit (SAU), for fluorescent immunoassays. The SAU consists of ten different flow cells each of one allocate one antibody microarray (up to 2000 spots), and is equipped with an unique designed optical package for fluorescent detection. We demonstrate the performance of SOLID3 for the detection of a broad range of molecular size compounds, from the amino acid size, peptides, proteins, to whole cells and spores, with sensitivities at the ppb level. References Parro, V., et al., 2005. Planetary and Space Science 53: 729-737. Parro, V., et al., 2008a. Space Science Reviews 135: 293-311 Parro, V., et al., 2008b. Astrobiology 8:987-99 Rivas, L. A., et al., 2008. Analytical Chemistry 80: 7970-7979
Microarray missing data imputation based on a set theoretic framework and biological knowledge.
Gan, Xiangchao; Liew, Alan Wee-Chung; Yan, Hong
2006-01-01
Gene expressions measured using microarrays usually suffer from the missing value problem. However, in many data analysis methods, a complete data matrix is required. Although existing missing value imputation algorithms have shown good performance to deal with missing values, they also have their limitations. For example, some algorithms have good performance only when strong local correlation exists in data while some provide the best estimate when data is dominated by global structure. In addition, these algorithms do not take into account any biological constraint in their imputation. In this paper, we propose a set theoretic framework based on projection onto convex sets (POCS) for missing data imputation. POCS allows us to incorporate different types of a priori knowledge about missing values into the estimation process. The main idea of POCS is to formulate every piece of prior knowledge into a corresponding convex set and then use a convergence-guaranteed iterative procedure to obtain a solution in the intersection of all these sets. In this work, we design several convex sets, taking into consideration the biological characteristic of the data: the first set mainly exploit the local correlation structure among genes in microarray data, while the second set captures the global correlation structure among arrays. The third set (actually a series of sets) exploits the biological phenomenon of synchronization loss in microarray experiments. In cyclic systems, synchronization loss is a common phenomenon and we construct a series of sets based on this phenomenon for our POCS imputation algorithm. Experiments show that our algorithm can achieve a significant reduction of error compared to the KNNimpute, SVDimpute and LSimpute methods.
Large-scale atlas of microarray data reveals biological landscape of gene expression in Arabidopsis
USDA-ARS?s Scientific Manuscript database
Transcriptome datasets from thousands of samples of the model plant Arabidopsis thaliana have been collectively generated by multiple individual labs. Although integration and meta-analysis of these samples has become routine in the plant research community, it is often hampered by the lack of metad...
Characterization of genetic variability of Venezuelan equine encephalitis viruses
Gardner, Shea N.; McLoughlin, Kevin; Be, Nicholas A.; ...
2016-04-07
Venezuelan equine encephalitis virus (VEEV) is a mosquito-borne alphavirus that has caused large outbreaks of severe illness in both horses and humans. New approaches are needed to rapidly infer the origin of a newly discovered VEEV strain, estimate its equine amplification and resultant epidemic potential, and predict human virulence phenotype. We performed whole genome single nucleotide polymorphism (SNP) analysis of all available VEE antigenic complex genomes, verified that a SNP-based phylogeny accurately captured the features of a phylogenetic tree based on multiple sequence alignment, and developed a high resolution genome-wide SNP microarray. We used the microarray to analyze a broadmore » panel of VEEV isolates, found excellent concordance between array- and sequence-based SNP calls, genotyped unsequenced isolates, and placed them on a phylogeny with sequenced genomes. The microarray successfully genotyped VEEV directly from tissue samples of an infected mouse, bypassing the need for viral isolation, culture and genomic sequencing. Lastly, we identified genomic variants associated with serotypes and host species, revealing a complex relationship between genotype and phenotype.« less
Quantification of the activity of biomolecules in microarrays obtained by direct laser transfer.
Dinca, V; Ranella, A; Farsari, M; Kafetzopoulos, D; Dinescu, M; Popescu, A; Fotakis, C
2008-10-01
The direct-writing technique laser-induced forward transfer has been employed for the micro-array printing of liquid solutions of the enzyme horseradish peroxidase and the protein Titin on nitrocellulose solid surfaces. The effect of two UV laser pulse lengths, femtosecond and nanosecond has been studied in relation with maintaining the activity of the transferred biomolecules. The quantification of the active biomolecules after transfer has been carried out using Bradford assay, quantitative colorimetric enzymatic assay and fluorescence techniques. Spectrophotometric measurements of the HRP and the Titin activity as well as chromatogenic and fluorescence assay studies have revealed a connection between the properties of the deposited, biologically active biomolecules, the experimental conditions and the target composition. The bioassays have shown that up to 78% of the biomolecules remained active after femtosecond laser transfer, while this value reduced to 54% after nanosecond laser transfer. The addition of glycerol in a percentage up to 70% in the solution to be transferred has contributed to the stabilization of the micro-array patterns and the increase of their resolution.
A low density microarray method for the identification of human papillomavirus type 18 variants.
Meza-Menchaca, Thuluz; Williams, John; Rodríguez-Estrada, Rocío B; García-Bravo, Aracely; Ramos-Ligonio, Ángel; López-Monteon, Aracely; Zepeda, Rossana C
2013-09-26
We describe a novel microarray based-method for the screening of oncogenic human papillomavirus 18 (HPV-18) molecular variants. Due to the fact that sequencing methodology may underestimate samples containing more than one variant we designed a specific and sensitive stacking DNA hybridization assay. This technology can be used to discriminate between three possible phylogenetic branches of HPV-18. Probes were attached covalently on glass slides and hybridized with single-stranded DNA targets. Prior to hybridization with the probes, the target strands were pre-annealed with the three auxiliary contiguous oligonucleotides flanking the target sequences. Screening HPV-18 positive cell lines and cervical samples were used to evaluate the performance of this HPV DNA microarray. Our results demonstrate that the HPV-18's variants hybridized specifically to probes, with no detection of unspecific signals. Specific probes successfully reveal detectable point mutations in these variants. The present DNA oligoarray system can be used as a reliable, sensitive and specific method for HPV-18 variant screening. Furthermore, this simple assay allows the use of inexpensive equipment, making it accessible in resource-poor settings.
A Low Density Microarray Method for the Identification of Human Papillomavirus Type 18 Variants
Meza-Menchaca, Thuluz; Williams, John; Rodríguez-Estrada, Rocío B.; García-Bravo, Aracely; Ramos-Ligonio, Ángel; López-Monteon, Aracely; Zepeda, Rossana C.
2013-01-01
We describe a novel microarray based-method for the screening of oncogenic human papillomavirus 18 (HPV-18) molecular variants. Due to the fact that sequencing methodology may underestimate samples containing more than one variant we designed a specific and sensitive stacking DNA hybridization assay. This technology can be used to discriminate between three possible phylogenetic branches of HPV-18. Probes were attached covalently on glass slides and hybridized with single-stranded DNA targets. Prior to hybridization with the probes, the target strands were pre-annealed with the three auxiliary contiguous oligonucleotides flanking the target sequences. Screening HPV-18 positive cell lines and cervical samples were used to evaluate the performance of this HPV DNA microarray. Our results demonstrate that the HPV-18's variants hybridized specifically to probes, with no detection of unspecific signals. Specific probes successfully reveal detectable point mutations in these variants. The present DNA oligoarray system can be used as a reliable, sensitive and specific method for HPV-18 variant screening. Furthermore, this simple assay allows the use of inexpensive equipment, making it accessible in resource-poor settings. PMID:24077317
Wang, Yang; Weng, Tingting; Gou, Deming; Chen, Zhongming; Chintagari, Narendranath Reddy; Liu, Lin
2007-01-24
An important mechanism for gene regulation utilizes small non-coding RNAs called microRNAs (miRNAs). These small RNAs play important roles in tissue development, cell differentiation and proliferation, lipid and fat metabolism, stem cells, exocytosis, diseases and cancers. To date, relatively little is known about functions of miRNAs in the lung except lung cancer. In this study, we utilized a rat miRNA microarray containing 216 miRNA probes, printed in-house, to detect the expression of miRNAs in the rat lung compared to the rat heart, brain, liver, kidney and spleen. Statistical analysis using Significant Analysis of Microarray (SAM) and Tukey Honestly Significant Difference (HSD) revealed 2 miRNAs (miR-195 and miR-200c) expressed specifically in the lung and 9 miRNAs co-expressed in the lung and another organ. 12 selected miRNAs were verified by Northern blot analysis. The identified lung-specific miRNAs from this work will facilitate functional studies of miRNAs during normal physiological and pathophysiological processes of the lung.
Hartmann, Luise; Stephenson, Christine F; Verkamp, Stephanie R; Johnson, Krystal R; Burnworth, Bettina; Hammock, Kelle; Brodersen, Lisa Eidenschink; de Baca, Monica E; Wells, Denise A; Loken, Michael R; Zehentner, Barbara K
2014-12-01
Array comparative genomic hybridization (aCGH) has become a powerful tool for analyzing hematopoietic neoplasms and identifying genome-wide copy number changes in a single assay. aCGH also has superior resolution compared with fluorescence in situ hybridization (FISH) or conventional cytogenetics. Integration of single nucleotide polymorphism (SNP) probes with microarray analysis allows additional identification of acquired uniparental disomy, a copy neutral aberration with known potential to contribute to tumor pathogenesis. However, a limitation of microarray analysis has been the inability to detect clonal heterogeneity in a sample. This study comprised 16 samples (acute myeloid leukemia, myelodysplastic syndrome, chronic lymphocytic leukemia, plasma cell neoplasm) with complex cytogenetic features and evidence of clonal evolution. We used an integrated manual peak reassignment approach combining analysis of aCGH and SNP microarray data for characterization of subclonal abnormalities. We compared array findings with results obtained from conventional cytogenetic and FISH studies. Clonal heterogeneity was detected in 13 of 16 samples by microarray on the basis of log2 values. Use of the manual peak reassignment analysis approach improved resolution of the sample's clonal composition and genetic heterogeneity in 10 of 13 (77%) patients. Moreover, in 3 patients, clonal disease progression was revealed by array analysis that was not evident by cytogenetic or FISH studies. Genetic abnormalities originating from separate clonal subpopulations can be identified and further characterized by combining aCGH and SNP hybridization results from 1 integrated microarray chip by use of the manual peak reassignment technique. Its clinical utility in comparison to conventional cytogenetic or FISH studies is demonstrated. © 2014 American Association for Clinical Chemistry.
Microarray analysis of gene expression profiles in ripening pineapple fruits.
Koia, Jonni H; Moyle, Richard L; Botella, Jose R
2012-12-18
Pineapple (Ananas comosus) is a tropical fruit crop of significant commercial importance. Although the physiological changes that occur during pineapple fruit development have been well characterized, little is known about the molecular events that occur during the fruit ripening process. Understanding the molecular basis of pineapple fruit ripening will aid the development of new varieties via molecular breeding or genetic modification. In this study we developed a 9277 element pineapple microarray and used it to profile gene expression changes that occur during pineapple fruit ripening. Microarray analyses identified 271 unique cDNAs differentially expressed at least 1.5-fold between the mature green and mature yellow stages of pineapple fruit ripening. Among these 271 sequences, 184 share significant homology with genes encoding proteins of known function, 53 share homology with genes encoding proteins of unknown function and 34 share no significant homology with any database accession. Of the 237 pineapple sequences with homologs, 160 were up-regulated and 77 were down-regulated during pineapple fruit ripening. DAVID Functional Annotation Cluster (FAC) analysis of all 237 sequences with homologs revealed confident enrichment scores for redox activity, organic acid metabolism, metalloenzyme activity, glycolysis, vitamin C biosynthesis, antioxidant activity and cysteine peptidase activity, indicating the functional significance and importance of these processes and pathways during pineapple fruit development. Quantitative real-time PCR analysis validated the microarray expression results for nine out of ten genes tested. This is the first report of a microarray based gene expression study undertaken in pineapple. Our bioinformatic analyses of the transcript profiles have identified a number of genes, processes and pathways with putative involvement in the pineapple fruit ripening process. This study extends our knowledge of the molecular basis of pineapple fruit ripening and non-climacteric fruit ripening in general.
Microarray analysis of gene expression profiles in ripening pineapple fruits
2012-01-01
Background Pineapple (Ananas comosus) is a tropical fruit crop of significant commercial importance. Although the physiological changes that occur during pineapple fruit development have been well characterized, little is known about the molecular events that occur during the fruit ripening process. Understanding the molecular basis of pineapple fruit ripening will aid the development of new varieties via molecular breeding or genetic modification. In this study we developed a 9277 element pineapple microarray and used it to profile gene expression changes that occur during pineapple fruit ripening. Results Microarray analyses identified 271 unique cDNAs differentially expressed at least 1.5-fold between the mature green and mature yellow stages of pineapple fruit ripening. Among these 271 sequences, 184 share significant homology with genes encoding proteins of known function, 53 share homology with genes encoding proteins of unknown function and 34 share no significant homology with any database accession. Of the 237 pineapple sequences with homologs, 160 were up-regulated and 77 were down-regulated during pineapple fruit ripening. DAVID Functional Annotation Cluster (FAC) analysis of all 237 sequences with homologs revealed confident enrichment scores for redox activity, organic acid metabolism, metalloenzyme activity, glycolysis, vitamin C biosynthesis, antioxidant activity and cysteine peptidase activity, indicating the functional significance and importance of these processes and pathways during pineapple fruit development. Quantitative real-time PCR analysis validated the microarray expression results for nine out of ten genes tested. Conclusions This is the first report of a microarray based gene expression study undertaken in pineapple. Our bioinformatic analyses of the transcript profiles have identified a number of genes, processes and pathways with putative involvement in the pineapple fruit ripening process. This study extends our knowledge of the molecular basis of pineapple fruit ripening and non-climacteric fruit ripening in general. PMID:23245313
Mlakar, Vid; Strazisar, Mojca; Sok, Mihael; Glavac, Damjan
2010-06-01
The purpose of this study was to find novel gene(s) involved in the development of lung adenocarcinoma (AD). Using DNA microarrays, we identified 31 up-regulated and 8 downregulated genes in 12 AD. Real time PCR was used to measure expression of VIPR1 and SPP1 mRNA and possible losses or gains of genes in 32 AD. We describe significant upregulation of the SPP1 gene, downregulation of VIPR1, and losses of the VIPR1 gene. Our findings complement a proposed VIPR1 tumor suppressor role, in which deletions in the 3p22 chromosome region are an important mechanism leading to loss of the VIPR1 gene.
Identification of the TFII-I family target genes in the vertebrate genome.
Chimge, Nyam-Osor; Makeyev, Aleksandr V; Ruddle, Frank H; Bayarsaihan, Dashzeveg
2008-07-01
GTF2I and GTF2IRD1 encode members of the TFII-I transcription factor family and are prime candidates in the Williams syndrome, a complex neurodevelopmental disorder. Our previous expression microarray studies implicated TFII-I proteins in the regulation of a number of genes critical in various aspects of cell physiology. Here, we combined bioinformatics and microarray results to identify TFII-I downstream targets in the vertebrate genome. These results were validated by chromatin immunoprecipitation and siRNA analysis. The collected evidence revealed the complexity of TFII-I-mediated processes that involve distinct regulatory networks. Altogether, these results lead to a better understanding of specific molecular events, some of which may be responsible for the Williams syndrome phenotype.
NCBI GEO: mining tens of millions of expression profiles--database and tools update.
Barrett, Tanya; Troup, Dennis B; Wilhite, Stephen E; Ledoux, Pierre; Rudnev, Dmitry; Evangelista, Carlos; Kim, Irene F; Soboleva, Alexandra; Tomashevsky, Maxim; Edgar, Ron
2007-01-01
The Gene Expression Omnibus (GEO) repository at the National Center for Biotechnology Information (NCBI) archives and freely disseminates microarray and other forms of high-throughput data generated by the scientific community. The database has a minimum information about a microarray experiment (MIAME)-compliant infrastructure that captures fully annotated raw and processed data. Several data deposit options and formats are supported, including web forms, spreadsheets, XML and Simple Omnibus Format in Text (SOFT). In addition to data storage, a collection of user-friendly web-based interfaces and applications are available to help users effectively explore, visualize and download the thousands of experiments and tens of millions of gene expression patterns stored in GEO. This paper provides a summary of the GEO database structure and user facilities, and describes recent enhancements to database design, performance, submission format options, data query and retrieval utilities. GEO is accessible at http://www.ncbi.nlm.nih.gov/geo/
Ikeda, Akemi; Kojima-Aikawa, Kyoko; Taniguchi, Naoyuki; Varón Silva, Daniel; Feizi, Ten; Seeberger, Peter H.; Yamaguchi, Yoshiki
2018-01-01
ZG16p is a soluble mammalian lectin that interacts with mannose and heparan sulfate. Here we describe detailed analyses of the interactions of human ZG16p with mycobacterial phosphatidylinositol mannosides (PIMs), using glycan microarray and NMR. Pathogen-related glycan microarray analysis identified phosphatidylinositol mono- and di-mannosides (PIM1 and PIM2) as novel ligand candidates of ZG16p. Saturation Transfer Difference (STD) NMR and transferred NOE experiments with chemically synthesized PIM glycans indicate that PIMs preferentially interacts with ZG16p using the mannose residues. Binding site of PIMs is identified by chemical shift perturbation experiments using uniformly 15N-labeled ZG16p. NMR results with docking simulations suggest a binding mode of ZG16p and PIM glycan, which would help to consider the physiological role of ZG16p. PMID:25919894
MSL: A Measure to Evaluate Three-dimensional Patterns in Gene Expression Data
Gutiérrez-Avilés, David; Rubio-Escudero, Cristina
2015-01-01
Microarray technology is highly used in biological research environments due to its ability to monitor the RNA concentration levels. The analysis of the data generated represents a computational challenge due to the characteristics of these data. Clustering techniques are widely applied to create groups of genes that exhibit a similar behavior. Biclustering relaxes the constraints for grouping, allowing genes to be evaluated only under a subset of the conditions. Triclustering appears for the analysis of longitudinal experiments in which the genes are evaluated under certain conditions at several time points. These triclusters provide hidden information in the form of behavior patterns from temporal experiments with microarrays relating subsets of genes, experimental conditions, and time points. We present an evaluation measure for triclusters called Multi Slope Measure, based on the similarity among the angles of the slopes formed by each profile formed by the genes, conditions, and times of the tricluster. PMID:26124630
TIPMaP: a web server to establish transcript isoform profiles from reliable microarray probes.
Chitturi, Neelima; Balagannavar, Govindkumar; Chandrashekar, Darshan S; Abinaya, Sadashivam; Srini, Vasan S; Acharya, Kshitish K
2013-12-27
Standard 3' Affymetrix gene expression arrays have contributed a significantly higher volume of existing gene expression data than other microarray platforms. These arrays were designed to identify differentially expressed genes, but not their alternatively spliced transcript forms. No resource can currently identify expression pattern of specific mRNA forms using these microarray data, even though it is possible to do this. We report a web server for expression profiling of alternatively spliced transcripts using microarray data sets from 31 standard 3' Affymetrix arrays for human, mouse and rat species. The tool has been experimentally validated for mRNAs transcribed or not-detected in a human disease condition (non-obstructive azoospermia, a male infertility condition). About 4000 gene expression datasets were downloaded from a public repository. 'Good probes' with complete coverage and identity to latest reference transcript sequences were first identified. Using them, 'Transcript specific probe-clusters' were derived for each platform and used to identify expression status of possible transcripts. The web server can lead the user to datasets corresponding to specific tissues, conditions via identifiers of the microarray studies or hybridizations, keywords, official gene symbols or reference transcript identifiers. It can identify, in the tissues and conditions of interest, about 40% of known transcripts as 'transcribed', 'not-detected' or 'differentially regulated'. Corresponding additional information for probes, genes, transcripts and proteins can be viewed too. We identified the expression of transcripts in a specific clinical condition and validated a few of these transcripts by experiments (using reverse transcription followed by polymerase chain reaction). The experimental observations indicated higher agreements with the web server results, than contradictions. The tool is accessible at http://resource.ibab.ac.in/TIPMaP. The newly developed online tool forms a reliable means for identification of alternatively spliced transcript-isoforms that may be differentially expressed in various tissues, cell types or physiological conditions. Thus, by making better use of existing data, TIPMaP avoids the dependence on precious tissue-samples, in experiments with a goal to establish expression profiles of alternative splice forms--at least in some cases.
Dielectrophoretic manipulation and separation of microparticles using microarray dot electrodes.
Yafouz, Bashar; Kadri, Nahrizul Adib; Ibrahim, Fatimah
2014-04-03
This paper introduces a dielectrophoretic system for the manipulation and separation of microparticles. The system is composed of five layers and utilizes microarray dot electrodes. We validated our system by conducting size-dependent manipulation and separation experiments on 1, 5 and 15 μm polystyrene particles. Our findings confirm the capability of the proposed device to rapidly and efficiently manipulate and separate microparticles of various dimensions, utilizing positive and negative dielectrophoresis (DEP) effects. Larger size particles were repelled and concentrated in the center of the dot by negative DEP, while the smaller sizes were attracted and collected by the edge of the dot by positive DEP.
Sequence verification as quality-control step for production of cDNA microarrays.
Taylor, E; Cogdell, D; Coombes, K; Hu, L; Ramdas, L; Tabor, A; Hamilton, S; Zhang, W
2001-07-01
To generate cDNA arrays in our core laboratory, we amplified about 2300 PCR products from a human, sequence-verified cDNA clone library. As a quality-control step, we sequenced the PCR products immediately before printing. The sequence information was used to search the GenBank database to confirm the identities. Although these clones were previously sequence verified by the company, we found that only 79% of the clones matched the original database after handling. Our experience strongly indicates the necessity to sequence verify the clones at the final stage before printing on microarray slides and to modify the gene list accordingly.
Spotting effect in microarray experiments
Mary-Huard, Tristan; Daudin, Jean-Jacques; Robin, Stéphane; Bitton, Frédérique; Cabannes, Eric; Hilson, Pierre
2004-01-01
Background Microarray data must be normalized because they suffer from multiple biases. We have identified a source of spatial experimental variability that significantly affects data obtained with Cy3/Cy5 spotted glass arrays. It yields a periodic pattern altering both signal (Cy3/Cy5 ratio) and intensity across the array. Results Using the variogram, a geostatistical tool, we characterized the observed variability, called here the spotting effect because it most probably arises during steps in the array printing procedure. Conclusions The spotting effect is not appropriately corrected by current normalization methods, even by those addressing spatial variability. Importantly, the spotting effect may alter differential and clustering analysis. PMID:15151695
Karsten, Stanislav L.; Van Deerlin, Vivianna M. D.; Sabatti, Chiara; Gill, Lisa H.; Geschwind, Daniel H.
2002-01-01
Archival formalin-fixed, paraffin-embedded and ethanol-fixed tissues represent a potentially invaluable resource for gene expression analysis, as they are the most widely available material for studies of human disease. Little data are available evaluating whether RNA obtained from fixed (archival) tissues could produce reliable and reproducible microarray expression data. Here we compare the use of RNA isolated from human archival tissues fixed in ethanol and formalin to frozen tissue in cDNA microarray experiments. Since an additional factor that can limit the utility of archival tissue is the often small quantities available, we also evaluate the use of the tyramide signal amplification method (TSA), which allows the use of small amounts of RNA. Detailed analysis indicates that TSA provides a consistent and reproducible signal amplification method for cDNA microarray analysis, across both arrays and the genes tested. Analysis of this method also highlights the importance of performing non-linear channel normalization and dye switching. Furthermore, archived, fixed specimens can perform well, but not surprisingly, produce more variable results than frozen tissues. Consistent results are more easily obtainable using ethanol-fixed tissues, whereas formalin-fixed tissue does not typically provide a useful substrate for cDNA synthesis and labeling. PMID:11788730
BioconductorBuntu: a Linux distribution that implements a web-based DNA microarray analysis server.
Geeleher, Paul; Morris, Dermot; Hinde, John P; Golden, Aaron
2009-06-01
BioconductorBuntu is a custom distribution of Ubuntu Linux that automatically installs a server-side microarray processing environment, providing a user-friendly web-based GUI to many of the tools developed by the Bioconductor Project, accessible locally or across a network. System installation is via booting off a CD image or by using a Debian package provided to upgrade an existing Ubuntu installation. In its current version, several microarray analysis pipelines are supported including oligonucleotide, dual-or single-dye experiments, including post-processing with Gene Set Enrichment Analysis. BioconductorBuntu is designed to be extensible, by server-side integration of further relevant Bioconductor modules as required, facilitated by its straightforward underlying Python-based infrastructure. BioconductorBuntu offers an ideal environment for the development of processing procedures to facilitate the analysis of next-generation sequencing datasets. BioconductorBuntu is available for download under a creative commons license along with additional documentation and a tutorial from (http://bioinf.nuigalway.ie).
Customizing chemotherapy for colon cancer: the potential of gene expression profiling.
Mariadason, John M; Arango, Diego; Augenlicht, Leonard H
2004-06-01
The value of gene expression profiling, or microarray analysis, for the classification and prognosis of multiple forms of cancer is now clearly established. For colon cancer, expression profiling can readily discriminate between normal and tumor tissue, and to some extent between tumors of different histopathological stage and prognosis. While a definitive in vivo study demonstrating the potential of this methodology for predicting response to chemotherapy is presently lacking, the ability of microarrays to distinguish other subtleties of colon cancer phenotype, as well as recent in vitro proof-of-principle experiments utilizing colon cancer cell lines, illustrate the potential of this methodology for predicting the probability of response to specific chemotherapeutic agents. This review discusses some of the recent advances in the use of microarray analysis for understanding and distinguishing colon cancer subtypes, and attempts to identify challenges that need to be overcome in order to achieve the goal of using gene expression profiling for customizing chemotherapy in colon cancer.
Dion, Johann; Advedissian, Tamara; Storozhylova, Nataliya; Dahbi, Samir; Lambert, Annie; Deshayes, Frédérique; Viguier, Mireille; Tellier, Charles; Poirier, Françoise; Téletchéa, Stéphane; Dussouy, Christophe; Tateno, Hiroaki; Hirabayashi, Jun; Grandjean, Cyrille
2017-12-14
Glycan microarrays are useful tools for lectin glycan profiling. The use of a glycan microarray based on evanescent-field fluorescence detection was herein further extended to the screening of lectin inhibitors in competitive experiments. The efficacy of this approach was tested with 2/3'-mono- and 2,3'-diaromatic type II lactosamine derivatives and galectins as targets and was validated by comparison with fluorescence anisotropy proposed as an orthogonal protein interaction measurement technique. We showed that subtle differences in the architecture of the inhibitor could be sensed that pointed out the preference of galectin-3 for 2'-arylamido derivatives over ureas, thioureas, and amines and that of galectin-7 for derivatives bearing an α substituent at the anomeric position of glucosamine. We eventually identified a diaromatic oxazoline as a highly specific inhibitor of galectin-3 versus galectin-1 and galectin-7. © 2017 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Gene Expression Omnibus (GEO): Microarray data storage, submission, retrieval, and analysis
Barrett, Tanya
2006-01-01
The Gene Expression Omnibus (GEO) repository at the National Center for Biotechnology Information (NCBI) archives and freely distributes high-throughput molecular abundance data, predominantly gene expression data generated by DNA microarray technology. The database has a flexible design that can handle diverse styles of both unprocessed and processed data in a MIAME- (Minimum Information About a Microarray Experiment) supportive infrastructure that promotes fully annotated submissions. GEO currently stores about a billion individual gene expression measurements, derived from over 100 organisms, submitted by over 1,500 laboratories, addressing a wide range of biological phenomena. To maximize the utility of these data, several user-friendly Web-based interfaces and applications have been implemented that enable effective exploration, query, and visualization of these data, at the level of individual genes or entire studies. This chapter describes how the data are stored, submission procedures, and mechanisms for data retrieval and query. GEO is publicly accessible at http://www.ncbi.nlm.nih.gov/projects/geo/. PMID:16939800
English, Sangeeta B.; Shih, Shou-Ching; Ramoni, Marco F.; Smith, Lois E.; Butte, Atul J.
2014-01-01
Though genome-wide technologies, such as microarrays, are widely used, data from these methods are considered noisy; there is still varied success in downstream biological validation. We report a method that increases the likelihood of successfully validating microarray findings using real time RT-PCR, including genes at low expression levels and with small differences. We use a Bayesian network to identify the most relevant sources of noise based on the successes and failures in validation for an initial set of selected genes, and then improve our subsequent selection of genes for validation based on eliminating these sources of noise. The network displays the significant sources of noise in an experiment, and scores the likelihood of validation for every gene. We show how the method can significantly increase validation success rates. In conclusion, in this study, we have successfully added a new automated step to determine the contributory sources of noise that determine successful or unsuccessful downstream biological validation. PMID:18790084
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jaing, C; Gardner, S
The goal of this project is to develop forensic genotyping assays for select agent viruses, enhancing the current capabilities for the viral bioforensics and law enforcement community. We used a multipronged approach combining bioinformatics analysis, PCR-enriched samples, microarrays and TaqMan assays to develop high resolution and cost effective genotyping methods for strain level forensic discrimination of viruses. We have leveraged substantial experience and efficiency gained through year 1 on software development, SNP discovery, TaqMan signature design and phylogenetic signature mapping to scale up the development of forensics signatures in year 2. In this report, we have summarized the whole genomemore » wide SNP analysis and microarray probe design for forensics characterization of South American hemorrhagic fever viruses, tick-borne encephalitis viruses and henipaviruses, Old World Arenaviruses, filoviruses, Crimean-Congo hemorrhagic fever virus, Rift Valley fever virus and Japanese encephalitis virus.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
He, Fei; Maslov, Sergei; Yoo, Shinjae
Here, transcriptome datasets from thousands of samples of the model plant Arabidopsis thaliana have been collectively generated by multiple individual labs. Although integration and meta-analysis of these samples has become routine in the plant research community, it is often hampered by the lack of metadata or differences in annotation styles by different labs. In this study, we carefully selected and integrated 6,057 Arabidopsis microarray expression samples from 304 experiments deposited to NCBI GEO. Metadata such as tissue type, growth condition, and developmental stage were manually curated for each sample. We then studied global expression landscape of the integrated dataset andmore » found that samples of the same tissue tend to be more similar to each other than to samples of other tissues, even in different growth conditions or developmental stages. Root has the most distinct transcriptome compared to aerial tissues, but the transcriptome of cultured root is more similar to those of aerial tissues as the former samples lost their cellular identity. Using a simple computational classification method, we showed that the tissue type of a sample can be successfully predicted based on its expression profile, opening the door for automatic metadata extraction and facilitating re-use of plant transcriptome data. As a proof of principle we applied our automated annotation pipeline to 708 RNA-seq samples from public repositories and verified accuracy of our predictions with samples’ metadata provided by authors.« less
He, Fei; Maslov, Sergei; Yoo, Shinjae; ...
2016-05-25
Here, transcriptome datasets from thousands of samples of the model plant Arabidopsis thaliana have been collectively generated by multiple individual labs. Although integration and meta-analysis of these samples has become routine in the plant research community, it is often hampered by the lack of metadata or differences in annotation styles by different labs. In this study, we carefully selected and integrated 6,057 Arabidopsis microarray expression samples from 304 experiments deposited to NCBI GEO. Metadata such as tissue type, growth condition, and developmental stage were manually curated for each sample. We then studied global expression landscape of the integrated dataset andmore » found that samples of the same tissue tend to be more similar to each other than to samples of other tissues, even in different growth conditions or developmental stages. Root has the most distinct transcriptome compared to aerial tissues, but the transcriptome of cultured root is more similar to those of aerial tissues as the former samples lost their cellular identity. Using a simple computational classification method, we showed that the tissue type of a sample can be successfully predicted based on its expression profile, opening the door for automatic metadata extraction and facilitating re-use of plant transcriptome data. As a proof of principle we applied our automated annotation pipeline to 708 RNA-seq samples from public repositories and verified accuracy of our predictions with samples’ metadata provided by authors.« less
Pozhitkov, Alex E; Noble, Peter A; Bryk, Jarosław; Tautz, Diethard
2014-01-01
Although microarrays are analysis tools in biomedical research, they are known to yield noisy output that usually requires experimental confirmation. To tackle this problem, many studies have developed rules for optimizing probe design and devised complex statistical tools to analyze the output. However, less emphasis has been placed on systematically identifying the noise component as part of the experimental procedure. One source of noise is the variance in probe binding, which can be assessed by replicating array probes. The second source is poor probe performance, which can be assessed by calibrating the array based on a dilution series of target molecules. Using model experiments for copy number variation and gene expression measurements, we investigate here a revised design for microarray experiments that addresses both of these sources of variance. Two custom arrays were used to evaluate the revised design: one based on 25 mer probes from an Affymetrix design and the other based on 60 mer probes from an Agilent design. To assess experimental variance in probe binding, all probes were replicated ten times. To assess probe performance, the probes were calibrated using a dilution series of target molecules and the signal response was fitted to an adsorption model. We found that significant variance of the signal could be controlled by averaging across probes and removing probes that are nonresponsive or poorly responsive in the calibration experiment. Taking this into account, one can obtain a more reliable signal with the added option of obtaining absolute rather than relative measurements. The assessment of technical variance within the experiments, combined with the calibration of probes allows to remove poorly responding probes and yields more reliable signals for the remaining ones. Once an array is properly calibrated, absolute quantification of signals becomes straight forward, alleviating the need for normalization and reference hybridizations.
NASA Astrophysics Data System (ADS)
Liu, Robin H.; Lodes, Mike; Fuji, H. Sho; Danley, David; McShea, Andrew
Microarray assays typically involve multistage sample processing and fluidic handling, which are generally labor-intensive and time-consuming. Automation of these processes would improve robustness, reduce run-to-run and operator-to-operator variation, and reduce costs. In this chapter, a fully integrated and self-contained microfluidic biochip device that has been developed to automate the fluidic handling steps for microarray-based gene expression or genotyping analysis is presented. The device consists of a semiconductor-based CustomArray® chip with 12,000 features and a microfluidic cartridge. The CustomArray was manufactured using a semiconductor-based in situ synthesis technology. The micro-fluidic cartridge consists of microfluidic pumps, mixers, valves, fluid channels, and reagent storage chambers. Microarray hybridization and subsequent fluidic handling and reactions (including a number of washing and labeling steps) were performed in this fully automated and miniature device before fluorescent image scanning of the microarray chip. Electrochemical micropumps were integrated in the cartridge to provide pumping of liquid solutions. A micromixing technique based on gas bubbling generated by electrochemical micropumps was developed. Low-cost check valves were implemented in the cartridge to prevent cross-talk of the stored reagents. Gene expression study of the human leukemia cell line (K562) and genotyping detection and sequencing of influenza A subtypes have been demonstrated using this integrated biochip platform. For gene expression assays, the microfluidic CustomArray device detected sample RNAs with a concentration as low as 0.375 pM. Detection was quantitative over more than three orders of magnitude. Experiment also showed that chip-to-chip variability was low indicating that the integrated microfluidic devices eliminate manual fluidic handling steps that can be a significant source of variability in genomic analysis. The genotyping results showed that the device identified influenza A hemagglutinin and neuraminidase subtypes and sequenced portions of both genes, demonstrating the potential of integrated microfluidic and microarray technology for multiple virus detection. The device provides a cost-effective solution to eliminate labor-intensive and time-consuming fluidic handling steps and allows microarray-based DNA analysis in a rapid and automated fashion.
MIPHENO: Data normalization for high throughput metabolic analysis.
High throughput methodologies such as microarrays, mass spectrometry and plate-based small molecule screens are increasingly used to facilitate discoveries from gene function to drug candidate identification. These large-scale experiments are typically carried out over the course...
DOE Office of Scientific and Technical Information (OSTI.GOV)
Handley, Kim M.; Wrighton, Kelly C.; Piceno, Yvette M.
2012-04-13
There is increasing interest in harnessing the functional capacities of indigenous microbial communities to transform and remediate a wide range of environmental contaminants. Information about which community members respond to stimulation can guide the interpretation and development of remediation approaches. To comprehensively determine community membership and abundance patterns among a suite of samples associated with uranium bioremediation experiments we employed a high-density microarray (PhyloChip). Samples were unstimulated, naturally reducing, or collected during Fe(III) (early) and sulfate reduction (late biostimulation) from an acetate re-amended/amended aquifer in Rifle, Colorado, and from laboratory experiments using field-collected materials. Deep community sampling with PhyloChip identifiedmore » hundreds-to-thousands of operational taxonomic units (OTUs) present during amendment, and revealed close similarity among highly enriched taxa from drill-core and groundwater well-deployed column sediment. Overall, phylogenetic data suggested stimulated community membership was most affected by a carryover effect between annual stimulation events. Nevertheless, OTUs within the Fe(III)- and sulfate-reducing lineages, Desulfuromonadales and Desulfobacterales, were repeatedly stimulated. Less consistent, co-enriched taxa represented additional lineages associated with Fe(III) and sulfate reduction (for example, Desulfovibrionales; Syntrophobacterales; Peptococcaceae) and autotrophic sulfur oxidation (Sulfurovum; Campylobacterales). These data imply complex membership among highly stimulated taxa, and by inference biogeochemical responses to acetate, a non-fermentable substrate.« less
Ma, Y; Dai, X; Hong, T; Munk, G B; Libera, M
2016-12-19
Despite their many advantages and successes, molecular beacon (MB) hybridization probes have not been extensively used in microarray formats because of the complicating probe-substrate interactions that increase the background intensity. We have previously shown that tethering to surface-patterned microgels is an effective means for localizing MB probes to specific surface locations in a microarray format while simultaneously maintaining them in as water-like an environment as possible and minimizing probe-surface interactions. Here we extend this approach to include both real-time detection together with integrated NASBA amplification. We fabricate small (∼250 μm × 250 μm) simplex, duplex, and five-plex assays with microarray spots of controllable size (∼20 μm diameter), position, and shape to detect bacteria and fungi in a bloodstream-infection model. The targets, primers, and microgel-tethered probes can be combined in a single isothermal reaction chamber with no post-amplification labelling. We extract total RNA from clinical blood samples and differentiate between Gram-positive and Gram-negative bloodstream infection in a duplex assay to detect RNA- amplicons. The sensitivity based on our current protocols in a simplex assay to detect specific ribosomal RNA sequences within total RNA extracted from S. aureus and E. coli cultures corresponds to tens of bacteria per ml. We furthermore show that the platform can detect RNA- amplicons from synthetic target DNA with 1 fM sensitivity in sample volumes that contain about 12 000 DNA molecules. These experiments demonstrate an alternative approach that can enable rapid and real-time microarray-based molecular diagnostics.
Calling Biomarkers in Milk Using a Protein Microarray on Your Smartphone
Ludwig, Susann K. J.; Tokarski, Christian; Lang, Stefan N.; van Ginkel, Leendert A.; Zhu, Hongying; Ozcan, Aydogan; Nielen, Michel W. F.
2015-01-01
Here we present the concept of a protein microarray-based fluorescence immunoassay for multiple biomarker detection in milk extracts by an ordinary smartphone. A multiplex immunoassay was designed on a microarray chip, having built-in positive and negative quality controls. After the immunoassay procedure, the 48 microspots were labelled with Quantum Dots (QD) depending on the protein biomarker levels in the sample. QD-fluorescence was subsequently detected by the smartphone camera under UV light excitation from LEDs embedded in a simple 3D-printed opto-mechanical smartphone attachment. The somewhat aberrant images obtained under such conditions, were corrected by newly developed Android-based software on the same smartphone, and protein biomarker profiles were calculated. The indirect detection of recombinant bovine somatotropin (rbST) in milk extracts based on altered biomarker profile of anti-rbST antibodies was selected as a real-life challenge. RbST-treated and untreated cows clearly showed reproducible treatment-dependent biomarker profiles in milk, in excellent agreement with results from a flow cytometer reference method. In a pilot experiment, anti-rbST antibody detection was multiplexed with the detection of another rbST-dependent biomarker, insulin-like growth factor 1 (IGF-1). Milk extract IGF-1 levels were found to be increased after rbST treatment and correlated with the results obtained from the reference method. These data clearly demonstrate the potential of the portable protein microarray concept towards simultaneous detection of multiple biomarkers. We envisage broad application of this ‘protein microarray on a smartphone’-concept for on-site testing, e.g., in food safety, environment and health monitoring. PMID:26308444
Systematic Omics Analysis Review (SOAR) Tool to Support Risk Assessment
McConnell, Emma R.; Bell, Shannon M.; Cote, Ila; Wang, Rong-Lin; Perkins, Edward J.; Garcia-Reyero, Natàlia; Gong, Ping; Burgoon, Lyle D.
2014-01-01
Environmental health risk assessors are challenged to understand and incorporate new data streams as the field of toxicology continues to adopt new molecular and systems biology technologies. Systematic screening reviews can help risk assessors and assessment teams determine which studies to consider for inclusion in a human health assessment. A tool for systematic reviews should be standardized and transparent in order to consistently determine which studies meet minimum quality criteria prior to performing in-depth analyses of the data. The Systematic Omics Analysis Review (SOAR) tool is focused on assisting risk assessment support teams in performing systematic reviews of transcriptomic studies. SOAR is a spreadsheet tool of 35 objective questions developed by domain experts, focused on transcriptomic microarray studies, and including four main topics: test system, test substance, experimental design, and microarray data. The tool will be used as a guide to identify studies that meet basic published quality criteria, such as those defined by the Minimum Information About a Microarray Experiment standard and the Toxicological Data Reliability Assessment Tool. Seven scientists were recruited to test the tool by using it to independently rate 15 published manuscripts that study chemical exposures with microarrays. Using their feedback, questions were weighted based on importance of the information and a suitability cutoff was set for each of the four topic sections. The final validation resulted in 100% agreement between the users on four separate manuscripts, showing that the SOAR tool may be used to facilitate the standardized and transparent screening of microarray literature for environmental human health risk assessment. PMID:25531884
Extraction and labeling methods for microarrays using small amounts of plant tissue.
Stimpson, Alexander J; Pereira, Rhea S; Kiss, John Z; Correll, Melanie J
2009-03-01
Procedures were developed to maximize the yield of high-quality RNA from small amounts of plant biomass for microarrays. Two disruption techniques (bead milling and pestle and mortar) were compared for the yield and the quality of RNA extracted from 1-week-old Arabidopsis thaliana seedlings (approximately 0.5-30 mg total biomass). The pestle and mortar method of extraction showed enhanced RNA quality at the smaller biomass samples compared with the bead milling technique, although the quality in the bead milling could be improved with additional cooling steps. The RNA extracted from the pestle and mortar technique was further tested to determine if the small quantity of RNA (500 ng-7 microg) was appropriate for microarray analyses. A new method of low-quantity RNA labeling for microarrays (NuGEN Technologies, Inc.) was used on five 7-day-old seedlings (approximately 2.5 mg fresh weight total) of Arabidopsis that were grown in the dark and exposed to 1 h of red light or continued dark. Microarray analyses were performed on a small plant sample (five seedlings; approximately 2.5 mg) using these methods and compared with extractions performed with larger biomass samples (approximately 500 roots). Many well-known light-regulated genes between the small plant samples and the larger biomass samples overlapped in expression changes, and the relative expression levels of selected genes were confirmed with quantitative real-time polymerase chain reaction, suggesting that these methods can be used for plant experiments where the biomass is extremely limited (i.e. spaceflight studies).
Malenke, J R; Milash, B; Miller, A W; Dearing, M D
2013-07-01
Massively parallel sequencing has enabled the creation of novel, in-depth genetic tools for nonmodel, ecologically important organisms. We present the de novo transcriptome sequencing, analysis and microarray development for a vertebrate herbivore, the woodrat (Neotoma spp.). This genus is of ecological and evolutionary interest, especially with respect to ingestion and hepatic metabolism of potentially toxic plant secondary compounds. We generated a liver transcriptome of the desert woodrat (Neotoma lepida) using the Roche 454 platform. The assembled contigs were well annotated using rodent references (99.7% annotation), and biotransformation function was reflected in the gene ontology. The transcriptome was used to develop a custom microarray (eArray, Agilent). We tested the microarray with three experiments: one across species with similar habitat (thus, dietary) niches, one across species with different habitat niches and one across populations within a species. The resulting one-colour arrays had high technical and biological quality. Probes designed from the woodrat transcriptome performed significantly better than functionally similar probes from the Norway rat (Rattus norvegicus). There were a multitude of expression differences across the woodrat treatments, many of which related to biotransformation processes and activities. The pattern and function of the differences indicate shared ecological pressures, and not merely phylogenetic distance, play an important role in shaping gene expression profiles of woodrat species and populations. The quality and functionality of the woodrat transcriptome and custom microarray suggest these tools will be valuable for expanding the scope of herbivore biology, as well as the exploration of conceptual topics in ecology. © 2013 John Wiley & Sons Ltd.
2014-01-01
Background Uncovering the complex transcriptional regulatory networks (TRNs) that underlie plant and animal development remains a challenge. However, a vast amount of data from public microarray experiments is available, which can be subject to inference algorithms in order to recover reliable TRN architectures. Results In this study we present a simple bioinformatics methodology that uses public, carefully curated microarray data and the mutual information algorithm ARACNe in order to obtain a database of transcriptional interactions. We used data from Arabidopsis thaliana root samples to show that the transcriptional regulatory networks derived from this database successfully recover previously identified root transcriptional modules and to propose new transcription factors for the SHORT ROOT/SCARECROW and PLETHORA pathways. We further show that these networks are a powerful tool to integrate and analyze high-throughput expression data, as exemplified by our analysis of a SHORT ROOT induction time-course microarray dataset, and are a reliable source for the prediction of novel root gene functions. In particular, we used our database to predict novel genes involved in root secondary cell-wall synthesis and identified the MADS-box TF XAL1/AGL12 as an unexpected participant in this process. Conclusions This study demonstrates that network inference using carefully curated microarray data yields reliable TRN architectures. In contrast to previous efforts to obtain root TRNs, that have focused on particular functional modules or tissues, our root transcriptional interactions provide an overview of the transcriptional pathways present in Arabidopsis thaliana roots and will likely yield a plethora of novel hypotheses to be tested experimentally. PMID:24739361
Linking microarray reporters with protein functions.
Gaj, Stan; van Erk, Arie; van Haaften, Rachel I M; Evelo, Chris T A
2007-09-26
The analysis of microarray experiments requires accurate and up-to-date functional annotation of the microarray reporters to optimize the interpretation of the biological processes involved. Pathway visualization tools are used to connect gene expression data with existing biological pathways by using specific database identifiers that link reporters with elements in the pathways. This paper proposes a novel method that aims to improve microarray reporter annotation by BLASTing the original reporter sequences against a species-specific EMBL subset, that was derived from and crosslinked back to the highly curated UniProt database. The resulting alignments were filtered using high quality alignment criteria and further compared with the outcome of a more traditional approach, where reporter sequences were BLASTed against EnsEMBL followed by locating the corresponding protein (UniProt) entry for the high quality hits. Combining the results of both methods resulted in successful annotation of > 58% of all reporter sequences with UniProt IDs on two commercial array platforms, increasing the amount of Incyte reporters that could be coupled to Gene Ontology terms from 32.7% to 58.3% and to a local GenMAPP pathway from 9.6% to 16.7%. For Agilent, 35.3% of the total reporters are now linked towards GO nodes and 7.1% on local pathways. Our methods increased the annotation quality of microarray reporter sequences and allowed us to visualize more reporters using pathway visualization tools. Even in cases where the original reporter annotation showed the correct description the new identifiers often allowed improved pathway and Gene Ontology linking. These methods are freely available at http://www.bigcat.unimaas.nl/public/publications/Gaj_Annotation/.
Computational approaches were developed to identify factors that regulate Nrf2 in a large gene expression compendium of microarray profiles including >2000 comparisons which queried the effects of chemicals, genes, diets, and infectious agents on gene expression in the mouse l...
Amber J. Vanden Wymelenberg; Jill A. Gaskell; Michael D. Mozuch; Philip J. Kersten; Grzegorz Sabat; Diego Martinez; Daniel Cullen
2009-01-01
The wood decay basidiomycete Phanerochaete chrysosporium was grown under standard ligninolytic or cellulolytic conditions and subjected to whole-genome expression microarray analysis and liquid chromatography-tandem mass spectrometry of extracellular proteins. A total of 545 genes were flagged on the basis of significant changes in transcript accumulation and/or...
DOE Office of Scientific and Technical Information (OSTI.GOV)
Nelson, T.A.; Holmes, S.; Alekseyenko, A.V.
Irritable bowel syndrome (IBS) is a chronic, episodic gastrointestinal disorder that is prevalent in a significant fraction of western human populations; and changes in the microbiota of the large bowel have been implicated in the pathology of the disease. Using a novel comprehensive, high-density DNA microarray (PhyloChip) we performed a phylogenetic analysis of the microbial community of the large bowel in a rat model in which intracolonic acetic acid in neonates was used to induce long lasting colonic hypersensitivity and decreased stool water content and frequency, representing the equivalent of human constipation-predominant IBS. Our results revealed a significantly increased compositionalmore » difference in the microbial communities in rats with neonatal irritation as compared with controls. Even more striking was the dramatic change in the ratio of Firmicutes relative to Bacteroidetes, where neonatally irritated rats were enriched more with Bacteroidetes and also contained a different composition of species within this phylum. Our study also revealed differences at the level of bacterial families and species. The PhyloChip is a useful and convenient method to study enteric microflora. Further, this rat model system may be a useful experimental platform to study the causes and consequences of changes in microbial community composition associated with IBS.« less
Jones, D L; Petty, J; Hoyle, D C; Hayes, A; Ragni, E; Popolo, L; Oliver, S G; Stateva, L I
2003-12-16
Often changes in gene expression levels have been considered significant only when above/below some arbitrarily chosen threshold. We investigated the effect of applying a purely statistical approach to microarray analysis and demonstrated that small changes in gene expression have biological significance. Whole genome microarray analysis of a pde2Delta mutant, constructed in the Saccharomyces cerevisiae reference strain FY23, revealed altered expression of approximately 11% of protein encoding genes. The mutant, characterized by constitutive activation of the Ras/cAMP pathway, has increased sensitivity to stress, reduced ability to assimilate nonfermentable carbon sources, and some cell wall integrity defects. Applying the Munich Information Centre for Protein Sequences (MIPS) functional categories revealed increased expression of genes related to ribosome biogenesis and downregulation of genes in the cell rescue, defense, cell death and aging category, suggesting a decreased response to stress conditions. A reduced level of gene expression in the unfolded protein response pathway (UPR) was observed. Cell wall genes whose expression was affected by this mutation were also identified. Several of the cAMP-responsive orphan genes, upon further investigation, revealed cell wall functions; others had previously unidentified phenotypes assigned to them. This investigation provides a statistical global transcriptome analysis of the cellular response to constitutive activation of the Ras/cAMP pathway.
Guan, Zheng; Tan, Jing; Gao, Wei; Li, Xin; Yang, Yuandong; Li, Xiaogang; Li, Yingchao; Wang, Qiang
2018-06-19
Recent studies have revealed that circular RNAs (circRNAs) play important roles in the tumorigenesis of human cancer, including hepatocellular carcinoma (HCC). In present study, we screen the circular RNA expression profiles in HCC tissue and investigate the molecular roles on HCC tumorigenesis. Human circRNA microarray analysis showed there were total 1,245 differently expressed circular RNAs, including 756 up-regulated circRNAs and 489 down-regulated circRNAs, in three pairs of HCC tissue and adjacent normal tissue. Hsa_circ_0016788 was identified to be up-regulated in both HCC tissue and cell lines. Loss-of-functional experiments in vivo and vitro revealed that hsa_circ_0016788 silencing inhibited the proliferation, invasion and promoted the apoptosis in vitro, and inhibited the tumor growth in vivo. Bioinformatics tools and luciferase reporter assay validated that miR-486 targeted hsa_circ_0016788 and CDK4 accompanying with negatively correlated expression, suggesting the hsa_circ_0016788/miR-486/CDK4 pathway. Receiver operating characteristic (ROC) curve showed that hsa_circ_0016788 had high diagnostic value (AUC = 0.851). In summary, results reveal the role of hsa_circ_0016788/miR-486/CDK4 in HCC tumorigenesis, providing a novel therapeutic target for HCC. © 2018 Wiley Periodicals, Inc.
Chatonnet, Fabrice; Guyot, Romain; Picou, Frédéric; Bondesson, Maria; Flamant, Frederic
2012-01-01
Thyroid hormone (T3) has a major influence on cerebellum post-natal development. The major phenotypic landmark of exposure to low levels of T3 during development (hypothyroidism) in the cerebellum is the retarded inward migration of the most numerous cell type, granular neurons. In order to identify the direct genetic regulation exerted by T3 on cerebellar neurons and their precursors, we used microarray RNA hybridization to perform a time course analysis of T3 induced gene expression in primary cultures of cerebellar neuronal cell. These experiments suggest that we identified a small set of genes which are directly regulated, both in vivo and in vitro, during cerebellum post-natal development. These modest changes suggest that T3 does not acts directly on granular neurons and mainly indirectly influences the cellular interactions taking place during development. PMID:22586439
Aberrant expression of the PHF14 gene in biliary tract cancer cells
AKAZAWA, TAKAKO; YASUI, KOHICHIROH; GEN, YASUYUKI; YAMADA, NOBUHISA; TOMIE, AKIRA; DOHI, OSAMU; MITSUYOSHI, HIRONORI; YAGI, NOBUAKI; ITOH, YOSHITO; NAITO, YUJI; YOSHIKAWA, TOSHIKAZU
2013-01-01
DNA copy number aberrations in human biliary tract cancer (BTC) cell lines were investigated using a high-density oligonucleotide microarray. A novel homozygous deletion was detected at chromosomal region 7p21.3 in the OZ cell line. Further validation experiments using genomic PCR revealed a homozygous deletion of a single gene, plant homeodomain (PHD) finger protein 14 (PHF14). No PHF14 mRNA or protein expression was detected, thus demonstrating the absence of PHF14 expression in the OZ cell line. Although the PHD finger protein is considered to be involved in chromatin-mediated transcriptional regulation, little is known about the function of PHF14 in cancer. The present study observed that the knock down of PHF14 using small interfering RNA (siRNA) enhanced the growth of the BTC cells. These observations suggest that aberrant PHF14 expression may have a role in the tumorigenesis of BTC. PMID:23833654
The Role of Cytokine PF4 in the Antiviral Immune Response of Shrimp
Chen, Yulei; Cao, Jiao; Zhang, Xiaobo
2016-01-01
During viral infection in vertebrates, cytokines play important roles in the host defense against the virus. However, the function of cytokines in invertebrates has not been well characterized. In this study, shrimp cytokines involved in viral infection were screened using a cytokine antibody microarray. The results showed that three cytokines, the Fas receptor (Fas), platelet factor 4 (PF4) and interleukin-22 (IL-22), were significantly upregulated in the white spot syndrome virus (WSSV)-challenged shrimp, suggesting that these cytokines played positive regulatory roles in the immune response of shrimp against the virus. Further experiments revealed that PF4 had positive effects on the antiviral immunity of shrimp by enhancing the shrimp phagocytic activity and inhibiting the apoptotic activity of virus-infected hemocytes. Therefore, our study presented a novel mechanism of cytokines in the innate immunity of invertebrates. PMID:27631372
Schluttenhofer, Craig; Pattanaik, Sitakanta; Patra, Barunava; Yuan, Ling
2014-06-20
To combat infection to biotic stress plants elicit the biosynthesis of numerous natural products, many of which are valuable pharmaceutical compounds. Jasmonate is a central regulator of defense response to pathogens and accumulation of specialized metabolites. Catharanthus roseus produces a large number of terpenoid indole alkaloids (TIAs) and is an excellent model for understanding the regulation of this class of valuable compounds. Recent work illustrates a possible role for the Catharanthus WRKY transcription factors (TFs) in regulating TIA biosynthesis. In Arabidopsis and other plants, the WRKY TF family is also shown to play important role in controlling tolerance to biotic and abiotic stresses, as well as secondary metabolism. Here, we describe the WRKY TF families in response to jasmonate in Arabidopsis and Catharanthus. Publically available Arabidopsis microarrays revealed at least 30% (22 of 72) of WRKY TFs respond to jasmonate treatments. Microarray analysis identified at least six jasmonate responsive Arabidopsis WRKY genes (AtWRKY7, AtWRKY20, AtWRKY26, AtWRKY45, AtWRKY48, and AtWRKY72) that have not been previously reported. The Catharanthus WRKY TF family is comprised of at least 48 members. Phylogenetic clustering reveals 11 group I, 32 group II, and 5 group III WRKY TFs. Furthermore, we found that at least 25% (12 of 48) were jasmonate responsive, and 75% (9 of 12) of the jasmonate responsive CrWRKYs are orthologs of AtWRKYs known to be regulated by jasmonate. Overall, the CrWRKY family, ascertained from transcriptome sequences, contains approximately 75% of the number of WRKYs found in other sequenced asterid species (pepper, tomato, potato, and bladderwort). Microarray and transcriptomic data indicate that expression of WRKY TFs in Arabidopsis and Catharanthus are under tight spatio-temporal and developmental control, and potentially have a significant role in jasmonate signaling. Profiling of CrWRKY expression in response to jasmonate treatment revealed potential associations with secondary metabolism. This study provides a foundation for further characterization of WRKY TFs in jasmonate responses and regulation of natural product biosynthesis.
Anisimov, Sergey V; Khavinson, Vladimir Kh; Anisimov, Vladimir N
2004-01-01
Aging is associated with significant alterations in gene expression in numerous organs and tissues. Anti-aging therapy with peptide bioregulators holds much promise for the correction of age-associated changes, making a screening for their molecular targets in tissues an important question of modern gerontology. The synthetic tetrapeptide Cortagen (Ala-Glu-Asp-Pro) was obtained by directed synthesis based on amino acid analysis of natural brain cortex peptide preparation Cortexin. In humans, Cortagen demonstrated a pronounced therapeutic effect upon the structural and functional posttraumatic recovery of peripheral nerve tissue. Importantly, other effects were also observed in cardiovascular and cerebrovascular parameters. Based on these latter observations, we hypothesized that acute course of Cortagen treatment, large-scale transcriptome analysis, and identification of transcripts with altered expression in heart would facilitate our understanding of the mechanisms responsible for this peptide biological effects. We therefore analyzed the expression of 15,247 transcripts in the heart of female 6-months CBA mice receiving injections of Cortagen for 5 consecutive days was studied by cDNA microarrays. Comparative analysis of cDNA microarray hybridisation with heart samples from control and experimental group revealed 234 clones (1,53% of the total number of clones) with significant changes of expression that matched 110 known genes belonging to various functional categories. Maximum up- and down-regulation was +5.42 and -2.86, respectively. Intercomparison of changes in cardiac expression profile induced by synthetic peptides (Cortagen, Vilon, Epitalon) and pineal peptide hormone melatonin revealed both common and specific effects of Cortagen upon gene expression in heart.
Mohr, Roland; Neckel, Peter; Zhang, Ying; Stachon, Susanne; Nothelfer, Katharina; Schaeferhoff, Karin; Obermayr, Florian; Bonin, Michael; Just, Lothar
2013-11-01
Thyroid hormones play important roles in the development of neural cells in the central nervous system. Even minor changes to normal thyroid hormone levels affect dendritic and axonal outgrowth, sprouting and myelination and might even lead to irreversible damages such as cretinism. Despite our knowledge of the influence on the mammalian CNS, the role of thyroid hormones in the development of the enteric nervous system (ENS) still needs to be elucidated. In this study we have analyzed for the first time the influence of 3,5,3'-triiodothyronine (T3) on ENS progenitor cells using cell biological assays and a microarray technique. In our in vitro model, T3 inhibited cell proliferation and stimulated neurite outgrowth of differentiating ENS progenitor cells. Microarray analysis revealed a group of 338 genes that were regulated by T3 in differentiating enterospheres. 67 of these genes are involved in function and development of the nervous system. 14 of them belong to genes that are involved in axonal guidance or neurite outgrowth. Interestingly, T3 regulated the expression of netrin G1 and endothelin 3, two guidance molecules that are involved in human enteric dysganglionoses. The results of our study give first insights how T3 may affect the enteric nervous system. T3 is involved in proliferation and differentiation processes in enterospheres. Microarray analysis revealed several interesting gene candidates that might be involved in the observed effects on enterosphere differentiation. Future studies need to be conducted to better understand the gene to gene interactions. © 2013.
Joint mapping of genes and conditions via multidimensional unfolding analysis
Van Deun, Katrijn; Marchal, Kathleen; Heiser, Willem J; Engelen, Kristof; Van Mechelen, Iven
2007-01-01
Background Microarray compendia profile the expression of genes in a number of experimental conditions. Such data compendia are useful not only to group genes and conditions based on their similarity in overall expression over profiles but also to gain information on more subtle relations between genes and conditions. Getting a clear visual overview of all these patterns in a single easy-to-grasp representation is a useful preliminary analysis step: We propose to use for this purpose an advanced exploratory method, called multidimensional unfolding. Results We present a novel algorithm for multidimensional unfolding that overcomes both general problems and problems that are specific for the analysis of gene expression data sets. Applying the algorithm to two publicly available microarray compendia illustrates its power as a tool for exploratory data analysis: The unfolding analysis of a first data set resulted in a two-dimensional representation which clearly reveals temporal regulation patterns for the genes and a meaningful structure for the time points, while the analysis of a second data set showed the algorithm's ability to go beyond a mere identification of those genes that discriminate between different patient or tissue types. Conclusion Multidimensional unfolding offers a useful tool for preliminary explorations of microarray data: By relying on an easy-to-grasp low-dimensional geometric framework, relations among genes, among conditions and between genes and conditions are simultaneously represented in an accessible way which may reveal interesting patterns in the data. An additional advantage of the method is that it can be applied to the raw data without necessitating the choice of suitable genewise transformations of the data. PMID:17550582
Tanaka, Yohei; Nakayama, Jun
2018-05-01
Water-filtered broad-spectrum near-infrared irradiation can induce various biological effects, as our previous clinical, histological, and biochemical investigations have shown. However, few studies that examined the changes thus induced in gene expression. The aim was to investigate the changes in gene expression in a 3-dimensional reconstructed epidermal tissue culture exposed to water-filtered broad-spectrum near-infrared irradiation. DNA microarray and quantitative real-time polymerase chain reaction (PCR) analysis was used to assess gene expression levels in a 3-dimensional reconstructed epidermal model composed of normal human epidermal cells exposed to water-filtered broad-spectrum near-infrared irradiation. The water filter allowed 1000-1800 nm wavelengths and excluded 1400-1500 nm wavelengths, and cells were exposed to 5 or 10 rounds of near-infrared irradiation at 10 J/cm 2 . A DNA microarray with over 50 000 different probes showed 18 genes that were upregulated or downregulated by at least twofold after irradiation. Quantitative real-time PCR revealed that, relative to control cells, the gene encoding La ribonucleoprotein domain family member 6 (LARP6), which regulates collagen expression, was significantly and dose-dependently upregulated (P < 0.05) by water-filtered broad-spectrum near-infrared exposure. Gene encoding transcripts of collagen type I were significantly upregulated compared with controls (P < 0.05). This study demonstrates the ability of water-filtered broad-spectrum near-infrared irradiation to stimulate the production of type I collagen. © 2017 The Australasian College of Dermatologists.
Wang, Jia-Chi; Boyar, Fatih Z
2016-01-01
Chromosomal microarray analysis (CMA) has been recommended and practiced routinely in the large reference laboratories of U.S.A. as the first-tier test for the postnatal evaluation of individuals with intellectual disability, autism spectrum disorders, and/or multiple congenital anomalies. Using CMA as a diagnostic tool and without a routine setting of fluorescence in situ hybridization with labeled bacterial artificial chromosome probes (BAC-FISH) in the large reference laboratories becomes a challenge in the characterization of chromosome 9 pericentric region. This region has a very complex genomic structure and contains a variety of heterochromatic and euchromatic polymorphic variants. These variants were usually studied by G-banding, C-banding and BAC-FISH analysis. Chromosomal microarray analysis (CMA) was not recommended since it may lead to false positive results. Here, we presented a cohort of four cases, in which high-resolution CMA was used as the first-tier test or simultaneously with G-banding analysis on the proband to identify pathogenic copy number variants (CNVs) in the whole genome. CMA revealed large pathogenic CNVs from chromosome 9 in 3 cases which also revealed different G-banding patterns between the two chromosome 9 homologues. Although we demonstrated that high-resolution CMA played an important role in the identification of pathogenic copy number variants in chromosome 9 pericentric regions, the lack of BAC-FISH analysis or other useful tools renders significant challenges in the characterization of chromosome 9 pericentric regions. None; it is not a clinical trial, and the cases were retrospectively collected and analyzed.
Reconstructing the temporal ordering of biological samples using microarray data.
Magwene, Paul M; Lizardi, Paul; Kim, Junhyong
2003-05-01
Accurate time series for biological processes are difficult to estimate due to problems of synchronization, temporal sampling and rate heterogeneity. Methods are needed that can utilize multi-dimensional data, such as those resulting from DNA microarray experiments, in order to reconstruct time series from unordered or poorly ordered sets of observations. We present a set of algorithms for estimating temporal orderings from unordered sets of sample elements. The techniques we describe are based on modifications of a minimum-spanning tree calculated from a weighted, undirected graph. We demonstrate the efficacy of our approach by applying these techniques to an artificial data set as well as several gene expression data sets derived from DNA microarray experiments. In addition to estimating orderings, the techniques we describe also provide useful heuristics for assessing relevant properties of sample datasets such as noise and sampling intensity, and we show how a data structure called a PQ-tree can be used to represent uncertainty in a reconstructed ordering. Academic implementations of the ordering algorithms are available as source code (in the programming language Python) on our web site, along with documentation on their use. The artificial 'jelly roll' data set upon which the algorithm was tested is also available from this web site. The publicly available gene expression data may be found at http://genome-www.stanford.edu/cellcycle/ and http://caulobacter.stanford.edu/CellCycle/.
2013-01-01
Background Hybridization based assays and capture systems depend on the specificity of hybridization between a probe and its intended target. A common guideline in the construction of DNA microarrays, for instance, is that avoiding complementary stretches of more than 15 nucleic acids in a 50 or 60-mer probe will eliminate sequence specific cross-hybridization reactions. Here we present a study of the behavior of partially matched oligonucleotide pairs with complementary stretches starting well below this threshold complementarity length – in silico, in solution, and at the microarray surface. The modeled behavior of pairs of oligonucleotide probes and their targets suggests that even a complementary stretch of sequence 12 nt in length would give rise to specific cross-hybridization. We designed a set of binding partners to a 50-mer oligonucleotide containing complementary stretches from 6 nt to 21 nt in length. Results Solution melting experiments demonstrate that stable partial duplexes can form when only 12 bp of complementary sequence are present; surface hybridization experiments confirm that a signal close in magnitude to full-strength signal can be obtained from hybridization of a 12 bp duplex within a 50mer oligonucleotide. Conclusions Microarray and other molecular capture strategies that rely on a 15 nt lower complementarity bound for eliminating specific cross-hybridization may not be sufficiently conservative. PMID:23445545
Lim, Hye-Sun; Ha, Hyekyung; Shin, Hyeun-Kyoo; Jeong, Soo-Jin
2015-09-01
Saussurea lappa has been reported to possess anti-atopic properties. In this study, we have confirmed the S. lappa's anti-atopic properties in Nc/Nga mice and investigated the candidate gene related with its properties using microarray. We determined the target gene using real time PCR in in vitro experiment. S. lappa showed the significant reduction in atopic dermatitis (AD) score and immunoglobulin E compared with the AD induced Nc/Nga mice. In the results of microarray using back skin obtained from animals, we found that S. lappa's properties are closely associated with cytokine-cytokine receptor interaction and the JAK-STAT signaling pathway. Consistent with the microarray data, real-time RT-PCR confirmed these modulation at the mRNA level in skin tissues from S. lappa-treated mice. Among these genes, PI3Kca and IL20Rβ were significantly downregulated by S. lappa treatment in Nc/Nga mouse model. In in vitro experiment using HaCaT cells, we found that the S. lappa components, including alantolactone, caryophyllene, costic acid, costunolide and dehydrocostus lactone significantly decreased the expression of PI3Kca but not IL20Rβ in vitro. Therefore, our study suggests that PI3Kca-related signaling is closely related with the protective effects of S. lappa against the development of atopic-dermatitis.
Simpson, Julie E; Hosny, Ola; Wharton, Stephen B; Heath, Paul R; Holden, Hazel; Fernando, Malee S; Matthews, Fiona; Forster, Gill; O'Brien, John T; Barber, Robert; Kalaria, Raj N; Brayne, Carol; Shaw, Pamela J; Lewis, Claire E; Ince, Paul G
2009-02-01
White matter lesions (WML) in brain aging are linked to dementia and depression. Ischemia contributes to their pathogenesis but other mechanisms may contribute. We used RNA microarray analysis with functional pathway grouping as an unbiased approach to investigate evidence for additional pathogenetic mechanisms. WML were identified by MRI and pathology in brains donated to the Medical Research Council Cognitive Function and Ageing Study Cognitive Function and Aging Study. RNA was extracted to compare WML with nonlesional white matter samples from cases with lesions (WM[L]), and from cases with no lesions (WM[C]) using RNA microarray and pathway analysis. Functional pathways were validated for selected genes by quantitative real-time polymerase chain reaction and immunocytochemistry. We identified 8 major pathways in which multiple genes showed altered RNA transcription (immune regulation, cell cycle, apoptosis, proteolysis, ion transport, cell structure, electron transport, metabolism) among 502 genes that were differentially expressed in WML compared to WM[C]. In WM[L], 409 genes were altered involving the same pathways. Genes selected to validate this microarray data all showed the expected changes in RNA levels and immunohistochemical expression of protein. WML represent areas with a complex molecular phenotype. From this and previous evidence, WML may arise through tissue ischemia but may also reflect the contribution of additional factors like blood-brain barrier dysfunction. Differential expression of genes in WM[L] compared to WM[C] indicate a "field effect" in the seemingly normal surrounding white matter.
2011-01-01
Background Phytohormones organize plant development and environmental adaptation through cell-to-cell signal transduction, and their action involves transcriptional activation. Recent international efforts to establish and maintain public databases of Arabidopsis microarray data have enabled the utilization of this data in the analysis of various phytohormone responses, providing genome-wide identification of promoters targeted by phytohormones. Results We utilized such microarray data for prediction of cis-regulatory elements with an octamer-based approach. Our test prediction of a drought-responsive RD29A promoter with the aid of microarray data for response to drought, ABA and overexpression of DREB1A, a key regulator of cold and drought response, provided reasonable results that fit with the experimentally identified regulatory elements. With this succession, we expanded the prediction to various phytohormone responses, including those for abscisic acid, auxin, cytokinin, ethylene, brassinosteroid, jasmonic acid, and salicylic acid, as well as for hydrogen peroxide, drought and DREB1A overexpression. Totally 622 promoters that are activated by phytohormones were subjected to the prediction. In addition, we have assigned putative functions to 53 octamers of the Regulatory Element Group (REG) that have been extracted as position-dependent cis-regulatory elements with the aid of their feature of preferential appearance in the promoter region. Conclusions Our prediction of Arabidopsis cis-regulatory elements for phytohormone responses provides guidance for experimental analysis of promoters to reveal the basis of the transcriptional network of phytohormone responses. PMID:21349196
Multi-membership gene regulation in pathway based microarray analysis
2011-01-01
Background Gene expression analysis has been intensively researched for more than a decade. Recently, there has been elevated interest in the integration of microarray data analysis with other types of biological knowledge in a holistic analytical approach. We propose a methodology that can be facilitated for pathway based microarray data analysis, based on the observation that a substantial proportion of genes present in biochemical pathway databases are members of a number of distinct pathways. Our methodology aims towards establishing the state of individual pathways, by identifying those truly affected by the experimental conditions based on the behaviour of such genes. For that purpose it considers all the pathways in which a gene participates and the general census of gene expression per pathway. Results We utilise hill climbing, simulated annealing and a genetic algorithm to analyse the consistency of the produced results, through the application of fuzzy adjusted rand indexes and hamming distance. All algorithms produce highly consistent genes to pathways allocations, revealing the contribution of genes to pathway functionality, in agreement with current pathway state visualisation techniques, with the simulated annealing search proving slightly superior in terms of efficiency. Conclusions We show that the expression values of genes, which are members of a number of biochemical pathways or modules, are the net effect of the contribution of each gene to these biochemical processes. We show that by manipulating the pathway and module contribution of such genes to follow underlying trends we can interpret microarray results centred on the behaviour of these genes. PMID:21939531
Multi-membership gene regulation in pathway based microarray analysis.
Pavlidis, Stelios P; Payne, Annette M; Swift, Stephen M
2011-09-22
Gene expression analysis has been intensively researched for more than a decade. Recently, there has been elevated interest in the integration of microarray data analysis with other types of biological knowledge in a holistic analytical approach. We propose a methodology that can be facilitated for pathway based microarray data analysis, based on the observation that a substantial proportion of genes present in biochemical pathway databases are members of a number of distinct pathways. Our methodology aims towards establishing the state of individual pathways, by identifying those truly affected by the experimental conditions based on the behaviour of such genes. For that purpose it considers all the pathways in which a gene participates and the general census of gene expression per pathway. We utilise hill climbing, simulated annealing and a genetic algorithm to analyse the consistency of the produced results, through the application of fuzzy adjusted rand indexes and hamming distance. All algorithms produce highly consistent genes to pathways allocations, revealing the contribution of genes to pathway functionality, in agreement with current pathway state visualisation techniques, with the simulated annealing search proving slightly superior in terms of efficiency. We show that the expression values of genes, which are members of a number of biochemical pathways or modules, are the net effect of the contribution of each gene to these biochemical processes. We show that by manipulating the pathway and module contribution of such genes to follow underlying trends we can interpret microarray results centred on the behaviour of these genes.
2013-01-01
Background The Grooved Carpet shell clam Ruditapes decussatus is the autochthonous European clam and the most appreciated from a gastronomic and economic point of view. The production is in decline due to several factors such as Perkinsiosis and habitat invasion and competition by the introduced exotic species, the manila clam Ruditapes philippinarum. After we sequenced R. decussatus transcriptome we have designed an oligo microarray capable of contributing to provide some clues on molecular response of the clam to Perkinsiosis. Results A database consisting of 41,119 unique transcripts was constructed, of which 12,479 (30.3%) were annotated by similarity. An oligo-DNA microarray platform was then designed and applied to profile gene expression in R. decussatus heavily infected by Perkinsus olseni. Functional annotation of differentially expressed genes between those two conditionswas performed by gene set enrichment analysis. As expected, microarrays unveil genes related with stress/infectious agents such as hydrolases, proteases and others. The extensive role of innate immune system was also analyzed and effect of parasitosis upon expression of important molecules such as lectins reviewed. Conclusions This study represents a first attempt to characterize Ruditapes decussatus transcriptome, an important marine resource for the European aquaculture. The trancriptome sequencing and consequent annotation will increase the available tools and resources for this specie, introducing the possibility of high throughput experiments such as microarrays analysis. In this specific case microarray approach was used to unveil some important aspects of host-parasite interaction between the Carpet shell clam and Perkinsus, two non-model species, highlighting some genes associated with this interaction. Ample information was obtained to identify biological processes significantly enriched among differentially expressed genes in Perkinsus infected versus non-infected gills. An overview on the genes related with the immune system on R. decussatus transcriptome is also reported. PMID:24168212
Leite, Ricardo B; Milan, Massimo; Coppe, Alessandro; Bortoluzzi, Stefania; dos Anjos, António; Reinhardt, Richard; Saavedra, Carlos; Patarnello, Tomaso; Cancela, M Leonor; Bargelloni, Luca
2013-10-29
The Grooved Carpet shell clam Ruditapes decussatus is the autochthonous European clam and the most appreciated from a gastronomic and economic point of view. The production is in decline due to several factors such as Perkinsiosis and habitat invasion and competition by the introduced exotic species, the manila clam Ruditapes philippinarum. After we sequenced R. decussatus transcriptome we have designed an oligo microarray capable of contributing to provide some clues on molecular response of the clam to Perkinsiosis. A database consisting of 41,119 unique transcripts was constructed, of which 12,479 (30.3%) were annotated by similarity. An oligo-DNA microarray platform was then designed and applied to profile gene expression in R. decussatus heavily infected by Perkinsus olseni. Functional annotation of differentially expressed genes between those two conditionswas performed by gene set enrichment analysis. As expected, microarrays unveil genes related with stress/infectious agents such as hydrolases, proteases and others. The extensive role of innate immune system was also analyzed and effect of parasitosis upon expression of important molecules such as lectins reviewed. This study represents a first attempt to characterize Ruditapes decussatus transcriptome, an important marine resource for the European aquaculture. The trancriptome sequencing and consequent annotation will increase the available tools and resources for this specie, introducing the possibility of high throughput experiments such as microarrays analysis. In this specific case microarray approach was used to unveil some important aspects of host-parasite interaction between the Carpet shell clam and Perkinsus, two non-model species, highlighting some genes associated with this interaction. Ample information was obtained to identify biological processes significantly enriched among differentially expressed genes in Perkinsus infected versus non-infected gills. An overview on the genes related with the immune system on R. decussatus transcriptome is also reported.
2009-01-01
Background Large discrepancies in signature composition and outcome concordance have been observed between different microarray breast cancer expression profiling studies. This is often ascribed to differences in array platform as well as biological variability. We conjecture that other reasons for the observed discrepancies are the measurement error associated with each feature and the choice of preprocessing method. Microarray data are known to be subject to technical variation and the confidence intervals around individual point estimates of expression levels can be wide. Furthermore, the estimated expression values also vary depending on the selected preprocessing scheme. In microarray breast cancer classification studies, however, these two forms of feature variability are almost always ignored and hence their exact role is unclear. Results We have performed a comprehensive sensitivity analysis of microarray breast cancer classification under the two types of feature variability mentioned above. We used data from six state of the art preprocessing methods, using a compendium consisting of eight diferent datasets, involving 1131 hybridizations, containing data from both one and two-color array technology. For a wide range of classifiers, we performed a joint study on performance, concordance and stability. In the stability analysis we explicitly tested classifiers for their noise tolerance by using perturbed expression profiles that are based on uncertainty information directly related to the preprocessing methods. Our results indicate that signature composition is strongly influenced by feature variability, even if the array platform and the stratification of patient samples are identical. In addition, we show that there is often a high level of discordance between individual class assignments for signatures constructed on data coming from different preprocessing schemes, even if the actual signature composition is identical. Conclusion Feature variability can have a strong impact on breast cancer signature composition, as well as the classification of individual patient samples. We therefore strongly recommend that feature variability is considered in analyzing data from microarray breast cancer expression profiling experiments. PMID:19941644
McArt, Darragh G.; Dunne, Philip D.; Blayney, Jaine K.; Salto-Tellez, Manuel; Van Schaeybroeck, Sandra; Hamilton, Peter W.; Zhang, Shu-Dong
2013-01-01
The advent of next generation sequencing technologies (NGS) has expanded the area of genomic research, offering high coverage and increased sensitivity over older microarray platforms. Although the current cost of next generation sequencing is still exceeding that of microarray approaches, the rapid advances in NGS will likely make it the platform of choice for future research in differential gene expression. Connectivity mapping is a procedure for examining the connections among diseases, genes and drugs by differential gene expression initially based on microarray technology, with which a large collection of compound-induced reference gene expression profiles have been accumulated. In this work, we aim to test the feasibility of incorporating NGS RNA-Seq data into the current connectivity mapping framework by utilizing the microarray based reference profiles and the construction of a differentially expressed gene signature from a NGS dataset. This would allow for the establishment of connections between the NGS gene signature and those microarray reference profiles, alleviating the associated incurring cost of re-creating drug profiles with NGS technology. We examined the connectivity mapping approach on a publicly available NGS dataset with androgen stimulation of LNCaP cells in order to extract candidate compounds that could inhibit the proliferative phenotype of LNCaP cells and to elucidate their potential in a laboratory setting. In addition, we also analyzed an independent microarray dataset of similar experimental settings. We found a high level of concordance between the top compounds identified using the gene signatures from the two datasets. The nicotine derivative cotinine was returned as the top candidate among the overlapping compounds with potential to suppress this proliferative phenotype. Subsequent lab experiments validated this connectivity mapping hit, showing that cotinine inhibits cell proliferation in an androgen dependent manner. Thus the results in this study suggest a promising prospect of integrating NGS data with connectivity mapping. PMID:23840550
Kondo, S; Kamei, A; Xiao, J Z; Iwatsuki, K; Abe, K
2013-09-01
We previously reported that supplementation with Bifidobacterium breve B-3 reduced body weight gain and accumulation of visceral fat in a dose-dependent manner, and improved serum levels of total cholesterol, glucose and insulin in a mouse model of diet-induced obesity. In this study, we investigated the expression of genes in the liver using DNA microarray analysis and q-PCR to reveal the mechanism of these anti-obesity effects in this mouse model. Administration of B. breve B-3 led to regulated gene expression of pathways involved in lipid metabolism and response to stress. The results indicate that these regulations in the liver are related to the anti-metabolic syndrome effects of B. breve B-3.
COMPARISON OF COMPARATIVE GENOMIC HYBRIDIZATIONS TECHNOLOGIES ACROSS MICROARRAY PLATFORMS
Comparative Genomic Hybridization (CGH) measures DNA copy number differences between a reference genome and a test genome. The DNA samples are differentially labeled and hybridized to an immobilized substrate. In early CGH experiments, the DNA targets were hybridized to metaphase...
Andrews, C D; Payne, J F; Rise, M L
2014-01-01
Functional genomic studies were carried out on the inner ear of Atlantic salmon Salmo salar following exposure to a seismic airgun. Microarray analyses revealed 79 unique transcripts (passing background threshold), with 42 reproducibly up-regulated and 37 reproducibly down-regulated in exposed v. control fish. Regarding the potential effects on cellular energetics and cellular respiration, altered transcripts included those with roles in oxygen transport, the glycolytic pathway, the Krebs cycle and the electron transport chain. Of these, a number of transcripts encoding haemoglobins that are important in oxygen transport were up-regulated and among the most highly expressed. Up-regulation of transcripts encoding nicotinamide riboside kinase 2, which is also important in energy production and linked to nerve cell damage, points to evidence of neuronal damage in the ear following noise exposure. Transcripts related to protein modification or degradation also indicated potential damaging effects of sound on ear tissues. Notable in this regard were transcripts associated with the proteasome–ubiquitin pathway, which is involved in protein degradation, with the transcript encoding ubiquitin family domain-containing protein 1 displaying the highest response to exposure. The differential expression of transcripts observed for some immune responses could potentially be linked to the rupture of cell membranes. Meanwhile, the altered expression of transcripts for cytoskeletal proteins that contribute to the structural integrity of the inner ear could point to repair or regeneration of ear tissues including auditory hair cells. Regarding potential effects on hormones and vitamins, the protein carrier for thyroxine and retinol (vitamin A), namely transthyretin, was altered at the transcript expression level and it has been suggested from studies in mammalian systems that retinoic acid may play a role in the regeneration of damaged hair cells. The microarray experiment identified the transcript encoding growth hormone I as up-regulated by loud sound, supporting previous evidence linking growth hormone to hair cell regeneration in fishes. Quantitative (q) reverse transcription (RT) polymerase chain reaction (qRT-PCR) analyses confirmed dysregulation of some microarray-identified transcripts and in some cases revealed a high level of biological variability in the exposed group. These results support the potential utility of molecular biomarkers to evaluate the effect of seismic surveys on fishes with studies on the ears being placed in a priority category for development of exposure–response relationships. Knowledge of such relationships is necessary for addressing the question of potential size of injury zones. PMID:24814183
Aberrant expression of long noncoding RNAs in cumulus cells isolated from PCOS patients.
Huang, Xin; Hao, Cuifang; Bao, Hongchu; Wang, Meimei; Dai, Huangguan
2016-01-01
To describe the long noncoding RNA (lncRNA) profiles in cumulus cells isolated from polycystic ovary syndrome (PCOS) patients by employing a microarray and in-depth bioinformatics analysis. This information will help us understand the occurrence and development of PCOS. In this study, we used a microarray to describe lncRNA profiles in cumulus cells isolated from ten patients (five PCOS and five normal women). Several differentially expressed lncRNAs were chosen to validate the microarray results by quantitative RT-PCR (qRT-PCR). Then, the differentially expressed lncRNAs were classified into three subgroups (HOX loci lncRNA, enhancer-like lncRNA, and lincRNA) to deduce their potential features. Furthermore, a lncRNA/mRNA co-expression network was constructed by using the Cytoscape software (V2.8.3, http://www.cytoscape.org/ ). We observed that 623 lncRNAs and 260 messenger RNAs (mRNAs) were significantly up- or down-regulated (≥2-fold change), and these differences could be used to discriminate cumulus cells of PCOS from those of normal patients. Five differentially expressed lncRNAs (XLOC_011402, ENST00000454271, ENST00000433673, ENST00000450294, and ENST00000432431) were selected to validate the microarray results using quantitative RT-PCR (qRT-PCR). The qRT-PCR results were consistent with the microarray data. Further analysis indicated that many differentially expressed lncRNAs were transcribed from chromosome 2 and may act as enhancers to regulate their neighboring protein-coding genes. Forty-three lncRNAs and 29 mRNAs were used to construct the coding-non-coding gene co-expression network. Most pairs positively correlated, and one mRNA correlated with one or more lncRNAs. Our study is the first to determine genome-wide lncRNA expression patterns in cumulus cells isolated from PCOS patients by microarray. The results show that clusters of lncRNAs were aberrantly expressed in cumulus cells of PCOS patients compared with those of normal women, which revealed that lncRNAs differentially expressed in PCOS and normal women may contribute to the occurrence of PCOS and affect oocyte development.
Nowrousian, Minou; Ringelberg, Carol; Dunlap, Jay C; Loros, Jennifer J; Kück, Ulrich
2005-04-01
The filamentous fungus Sordaria macrospora forms complex three-dimensional fruiting bodies that protect the developing ascospores and ensure their proper discharge. Several regulatory genes essential for fruiting body development were previously isolated by complementation of the sterile mutants pro1, pro11 and pro22. To establish the genetic relationships between these genes and to identify downstream targets, we have conducted cross-species microarray hybridizations using cDNA arrays derived from the closely related fungus Neurospora crassa and RNA probes prepared from wild-type S. macrospora and the three developmental mutants. Of the 1,420 genes which gave a signal with the probes from all the strains used, 172 (12%) were regulated differently in at least one of the three mutants compared to the wild type, and 17 (1.2%) were regulated differently in all three mutant strains. Microarray data were verified by Northern analysis or quantitative real time PCR. Among the genes that are up- or down-regulated in the mutant strains are genes encoding the pheromone precursors, enzymes involved in melanin biosynthesis and a lectin-like protein. Analysis of gene expression in double mutants revealed a complex network of interaction between the pro gene products.
Song, Xuezheng; Lasanajak, Yi; Rivera-Marrero, Carlos; Luyai, Anthony; Willard, Margaret; Smith, David F; Cummings, Richard D
2009-12-15
Glycan microarray technology has become a successful tool for studying protein-carbohydrate interactions, but a limitation has been the laborious synthesis of glycan structures by enzymatic and chemical methods. Here we describe a new method to generate quantifiable glycan libraries from natural sources by combining widely used protease digestion of glycoproteins and Fmoc chemistry. Glycoproteins including chicken ovalbumin, bovine fetuin, and horseradish peroxidase (HRP) were digested by Pronase, protected by FmocCl, and efficiently separated by 2D-HPLC. We show that glycans from HRP glycopeptides separated by HPLC and fluorescence monitoring retained their natural reducing end structures, mostly core alpha1,3-fucose and core alpha1,2-xylose. After simple Fmoc deprotection, the glycans were printed on NHS-activated glass slides. The glycans were interrogated using plant lectins and antibodies in sera from mice infected with Schistosoma mansoni, which revealed the presence of both IgM and IgG antibody responses to HRP glycopeptides. This simple approach to glycopeptide purification and conjugation allows for the development of natural glycopeptide microarrays without the need to remove and derivatize glycans and potentially compromise their reducing end determinants.
DNA microarray analysis is plagued by a lack of data reproducibility and by limits to the detectability of transcripts by hybridization. To mitigate these limitations, we employed transcriptional coupling within the S. typhimurium genome. This genome has 2664 transcriptionally co...
USDA-ARS?s Scientific Manuscript database
C. jejuni colonizes the intestinal mucosa, and the severity of disease in different strains is correlated with host cell interaction and invasion. A microarray screen to identify genes differentially regulated during C. jejuni interaction with tissue culture cells revealed the up-regulation of a two...
Huang, Hui-Ling; Wu, Yu-Chung; Su, Li-Jen; Huang, Yun-Ju; Charoenkwan, Phasit; Chen, Wen-Liang; Lee, Hua-Chin; Chu, William Cheng-Chung; Ho, Shinn-Ying
2015-02-21
Few studies have investigated prognostic biomarkers of distant metastases of lung cancer. One of the central difficulties in identifying biomarkers from microarray data is the availability of only a small number of samples, which results overtraining. Recently obtained evidence reveals that epithelial-mesenchymal transition (EMT) of tumor cells causes metastasis, which is detrimental to patients' survival. This work proposes a novel optimization approach to discovering EMT-related prognostic biomarkers to predict the distant metastasis of lung cancer using both microarray and survival data. This weighted objective function maximizes both the accuracy of prediction of distant metastasis and the area between the disease-free survival curves of the non-distant and distant metastases. Seventy-eight patients with lung cancer and a follow-up time of 120 months are used to identify a set of gene markers and an independent cohort of 26 patients is used to evaluate the identified biomarkers. The medical records of the 78 patients show a significant difference between the disease-free survival times of the 37 non-distant- and the 41 distant-metastasis patients. The experimental results thus obtained are as follows. 1) The use of disease-free survival curves can compensate for the shortcoming of insufficient samples and greatly increase the test accuracy by 11.10%; and 2) the support vector machine with a set of 17 transcripts, such as CCL16 and CDKN2AIP, can yield a leave-one-out cross-validation accuracy of 93.59%, a test accuracy of 76.92%, a large disease-free survival area of 74.81%, and a mean survival prediction error of 3.99 months. The identified putative biomarkers are examined using related studies and signaling pathways to reveal the potential effectiveness of the biomarkers in prospective confirmatory studies. The proposed new optimization approach to identifying prognostic biomarkers by combining multiple sources of data (microarray and survival) can facilitate the accurate selection of biomarkers that are most relevant to the disease while solving the problem of insufficient samples.
Alshamlan, Hala; Badr, Ghada; Alohali, Yousef
2015-01-01
An artificial bee colony (ABC) is a relatively recent swarm intelligence optimization approach. In this paper, we propose the first attempt at applying ABC algorithm in analyzing a microarray gene expression profile. In addition, we propose an innovative feature selection algorithm, minimum redundancy maximum relevance (mRMR), and combine it with an ABC algorithm, mRMR-ABC, to select informative genes from microarray profile. The new approach is based on a support vector machine (SVM) algorithm to measure the classification accuracy for selected genes. We evaluate the performance of the proposed mRMR-ABC algorithm by conducting extensive experiments on six binary and multiclass gene expression microarray datasets. Furthermore, we compare our proposed mRMR-ABC algorithm with previously known techniques. We reimplemented two of these techniques for the sake of a fair comparison using the same parameters. These two techniques are mRMR when combined with a genetic algorithm (mRMR-GA) and mRMR when combined with a particle swarm optimization algorithm (mRMR-PSO). The experimental results prove that the proposed mRMR-ABC algorithm achieves accurate classification performance using small number of predictive genes when tested using both datasets and compared to previously suggested methods. This shows that mRMR-ABC is a promising approach for solving gene selection and cancer classification problems. PMID:25961028
Swimley, Michelle S.; Taylor, Amber W.; Dawson, Erica D.
2011-01-01
Abstract Shiga toxin–producing Escherichia coli O157 is a leading cause of foodborne illness worldwide. To evaluate better methods to rapidly detect and genotype E. coli O157 strains, the present study evaluated the use of ampliPHOX, a novel colorimetric detection method based on photopolymerization, for pathogen identification with DNA microarrays. A low-density DNA oligonucleotide microarray was designed to target stx1 and stx2 genes encoding Shiga toxin production, the eae gene coding for adherence membrane protein, and the per gene encoding the O157-antigen perosamine synthetase. Results from the validation experiments demonstrated that the use of ampliPHOX allowed the accurate genotyping of the tested E. coli strains, and positive hybridization signals were observed for only probes targeting virulence genes present in the reference strains. Quantification showed that the average signal-to-noise ratio values ranged from 47.73 ± 7.12 to 76.71 ± 8.33, whereas average signal-to-noise ratio values below 2.5 were determined for probes where no polymer was formed due to lack of specific hybridization. Sensitivity tests demonstrated that the sensitivity threshold for E. coli O157 detection was 100–1000 CFU/mL. Thus, the use of DNA microarrays in combination with photopolymerization allowed the rapid and accurate genotyping of E. coli O157 strains. PMID:21288130
High throughput gene expression profiling: a molecular approach to integrative physiology
Liang, Mingyu; Cowley, Allen W; Greene, Andrew S
2004-01-01
Integrative physiology emphasizes the importance of understanding multiple pathways with overlapping, complementary, or opposing effects and their interactions in the context of intact organisms. The DNA microarray technology, the most commonly used method for high-throughput gene expression profiling, has been touted as an integrative tool that provides insights into regulatory pathways. However, the physiology community has been slow in acceptance of these techniques because of early failure in generating useful data and the lack of a cohesive theoretical framework in which experiments can be analysed. With recent advances in both technology and analysis, we propose a concept of multidimensional integration of physiology that incorporates data generated by DNA microarray and other functional, genomic, and proteomic approaches to achieve a truly integrative understanding of physiology. Analysis of several studies performed in simpler organisms or in mammalian model animals supports the feasibility of such multidimensional integration and demonstrates the power of DNA microarray as an indispensable molecular tool for such integration. Evaluation of DNA microarray techniques indicates that these techniques, despite limitations, have advanced to a point where the question-driven profiling research has become a feasible complement to the conventional, hypothesis-driven research. With a keen sense of homeostasis, global regulation, and quantitative analysis, integrative physiologists are uniquely positioned to apply these techniques to enhance the understanding of complex physiological functions. PMID:14678487
Alshamlan, Hala; Badr, Ghada; Alohali, Yousef
2015-01-01
An artificial bee colony (ABC) is a relatively recent swarm intelligence optimization approach. In this paper, we propose the first attempt at applying ABC algorithm in analyzing a microarray gene expression profile. In addition, we propose an innovative feature selection algorithm, minimum redundancy maximum relevance (mRMR), and combine it with an ABC algorithm, mRMR-ABC, to select informative genes from microarray profile. The new approach is based on a support vector machine (SVM) algorithm to measure the classification accuracy for selected genes. We evaluate the performance of the proposed mRMR-ABC algorithm by conducting extensive experiments on six binary and multiclass gene expression microarray datasets. Furthermore, we compare our proposed mRMR-ABC algorithm with previously known techniques. We reimplemented two of these techniques for the sake of a fair comparison using the same parameters. These two techniques are mRMR when combined with a genetic algorithm (mRMR-GA) and mRMR when combined with a particle swarm optimization algorithm (mRMR-PSO). The experimental results prove that the proposed mRMR-ABC algorithm achieves accurate classification performance using small number of predictive genes when tested using both datasets and compared to previously suggested methods. This shows that mRMR-ABC is a promising approach for solving gene selection and cancer classification problems.
An expression database for roots of the model legume Medicago truncatula under salt stress
2009-01-01
Background Medicago truncatula is a model legume whose genome is currently being sequenced by an international consortium. Abiotic stresses such as salt stress limit plant growth and crop productivity, including those of legumes. We anticipate that studies on M. truncatula will shed light on other economically important legumes across the world. Here, we report the development of a database called MtED that contains gene expression profiles of the roots of M. truncatula based on time-course salt stress experiments using the Affymetrix Medicago GeneChip. Our hope is that MtED will provide information to assist in improving abiotic stress resistance in legumes. Description The results of our microarray experiment with roots of M. truncatula under 180 mM sodium chloride were deposited in the MtED database. Additionally, sequence and annotation information regarding microarray probe sets were included. MtED provides functional category analysis based on Gene and GeneBins Ontology, and other Web-based tools for querying and retrieving query results, browsing pathways and transcription factor families, showing metabolic maps, and comparing and visualizing expression profiles. Utilities like mapping probe sets to genome of M. truncatula and In-Silico PCR were implemented by BLAT software suite, which were also available through MtED database. Conclusion MtED was built in the PHP script language and as a MySQL relational database system on a Linux server. It has an integrated Web interface, which facilitates ready examination and interpretation of the results of microarray experiments. It is intended to help in selecting gene markers to improve abiotic stress resistance in legumes. MtED is available at http://bioinformatics.cau.edu.cn/MtED/. PMID:19906315
An expression database for roots of the model legume Medicago truncatula under salt stress.
Li, Daofeng; Su, Zhen; Dong, Jiangli; Wang, Tao
2009-11-11
Medicago truncatula is a model legume whose genome is currently being sequenced by an international consortium. Abiotic stresses such as salt stress limit plant growth and crop productivity, including those of legumes. We anticipate that studies on M. truncatula will shed light on other economically important legumes across the world. Here, we report the development of a database called MtED that contains gene expression profiles of the roots of M. truncatula based on time-course salt stress experiments using the Affymetrix Medicago GeneChip. Our hope is that MtED will provide information to assist in improving abiotic stress resistance in legumes. The results of our microarray experiment with roots of M. truncatula under 180 mM sodium chloride were deposited in the MtED database. Additionally, sequence and annotation information regarding microarray probe sets were included. MtED provides functional category analysis based on Gene and GeneBins Ontology, and other Web-based tools for querying and retrieving query results, browsing pathways and transcription factor families, showing metabolic maps, and comparing and visualizing expression profiles. Utilities like mapping probe sets to genome of M. truncatula and In-Silico PCR were implemented by BLAT software suite, which were also available through MtED database. MtED was built in the PHP script language and as a MySQL relational database system on a Linux server. It has an integrated Web interface, which facilitates ready examination and interpretation of the results of microarray experiments. It is intended to help in selecting gene markers to improve abiotic stress resistance in legumes. MtED is available at http://bioinformatics.cau.edu.cn/MtED/.
Holloway, Andrew J; Oshlack, Alicia; Diyagama, Dileepa S; Bowtell, David DL; Smyth, Gordon K
2006-01-01
Background Concerns are often raised about the accuracy of microarray technologies and the degree of cross-platform agreement, but there are yet no methods which can unambiguously evaluate precision and sensitivity for these technologies on a whole-array basis. Results A methodology is described for evaluating the precision and sensitivity of whole-genome gene expression technologies such as microarrays. The method consists of an easy-to-construct titration series of RNA samples and an associated statistical analysis using non-linear regression. The method evaluates the precision and responsiveness of each microarray platform on a whole-array basis, i.e., using all the probes, without the need to match probes across platforms. An experiment is conducted to assess and compare four widely used microarray platforms. All four platforms are shown to have satisfactory precision but the commercial platforms are superior for resolving differential expression for genes at lower expression levels. The effective precision of the two-color platforms is improved by allowing for probe-specific dye-effects in the statistical model. The methodology is used to compare three data extraction algorithms for the Affymetrix platforms, demonstrating poor performance for the commonly used proprietary algorithm relative to the other algorithms. For probes which can be matched across platforms, the cross-platform variability is decomposed into within-platform and between-platform components, showing that platform disagreement is almost entirely systematic rather than due to measurement variability. Conclusion The results demonstrate good precision and sensitivity for all the platforms, but highlight the need for improved probe annotation. They quantify the extent to which cross-platform measures can be expected to be less accurate than within-platform comparisons for predicting disease progression or outcome. PMID:17118209
Linking microarray reporters with protein functions
Gaj, Stan; van Erk, Arie; van Haaften, Rachel IM; Evelo, Chris TA
2007-01-01
Background The analysis of microarray experiments requires accurate and up-to-date functional annotation of the microarray reporters to optimize the interpretation of the biological processes involved. Pathway visualization tools are used to connect gene expression data with existing biological pathways by using specific database identifiers that link reporters with elements in the pathways. Results This paper proposes a novel method that aims to improve microarray reporter annotation by BLASTing the original reporter sequences against a species-specific EMBL subset, that was derived from and crosslinked back to the highly curated UniProt database. The resulting alignments were filtered using high quality alignment criteria and further compared with the outcome of a more traditional approach, where reporter sequences were BLASTed against EnsEMBL followed by locating the corresponding protein (UniProt) entry for the high quality hits. Combining the results of both methods resulted in successful annotation of > 58% of all reporter sequences with UniProt IDs on two commercial array platforms, increasing the amount of Incyte reporters that could be coupled to Gene Ontology terms from 32.7% to 58.3% and to a local GenMAPP pathway from 9.6% to 16.7%. For Agilent, 35.3% of the total reporters are now linked towards GO nodes and 7.1% on local pathways. Conclusion Our methods increased the annotation quality of microarray reporter sequences and allowed us to visualize more reporters using pathway visualization tools. Even in cases where the original reporter annotation showed the correct description the new identifiers often allowed improved pathway and Gene Ontology linking. These methods are freely available at http://www.bigcat.unimaas.nl/public/publications/Gaj_Annotation/. PMID:17897448
Pan, Deyun; Sun, Ning; Cheung, Kei-Hoi; Guan, Zhong; Ma, Ligeng; Holford, Matthew; Deng, Xingwang; Zhao, Hongyu
2003-11-07
To date, many genomic and pathway-related tools and databases have been developed to analyze microarray data. In published web-based applications to date, however, complex pathways have been displayed with static image files that may not be up-to-date or are time-consuming to rebuild. In addition, gene expression analyses focus on individual probes and genes with little or no consideration of pathways. These approaches reveal little information about pathways that are key to a full understanding of the building blocks of biological systems. Therefore, there is a need to provide useful tools that can generate pathways without manually building images and allow gene expression data to be integrated and analyzed at pathway levels for such experimental organisms as Arabidopsis. We have developed PathMAPA, a web-based application written in Java that can be easily accessed over the Internet. An Oracle database is used to store, query, and manipulate the large amounts of data that are involved. PathMAPA allows its users to (i) upload and populate microarray data into a database; (ii) integrate gene expression with enzymes of the pathways; (iii) generate pathway diagrams without building image files manually; (iv) visualize gene expressions for each pathway at enzyme, locus, and probe levels; and (v) perform statistical tests at pathway, enzyme and gene levels. PathMAPA can be used to examine Arabidopsis thaliana gene expression patterns associated with metabolic pathways. PathMAPA provides two unique features for the gene expression analysis of Arabidopsis thaliana: (i) automatic generation of pathways associated with gene expression and (ii) statistical tests at pathway level. The first feature allows for the periodical updating of genomic data for pathways, while the second feature can provide insight into how treatments affect relevant pathways for the selected experiment(s).
Pan, Deyun; Sun, Ning; Cheung, Kei-Hoi; Guan, Zhong; Ma, Ligeng; Holford, Matthew; Deng, Xingwang; Zhao, Hongyu
2003-01-01
Background To date, many genomic and pathway-related tools and databases have been developed to analyze microarray data. In published web-based applications to date, however, complex pathways have been displayed with static image files that may not be up-to-date or are time-consuming to rebuild. In addition, gene expression analyses focus on individual probes and genes with little or no consideration of pathways. These approaches reveal little information about pathways that are key to a full understanding of the building blocks of biological systems. Therefore, there is a need to provide useful tools that can generate pathways without manually building images and allow gene expression data to be integrated and analyzed at pathway levels for such experimental organisms as Arabidopsis. Results We have developed PathMAPA, a web-based application written in Java that can be easily accessed over the Internet. An Oracle database is used to store, query, and manipulate the large amounts of data that are involved. PathMAPA allows its users to (i) upload and populate microarray data into a database; (ii) integrate gene expression with enzymes of the pathways; (iii) generate pathway diagrams without building image files manually; (iv) visualize gene expressions for each pathway at enzyme, locus, and probe levels; and (v) perform statistical tests at pathway, enzyme and gene levels. PathMAPA can be used to examine Arabidopsis thaliana gene expression patterns associated with metabolic pathways. Conclusion PathMAPA provides two unique features for the gene expression analysis of Arabidopsis thaliana: (i) automatic generation of pathways associated with gene expression and (ii) statistical tests at pathway level. The first feature allows for the periodical updating of genomic data for pathways, while the second feature can provide insight into how treatments affect relevant pathways for the selected experiment(s). PMID:14604444
Dai, Yilin; Guo, Ling; Li, Meng; Chen, Yi-Bu
2012-06-08
Microarray data analysis presents a significant challenge to researchers who are unable to use the powerful Bioconductor and its numerous tools due to their lack of knowledge of R language. Among the few existing software programs that offer a graphic user interface to Bioconductor packages, none have implemented a comprehensive strategy to address the accuracy and reliability issue of microarray data analysis due to the well known probe design problems associated with many widely used microarray chips. There is also a lack of tools that would expedite the functional analysis of microarray results. We present Microarray Я US, an R-based graphical user interface that implements over a dozen popular Bioconductor packages to offer researchers a streamlined workflow for routine differential microarray expression data analysis without the need to learn R language. In order to enable a more accurate analysis and interpretation of microarray data, we incorporated the latest custom probe re-definition and re-annotation for Affymetrix and Illumina chips. A versatile microarray results output utility tool was also implemented for easy and fast generation of input files for over 20 of the most widely used functional analysis software programs. Coupled with a well-designed user interface, Microarray Я US leverages cutting edge Bioconductor packages for researchers with no knowledge in R language. It also enables a more reliable and accurate microarray data analysis and expedites downstream functional analysis of microarray results.
Lactobacillus reuteri 100-23 modulates urea hydrolysis in the murine stomach.
Wilson, Charlotte M; Loach, Diane; Lawley, Blair; Bell, Tracey; Sims, Ian M; O'Toole, Paul W; Zomer, Aldert; Tannock, Gerald W
2014-10-01
Comparisons of in vivo (mouse stomach) and in vitro (laboratory culture) transcriptomes of Lactobacillus reuteri strain 100-23 were made by microarray analysis. These comparisons revealed the upregulation of genes associated with acid tolerance, including urease production, in the mouse stomach. Inactivation of the ureC gene reduced the acid tolerance of strain 100-23 in vitro, and the mutant was outcompeted by the wild type in the gut of ex-Lactobacillus-free mice. Urine analysis showed that stable isotope-labeled urea, administered by gavage, was metabolized to a greater extent in Lactobacillus-free mice than animals colonized by strain 100-23. This surprising observation was associated with higher levels of urease activity and fecal-type bacteria in the stomach digesta of Lactobacillus-free mice. Despite the modulation of urea hydrolysis in the stomach, recycling of urea nitrogen in the murine host was not affected since the essential amino acid isoleucine, labeled with a stable isotope, was detected in the livers of both Lactobacillus-free and 100-23-colonized animals. Therefore, our experiments reveal a new and unexpected impact of Lactobacillus colonization on urea hydrolysis in the murine gut. Copyright © 2014, American Society for Microbiology. All Rights Reserved.
Lactobacillus reuteri 100-23 Modulates Urea Hydrolysis in the Murine Stomach
Wilson, Charlotte M.; Loach, Diane; Lawley, Blair; Bell, Tracey; Sims, Ian M.; O'Toole, Paul W.; Zomer, Aldert
2014-01-01
Comparisons of in vivo (mouse stomach) and in vitro (laboratory culture) transcriptomes of Lactobacillus reuteri strain 100-23 were made by microarray analysis. These comparisons revealed the upregulation of genes associated with acid tolerance, including urease production, in the mouse stomach. Inactivation of the ureC gene reduced the acid tolerance of strain 100-23 in vitro, and the mutant was outcompeted by the wild type in the gut of ex-Lactobacillus-free mice. Urine analysis showed that stable isotope-labeled urea, administered by gavage, was metabolized to a greater extent in Lactobacillus-free mice than animals colonized by strain 100-23. This surprising observation was associated with higher levels of urease activity and fecal-type bacteria in the stomach digesta of Lactobacillus-free mice. Despite the modulation of urea hydrolysis in the stomach, recycling of urea nitrogen in the murine host was not affected since the essential amino acid isoleucine, labeled with a stable isotope, was detected in the livers of both Lactobacillus-free and 100-23-colonized animals. Therefore, our experiments reveal a new and unexpected impact of Lactobacillus colonization on urea hydrolysis in the murine gut. PMID:25063664
DOE Office of Scientific and Technical Information (OSTI.GOV)
Eric E. Roden
2009-07-08
This report summarizes research conducted in conjunction with a project entitled “Integrated Nucleic Acid System for In-Field Monitoring of Microbial Community Dynamics and Metabolic Activity”, which was funded through the Integrative Studies Element of the former NABIR Program (now the Environmental Remediation Sciences Program) within the Office of Biological and Environmental Research. Dr. Darrell Chandler (originally at Argonne National Laboratory, now with Akonni Biosystems) was the overall PI/PD for the project. The overall project goals were to (1) apply a model iron-reducer and sulfate-reducer microarray and instrumentation systems to sediment and groundwater samples from the Scheibe et al. FRC Areamore » 2 field site, UMTRA sediments, and other DOE contaminated sites; (2) continue development and expansion of a 16S rRNA/rDNA¬-targeted probe suite for microbial community dynamics as new sequences are obtained from DOE-relevant sites; and (3) address the fundamental molecular biology and analytical chemistry associated with the extraction, purification and analysis of functional genes and mRNA in environmental samples. Work on the UW subproject focused on conducting detailed batch and semicontinuous culture reactor experiments with uranium-contaminated FRC Area 2 sediment. The reactor experiments were designed to provide coherent geochemical and microbiological data in support of microarray analyses of microbial communities in Area 2 sediments undergoing biostimulation with ethanol. A total of four major experiments were conducted (one batch and three semicontinuous culture), three of which (the batch and two semicontinuous culture) provided samples for DNA microarray analysis. A variety of other molecular analyses (clone libraries, 16S PhyloChip, RT-PCR, and T-RFLP) were conducted on parallel samples from the various experiments in order to provide independent information on microbial community response to biostimulation.« less
ArrayInitiative - a tool that simplifies creating custom Affymetrix CDFs
2011-01-01
Background Probes on a microarray represent a frozen view of a genome and are quickly outdated when new sequencing studies extend our knowledge, resulting in significant measurement error when analyzing any microarray experiment. There are several bioinformatics approaches to improve probe assignments, but without in-house programming expertise, standardizing these custom array specifications as a usable file (e.g. as Affymetrix CDFs) is difficult, owing mostly to the complexity of the specification file format. However, without correctly standardized files there is a significant barrier for testing competing analysis approaches since this file is one of the required inputs for many commonly used algorithms. The need to test combinations of probe assignments and analysis algorithms led us to develop ArrayInitiative, a tool for creating and managing custom array specifications. Results ArrayInitiative is a standalone, cross-platform, rich client desktop application for creating correctly formatted, custom versions of manufacturer-provided (default) array specifications, requiring only minimal knowledge of the array specification rules and file formats. Users can import default array specifications, import probe sequences for a default array specification, design and import a custom array specification, export any array specification to multiple output formats, export the probe sequences for any array specification and browse high-level information about the microarray, such as version and number of probes. The initial release of ArrayInitiative supports the Affymetrix 3' IVT expression arrays we currently analyze, but as an open source application, we hope that others will contribute modules for other platforms. Conclusions ArrayInitiative allows researchers to create new array specifications, in a standard format, based upon their own requirements. This makes it easier to test competing design and analysis strategies that depend on probe definitions. Since the custom array specifications are easily exported to the manufacturer's standard format, researchers can analyze these customized microarray experiments using established software tools, such as those available in Bioconductor. PMID:21548938
Stannous Fluoride Effects on Gene Expression of Streptococcus mutans and Actinomyces viscosus.
Shi, Y; Li, R; White, D J; Biesbrock, A R
2018-02-01
A genome-wide transcriptional analysis was performed to elucidate the bacterial cellular response of Streptococcus mutans and Actinomyces viscosus to NaF and SnF 2 . The minimal inhibitory concentration (MIC) and minimal bactericidal concentration (MBC) of SnF 2 were predetermined before microarray study. Gene expression profiling microarray experiments were carried out in the absence (control) and presence (experimental) of 10 ppm and 100 ppm Sn 2+ (in the form of SnF 2 ) and fluoride controls for 10-min exposures (4 biological replicates/treatment). These Sn 2+ levels and treatment time were chosen because they have been shown to slow bacterial growth of S. mutans (10 ppm) and A. viscosus (100 ppm) without affecting cell viability. All data generated by microarray experiments were analyzed with bioinformatics tools by applying the following criteria: 1) a q value should be ≤0.05, and 2) an absolute fold change in transcript level should be ≥1.5. Microarray results showed SnF 2 significantly inhibited several genes encoding enzymes of the galactose pathway upon a 10-min exposure versus a negative control: lacA and lacB (A and B subunits of the galactose-6-P isomerase), lacC (tagatose-6-P kinase), lacD (tagatose-1,6-bP adolase), galK (galactokinase), galT (galactose-1-phosphate uridylyltransferase), and galE (UDP-glucose 4-epimerase). A gene fruK encoding fructose-1-phosphate kinase in the fructose pathway was also significantly inhibited. Several genes encoding fructose/mannose-specific enzyme IIABC components in the phosphotransferase system (PTS) were also downregulated, as was ldh encoding lactate dehydrogenase, a key enzyme involved in lactic acid synthesis. SnF 2 downregulated the transcription of most key enzyme genes involved in the galactose pathway and also suppressed several key genes involved in the PTS, which transports sugars into the cell in the first step of glycolysis.
NCBI GEO: archive for high-throughput functional genomic data.
Barrett, Tanya; Troup, Dennis B; Wilhite, Stephen E; Ledoux, Pierre; Rudnev, Dmitry; Evangelista, Carlos; Kim, Irene F; Soboleva, Alexandra; Tomashevsky, Maxim; Marshall, Kimberly A; Phillippy, Katherine H; Sherman, Patti M; Muertter, Rolf N; Edgar, Ron
2009-01-01
The Gene Expression Omnibus (GEO) at the National Center for Biotechnology Information (NCBI) is the largest public repository for high-throughput gene expression data. Additionally, GEO hosts other categories of high-throughput functional genomic data, including those that examine genome copy number variations, chromatin structure, methylation status and transcription factor binding. These data are generated by the research community using high-throughput technologies like microarrays and, more recently, next-generation sequencing. The database has a flexible infrastructure that can capture fully annotated raw and processed data, enabling compliance with major community-derived scientific reporting standards such as 'Minimum Information About a Microarray Experiment' (MIAME). In addition to serving as a centralized data storage hub, GEO offers many tools and features that allow users to effectively explore, analyze and download expression data from both gene-centric and experiment-centric perspectives. This article summarizes the GEO repository structure, content and operating procedures, as well as recently introduced data mining features. GEO is freely accessible at http://www.ncbi.nlm.nih.gov/geo/.
NCBI GEO: mining millions of expression profiles--database and tools.
Barrett, Tanya; Suzek, Tugba O; Troup, Dennis B; Wilhite, Stephen E; Ngau, Wing-Chi; Ledoux, Pierre; Rudnev, Dmitry; Lash, Alex E; Fujibuchi, Wataru; Edgar, Ron
2005-01-01
The Gene Expression Omnibus (GEO) at the National Center for Biotechnology Information (NCBI) is the largest fully public repository for high-throughput molecular abundance data, primarily gene expression data. The database has a flexible and open design that allows the submission, storage and retrieval of many data types. These data include microarray-based experiments measuring the abundance of mRNA, genomic DNA and protein molecules, as well as non-array-based technologies such as serial analysis of gene expression (SAGE) and mass spectrometry proteomic technology. GEO currently holds over 30,000 submissions representing approximately half a billion individual molecular abundance measurements, for over 100 organisms. Here, we describe recent database developments that facilitate effective mining and visualization of these data. Features are provided to examine data from both experiment- and gene-centric perspectives using user-friendly Web-based interfaces accessible to those without computational or microarray-related analytical expertise. The GEO database is publicly accessible through the World Wide Web at http://www.ncbi.nlm.nih.gov/geo.
Linear model for fast background subtraction in oligonucleotide microarrays.
Kroll, K Myriam; Barkema, Gerard T; Carlon, Enrico
2009-11-16
One important preprocessing step in the analysis of microarray data is background subtraction. In high-density oligonucleotide arrays this is recognized as a crucial step for the global performance of the data analysis from raw intensities to expression values. We propose here an algorithm for background estimation based on a model in which the cost function is quadratic in a set of fitting parameters such that minimization can be performed through linear algebra. The model incorporates two effects: 1) Correlated intensities between neighboring features in the chip and 2) sequence-dependent affinities for non-specific hybridization fitted by an extended nearest-neighbor model. The algorithm has been tested on 360 GeneChips from publicly available data of recent expression experiments. The algorithm is fast and accurate. Strong correlations between the fitted values for different experiments as well as between the free-energy parameters and their counterparts in aqueous solution indicate that the model captures a significant part of the underlying physical chemistry.
Fast gene ontology based clustering for microarray experiments.
Ovaska, Kristian; Laakso, Marko; Hautaniemi, Sampsa
2008-11-21
Analysis of a microarray experiment often results in a list of hundreds of disease-associated genes. In order to suggest common biological processes and functions for these genes, Gene Ontology annotations with statistical testing are widely used. However, these analyses can produce a very large number of significantly altered biological processes. Thus, it is often challenging to interpret GO results and identify novel testable biological hypotheses. We present fast software for advanced gene annotation using semantic similarity for Gene Ontology terms combined with clustering and heat map visualisation. The methodology allows rapid identification of genes sharing the same Gene Ontology cluster. Our R based semantic similarity open-source package has a speed advantage of over 2000-fold compared to existing implementations. From the resulting hierarchical clustering dendrogram genes sharing a GO term can be identified, and their differences in the gene expression patterns can be seen from the heat map. These methods facilitate advanced annotation of genes resulting from data analysis.
Probabilistic segmentation and intensity estimation for microarray images.
Gottardo, Raphael; Besag, Julian; Stephens, Matthew; Murua, Alejandro
2006-01-01
We describe a probabilistic approach to simultaneous image segmentation and intensity estimation for complementary DNA microarray experiments. The approach overcomes several limitations of existing methods. In particular, it (a) uses a flexible Markov random field approach to segmentation that allows for a wider range of spot shapes than existing methods, including relatively common 'doughnut-shaped' spots; (b) models the image directly as background plus hybridization intensity, and estimates the two quantities simultaneously, avoiding the common logical error that estimates of foreground may be less than those of the corresponding background if the two are estimated separately; and (c) uses a probabilistic modeling approach to simultaneously perform segmentation and intensity estimation, and to compute spot quality measures. We describe two approaches to parameter estimation: a fast algorithm, based on the expectation-maximization and the iterated conditional modes algorithms, and a fully Bayesian framework. These approaches produce comparable results, and both appear to offer some advantages over other methods. We use an HIV experiment to compare our approach to two commercial software products: Spot and Arrayvision.
Kerr, Kathleen F; Serikawa, Kyle A; Wei, Caimiao; Peters, Mette A; Bumgarner, Roger E
2007-01-01
The reference design is a practical and popular choice for microarray studies using two-color platforms. In the reference design, the reference RNA uses half of all array resources, leading investigators to ask: What is the best reference RNA? We propose a novel method for evaluating reference RNAs and present the results of an experiment that was specially designed to evaluate three common choices of reference RNA. We found no compelling evidence in favor of any particular reference. In particular, a commercial reference showed no advantage in our data. Our experimental design also enabled a new way to test the effectiveness of pre-processing methods for two-color arrays. Our results favor using intensity normalization and foregoing background subtraction. Finally, we evaluate the sensitivity and specificity of data quality filters, and we propose a new filter that can be applied to any experimental design and does not rely on replicate hybridizations.
Tronser, Tina; Popova, Anna A; Jaggy, Mona; Bastmeyer, Martin; Levkin, Pavel A
2017-12-01
Over the past decades, stem cells have attracted growing interest in fundamental biological and biomedical research as well as in regenerative medicine, due to their unique ability to self-renew and differentiate into various cell types. Long-term maintenance of the self-renewal ability and inhibition of spontaneous differentiation, however, still remain challenging and are not fully understood. Uncontrolled spontaneous differentiation of stem cells makes high-throughput screening of stem cells also difficult. This further hinders investigation of the underlying mechanisms of stem cell differentiation and the factors that might affect it. In this work, a dual functionality of nanoporous superhydrophobic-hydrophilic micropatterns is demonstrated in their ability to inhibit differentiation of mouse embryonic stem cells (mESCs) and at the same time enable formation of arrays of microdroplets (droplet microarray) via the effect of discontinuous dewetting. Such combination makes high-throughput screening of undifferentiated mouse embryonic stem cells possible. The droplet microarray is used to investigate the development, differentiation, and maintenance of stemness of mESC, revealing the dependence of stem cell behavior on droplet volume in nano- and microliter scale. The inhibition of spontaneous differentiation of mESCs cultured on the droplet microarray for up to 72 h is observed. In addition, up to fourfold increased cell growth rate of mESCs cultured on our platform has been observed. The difference in the behavior of mESCs is attributed to the porosity and roughness of the polymer surface. This work demonstrates that the droplet microarray possesses the potential for the screening of mESCs under conditions of prolonged inhibition of stem cells' spontaneous differentiation. Such a platform can be useful for applications in the field of stem cell research, pharmacological testing of drug efficacy and toxicity, biomedical research as well as in the field of regenerative medicine and tissue engineering. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Gene Expression Analyses of Subchondral Bone in Early Experimental Osteoarthritis by Microarray
Chen, YuXian; Shen, Jun; Lu, HuaDing; Zeng, Chun; Ren, JianHua; Zeng, Hua; Li, ZhiFu; Chen, ShaoMing; Cai, DaoZhang; Zhao, Qing
2012-01-01
Osteoarthritis (OA) is a degenerative joint disease that affects both cartilage and bone. A better understanding of the early molecular changes in subchondral bone may help elucidate the pathogenesis of OA. We used microarray technology to investigate the time course of molecular changes in the subchondral bone in the early stages of experimental osteoarthritis in a rat model. We identified 2,234 differentially expressed (DE) genes at 1 week, 1,944 at 2 weeks and 1,517 at 4 weeks post-surgery. Further analyses of the dysregulated genes indicated that the events underlying subchondral bone remodeling occurred sequentially and in a time-dependent manner at the gene expression level. Some of the identified dysregulated genes that were identified have suspected roles in bone development or remodeling; these genes include Alp, Igf1, Tgf β1, Postn, Mmp3, Tnfsf11, Acp5, Bmp5, Aspn and Ihh. The differences in the expression of these genes were confirmed by real-time PCR, and the results indicated that our microarray data accurately reflected gene expression patterns characteristic of early OA. To validate the results of our microarray analysis at the protein level, immunohistochemistry staining was used to investigate the expression of Mmp3 and Aspn protein in tissue sections. These analyses indicate that Mmp3 protein expression completely matched the results of both the microarray and real-time PCR analyses; however, Aspn protein expression was not observed to differ at any time. In summary, our study demonstrated a simple method of separation of subchondral bone sample from the knee joint of rat, which can effectively avoid bone RNA degradation. These findings also revealed the gene expression profiles of subchondral bone in the rat OA model at multiple time points post-surgery and identified important DE genes with known or suspected roles in bone development or remodeling. These genes may be novel diagnostic markers or therapeutic targets for OA. PMID:22384228
RNA sequencing: current and prospective uses in metabolic research.
Vikman, Petter; Fadista, Joao; Oskolkov, Nikolay
2014-10-01
Previous global RNA analysis was restricted to known transcripts in species with a defined transcriptome. Next generation sequencing has transformed transcriptomics by making it possible to analyse expressed genes with an exon level resolution from any tissue in any species without any a priori knowledge of which genes that are being expressed, splice patterns or their nucleotide sequence. In addition, RNA sequencing is a more sensitive technique compared with microarrays with a larger dynamic range, and it also allows for investigation of imprinting and allele-specific expression. This can be done for a cost that is able to compete with that of a microarray, making RNA sequencing a technique available to most researchers. Therefore RNA sequencing has recently become the state of the art with regards to large-scale RNA investigations and has to a large extent replaced microarrays. The only drawback is the large data amounts produced, which together with the complexity of the data can make a researcher spend far more time on analysis than performing the actual experiment. © 2014 Society for Endocrinology.
Development of an electro-responsive platform for the controlled transfection of mammalian cells
NASA Astrophysics Data System (ADS)
Hook, Andrew L.; Thissen, Helmut W.; Hayes, Jason P.; Voelcker, Nicolas H.
2005-02-01
The recent development of living microarrays as novel tools for the analysis of gene expression in an in-situ environment promises to unravel gene function within living organisms. In order to significantly enhance microarray performance, we are working towards electro-responsive DNA transfection chips. This study focuses on the control of DNA adsorption and desorption by appropriate surface modification of highly doped p++ silicon. Silicon was modified by plasma polymerisation of allylamine (ALAPP), a non-toxic surface that sustains cell growth. Subsequent high surface density grafting of poly(ethylene oxide) formed a layer resistant to biomolecule adsorption and cell attachment. Spatially controlled excimer laser ablation of the surface produced micron resolution patterns of re-exposed plasma polymer whilst the rest of the surface remained non-fouling. We observed electro-stimulated preferential adsorption of DNA to the ALAPP surface and subsequent desorption by the application of a negative bias. Cell culture experiments with HEK 293 cells demonstrated efficient and controlled transfection of cells using the expression of green fluorescent protein as a reporter. Thus, these chemically patterned surfaces are promising platforms for use as living microarrays.
SoFoCles: feature filtering for microarray classification based on gene ontology.
Papachristoudis, Georgios; Diplaris, Sotiris; Mitkas, Pericles A
2010-02-01
Marker gene selection has been an important research topic in the classification analysis of gene expression data. Current methods try to reduce the "curse of dimensionality" by using statistical intra-feature set calculations, or classifiers that are based on the given dataset. In this paper, we present SoFoCles, an interactive tool that enables semantic feature filtering in microarray classification problems with the use of external, well-defined knowledge retrieved from the Gene Ontology. The notion of semantic similarity is used to derive genes that are involved in the same biological path during the microarray experiment, by enriching a feature set that has been initially produced with legacy methods. Among its other functionalities, SoFoCles offers a large repository of semantic similarity methods that are used in order to derive feature sets and marker genes. The structure and functionality of the tool are discussed in detail, as well as its ability to improve classification accuracy. Through experimental evaluation, SoFoCles is shown to outperform other classification schemes in terms of classification accuracy in two real datasets using different semantic similarity computation approaches.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Katika, Madhumohan R.; Department of Health Risk Analysis and Toxicology, Maastricht University; Netherlands Toxicogenomics Centre
Deoxynivalenol (DON) or vomitoxin is a commonly encountered type-B trichothecene mycotoxin, produced by Fusarium species predominantly found in cereals and grains. DON is known to exert toxic effects on the gastrointestinal, reproductive and neuroendocrine systems, and particularly on the immune system. Depending on dose and exposure time, it can either stimulate or suppress immune function. The main objective of this study was to obtain a deeper insight into DON-induced effects on lymphoid cells. For this, we exposed the human T-lymphocyte cell line Jurkat and human peripheral blood mononuclear cells (PBMCs) to various concentrations of DON for various times and examinedmore » gene expression changes by DNA microarray analysis. Jurkat cells were exposed to 0.25 and 0.5 μM DON for 3, 6 and 24 h. Biological interpretation of the microarray data indicated that DON affects various processes in these cells: It upregulates genes involved in ribosome structure and function, RNA/protein synthesis and processing, endoplasmic reticulum (ER) stress, calcium-mediated signaling, mitochondrial function, oxidative stress, the NFAT and NF-κB/TNF-α pathways, T cell activation and apoptosis. The effects of DON on the expression of genes involved in ER stress, NFAT activation and apoptosis were confirmed by qRT-PCR. Other biochemical experiments confirmed that DON activates calcium-dependent proteins such as calcineurin and M-calpain that are known to be involved in T cell activation and apoptosis. Induction of T cell activation was also confirmed by demonstrating that DON activates NFATC1 and induces its translocation from the cytoplasm to the nucleus. For the gene expression profiling of PBMCs, cells were exposed to 2 and 4 μM DON for 6 and 24 h. Comparison of the Jurkat microarray data with those obtained with PBMCs showed that most of the processes affected by DON in the Jurkat cell line were also affected in the PBMCs. -- Highlights: ► The human T cell line Jurkat and human PBMCs were exposed to DON. ► Whole-genome microarray experiments were performed. ► Microarray data indicates that DON affects ribosome and RNA/protein synthesis. ► DON treatment induces ER stress, calcium mediated signaling, NFAT and NF-κB. ► Exposure to DON induces T cell activation, oxidative stress and apoptosis.« less
NASA Astrophysics Data System (ADS)
Liu, Yingshuai; Li, Xuelian; Bao, Shujuan; Lu, Zhisong; Li, Qing; Li, Chang Ming
2013-05-01
Superparamagnetic iron oxide nanoparticles (SPIONs) (about 15 nm) were synthesized via a hydrothermal method and characterized by field emission scanning electron microscopy, transmission electron microscopy, dynamic light scattering, x-ray diffraction, and vibrating sample magnetometer. The molecular pathways of SPIONs-induced nanotoxicity was further investigated by protein microarrays on a plastic substrate from evaluation of cell viability, reactive oxygen species (ROS) generation and cell apoptosis. The experimental results reveal that 50 μg ml-1 or higher levels of SPIONs cause significant loss of cell viability, considerable generation of ROS and cell apoptosis. It is proposed that high level SPIONs could induce cell apoptosis via a mitochondria-mediated intrinsic pathway by activation of caspase 9 and caspase 3, an increase of the Bax/Bcl-2 ratio, and down-regulation of HSP70 and HSP90 survivor factors.
Pro-oncogene Pokemon promotes breast cancer progression by upregulating survivin expression.
Zu, Xuyu; Ma, Jun; Liu, Hongxia; Liu, Feng; Tan, Chunyan; Yu, Lingling; Wang, Jue; Xie, Zhenhua; Cao, Deliang; Jiang, Yuyang
2011-03-10
Pokemon is an oncogenic transcription factor involved in cell growth, differentiation and oncogenesis, but little is known about its role in human breast cancer. In this study, we aimed to reveal the role of Pokemon in breast cancer progression and patient survival and to understand its underlying mechanisms. Tissue microarray analysis of breast cancer tissues from patients with complete clinicopathological data and more than 20 years of follow-up were used to evaluate Pokemon expression and its correlation with the progression and prognosis of the disease. DNA microarray analysis of MCF-7 cells that overexpress Pokemon was used to identify Pokemon target genes. Chromatin immunoprecipitation (ChIP) and site-directed mutagenesis were utilized to determine how Pokemon regulates survivin expression, a target gene. Pokemon was found to be overexpressed in 158 (86.8%) of 182 breast cancer tissues, and its expression was correlated with tumor size (P = 0.0148) and lymph node metastasis (P = 0.0014). Pokemon expression led to worse overall (n = 175, P = 0.01) and disease-related (n = 79, P = 0.0134) patient survival. DNA microarray analyses revealed that in MCF-7 breast cancer cells, Pokemon regulates the expression of at least 121 genes involved in several signaling and metabolic pathways, including anti-apoptotic survivin. In clinical specimens, Pokemon and survivin expression were highly correlated (n = 49, r = 0.6799, P < 0.0001). ChIP and site-directed mutagenesis indicated that Pokemon induces survivin expression by binding to the GT boxes in its promoter. Pokemon promotes breast cancer progression by upregulating survivin expression and thus may be a potential target for the treatment of this malignancy.
Pro-oncogene Pokemon promotes breast cancer progression by upregulating survivin expression
2011-01-01
Introduction Pokemon is an oncogenic transcription factor involved in cell growth, differentiation and oncogenesis, but little is known about its role in human breast cancer. In this study, we aimed to reveal the role of Pokemon in breast cancer progression and patient survival and to understand its underlying mechanisms. Methods Tissue microarray analysis of breast cancer tissues from patients with complete clinicopathological data and more than 20 years of follow-up were used to evaluate Pokemon expression and its correlation with the progression and prognosis of the disease. DNA microarray analysis of MCF-7 cells that overexpress Pokemon was used to identify Pokemon target genes. Chromatin immunoprecipitation (ChIP) and site-directed mutagenesis were utilized to determine how Pokemon regulates survivin expression, a target gene. Results Pokemon was found to be overexpressed in 158 (86.8%) of 182 breast cancer tissues, and its expression was correlated with tumor size (P = 0.0148) and lymph node metastasis (P = 0.0014). Pokemon expression led to worse overall (n = 175, P = 0.01) and disease-related (n = 79, P = 0.0134) patient survival. DNA microarray analyses revealed that in MCF-7 breast cancer cells, Pokemon regulates the expression of at least 121 genes involved in several signaling and metabolic pathways, including anti-apoptotic survivin. In clinical specimens, Pokemon and survivin expression were highly correlated (n = 49, r = 0.6799, P < 0.0001). ChIP and site-directed mutagenesis indicated that Pokemon induces survivin expression by binding to the GT boxes in its promoter. Conclusions Pokemon promotes breast cancer progression by upregulating survivin expression and thus may be a potential target for the treatment of this malignancy. PMID:21392388
Adan, Aysun; Baran, Yusuf
2015-11-01
Fisetin and hesperetin, flavonoids from various plants, have several pharmaceutical activities including antioxidative, anti-inflammatory, and anticancer effects. However, studies elucidating the role and the mechanism(s) of action of fisetin and hesperetin in acute promyelocytic leukemia are absent. In this study, we investigated the mechanism of the antiproliferative and apoptotic actions exerted by fisetin and hesperetin on human HL60 acute promyelocytic leukemia cells. The viability of HL60 cells was evaluated using the MTT assay, apoptosis by annexin V/propidium iodide (PI) staining and cell cycle distribution using flow cytometry, and changes in caspase-3 enzyme activity and mitochondrial transmembrane potential. Moreover, we performed whole-genome microarray gene expression analysis to reveal genes affected by fisetin and hesperetin that can be important for developing of future targeted therapy. Based on data obtained from microarray analysis, we also described biological networks modulated after fisetin and hesperetin treatment by KEGG and IPA analysis. Fisetin and hesperetin treatment showed a concentration- and time-dependent inhibition of proliferation and induced G2/M arrest for both agents and G0/G1 arrest for hesperetin at only the highest concentrations. There was a disruption of mitochondrial membrane potential together with increased caspase-3 activity. Furthermore, fisetin- and hesperetin-triggered apoptosis was confirmed by annexin V/PI analysis. The microarray gene profiling analysis revealed some important biological pathways including mitogen-activated protein kinases (MAPK) and inhibitor of DNA binding (ID) signaling pathways altered by fisetin and hesperetin treatment as well as gave a list of genes modulated ≥2-fold involved in cell proliferation, cell division, and apoptosis. Altogether, data suggested that fisetin and hesperetin have anticancer properties and deserve further investigation.
High Frequency of Copy-Neutral Loss of Heterozygosity in Patients with Myelofibrosis.
Rego de Paula Junior, Milton; Nonino, Alexandre; Minuncio Nascimento, Juliana; Bonadio, Raphael S; Pic-Taylor, Aline; de Oliveira, Silviene F; Wellerson Pereira, Rinaldo; do Couto Mascarenhas, Cintia; Forte Mazzeu, Juliana
2018-01-01
Myelofibrosis is the rarest and most severe type of Philadelphia-negative classical myeloproliferative neoplasms. Although mutually exclusive driver mutations in JAK2, MPL, or CALR that activate JAK-STAT pathway have been related to the pathogenesis of the disease, chromosome abnormalities have also been associated with the phenotype and prognosis of the disease. Here, we report the use of a chromosomal microarray platform consisting of both oligo and SNP probes to improve the detection of chromosome abnormalities in patients with myelofibrosis. Sixteen patients with myelofibrosis were tested, and the results were compared to karyotype analysis. Driver mutations in JAK2, MPL, or CALR were investigated by PCR and MLPA. Conventional cytogenetics revealed chromosome abnormalities in 3 out of 16 cases (18.7%), while chromosomal microarray analysis detected copy-number variations (CNV) or copy-neutral loss of heterozygosity (CN-LOH) alterations in 11 out of 16 (68.7%) patients. These included 43 CN-LOH, 14 deletions, 1 trisomy, and 1 duplication. Ten patients showed multiple chromosomal abnormalities, varying from 2 to 13 CNVs or CN-LOHs. Mutational status for JAK2, CALR, and MPL by MLPA revealed a total of 3/16 (18.7%) patients positive for the JAK2 V617F mutation, 9 with CALR deletion or insertion and 1 positive for MPL mutation. Considering that most of the CNVs identified were smaller than the karyotype resolution and the high frequency of CN-LOHs in our study, we propose that chromosomal microarray platforms that combine oligos and SNP should be used as a first-tier genetic test in patients with myelofibrosis. © 2018 S. Karger AG, Basel.
caCORRECT2: Improving the accuracy and reliability of microarray data in the presence of artifacts
2011-01-01
Background In previous work, we reported the development of caCORRECT, a novel microarray quality control system built to identify and correct spatial artifacts commonly found on Affymetrix arrays. We have made recent improvements to caCORRECT, including the development of a model-based data-replacement strategy and integration with typical microarray workflows via caCORRECT's web portal and caBIG grid services. In this report, we demonstrate that caCORRECT improves the reproducibility and reliability of experimental results across several common Affymetrix microarray platforms. caCORRECT represents an advance over state-of-art quality control methods such as Harshlighting, and acts to improve gene expression calculation techniques such as PLIER, RMA and MAS5.0, because it incorporates spatial information into outlier detection as well as outlier information into probe normalization. The ability of caCORRECT to recover accurate gene expressions from low quality probe intensity data is assessed using a combination of real and synthetic artifacts with PCR follow-up confirmation and the affycomp spike in data. The caCORRECT tool can be accessed at the website: http://cacorrect.bme.gatech.edu. Results We demonstrate that (1) caCORRECT's artifact-aware normalization avoids the undesirable global data warping that happens when any damaged chips are processed without caCORRECT; (2) When used upstream of RMA, PLIER, or MAS5.0, the data imputation of caCORRECT generally improves the accuracy of microarray gene expression in the presence of artifacts more than using Harshlighting or not using any quality control; (3) Biomarkers selected from artifactual microarray data which have undergone the quality control procedures of caCORRECT are more likely to be reliable, as shown by both spike in and PCR validation experiments. Finally, we present a case study of the use of caCORRECT to reliably identify biomarkers for renal cell carcinoma, yielding two diagnostic biomarkers with potential clinical utility, PRKAB1 and NNMT. Conclusions caCORRECT is shown to improve the accuracy of gene expression, and the reproducibility of experimental results in clinical application. This study suggests that caCORRECT will be useful to clean up possible artifacts in new as well as archived microarray data. PMID:21957981
2012-01-01
Background DNA microarrays are used both for research and for diagnostics. In research, Affymetrix arrays are commonly used for genome wide association studies, resequencing, and for gene expression analysis. These arrays provide large amounts of data. This data is analyzed using statistical methods that quite often discard a large portion of the information. Most of the information that is lost comes from probes that systematically fail across chips and from batch effects. The aim of this study was to develop a comprehensive model for hybridization that predicts probe intensities for Affymetrix arrays and that could provide a basis for improved microarray analysis and probe development. The first part of the model calculates probe binding affinities to all the possible targets in the hybridization solution using the Langmuir isotherm. In the second part of the model we integrate details that are specific to each experiment and contribute to the differences between hybridization in solution and on the microarray. These details include fragmentation, wash stringency, temperature, salt concentration, and scanner settings. Furthermore, the model fits probe synthesis efficiency and target concentration parameters directly to the data. All the parameters used in the model have a well-established physical origin. Results For the 302 chips that were analyzed the mean correlation between expected and observed probe intensities was 0.701 with a range of 0.88 to 0.55. All available chips were included in the analysis regardless of the data quality. Our results show that batch effects arise from differences in probe synthesis, scanner settings, wash strength, and target fragmentation. We also show that probe synthesis efficiencies for different nucleotides are not uniform. Conclusions To date this is the most complete model for binding on microarrays. This is the first model that includes both probe synthesis efficiency and hybridization kinetics/cross-hybridization. These two factors are sequence dependent and have a large impact on probe intensity. The results presented here provide novel insight into the effect of probe synthesis errors on Affymetrix microarrays; furthermore, the algorithms developed in this work provide useful tools for the analysis of cross-hybridization, probe synthesis efficiency, fragmentation, wash stringency, temperature, and salt concentration on microarray intensities. PMID:23270536
Loch, Christian M; Strickler, James E
2012-11-01
Substrate ubiquitylation is a reversible process critical to cellular homeostasis that is often dysregulated in many human pathologies including cancer and neurodegeneration. Elucidating the mechanistic details of this pathway could unlock a large store of information useful to the design of diagnostic and therapeutic interventions. Proteomic approaches to the questions at hand have generally utilized mass spectrometry (MS), which has been successful in identifying both ubiquitylation substrates and profiling pan-cellular chain linkages, but is generally unable to connect the two. Interacting partners of the deubiquitylating enzymes (DUBs) have also been reported by MS, although substrates of catalytically competent DUBs generally cannot be. Where they have been used towards the study of ubiquitylation, protein microarrays have usually functioned as platforms for the identification of substrates for specific E3 ubiquitin ligases. Here, we report on the first use of protein microarrays to identify substrates of DUBs, and in so doing demonstrate the first example of microarray proteomics involving multiple (i.e., distinct, sequential and opposing) enzymatic activities. This technique demonstrates the selectivity of DUBs for both substrate and type (mono- versus poly-) of ubiquitylation. This work shows that the vast majority of DUBs are monoubiquitylated in vitro, and are incapable of removing this modification from themselves. This work also underscores the critical role of utilizing both ubiquitin chains and substrates when attempting to characterize DUBs. This article is part of a Special Issue entitled: Ubiquitin Drug Discovery and Diagnostics. Copyright © 2012 Elsevier B.V. All rights reserved.
Tiwari, Jagesh Kumar; Devi, Sapna; Sundaresha, S; Chandel, Poonam; Ali, Nilofer; Singh, Brajesh; Bhardwaj, Vinay; Singh, Bir Pal
2015-06-01
Genes involved in photoassimilate partitioning and changes in hormonal balance are important for potato tuberization. In the present study, we investigated gene expression patterns in the tuber-bearing potato somatic hybrid (E1-3) and control non-tuberous wild species Solanum etuberosum (Etb) by microarray. Plants were grown under controlled conditions and leaves were collected at eight tuber developmental stages for microarray analysis. A t-test analysis identified a total of 468 genes (94 up-regulated and 374 down-regulated) that were statistically significant (p ≤ 0.05) and differentially expressed in E1-3 and Etb. Gene Ontology (GO) characterization of the 468 genes revealed that 145 were annotated and 323 were of unknown function. Further, these 145 genes were grouped based on GO biological processes followed by molecular function and (or) PGSC description into 15 gene sets, namely (1) transport, (2) metabolic process, (3) biological process, (4) photosynthesis, (5) oxidation-reduction, (6) transcription, (7) translation, (8) binding, (9) protein phosphorylation, (10) protein folding, (11) ubiquitin-dependent protein catabolic process, (12) RNA processing, (13) negative regulation of protein, (14) methylation, and (15) mitosis. RT-PCR analysis of 10 selected highly significant genes (p ≤ 0.01) confirmed the microarray results. Overall, we show that candidate genes induced in leaves of E1-3 were implicated in tuberization processes such as transport, carbohydrate metabolism, phytohormones, and transcription/translation/binding functions. Hence, our results provide an insight into the candidate genes induced in leaf tissues during tuberization in E1-3.
Hess, Jonathan L.; Tylee, Daniel S.; Barve, Rahul; de Jong, Simone; Ophoff, Roel A.; Kumarasinghe, Nishantha; Tooney, Paul; Schall, Ulrich; Gardiner, Erin; Beveridge, Natalie Jane; Scott, Rodney J.; Yasawardene, Surangi; Perera, Antionette; Mendis, Jayan; Carr, Vaughan; Kelly, Brian; Cairns, Murray; Tsuang, Ming T.; Glatt, Stephen J.
2016-01-01
The application of microarray technology in schizophrenia research was heralded as paradigm-shifting, as it allowed for high-throughput assessment of cell and tissue function. This technology was widely adopted, initially in studies of postmortem brain tissue, and later in studies of peripheral blood. The collective body of schizophrenia microarray literature contains apparent inconsistencies between studies, with failures to replicate top hits, in part due to small sample sizes, cohort-specific effects, differences in array types, and other confounders. In an attempt to summarize existing studies of schizophrenia cases and non-related comparison subjects, we performed two mega-analyses of a combined set of microarray data from postmortem prefrontal cortices (n = 315) and from ex-vivo blood tissues (n = 578). We adjusted regression models per gene to remove non-significant covariates, providing best-estimates of transcripts dysregulated in schizophrenia. We also examined dysregulation of functionally related gene sets and gene co-expression modules, and assessed enrichment of cell types and genetic risk factors. The identities of the most significantly dysregulated genes were largely distinct for each tissue, but the findings indicated common emergent biological functions (e.g. immunity) and regulatory factors (e.g., predicted targets of transcription factors and miRNA species across tissues). Our network-based analyses converged upon similar patterns of heightened innate immune gene expression in both brain and blood in schizophrenia. We also constructed generalizable machine-learning classifiers using the blood-based microarray data. Our study provides an informative atlas for future pathophysiologic and biomarker studies of schizophrenia. PMID:27450777
Hess, Jonathan L; Tylee, Daniel S; Barve, Rahul; de Jong, Simone; Ophoff, Roel A; Kumarasinghe, Nishantha; Tooney, Paul; Schall, Ulrich; Gardiner, Erin; Beveridge, Natalie Jane; Scott, Rodney J; Yasawardene, Surangi; Perera, Antionette; Mendis, Jayan; Carr, Vaughan; Kelly, Brian; Cairns, Murray; Tsuang, Ming T; Glatt, Stephen J
2016-10-01
The application of microarray technology in schizophrenia research was heralded as paradigm-shifting, as it allowed for high-throughput assessment of cell and tissue function. This technology was widely adopted, initially in studies of postmortem brain tissue, and later in studies of peripheral blood. The collective body of schizophrenia microarray literature contains apparent inconsistencies between studies, with failures to replicate top hits, in part due to small sample sizes, cohort-specific effects, differences in array types, and other confounders. In an attempt to summarize existing studies of schizophrenia cases and non-related comparison subjects, we performed two mega-analyses of a combined set of microarray data from postmortem prefrontal cortices (n=315) and from ex-vivo blood tissues (n=578). We adjusted regression models per gene to remove non-significant covariates, providing best-estimates of transcripts dysregulated in schizophrenia. We also examined dysregulation of functionally related gene sets and gene co-expression modules, and assessed enrichment of cell types and genetic risk factors. The identities of the most significantly dysregulated genes were largely distinct for each tissue, but the findings indicated common emergent biological functions (e.g. immunity) and regulatory factors (e.g., predicted targets of transcription factors and miRNA species across tissues). Our network-based analyses converged upon similar patterns of heightened innate immune gene expression in both brain and blood in schizophrenia. We also constructed generalizable machine-learning classifiers using the blood-based microarray data. Our study provides an informative atlas for future pathophysiologic and biomarker studies of schizophrenia. Published by Elsevier B.V.
Yang, Mingxing; Li, Xiumin; Li, Zhibin; Ou, Zhimin; Liu, Ming; Liu, Suhuan; Li, Xuejun; Yang, Shuyu
2013-01-01
DNA microarray analysis is characterized by obtaining a large number of gene variables from a small number of observations. Cluster analysis is widely used to analyze DNA microarray data to make classification and diagnosis of disease. Because there are so many irrelevant and insignificant genes in a dataset, a feature selection approach must be employed in data analysis. The performance of cluster analysis of this high-throughput data depends on whether the feature selection approach chooses the most relevant genes associated with disease classes. Here we proposed a new method using multiple Orthogonal Partial Least Squares-Discriminant Analysis (mOPLS-DA) models and S-plots to select the most relevant genes to conduct three-class disease classification and prediction. We tested our method using Golub's leukemia microarray data. For three classes with subtypes, we proposed hierarchical orthogonal partial least squares-discriminant analysis (OPLS-DA) models and S-plots to select features for two main classes and their subtypes. For three classes in parallel, we employed three OPLS-DA models and S-plots to choose marker genes for each class. The power of feature selection to classify and predict three-class disease was evaluated using cluster analysis. Further, the general performance of our method was tested using four public datasets and compared with those of four other feature selection methods. The results revealed that our method effectively selected the most relevant features for disease classification and prediction, and its performance was better than that of the other methods.
Short, Michael D.; Abell, Guy C. J.; Bodrossy, Levente; van den Akker, Ben
2013-01-01
We report on the first study trialling a newly-developed, functional gene microarray (FGA) for characterising bacterial and archaeal ammonia oxidisers in activated sludge. Mixed liquor (ML) and media biofilm samples from a full-scale integrated fixed-film activated sludge (IFAS) plant were analysed with the FGA to profile the diversity and relative abundance of ammonia-oxidising archaea and bacteria (AOA and AOB respectively). FGA analyses of AOA and AOB communities revealed ubiquitous distribution of AOA across all samples – an important finding for these newly-discovered and poorly characterised organisms. Results also revealed striking differences in the functional ecology of attached versus suspended communities within the IFAS reactor. Quantitative assessment of AOB and AOA functional gene abundance revealed a dominance of AOB in the ML and approximately equal distribution of AOA and AOB in the media-attached biofilm. Subsequent correlations of functional gene abundance data with key water quality parameters suggested an important functional role for media-attached AOB in particular for IFAS reactor nitrification performance and indicate possible functional redundancy in some IFAS ammonia oxidiser communities. Results from this investigation demonstrate the capacity of the FGA to resolve subtle ecological shifts in key microbial communities in nitrifying activated sludge and indicate its value as a tool for better understanding the linkages between the ecology and performance of these engineered systems. PMID:24155925
2011-01-01
Background Understanding the genetic elements that contribute to key aspects of coffee biology will have an impact on future agronomical improvements for this economically important tree. During the past years, EST collections were generated in Coffee, opening the possibility to create new tools for functional genomics. Results The "PUCE CAFE" Project, organized by the scientific consortium NESTLE/IRD/CIRAD, has developed an oligo-based microarray using 15,721 unigenes derived from published coffee EST sequences mostly obtained from different stages of fruit development and leaves in Coffea Canephora (Robusta). Hybridizations for two independent experiments served to compare global gene expression profiles in three types of tissue matter (mature beans, leaves and flowers) in C. canephora as well as in the leaves of three different coffee species (C. canephora, C. eugenoides and C. arabica). Microarray construction, statistical analyses and validation by Q-PCR analysis are presented in this study. Conclusion We have generated the first 15 K coffee array during this PUCE CAFE project, granted by Génoplante (the French consortium for plant genomics). This new tool will help study functional genomics in a wide range of experiments on various plant tissues, such as analyzing bean maturation or resistance to pathogens or drought. Furthermore, the use of this array has proven to be valid in different coffee species (diploid or tetraploid), drastically enlarging its impact for high-throughput gene expression in the community of coffee research. PMID:21208403
Privat, Isabelle; Bardil, Amélie; Gomez, Aureliano Bombarely; Severac, Dany; Dantec, Christelle; Fuentes, Ivanna; Mueller, Lukas; Joët, Thierry; Pot, David; Foucrier, Séverine; Dussert, Stéphane; Leroy, Thierry; Journot, Laurent; de Kochko, Alexandre; Campa, Claudine; Combes, Marie-Christine; Lashermes, Philippe; Bertrand, Benoit
2011-01-05
Understanding the genetic elements that contribute to key aspects of coffee biology will have an impact on future agronomical improvements for this economically important tree. During the past years, EST collections were generated in Coffee, opening the possibility to create new tools for functional genomics. The "PUCE CAFE" Project, organized by the scientific consortium NESTLE/IRD/CIRAD, has developed an oligo-based microarray using 15,721 unigenes derived from published coffee EST sequences mostly obtained from different stages of fruit development and leaves in Coffea Canephora (Robusta). Hybridizations for two independent experiments served to compare global gene expression profiles in three types of tissue matter (mature beans, leaves and flowers) in C. canephora as well as in the leaves of three different coffee species (C. canephora, C. eugenoides and C. arabica). Microarray construction, statistical analyses and validation by Q-PCR analysis are presented in this study. We have generated the first 15 K coffee array during this PUCE CAFE project, granted by Génoplante (the French consortium for plant genomics). This new tool will help study functional genomics in a wide range of experiments on various plant tissues, such as analyzing bean maturation or resistance to pathogens or drought. Furthermore, the use of this array has proven to be valid in different coffee species (diploid or tetraploid), drastically enlarging its impact for high-throughput gene expression in the community of coffee research.
2008 Microarray Research Group (MARG Survey): Sensing the State of Microarray Technology
Over the past several years, the field of microarrays has grown and evolved drastically. In its continued efforts to track this evolution and transformation, the ABRF-MARG has once again conducted a survey of international microarray facilities and individual microarray users. Th...
Talkowski, Michael E; Ernst, Carl; Heilbut, Adrian; Chiang, Colby; Hanscom, Carrie; Lindgren, Amelia; Kirby, Andrew; Liu, Shangtao; Muddukrishna, Bhavana; Ohsumi, Toshiro K; Shen, Yiping; Borowsky, Mark; Daly, Mark J; Morton, Cynthia C; Gusella, James F
2011-04-08
The contribution of balanced chromosomal rearrangements to complex disorders remains unclear because they are not detected routinely by genome-wide microarrays and clinical localization is imprecise. Failure to consider these events bypasses a potentially powerful complement to single nucleotide polymorphism and copy-number association approaches to complex disorders, where much of the heritability remains unexplained. To capitalize on this genetic resource, we have applied optimized sequencing and analysis strategies to test whether these potentially high-impact variants can be mapped at reasonable cost and throughput. By using a whole-genome multiplexing strategy, rearrangement breakpoints could be delineated at a fraction of the cost of standard sequencing. For rearrangements already mapped regionally by karyotyping and fluorescence in situ hybridization, a targeted approach enabled capture and sequencing of multiple breakpoints simultaneously. Importantly, this strategy permitted capture and unique alignment of up to 97% of repeat-masked sequences in the targeted regions. Genome-wide analyses estimate that only 3.7% of bases should be routinely omitted from genomic DNA capture experiments. Illustrating the power of these approaches, the rearrangement breakpoints were rapidly defined to base pair resolution and revealed unexpected sequence complexity, such as co-occurrence of inversion and translocation as an underlying feature of karyotypically balanced alterations. These findings have implications ranging from genome annotation to de novo assemblies and could enable sequencing screens for structural variations at a cost comparable to that of microarrays in standard clinical practice. Copyright © 2011 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
Characterization of microRNA profile in mammary tissue of dairy and beef breed heifers.
Wicik, Z; Gajewska, M; Majewska, A; Walkiewicz, D; Osińska, E; Motyl, T
2016-02-01
MicroRNAs (miRNAs) are small non-coding RNAs that participate in the regulation of gene expression. Their role during mammary gland development is still largely unknown. In this study, we performed a microarray analysis to identify miRNAs associated with high mammogenic potential of the bovine mammary gland. We identified 54 significantly differentially expressed miRNAs between the mammary tissue of dairy (Holstein-Friesian, HF) and beef (Limousin, LM) postpubertal heifers. Fifty-two miRNAs had higher expression in the mammary tissue of LM heifers. The expression of the top candidate miRNAs (bta-miR-10b, bta-miR-29b, bta-miR-101, bta-miR-375, bta-miR-2285t, bta-miR-146b, bta-let7b, bta-miR-107, bta-miR-1434-3p) identified in the microarray experiment was additionally evaluated by qPCR. Enrichment analyses for targeted genes revealed that the major differences between miRNA expression in the mammary gland of HF versus LM were associated with the regulation of signalling pathways that are crucial for mammary gland development, such as TGF-beta, insulin, WNT and inflammatory pathways. Moreover, a number of genes potentially targeted by significantly differentially expressed miRNAs were associated with the activity of mammary stem cells. These data indicate that the high developmental potential of the mammary gland in dairy cattle, leading to high milk productivity, depends also on a specific miRNA expression pattern. © 2015 Blackwell Verlag GmbH.
Ye, Yibiao; Chen, Jie; Zhou, Yu; Fu, Zhiqiang; Zhou, Quanbo; Wang, YingXue; Gao, Wenchao; Zheng, ShangYou; Zhao, Xiaohui; Chen, Tao; Chen, Rufu
2015-04-30
Pancreatic ductal adenocarcinoma (PDAC) is still a lethal malignancy. Long noncoding RNAs (lncRNAs) have been shown to play a critical role in cancer development and progression. Here we identified overexpression of the lncRNA AFAP1-AS1 in PDAC patients and evaluated its prognostic and functional relevance. The global lncRNA expression profile in PDAC was measured by lncRNA microarray. Expression of AFAP1-AS1 was evaluated by reverse-transcriptase quantitative polymerase chain reaction (RT-qPCR) in 90 PDAC tissue samples and adjacent normal tissues. The impact of AFAP1-AS1 expression on cell proliferation, migration, and invasion were evaluated in vitro using knockdown and ectopic expression strategies. Microarray analysis revealed that up-regulation of AFAP1-AS1 expression in PDAC tissues compared with normal adjacent tissues, which was confirmed by RT-qPCR in 69/90 cases (76.7%). Its overexpression was associated with lymph node metastasis, perineural invasion, and poor survival. When using AFAP1-AS1 as a prognostic marker, the areas under ROC curves were 0.8669 and 0.9370 for predicting tumor progression within 6 months and 1 year, respectively. In vitro functional experiments involving knockdown of AFAP1-AS1 resulted in attenuated PDAC cell proliferation, migration, and invasion. Ectopic expression of AFAP1-AS1 promoted cell proliferation, migration, and invasion. AFAP1-AS1 is a potential novel prognostic marker to predict the clinical outcome of PDAC patients after surgery and may be a rational target for therapy.
Gastroesophageal reflux activates the NF-κB pathway and impairs esophageal barrier function in mice
Fang, Yu; Chen, Hao; Hu, Yuhui; Djukic, Zorka; Tevebaugh, Whitney; Shaheen, Nicholas J.; Orlando, Roy C.; Hu, Jianguo
2013-01-01
The barrier function of the esophageal epithelium is a major defense against gastroesophageal reflux disease. Previous studies have shown that reflux damage is reflected in a decrease in transepithelial electrical resistance associated with tight junction alterations in the esophageal epithelium. To develop novel therapies, it is critical to understand the molecular mechanisms whereby contact with a refluxate impairs esophageal barrier function. In this study, surgical models of duodenal and mixed reflux were developed in mice. Mouse esophageal epithelium was analyzed by gene microarray. Gene set enrichment analysis showed upregulation of inflammation-related gene sets and the NF-κB pathway due to reflux. Significance analysis of microarrays revealed upregulation of NF-κB target genes. Overexpression of NF-κB subunits (p50 and p65) and NF-κB target genes (matrix metalloproteinases-3 and -9, IL-1β, IL-6, and IL-8) confirmed activation of the NF-κB pathway in the esophageal epithelium. In addition, real-time PCR, Western blotting, and immunohistochemical staining also showed downregulation and mislocalization of claudins-1 and -4. In a second animal experiment, treatment with an NF-κB inhibitor, BAY 11-7085 (20 mg·kg−1·day−1 ip for 10 days), counteracted the effects of duodenal and mixed reflux on epithelial resistance and NF-κB-regulated cytokines. We conclude that gastroesophageal reflux activates the NF-κB pathway and impairs esophageal barrier function in mice and that targeting the NF-κB pathway may strengthen esophageal barrier function against reflux. PMID:23639809
Guerra-Laso, José M; Raposo-García, Sara; García-García, Silvia; Diez-Tascón, Cristina; Rivero-Lezcano, Octavio M
2015-02-01
Differences in the activity of monocytes/macrophages, important target cells of Mycobacterium tuberculosis, might influence tuberculosis progression. With the purpose of identifying candidate genes for tuberculosis susceptibility we infected monocytes from both healthy elderly individuals (a tuberculosis susceptibility group) and elderly tuberculosis patients with M. tuberculosis, and performed a microarray experiment. We detected 78 differentially expressed transcripts and confirmed these results by quantitative PCR of selected genes. We found that monocytes from tuberculosis patients showed similar expression patterns for these genes, regardless of whether they were obtained from younger or older patients. Only one of the detected genes corresponded to a cytokine: IL26, a member of the interleukin-10 (IL-10) cytokine family which we found to be down-regulated in infected monocytes from tuberculosis patients. Non-infected monocytes secreted IL-26 constitutively but they reacted strongly to M. tuberculosis infection by decreasing IL-26 production. Furthermore, IL-26 serum concentrations appeared to be lower in the tuberculosis patients. When whole blood was infected, IL-26 inhibited the observed pathogen-killing capability. Although lymphocytes expressed IL26R, the receptor mRNA was not detected in either monocytes or neutrophils, suggesting that the inhibition of anti-mycobacterial activity may be mediated by lymphocytes. Additionally, IL-2 concentrations in infected blood were lower in the presence of IL-26. The negative influence of IL-26 on the anti-mycobacterial activity and its constitutive presence in both serum and monocyte supernatants prompt us to propose IL26 as a candidate gene for tuberculosis susceptibility. © 2014 John Wiley & Sons Ltd.
Zhao, Yangyang; Qian, Guoliang; Chen, Yuan; Du, Liangcheng; Liu, Fengquan
2017-01-01
Lysobacter enzymogenes is a ubiquitous, beneficial, plant-associated bacterium emerging as a novel biological control agent. It has the potential to become a new source of antimicrobial secondary metabolites such as the Heat-Stable Antifungal Factor (HSAF), which is a broad-spectrum antimycotic with a novel mode of action. However, very little information about how L. enzymogenes detects and responds to fungi or oomycetes has been reported. An in vitro confrontation bioassay between the pathogenic oomycete Pythium aphanidermatum and the biocontrol bacterial strain L. enzymogenes OH11 was used to analyze the transcriptional changes in the bacteria that were induced by the oomycetes. Analysis was performed at three time points of the interaction, starting before inhibition zone formation until inhibition zone formation. A L. enzymogenes OH11 DNA microarray was constructed for the analysis. Microarray analysis indicated that a wide range of genes belonging to 14 diverse functions in L. enzymogenes were affected by P. aphanidermatum as critical antagonistic effects occurred. L. enzymogenes detected and responded to the presence of P. aphanidermatum early, but alteration of gene expression typically occurred after inhibition zone formation. The presence of P. aphanidermatum increased the twitching motility and HSAF production in L. enzymogenes. We also performed a contact interaction between L. enzymogenes and P. aphanidermatum, and found that HSAF played a critical role in the interaction. Our experiments demonstrated that L. enzymogenes displayed transcriptional and antagonistic responses to P. aphanidermatum in order to gain advantages in the competition with this oomycete. This study revealed new insights into the interactions between bacteria and oomycete. PMID:28634478
Zhao, Yangyang; Qian, Guoliang; Chen, Yuan; Du, Liangcheng; Liu, Fengquan
2017-01-01
Lysobacter enzymogenes is a ubiquitous, beneficial, plant-associated bacterium emerging as a novel biological control agent. It has the potential to become a new source of antimicrobial secondary metabolites such as the Heat-Stable Antifungal Factor (HSAF), which is a broad-spectrum antimycotic with a novel mode of action. However, very little information about how L. enzymogenes detects and responds to fungi or oomycetes has been reported. An in vitro confrontation bioassay between the pathogenic oomycete Pythium aphanidermatum and the biocontrol bacterial strain L. enzymogenes OH11 was used to analyze the transcriptional changes in the bacteria that were induced by the oomycetes. Analysis was performed at three time points of the interaction, starting before inhibition zone formation until inhibition zone formation. A L. enzymogenes OH11 DNA microarray was constructed for the analysis. Microarray analysis indicated that a wide range of genes belonging to 14 diverse functions in L. enzymogenes were affected by P. aphanidermatum as critical antagonistic effects occurred. L. enzymogenes detected and responded to the presence of P. aphanidermatum early, but alteration of gene expression typically occurred after inhibition zone formation. The presence of P. aphanidermatum increased the twitching motility and HSAF production in L. enzymogenes . We also performed a contact interaction between L. enzymogenes and P. aphanidermatum , and found that HSAF played a critical role in the interaction. Our experiments demonstrated that L. enzymogenes displayed transcriptional and antagonistic responses to P. aphanidermatum in order to gain advantages in the competition with this oomycete. This study revealed new insights into the interactions between bacteria and oomycete.
THE ABRF-MARG MICROARRAY SURVEY 2004: TAKING THE PULSE OF THE MICROARRAY FIELD
Over the past several years, the field of microarrays has grown and evolved drastically. In its continued efforts to track this evolution, the ABRF-MARG has once again conducted a survey of international microarray facilities and individual microarray users. The goal of the surve...
Contributions to Statistical Problems Related to Microarray Data
ERIC Educational Resources Information Center
Hong, Feng
2009-01-01
Microarray is a high throughput technology to measure the gene expression. Analysis of microarray data brings many interesting and challenging problems. This thesis consists three studies related to microarray data. First, we propose a Bayesian model for microarray data and use Bayes Factors to identify differentially expressed genes. Second, we…
NASA Astrophysics Data System (ADS)
Bogdanov, Valery L.; Boyce-Jacino, Michael
1999-05-01
Confined arrays of biochemical probes deposited on a solid support surface (analytical microarray or 'chip') provide an opportunity to analysis multiple reactions simultaneously. Microarrays are increasingly used in genetics, medicine and environment scanning as research and analytical instruments. A power of microarray technology comes from its parallelism which grows with array miniaturization, minimization of reagent volume per reaction site and reaction multiplexing. An optical detector of microarray signals should combine high sensitivity, spatial and spectral resolution. Additionally, low-cost and a high processing rate are needed to transfer microarray technology into biomedical practice. We designed an imager that provides confocal and complete spectrum detection of entire fluorescently-labeled microarray in parallel. Imager uses microlens array, non-slit spectral decomposer, and high- sensitive detector (cooled CCD). Two imaging channels provide a simultaneous detection of localization, integrated and spectral intensities for each reaction site in microarray. A dimensional matching between microarray and imager's optics eliminates all in moving parts in instrumentation, enabling highly informative, fast and low-cost microarray detection. We report theory of confocal hyperspectral imaging with microlenses array and experimental data for implementation of developed imager to detect fluorescently labeled microarray with a density approximately 103 sites per cm2.
An efficient ensemble learning method for gene microarray classification.
Osareh, Alireza; Shadgar, Bita
2013-01-01
The gene microarray analysis and classification have demonstrated an effective way for the effective diagnosis of diseases and cancers. However, it has been also revealed that the basic classification techniques have intrinsic drawbacks in achieving accurate gene classification and cancer diagnosis. On the other hand, classifier ensembles have received increasing attention in various applications. Here, we address the gene classification issue using RotBoost ensemble methodology. This method is a combination of Rotation Forest and AdaBoost techniques which in turn preserve both desirable features of an ensemble architecture, that is, accuracy and diversity. To select a concise subset of informative genes, 5 different feature selection algorithms are considered. To assess the efficiency of the RotBoost, other nonensemble/ensemble techniques including Decision Trees, Support Vector Machines, Rotation Forest, AdaBoost, and Bagging are also deployed. Experimental results have revealed that the combination of the fast correlation-based feature selection method with ICA-based RotBoost ensemble is highly effective for gene classification. In fact, the proposed method can create ensemble classifiers which outperform not only the classifiers produced by the conventional machine learning but also the classifiers generated by two widely used conventional ensemble learning methods, that is, Bagging and AdaBoost.
The genome-wide expression profile of Curcuma longa-treated cisplatin-stimulated HEK293 cells
Sohn, Sung-Hwa; Ko, Eunjung; Chung, Hwan-Suck; Lee, Eun-Young; Kim, Sung-Hoon; Shin, Minkyu; Hong, Moochang; Bae, Hyunsu
2010-01-01
AIM The rhizome of turmeric, Curcuma longa (CL), is a herbal medicine used in many traditional prescriptions. It has previously been shown that CL treatment showed greater than 47% recovery from cisplatin-induced cell damage in human kidney HEK 293 cells. This study was conducted to evaluate the recovery mechanisms of CL that occur during cisplatin induced nephrotoxicity by examining the genome wide mRNA expression profiles of HEK 293 -cells. METHOD Recovery mechanisms of CL that occur during cisplatin-induced nephrotoxicity were determined by microarray, real-time PCR, immunofluorescent confocal microscopy and Western blot analysis. RESULTS The results of microarray analysis and real-time PCR revealed that NFκB pathway-related genes and apoptosis-related genes were down-regulated in CL-treated HEK 293 cells. In addition, immunofluorescent confocal microscopy and Western blot analysis revealed that NFκB p65 nuclear translocation was inhibited in CL-treated HEK 293 cells. Therefore, the mechanism responsible for the effects of CL on HEK 293 cells is closely associated with regulation of the NFκB pathway. CONCLUSION CL possesses novel therapeutic agents that can be used for the prevention or treatment of cisplatin-induced renal disorders. PMID:20840446
NASA Astrophysics Data System (ADS)
Reicher, Naama; Segev, Lior; Rudich, Yinon
2018-01-01
The WeIzmann Supercooled Droplets Observation on Microarray (WISDOM) is a new setup for studying ice nucleation in an array of monodisperse droplets for atmospheric implications. WISDOM combines microfluidics techniques for droplets production and a cryo-optic stage for observation and characterization of freezing events of individual droplets. This setup is designed to explore heterogeneous ice nucleation in the immersion freezing mode, down to the homogeneous freezing of water (235 K) in various cooling rates (typically 0.1-10 K min-1). It can also be used for studying homogeneous freezing of aqueous solutions in colder temperatures. Frozen fraction, ice nucleation active surface site densities and freezing kinetics can be obtained from WISDOM measurements for hundreds of individual droplets in a single freezing experiment. Calibration experiments using eutectic solutions and previously studied materials are described. WISDOM also allows repeatable cycles of cooling and heating for the same array of droplets. This paper describes the WISDOM setup, its temperature calibration, validation experiments and measurement uncertainties. Finally, application of WISDOM to study the ice nucleating particle (INP) properties of size-selected ambient Saharan dust particles is presented.
Wang, Yun; Huang, Fangzhou
2018-01-01
The selection of feature genes with high recognition ability from the gene expression profiles has gained great significance in biology. However, most of the existing methods have a high time complexity and poor classification performance. Motivated by this, an effective feature selection method, called supervised locally linear embedding and Spearman's rank correlation coefficient (SLLE-SC2), is proposed which is based on the concept of locally linear embedding and correlation coefficient algorithms. Supervised locally linear embedding takes into account class label information and improves the classification performance. Furthermore, Spearman's rank correlation coefficient is used to remove the coexpression genes. The experiment results obtained on four public tumor microarray datasets illustrate that our method is valid and feasible. PMID:29666661
Xu, Jiucheng; Mu, Huiyu; Wang, Yun; Huang, Fangzhou
2018-01-01
The selection of feature genes with high recognition ability from the gene expression profiles has gained great significance in biology. However, most of the existing methods have a high time complexity and poor classification performance. Motivated by this, an effective feature selection method, called supervised locally linear embedding and Spearman's rank correlation coefficient (SLLE-SC 2 ), is proposed which is based on the concept of locally linear embedding and correlation coefficient algorithms. Supervised locally linear embedding takes into account class label information and improves the classification performance. Furthermore, Spearman's rank correlation coefficient is used to remove the coexpression genes. The experiment results obtained on four public tumor microarray datasets illustrate that our method is valid and feasible.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Katoh, Hironori; Fujita, Keiko; Takuhara, Yuki
2011-02-18
Highlights: {yields} VIGG is an ER stress-induced protein in plant. {yields} We examine the characteristics of VIGG-overexpressing Arabidopsis plants. {yields} VIGG-overexpressing plants reveal growth retardation and robustness to ER stress. {yields} VIGG disturbs cation homeostasis in plant. -- Abstract: VIGG is a putative endoplasmic reticulum (ER) resident protein induced by virus infection and ER stress, and is correlated with fruit quality in grapevine. The present study was undertaken to determine the biological function of VIGG in grapevine. Experiments using fluorescent protein-VIGG fusion protein demonstrated that VIGG is localized in ER and the ER targeting sequence is in the N-terminus. Themore » overexpression of VIGG in Arabidopsis plant led to growth retardation. The rosette leaves of VIGG-overexpressing plants were smaller than those of the control plants and rolled at 42 days after seeding. VIGG-overexpressing plants revealed robustness to ER stress as well as the low expression of ER stress marker proteins, such as the luminal binding proteins. These characteristics of VIGG-overexpressing plants were supported by a microarray experiment that demonstrated the disruption of genes related to ER stress response and flowering, as well as cation mobility, in the plants. Finally, cation homeostasis in the plants was disturbed by the overexpression of VIGG. Taken together, these results suggest that VIGG may disturb cation homeostasis in plant, which is correlated with the robustness to ER stress and growth retardation.« less
Nishimura, Jihei; Dewa, Yasuaki; Muguruma, Masako; Kuroiwa, Yuichi; Yasuno, Hiroaki; Shima, Tomomi; Jin, Mailan; Takahashi, Miwa; Umemura, Takashi; Mitsumori, Kunitoshi
2007-05-01
To investigate the relationship between fenofibrate (FF) and oxidative stress, enzymatic, histopathological, and molecular biological analyses were performed in the liver of male F344 rats fed 2 doses of FF (Experiment 1; 0 and 6000 ppm) for 3 weeks and 3 doses (Experiment 2; 0, 3000, and 6000 ppm) for 9 weeks. FF treatment increased the activity of enzymes such as carnitine acetyltransferase, carnitine palmitoyltransferase, fatty acyl-CoA oxidizing system, and catalase in the liver. However, it decreased those of superoxide dismutase in the liver in both experiments. Increased 8-hydroxy-2'-deoxyguanosine levels in liver DNA and lipofuscin accumulation were observed in the treated rats of Experiment 2. In vitro measurement of reactive oxygen species (ROS) in rat liver microsomes revealed a dose-dependent increase due to FF treatment. Microarray (only Experiment 1) or real-time reverse transcription-polymerase chain reaction analyses revealed that the expression levels of metabolism and DNA repair-related genes such as Aco, Cyp4a1, Cat, Yc2, Gpx2, Apex1, Xrcc5, Mgmt, Mlh1, Gadd45a, and Nbn were increased in FF-treated rats. These results provide evidence of a direct or indirect relationship between oxidative stress and FF treatment. In addition, increases in the expression levels of cell cycle-related genes such as Chek1, Cdc25a, and Ccdn1; increases in the expression levels of cell proliferation-related genes such as Hdgfrp3 and Vegfb; and fluctuations in the expression levels of apoptosis-related genes such as Casp11 and Trp53inp1 were observed in these rats. This suggests that cell proliferation induction, apoptosis suppression, and DNA damage due to oxidative stresses are probably involved in the mechanism of hepatocarcinogenesis due to FF in rats.
Chemiluminescence microarrays in analytical chemistry: a critical review.
Seidel, Michael; Niessner, Reinhard
2014-09-01
Multi-analyte immunoassays on microarrays and on multiplex DNA microarrays have been described for quantitative analysis of small organic molecules (e.g., antibiotics, drugs of abuse, small molecule toxins), proteins (e.g., antibodies or protein toxins), and microorganisms, viruses, and eukaryotic cells. In analytical chemistry, multi-analyte detection by use of analytical microarrays has become an innovative research topic because of the possibility of generating several sets of quantitative data for different analyte classes in a short time. Chemiluminescence (CL) microarrays are powerful tools for rapid multiplex analysis of complex matrices. A wide range of applications for CL microarrays is described in the literature dealing with analytical microarrays. The motivation for this review is to summarize the current state of CL-based analytical microarrays. Combining analysis of different compound classes on CL microarrays reduces analysis time, cost of reagents, and use of laboratory space. Applications are discussed, with examples from food safety, water safety, environmental monitoring, diagnostics, forensics, toxicology, and biosecurity. The potential and limitations of research on multiplex analysis by use of CL microarrays are discussed in this review.
Analysis of High-Throughput ELISA Microarray Data
DOE Office of Scientific and Technical Information (OSTI.GOV)
White, Amanda M.; Daly, Don S.; Zangar, Richard C.
Our research group develops analytical methods and software for the high-throughput analysis of quantitative enzyme-linked immunosorbent assay (ELISA) microarrays. ELISA microarrays differ from DNA microarrays in several fundamental aspects and most algorithms for analysis of DNA microarray data are not applicable to ELISA microarrays. In this review, we provide an overview of the steps involved in ELISA microarray data analysis and how the statistically sound algorithms we have developed provide an integrated software suite to address the needs of each data-processing step. The algorithms discussed are available in a set of open-source software tools (http://www.pnl.gov/statistics/ProMAT).
A Model-Based Joint Identification of Differentially Expressed Genes and Phenotype-Associated Genes
Seo, Minseok; Shin, Su-kyung; Kwon, Eun-Young; Kim, Sung-Eun; Bae, Yun-Jung; Lee, Seungyeoun; Sung, Mi-Kyung; Choi, Myung-Sook; Park, Taesung
2016-01-01
Over the last decade, many analytical methods and tools have been developed for microarray data. The detection of differentially expressed genes (DEGs) among different treatment groups is often a primary purpose of microarray data analysis. In addition, association studies investigating the relationship between genes and a phenotype of interest such as survival time are also popular in microarray data analysis. Phenotype association analysis provides a list of phenotype-associated genes (PAGs). However, it is sometimes necessary to identify genes that are both DEGs and PAGs. We consider the joint identification of DEGs and PAGs in microarray data analyses. The first approach we used was a naïve approach that detects DEGs and PAGs separately and then identifies the genes in an intersection of the list of PAGs and DEGs. The second approach we considered was a hierarchical approach that detects DEGs first and then chooses PAGs from among the DEGs or vice versa. In this study, we propose a new model-based approach for the joint identification of DEGs and PAGs. Unlike the previous two-step approaches, the proposed method identifies genes simultaneously that are DEGs and PAGs. This method uses standard regression models but adopts different null hypothesis from ordinary regression models, which allows us to perform joint identification in one-step. The proposed model-based methods were evaluated using experimental data and simulation studies. The proposed methods were used to analyze a microarray experiment in which the main interest lies in detecting genes that are both DEGs and PAGs, where DEGs are identified between two diet groups and PAGs are associated with four phenotypes reflecting the expression of leptin, adiponectin, insulin-like growth factor 1, and insulin. Model-based approaches provided a larger number of genes, which are both DEGs and PAGs, than other methods. Simulation studies showed that they have more power than other methods. Through analysis of data from experimental microarrays and simulation studies, the proposed model-based approach was shown to provide a more powerful result than the naïve approach and the hierarchical approach. Since our approach is model-based, it is very flexible and can easily handle different types of covariates. PMID:26964035
Big Results from Small Samples: Evaluation of Amplification Protocols for Gene Expression Profiling
Microarrays have revolutionized many areas of biology due to our technical ability to quantify tens of thousands of transcripts within a single experiment. However, there are still many areas that cannot benefit from this technology due to the amount of biological material needed...
Waveguide-excited fluorescence microarray
NASA Astrophysics Data System (ADS)
Sagarzazu, Gabriel; Bedu, Mélanie; Martinelli, Lucio; Ha, Khoi-Nguyen; Pelletier, Nicolas; Safarov, Viatcheslav I.; Weisbuch, Claude; Gacoin, Thierry; Benisty, Henri
2008-04-01
Signal-to-noise ratio is a crucial issue in microarray fluorescence read-out. Several strategies are proposed for its improvement. First, light collection in conventional microarrays scanners is quite limited. It was recently shown that almost full collection can be achieved in an integrated lens-free biosensor, with labelled species hybridizing practically on the surface of a sensitive silicon detector [L. Martinelli et al. Appl. Phys. Lett. 91, 083901 (2007)]. However, even with such an improvement, the ultimate goal of real-time measurements during hybridization is challenging: the detector is dazzled by the large fluorescence of labelled species in the solution. In the present paper we show that this unwanted signal can effectively be reduced if the excitation light is confined in a waveguide. Moreover, the concentration of excitation light in a waveguide results in a huge signal gain. In our experiment we realized a structure consisting of a high index sol-gel waveguide deposited on a low-index substrate. The fluorescent molecules deposited on the surface of the waveguide were excited by the evanescent part of a wave travelling in the guide. The comparison with free-space excitation schemes confirms a huge gain (by several orders of magnitude) in favour of waveguide-based excitation. An optical guide deposited onto an integrated biosensor thus combines both advantages of ideal light collection and enhanced surface localized excitation without compromising the imaging properties. Modelling predicts a negligible penalty from spatial cross-talk in practical applications. We believe that such a system would bring microarrays to hitherto unattained sensitivities.
Samolski, Ilanit; de Luis, Alberto; Vizcaíno, Juan Antonio; Monte, Enrique; Suárez, M Belén
2009-10-13
It has recently been shown that the Trichoderma fungal species used for biocontrol of plant diseases are capable of interacting with plant roots directly, behaving as symbiotic microorganisms. With a view to providing further information at transcriptomic level about the early response of Trichoderma to a host plant, we developed a high-density oligonucleotide (HDO) microarray encompassing 14,081 Expressed Sequence Tag (EST)-based transcripts from eight Trichoderma spp. and 9,121 genome-derived transcripts of T. reesei, and we have used this microarray to examine the gene expression of T. harzianum either alone or in the presence of tomato plants, chitin, or glucose. Global microarray analysis revealed 1,617 probe sets showing differential expression in T. harzianum mycelia under at least one of the culture conditions tested as compared with one another. Hierarchical clustering and heat map representation showed that the expression patterns obtained in glucose medium clustered separately from the expression patterns observed in the presence of tomato plants and chitin. Annotations using the Blast2GO suite identified 85 of the 257 transcripts whose probe sets afforded up-regulated expression in response to tomato plants. Some of these transcripts were predicted to encode proteins related to Trichoderma-host (fungus or plant) associations, such as Sm1/Elp1 protein, proteases P6281 and PRA1, enchochitinase CHIT42, or QID74 protein, although previously uncharacterized genes were also identified, including those responsible for the possible biosynthesis of nitric oxide, xenobiotic detoxification, mycelium development, or those related to the formation of infection structures in plant tissues. The effectiveness of the Trichoderma HDO microarray to detect different gene responses under different growth conditions in the fungus T. harzianum strongly indicates that this tool should be useful for further assays that include different stages of plant colonization, as well as for expression studies in other Trichoderma spp. represented on it. Using this microarray, we have been able to define a number of genes probably involved in the transcriptional response of T. harzianum within the first hours of contact with tomato plant roots, which may provide new insights into the mechanisms and roles of this fungus in the Trichoderma-plant interaction.
2009-01-01
Background It has recently been shown that the Trichoderma fungal species used for biocontrol of plant diseases are capable of interacting with plant roots directly, behaving as symbiotic microorganisms. With a view to providing further information at transcriptomic level about the early response of Trichoderma to a host plant, we developed a high-density oligonucleotide (HDO) microarray encompassing 14,081 Expressed Sequence Tag (EST)-based transcripts from eight Trichoderma spp. and 9,121 genome-derived transcripts of T. reesei, and we have used this microarray to examine the gene expression of T. harzianum either alone or in the presence of tomato plants, chitin, or glucose. Results Global microarray analysis revealed 1,617 probe sets showing differential expression in T. harzianum mycelia under at least one of the culture conditions tested as compared with one another. Hierarchical clustering and heat map representation showed that the expression patterns obtained in glucose medium clustered separately from the expression patterns observed in the presence of tomato plants and chitin. Annotations using the Blast2GO suite identified 85 of the 257 transcripts whose probe sets afforded up-regulated expression in response to tomato plants. Some of these transcripts were predicted to encode proteins related to Trichoderma-host (fungus or plant) associations, such as Sm1/Elp1 protein, proteases P6281 and PRA1, enchochitinase CHIT42, or QID74 protein, although previously uncharacterized genes were also identified, including those responsible for the possible biosynthesis of nitric oxide, xenobiotic detoxification, mycelium development, or those related to the formation of infection structures in plant tissues. Conclusion The effectiveness of the Trichoderma HDO microarray to detect different gene responses under different growth conditions in the fungus T. harzianum strongly indicates that this tool should be useful for further assays that include different stages of plant colonization, as well as for expression studies in other Trichoderma spp. represented on it. Using this microarray, we have been able to define a number of genes probably involved in the transcriptional response of T. harzianum within the first hours of contact with tomato plant roots, which may provide new insights into the mechanisms and roles of this fungus in the Trichoderma-plant interaction. PMID:19825185
Rise, Matthew L.; von Schalburg, Kristian R.; Brown, Gordon D.; Mawer, Melanie A.; Devlin, Robert H.; Kuipers, Nathanael; Busby, Maura; Beetz-Sargent, Marianne; Alberto, Roberto; Gibbs, A. Ross; Hunt, Peter; Shukin, Robert; Zeznik, Jeffrey A.; Nelson, Colleen; Jones, Simon R.M.; Smailus, Duane E.; Jones, Steven J.M.; Schein, Jacqueline E.; Marra, Marco A.; Butterfield, Yaron S.N.; Stott, Jeff M.; Ng, Siemon H.S.; Davidson, William S.; Koop, Ben F.
2004-01-01
We report 80,388 ESTs from 23 Atlantic salmon (Salmo salar) cDNA libraries (61,819 ESTs), 6 rainbow trout (Oncorhynchus mykiss) cDNA libraries (14,544 ESTs), 2 chinook salmon (Oncorhynchus tshawytscha) cDNA libraries (1317 ESTs), 2 sockeye salmon (Oncorhynchus nerka) cDNA libraries (1243 ESTs), and 2 lake whitefish (Coregonus clupeaformis) cDNA libraries (1465 ESTs). The majority of these are 3′ sequences, allowing discrimination between paralogs arising from a recent genome duplication in the salmonid lineage. Sequence assembly reveals 28,710 different S. salar, 8981 O. mykiss, 1085 O. tshawytscha, 520 O. nerka, and 1176 C. clupeaformis putative transcripts. We annotate the submitted portion of our EST database by molecular function. Higher- and lower-molecular-weight fractions of libraries are shown to contain distinct gene sets, and higher rates of gene discovery are associated with higher-molecular weight libraries. Pyloric caecum library group annotations indicate this organ may function in redox control and as a barrier against systemic uptake of xenobiotics. A microarray is described, containing 7356 salmonid elements representing 3557 different cDNAs. Analyses of cross-species hybridizations to this cDNA microarray indicate that this resource may be used for studies involving all salmonids. PMID:14962987
Holliday, Jason A; Ralph, Steven G; White, Richard; Bohlmann, Jörg; Aitken, Sally N
2008-01-01
Cold acclimation in conifers is a complex process, the timing and extent of which reflects local adaptation and varies widely along latitudinal gradients for many temperate and boreal tree species. Despite their ecological and economic importance, little is known about the global changes in gene expression that accompany autumn cold acclimation in conifers. Using three populations of Sitka spruce (Picea sitchensis) spanning the species range, and a Picea cDNA microarray with 21,840 unique elements, within- and among-population gene expression was monitored during the autumn. Microarray data were validated for selected genes using real-time PCR. Similar numbers of genes were significantly twofold upregulated (1257) and downregulated (967) between late summer and early winter. Among those upregulated were dehydrins, pathogenesis-related/antifreeze genes, carbohydrate and lipid metabolism genes, and genes involved in signal transduction and transcriptional regulation. Among-population microarray hybridizations at early and late autumn time points revealed substantial variation in the autumn transcriptome, some of which may reflect local adaptation. These results demonstrate the complexity of cold acclimation in conifers, highlight similarities and differences to cold tolerance in annual plants, and provide a solid foundation for functional and genetic studies of this important adaptive process.
Bijangi-Vishehsaraei, Khadijeh; Blum, Kevin; Zhang, Hongji; Safa, Ahmad R; Halum, Stacey L
2016-03-01
The pathophysiology of recurrent laryngeal nerve (RLN) transection injury is rare in that it is characteristically followed by a high degree of spontaneous reinnervation, with reinnervation of the laryngeal adductor complex (AC) preceding that of the abducting posterior cricoarytenoid (PCA) muscle. Here, we aim to elucidate the differentially expressed myogenic factors following RLN injury that may be at least partially responsible for the spontaneous reinnervation. F344 male rats underwent RLN injury (n = 12) or sham surgery (n = 12). One week after RLN injury, larynges were harvested following euthanasia. The mRNA was extracted from PCA and AC muscles bilaterally, and microarray analysis was performed using a full rat genome array. Microarray analysis of denervated AC and PCA muscles demonstrated dramatic differences in gene expression profiles, with 205 individual probes that were differentially expressed between the denervated AC and PCA muscles and only 14 genes with similar expression patterns. The differential expression patterns of the AC and PCA suggest different mechanisms of reinnervation. The PCA showed the gene patterns of Wallerian degeneration, while the AC expressed the gene patterns of reinnervation by adjacent axonal sprouting. This finding may reveal important therapeutic targets applicable to RLN and other peripheral nerve injuries. © The Author(s) 2015.
A gene expression signature associated with survival in metastatic melanoma
Mandruzzato, Susanna; Callegaro, Andrea; Turcatel, Gianluca; Francescato, Samuela; Montesco, Maria C; Chiarion-Sileni, Vanna; Mocellin, Simone; Rossi, Carlo R; Bicciato, Silvio; Wang, Ena; Marincola, Francesco M; Zanovello, Paola
2006-01-01
Background Current clinical and histopathological criteria used to define the prognosis of melanoma patients are inadequate for accurate prediction of clinical outcome. We investigated whether genome screening by means of high-throughput gene microarray might provide clinically useful information on patient survival. Methods Forty-three tumor tissues from 38 patients with stage III and stage IV melanoma were profiled with a 17,500 element cDNA microarray. Expression data were analyzed using significance analysis of microarrays (SAM) to identify genes associated with patient survival, and supervised principal components (SPC) to determine survival prediction. Results SAM analysis revealed a set of 80 probes, corresponding to 70 genes, associated with survival, i.e. 45 probes characterizing longer and 35 shorter survival times, respectively. These transcripts were included in a survival prediction model designed using SPC and cross-validation which allowed identifying 30 predicting probes out of the 80 associated with survival. Conclusion The longer-survival group of genes included those expressed in immune cells, both innate and acquired, confirming the interplay between immunological mechanisms and the natural history of melanoma. Genes linked to immune cells were totally lacking in the poor-survival group, which was instead associated with a number of genes related to highly proliferative and invasive tumor cells. PMID:17129373
Chromosome r(10)(p15.3q26.12) in a newborn child: case report.
Gunnarsson, Cecilia; Graffmann, Barbara; Jonasson, Jon
2009-12-07
Ring chromosome 10 is a rare cytogenetic finding. Of the less than 10 reported cases we have found in the literature, none was characterized using high-resolution microarray analysis. Ring chromosomes are frequently unstable due to sister chromatid exchanges and mitotic failures. When mosaicism is present, the interpretation of genotype-phenotype correlations becomes extremely difficult. We report on a newborn girl with growth retardation, microcephaly, congenital heart defects, dysmorphic features and psychomotor retardation. Karyotyping revealed a non-mosaic apparently stable ring chromosome 10 replacing one of the normal homologues in all analyzed metaphases. High-resolution oligonucleotide microarray analysis showed a de novo approximately 12.5 Mb terminal deletion 10q26.12 -> qter and a corresponding 285 kb terminal deletion of 10pter -> p15.3. This case demonstrates that an increased nuchal translucency thickness detected by early ultrasonography should preferably lead to not only QF-PCR for the diagnosis of Down syndrome but also karyotyping. In the future, microarray analysis, which needs further evaluation, might become the method of choice. The clinical phenotype of our patient was in agreement with that of patients with a terminal 10q deletion. For the purpose of genotype-phenotype analysis, there seems to be no need for a "ring syndrome" concept.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Handley, Kim M.; Wrighton, Kelly E.; Piceno, Y. M.
2012-06-13
There is increasing interest in harnessing the functional diversity of indigenous microbial communities to transform and remediate a wide range of environmental contaminants. Understanding the response of communities to stimulation, including flanking taxa, presents important opportunities for optimizing remediation approaches. We used high-density PhyloChip microarray analysis to comprehensively determine community membership and abundance patterns amongst a suite of samples from U(VI) bioremediation experiments. Samples were unstimulated or collected during Fe(III) and sulfate reduction from an acetate-augmented aquifer in Rifle, Colorado, and from laboratory experiments using field-collected materials. Results showed the greatest diversity in abundant SRB lineages was present in naturally-reducedmore » sediment. Desulfuromonadales and Desulfobacterales were consistently identified as the dominant Fe(III)- and sulfate-reducing bacteria (IRB and SRB) throughout acetate amendment experiments. Stimulated communities also exhibited a high degree of functional redundancy amongst enriched flanking members. Not surprisingly, competition for both sulfate and iron was evident amongst abundant taxa, but the distribution and abundance of these ancillary SRB (Peptococcaceae, Desulfovibrionales and Syntrophobacterales), and lineages containing IRB (excluding Desulfobacteraceae) was heterogeneous amongst sample types. Interesting, amongst the most abundant taxa, particularly during sulfate reduction, were Epsilonproteobacteria that perform microaerobic or nitrate-dependant sulfur oxidation, and a number of bacteria other than Geobacteraceae that may enzymatically reduce U(VI). Finally, in depth community probing with PhyloChip determined the efficacy of experimental approaches, notably revealing striking similarity amongst stimulated sediment (from drill cores and in-situ columns) and groundwater communities, and demonstrating that sediment-packed in-situ (down-well) columns served as an ideal method for subsurface biostimulation.« less
Herwig, Annika; Campbell, Gill; Mayer, Claus-Dieter; Boelen, Anita; Anderson, Richard A.; Ross, Alexander W.; Mercer, Julian G.
2014-01-01
Background: The thyroid hormone triiodothyronine (T3) is known to affect energy balance. Recent evidence points to an action of T3 in the hypothalamus, a key area of the brain involved in energy homeostasis, but the components and mechanisms are far from understood. The aim of this study was to identify components in the hypothalamus that may be involved in the action of T3 on energy balance regulatory mechanisms. Methods: Sprague Dawley rats were made hypothyroid by giving 0.025% methimazole (MMI) in their drinking water for 22 days. On day 21, half the MMI-treated rats received a saline injection, whereas the others were injected with T3. Food intake and body weight measurements were taken daily. Body composition was determined by magnetic resonance imaging, gene expression was analyzed by in situ hybridization, and T3-induced gene expression was determined by microarray analysis of MMI-treated compared to MMI-T3-injected hypothalamic RNA. Results: Post mortem serum thyroid hormone levels showed that MMI treatment decreased circulating thyroid hormones and increased thyrotropin (TSH). MMI treatment decreased food intake and body weight. Body composition analysis revealed reduced lean and fat mass in thyroidectomized rats from day 14 of the experiment. MMI treatment caused a decrease in circulating triglyceride concentrations, an increase in nonesterified fatty acids, and decreased insulin levels. A glucose tolerance test showed impaired glucose clearance in the thyroidectomized animals. In the brain, in situ hybridization revealed marked changes in gene expression, including genes such as Mct8, a thyroid hormone transporter, and Agrp, a key component in energy balance regulation. Microarray analysis revealed 110 genes to be up- or downregulated with T3 treatment (±1.3-fold change, p<0.05). Three genes chosen from the differentially expressed genes were verified by in situ hybridization to be activated by T3 in cells located at or close to the hypothalamic ventricular ependymal layer and differentially expressed in animal models of long- and short-term body weight regulation. Conclusion: This study identified genes regulated by T3 in the hypothalamus, a key area of the brain involved in homeostasis and neuroendocrine functions. These include genes hitherto not known to be regulated by thyroid status. PMID:25087834
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhou, J.; Wu, L.; Gentry, T.
2006-04-05
To effectively monitor microbial populations involved in various important processes, a 50-mer-based oligonucleotide microarray was developed based on known genes and pathways involved in: biodegradation, metal resistance and reduction, denitrification, nitrification, nitrogen fixation, methane oxidation, methanogenesis, carbon polymer decomposition, and sulfate reduction. This array contains approximately 2000 unique and group-specific probes with <85% similarity to their non-target sequences. Based on artificial probes, our results showed that at hybridization conditions of 50 C and 50% formamide, the 50-mer microarray hybridization can differentiate sequences having <88% similarity. Specificity tests with representative pure cultures indicated that the designed probes on the arrays appearedmore » to be specific to their corresponding target genes. Detection limits were about 5-10ng genomic DNA in the absence of background DNA, and 50-100ng ({approx}1.3{sup o} 10{sup 7} cells) in the presence background DNA. Strong linear relationships between signal intensity and target DNA and RNA concentration were observed (r{sup 2} = 0.95-0.99). Application of this microarray to naphthalene-amended enrichments and soil microcosms demonstrated that composition of the microflora varied depending on incubation conditions. While the naphthalene-degrading genes from Rhodococcus-type microorganisms were dominant in enrichments, the genes involved in naphthalene degradation from Gram-negative microorganisms such as Ralstonia, Comamonas, and Burkholderia were most abundant in the soil microcosms (as well as those for polyaromatic hydrocarbon and nitrotoluene degradation). Although naphthalene degradation is widely known and studied in Pseudomonas, Pseudomonas genes were not detected in either system. Real-time PCR analysis of 4 representative genes was consistent with microarray-based quantification (r{sup 2} = 0.95). Currently, we are also applying this microarray to the study of several different microbial communities and processes at the NABIR-FRC in Oak Ridge, TN. One project involves the monitoring of the development and dynamics of the microbial community of a fluidized bed reactor (FBR) used for reducing nitrate and the other project monitors microbial community responses to stimulation of uranium reducing populations via ethanol donor additions in situ and in a model system. Additionally, we are developing novel strategies for increasing microarray hybridization sensitivity. Finally, great improvements to our methods of probe design were made by the development of a new computer program, CommOligo. CommOligo designs unique and group-specific oligo probes for whole-genomes, metagenomes, and groups of environmental sequences and uses a new global alignment algorithm to design single or multiple probes for each gene or group. We are now using this program to design a more comprehensive functional gene array for environmental studies. Overall, our results indicate that the 50mer-based microarray technology has potential as a specific and quantitative tool to reveal the composition of microbial communities and their dynamics important to processes within contaminated environments.« less
Intra-Platform Repeatability and Inter-Platform Comparability of MicroRNA Microarray Technology
Sato, Fumiaki; Tsuchiya, Soken; Terasawa, Kazuya; Tsujimoto, Gozoh
2009-01-01
Over the last decade, DNA microarray technology has provided a great contribution to the life sciences. The MicroArray Quality Control (MAQC) project demonstrated the way to analyze the expression microarray. Recently, microarray technology has been utilized to analyze a comprehensive microRNA expression profiling. Currently, several platforms of microRNA microarray chips are commercially available. Thus, we compared repeatability and comparability of five different microRNA microarray platforms (Agilent, Ambion, Exiqon, Invitrogen and Toray) using 309 microRNAs probes, and the Taqman microRNA system using 142 microRNA probes. This study demonstrated that microRNA microarray has high intra-platform repeatability and comparability to quantitative RT-PCR of microRNA. Among the five platforms, Agilent and Toray array showed relatively better performances than the others. However, the current lineup of commercially available microRNA microarray systems fails to show good inter-platform concordance, probably because of lack of an adequate normalization method and severe divergence in stringency of detection call criteria between different platforms. This study provided the basic information about the performance and the problems specific to the current microRNA microarray systems. PMID:19436744
Living Cell Microarrays: An Overview of Concepts
Jonczyk, Rebecca; Kurth, Tracy; Lavrentieva, Antonina; Walter, Johanna-Gabriela; Scheper, Thomas; Stahl, Frank
2016-01-01
Living cell microarrays are a highly efficient cellular screening system. Due to the low number of cells required per spot, cell microarrays enable the use of primary and stem cells and provide resolution close to the single-cell level. Apart from a variety of conventional static designs, microfluidic microarray systems have also been established. An alternative format is a microarray consisting of three-dimensional cell constructs ranging from cell spheroids to cells encapsulated in hydrogel. These systems provide an in vivo-like microenvironment and are preferably used for the investigation of cellular physiology, cytotoxicity, and drug screening. Thus, many different high-tech microarray platforms are currently available. Disadvantages of many systems include their high cost, the requirement of specialized equipment for their manufacture, and the poor comparability of results between different platforms. In this article, we provide an overview of static, microfluidic, and 3D cell microarrays. In addition, we describe a simple method for the printing of living cell microarrays on modified microscope glass slides using standard DNA microarray equipment available in most laboratories. Applications in research and diagnostics are discussed, e.g., the selective and sensitive detection of biomarkers. Finally, we highlight current limitations and the future prospects of living cell microarrays. PMID:27600077
Bock, I; Raveh-Amit, H; Losonczi, E; Carstea, A C; Feher, A; Mashayekhi, K; Matyas, S; Dinnyes, A; Pribenszky, C
2016-04-01
The efficiency of various assisted reproductive techniques can be improved by preconditioning the gametes and embryos with sublethal hydrostatic pressure treatment. However, the underlying molecular mechanism responsible for this protective effect remains unknown and requires further investigation. Here, we studied the effect of optimised hydrostatic pressure treatment on the global gene expression of mouse oocytes after embryonic genome activation. Based on a gene expression microarray analysis, a significant effect of treatment was observed in 4-cell embryos derived from treated oocytes, revealing a transcriptional footprint of hydrostatic pressure-affected genes. Functional analysis identified numerous genes involved in protein synthesis that were downregulated in 4-cell embryos in response to hydrostatic pressure treatment, suggesting that regulation of translation has a major role in optimised hydrostatic pressure-induced stress tolerance. We present a comprehensive microarray analysis and further delineate a potential mechanism responsible for the protective effect of hydrostatic pressure treatment.
Anisimov, S V; Bokheler, K R; Khavinson, V Kh; Anisimov, V N
2002-03-01
Expression of 15,247 clones from a cDNA library in the heart of mice receiving Vilon and Epithalon was studied by DNA-microarray technology. We revealed 300 clones (1.94% of the total count), whose expression changed more than by 2 times. Vilon changed expression of 36 clones, while Epithalon modulated expression of 98 clones. Combined treatment with Vilon and Epithalon changed expression of 144 clones. Vilon alone or in combination with Epithalon activated expression of 157 clones (maximally by 6.13 times) and inhibited expression of 23 clones (maximally by 2.79 times). Epithalon alone or in combination with Vilon activated expression of 194 clones (maximally by 6.61 times) and inhibited expression of 48 clones (maximally by 2.71 times). Our results demonstrate the specific effects of Epithalon and Vilon on gene expression.
Gao, Jian-Jie; Peng, Ri-He; Zhu, Bo; Wang, Bo; Wang, Li-Juan; Xu, Jing; Sun, Miao; Yao, Quan-Hong
2015-10-01
Acrylamide (ACR) is a widely used industrial chemical. However, it is a dangerous compound because it showed neurotoxic effects in humans and act as reproductive toxicant and carcinogen in many animal species. In the environment, acrylamide has high soil mobility and may travel via groundwater. Phytoremediation is an effective method to remove the environmental pollutants, but the mechanism of plant response to acrylamide remains unknown. With the purpose of assessing remediation potentials of plants for acrylamide, we have examined acrylamide uptake by the model plant Arabidopsis grown on contaminated substrates with high performance liquid chromatography (HPLC) analysis. The result revealed that acrylamide could be absorbed and degraded by Arabidopsis. Further microarray analysis showed that 527 transcripts were up-regulated within 2-days under acrylamide exposure condition. We have found many potential acrylamide-induced genes playing a major role in plant metabolism and phytoremediation. Copyright © 2015 Elsevier Inc. All rights reserved.
Garg, Rohini; Tyagi, Akhilesh K.; Jain, Mukesh
2012-01-01
Hormones exert pleiotropic effects on plant growth and development throughout the life cycle. Many of these effects are mediated at molecular level via altering gene expression. In this study, we investigated the exogenous effect of plant hormones, including auxin, cytokinin, abscisic acid, ethylene, salicylic acid and jasmonic acid, on the transcription of rice genes at whole genome level using microarray. Our analysis identified a total of 4171 genes involved in several biological processes, whose expression was altered significantly in the presence of different hormones. Further, 28% of these genes exhibited overlapping transcriptional responses in the presence of any two hormones, indicating crosstalk among plant hormones. In addition, we identified genes showing only a particular hormone-specific response, which can be used as hormone-specific markers. The results of this study will facilitate further studies in hormone biology in rice. PMID:22827941
Mapping of Epitopes Occurring in Bovine α(s1)-Casein Variants by Peptide Microarray Immunoassay.
Lisson, Maria; Erhardt, Georg
2016-01-01
Immunoglobulin E epitope mapping of milk proteins reveals important information about their immunologic properties. Genetic variants of αS1-casein, one of the major allergens in bovine milk, are until now not considered when discussing the allergenic potential. Here we describe the complete procedure to assess the allergenicity of αS1-casein variants B and C, which are frequent in most breeds, starting from milk with identification and purification of casein variants by isoelectric focusing (IEF) and anion-exchange chromatography, followed by in vitro gastrointestinal digestion of the casein variants, identification of the resulting peptides by matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF-MS), in silico analysis of the variant-specific peptides as allergenic epitopes, and determination of their IgE-binding properties by microarray immunoassay with cow's milk allergic human sera.
CoPub: a literature-based keyword enrichment tool for microarray data analysis.
Frijters, Raoul; Heupers, Bart; van Beek, Pieter; Bouwhuis, Maurice; van Schaik, René; de Vlieg, Jacob; Polman, Jan; Alkema, Wynand
2008-07-01
Medline is a rich information source, from which links between genes and keywords describing biological processes, pathways, drugs, pathologies and diseases can be extracted. We developed a publicly available tool called CoPub that uses the information in the Medline database for the biological interpretation of microarray data. CoPub allows batch input of multiple human, mouse or rat genes and produces lists of keywords from several biomedical thesauri that are significantly correlated with the set of input genes. These lists link to Medline abstracts in which the co-occurring input genes and correlated keywords are highlighted. Furthermore, CoPub can graphically visualize differentially expressed genes and over-represented keywords in a network, providing detailed insight in the relationships between genes and keywords, and revealing the most influential genes as highly connected hubs. CoPub is freely accessible at http://services.nbic.nl/cgi-bin/copub/CoPub.pl.
RNA-Seq Profiling Reveals Novel Hepatic Gene Expression Pattern in Aflatoxin B1 Treated Rats
Merrick, B. Alex; Phadke, Dhiral P.; Auerbach, Scott S.; Mav, Deepak; Stiegelmeyer, Suzy M.; Shah, Ruchir R.; Tice, Raymond R.
2013-01-01
Deep sequencing was used to investigate the subchronic effects of 1 ppm aflatoxin B1 (AFB1), a potent hepatocarcinogen, on the male rat liver transcriptome prior to onset of histopathological lesions or tumors. We hypothesized RNA-Seq would reveal more differentially expressed genes (DEG) than microarray analysis, including low copy and novel transcripts related to AFB1’s carcinogenic activity compared to feed controls (CTRL). Paired-end reads were mapped to the rat genome (Rn4) with TopHat and further analyzed by DESeq and Cufflinks-Cuffdiff pipelines to identify differentially expressed transcripts, new exons and unannotated transcripts. PCA and cluster analysis of DEGs showed clear separation between AFB1 and CTRL treatments and concordance among group replicates. qPCR of eight high and medium DEGs and three low DEGs showed good comparability among RNA-Seq and microarray transcripts. DESeq analysis identified 1,026 differentially expressed transcripts at greater than two-fold change (p<0.005) compared to 626 transcripts by microarray due to base pair resolution of transcripts by RNA-Seq, probe placement within transcripts or an absence of probes to detect novel transcripts, splice variants and exons. Pathway analysis among DEGs revealed signaling of Ahr, Nrf2, GSH, xenobiotic, cell cycle, extracellular matrix, and cell differentiation networks consistent with pathways leading to AFB1 carcinogenesis, including almost 200 upregulated transcripts controlled by E2f1-related pathways related to kinetochore structure, mitotic spindle assembly and tissue remodeling. We report 49 novel, differentially-expressed transcripts including confirmation by PCR-cloning of two unique, unannotated, hepatic AFB1-responsive transcripts (HAfT’s) on chromosomes 1.q55 and 15.q11, overexpressed by 10 to 25-fold. Several potentially novel exons were found and exon refinements were made including AFB1 exon-specific induction of homologous family members, Ugt1a6 and Ugt1a7c. We find the rat transcriptome contains many previously unidentified, AFB1-responsive exons and transcripts supporting RNA-Seq’s capabilities to provide new insights into AFB1-mediated gene expression leading to hepatocellular carcinoma. PMID:23630614
RNA-Seq profiling reveals novel hepatic gene expression pattern in aflatoxin B1 treated rats.
Merrick, B Alex; Phadke, Dhiral P; Auerbach, Scott S; Mav, Deepak; Stiegelmeyer, Suzy M; Shah, Ruchir R; Tice, Raymond R
2013-01-01
Deep sequencing was used to investigate the subchronic effects of 1 ppm aflatoxin B1 (AFB1), a potent hepatocarcinogen, on the male rat liver transcriptome prior to onset of histopathological lesions or tumors. We hypothesized RNA-Seq would reveal more differentially expressed genes (DEG) than microarray analysis, including low copy and novel transcripts related to AFB1's carcinogenic activity compared to feed controls (CTRL). Paired-end reads were mapped to the rat genome (Rn4) with TopHat and further analyzed by DESeq and Cufflinks-Cuffdiff pipelines to identify differentially expressed transcripts, new exons and unannotated transcripts. PCA and cluster analysis of DEGs showed clear separation between AFB1 and CTRL treatments and concordance among group replicates. qPCR of eight high and medium DEGs and three low DEGs showed good comparability among RNA-Seq and microarray transcripts. DESeq analysis identified 1,026 differentially expressed transcripts at greater than two-fold change (p<0.005) compared to 626 transcripts by microarray due to base pair resolution of transcripts by RNA-Seq, probe placement within transcripts or an absence of probes to detect novel transcripts, splice variants and exons. Pathway analysis among DEGs revealed signaling of Ahr, Nrf2, GSH, xenobiotic, cell cycle, extracellular matrix, and cell differentiation networks consistent with pathways leading to AFB1 carcinogenesis, including almost 200 upregulated transcripts controlled by E2f1-related pathways related to kinetochore structure, mitotic spindle assembly and tissue remodeling. We report 49 novel, differentially-expressed transcripts including confirmation by PCR-cloning of two unique, unannotated, hepatic AFB1-responsive transcripts (HAfT's) on chromosomes 1.q55 and 15.q11, overexpressed by 10 to 25-fold. Several potentially novel exons were found and exon refinements were made including AFB1 exon-specific induction of homologous family members, Ugt1a6 and Ugt1a7c. We find the rat transcriptome contains many previously unidentified, AFB1-responsive exons and transcripts supporting RNA-Seq's capabilities to provide new insights into AFB1-mediated gene expression leading to hepatocellular carcinoma.
2014-01-01
Background KIAA1199 is a recently identified novel gene that is up-regulated in human cancer with poor survival. Our proteomic study on signaling polarity in chemotactic cells revealed KIAA1199 as a novel protein target that may be involved in cellular chemotaxis and motility. In the present study, we examined the functional significance of KIAA1199 expression in breast cancer growth, motility and invasiveness. Methods We validated the previous microarray observation by tissue microarray immunohistochemistry using a TMA slide containing 12 breast tumor tissue cores and 12 corresponding normal tissues. We performed the shRNA-mediated knockdown of KIAA1199 in MDA-MB-231 and HS578T cells to study the role of this protein in cell proliferation, migration and apoptosis in vitro. We studied the effects of KIAA1199 knockdown in vivo in two groups of mice (n = 5). We carried out the SILAC LC-MS/MS based proteomic studies on the involvement of KIAA1199 in breast cancer. Results KIAA1199 mRNA and protein was significantly overexpressed in breast tumor specimens and cell lines as compared with non-neoplastic breast tissues from large-scale microarray and studies of breast cancer cell lines and tumors. To gain deeper insights into the novel role of KIAA1199 in breast cancer, we modulated KIAA1199 expression using shRNA-mediated knockdown in two breast cancer cell lines (MDA-MB-231 and HS578T), expressing higher levels of KIAA1199. The KIAA1199 knockdown cells showed reduced motility and cell proliferation in vitro. Moreover, when the knockdown cells were injected into the mammary fat pads of female athymic nude mice, there was a significant decrease in tumor incidence and growth. In addition, quantitative proteomic analysis revealed that knockdown of KIAA1199 in breast cancer (MDA-MB-231) cells affected a broad range of cellular functions including apoptosis, metabolism and cell motility. Conclusions Our findings indicate that KIAA1199 may play an important role in breast tumor growth and invasiveness, and that it may represent a novel target for biomarker development and a novel therapeutic target for breast cancer. PMID:24628760
Gluck, Christian; Min, Sangwon; Oyelakin, Akinsola; Smalley, Kirsten; Sinha, Satrajit; Romano, Rose-Anne
2016-11-16
Mouse models have served a valuable role in deciphering various facets of Salivary Gland (SG) biology, from normal developmental programs to diseased states. To facilitate such studies, gene expression profiling maps have been generated for various stages of SG organogenesis. However these prior studies fall short of capturing the transcriptional complexity due to the limited scope of gene-centric microarray-based technology. Compared to microarray, RNA-sequencing (RNA-seq) offers unbiased detection of novel transcripts, broader dynamic range and high specificity and sensitivity for detection of genes, transcripts, and differential gene expression. Although RNA-seq data, particularly under the auspices of the ENCODE project, have covered a large number of biological specimens, studies on the SG have been lacking. To better appreciate the wide spectrum of gene expression profiles, we isolated RNA from mouse submandibular salivary glands at different embryonic and adult stages. In parallel, we processed RNA-seq data for 24 organs and tissues obtained from the mouse ENCODE consortium and calculated the average gene expression values. To identify molecular players and pathways likely to be relevant for SG biology, we performed functional gene enrichment analysis, network construction and hierarchal clustering of the RNA-seq datasets obtained from different stages of SG development and maturation, and other mouse organs and tissues. Our bioinformatics-based data analysis not only reaffirmed known modulators of SG morphogenesis but revealed novel transcription factors and signaling pathways unique to mouse SG biology and function. Finally we demonstrated that the unique SG gene signature obtained from our mouse studies is also well conserved and can demarcate features of the human SG transcriptome that is different from other tissues. Our RNA-seq based Atlas has revealed a high-resolution cartographic view of the dynamic transcriptomic landscape of the mouse SG at various stages. These RNA-seq datasets will complement pre-existing microarray based datasets, including the Salivary Gland Molecular Anatomy Project by offering a broader systems-biology based perspective rather than the classical gene-centric view. Ultimately such resources will be valuable in providing a useful toolkit to better understand how the diverse cell population of the SG are organized and controlled during development and differentiation.
A New Oligonucleotide Microarray for Detection of Pathogenic and Non-Pathogenic Legionella spp.
Cao, Boyang; Liu, Xiangqian; Yu, Xiang; Chen, Min; Feng, Lu; Wang, Lei
2014-01-01
Legionella pneumophila has been recognized as the major cause of legionellosis since the discovery of the deadly disease. Legionella spp. other than L. pneumophila were later found to be responsible to many non-pneumophila infections. The non-L. pneumophila infections are likely under-detected because of a lack of effective diagnosis. In this report, we have sequenced the 16S-23S rRNA gene internal transcribed spacer (ITS) of 10 Legionella species and subspecies, including L. anisa, L. bozemanii, L. dumoffii, L. fairfieldensis, L. gormanii, L. jordanis, L. maceachernii, L. micdadei, L. pneumophila subspp. fraseri and L. pneumophila subspp. pasculleii, and developed a rapid oligonucleotide microarray detection technique accordingly to identify 12 most common Legionella spp., which consist of 11 pathogenic species of L. anisa, L. bozemanii, L. dumoffii, L. gormanii, L. jordanis, L. longbeachae, L. maceachernii, L. micdadei, and L. pneumophila (including subspp. pneumophila, subspp. fraseri, and subspp. pasculleii) and one non-pathogenic species, L. fairfieldensis. Twenty-nine probes that reproducibly detected multiple Legionella species with high specificity were included in the array. A total of 52 strains, including 30 target pathogens and 22 non-target bacteria, were used to verify the oligonucleotide microarray assay. The sensitivity of the detection was at 1.0 ng with genomic DNA or 13 CFU/100 mL with Legionella cultures. The microarray detected seven samples of air conditioner-condensed water with 100% accuracy, validating the technique as a promising method for applications in basic microbiology, clinical diagnosis, food safety, and epidemiological surveillance. The phylogenetic study based on the ITS has also revealed that the non-pathogenic L. fairfieldensis is the closest to L. pneumophila than the nine other pathogenic Legionella spp. PMID:25469776
Matussek, A; Jernberg, C; Einemo, I-M; Monecke, S; Ehricht, R; Engelmann, I; Löfgren, S; Mernelius, S
2017-08-01
Shiga toxin (Stx)-producing Escherichia coli (STECs) cause non-bloody diarrhea, hemorrhagic colitis, and hemolytic uremic syndrome, and are the primary cause of acute renal failure in children worldwide. This study investigated the correlation of genetic makeup of STEC strains as revealed by DNA microarray to clinical symptoms and the duration of STEC shedding. All STEC isolated (n = 96) from patients <10 years of age in Jönköping County, Sweden from 2003 to 2015 were included. Isolates were characterized by DNA microarray, including almost 280 genes. Clinical data were collected through a questionnaire and by reviewing medical records. Of the 96 virulence genes (including stx) in the microarray, 62 genes were present in at least one isolate. Statistically significant differences in prevalence were observed for 21 genes when comparing patients with bloody diarrhea (BD) and with non-bloody stool (18 of 21 associated with BD). Most genes encode toxins (e.g., stx2 alleles, astA, toxB), adhesion factors (i.e. espB_O157, tir, eae), or secretion factors (e.g., espA, espF, espJ, etpD, nleA, nleB, nleC, tccP). Seven genes were associated with prolonged stx shedding; the presence of three genes (lpfA, senB, and stx1) and the absence of four genes (espB_O157, espF, astA, and intI1). We found STEC genes that might predict severe disease outcome already at diagnosis. This can be used to develop diagnostic tools for risk assessment of disease outcome. Furthermore, genes associated with the duration of stx shedding were detected, enabling a possible better prediction of length of STEC carriage after infection.
[Study of generational risk in deafness inflicted couples using deafness gene microarray technique].
Wang, Ping; Zhao, Jia; Yu, Shu-yuan; Jin, Peng; Zhu, Wei; DU, Bo
2011-06-01
To explored the significance of screening the gene mutations of deafness related in deaf-mute (deaf & dumb) family using DNA microarray. Total of 52 couples of deaf-mute were recruited from Changchun deaf-mute community. With an average age of (58.3 ± 6.7) years old (x(-) ± s). Blood samples were obtained with informed consent. Their genomic DNA was extracted from peripheral blood and PCR was performed. Nine of hot spot mutations in four most common deafness pathologic gene were examined with the DNA microarray, including GJB2, GJB3, PDS and mtDNA 12S rRNA genes. At the same time, the results were verified with the traditional methods of sequencing. Fifty of normal people served as a control group. All patients were diagnosed non-syndromic sensorineural hearing loss by subjective pure tone audiometry. Thirty-two of 104 cases appeared GJB2 gene mutation (30.7%), the mutation sites included 35delG, 176del16, 235delC and 299delAT. Eighteen of 32 cases of GJB2 mutations were 235delC (59.1%). Seven of 104 cases appeared SLC26A4 gene IVS7-2 A > G mutation. Questionnaire survey and gene diagnosis revealed that four of 52 families have deaf offspring (7.6%). When a couple carries the same gene mutation, the risk of their children deafness was 100%. The results were confirmed with the traditional methods of sequencing. There is a high risk of deafness if a deaf-mute family is planning to have a new baby. It is very important and helpful to avoid deaf newborns again in deaf-mute family by DNA microarray.
Hou, Jing; Liu, Xinhui; Wang, Juan; Zhao, Shengnan; Cui, Baoshan
2015-02-03
The effects of heavy metals in agricultural soils have received special attention due to their potential for accumulation in crops, which can affect species at all trophic levels. Therefore, there is a critical need for reliable bioassays for assessing risk levels due to heavy metals in agricultural soil. In the present study, we used microarrays to investigate changes in gene expression of Lycopersicon esculentum in response to Cd-, Cr-, Hg-, or Pb-spiked soil. Exposure to (1)/10 median lethal concentrations (LC50) of Cd, Cr, Hg, or Pb for 7 days resulted in expression changes in 29 Cd-specific, 58 Cr-specific, 192 Hg-specific and 864 Pb-specific genes as determined by microarray analysis, whereas conventional morphological and physiological bioassays did not reveal any toxicant stresses. Hierarchical clustering analysis showed that the characteristic gene expression profiles induced by Cd, Cr, Hg, and Pb were distinct from not only the control but also one another. Furthermore, a total of three genes related to "ion transport" for Cd, 14 genes related to "external encapsulating structure organization", "reproductive developmental process", "lipid metabolic process" and "response to stimulus" for Cr, 11 genes related to "cellular metabolic process" and "cellular response to stimulus" for Hg, 78 genes related to 20 biological processes (e.g., DNA metabolic process, monosaccharide catabolic process, cell division) for Pb were identified and selected as their potential biomarkers. These findings demonstrated that microarray-based analysis of Lycopersicon esculentum was a sensitive tool for the early detection of potential toxicity of heavy metals in agricultural soil, as well as an effective tool for identifying the heavy metal-specific genes, which should be useful for assessing risk levels due to heavy metals in agricultural soil.
Haas, Christian S; Creighton, Chad J; Pi, Xiujun; Maine, Ira; Koch, Alisa E; Haines, G Kenneth; Ling, Song; Chinnaiyan, Arul M; Holoshitz, Joseph
2006-07-01
To identify disease-specific gene expression profiles in patients with rheumatoid arthritis (RA), using complementary DNA (cDNA) microarray analyses on lymphoblastoid B cell lines (LCLs) derived from RA-discordant monozygotic (MZ) twins. The cDNA was prepared from LCLs derived from the peripheral blood of 11 pairs of RA-discordant MZ twins. The RA twin cDNA was labeled with cy5 fluorescent dye, and the cDNA of the healthy co-twin was labeled with cy3. To determine relative expression profiles, cDNA from each twin pair was combined and hybridized on 20,000-element microarray chips. Immunohistochemistry and real-time polymerase chain reaction were used to detect the expression of selected gene products in synovial tissue from patients with RA compared with patients with osteoarthritis and normal healthy controls. In RA twin LCLs compared with healthy co-twin LCLs, 1,163 transcripts were significantly differentially expressed. Of these, 747 were overexpressed and 416 were underexpressed. Gene ontology analysis revealed many genes known to play a role in apoptosis, angiogenesis, proteolysis, and signaling. The 3 most significantly overexpressed genes were laeverin (a novel enzyme with sequence homology to CD13), 11beta-hydroxysteroid dehydrogenase type 2 (a steroid pathway enzyme), and cysteine-rich, angiogenic inducer 61 (a known angiogenic factor). The products of these genes, heretofore uncharacterized in RA, were all abundantly expressed in RA synovial tissues. Microarray cDNA analysis of peripheral blood-derived LCLs from well-controlled patient populations is a useful tool to detect RA-relevant genes and could help in identifying novel therapeutic targets.
Ulrich, Reiner; Puff, Christina; Wewetzer, Konstantin; Kalkuhl, Arno; Deschl, Ulrich; Baumgärtner, Wolfgang
2014-01-01
Canine distemper virus (CDV)-induced demyelinating leukoencephalitis in dogs (Canis familiaris) is suggested to represent a naturally occurring translational model for subacute sclerosing panencephalitis and multiple sclerosis in humans. The aim of this study was a hypothesis-free microarray analysis of the transcriptional changes within cerebellar specimens of five cases of acute, six cases of subacute demyelinating, and three cases of chronic demyelinating and inflammatory CDV leukoencephalitis as compared to twelve non-infected control dogs. Frozen cerebellar specimens were used for analysis of histopathological changes including demyelination, transcriptional changes employing microarrays, and presence of CDV nucleoprotein RNA and protein using microarrays, RT-qPCR and immunohistochemistry. Microarray analysis revealed 780 differentially expressed probe sets. The dominating change was an up-regulation of genes related to the innate and the humoral immune response, and less distinct the cytotoxic T-cell-mediated immune response in all subtypes of CDV leukoencephalitis as compared to controls. Multiple myelin genes including myelin basic protein and proteolipid protein displayed a selective down-regulation in subacute CDV leukoencephalitis, suggestive of an oligodendrocyte dystrophy. In contrast, a marked up-regulation of multiple immunoglobulin-like expressed sequence tags and the delta polypeptide of the CD3 antigen was observed in chronic CDV leukoencephalitis, in agreement with the hypothesis of an immune-mediated demyelination in the late inflammatory phase of the disease. Analysis of pathways intimately linked to demyelination as determined by morphometry employing correlation-based Gene Set Enrichment Analysis highlighted the pathomechanistic importance of up-regulated genes comprised by the gene ontology terms “viral replication” and “humoral immune response” as well as down-regulated genes functionally related to “metabolite and energy generation”. PMID:24755553
Ulrich, Reiner; Puff, Christina; Wewetzer, Konstantin; Kalkuhl, Arno; Deschl, Ulrich; Baumgärtner, Wolfgang
2014-01-01
Canine distemper virus (CDV)-induced demyelinating leukoencephalitis in dogs (Canis familiaris) is suggested to represent a naturally occurring translational model for subacute sclerosing panencephalitis and multiple sclerosis in humans. The aim of this study was a hypothesis-free microarray analysis of the transcriptional changes within cerebellar specimens of five cases of acute, six cases of subacute demyelinating, and three cases of chronic demyelinating and inflammatory CDV leukoencephalitis as compared to twelve non-infected control dogs. Frozen cerebellar specimens were used for analysis of histopathological changes including demyelination, transcriptional changes employing microarrays, and presence of CDV nucleoprotein RNA and protein using microarrays, RT-qPCR and immunohistochemistry. Microarray analysis revealed 780 differentially expressed probe sets. The dominating change was an up-regulation of genes related to the innate and the humoral immune response, and less distinct the cytotoxic T-cell-mediated immune response in all subtypes of CDV leukoencephalitis as compared to controls. Multiple myelin genes including myelin basic protein and proteolipid protein displayed a selective down-regulation in subacute CDV leukoencephalitis, suggestive of an oligodendrocyte dystrophy. In contrast, a marked up-regulation of multiple immunoglobulin-like expressed sequence tags and the delta polypeptide of the CD3 antigen was observed in chronic CDV leukoencephalitis, in agreement with the hypothesis of an immune-mediated demyelination in the late inflammatory phase of the disease. Analysis of pathways intimately linked to demyelination as determined by morphometry employing correlation-based Gene Set Enrichment Analysis highlighted the pathomechanistic importance of up-regulated genes comprised by the gene ontology terms "viral replication" and "humoral immune response" as well as down-regulated genes functionally related to "metabolite and energy generation".
A new oligonucleotide microarray for detection of pathogenic and non-pathogenic Legionella spp.
Cao, Boyang; Liu, Xiangqian; Yu, Xiang; Chen, Min; Feng, Lu; Wang, Lei
2014-01-01
Legionella pneumophila has been recognized as the major cause of legionellosis since the discovery of the deadly disease. Legionella spp. other than L. pneumophila were later found to be responsible to many non-pneumophila infections. The non-L. pneumophila infections are likely under-detected because of a lack of effective diagnosis. In this report, we have sequenced the 16S-23S rRNA gene internal transcribed spacer (ITS) of 10 Legionella species and subspecies, including L. anisa, L. bozemanii, L. dumoffii, L. fairfieldensis, L. gormanii, L. jordanis, L. maceachernii, L. micdadei, L. pneumophila subspp. fraseri and L. pneumophila subspp. pasculleii, and developed a rapid oligonucleotide microarray detection technique accordingly to identify 12 most common Legionella spp., which consist of 11 pathogenic species of L. anisa, L. bozemanii, L. dumoffii, L. gormanii, L. jordanis, L. longbeachae, L. maceachernii, L. micdadei, and L. pneumophila (including subspp. pneumophila, subspp. fraseri, and subspp. pasculleii) and one non-pathogenic species, L. fairfieldensis. Twenty-nine probes that reproducibly detected multiple Legionella species with high specificity were included in the array. A total of 52 strains, including 30 target pathogens and 22 non-target bacteria, were used to verify the oligonucleotide microarray assay. The sensitivity of the detection was at 1.0 ng with genomic DNA or 13 CFU/100 mL with Legionella cultures. The microarray detected seven samples of air conditioner-condensed water with 100% accuracy, validating the technique as a promising method for applications in basic microbiology, clinical diagnosis, food safety, and epidemiological surveillance. The phylogenetic study based on the ITS has also revealed that the non-pathogenic L. fairfieldensis is the closest to L. pneumophila than the nine other pathogenic Legionella spp.
Wang, Hongyang; Owens, James D; Shih, Joanna H; Li, Ming-Chung; Bonner, Robert F; Mushinski, J Frederic
2006-04-27
Gene expression profiling by microarray analysis of cells enriched by laser capture microdissection (LCM) faces several technical challenges. Frozen sections yield higher quality RNA than paraffin-imbedded sections, but even with frozen sections, the staining methods used for histological identification of cells of interest could still damage the mRNA in the cells. To study the contribution of staining methods to degradation of results from gene expression profiling of LCM samples, we subjected pellets of the mouse plasma cell tumor cell line TEPC 1165 to direct RNA extraction and to parallel frozen sectioning for LCM and subsequent RNA extraction. We used microarray hybridization analysis to compare gene expression profiles of RNA from cell pellets with gene expression profiles of RNA from frozen sections that had been stained with hematoxylin and eosin (H&E), Nissl Stain (NS), and for immunofluorescence (IF) as well as with the plasma cell-revealing methyl green pyronin (MGP) stain. All RNAs were amplified with two rounds of T7-based in vitro transcription and analyzed by two-color expression analysis on 10-K cDNA microarrays. The MGP-stained samples showed the least introduction of mRNA loss, followed by H&E and immunofluorescence. Nissl staining was significantly more detrimental to gene expression profiles, presumably owing to an aqueous step in which RNA may have been damaged by endogenous or exogenous RNAases. RNA damage can occur during the staining steps preparatory to laser capture microdissection, with the consequence of loss of representation of certain genes in microarray hybridization analysis. Inclusion of RNAase inhibitor in aqueous staining solutions appears to be important in protecting RNA from loss of gene transcripts.
Wang, Hongyang; Owens, James D; Shih, Joanna H; Li, Ming-Chung; Bonner, Robert F; Mushinski, J Frederic
2006-01-01
Background Gene expression profiling by microarray analysis of cells enriched by laser capture microdissection (LCM) faces several technical challenges. Frozen sections yield higher quality RNA than paraffin-imbedded sections, but even with frozen sections, the staining methods used for histological identification of cells of interest could still damage the mRNA in the cells. To study the contribution of staining methods to degradation of results from gene expression profiling of LCM samples, we subjected pellets of the mouse plasma cell tumor cell line TEPC 1165 to direct RNA extraction and to parallel frozen sectioning for LCM and subsequent RNA extraction. We used microarray hybridization analysis to compare gene expression profiles of RNA from cell pellets with gene expression profiles of RNA from frozen sections that had been stained with hematoxylin and eosin (H&E), Nissl Stain (NS), and for immunofluorescence (IF) as well as with the plasma cell-revealing methyl green pyronin (MGP) stain. All RNAs were amplified with two rounds of T7-based in vitro transcription and analyzed by two-color expression analysis on 10-K cDNA microarrays. Results The MGP-stained samples showed the least introduction of mRNA loss, followed by H&E and immunofluorescence. Nissl staining was significantly more detrimental to gene expression profiles, presumably owing to an aqueous step in which RNA may have been damaged by endogenous or exogenous RNAases. Conclusion RNA damage can occur during the staining steps preparatory to laser capture microdissection, with the consequence of loss of representation of certain genes in microarray hybridization analysis. Inclusion of RNAase inhibitor in aqueous staining solutions appears to be important in protecting RNA from loss of gene transcripts. PMID:16643667
High-Throughput Cloning and Expression Library Creation for Functional Proteomics
Festa, Fernanda; Steel, Jason; Bian, Xiaofang; Labaer, Joshua
2013-01-01
The study of protein function usually requires the use of a cloned version of the gene for protein expression and functional assays. This strategy is particular important when the information available regarding function is limited. The functional characterization of the thousands of newly identified proteins revealed by genomics requires faster methods than traditional single gene experiments, creating the need for fast, flexible and reliable cloning systems. These collections of open reading frame (ORF) clones can be coupled with high-throughput proteomics platforms, such as protein microarrays and cell-based assays, to answer biological questions. In this tutorial we provide the background for DNA cloning, discuss the major high-throughput cloning systems (Gateway® Technology, Flexi® Vector Systems, and Creator™ DNA Cloning System) and compare them side-by-side. We also report an example of high-throughput cloning study and its application in functional proteomics. This Tutorial is part of the International Proteomics Tutorial Programme (IPTP12). Details can be found at http://www.proteomicstutorials.org. PMID:23457047
Zhang, Guang Lan; Keskin, Derin B.; Lin, Hsin-Nan; Lin, Hong Huang; DeLuca, David S.; Leppanen, Scott; Milford, Edgar L.; Reinherz, Ellis L.; Brusic, Vladimir
2014-01-01
Human leukocyte antigens (HLA) are important biomarkers because multiple diseases, drug toxicity, and vaccine responses reveal strong HLA associations. Current clinical HLA typing is an elimination process requiring serial testing. We present an alternative in situ synthesized DNA-based microarray method that contains hundreds of thousands of probes representing a complete overlapping set covering 1,610 clinically relevant HLA class I alleles accompanied by computational tools for assigning HLA type to 4-digit resolution. Our proof-of-concept experiment included 21 blood samples, 18 cell lines, and multiple controls. The method is accurate, robust, and amenable to automation. Typing errors were restricted to homozygous samples or those with very closely related alleles from the same locus, but readily resolved by targeted DNA sequencing validation of flagged samples. High-throughput HLA typing technologies that are effective, yet inexpensive, can be used to analyze the world’s populations, benefiting both global public health and personalized health care. PMID:25505899
A New Approach for Mining Order-Preserving Submatrices Based on All Common Subsequences.
Xue, Yun; Liao, Zhengling; Li, Meihang; Luo, Jie; Kuang, Qiuhua; Hu, Xiaohui; Li, Tiechen
2015-01-01
Order-preserving submatrices (OPSMs) have been applied in many fields, such as DNA microarray data analysis, automatic recommendation systems, and target marketing systems, as an important unsupervised learning model. Unfortunately, most existing methods are heuristic algorithms which are unable to reveal OPSMs entirely in NP-complete problem. In particular, deep OPSMs, corresponding to long patterns with few supporting sequences, incur explosive computational costs and are completely pruned by most popular methods. In this paper, we propose an exact method to discover all OPSMs based on frequent sequential pattern mining. First, an existing algorithm was adjusted to disclose all common subsequence (ACS) between every two row sequences, and therefore all deep OPSMs will not be missed. Then, an improved data structure for prefix tree was used to store and traverse ACS, and Apriori principle was employed to efficiently mine the frequent sequential pattern. Finally, experiments were implemented on gene and synthetic datasets. Results demonstrated the effectiveness and efficiency of this method.
Dehydroxymethylepoxyquinomicin selectively ablates T-CAEBV cells.
Zhang, Hui; Yang, Wen-Tao; Wang, Zhao; Yao, Chun-Mei; Wang, Xiao-Fang; Tian, Zhi-Qing; Jin, Ying-Ying; Wang, Lin-Lin; Chen, Tong-Xin
2015-01-01
Chronic active Epstein-Barr virus infection (CAEBV) represents a new subtype of lymphoproliferative disorders characterized by high morbidity and mortality rates and often leads to malignant transformation of infected cells. Efficient therapeutic strategies are presently unavailable; therefore, the development of therapies to prevent CAEBV-mediated transformation and disease progression is crucial. Here, we used microarray analysis and luciferase reporter assays to reveal the potential role of activated nuclear factor kappa B (NF-kB) in T cell type of-CAEBV infection. Using a series of cellular and molecular experiments, we demonstrated that dehydroxymethylepoxyquinomicin (DHMEQ), a novel NF-kB inhibitor, can selectively induce apoptosis in SNT-16 cells infected with CAEBV. Mechanistic studies suggested that DHMEQ induces SNT-16 cell apoptosis through NF-kB inhibition coupled with oxidative stress generation. Thus, activated NF-kB could be a new target for CAEBV therapeutics. Owing to its selective targeting ability, DHMEQ may be a candidate for a novel therapeutic regimen to control the progression of CAEBV infections.
High-throughput cloning and expression library creation for functional proteomics.
Festa, Fernanda; Steel, Jason; Bian, Xiaofang; Labaer, Joshua
2013-05-01
The study of protein function usually requires the use of a cloned version of the gene for protein expression and functional assays. This strategy is particularly important when the information available regarding function is limited. The functional characterization of the thousands of newly identified proteins revealed by genomics requires faster methods than traditional single-gene experiments, creating the need for fast, flexible, and reliable cloning systems. These collections of ORF clones can be coupled with high-throughput proteomics platforms, such as protein microarrays and cell-based assays, to answer biological questions. In this tutorial, we provide the background for DNA cloning, discuss the major high-throughput cloning systems (Gateway® Technology, Flexi® Vector Systems, and Creator(TM) DNA Cloning System) and compare them side-by-side. We also report an example of high-throughput cloning study and its application in functional proteomics. This tutorial is part of the International Proteomics Tutorial Programme (IPTP12). © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Mapping and analysis of Caenorhabditis elegans transcription factor sequence specificities
Narasimhan, Kamesh; Lambert, Samuel A; Yang, Ally WH; Riddell, Jeremy; Mnaimneh, Sanie; Zheng, Hong; Albu, Mihai; Najafabadi, Hamed S; Reece-Hoyes, John S; Fuxman Bass, Juan I; Walhout, Albertha JM; Weirauch, Matthew T; Hughes, Timothy R
2015-01-01
Caenorhabditis elegans is a powerful model for studying gene regulation, as it has a compact genome and a wealth of genomic tools. However, identification of regulatory elements has been limited, as DNA-binding motifs are known for only 71 of the estimated 763 sequence-specific transcription factors (TFs). To address this problem, we performed protein binding microarray experiments on representatives of canonical TF families in C. elegans, obtaining motifs for 129 TFs. Additionally, we predict motifs for many TFs that have DNA-binding domains similar to those already characterized, increasing coverage of binding specificities to 292 C. elegans TFs (∼40%). These data highlight the diversification of binding motifs for the nuclear hormone receptor and C2H2 zinc finger families and reveal unexpected diversity of motifs for T-box and DM families. Motif enrichment in promoters of functionally related genes is consistent with known biology and also identifies putative regulatory roles for unstudied TFs. DOI: http://dx.doi.org/10.7554/eLife.06967.001 PMID:25905672
Statistical methodology for the analysis of dye-switch microarray experiments
Mary-Huard, Tristan; Aubert, Julie; Mansouri-Attia, Nadera; Sandra, Olivier; Daudin, Jean-Jacques
2008-01-01
Background In individually dye-balanced microarray designs, each biological sample is hybridized on two different slides, once with Cy3 and once with Cy5. While this strategy ensures an automatic correction of the gene-specific labelling bias, it also induces dependencies between log-ratio measurements that must be taken into account in the statistical analysis. Results We present two original statistical procedures for the statistical analysis of individually balanced designs. These procedures are compared with the usual ML and REML mixed model procedures proposed in most statistical toolboxes, on both simulated and real data. Conclusion The UP procedure we propose as an alternative to usual mixed model procedures is more efficient and significantly faster to compute. This result provides some useful guidelines for the analysis of complex designs. PMID:18271965
McLachlan, G J; Bean, R W; Jones, L Ben-Tovim
2006-07-01
An important problem in microarray experiments is the detection of genes that are differentially expressed in a given number of classes. We provide a straightforward and easily implemented method for estimating the posterior probability that an individual gene is null. The problem can be expressed in a two-component mixture framework, using an empirical Bayes approach. Current methods of implementing this approach either have some limitations due to the minimal assumptions made or with more specific assumptions are computationally intensive. By converting to a z-score the value of the test statistic used to test the significance of each gene, we propose a simple two-component normal mixture that models adequately the distribution of this score. The usefulness of our approach is demonstrated on three real datasets.
Ensink, Elliot; Sinha, Jessica; Sinha, Arkadeep; Tang, Huiyuan; Calderone, Heather M; Hostetter, Galen; Winter, Jordan; Cherba, David; Brand, Randall E; Allen, Peter J; Sempere, Lorenzo F; Haab, Brian B
2015-10-06
Experiments involving the high-throughput quantification of image data require algorithms for automation. A challenge in the development of such algorithms is to properly interpret signals over a broad range of image characteristics, without the need for manual adjustment of parameters. Here we present a new approach for locating signals in image data, called Segment and Fit Thresholding (SFT). The method assesses statistical characteristics of small segments of the image and determines the best-fit trends between the statistics. Based on the relationships, SFT identifies segments belonging to background regions; analyzes the background to determine optimal thresholds; and analyzes all segments to identify signal pixels. We optimized the initial settings for locating background and signal in antibody microarray and immunofluorescence data and found that SFT performed well over multiple, diverse image characteristics without readjustment of settings. When used for the automated analysis of multicolor, tissue-microarray images, SFT correctly found the overlap of markers with known subcellular localization, and it performed better than a fixed threshold and Otsu's method for selected images. SFT promises to advance the goal of full automation in image analysis.