A genome-wide 20 K citrus microarray for gene expression analysis
Martinez-Godoy, M Angeles; Mauri, Nuria; Juarez, Jose; Marques, M Carmen; Santiago, Julia; Forment, Javier; Gadea, Jose
2008-01-01
Background Understanding of genetic elements that contribute to key aspects of citrus biology will impact future improvements in this economically important crop. Global gene expression analysis demands microarray platforms with a high genome coverage. In the last years, genome-wide EST collections have been generated in citrus, opening the possibility to create new tools for functional genomics in this crop plant. Results We have designed and constructed a publicly available genome-wide cDNA microarray that include 21,081 putative unigenes of citrus. As a functional companion to the microarray, a web-browsable database [1] was created and populated with information about the unigenes represented in the microarray, including cDNA libraries, isolated clones, raw and processed nucleotide and protein sequences, and results of all the structural and functional annotation of the unigenes, like general description, BLAST hits, putative Arabidopsis orthologs, microsatellites, putative SNPs, GO classification and PFAM domains. We have performed a Gene Ontology comparison with the full set of Arabidopsis proteins to estimate the genome coverage of the microarray. We have also performed microarray hybridizations to check its usability. Conclusion This new cDNA microarray replaces the first 7K microarray generated two years ago and allows gene expression analysis at a more global scale. We have followed a rational design to minimize cross-hybridization while maintaining its utility for different citrus species. Furthermore, we also provide access to a website with full structural and functional annotation of the unigenes represented in the microarray, along with the ability to use this site to directly perform gene expression analysis using standard tools at different publicly available servers. Furthermore, we show how this microarray offers a good representation of the citrus genome and present the usefulness of this genomic tool for global studies in citrus by using it to catalogue genes expressed in citrus globular embryos. PMID:18598343
Microarray as a First Genetic Test in Global Developmental Delay: A Cost-Effectiveness Analysis
ERIC Educational Resources Information Center
Trakadis, Yannis; Shevell, Michael
2011-01-01
Aim: Microarray technology has a significantly higher clinical yield than karyotyping in individuals with global developmental delay (GDD). Despite this, it has not yet been routinely implemented as a screening test owing to the perception that this approach is more expensive. We aimed to evaluate the effect that replacing karyotype with…
Development of DNA Microarrays for Metabolic Pathway and Bioprocess Monitoring
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gregory Stephanopoulos
Transcriptional profiling experiments utilizing DNA microarrays to study the intracellular accumulation of PHB in Synechocystis has proved difficult in large part because strains that show significant differences in PHB which would justify global analysis of gene expression have not been isolated.
Construction of a cDNA microarray derived from the ascidian Ciona intestinalis.
Azumi, Kaoru; Takahashi, Hiroki; Miki, Yasufumi; Fujie, Manabu; Usami, Takeshi; Ishikawa, Hisayoshi; Kitayama, Atsusi; Satou, Yutaka; Ueno, Naoto; Satoh, Nori
2003-10-01
A cDNA microarray was constructed from a basal chordate, the ascidian Ciona intestinalis. The draft genome of Ciona has been read and inferred to contain approximately 16,000 protein-coding genes, and cDNAs for transcripts of 13,464 genes have been characterized and compiled as the "Ciona intestinalis Gene Collection Release I". In the present study, we constructed a cDNA microarray of these 13,464 Ciona genes. A preliminary experiment with Cy3- and Cy5-labeled probes showed extensive differential gene expression between fertilized eggs and larvae. In addition, there was a good correlation between results obtained by the present microarray analysis and those from previous EST analyses. This first microarray of a large collection of Ciona intestinalis cDNA clones should facilitate the analysis of global gene expression and gene networks during the embryogenesis of basal chordates.
Gong, Wei; He, Kun; Covington, Mike; Dinesh-Kumar, S. P.; Snyder, Michael; Harmer, Stacey L.; Zhu, Yu-Xian; Deng, Xing Wang
2009-01-01
We used our collection of Arabidopsis transcription factor (TF) ORFeome clones to construct protein microarrays containing as many as 802 TF proteins. These protein microarrays were used for both protein-DNA and protein-protein interaction analyses. For protein-DNA interaction studies, we examined AP2/ERF family TFs and their cognate cis-elements. By careful comparison of the DNA-binding specificity of 13 TFs on the protein microarray with previous non-microarray data, we showed that protein microarrays provide an efficient and high throughput tool for genome-wide analysis of TF-DNA interactions. This microarray protein-DNA interaction analysis allowed us to derive a comprehensive view of DNA-binding profiles of AP2/ERF family proteins in Arabidopsis. It also revealed four TFs that bound the EE (evening element) and had the expected phased gene expression under clock-regulation, thus providing a basis for further functional analysis of their roles in clock regulation of gene expression. We also developed procedures for detecting protein interactions using this TF protein microarray and discovered four novel partners that interact with HY5, which can be validated by yeast two-hybrid assays. Thus, plant TF protein microarrays offer an attractive high-throughput alternative to traditional techniques for TF functional characterization on a global scale. PMID:19802365
Emerging Use of Gene Expression Microarrays in Plant Physiology
Wullschleger, Stan D.; Difazio, Stephen P.
2003-01-01
Microarrays have become an important technology for the global analysis of gene expression in humans, animals, plants, and microbes. Implemented in the context of a well-designed experiment, cDNA and oligonucleotide arrays can provide highthroughput, simultaneous analysis of transcript abundance for hundreds, if not thousands, of genes. However, despite widespread acceptance, the use of microarrays as a tool to better understand processes of interest to the plant physiologist is still being explored. To help illustrate current uses of microarrays in the plant sciences, several case studies that we believe demonstrate the emerging application of gene expression arrays in plant physiology weremore » selected from among the many posters and presentations at the 2003 Plant and Animal Genome XI Conference. Based on this survey, microarrays are being used to assess gene expression in plants exposed to the experimental manipulation of air temperature, soil water content and aluminium concentration in the root zone. Analysis often includes characterizing transcript profiles for multiple post-treatment sampling periods and categorizing genes with common patterns of response using hierarchical clustering techniques. In addition, microarrays are also providing insights into developmental changes in gene expression associated with fibre and root elongation in cotton and maize, respectively. Technical and analytical limitations of microarrays are discussed and projects attempting to advance areas of microarray design and data analysis are highlighted. Finally, although much work remains, we conclude that microarrays are a valuable tool for the plant physiologist interested in the characterization and identification of individual genes and gene families with potential application in the fields of agriculture, horticulture and forestry.« less
2013-01-01
Background Analysis of global gene expression by DNA microarrays is widely used in experimental molecular biology. However, the complexity of such high-dimensional data sets makes it difficult to fully understand the underlying biological features present in the data. The aim of this study is to introduce a method for DNA microarray analysis that provides an intuitive interpretation of data through dimension reduction and pattern recognition. We present the first “Archetypal Analysis” of global gene expression. The analysis is based on microarray data from five integrated studies of Pseudomonas aeruginosa isolated from the airways of cystic fibrosis patients. Results Our analysis clustered samples into distinct groups with comprehensible characteristics since the archetypes representing the individual groups are closely related to samples present in the data set. Significant changes in gene expression between different groups identified adaptive changes of the bacteria residing in the cystic fibrosis lung. The analysis suggests a similar gene expression pattern between isolates with a high mutation rate (hypermutators) despite accumulation of different mutations for these isolates. This suggests positive selection in the cystic fibrosis lung environment, and changes in gene expression for these isolates are therefore most likely related to adaptation of the bacteria. Conclusions Archetypal analysis succeeded in identifying adaptive changes of P. aeruginosa. The combination of clustering and matrix factorization made it possible to reveal minor similarities among different groups of data, which other analytical methods failed to identify. We suggest that this analysis could be used to supplement current methods used to analyze DNA microarray data. PMID:24059747
Jain, Ruchi; Dey, Bappaditya; Tyagi, Anil K
2012-10-02
The Guinea pig (Cavia porcellus) is one of the most extensively used animal models to study infectious diseases. However, despite its tremendous contribution towards understanding the establishment, progression and control of a number of diseases in general and tuberculosis in particular, the lack of fully annotated guinea pig genome sequence as well as appropriate molecular reagents has severely hampered detailed genetic and immunological analysis in this animal model. By employing the cross-species hybridization technique, we have developed an oligonucleotide microarray with 44,000 features assembled from different mammalian species, which to the best of our knowledge is the first attempt to employ microarray to study the global gene expression profile in guinea pigs. To validate and demonstrate the merit of this microarray, we have studied, as an example, the expression profile of guinea pig lungs during the advanced phase of M. tuberculosis infection. A significant upregulation of 1344 genes and a marked down regulation of 1856 genes in the lungs identified a disease signature of pulmonary tuberculosis infection. We report the development of first comprehensive microarray for studying the global gene expression profile in guinea pigs and validation of its usefulness with tuberculosis as a case study. An important gap in the area of infectious diseases has been addressed and a valuable molecular tool is provided to optimally harness the potential of guinea pig model to develop better vaccines and therapies against human diseases.
Global gene expression in channel catfish after vaccination with an attenuated Edwardsiella ictaluri
USDA-ARS?s Scientific Manuscript database
To understand the global gene expression in channel catfish after immersion vaccination with an attenuated Edwardsiella ictaluri (AquaVac ESCTM), microarray analysis of 65,182 UniGene transcripts were performed. With a filter of false-discovery rate less than 0.05 and fold change greater than 2, a t...
Wimmer, Isabella; Tröscher, Anna R; Brunner, Florian; Rubino, Stephen J; Bien, Christian G; Weiner, Howard L; Lassmann, Hans; Bauer, Jan
2018-04-20
Formalin-fixed paraffin-embedded (FFPE) tissues are valuable resources commonly used in pathology. However, formalin fixation modifies nucleic acids challenging the isolation of high-quality RNA for genetic profiling. Here, we assessed feasibility and reliability of microarray studies analysing transcriptome data from fresh, fresh-frozen (FF) and FFPE tissues. We show that reproducible microarray data can be generated from only 2 ng FFPE-derived RNA. For RNA quality assessment, fragment size distribution (DV200) and qPCR proved most suitable. During RNA isolation, extending tissue lysis time to 10 hours reduced high-molecular-weight species, while additional incubation at 70 °C markedly increased RNA yields. Since FF- and FFPE-derived microarrays constitute different data entities, we used indirect measures to investigate gene signal variation and relative gene expression. Whole-genome analyses revealed high concordance rates, while reviewing on single-genes basis showed higher data variation in FFPE than FF arrays. Using an experimental model, gene set enrichment analysis (GSEA) of FFPE-derived microarrays and fresh tissue-derived RNA-Seq datasets yielded similarly affected pathways confirming the applicability of FFPE tissue in global gene expression analysis. Our study provides a workflow comprising RNA isolation, quality assessment and microarray profiling using minimal RNA input, thus enabling hypothesis-generating pathway analyses from limited amounts of precious, pathologically significant FFPE tissues.
Development and application of a DNA microarray-based yeast two-hybrid system
Suter, Bernhard; Fontaine, Jean-Fred; Yildirimman, Reha; Raskó, Tamás; Schaefer, Martin H.; Rasche, Axel; Porras, Pablo; Vázquez-Álvarez, Blanca M.; Russ, Jenny; Rau, Kirstin; Foulle, Raphaele; Zenkner, Martina; Saar, Kathrin; Herwig, Ralf; Andrade-Navarro, Miguel A.; Wanker, Erich E.
2013-01-01
The yeast two-hybrid (Y2H) system is the most widely applied methodology for systematic protein–protein interaction (PPI) screening and the generation of comprehensive interaction networks. We developed a novel Y2H interaction screening procedure using DNA microarrays for high-throughput quantitative PPI detection. Applying a global pooling and selection scheme to a large collection of human open reading frames, proof-of-principle Y2H interaction screens were performed for the human neurodegenerative disease proteins huntingtin and ataxin-1. Using systematic controls for unspecific Y2H results and quantitative benchmarking, we identified and scored a large number of known and novel partner proteins for both huntingtin and ataxin-1. Moreover, we show that this parallelized screening procedure and the global inspection of Y2H interaction data are uniquely suited to define specific PPI patterns and their alteration by disease-causing mutations in huntingtin and ataxin-1. This approach takes advantage of the specificity and flexibility of DNA microarrays and of the existence of solid-related statistical methods for the analysis of DNA microarray data, and allows a quantitative approach toward interaction screens in human and in model organisms. PMID:23275563
Kawaura, Kanako; Mochida, Keiichi; Yamazaki, Yukiko; Ogihara, Yasunari
2006-04-01
In this study, we constructed a 22k wheat oligo-DNA microarray. A total of 148,676 expressed sequence tags of common wheat were collected from the database of the Wheat Genomics Consortium of Japan. These were grouped into 34,064 contigs, which were then used to design an oligonucleotide DNA microarray. Following a multistep selection of the sense strand, 21,939 60-mer oligo-DNA probes were selected for attachment on the microarray slide. This 22k oligo-DNA microarray was used to examine the transcriptional response of wheat to salt stress. More than 95% of the probes gave reproducible hybridization signals when targeted with RNAs extracted from salt-treated wheat shoots and roots. With the microarray, we identified 1,811 genes whose expressions changed more than 2-fold in response to salt. These included genes known to mediate response to salt, as well as unknown genes, and they were classified into 12 major groups by hierarchical clustering. These gene expression patterns were also confirmed by real-time reverse transcription-PCR. Many of the genes with unknown function were clustered together with genes known to be involved in response to salt stress. Thus, analysis of gene expression patterns combined with gene ontology should help identify the function of the unknown genes. Also, functional analysis of these wheat genes should provide new insight into the response to salt stress. Finally, these results indicate that the 22k oligo-DNA microarray is a reliable method for monitoring global gene expression patterns in wheat.
Stephenson, Kathryn E.; Neubauer, George H.; Reimer, Ulf; ...
2014-11-14
An effective vaccine against human immunodeficiency virus type 1 (HIV-1) will have to provide protection against a vast array of different HIV-1 strains. Current methods to measure HIV-1-specific binding antibodies following immunization typically focus on determining the magnitude of antibody responses, but the epitope diversity of antibody responses has remained largely unexplored. Here we describe the development of a global HIV-1 peptide microarray that contains 6564 peptides from across the HIV-1 proteome and covers the majority of HIV-1 sequences in the Los Alamos National Laboratory global HIV-1 sequence database. Using this microarray, we quantified the magnitude, breadth, and depth ofmore » IgG binding to linear HIV-1 sequences in HIV-1-infected humans and HIV-1-vaccinated humans, rhesus monkeys and guinea pigs. The microarray measured potentially important differences in antibody epitope diversity, particularly regarding the depth of epitope variants recognized at each binding site. Our data suggest that the global HIV-1 peptide microarray may be a useful tool for both preclinical and clinical HIV-1 research.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Stephenson, Kathryn E.; Neubauer, George H.; Reimer, Ulf
An effective vaccine against human immunodeficiency virus type 1 (HIV-1) will have to provide protection against a vast array of different HIV-1 strains. Current methods to measure HIV-1-specific binding antibodies following immunization typically focus on determining the magnitude of antibody responses, but the epitope diversity of antibody responses has remained largely unexplored. Here we describe the development of a global HIV-1 peptide microarray that contains 6564 peptides from across the HIV-1 proteome and covers the majority of HIV-1 sequences in the Los Alamos National Laboratory global HIV-1 sequence database. Using this microarray, we quantified the magnitude, breadth, and depth ofmore » IgG binding to linear HIV-1 sequences in HIV-1-infected humans and HIV-1-vaccinated humans, rhesus monkeys and guinea pigs. The microarray measured potentially important differences in antibody epitope diversity, particularly regarding the depth of epitope variants recognized at each binding site. Our data suggest that the global HIV-1 peptide microarray may be a useful tool for both preclinical and clinical HIV-1 research.« less
Lo, Miranda; Cordwell, Stuart J; Bulach, Dieter M; Adler, Ben
2009-12-08
Leptospirosis is a global zoonosis affecting millions of people annually. Transcriptional changes in response to temperature were previously investigated using microarrays to identify genes potentially expressed upon host entry. Past studies found that various leptospiral outer membrane proteins are differentially expressed at different temperatures. However, our microarray studies highlighted a divergence between protein abundance and transcript levels for some proteins. Given the abundance of post-transcriptional expression control mechanisms, this finding highlighted the importance of global protein analysis systems. To complement our previous transcription study, we evaluated differences in the proteins of the leptospiral outer membrane fraction in response to temperature upshift. Outer membrane protein-enriched fractions from Leptospira interrogans grown at 30 degrees C or overnight upshift to 37 degrees C were isolated and the relative abundance of each protein was determined by iTRAQ analysis coupled with two-dimensional liquid chromatography and tandem mass spectrometry (2-DLC/MS-MS). We identified 1026 proteins with 99% confidence; 27 and 66 were present at elevated and reduced abundance respectively. Protein abundance changes were compared with transcriptional differences determined from the microarray studies. While there was some correlation between the microarray and iTRAQ data, a subset of genes that showed no differential expression by microarray was found to encode temperature-regulated proteins. This set of genes is of particular interest as it is likely that regulation of their expression occurs post-transcriptionally, providing an opportunity to develop hypotheses about the molecular dynamics of the outer membrane of Leptospira in response to changing environments. This is the first study to compare transcriptional and translational responses to temperature shift in L. interrogans. The results thus provide an insight into the mechanisms used by L. interrogans to adapt to conditions encountered in the host and to cause disease. Our results suggest down-regulation of protein expression in response to temperature, and decreased expression of outer membrane proteins may facilitate minimal interaction with host immune mechanisms.
Mutual information estimation reveals global associations between stimuli and biological processes
Suzuki, Taiji; Sugiyama, Masashi; Kanamori, Takafumi; Sese, Jun
2009-01-01
Background Although microarray gene expression analysis has become popular, it remains difficult to interpret the biological changes caused by stimuli or variation of conditions. Clustering of genes and associating each group with biological functions are often used methods. However, such methods only detect partial changes within cell processes. Herein, we propose a method for discovering global changes within a cell by associating observed conditions of gene expression with gene functions. Results To elucidate the association, we introduce a novel feature selection method called Least-Squares Mutual Information (LSMI), which computes mutual information without density estimaion, and therefore LSMI can detect nonlinear associations within a cell. We demonstrate the effectiveness of LSMI through comparison with existing methods. The results of the application to yeast microarray datasets reveal that non-natural stimuli affect various biological processes, whereas others are no significant relation to specific cell processes. Furthermore, we discover that biological processes can be categorized into four types according to the responses of various stimuli: DNA/RNA metabolism, gene expression, protein metabolism, and protein localization. Conclusion We proposed a novel feature selection method called LSMI, and applied LSMI to mining the association between conditions of yeast and biological processes through microarray datasets. In fact, LSMI allows us to elucidate the global organization of cellular process control. PMID:19208155
The Importance of Normalization on Large and Heterogeneous Microarray Datasets
DNA microarray technology is a powerful functional genomics tool increasingly used for investigating global gene expression in environmental studies. Microarrays can also be used in identifying biological networks, as they give insight on the complex gene-to-gene interactions, ne...
DNA microarray analysis is plagued by a lack of data reproducibility and by limits to the detectability of transcripts by hybridization. To mitigate these limitations, we employed transcriptional coupling within the S. typhimurium genome. This genome has 2664 transcriptionally co...
Biomarkers of the Hedgehog/Smoothened pathway in healthy volunteers
Kadam, Sunil K; Patel, Bharvin K R; Jones, Emma; Nguyen, Tuan S; Verma, Lalit K; Landschulz, Katherine T; Stepaniants, Sergey; Li, Bin; Brandt, John T; Brail, Leslie H
2012-01-01
The Hedgehog (Hh) pathway is involved in oncogenic transformation and tumor maintenance. The primary objective of this study was to select surrogate tissue to measure messenger ribonucleic acid (mRNA) levels of Hh pathway genes for measurement of pharmacodynamic effect. Expression of Hh pathway specific genes was measured by quantitative real time polymerase chain reaction (qRT-PCR) and global gene expression using Affymetrix U133 microarrays. Correlations were made between the expression of specific genes determined by qRT-PCR and normalized microarray data. Gene ontology analysis using microarray data for a broader set of Hh pathway genes was performed to identify additional Hh pathway-related markers in the surrogate tissue. RNA extracted from blood, hair follicle, and skin obtained from healthy subjects was analyzed by qRT-PCR for 31 genes, whereas 8 samples were analyzed for a 7-gene subset. Twelve sample sets, each with ≤500 ng total RNA derived from hair, skin, and blood, were analyzed using Affymetrix U133 microarrays. Transcripts for several Hh pathway genes were undetectable in blood using qRT-PCR. Skin was the most desirable matrix, followed by hair follicle. Whether processed by robust multiarray average or microarray suite 5 (MAS5), expression patterns of individual samples showed co-clustered signals; both normalization methods were equally effective for unsupervised analysis. The MAS5- normalized probe sets appeared better suited for supervised analysis. This work provides the basis for selection of a surrogate tissue and an expression analysis-based approach to evaluate pathway-related genes as markers of pharmacodynamic effect with novel inhibitors of the Hh pathway. PMID:22611475
Cloud-scale genomic signals processing classification analysis for gene expression microarray data.
Harvey, Benjamin; Soo-Yeon Ji
2014-01-01
As microarray data available to scientists continues to increase in size and complexity, it has become overwhelmingly important to find multiple ways to bring inference though analysis of DNA/mRNA sequence data that is useful to scientists. Though there have been many attempts to elucidate the issue of bringing forth biological inference by means of wavelet preprocessing and classification, there has not been a research effort that focuses on a cloud-scale classification analysis of microarray data using Wavelet thresholding in a Cloud environment to identify significantly expressed features. This paper proposes a novel methodology that uses Wavelet based Denoising to initialize a threshold for determination of significantly expressed genes for classification. Additionally, this research was implemented and encompassed within cloud-based distributed processing environment. The utilization of Cloud computing and Wavelet thresholding was used for the classification 14 tumor classes from the Global Cancer Map (GCM). The results proved to be more accurate than using a predefined p-value for differential expression classification. This novel methodology analyzed Wavelet based threshold features of gene expression in a Cloud environment, furthermore classifying the expression of samples by analyzing gene patterns, which inform us of biological processes. Moreover, enabling researchers to face the present and forthcoming challenges that may arise in the analysis of data in functional genomics of large microarray datasets.
A microarray analysis of potential genes underlying the neurosensitivity of mice to propofol.
Lowes, Damon A; Galley, Helen F; Lowe, Peter R; Rikke, Brad A; Johnson, Thomas E; Webster, Nigel R
2005-09-01
Establishing the mechanism of action of general anesthetics at the molecular level is difficult because of the multiple targets with which these drugs are associated. Inbred short sleep (ISS) and long sleep (ILS) mice are differentially sensitive in response to ethanol and other sedative hypnotics and contain a single quantitative trait locus (Lorp1) that accounts for the genetic variance of loss-of-righting reflex in response to propofol (LORP). In this study, we used high-density oligonucleotide microarrays to identify global gene expression and candidate genes differentially expressed within the Lorp1 region that may give insight into the molecular mechanism underlying LORP. Microarray analysis was performed using Affymetrix MG-U74Av2 Genechips and a selection of differentially expressed genes was confirmed by semiquantitative reverse transcription-polymerase chain reaction. Global expression in the brains of ILS and ISS mice revealed 3423 genes that were significantly expressed, of which 139 (4%) were differentially expressed. Analysis of genes located within the Lorp1 region showed that 26 genes were significantly expressed and that just 2 genes (7%) were differentially expressed. These genes encoded for the proteins AWP1 (associated with protein kinase 1) and "BTB (POZ) domain containing 1," whose functions are largely uncharacterized. Genes differentially expressed outside Lorp1 included seven genes with previously characterized neuronal functions and thus stand out as additional candidate genes that may be involved in mediating the neurosensitivity differences between ISS and ILS.
Shi, Xiang Yang; Dumenyo, C Korsi; Hernandez-Martinez, Rufina; Azad, Hamid; Cooksey, Donald A
2007-11-01
Many virulence genes in plant bacterial pathogens are coordinately regulated by "global" regulatory genes. Conducting DNA microarray analysis of bacterial mutants of such genes, compared with the wild type, can help to refine the list of genes that may contribute to virulence in bacterial pathogens. The regulatory gene algU, with roles in stress response and regulation of the biosynthesis of the exopolysaccharide alginate in Pseudomonas aeruginosa and many other bacteria, has been extensively studied. The role of algU in Xylella fastidiosa, the cause of Pierce's disease of grapevines, was analyzed by mutation and whole-genome microarray analysis to define its involvement in aggregation, biofilm formation, and virulence. In this study, an algU::nptII mutant had reduced cell-cell aggregation, attachment, and biofilm formation and lower virulence in grapevines. Microarray analysis showed that 42 genes had significantly lower expression in the algU::nptII mutant than in the wild type. Among these are several genes that could contribute to cell aggregation and biofilm formation, as well as other physiological processes such as virulence, competition, and survival.
USDA-ARS?s Scientific Manuscript database
In the current study, we compared chicken gene transcriptional profiles following primary and secondary infections with Eimeria acervulina using a 9.6K avian intestinal intraepithelial lymphocyte cDNA microarray (AVIELA). Gene Ontology analysis showed that primary infection significantly modulated ...
Harvey, Benjamin Simeon; Ji, Soo-Yeon
2017-01-01
As microarray data available to scientists continues to increase in size and complexity, it has become overwhelmingly important to find multiple ways to bring forth oncological inference to the bioinformatics community through the analysis of large-scale cancer genomic (LSCG) DNA and mRNA microarray data that is useful to scientists. Though there have been many attempts to elucidate the issue of bringing forth biological interpretation by means of wavelet preprocessing and classification, there has not been a research effort that focuses on a cloud-scale distributed parallel (CSDP) separable 1-D wavelet decomposition technique for denoising through differential expression thresholding and classification of LSCG microarray data. This research presents a novel methodology that utilizes a CSDP separable 1-D method for wavelet-based transformation in order to initialize a threshold which will retain significantly expressed genes through the denoising process for robust classification of cancer patients. Additionally, the overall study was implemented and encompassed within CSDP environment. The utilization of cloud computing and wavelet-based thresholding for denoising was used for the classification of samples within the Global Cancer Map, Cancer Cell Line Encyclopedia, and The Cancer Genome Atlas. The results proved that separable 1-D parallel distributed wavelet denoising in the cloud and differential expression thresholding increased the computational performance and enabled the generation of higher quality LSCG microarray datasets, which led to more accurate classification results.
The effect of column purification on cDNA indirect labelling for microarrays
Molas, M Lia; Kiss, John Z
2007-01-01
Background The success of the microarray reproducibility is dependent upon the performance of standardized procedures. Since the introduction of microarray technology for the analysis of global gene expression, reproducibility of results among different laboratories has been a major problem. Two of the main contributors to this variability are the use of different microarray platforms and different laboratory practices. In this paper, we address the latter question in terms of how variation in one of the steps of a labelling procedure affects the cDNA product prior to microarray hybridization. Results We used a standard procedure to label cDNA for microarray hybridization and employed different types of column chromatography for cDNA purification. After purifying labelled cDNA, we used the Agilent 2100 Bioanalyzer and agarose gel electrophoresis to assess the quality of the labelled cDNA before its hybridization onto a microarray platform. There were major differences in the cDNA profile (i.e. cDNA fragment lengths and abundance) as a result of using four different columns for purification. In addition, different columns have different efficiencies to remove rRNA contamination. This study indicates that the appropriate column to use in this type of protocol has to be experimentally determined. Finally, we present new evidence establishing the importance of testing the method of purification used during an indirect labelling procedure. Our results confirm the importance of assessing the quality of the sample in the labelling procedure prior to hybridization onto a microarray platform. Conclusion Standardization of column purification systems to be used in labelling procedures will improve the reproducibility of microarray results among different laboratories. In addition, implementation of a quality control check point of the labelled samples prior to microarray hybridization will prevent hybridizing a poor quality sample to expensive micorarrays. PMID:17597522
The effect of column purification on cDNA indirect labelling for microarrays.
Molas, M Lia; Kiss, John Z
2007-06-27
The success of the microarray reproducibility is dependent upon the performance of standardized procedures. Since the introduction of microarray technology for the analysis of global gene expression, reproducibility of results among different laboratories has been a major problem. Two of the main contributors to this variability are the use of different microarray platforms and different laboratory practices. In this paper, we address the latter question in terms of how variation in one of the steps of a labelling procedure affects the cDNA product prior to microarray hybridization. We used a standard procedure to label cDNA for microarray hybridization and employed different types of column chromatography for cDNA purification. After purifying labelled cDNA, we used the Agilent 2100 Bioanalyzer and agarose gel electrophoresis to assess the quality of the labelled cDNA before its hybridization onto a microarray platform. There were major differences in the cDNA profile (i.e. cDNA fragment lengths and abundance) as a result of using four different columns for purification. In addition, different columns have different efficiencies to remove rRNA contamination. This study indicates that the appropriate column to use in this type of protocol has to be experimentally determined. Finally, we present new evidence establishing the importance of testing the method of purification used during an indirect labelling procedure. Our results confirm the importance of assessing the quality of the sample in the labelling procedure prior to hybridization onto a microarray platform. Standardization of column purification systems to be used in labelling procedures will improve the reproducibility of microarray results among different laboratories. In addition, implementation of a quality control check point of the labelled samples prior to microarray hybridization will prevent hybridizing a poor quality sample to expensive micorarrays.
Global transcriptional responses of Bacillus subtilis to xenocoumacin 1.
Zhou, T; Zeng, H; Qiu, D; Yang, X; Wang, B; Chen, M; Guo, L; Wang, S
2011-09-01
To determine the global transcriptional response of Bacillus subtilis to an antimicrobial agent, xenocoumacin 1 (Xcn1). Subinhibitory concentration of Xcn1 applied to B. subtilis was measured according to Hutter's method for determining optimal concentrations. cDNA microarray technology was used to study the global transcriptional response of B. subtilis to Xcn1. Real-time RT-PCR was employed to verify alterations in the transcript levels of six genes. The subinhibitory concentration was determined to be 1 μg ml(-1). The microarray data demonstrated that Xcn1 treatment of B. subtilis led to more than a 2.0-fold up-regulation of 480 genes and more than a 2.0-fold down-regulation of 479 genes (q ≤ 0.05). The transcriptional responses of B. subtilis to Xcn1 were determined, and several processes were affected by Xcn1. Additionally, cluster analysis of gene expression profiles after treatment with Xcn1 or 37 previously studied antibiotics indicated that Xcn1 has similar mechanisms of action to protein synthesis inhibitors. These microarray data showed alterations of gene expression in B. subtilis after exposure to Xcn1. From the results, we identified various processes affected by Xcn1. This study provides a whole-genome perspective to elucidate the action of Xcn1 as a potential antimicrobial agent. © 2011 The Authors. Journal of Applied Microbiology © 2011 The Society for Applied Microbiology.
USDA-ARS?s Scientific Manuscript database
Technological developments in both the collection and analysis of molecular genetic data over the past few years have provided new opportunities for an improved understanding of the global response to pathogen exposure. Such developments are particularly dramatic for scientists studying the pig, whe...
Global Gene Expression Analysis of Yeast Cells during Sake Brewing▿ †
Wu, Hong; Zheng, Xiaohong; Araki, Yoshio; Sahara, Hiroshi; Takagi, Hiroshi; Shimoi, Hitoshi
2006-01-01
During the brewing of Japanese sake, Saccharomyces cerevisiae cells produce a high concentration of ethanol compared with other ethanol fermentation methods. We analyzed the gene expression profiles of yeast cells during sake brewing using DNA microarray analysis. This analysis revealed some characteristics of yeast gene expression during sake brewing and provided a scaffold for a molecular level understanding of the sake brewing process. PMID:16997994
Wolff, Alexander; Bayerlová, Michaela; Gaedcke, Jochen; Kube, Dieter; Beißbarth, Tim
2018-01-01
Pipeline comparisons for gene expression data are highly valuable for applied real data analyses, as they enable the selection of suitable analysis strategies for the dataset at hand. Such pipelines for RNA-Seq data should include mapping of reads, counting and differential gene expression analysis or preprocessing, normalization and differential gene expression in case of microarray analysis, in order to give a global insight into pipeline performances. Four commonly used RNA-Seq pipelines (STAR/HTSeq-Count/edgeR, STAR/RSEM/edgeR, Sailfish/edgeR, TopHat2/Cufflinks/CuffDiff)) were investigated on multiple levels (alignment and counting) and cross-compared with the microarray counterpart on the level of gene expression and gene ontology enrichment. For these comparisons we generated two matched microarray and RNA-Seq datasets: Burkitt Lymphoma cell line data and rectal cancer patient data. The overall mapping rate of STAR was 98.98% for the cell line dataset and 98.49% for the patient dataset. Tophat's overall mapping rate was 97.02% and 96.73%, respectively, while Sailfish had only an overall mapping rate of 84.81% and 54.44%. The correlation of gene expression in microarray and RNA-Seq data was moderately worse for the patient dataset (ρ = 0.67-0.69) than for the cell line dataset (ρ = 0.87-0.88). An exception were the correlation results of Cufflinks, which were substantially lower (ρ = 0.21-0.29 and 0.34-0.53). For both datasets we identified very low numbers of differentially expressed genes using the microarray platform. For RNA-Seq we checked the agreement of differentially expressed genes identified in the different pipelines and of GO-term enrichment results. In conclusion the combination of STAR aligner with HTSeq-Count followed by STAR aligner with RSEM and Sailfish generated differentially expressed genes best suited for the dataset at hand and in agreement with most of the other transcriptomics pipelines.
High throughput gene expression profiling: a molecular approach to integrative physiology
Liang, Mingyu; Cowley, Allen W; Greene, Andrew S
2004-01-01
Integrative physiology emphasizes the importance of understanding multiple pathways with overlapping, complementary, or opposing effects and their interactions in the context of intact organisms. The DNA microarray technology, the most commonly used method for high-throughput gene expression profiling, has been touted as an integrative tool that provides insights into regulatory pathways. However, the physiology community has been slow in acceptance of these techniques because of early failure in generating useful data and the lack of a cohesive theoretical framework in which experiments can be analysed. With recent advances in both technology and analysis, we propose a concept of multidimensional integration of physiology that incorporates data generated by DNA microarray and other functional, genomic, and proteomic approaches to achieve a truly integrative understanding of physiology. Analysis of several studies performed in simpler organisms or in mammalian model animals supports the feasibility of such multidimensional integration and demonstrates the power of DNA microarray as an indispensable molecular tool for such integration. Evaluation of DNA microarray techniques indicates that these techniques, despite limitations, have advanced to a point where the question-driven profiling research has become a feasible complement to the conventional, hypothesis-driven research. With a keen sense of homeostasis, global regulation, and quantitative analysis, integrative physiologists are uniquely positioned to apply these techniques to enhance the understanding of complex physiological functions. PMID:14678487
Phadtare, Sangita; Kato, Ikunoshin; Inouye, Masayori
2002-01-01
We carried out DNA microarray-based global transcript profiling of Escherichia coli in response to 4,5-dihydroxy-2-cyclopenten-1-one to explore the manifestation of its antibacterial activity. We show that it has widespread effects in E. coli affecting genes encoding proteins involved in cell metabolism and membrane synthesis and functions. Genes belonging to the regulon involved in synthesis of Cys are upregulated. In addition, rpoS and RpoS-regulated genes responding to various stresses and a number of genes responding to oxidative stress are upregulated. PMID:12426362
Microarray Meta-Analysis of RNA-Binding Protein Functions in Alternative Polyadenylation
Hu, Wenchao; Liu, Yuting; Yan, Jun
2014-01-01
Alternative polyadenylation (APA) is a post-transcriptional mechanism to generate diverse mRNA transcripts with different 3′UTRs from the same gene. In this study, we systematically searched for the APA events with differential expression in public mouse microarray data. Hundreds of genes with over-represented differential APA events and the corresponding experiments were identified. We further revealed that global APA differential expression occurred prevalently in tissues such as brain comparing to peripheral tissues, and biological processes such as development, differentiation and immune responses. Interestingly, we also observed widespread differential APA events in RNA-binding protein (RBP) genes such as Rbm3, Eif4e2 and Elavl1. Given the fact that RBPs are considered as the main regulators of differential APA expression, we constructed a co-expression network between APAs and RBPs using the microarray data. Further incorporation of CLIP-seq data of selected RBPs showed that Nova2 represses and Mbnl1 promotes the polyadenylation of closest poly(A) sites respectively. Altogether, our study is the first microarray meta-analysis in a mammal on the regulation of APA by RBPs that integrated massive mRNA expression data under a wide-range of biological conditions. Finally, we present our results as a comprehensive resource in an online website for the research community. PMID:24622240
BμG@Sbase—a microbial gene expression and comparative genomic database
Witney, Adam A.; Waldron, Denise E.; Brooks, Lucy A.; Tyler, Richard H.; Withers, Michael; Stoker, Neil G.; Wren, Brendan W.; Butcher, Philip D.; Hinds, Jason
2012-01-01
The reducing cost of high-throughput functional genomic technologies is creating a deluge of high volume, complex data, placing the burden on bioinformatics resources and tool development. The Bacterial Microarray Group at St George's (BμG@S) has been at the forefront of bacterial microarray design and analysis for over a decade and while serving as a hub of a global network of microbial research groups has developed BμG@Sbase, a microbial gene expression and comparative genomic database. BμG@Sbase (http://bugs.sgul.ac.uk/bugsbase/) is a web-browsable, expertly curated, MIAME-compliant database that stores comprehensive experimental annotation and multiple raw and analysed data formats. Consistent annotation is enabled through a structured set of web forms, which guide the user through the process following a set of best practices and controlled vocabulary. The database currently contains 86 expertly curated publicly available data sets (with a further 124 not yet published) and full annotation information for 59 bacterial microarray designs. The data can be browsed and queried using an explorer-like interface; integrating intuitive tree diagrams to present complex experimental details clearly and concisely. Furthermore the modular design of the database will provide a robust platform for integrating other data types beyond microarrays into a more Systems analysis based future. PMID:21948792
BμG@Sbase--a microbial gene expression and comparative genomic database.
Witney, Adam A; Waldron, Denise E; Brooks, Lucy A; Tyler, Richard H; Withers, Michael; Stoker, Neil G; Wren, Brendan W; Butcher, Philip D; Hinds, Jason
2012-01-01
The reducing cost of high-throughput functional genomic technologies is creating a deluge of high volume, complex data, placing the burden on bioinformatics resources and tool development. The Bacterial Microarray Group at St George's (BμG@S) has been at the forefront of bacterial microarray design and analysis for over a decade and while serving as a hub of a global network of microbial research groups has developed BμG@Sbase, a microbial gene expression and comparative genomic database. BμG@Sbase (http://bugs.sgul.ac.uk/bugsbase/) is a web-browsable, expertly curated, MIAME-compliant database that stores comprehensive experimental annotation and multiple raw and analysed data formats. Consistent annotation is enabled through a structured set of web forms, which guide the user through the process following a set of best practices and controlled vocabulary. The database currently contains 86 expertly curated publicly available data sets (with a further 124 not yet published) and full annotation information for 59 bacterial microarray designs. The data can be browsed and queried using an explorer-like interface; integrating intuitive tree diagrams to present complex experimental details clearly and concisely. Furthermore the modular design of the database will provide a robust platform for integrating other data types beyond microarrays into a more Systems analysis based future.
The relationship between chemical structure and biological activity has been examined for various compounds and endpoints for decades. To explore this question relative to global gene expression, we performed microarray analysis of Salmonella TA100 after treatment under condition...
Dai, Yilin; Guo, Ling; Li, Meng; Chen, Yi-Bu
2012-06-08
Microarray data analysis presents a significant challenge to researchers who are unable to use the powerful Bioconductor and its numerous tools due to their lack of knowledge of R language. Among the few existing software programs that offer a graphic user interface to Bioconductor packages, none have implemented a comprehensive strategy to address the accuracy and reliability issue of microarray data analysis due to the well known probe design problems associated with many widely used microarray chips. There is also a lack of tools that would expedite the functional analysis of microarray results. We present Microarray Я US, an R-based graphical user interface that implements over a dozen popular Bioconductor packages to offer researchers a streamlined workflow for routine differential microarray expression data analysis without the need to learn R language. In order to enable a more accurate analysis and interpretation of microarray data, we incorporated the latest custom probe re-definition and re-annotation for Affymetrix and Illumina chips. A versatile microarray results output utility tool was also implemented for easy and fast generation of input files for over 20 of the most widely used functional analysis software programs. Coupled with a well-designed user interface, Microarray Я US leverages cutting edge Bioconductor packages for researchers with no knowledge in R language. It also enables a more reliable and accurate microarray data analysis and expedites downstream functional analysis of microarray results.
RNA sequencing: current and prospective uses in metabolic research.
Vikman, Petter; Fadista, Joao; Oskolkov, Nikolay
2014-10-01
Previous global RNA analysis was restricted to known transcripts in species with a defined transcriptome. Next generation sequencing has transformed transcriptomics by making it possible to analyse expressed genes with an exon level resolution from any tissue in any species without any a priori knowledge of which genes that are being expressed, splice patterns or their nucleotide sequence. In addition, RNA sequencing is a more sensitive technique compared with microarrays with a larger dynamic range, and it also allows for investigation of imprinting and allele-specific expression. This can be done for a cost that is able to compete with that of a microarray, making RNA sequencing a technique available to most researchers. Therefore RNA sequencing has recently become the state of the art with regards to large-scale RNA investigations and has to a large extent replaced microarrays. The only drawback is the large data amounts produced, which together with the complexity of the data can make a researcher spend far more time on analysis than performing the actual experiment. © 2014 Society for Endocrinology.
Popescu, Sorina C.; Popescu, George V.; Bachan, Shawn; Zhang, Zimei; Seay, Montrell; Gerstein, Mark; Snyder, Michael; Dinesh-Kumar, S. P.
2007-01-01
Calmodulins (CaMs) are the most ubiquitous calcium sensors in eukaryotes. A number of CaM-binding proteins have been identified through classical methods, and many proteins have been predicted to bind CaMs based on their structural homology with known targets. However, multicellular organisms typically contain many CaM-like (CML) proteins, and a global identification of their targets and specificity of interaction is lacking. In an effort to develop a platform for large-scale analysis of proteins in plants we have developed a protein microarray and used it to study the global analysis of CaM/CML interactions. An Arabidopsis thaliana expression collection containing 1,133 ORFs was generated and used to produce proteins with an optimized medium-throughput plant-based expression system. Protein microarrays were prepared and screened with several CaMs/CMLs. A large number of previously known and novel CaM/CML targets were identified, including transcription factors, receptor and intracellular protein kinases, F-box proteins, RNA-binding proteins, and proteins of unknown function. Multiple CaM/CML proteins bound many binding partners, but the majority of targets were specific to one or a few CaMs/CMLs indicating that different CaM family members function through different targets. Based on our analyses, the emergent CaM/CML interactome is more extensive than previously predicted. Our results suggest that calcium functions through distinct CaM/CML proteins to regulate a wide range of targets and cellular activities. PMID:17360592
Klein, Hans-Ulrich; Ruckert, Christian; Kohlmann, Alexander; Bullinger, Lars; Thiede, Christian; Haferlach, Torsten; Dugas, Martin
2009-12-15
Multiple gene expression signatures derived from microarray experiments have been published in the field of leukemia research. A comparison of these signatures with results from new experiments is useful for verification as well as for interpretation of the results obtained. Currently, the percentage of overlapping genes is frequently used to compare published gene signatures against a signature derived from a new experiment. However, it has been shown that the percentage of overlapping genes is of limited use for comparing two experiments due to the variability of gene signatures caused by different array platforms or assay-specific influencing parameters. Here, we present a robust approach for a systematic and quantitative comparison of published gene expression signatures with an exemplary query dataset. A database storing 138 leukemia-related published gene signatures was designed. Each gene signature was manually annotated with terms according to a leukemia-specific taxonomy. Two analysis steps are implemented to compare a new microarray dataset with the results from previous experiments stored and curated in the database. First, the global test method is applied to assess gene signatures and to constitute a ranking among them. In a subsequent analysis step, the focus is shifted from single gene signatures to chromosomal aberrations or molecular mutations as modeled in the taxonomy. Potentially interesting disease characteristics are detected based on the ranking of gene signatures associated with these aberrations stored in the database. Two example analyses are presented. An implementation of the approach is freely available as web-based application. The presented approach helps researchers to systematically integrate the knowledge derived from numerous microarray experiments into the analysis of a new dataset. By means of example leukemia datasets we demonstrate that this approach detects related experiments as well as related molecular mutations and may help to interpret new microarray data.
Glycome Diagnosis of Human Induced Pluripotent Stem Cells Using Lectin Microarray*
Tateno, Hiroaki; Toyota, Masashi; Saito, Shigeru; Onuma, Yasuko; Ito, Yuzuru; Hiemori, Keiko; Fukumura, Mihoko; Matsushima, Asako; Nakanishi, Mio; Ohnuma, Kiyoshi; Akutsu, Hidenori; Umezawa, Akihiro; Horimoto, Katsuhisa; Hirabayashi, Jun; Asashima, Makoto
2011-01-01
Induced pluripotent stem cells (iPSCs) can now be produced from various somatic cell (SC) lines by ectopic expression of the four transcription factors. Although the procedure has been demonstrated to induce global change in gene and microRNA expressions and even epigenetic modification, it remains largely unknown how this transcription factor-induced reprogramming affects the total glycan repertoire expressed on the cells. Here we performed a comprehensive glycan analysis using 114 types of human iPSCs generated from five different SCs and compared their glycomes with those of human embryonic stem cells (ESCs; nine cell types) using a high density lectin microarray. In unsupervised cluster analysis of the results obtained by lectin microarray, both undifferentiated iPSCs and ESCs were clustered as one large group. However, they were clearly separated from the group of differentiated SCs, whereas all of the four SCs had apparently distinct glycome profiles from one another, demonstrating that SCs with originally distinct glycan profiles have acquired those similar to ESCs upon induction of pluripotency. Thirty-eight lectins discriminating between SCs and iPSCs/ESCs were statistically selected, and characteristic features of the pluripotent state were then obtained at the level of the cellular glycome. The expression profiles of relevant glycosyltransferase genes agreed well with the results obtained by lectin microarray. Among the 38 lectins, rBC2LCN was found to detect only undifferentiated iPSCs/ESCs and not differentiated SCs. Hence, the high density lectin microarray has proved to be valid for not only comprehensive analysis of glycans but also diagnosis of stem cells under the concept of the cellular glycome. PMID:21471226
Bock, I; Raveh-Amit, H; Losonczi, E; Carstea, A C; Feher, A; Mashayekhi, K; Matyas, S; Dinnyes, A; Pribenszky, C
2016-04-01
The efficiency of various assisted reproductive techniques can be improved by preconditioning the gametes and embryos with sublethal hydrostatic pressure treatment. However, the underlying molecular mechanism responsible for this protective effect remains unknown and requires further investigation. Here, we studied the effect of optimised hydrostatic pressure treatment on the global gene expression of mouse oocytes after embryonic genome activation. Based on a gene expression microarray analysis, a significant effect of treatment was observed in 4-cell embryos derived from treated oocytes, revealing a transcriptional footprint of hydrostatic pressure-affected genes. Functional analysis identified numerous genes involved in protein synthesis that were downregulated in 4-cell embryos in response to hydrostatic pressure treatment, suggesting that regulation of translation has a major role in optimised hydrostatic pressure-induced stress tolerance. We present a comprehensive microarray analysis and further delineate a potential mechanism responsible for the protective effect of hydrostatic pressure treatment.
BIOPHYSICAL PROPERTIES OF NUCLEIC ACIDS AT SURFACES RELEVANT TO MICROARRAY PERFORMANCE.
Rao, Archana N; Grainger, David W
2014-04-01
Both clinical and analytical metrics produced by microarray-based assay technology have recognized problems in reproducibility, reliability and analytical sensitivity. These issues are often attributed to poor understanding and control of nucleic acid behaviors and properties at solid-liquid interfaces. Nucleic acid hybridization, central to DNA and RNA microarray formats, depends on the properties and behaviors of single strand (ss) nucleic acids (e.g., probe oligomeric DNA) bound to surfaces. ssDNA's persistence length, radius of gyration, electrostatics, conformations on different surfaces and under various assay conditions, its chain flexibility and curvature, charging effects in ionic solutions, and fluorescent labeling all influence its physical chemistry and hybridization under assay conditions. Nucleic acid (e.g., both RNA and DNA) target interactions with immobilized ssDNA strands are highly impacted by these biophysical states. Furthermore, the kinetics, thermodynamics, and enthalpic and entropic contributions to DNA hybridization reflect global probe/target structures and interaction dynamics. Here we review several biophysical issues relevant to oligomeric nucleic acid molecular behaviors at surfaces and their influences on duplex formation that influence microarray assay performance. Correlation of biophysical aspects of single and double-stranded nucleic acids with their complexes in bulk solution is common. Such analysis at surfaces is not commonly reported, despite its importance to microarray assays. We seek to provide further insight into nucleic acid-surface challenges facing microarray diagnostic formats that have hindered their clinical adoption and compromise their research quality and value as genomics tools.
BIOPHYSICAL PROPERTIES OF NUCLEIC ACIDS AT SURFACES RELEVANT TO MICROARRAY PERFORMANCE
Rao, Archana N.; Grainger, David W.
2014-01-01
Both clinical and analytical metrics produced by microarray-based assay technology have recognized problems in reproducibility, reliability and analytical sensitivity. These issues are often attributed to poor understanding and control of nucleic acid behaviors and properties at solid-liquid interfaces. Nucleic acid hybridization, central to DNA and RNA microarray formats, depends on the properties and behaviors of single strand (ss) nucleic acids (e.g., probe oligomeric DNA) bound to surfaces. ssDNA’s persistence length, radius of gyration, electrostatics, conformations on different surfaces and under various assay conditions, its chain flexibility and curvature, charging effects in ionic solutions, and fluorescent labeling all influence its physical chemistry and hybridization under assay conditions. Nucleic acid (e.g., both RNA and DNA) target interactions with immobilized ssDNA strands are highly impacted by these biophysical states. Furthermore, the kinetics, thermodynamics, and enthalpic and entropic contributions to DNA hybridization reflect global probe/target structures and interaction dynamics. Here we review several biophysical issues relevant to oligomeric nucleic acid molecular behaviors at surfaces and their influences on duplex formation that influence microarray assay performance. Correlation of biophysical aspects of single and double-stranded nucleic acids with their complexes in bulk solution is common. Such analysis at surfaces is not commonly reported, despite its importance to microarray assays. We seek to provide further insight into nucleic acid-surface challenges facing microarray diagnostic formats that have hindered their clinical adoption and compromise their research quality and value as genomics tools. PMID:24765522
Jiménez-Guerrero, Irene; Acosta-Jurado, Sebastián; Navarro-Gómez, Pilar; López-Baena, Francisco Javier; Ollero, Francisco Javier
2017-01-01
Simultaneous quantification of transcripts of the whole bacterial genome allows the analysis of the global transcriptional response under changing conditions. RNA-seq and microarrays are the most used techniques to measure these transcriptomic changes, and both complement each other in transcriptome profiling. In this review, we exhaustively compiled the symbiosis-related transcriptomic reports (microarrays and RNA sequencing) carried out hitherto in rhizobia. This review is specially focused on transcriptomic changes that takes place when five rhizobial species, Bradyrhizobium japonicum (=diazoefficiens) USDA 110, Rhizobium leguminosarum biovar viciae 3841, Rhizobium tropici CIAT 899, Sinorhizobium (=Ensifer) meliloti 1021 and S. fredii HH103, recognize inducing flavonoids, plant-exuded phenolic compounds that activate the biosynthesis and export of Nod factors (NF) in all analysed rhizobia. Interestingly, our global transcriptomic comparison also indicates that each rhizobial species possesses its own arsenal of molecular weapons accompanying the set of NF in order to establish a successful interaction with host legumes. PMID:29267254
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hatazawa, Yukino; Research Fellow of Japan Society for the Promotion of Science, Tokyo; Minami, Kimiko
The expression of the transcriptional coactivator PGC1α is increased in skeletal muscles during exercise. Previously, we showed that increased PGC1α leads to prolonged exercise performance (the duration for which running can be continued) and, at the same time, increases the expression of branched-chain amino acid (BCAA) metabolism-related enzymes and genes that are involved in supplying substrates for the TCA cycle. We recently created mice with PGC1α knockout specifically in the skeletal muscles (PGC1α KO mice), which show decreased mitochondrial content. In this study, global gene expression (microarray) analysis was performed in the skeletal muscles of PGC1α KO mice compared withmore » that of wild-type control mice. As a result, decreased expression of genes involved in the TCA cycle, oxidative phosphorylation, and BCAA metabolism were observed. Compared with previously obtained microarray data on PGC1α-overexpressing transgenic mice, each gene showed the completely opposite direction of expression change. Bioinformatic analysis of the promoter region of genes with decreased expression in PGC1α KO mice predicted the involvement of several transcription factors, including a nuclear receptor, ERR, in their regulation. As PGC1α KO microarray data in this study show opposing findings to the PGC1α transgenic data, a loss-of-function experiment, as well as a gain-of-function experiment, revealed PGC1α’s function in the oxidative energy metabolism of skeletal muscles. - Highlights: • Microarray analysis was performed in the skeletal muscle of PGC1α KO mice. • Expression of genes in the oxidative energy metabolism was decreased. • Bioinformatic analysis of promoter region of the genes predicted involvement of ERR. • PGC1α KO microarray data in this study show the mirror image of transgenic data.« less
Analysis of High-Throughput ELISA Microarray Data
DOE Office of Scientific and Technical Information (OSTI.GOV)
White, Amanda M.; Daly, Don S.; Zangar, Richard C.
Our research group develops analytical methods and software for the high-throughput analysis of quantitative enzyme-linked immunosorbent assay (ELISA) microarrays. ELISA microarrays differ from DNA microarrays in several fundamental aspects and most algorithms for analysis of DNA microarray data are not applicable to ELISA microarrays. In this review, we provide an overview of the steps involved in ELISA microarray data analysis and how the statistically sound algorithms we have developed provide an integrated software suite to address the needs of each data-processing step. The algorithms discussed are available in a set of open-source software tools (http://www.pnl.gov/statistics/ProMAT).
Zhu, Yuerong; Zhu, Yuelin; Xu, Wei
2008-01-01
Background Though microarray experiments are very popular in life science research, managing and analyzing microarray data are still challenging tasks for many biologists. Most microarray programs require users to have sophisticated knowledge of mathematics, statistics and computer skills for usage. With accumulating microarray data deposited in public databases, easy-to-use programs to re-analyze previously published microarray data are in high demand. Results EzArray is a web-based Affymetrix expression array data management and analysis system for researchers who need to organize microarray data efficiently and get data analyzed instantly. EzArray organizes microarray data into projects that can be analyzed online with predefined or custom procedures. EzArray performs data preprocessing and detection of differentially expressed genes with statistical methods. All analysis procedures are optimized and highly automated so that even novice users with limited pre-knowledge of microarray data analysis can complete initial analysis quickly. Since all input files, analysis parameters, and executed scripts can be downloaded, EzArray provides maximum reproducibility for each analysis. In addition, EzArray integrates with Gene Expression Omnibus (GEO) and allows instantaneous re-analysis of published array data. Conclusion EzArray is a novel Affymetrix expression array data analysis and sharing system. EzArray provides easy-to-use tools for re-analyzing published microarray data and will help both novice and experienced users perform initial analysis of their microarray data from the location of data storage. We believe EzArray will be a useful system for facilities with microarray services and laboratories with multiple members involved in microarray data analysis. EzArray is freely available from . PMID:18218103
Analysis of Protein Expression in Cell Microarrays: A Tool for Antibody-based Proteomics
Andersson, Ann-Catrin; Strömberg, Sara; Bäckvall, Helena; Kampf, Caroline; Uhlen, Mathias; Wester, Kenneth; Pontén, Fredrik
2006-01-01
Tissue microarray (TMA) technology provides a possibility to explore protein expression patterns in a multitude of normal and disease tissues in a high-throughput setting. Although TMAs have been used for analysis of tissue samples, robust methods for studying in vitro cultured cell lines and cell aspirates in a TMA format have been lacking. We have adopted a technique to homogeneously distribute cells in an agarose gel matrix, creating an artificial tissue. This enables simultaneous profiling of protein expression in suspension- and adherent-grown cell samples assembled in a microarray. In addition, the present study provides an optimized strategy for the basic laboratory steps to efficiently produce TMAs. Presented modifications resulted in an improved quality of specimens and a higher section yield compared with standard TMA production protocols. Sections from the generated cell TMAs were tested for immunohistochemical staining properties using 20 well-characterized antibodies. Comparison of immunoreactivity in cultured dispersed cells and corresponding cells in tissue samples showed congruent results for all tested antibodies. We conclude that a modified TMA technique, including cell samples, provides a valuable tool for high-throughput analysis of protein expression, and that this technique can be used for global approaches to explore the human proteome. PMID:16957166
Song, Yajian; Xue, Yanfen; Ma, Yanhe
2013-01-01
The alkaliphilic hemicellulolytic bacterium Bacillus sp. N16-5 has a broad substrate spectrum and exhibits the capacity to utilize complex carbohydrates such as galactomannan, xylan, and pectin. In the monosaccharide mixture, sequential utilization by Bacillus sp. N16-5 was observed. Glucose appeared to be its preferential monosaccharide, followed by fructose, mannose, arabinose, xylose, and galactose. Global transcription profiles of the strain were determined separately for growth on six monosaccharides (glucose, fructose, mannose, galactose, arabinose, and xylose) and four polysaccharides (galactomannan, xylan, pectin, and sodium carboxymethylcellulose) using one-color microarrays. Numerous genes potentially related to polysaccharide degradation, sugar transport, and monosaccharide metabolism were found to respond to a specific substrate. Putative gene clusters for different carbohydrates were identified according to transcriptional patterns and genome annotation. Identification and analysis of these gene clusters contributed to pathway reconstruction for carbohydrate utilization in Bacillus sp. N16-5. Several genes encoding putative sugar transporters were highly expressed during growth on specific sugars, suggesting their functional roles. Two phosphoenolpyruvate-dependent phosphotransferase systems were identified as candidate transporters for mannose and fructose, and a major facilitator superfamily transporter was identified as a candidate transporter for arabinose and xylose. Five carbohydrate uptake transporter 1 family ATP-binding cassette transporters were predicted to participate in the uptake of hemicellulose and pectin degradation products. Collectively, microarray data improved the pathway reconstruction involved in carbohydrate utilization of Bacillus sp. N16-5 and revealed that the organism precisely regulates gene transcription in response to fluctuations in energy resources. PMID:23326578
Friedrich, Torben; Rahmann, Sven; Weigel, Wilfried; Rabsch, Wolfgang; Fruth, Angelika; Ron, Eliora; Gunzer, Florian; Dandekar, Thomas; Hacker, Jörg; Müller, Tobias; Dobrindt, Ulrich
2010-10-21
The Enterobacteriaceae comprise a large number of clinically relevant species with several individual subspecies. Overlapping virulence-associated gene pools and the high overall genome plasticity often interferes with correct enterobacterial strain typing and risk assessment. Array technology offers a fast, reproducible and standardisable means for bacterial typing and thus provides many advantages for bacterial diagnostics, risk assessment and surveillance. The development of highly discriminative broad-range microbial diagnostic microarrays remains a challenge, because of marked genome plasticity of many bacterial pathogens. We developed a DNA microarray for strain typing and detection of major antimicrobial resistance genes of clinically relevant enterobacteria. For this purpose, we applied a global genome-wide probe selection strategy on 32 available complete enterobacterial genomes combined with a regression model for pathogen classification. The discriminative power of the probe set was further tested in silico on 15 additional complete enterobacterial genome sequences. DNA microarrays based on the selected probes were used to type 92 clinical enterobacterial isolates. Phenotypic tests confirmed the array-based typing results and corroborate that the selected probes allowed correct typing and prediction of major antibiotic resistances of clinically relevant Enterobacteriaceae, including the subspecies level, e.g. the reliable distinction of different E. coli pathotypes. Our results demonstrate that the global probe selection approach based on longest common factor statistics as well as the design of a DNA microarray with a restricted set of discriminative probes enables robust discrimination of different enterobacterial variants and represents a proof of concept that can be adopted for diagnostics of a wide range of microbial pathogens. Our approach circumvents misclassifications arising from the application of virulence markers, which are highly affected by horizontal gene transfer. Moreover, a broad range of pathogens have been covered by an efficient probe set size enabling the design of high-throughput diagnostics.
Chemiluminescence microarrays in analytical chemistry: a critical review.
Seidel, Michael; Niessner, Reinhard
2014-09-01
Multi-analyte immunoassays on microarrays and on multiplex DNA microarrays have been described for quantitative analysis of small organic molecules (e.g., antibiotics, drugs of abuse, small molecule toxins), proteins (e.g., antibodies or protein toxins), and microorganisms, viruses, and eukaryotic cells. In analytical chemistry, multi-analyte detection by use of analytical microarrays has become an innovative research topic because of the possibility of generating several sets of quantitative data for different analyte classes in a short time. Chemiluminescence (CL) microarrays are powerful tools for rapid multiplex analysis of complex matrices. A wide range of applications for CL microarrays is described in the literature dealing with analytical microarrays. The motivation for this review is to summarize the current state of CL-based analytical microarrays. Combining analysis of different compound classes on CL microarrays reduces analysis time, cost of reagents, and use of laboratory space. Applications are discussed, with examples from food safety, water safety, environmental monitoring, diagnostics, forensics, toxicology, and biosecurity. The potential and limitations of research on multiplex analysis by use of CL microarrays are discussed in this review.
mRNA expression profiling of laser microbeam microdissected cells from slender embryonic structures.
Scheidl, Stefan J; Nilsson, Sven; Kalén, Mattias; Hellström, Mats; Takemoto, Minoru; Håkansson, Joakim; Lindahl, Per
2002-03-01
Microarray hybridization has rapidly evolved as an important tool for genomic studies and studies of gene regulation at the transcriptome level. Expression profiles from homogenous samples such as yeast and mammalian cell cultures are currently extending our understanding of biology, whereas analyses of multicellular organisms are more difficult because of tissue complexity. The combination of laser microdissection, RNA amplification, and microarray hybridization has the potential to provide expression profiles from selected populations of cells in vivo. In this article, we present and evaluate an experimental procedure for global gene expression analysis of slender embryonic structures using laser microbeam microdissection and laser pressure catapulting. As a proof of principle, expression profiles from 1000 cells in the mouse embryonic (E9.5) dorsal aorta were generated and compared with profiles for captured mesenchymal cells located one cell diameter further away from the aortic lumen. A number of genes were overexpressed in the aorta, including 11 previously known markers for blood vessels. Among the blood vessel markers were endoglin, tie-2, PDGFB, and integrin-beta1, that are important regulators of blood vessel formation. This demonstrates that microarray analysis of laser microbeam micro-dissected cells is sufficiently sensitive for identifying genes with regulative functions.
Microarray missing data imputation based on a set theoretic framework and biological knowledge.
Gan, Xiangchao; Liew, Alan Wee-Chung; Yan, Hong
2006-01-01
Gene expressions measured using microarrays usually suffer from the missing value problem. However, in many data analysis methods, a complete data matrix is required. Although existing missing value imputation algorithms have shown good performance to deal with missing values, they also have their limitations. For example, some algorithms have good performance only when strong local correlation exists in data while some provide the best estimate when data is dominated by global structure. In addition, these algorithms do not take into account any biological constraint in their imputation. In this paper, we propose a set theoretic framework based on projection onto convex sets (POCS) for missing data imputation. POCS allows us to incorporate different types of a priori knowledge about missing values into the estimation process. The main idea of POCS is to formulate every piece of prior knowledge into a corresponding convex set and then use a convergence-guaranteed iterative procedure to obtain a solution in the intersection of all these sets. In this work, we design several convex sets, taking into consideration the biological characteristic of the data: the first set mainly exploit the local correlation structure among genes in microarray data, while the second set captures the global correlation structure among arrays. The third set (actually a series of sets) exploits the biological phenomenon of synchronization loss in microarray experiments. In cyclic systems, synchronization loss is a common phenomenon and we construct a series of sets based on this phenomenon for our POCS imputation algorithm. Experiments show that our algorithm can achieve a significant reduction of error compared to the KNNimpute, SVDimpute and LSimpute methods.
cDNA microarray analysis of esophageal cancer: discoveries and prospects.
Shimada, Yutaka; Sato, Fumiaki; Shimizu, Kazuharu; Tsujimoto, Gozoh; Tsukada, Kazuhiro
2009-07-01
Recent progress in molecular biology has revealed many genetic and epigenetic alterations that are involved in the development and progression of esophageal cancer. Microarray analysis has also revealed several genetic networks that are involved in esophageal cancer. However, clinical application of microarray techniques and use of microarray data have not yet occurred. In this review, we focus on the recent developments and problems with microarray analysis of esophageal cancer.
Biomarkers of Selenium Action in Prostate Cancer
2005-01-01
secretory by conventional methods according to published literature. In addition, we have determined the similarities and differences in global gene...transition zone tissue of a 42-year-old man ac- arrays in the resulting data tables were ordered by their cording to previously described methods [4]. The pre...hundred fifteen genes identified by ELISA method . Replicating the conditions used for the SAM analysis showed significant differential expres- microarray
Importing MAGE-ML format microarray data into BioConductor.
Durinck, Steffen; Allemeersch, Joke; Carey, Vincent J; Moreau, Yves; De Moor, Bart
2004-12-12
The microarray gene expression markup language (MAGE-ML) is a widely used XML (eXtensible Markup Language) standard for describing and exchanging information about microarray experiments. It can describe microarray designs, microarray experiment designs, gene expression data and data analysis results. We describe RMAGEML, a new Bioconductor package that provides a link between cDNA microarray data stored in MAGE-ML format and the Bioconductor framework for preprocessing, visualization and analysis of microarray experiments. http://www.bioconductor.org. Open Source.
Liu, Wan-Ting; Wang, Yang; Zhang, Jing; Ye, Fei; Huang, Xiao-Hui; Li, Bin; He, Qing-Yu
2018-07-01
Lung adenocarcinoma (LAC) is the most lethal cancer and the leading cause of cancer-related death worldwide. The identification of meaningful clusters of co-expressed genes or representative biomarkers may help improve the accuracy of LAC diagnoses. Public databases, such as the Gene Expression Omnibus (GEO), provide rich resources of valuable information for clinics, however, the integration of multiple microarray datasets from various platforms and institutes remained a challenge. To determine potential indicators of LAC, we performed genome-wide relative significance (GWRS), genome-wide global significance (GWGS) and support vector machine (SVM) analyses progressively to identify robust gene biomarker signatures from 5 different microarray datasets that included 330 samples. The top 200 genes with robust signatures were selected for integrative analysis according to "guilt-by-association" methods, including protein-protein interaction (PPI) analysis and gene co-expression analysis. Of these 200 genes, only 10 genes showed both intensive PPI network and high gene co-expression correlation (r > 0.8). IPA analysis of this regulatory networks suggested that the cell cycle process is a crucial determinant of LAC. CENPA, as well as two linked hub genes CDK1 and CDC20, are determined to be potential indicators of LAC. Immunohistochemical staining showed that CENPA, CDK1 and CDC20 were highly expressed in LAC cancer tissue with co-expression patterns. A Cox regression model indicated that LAC patients with CENPA + /CDK1 + and CENPA + /CDC20 + were high-risk groups in terms of overall survival. In conclusion, our integrated microarray analysis demonstrated that CENPA, CDK1 and CDC20 might serve as novel cluster of prognostic biomarkers for LAC, and the cooperative unit of three genes provides a technically simple approach for identification of LAC patients. Copyright © 2018 Elsevier B.V. All rights reserved.
Identification of significant features by the Global Mean Rank test.
Klammer, Martin; Dybowski, J Nikolaj; Hoffmann, Daniel; Schaab, Christoph
2014-01-01
With the introduction of omics-technologies such as transcriptomics and proteomics, numerous methods for the reliable identification of significantly regulated features (genes, proteins, etc.) have been developed. Experimental practice requires these tests to successfully deal with conditions such as small numbers of replicates, missing values, non-normally distributed expression levels, and non-identical distributions of features. With the MeanRank test we aimed at developing a test that performs robustly under these conditions, while favorably scaling with the number of replicates. The test proposed here is a global one-sample location test, which is based on the mean ranks across replicates, and internally estimates and controls the false discovery rate. Furthermore, missing data is accounted for without the need of imputation. In extensive simulations comparing MeanRank to other frequently used methods, we found that it performs well with small and large numbers of replicates, feature dependent variance between replicates, and variable regulation across features on simulation data and a recent two-color microarray spike-in dataset. The tests were then used to identify significant changes in the phosphoproteomes of cancer cells induced by the kinase inhibitors erlotinib and 3-MB-PP1 in two independently published mass spectrometry-based studies. MeanRank outperformed the other global rank-based methods applied in this study. Compared to the popular Significance Analysis of Microarrays and Linear Models for Microarray methods, MeanRank performed similar or better. Furthermore, MeanRank exhibits more consistent behavior regarding the degree of regulation and is robust against the choice of preprocessing methods. MeanRank does not require any imputation of missing values, is easy to understand, and yields results that are easy to interpret. The software implementing the algorithm is freely available for academic and commercial use.
Cell cycle arrest and gene expression profiling of testis in mice exposed to fluoride.
Su, Kai; Sun, Zilong; Niu, Ruiyan; Lei, Ying; Cheng, Jing; Wang, Jundong
2017-05-01
Exposure to fluoride results in low reproductive capacity; however, the mechanism underlying the impact of fluoride on male productive system still remains obscure. To assess the potential toxicity in testis of mice administrated with fluoride, global genome microarray and real-time PCR were performed to detect and identify the altered transcriptions. The results revealed that 763 differentially expressed genes were identified, including 330 up-regulated and 433 down-regulated genes, which were involved in spermatogenesis, apoptosis, DNA damage, DNA replication, and cell differentiation. Twelve differential expressed genes were selected to confirm the microarray results using real-time PCR, and the result kept the same tendency with that of microarray. Furthermore, compared with the control group, more apoptotic spermatogenic cells were observed in the fluoride group, and the spermatogonium were markedly increased in S phase and decreased in G2/M phase by fluoride. Our findings suggested global genome microarray provides an insight into the reproductive toxicity induced by fluoride, and several important biological clues for further investigations. © 2016 Wiley Periodicals, Inc. Environ Toxicol 32: 1558-1565, 2017. © 2016 Wiley Periodicals, Inc.
Killion, Patrick J; Sherlock, Gavin; Iyer, Vishwanath R
2003-01-01
Background The power of microarray analysis can be realized only if data is systematically archived and linked to biological annotations as well as analysis algorithms. Description The Longhorn Array Database (LAD) is a MIAME compliant microarray database that operates on PostgreSQL and Linux. It is a fully open source version of the Stanford Microarray Database (SMD), one of the largest microarray databases. LAD is available at Conclusions Our development of LAD provides a simple, free, open, reliable and proven solution for storage and analysis of two-color microarray data. PMID:12930545
mRNA Expression Profiling of Laser Microbeam Microdissected Cells from Slender Embryonic Structures
Scheidl, Stefan J.; Nilsson, Sven; Kalén, Mattias; Hellström, Mats; Takemoto, Minoru; Håkansson, Joakim; Lindahl, Per
2002-01-01
Microarray hybridization has rapidly evolved as an important tool for genomic studies and studies of gene regulation at the transcriptome level. Expression profiles from homogenous samples such as yeast and mammalian cell cultures are currently extending our understanding of biology, whereas analyses of multicellular organisms are more difficult because of tissue complexity. The combination of laser microdissection, RNA amplification, and microarray hybridization has the potential to provide expression profiles from selected populations of cells in vivo. In this article, we present and evaluate an experimental procedure for global gene expression analysis of slender embryonic structures using laser microbeam microdissection and laser pressure catapulting. As a proof of principle, expression profiles from 1000 cells in the mouse embryonic (E9.5) dorsal aorta were generated and compared with profiles for captured mesenchymal cells located one cell diameter further away from the aortic lumen. A number of genes were overexpressed in the aorta, including 11 previously known markers for blood vessels. Among the blood vessel markers were endoglin, tie-2, PDGFB, and integrin-β1, that are important regulators of blood vessel formation. This demonstrates that microarray analysis of laser microbeam micro-dissected cells is sufficiently sensitive for identifying genes with regulative functions. PMID:11891179
The Microarray Revolution: Perspectives from Educators
ERIC Educational Resources Information Center
Brewster, Jay L.; Beason, K. Beth; Eckdahl, Todd T.; Evans, Irene M.
2004-01-01
In recent years, microarray analysis has become a key experimental tool, enabling the analysis of genome-wide patterns of gene expression. This review approaches the microarray revolution with a focus upon four topics: 1) the early development of this technology and its application to cancer diagnostics; 2) a primer of microarray research,…
Barton, G; Abbott, J; Chiba, N; Huang, DW; Huang, Y; Krznaric, M; Mack-Smith, J; Saleem, A; Sherman, BT; Tiwari, B; Tomlinson, C; Aitman, T; Darlington, J; Game, L; Sternberg, MJE; Butcher, SA
2008-01-01
Background Microarray experimentation requires the application of complex analysis methods as well as the use of non-trivial computer technologies to manage the resultant large data sets. This, together with the proliferation of tools and techniques for microarray data analysis, makes it very challenging for a laboratory scientist to keep up-to-date with the latest developments in this field. Our aim was to develop a distributed e-support system for microarray data analysis and management. Results EMAAS (Extensible MicroArray Analysis System) is a multi-user rich internet application (RIA) providing simple, robust access to up-to-date resources for microarray data storage and analysis, combined with integrated tools to optimise real time user support and training. The system leverages the power of distributed computing to perform microarray analyses, and provides seamless access to resources located at various remote facilities. The EMAAS framework allows users to import microarray data from several sources to an underlying database, to pre-process, quality assess and analyse the data, to perform functional analyses, and to track data analysis steps, all through a single easy to use web portal. This interface offers distance support to users both in the form of video tutorials and via live screen feeds using the web conferencing tool EVO. A number of analysis packages, including R-Bioconductor and Affymetrix Power Tools have been integrated on the server side and are available programmatically through the Postgres-PLR library or on grid compute clusters. Integrated distributed resources include the functional annotation tool DAVID, GeneCards and the microarray data repositories GEO, CELSIUS and MiMiR. EMAAS currently supports analysis of Affymetrix 3' and Exon expression arrays, and the system is extensible to cater for other microarray and transcriptomic platforms. Conclusion EMAAS enables users to track and perform microarray data management and analysis tasks through a single easy-to-use web application. The system architecture is flexible and scalable to allow new array types, analysis algorithms and tools to be added with relative ease and to cope with large increases in data volume. PMID:19032776
An Introduction to MAMA (Meta-Analysis of MicroArray data) System.
Zhang, Zhe; Fenstermacher, David
2005-01-01
Analyzing microarray data across multiple experiments has been proven advantageous. To support this kind of analysis, we are developing a software system called MAMA (Meta-Analysis of MicroArray data). MAMA utilizes a client-server architecture with a relational database on the server-side for the storage of microarray datasets collected from various resources. The client-side is an application running on the end user's computer that allows the user to manipulate microarray data and analytical results locally. MAMA implementation will integrate several analytical methods, including meta-analysis within an open-source framework offering other developers the flexibility to plug in additional statistical algorithms.
Parafioriti, Antonina; Bason, Caterina; Armiraglio, Elisabetta; Calciano, Lucia; Daolio, Primo Andrea; Berardocco, Martina; Di Bernardo, Andrea; Colosimo, Alessia; Luksch, Roberto; Berardi, Anna C
2016-04-30
The molecular mechanism responsible for Ewing's Sarcoma (ES) remains largely unknown. MicroRNAs (miRNAs), a class of small non-coding RNAs able to regulate gene expression, are deregulated in tumors and may serve as a tool for diagnosis and prediction. However, the status of miRNAs in ES has not yet been thoroughly investigated. This study compared global miRNAs expression in paraffin-embedded tumor tissue samples from 20 ES patients, affected by primary untreated tumors, with miRNAs expressed in normal human mesenchymal stromal cells (MSCs) by microarray analysis. A miRTarBase database was used to identify the predicted target genes for differentially expressed miRNAs. The miRNAs microarray analysis revealed distinct patterns of miRNAs expression between ES samples and normal MSCs. 58 of the 954 analyzed miRNAs were significantly differentially expressed in ES samples compared to MSCs. Moreover, the qRT-PCR analysis carried out on three selected miRNAs showed that miR-181b, miR-1915 and miR-1275 were significantly aberrantly regulated, confirming the microarray results. Bio-database analysis identified BCL-2 as a bona fide target gene of the miR-21, miR-181a, miR-181b, miR-29a, miR-29b, miR-497, miR-195, miR-let-7a, miR-34a and miR-1915. Using paraffin-embedded tissues from ES patients, this study has identified several potential target miRNAs and one gene that might be considered a novel critical biomarker for ES pathogenesis.
Grubaugh, Nathan D.; McMenamy, Scott S.; Turell, Michael J.; Lee, John S.
2013-01-01
Background Arthropod-borne viruses are important emerging pathogens world-wide. Viruses transmitted by mosquitoes, such as dengue, yellow fever, and Japanese encephalitis viruses, infect hundreds of millions of people and animals each year. Global surveillance of these viruses in mosquito vectors using molecular based assays is critical for prevention and control of the associated diseases. Here, we report an oligonucleotide DNA microarray design, termed ArboChip5.1, for multi-gene detection and identification of mosquito-borne RNA viruses from the genera Flavivirus (family Flaviviridae), Alphavirus (Togaviridae), Orthobunyavirus (Bunyaviridae), and Phlebovirus (Bunyaviridae). Methodology/Principal Findings The assay utilizes targeted PCR amplification of three genes from each virus genus for electrochemical detection on a portable, field-tested microarray platform. Fifty-two viruses propagated in cell-culture were used to evaluate the specificity of the PCR primer sets and the ArboChip5.1 microarray capture probes. The microarray detected all of the tested viruses and differentiated between many closely related viruses such as members of the dengue, Japanese encephalitis, and Semliki Forest virus clades. Laboratory infected mosquitoes were used to simulate field samples and to determine the limits of detection. Additionally, we identified dengue virus type 3, Japanese encephalitis virus, Tembusu virus, Culex flavivirus, and a Quang Binh-like virus from mosquitoes collected in Thailand in 2011 and 2012. Conclusions/Significance We demonstrated that the described assay can be utilized in a comprehensive field surveillance program by the broad-range amplification and specific identification of arboviruses from infected mosquitoes. Furthermore, the microarray platform can be deployed in the field and viral RNA extraction to data analysis can occur in as little as 12 h. The information derived from the ArboChip5.1 microarray can help to establish public health priorities, detect disease outbreaks, and evaluate control programs. PMID:23967358
Linear model for fast background subtraction in oligonucleotide microarrays.
Kroll, K Myriam; Barkema, Gerard T; Carlon, Enrico
2009-11-16
One important preprocessing step in the analysis of microarray data is background subtraction. In high-density oligonucleotide arrays this is recognized as a crucial step for the global performance of the data analysis from raw intensities to expression values. We propose here an algorithm for background estimation based on a model in which the cost function is quadratic in a set of fitting parameters such that minimization can be performed through linear algebra. The model incorporates two effects: 1) Correlated intensities between neighboring features in the chip and 2) sequence-dependent affinities for non-specific hybridization fitted by an extended nearest-neighbor model. The algorithm has been tested on 360 GeneChips from publicly available data of recent expression experiments. The algorithm is fast and accurate. Strong correlations between the fitted values for different experiments as well as between the free-energy parameters and their counterparts in aqueous solution indicate that the model captures a significant part of the underlying physical chemistry.
Biologically relevant effects of mRNA amplification on gene expression profiles.
van Haaften, Rachel I M; Schroen, Blanche; Janssen, Ben J A; van Erk, Arie; Debets, Jacques J M; Smeets, Hubert J M; Smits, Jos F M; van den Wijngaard, Arthur; Pinto, Yigal M; Evelo, Chris T A
2006-04-11
Gene expression microarray technology permits the analysis of global gene expression profiles. The amount of sample needed limits the use of small excision biopsies and/or needle biopsies from human or animal tissues. Linear amplification techniques have been developed to increase the amount of sample derived cDNA. These amplified samples can be hybridised on microarrays. However, little information is available whether microarrays based on amplified and unamplified material yield comparable results. In the present study we compared microarray data obtained from amplified mRNA derived from biopsies of rat cardiac left ventricle and non-amplified mRNA derived from the same organ. Biopsies were linearly amplified to acquire enough material for a microarray experiment. Both amplified and unamplified samples were hybridized to the Rat Expression Set 230 Array of Affymetrix. Analysis of the microarray data showed that unamplified material of two different left ventricles had 99.6% identical gene expression. Gene expression patterns of two biopsies obtained from the same parental organ were 96.3% identical. Similarly, gene expression pattern of two biopsies from dissimilar organs were 92.8% identical to each other.Twenty-one percent of reporters called present in parental left ventricular tissue disappeared after amplification in the biopsies. Those reporters were predominantly seen in the low intensity range. Sequence analysis showed that reporters that disappeared after amplification had a GC-content of 53.7+/-4.0%, while reporters called present in biopsy- and whole LV-samples had an average GC content of 47.8+/-5.5% (P <0.001). Those reporters were also predicted to form significantly more (0.76+/-0.07 versus 0.38+/-0.1) and longer (9.4+/-0.3 versus 8.4+/-0.4) hairpins as compared to representative control reporters present before and after amplification. This study establishes that the gene expression profile obtained after amplification of mRNA of left ventricular biopsies is representative for the whole left ventricle of the rat heart. However, specific gene transcripts present in parental tissues were undetectable in the minute left ventricular biopsies. Transcripts that were lost due to the amplification process were not randomly distributed, but had higher GC-content and hairpins in the sequence and were mainly found in the lower intensity range which includes many transcription factors from specific signalling pathways.
Biologically relevant effects of mRNA amplification on gene expression profiles
van Haaften, Rachel IM; Schroen, Blanche; Janssen, Ben JA; van Erk, Arie; Debets, Jacques JM; Smeets, Hubert JM; Smits, Jos FM; van den Wijngaard, Arthur; Pinto, Yigal M; Evelo, Chris TA
2006-01-01
Background Gene expression microarray technology permits the analysis of global gene expression profiles. The amount of sample needed limits the use of small excision biopsies and/or needle biopsies from human or animal tissues. Linear amplification techniques have been developed to increase the amount of sample derived cDNA. These amplified samples can be hybridised on microarrays. However, little information is available whether microarrays based on amplified and unamplified material yield comparable results. In the present study we compared microarray data obtained from amplified mRNA derived from biopsies of rat cardiac left ventricle and non-amplified mRNA derived from the same organ. Biopsies were linearly amplified to acquire enough material for a microarray experiment. Both amplified and unamplified samples were hybridized to the Rat Expression Set 230 Array of Affymetrix. Results Analysis of the microarray data showed that unamplified material of two different left ventricles had 99.6% identical gene expression. Gene expression patterns of two biopsies obtained from the same parental organ were 96.3% identical. Similarly, gene expression pattern of two biopsies from dissimilar organs were 92.8% identical to each other. Twenty-one percent of reporters called present in parental left ventricular tissue disappeared after amplification in the biopsies. Those reporters were predominantly seen in the low intensity range. Sequence analysis showed that reporters that disappeared after amplification had a GC-content of 53.7+/-4.0%, while reporters called present in biopsy- and whole LV-samples had an average GC content of 47.8+/-5.5% (P <0.001). Those reporters were also predicted to form significantly more (0.76+/-0.07 versus 0.38+/-0.1) and longer (9.4+/-0.3 versus 8.4+/-0.4) hairpins as compared to representative control reporters present before and after amplification. Conclusion This study establishes that the gene expression profile obtained after amplification of mRNA of left ventricular biopsies is representative for the whole left ventricle of the rat heart. However, specific gene transcripts present in parental tissues were undetectable in the minute left ventricular biopsies. Transcripts that were lost due to the amplification process were not randomly distributed, but had higher GC-content and hairpins in the sequence and were mainly found in the lower intensity range which includes many transcription factors from specific signalling pathways. PMID:16608515
Bengtsson, Henrik; Jönsson, Göran; Vallon-Christersson, Johan
2004-11-12
Non-linearities in observed log-ratios of gene expressions, also known as intensity dependent log-ratios, can often be accounted for by global biases in the two channels being compared. Any step in a microarray process may introduce such offsets and in this article we study the biases introduced by the microarray scanner and the image analysis software. By scanning the same spotted oligonucleotide microarray at different photomultiplier tube (PMT) gains, we have identified a channel-specific bias present in two-channel microarray data. For the scanners analyzed it was in the range of 15-25 (out of 65,535). The observed bias was very stable between subsequent scans of the same array although the PMT gain was greatly adjusted. This indicates that the bias does not originate from a step preceding the scanner detector parts. The bias varies slightly between arrays. When comparing estimates based on data from the same array, but from different scanners, we have found that different scanners introduce different amounts of bias. So do various image analysis methods. We propose a scanning protocol and a constrained affine model that allows us to identify and estimate the bias in each channel. Backward transformation removes the bias and brings the channels to the same scale. The result is that systematic effects such as intensity dependent log-ratios are removed, but also that signal densities become much more similar. The average scan, which has a larger dynamical range and greater signal-to-noise ratio than individual scans, can then be obtained. The study shows that microarray scanners may introduce a significant bias in each channel. Such biases have to be calibrated for, otherwise systematic effects such as intensity dependent log-ratios will be observed. The proposed scanning protocol and calibration method is simple to use and is useful for evaluating scanner biases or for obtaining calibrated measurements with extended dynamical range and better precision. The cross-platform R package aroma, which implements all described methods, is available for free from http://www.maths.lth.se/bioinformatics/.
Gene expression profiling of two distinct neuronal populations in the rodent spinal cord.
Ryge, Jesper; Westerdahl, Ann-Charlotte; Alstrøm, Preben; Kiehn, Ole
2008-01-01
In the field of neuroscience microarray gene expression profiles on anatomically defined brain structures are being used increasingly to study both normal brain functions as well as pathological states. Fluorescent tracing techniques in brain tissue that identifies distinct neuronal populations can in combination with global gene expression profiling potentially increase the resolution and specificity of such studies to shed new light on neuronal functions at the cellular level. We examine the microarray gene expression profiles of two distinct neuronal populations in the spinal cord of the neonatal rat, the principal motor neurons and specific interneurons involved in motor control. The gene expression profiles of the respective cell populations were obtained from amplified mRNA originating from 50-250 fluorescently identified and laser microdissected cells. In the data analysis we combine a new microarray normalization procedure with a conglomerate measure of significant differential gene expression. Using our methodology we find 32 genes to be more expressed in the interneurons compared to the motor neurons that all except one have not previously been associated with this neuronal population. As a validation of our method we find 17 genes to be more expressed in the motor neurons than in the interneurons and of these only one had not previously been described in this population. We provide an optimized experimental protocol that allows isolation of gene transcripts from fluorescent retrogradely labeled cell populations in fresh tissue, which can be used to generate amplified aRNA for microarray hybridization from as few as 50 laser microdissected cells. Using this optimized experimental protocol in combination with our microarray analysis methodology we find 49 differentially expressed genes between the motor neurons and the interneurons that reflect the functional differences between these two cell populations in generating and transmitting the motor output in the rodent spinal cord.
Gene Expression Profiling of Two Distinct Neuronal Populations in the Rodent Spinal Cord
Alstrøm, Preben; Kiehn, Ole
2008-01-01
Background In the field of neuroscience microarray gene expression profiles on anatomically defined brain structures are being used increasingly to study both normal brain functions as well as pathological states. Fluorescent tracing techniques in brain tissue that identifies distinct neuronal populations can in combination with global gene expression profiling potentially increase the resolution and specificity of such studies to shed new light on neuronal functions at the cellular level. Methodology/Principal Findings We examine the microarray gene expression profiles of two distinct neuronal populations in the spinal cord of the neonatal rat, the principal motor neurons and specific interneurons involved in motor control. The gene expression profiles of the respective cell populations were obtained from amplified mRNA originating from 50–250 fluorescently identified and laser microdissected cells. In the data analysis we combine a new microarray normalization procedure with a conglomerate measure of significant differential gene expression. Using our methodology we find 32 genes to be more expressed in the interneurons compared to the motor neurons that all except one have not previously been associated with this neuronal population. As a validation of our method we find 17 genes to be more expressed in the motor neurons than in the interneurons and of these only one had not previously been described in this population. Conclusions/Significance We provide an optimized experimental protocol that allows isolation of gene transcripts from fluorescent retrogradely labeled cell populations in fresh tissue, which can be used to generate amplified aRNA for microarray hybridization from as few as 50 laser microdissected cells. Using this optimized experimental protocol in combination with our microarray analysis methodology we find 49 differentially expressed genes between the motor neurons and the interneurons that reflect the functional differences between these two cell populations in generating and transmitting the motor output in the rodent spinal cord. PMID:18923679
Li, Dongmei; Le Pape, Marc A; Parikh, Nisha I; Chen, Will X; Dye, Timothy D
2013-01-01
Microarrays are widely used for examining differential gene expression, identifying single nucleotide polymorphisms, and detecting methylation loci. Multiple testing methods in microarray data analysis aim at controlling both Type I and Type II error rates; however, real microarray data do not always fit their distribution assumptions. Smyth's ubiquitous parametric method, for example, inadequately accommodates violations of normality assumptions, resulting in inflated Type I error rates. The Significance Analysis of Microarrays, another widely used microarray data analysis method, is based on a permutation test and is robust to non-normally distributed data; however, the Significance Analysis of Microarrays method fold change criteria are problematic, and can critically alter the conclusion of a study, as a result of compositional changes of the control data set in the analysis. We propose a novel approach, combining resampling with empirical Bayes methods: the Resampling-based empirical Bayes Methods. This approach not only reduces false discovery rates for non-normally distributed microarray data, but it is also impervious to fold change threshold since no control data set selection is needed. Through simulation studies, sensitivities, specificities, total rejections, and false discovery rates are compared across the Smyth's parametric method, the Significance Analysis of Microarrays, and the Resampling-based empirical Bayes Methods. Differences in false discovery rates controls between each approach are illustrated through a preterm delivery methylation study. The results show that the Resampling-based empirical Bayes Methods offer significantly higher specificity and lower false discovery rates compared to Smyth's parametric method when data are not normally distributed. The Resampling-based empirical Bayes Methods also offers higher statistical power than the Significance Analysis of Microarrays method when the proportion of significantly differentially expressed genes is large for both normally and non-normally distributed data. Finally, the Resampling-based empirical Bayes Methods are generalizable to next generation sequencing RNA-seq data analysis.
Transfection microarray and the applications.
Miyake, Masato; Yoshikawa, Tomohiro; Fujita, Satoshi; Miyake, Jun
2009-05-01
Microarray transfection has been extensively studied for high-throughput functional analysis of mammalian cells. However, control of efficiency and reproducibility are the critical issues for practical use. By using solid-phase transfection accelerators and nano-scaffold, we provide a highly efficient and reproducible microarray-transfection device, "transfection microarray". The device would be applied to the limited number of available primary cells and stem cells not only for large-scale functional analysis but also reporter-based time-lapse cellular event analysis.
Zhang, Zhaowei; Li, Peiwu; Hu, Xiaofeng; Zhang, Qi; Ding, Xiaoxia; Zhang, Wen
2012-01-01
Chemical contaminants in food have caused serious health issues in both humans and animals. Microarray technology is an advanced technique suitable for the analysis of chemical contaminates. In particular, immuno-microarray approach is one of the most promising methods for chemical contaminants analysis. The use of microarrays for the analysis of chemical contaminants is the subject of this review. Fabrication strategies and detection methods for chemical contaminants are discussed in detail. Application to the analysis of mycotoxins, biotoxins, pesticide residues, and pharmaceutical residues is also described. Finally, future challenges and opportunities are discussed.
Burgos, Carmen Mesas; Uggla, Andreas Ringman; Fagerström-Billai, Fredrik; Eklöf, Ann-Christine; Frenckner, Björn; Nord, Magnus
2010-07-01
Pulmonary hypoplasia and persistent pulmonary hypertension are the main causes of mortality and morbidity in newborns with congenital diaphragmatic hernia (CDH). Nitrofen is well known to induce CDH and lung hypoplasia in a rat model, but the mechanism remains unknown. To increase the understanding of the underlying pathogenesis of CDH, we performed a global gene expression analysis using microarray technology. Pregnant rats were given 100 mg nitrofen on gestational day 9.5 to create CDH. On day 21, fetuses after nitrofen administration and control fetuses were removed; and lungs were harvested. Global gene expression analysis was performed using Affymetrix Platform and the RAE 230 set arrays. For validation of microarray data, we performed real-time polymerase chain reaction and Western blot analysis. Significantly decreased genes after nitrofen administration included several growth factors and growth factors receptors involved in lung development, transcription factors, water and ion channels, and genes involved in angiogenesis and extracellular matrix. These results could be confirmed with real-time polymerase chain reaction and protein expression studies. The pathogenesis of lung hypoplasia and CDH in the nitrofen model includes alteration at a molecular level of several pathways involved in lung development. The complexity of the nitrofen mechanism of action reminds of human CDH; and the picture is consistent with lung hypoplasia and vascular disease, both important contributors to the high mortality and morbidity in CDH. Increased understanding of the molecular mechanisms that control lung growth may be the key to develop novel therapeutic techniques to stimulate pre- and postnatal lung growth. Copyright 2010 Elsevier Inc. All rights reserved.
Investigating the epigenetic effects of a prototype smoke-derived carcinogen in human cells.
Tommasi, Stella; Kim, Sang-in; Zhong, Xueyan; Wu, Xiwei; Pfeifer, Gerd P; Besaratinia, Ahmad
2010-05-12
Global loss of DNA methylation and locus/gene-specific gain of DNA methylation are two distinct hallmarks of carcinogenesis. Aberrant DNA methylation is implicated in smoking-related lung cancer. In this study, we have comprehensively investigated the modulation of DNA methylation consequent to chronic exposure to a prototype smoke-derived carcinogen, benzo[a]pyrene diol epoxide (B[a]PDE), in genomic regions of significance in lung cancer, in normal human cells. We have used a pulldown assay for enrichment of the CpG methylated fraction of cellular DNA combined with microarray platforms, followed by extensive validation through conventional bisulfite-based analysis. Here, we demonstrate strikingly similar patterns of DNA methylation in non-transformed B[a]PDE-treated cells vs control using high-throughput microarray-based DNA methylation profiling confirmed by conventional bisulfite-based DNA methylation analysis. The absence of aberrant DNA methylation in our model system within a timeframe that precedes cellular transformation suggests that following carcinogen exposure, other as yet unknown factors (secondary to carcinogen treatment) may help initiate global loss of DNA methylation and region-specific gain of DNA methylation, which can, in turn, contribute to lung cancer development. Unveiling the initiating events that cause aberrant DNA methylation in lung cancer has tremendous public health relevance, as it can help define future strategies for early detection and prevention of this highly lethal disease.
Investigating the Epigenetic Effects of a Prototype Smoke-Derived Carcinogen in Human Cells
Tommasi, Stella; Kim, Sang-in; Zhong, Xueyan; Wu, Xiwei; Pfeifer, Gerd P.; Besaratinia, Ahmad
2010-01-01
Global loss of DNA methylation and locus/gene-specific gain of DNA methylation are two distinct hallmarks of carcinogenesis. Aberrant DNA methylation is implicated in smoking-related lung cancer. In this study, we have comprehensively investigated the modulation of DNA methylation consequent to chronic exposure to a prototype smoke-derived carcinogen, benzo[a]pyrene diol epoxide (B[a]PDE), in genomic regions of significance in lung cancer, in normal human cells. We have used a pulldown assay for enrichment of the CpG methylated fraction of cellular DNA combined with microarray platforms, followed by extensive validation through conventional bisulfite-based analysis. Here, we demonstrate strikingly similar patterns of DNA methylation in non-transformed B[a]PDE-treated cells vs control using high-throughput microarray-based DNA methylation profiling confirmed by conventional bisulfite-based DNA methylation analysis. The absence of aberrant DNA methylation in our model system within a timeframe that precedes cellular transformation suggests that following carcinogen exposure, other as yet unknown factors (secondary to carcinogen treatment) may help initiate global loss of DNA methylation and region-specific gain of DNA methylation, which can, in turn, contribute to lung cancer development. Unveiling the initiating events that cause aberrant DNA methylation in lung cancer has tremendous public health relevance, as it can help define future strategies for early detection and prevention of this highly lethal disease. PMID:20485678
Soule, Tanya; Gao, Qunjie; Stout, Valerie; Garcia-Pichel, Ferran
2013-01-01
Cyanobacteria in nature are exposed not only to the visible spectrum of sunlight but also to its harmful ultraviolet components (UVA and UVB). We used Nostoc punctiforme ATCC 29133 as a model to study the UVA response by analyzing global gene expression patterns using genomic microarrays. UVA exposure resulted in the statistically detectable differential expression of 573 genes of the 6903 that were probed, compared with that of the control cultures. Of those genes, 473 were up-regulated, while only 100 were down-regulated. Many of the down-regulated genes were involved in photosynthetic pigment biosynthesis, indicating a significant shift in this metabolism. As expected, we detected the up-regulation of genes encoding antioxidant enzymes and the sunscreen, scytonemin. However, a majority of the up-regulated genes, 47%, were unassignable bioinformatically to known functional categories, suggesting that the UVA stress response is not well understood. Interestingly, the most dramatic up-regulation involved several contiguous genes of unassigned metabolism on plasmid A. This is the first global UVA stress response analysis of any phototrophic microorganism and the differential expression of 8% of the genes of the Nostoc genome indicates that adaptation to UVA in Nostoc has been an evolutionary force of significance. © 2012 Wiley Periodicals, Inc. Photochemistry and Photobiology © 2012 The American Society of Photobiology.
Contributions to Statistical Problems Related to Microarray Data
ERIC Educational Resources Information Center
Hong, Feng
2009-01-01
Microarray is a high throughput technology to measure the gene expression. Analysis of microarray data brings many interesting and challenging problems. This thesis consists three studies related to microarray data. First, we propose a Bayesian model for microarray data and use Bayes Factors to identify differentially expressed genes. Second, we…
2010-01-01
Background Infection by infectious laryngotracheitis virus (ILTV; gallid herpesvirus 1) causes acute respiratory diseases in chickens often with high mortality. To better understand host-ILTV interactions at the host transcriptional level, a microarray analysis was performed using 4 × 44 K Agilent chicken custom oligo microarrays. Results Microarrays were hybridized using the two color hybridization method with total RNA extracted from ILTV infected chicken embryo lung cells at 0, 1, 3, 5, and 7 days post infection (dpi). Results showed that 789 genes were differentially expressed in response to ILTV infection that include genes involved in the immune system (cytokines, chemokines, MHC, and NF-κB), cell cycle regulation (cyclin B2, CDK1, and CKI3), matrix metalloproteinases (MMPs) and cellular metabolism. Differential expression for 20 out of 789 genes were confirmed by quantitative reverse transcription-PCR (qRT-PCR). A bioinformatics tool (Ingenuity Pathway Analysis) used to analyze biological functions and pathways on the group of 789 differentially expressed genes revealed that 21 possible gene networks with intermolecular connections among 275 functionally identified genes. These 275 genes were classified into a number of functional groups that included cancer, genetic disorder, cellular growth and proliferation, and cell death. Conclusion The results of this study provide comprehensive knowledge on global gene expression, and biological functionalities of differentially expressed genes in chicken embryo lung cells in response to ILTV infections. PMID:20663125
Microarray platform for omics analysis
NASA Astrophysics Data System (ADS)
Mecklenburg, Michael; Xie, Bin
2001-09-01
Microarray technology has revolutionized genetic analysis. However, limitations in genome analysis has lead to renewed interest in establishing 'omic' strategies. As we enter the post-genomic era, new microarray technologies are needed to address these new classes of 'omic' targets, such as proteins, as well as lipids and carbohydrates. We have developed a microarray platform that combines self- assembling monolayers with the biotin-streptavidin system to provide a robust, versatile immobilization scheme. A hydrophobic film is patterned on the surface creating an array of tension wells that eliminates evaporation effects thereby reducing the shear stress to which biomolecules are exposed to during immobilization. The streptavidin linker layer makes it possible to adapt and/or develop microarray based assays using virtually any class of biomolecules including: carbohydrates, peptides, antibodies, receptors, as well as them ore traditional DNA based arrays. Our microarray technology is designed to furnish seamless compatibility across the various 'omic' platforms by providing a common blueprint for fabricating and analyzing arrays. The prototype microarray uses a microscope slide footprint patterned with 2 by 96 flat wells. Data on the microarray platform will be presented.
Microarrays in brain research: the good, the bad and the ugly.
Mirnics, K
2001-06-01
Making sense of microarray data is a complex process, in which the interpretation of findings will depend on the overall experimental design and judgement of the investigator performing the analysis. As a result, differences in tissue harvesting, microarray types, sample labelling and data analysis procedures make post hoc sharing of microarray data a great challenge. To ensure rapid and meaningful data exchange, we need to create some order out of the existing chaos. In these ground-breaking microarray standardization and data sharing efforts, NIH agencies should take a leading role
Trivedi, Prinal; Edwards, Jode W; Wang, Jelai; Gadbury, Gary L; Srinivasasainagendra, Vinodh; Zakharkin, Stanislav O; Kim, Kyoungmi; Mehta, Tapan; Brand, Jacob P L; Patki, Amit; Page, Grier P; Allison, David B
2005-04-06
Many efforts in microarray data analysis are focused on providing tools and methods for the qualitative analysis of microarray data. HDBStat! (High-Dimensional Biology-Statistics) is a software package designed for analysis of high dimensional biology data such as microarray data. It was initially developed for the analysis of microarray gene expression data, but it can also be used for some applications in proteomics and other aspects of genomics. HDBStat! provides statisticians and biologists a flexible and easy-to-use interface to analyze complex microarray data using a variety of methods for data preprocessing, quality control analysis and hypothesis testing. Results generated from data preprocessing methods, quality control analysis and hypothesis testing methods are output in the form of Excel CSV tables, graphs and an Html report summarizing data analysis. HDBStat! is a platform-independent software that is freely available to academic institutions and non-profit organizations. It can be downloaded from our website http://www.soph.uab.edu/ssg_content.asp?id=1164.
Plant-pathogen interactions: what microarray tells about it?
Lodha, T D; Basak, J
2012-01-01
Plant defense responses are mediated by elementary regulatory proteins that affect expression of thousands of genes. Over the last decade, microarray technology has played a key role in deciphering the underlying networks of gene regulation in plants that lead to a wide variety of defence responses. Microarray is an important tool to quantify and profile the expression of thousands of genes simultaneously, with two main aims: (1) gene discovery and (2) global expression profiling. Several microarray technologies are currently in use; most include a glass slide platform with spotted cDNA or oligonucleotides. Till date, microarray technology has been used in the identification of regulatory genes, end-point defence genes, to understand the signal transduction processes underlying disease resistance and its intimate links to other physiological pathways. Microarray technology can be used for in-depth, simultaneous profiling of host/pathogen genes as the disease progresses from infection to resistance/susceptibility at different developmental stages of the host, which can be done in different environments, for clearer understanding of the processes involved. A thorough knowledge of plant disease resistance using successful combination of microarray and other high throughput techniques, as well as biochemical, genetic, and cell biological experiments is needed for practical application to secure and stabilize yield of many crop plants. This review starts with a brief introduction to microarray technology, followed by the basics of plant-pathogen interaction, the use of DNA microarrays over the last decade to unravel the mysteries of plant-pathogen interaction, and ends with the future prospects of this technology.
Protein Microarray Analysis in Patients With Asthma*
Kim, Hyo-Bin; Kim, Chang-Keun; Iijima, Koji; Kobayashi, Takao; Kita, Hirohito
2010-01-01
Background Microarray technology offers a new opportunity to gain insight into global gene and protein expression profiles in asthma. To identify novel factors produced in the asthmatic airway, we analyzed sputum samples by using a membrane-based human cytokine microarray technology in patients with bronchial asthma (BA). Methods Induced sputum was obtained from 28 BA subjects, 20 nonasthmatic atopic control (AC) subjects, and 38 nonasthmatic nonatopic normal control (NC) subjects. The microarray samples of subjects were randomly selected from nine BA subjects, three AC subjects, and six NC subjects. Sputum supernatants were analyzed using a custom human cytokine array (RayBio Custom Human Cytokine Array; RayBiotech; Norcross, GA) designed to analyze 79 specific cytokines simultaneously. The levels of growth-regulated oncogene (GRO)-α, eotaxin-2, and pulmonary and activation-regulated chemokine (PARC)/CCL18 were measured by sandwich enzyme-linked immunosorbent assays (ELISAs), and eosinophil-derived neurotoxin (EDN) was measured by radioimmunoassay. Results By microarray, the signal intensities for GRO-α, eotaxin-2, and PARC were significantly higher in BA subjects than in AC and NC subjects (p = 0.036, p = 0.042, and p = 0.033, respectively). By ELISA, the sputum PARC protein levels were significantly higher in BA subjects than in AC and NC subjects (p < 0.0001). Furthermore, PARC levels correlated significantly with sputum eosinophil percentages (r = 0.570, p < 0.0001) and the levels of EDN(r = 0.633, p < 0.0001), the regulated upon activation, normal T cell expressed and secreted cytokine (r = 0.440, p < 0.001), interleukin-4 (r = 0.415, p < 0.01), and interferon-γ (r = 0.491, p < 0.001). Conclusions By a nonbiased screening approach, a chemokine, PARC, is elevated in sputum specimens from patients with asthma. PARC may play important roles in development of airway eosinophilic inflammation in asthma. PMID:19017877
ArrayNinja: An Open Source Platform for Unified Planning and Analysis of Microarray Experiments.
Dickson, B M; Cornett, E M; Ramjan, Z; Rothbart, S B
2016-01-01
Microarray-based proteomic platforms have emerged as valuable tools for studying various aspects of protein function, particularly in the field of chromatin biochemistry. Microarray technology itself is largely unrestricted in regard to printable material and platform design, and efficient multidimensional optimization of assay parameters requires fluidity in the design and analysis of custom print layouts. This motivates the need for streamlined software infrastructure that facilitates the combined planning and analysis of custom microarray experiments. To this end, we have developed ArrayNinja as a portable, open source, and interactive application that unifies the planning and visualization of microarray experiments and provides maximum flexibility to end users. Array experiments can be planned, stored to a private database, and merged with the imaged results for a level of data interaction and centralization that is not currently attainable with available microarray informatics tools. © 2016 Elsevier Inc. All rights reserved.
WebArray: an online platform for microarray data analysis
Xia, Xiaoqin; McClelland, Michael; Wang, Yipeng
2005-01-01
Background Many cutting-edge microarray analysis tools and algorithms, including commonly used limma and affy packages in Bioconductor, need sophisticated knowledge of mathematics, statistics and computer skills for implementation. Commercially available software can provide a user-friendly interface at considerable cost. To facilitate the use of these tools for microarray data analysis on an open platform we developed an online microarray data analysis platform, WebArray, for bench biologists to utilize these tools to explore data from single/dual color microarray experiments. Results The currently implemented functions were based on limma and affy package from Bioconductor, the spacings LOESS histogram (SPLOSH) method, PCA-assisted normalization method and genome mapping method. WebArray incorporates these packages and provides a user-friendly interface for accessing a wide range of key functions of limma and others, such as spot quality weight, background correction, graphical plotting, normalization, linear modeling, empirical bayes statistical analysis, false discovery rate (FDR) estimation, chromosomal mapping for genome comparison. Conclusion WebArray offers a convenient platform for bench biologists to access several cutting-edge microarray data analysis tools. The website is freely available at . It runs on a Linux server with Apache and MySQL. PMID:16371165
Global gene expression analysis by combinatorial optimization.
Ameur, Adam; Aurell, Erik; Carlsson, Mats; Westholm, Jakub Orzechowski
2004-01-01
Generally, there is a trade-off between methods of gene expression analysis that are precise but labor-intensive, e.g. RT-PCR, and methods that scale up to global coverage but are not quite as quantitative, e.g. microarrays. In the present paper, we show how how a known method of gene expression profiling (K. Kato, Nucleic Acids Res. 23, 3685-3690 (1995)), which relies on a fairly small number of steps, can be turned into a global gene expression measurement by advanced data post-processing, with potentially little loss of accuracy. Post-processing here entails solving an ancillary combinatorial optimization problem. Validation is performed on in silico experiments generated from the FANTOM data base of full-length mouse cDNA. We present two variants of the method. One uses state-of-the-art commercial software for solving problems of this kind, the other a code developed by us specifically for this purpose, released in the public domain under GPL license.
Han, Junwei; Li, Chunquan; Yang, Haixiu; Xu, Yanjun; Zhang, Chunlong; Ma, Jiquan; Shi, Xinrui; Liu, Wei; Shang, Desi; Yao, Qianlan; Zhang, Yunpeng; Su, Fei; Feng, Li; Li, Xia
2015-01-01
Identifying dysregulated pathways from high-throughput experimental data in order to infer underlying biological insights is an important task. Current pathway-identification methods focus on single pathways in isolation; however, consideration of crosstalk between pathways could improve our understanding of alterations in biological states. We propose a novel method of pathway analysis based on global influence (PAGI) to identify dysregulated pathways, by considering both within-pathway effects and crosstalk between pathways. We constructed a global gene–gene network based on the relationships among genes extracted from a pathway database. We then evaluated the extent of differential expression for each gene, and mapped them to the global network. The random walk with restart algorithm was used to calculate the extent of genes affected by global influence. Finally, we used cumulative distribution functions to determine the significance values of the dysregulated pathways. We applied the PAGI method to five cancer microarray datasets, and compared our results with gene set enrichment analysis and five other methods. Based on these analyses, we demonstrated that PAGI can effectively identify dysregulated pathways associated with cancer, with strong reproducibility and robustness. We implemented PAGI using the freely available R-based and Web-based tools (http://bioinfo.hrbmu.edu.cn/PAGI). PMID:25551156
Bizama, Carolina; Benavente, Felipe; Salvatierra, Edgardo; Gutiérrez-Moraga, Ana; Espinoza, Jaime A; Fernández, Elmer A; Roa, Iván; Mazzolini, Guillermo; Sagredo, Eduardo A; Gidekel, Manuel; Podhajcer, Osvaldo L
2014-02-15
Studies on the low-abundance transcriptome are of paramount importance for identifying the intimate mechanisms of tumor progression that can lead to novel therapies. The aim of the present study was to identify novel markers and targetable genes and pathways in advanced human gastric cancer through analyses of the low-abundance transcriptome. The procedure involved an initial subtractive hybridization step, followed by global gene expression analysis using microarrays. We observed profound differences, both at the single gene and gene ontology levels, between the low-abundance transcriptome and the whole transcriptome. Analysis of the low-abundance transcriptome led to the identification and validation by tissue microarrays of novel biomarkers, such as LAMA3 and TTN; moreover, we identified cancer type-specific intracellular pathways and targetable genes, such as IRS2, IL17, IFNγ, VEGF-C, WISP1, FZD5 and CTBP1 that were not detectable by whole transcriptome analyses. We also demonstrated that knocking down the expression of CTBP1 sensitized gastric cancer cells to mainstay chemotherapeutic drugs. We conclude that the analysis of the low-abundance transcriptome provides useful insights into the molecular basis and treatment of cancer. © 2013 UICC.
Global gene expression analysis of apple fruit development from the floral bud to ripe fruit
Janssen, Bart J; Thodey, Kate; Schaffer, Robert J; Alba, Rob; Balakrishnan, Lena; Bishop, Rebecca; Bowen, Judith H; Crowhurst, Ross N; Gleave, Andrew P; Ledger, Susan; McArtney, Steve; Pichler, Franz B; Snowden, Kimberley C; Ward, Shayna
2008-01-01
Background Apple fruit develop over a period of 150 days from anthesis to fully ripe. An array representing approximately 13000 genes (15726 oligonucleotides of 45–55 bases) designed from apple ESTs has been used to study gene expression over eight time points during fruit development. This analysis of gene expression lays the groundwork for a molecular understanding of fruit growth and development in apple. Results Using ANOVA analysis of the microarray data, 1955 genes showed significant changes in expression over this time course. Expression of genes is coordinated with four major patterns of expression observed: high in floral buds; high during cell division; high when starch levels and cell expansion rates peak; and high during ripening. Functional analysis associated cell cycle genes with early fruit development and three core cell cycle genes are significantly up-regulated in the early stages of fruit development. Starch metabolic genes were associated with changes in starch levels during fruit development. Comparison with microarrays of ethylene-treated apple fruit identified a group of ethylene induced genes also induced in normal fruit ripening. Comparison with fruit development microarrays in tomato has been used to identify 16 genes for which expression patterns are similar in apple and tomato and these genes may play fundamental roles in fruit development. The early phase of cell division and tissue specification that occurs in the first 35 days after pollination has been associated with up-regulation of a cluster of genes that includes core cell cycle genes. Conclusion Gene expression in apple fruit is coordinated with specific developmental stages. The array results are reproducible and comparisons with experiments in other species has been used to identify genes that may play a fundamental role in fruit development. PMID:18279528
Global gene expression analysis of apple fruit development from the floral bud to ripe fruit.
Janssen, Bart J; Thodey, Kate; Schaffer, Robert J; Alba, Rob; Balakrishnan, Lena; Bishop, Rebecca; Bowen, Judith H; Crowhurst, Ross N; Gleave, Andrew P; Ledger, Susan; McArtney, Steve; Pichler, Franz B; Snowden, Kimberley C; Ward, Shayna
2008-02-17
Apple fruit develop over a period of 150 days from anthesis to fully ripe. An array representing approximately 13000 genes (15726 oligonucleotides of 45-55 bases) designed from apple ESTs has been used to study gene expression over eight time points during fruit development. This analysis of gene expression lays the groundwork for a molecular understanding of fruit growth and development in apple. Using ANOVA analysis of the microarray data, 1955 genes showed significant changes in expression over this time course. Expression of genes is coordinated with four major patterns of expression observed: high in floral buds; high during cell division; high when starch levels and cell expansion rates peak; and high during ripening. Functional analysis associated cell cycle genes with early fruit development and three core cell cycle genes are significantly up-regulated in the early stages of fruit development. Starch metabolic genes were associated with changes in starch levels during fruit development. Comparison with microarrays of ethylene-treated apple fruit identified a group of ethylene induced genes also induced in normal fruit ripening. Comparison with fruit development microarrays in tomato has been used to identify 16 genes for which expression patterns are similar in apple and tomato and these genes may play fundamental roles in fruit development. The early phase of cell division and tissue specification that occurs in the first 35 days after pollination has been associated with up-regulation of a cluster of genes that includes core cell cycle genes. Gene expression in apple fruit is coordinated with specific developmental stages. The array results are reproducible and comparisons with experiments in other species has been used to identify genes that may play a fundamental role in fruit development.
Ontology-based meta-analysis of global collections of high-throughput public data.
Kupershmidt, Ilya; Su, Qiaojuan Jane; Grewal, Anoop; Sundaresh, Suman; Halperin, Inbal; Flynn, James; Shekar, Mamatha; Wang, Helen; Park, Jenny; Cui, Wenwu; Wall, Gregory D; Wisotzkey, Robert; Alag, Satnam; Akhtari, Saeid; Ronaghi, Mostafa
2010-09-29
The investigation of the interconnections between the molecular and genetic events that govern biological systems is essential if we are to understand the development of disease and design effective novel treatments. Microarray and next-generation sequencing technologies have the potential to provide this information. However, taking full advantage of these approaches requires that biological connections be made across large quantities of highly heterogeneous genomic datasets. Leveraging the increasingly huge quantities of genomic data in the public domain is fast becoming one of the key challenges in the research community today. We have developed a novel data mining framework that enables researchers to use this growing collection of public high-throughput data to investigate any set of genes or proteins. The connectivity between molecular states across thousands of heterogeneous datasets from microarrays and other genomic platforms is determined through a combination of rank-based enrichment statistics, meta-analyses, and biomedical ontologies. We address data quality concerns through dataset replication and meta-analysis and ensure that the majority of the findings are derived using multiple lines of evidence. As an example of our strategy and the utility of this framework, we apply our data mining approach to explore the biology of brown fat within the context of the thousands of publicly available gene expression datasets. Our work presents a practical strategy for organizing, mining, and correlating global collections of large-scale genomic data to explore normal and disease biology. Using a hypothesis-free approach, we demonstrate how a data-driven analysis across very large collections of genomic data can reveal novel discoveries and evidence to support existing hypothesis.
Song, Jie; Hu, Yajie; Hu, Yunguang; Wang, Jingjing; Zhang, Xiaolong; Wang, Lichun; Guo, Lei; Wang, Yancui; Ning, Ruotong; Liao, Yun; Zhang, Ying; Zheng, Huiwen; Shi, Haijing; He, Zhanlong; Li, Qihan; Liu, Longding
2016-03-02
Coxsackievirus A16 (CA16) is a dominant pathogen that results in hand, foot, and mouth disease and causes outbreaks worldwide, particularly in the Asia-Pacific region. However, the underlying molecular mechanisms remain unclear. Our previous study has demonstrated that the basic CA16 pathogenic process was successfully mimicked in rhesus monkey infant. The present study focused on the global gene expression changes in peripheral blood mononuclear cells of rhesus monkey infants with hand, foot, and mouth disease induced by CA16 infection at different time points. Genome-wide expression analysis was performed with Agilent whole-genome microarrays and established bioinformatics tools. Nine hundred and forty-eight significant differentially expressed genes that were associated with 5 gene ontology categories, including cell communication, cell cycle, immune system process, regulation of transcription and metabolic process were identified. Subsequently, the mapping of genes related to the immune system process by PANTHER pathway analysis revealed the predominance of inflammation mediated by chemokine and cytokine signaling pathways and the interleukin signaling pathway. Ultimately, co-expressed genes and their networks were analyzed. The results revealed the gene expression profile of the immune system in response to CA16 in rhesus monkey infants and suggested that such an immune response was generated as a result of the positive mobilization of the immune system. This initial microarray study will provide insights into the molecular mechanism of CA16 infection and will facilitate the identification of biomarkers for the evaluation of vaccines against this virus. Copyright © 2016 Elsevier B.V. All rights reserved.
Issues in the analysis of oligonucleotide tiling microarrays for transcript mapping
NASA Technical Reports Server (NTRS)
Royce, Thomas E.; Rozowsky, Joel S.; Bertone, Paul; Samanta, Manoj; Stolc, Viktor; Weissman, Sherman; Snyder, Michael; Gerstein, Mark
2005-01-01
Traditional microarrays use probes complementary to known genes to quantitate the differential gene expression between two or more conditions. Genomic tiling microarray experiments differ in that probes that span a genomic region at regular intervals are used to detect the presence or absence of transcription. This difference means the same sets of biases and the methods for addressing them are unlikely to be relevant to both types of experiment. We introduce the informatics challenges arising in the analysis of tiling microarray experiments as open problems to the scientific community and present initial approaches for the analysis of this nascent technology.
Karyotype versus microarray testing for genetic abnormalities after stillbirth.
Reddy, Uma M; Page, Grier P; Saade, George R; Silver, Robert M; Thorsten, Vanessa R; Parker, Corette B; Pinar, Halit; Willinger, Marian; Stoll, Barbara J; Heim-Hall, Josefine; Varner, Michael W; Goldenberg, Robert L; Bukowski, Radek; Wapner, Ronald J; Drews-Botsch, Carolyn D; O'Brien, Barbara M; Dudley, Donald J; Levy, Brynn
2012-12-06
Genetic abnormalities have been associated with 6 to 13% of stillbirths, but the true prevalence may be higher. Unlike karyotype analysis, microarray analysis does not require live cells, and it detects small deletions and duplications called copy-number variants. The Stillbirth Collaborative Research Network conducted a population-based study of stillbirth in five geographic catchment areas. Standardized postmortem examinations and karyotype analyses were performed. A single-nucleotide polymorphism array was used to detect copy-number variants of at least 500 kb in placental or fetal tissue. Variants that were not identified in any of three databases of apparently unaffected persons were then classified into three groups: probably benign, clinical significance unknown, or pathogenic. We compared the results of karyotype and microarray analyses of samples obtained after delivery. In our analysis of samples from 532 stillbirths, microarray analysis yielded results more often than did karyotype analysis (87.4% vs. 70.5%, P<0.001) and provided better detection of genetic abnormalities (aneuploidy or pathogenic copy-number variants, 8.3% vs. 5.8%; P=0.007). Microarray analysis also identified more genetic abnormalities among 443 antepartum stillbirths (8.8% vs. 6.5%, P=0.02) and 67 stillbirths with congenital anomalies (29.9% vs. 19.4%, P=0.008). As compared with karyotype analysis, microarray analysis provided a relative increase in the diagnosis of genetic abnormalities of 41.9% in all stillbirths, 34.5% in antepartum stillbirths, and 53.8% in stillbirths with anomalies. Microarray analysis is more likely than karyotype analysis to provide a genetic diagnosis, primarily because of its success with nonviable tissue, and is especially valuable in analyses of stillbirths with congenital anomalies or in cases in which karyotype results cannot be obtained. (Funded by the Eunice Kennedy Shriver National Institute of Child Health and Human Development.).
Samolski, Ilanit; de Luis, Alberto; Vizcaíno, Juan Antonio; Monte, Enrique; Suárez, M Belén
2009-10-13
It has recently been shown that the Trichoderma fungal species used for biocontrol of plant diseases are capable of interacting with plant roots directly, behaving as symbiotic microorganisms. With a view to providing further information at transcriptomic level about the early response of Trichoderma to a host plant, we developed a high-density oligonucleotide (HDO) microarray encompassing 14,081 Expressed Sequence Tag (EST)-based transcripts from eight Trichoderma spp. and 9,121 genome-derived transcripts of T. reesei, and we have used this microarray to examine the gene expression of T. harzianum either alone or in the presence of tomato plants, chitin, or glucose. Global microarray analysis revealed 1,617 probe sets showing differential expression in T. harzianum mycelia under at least one of the culture conditions tested as compared with one another. Hierarchical clustering and heat map representation showed that the expression patterns obtained in glucose medium clustered separately from the expression patterns observed in the presence of tomato plants and chitin. Annotations using the Blast2GO suite identified 85 of the 257 transcripts whose probe sets afforded up-regulated expression in response to tomato plants. Some of these transcripts were predicted to encode proteins related to Trichoderma-host (fungus or plant) associations, such as Sm1/Elp1 protein, proteases P6281 and PRA1, enchochitinase CHIT42, or QID74 protein, although previously uncharacterized genes were also identified, including those responsible for the possible biosynthesis of nitric oxide, xenobiotic detoxification, mycelium development, or those related to the formation of infection structures in plant tissues. The effectiveness of the Trichoderma HDO microarray to detect different gene responses under different growth conditions in the fungus T. harzianum strongly indicates that this tool should be useful for further assays that include different stages of plant colonization, as well as for expression studies in other Trichoderma spp. represented on it. Using this microarray, we have been able to define a number of genes probably involved in the transcriptional response of T. harzianum within the first hours of contact with tomato plant roots, which may provide new insights into the mechanisms and roles of this fungus in the Trichoderma-plant interaction.
2009-01-01
Background It has recently been shown that the Trichoderma fungal species used for biocontrol of plant diseases are capable of interacting with plant roots directly, behaving as symbiotic microorganisms. With a view to providing further information at transcriptomic level about the early response of Trichoderma to a host plant, we developed a high-density oligonucleotide (HDO) microarray encompassing 14,081 Expressed Sequence Tag (EST)-based transcripts from eight Trichoderma spp. and 9,121 genome-derived transcripts of T. reesei, and we have used this microarray to examine the gene expression of T. harzianum either alone or in the presence of tomato plants, chitin, or glucose. Results Global microarray analysis revealed 1,617 probe sets showing differential expression in T. harzianum mycelia under at least one of the culture conditions tested as compared with one another. Hierarchical clustering and heat map representation showed that the expression patterns obtained in glucose medium clustered separately from the expression patterns observed in the presence of tomato plants and chitin. Annotations using the Blast2GO suite identified 85 of the 257 transcripts whose probe sets afforded up-regulated expression in response to tomato plants. Some of these transcripts were predicted to encode proteins related to Trichoderma-host (fungus or plant) associations, such as Sm1/Elp1 protein, proteases P6281 and PRA1, enchochitinase CHIT42, or QID74 protein, although previously uncharacterized genes were also identified, including those responsible for the possible biosynthesis of nitric oxide, xenobiotic detoxification, mycelium development, or those related to the formation of infection structures in plant tissues. Conclusion The effectiveness of the Trichoderma HDO microarray to detect different gene responses under different growth conditions in the fungus T. harzianum strongly indicates that this tool should be useful for further assays that include different stages of plant colonization, as well as for expression studies in other Trichoderma spp. represented on it. Using this microarray, we have been able to define a number of genes probably involved in the transcriptional response of T. harzianum within the first hours of contact with tomato plant roots, which may provide new insights into the mechanisms and roles of this fungus in the Trichoderma-plant interaction. PMID:19825185
A Java-based tool for the design of classification microarrays.
Meng, Da; Broschat, Shira L; Call, Douglas R
2008-08-04
Classification microarrays are used for purposes such as identifying strains of bacteria and determining genetic relationships to understand the epidemiology of an infectious disease. For these cases, mixed microarrays, which are composed of DNA from more than one organism, are more effective than conventional microarrays composed of DNA from a single organism. Selection of probes is a key factor in designing successful mixed microarrays because redundant sequences are inefficient and limited representation of diversity can restrict application of the microarray. We have developed a Java-based software tool, called PLASMID, for use in selecting the minimum set of probe sequences needed to classify different groups of plasmids or bacteria. The software program was successfully applied to several different sets of data. The utility of PLASMID was illustrated using existing mixed-plasmid microarray data as well as data from a virtual mixed-genome microarray constructed from different strains of Streptococcus. Moreover, use of data from expression microarray experiments demonstrated the generality of PLASMID. In this paper we describe a new software tool for selecting a set of probes for a classification microarray. While the tool was developed for the design of mixed microarrays-and mixed-plasmid microarrays in particular-it can also be used to design expression arrays. The user can choose from several clustering methods (including hierarchical, non-hierarchical, and a model-based genetic algorithm), several probe ranking methods, and several different display methods. A novel approach is used for probe redundancy reduction, and probe selection is accomplished via stepwise discriminant analysis. Data can be entered in different formats (including Excel and comma-delimited text), and dendrogram, heat map, and scatter plot images can be saved in several different formats (including jpeg and tiff). Weights generated using stepwise discriminant analysis can be stored for analysis of subsequent experimental data. Additionally, PLASMID can be used to construct virtual microarrays with genomes from public databases, which can then be used to identify an optimal set of probes.
Tra, Yolande V; Evans, Irene M
2010-01-01
BIO2010 put forth the goal of improving the mathematical educational background of biology students. The analysis and interpretation of microarray high-dimensional data can be very challenging and is best done by a statistician and a biologist working and teaching in a collaborative manner. We set up such a collaboration and designed a course on microarray data analysis. We started using Genome Consortium for Active Teaching (GCAT) materials and Microarray Genome and Clustering Tool software and added R statistical software along with Bioconductor packages. In response to student feedback, one microarray data set was fully analyzed in class, starting from preprocessing to gene discovery to pathway analysis using the latter software. A class project was to conduct a similar analysis where students analyzed their own data or data from a published journal paper. This exercise showed the impact that filtering, preprocessing, and different normalization methods had on gene inclusion in the final data set. We conclude that this course achieved its goals to equip students with skills to analyze data from a microarray experiment. We offer our insight about collaborative teaching as well as how other faculty might design and implement a similar interdisciplinary course.
Evans, Irene M.
2010-01-01
BIO2010 put forth the goal of improving the mathematical educational background of biology students. The analysis and interpretation of microarray high-dimensional data can be very challenging and is best done by a statistician and a biologist working and teaching in a collaborative manner. We set up such a collaboration and designed a course on microarray data analysis. We started using Genome Consortium for Active Teaching (GCAT) materials and Microarray Genome and Clustering Tool software and added R statistical software along with Bioconductor packages. In response to student feedback, one microarray data set was fully analyzed in class, starting from preprocessing to gene discovery to pathway analysis using the latter software. A class project was to conduct a similar analysis where students analyzed their own data or data from a published journal paper. This exercise showed the impact that filtering, preprocessing, and different normalization methods had on gene inclusion in the final data set. We conclude that this course achieved its goals to equip students with skills to analyze data from a microarray experiment. We offer our insight about collaborative teaching as well as how other faculty might design and implement a similar interdisciplinary course. PMID:20810954
Chromosomal Microarray versus Karyotyping for Prenatal Diagnosis
Wapner, Ronald J.; Martin, Christa Lese; Levy, Brynn; Ballif, Blake C.; Eng, Christine M.; Zachary, Julia M.; Savage, Melissa; Platt, Lawrence D.; Saltzman, Daniel; Grobman, William A.; Klugman, Susan; Scholl, Thomas; Simpson, Joe Leigh; McCall, Kimberly; Aggarwal, Vimla S.; Bunke, Brian; Nahum, Odelia; Patel, Ankita; Lamb, Allen N.; Thom, Elizabeth A.; Beaudet, Arthur L.; Ledbetter, David H.; Shaffer, Lisa G.; Jackson, Laird
2013-01-01
Background Chromosomal microarray analysis has emerged as a primary diagnostic tool for the evaluation of developmental delay and structural malformations in children. We aimed to evaluate the accuracy, efficacy, and incremental yield of chromosomal microarray analysis as compared with karyotyping for routine prenatal diagnosis. Methods Samples from women undergoing prenatal diagnosis at 29 centers were sent to a central karyotyping laboratory. Each sample was split in two; standard karyotyping was performed on one portion and the other was sent to one of four laboratories for chromosomal microarray. Results We enrolled a total of 4406 women. Indications for prenatal diagnosis were advanced maternal age (46.6%), abnormal result on Down’s syndrome screening (18.8%), structural anomalies on ultrasonography (25.2%), and other indications (9.4%). In 4340 (98.8%) of the fetal samples, microarray analysis was successful; 87.9% of samples could be used without tissue culture. Microarray analysis of the 4282 nonmosaic samples identified all the aneuploidies and unbalanced rearrangements identified on karyotyping but did not identify balanced translocations and fetal triploidy. In samples with a normal karyotype, microarray analysis revealed clinically relevant deletions or duplications in 6.0% with a structural anomaly and in 1.7% of those whose indications were advanced maternal age or positive screening results. Conclusions In the context of prenatal diagnostic testing, chromosomal microarray analysis identified additional, clinically significant cytogenetic information as compared with karyotyping and was equally efficacious in identifying aneuploidies and unbalanced rearrangements but did not identify balanced translocations and triploidies. (Funded by the Eunice Kennedy Shriver National Institute of Child Health and Human Development and others; ClinicalTrials.gov number, NCT01279733.) PMID:23215555
Kumar, Mukesh; Rath, Nitish Kumar; Rath, Santanu Kumar
2016-04-01
Microarray-based gene expression profiling has emerged as an efficient technique for classification, prognosis, diagnosis, and treatment of cancer. Frequent changes in the behavior of this disease generates an enormous volume of data. Microarray data satisfies both the veracity and velocity properties of big data, as it keeps changing with time. Therefore, the analysis of microarray datasets in a small amount of time is essential. They often contain a large amount of expression, but only a fraction of it comprises genes that are significantly expressed. The precise identification of genes of interest that are responsible for causing cancer are imperative in microarray data analysis. Most existing schemes employ a two-phase process such as feature selection/extraction followed by classification. In this paper, various statistical methods (tests) based on MapReduce are proposed for selecting relevant features. After feature selection, a MapReduce-based K-nearest neighbor (mrKNN) classifier is also employed to classify microarray data. These algorithms are successfully implemented in a Hadoop framework. A comparative analysis is done on these MapReduce-based models using microarray datasets of various dimensions. From the obtained results, it is observed that these models consume much less execution time than conventional models in processing big data. Copyright © 2016 Elsevier Inc. All rights reserved.
Mirmirani, P.; Consolo, M.; Oyetakin-White, P.; Baron, E.; Leahy, P.; Karnik, P.
2014-01-01
Summary Background There are regional variations in scalp hair miniaturization seen in androgenetic alopecia (AGA). Use of topical minoxidil can lead to reversal of miniaturization in the vertex scalp. However, its effects on other scalp regions are less well studied. Methods A placebo controlled double-blinded prospective pilot study of minoxidil topical foam 5% (MTF) vs placebo was conducted in sixteen healthy men ages 18-49 with Hamilton-Norwood type IV-V thinning. The subjects were asked to apply the treatment (active drug or placebo) to the scalp twice daily for eight weeks. Stereotactic scalp photographs were taken at the baseline and final visits to monitor global hair growth. Scalp biopsies were done at the leading edge of hair loss from the frontal and vertex scalp before and after treatment with MTF and placebo and microarray analysis was done using the Affymetrix GeneChip HG U133 Plus 2.0. Results Global stereotactic photographs showed that MTF induced hair growth in both the frontal and vertex scalp of AGA patients. Regional differences in gene expression profiles were observed before treatment. However, MTF treatment induced the expression of hair keratin associated genes and decreased the expression of epidermal differentiation complex (EDC) and inflammatory genes in both scalp regions. Conclusions These data suggest that MTF is effective in the treatment of both the frontal and vertex scalp of AGA patients. PMID:25204361
BRIC-17 Mapping Spaceflight-Induced Hypoxic Signaling and Response in Plants
NASA Technical Reports Server (NTRS)
Gilroy, Simon; Choi, Won-Gyu; Swanson, Sarah
2012-01-01
Goals of this work are: (1) Define global changes in gene expression patterns in Arabidopsis plants grown in microgravity using whole genome microarrays (2) Compare to mutants resistant to low oxygen challenge using whole genome microarrays Also measuring root and shoot size Outcomes from this research are: (1) Provide fundamental information on plant responses to the stresses inherent in spaceflight (2) Potential for informing on genetic strategies to engineer plants for optimal growth in space
Direct labeling of serum proteins by fluorescent dye for antibody microarray.
Klimushina, M V; Gumanova, N G; Metelskaya, V A
2017-05-06
Analysis of serum proteome by antibody microarray is used to identify novel biomarkers and to study signaling pathways including protein phosphorylation and protein-protein interactions. Labeling of serum proteins is important for optimal performance of the antibody microarray. Proper choice of fluorescent label and optimal concentration of protein loaded on the microarray ensure good quality of imaging that can be reliably scanned and processed by the software. We have optimized direct serum protein labeling using fluorescent dye Arrayit Green 540 (Arrayit Corporation, USA) for antibody microarray. Optimized procedure produces high quality images that can be readily scanned and used for statistical analysis of protein composition of the serum. Copyright © 2017 Elsevier Inc. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
White, Amanda M.; Daly, Don S.; Willse, Alan R.
The Automated Microarray Image Analysis (AMIA) Toolbox for MATLAB is a flexible, open-source microarray image analysis tool that allows the user to customize analysis of sets of microarray images. This tool provides several methods of identifying and quantify spot statistics, as well as extensive diagnostic statistics and images to identify poor data quality or processing. The open nature of this software allows researchers to understand the algorithms used to provide intensity estimates and to modify them easily if desired.
Convergence in probiotic Lactobacillus gut-adaptive responses in humans and mice.
Marco, Maria L; de Vries, Maaike C; Wels, Michiel; Molenaar, Douwe; Mangell, Peter; Ahrne, Siv; de Vos, Willem M; Vaughan, Elaine E; Kleerebezem, Michiel
2010-11-01
Probiotic bacteria provide unique opportunities to study the global responses and molecular mechanisms underlying the effects of gut-associated microorganisms in the human digestive tract. In this study, we show by comparative transcriptome analysis using DNA microarrays that the established probiotic Lactobacillus plantarum 299v specifically adapts its metabolic capacity in the human intestine for carbohydrate acquisition and expression of exopolysaccharide and proteinaceous cell surface compounds. This report constitutes the first application of global gene expression profiling of a commensal microorganism in the human gut. A core L. plantarum transcriptome expressed in the mammalian intestine was also determined through comparisons of L. plantarum 299v activities in humans to those found for L. plantarum WCFS1 in germ-free mice. These results identify the niche-specific adaptations of a dietary microorganism to the intestinal ecosystem and provide novel targets for molecular analysis of microbial-host interactions which affect human health.
ELISA-BASE: An Integrated Bioinformatics Tool for Analyzing and Tracking ELISA Microarray Data
DOE Office of Scientific and Technical Information (OSTI.GOV)
White, Amanda M.; Collett, James L.; Seurynck-Servoss, Shannon L.
ELISA-BASE is an open-source database for capturing, organizing and analyzing protein enzyme-linked immunosorbent assay (ELISA) microarray data. ELISA-BASE is an extension of the BioArray Soft-ware Environment (BASE) database system, which was developed for DNA microarrays. In order to make BASE suitable for protein microarray experiments, we developed several plugins for importing and analyzing quantitative ELISA microarray data. Most notably, our Protein Microarray Analysis Tool (ProMAT) for processing quantita-tive ELISA data is now available as a plugin to the database.
2010-01-01
Background Recent developments in high-throughput methods of analyzing transcriptomic profiles are promising for many areas of biology, including ecophysiology. However, although commercial microarrays are available for most common laboratory models, transcriptome analysis in non-traditional model species still remains a challenge. Indeed, the signal resulting from heterologous hybridization is low and difficult to interpret because of the weak complementarity between probe and target sequences, especially when no microarray dedicated to a genetically close species is available. Results We show here that transcriptome analysis in a species genetically distant from laboratory models is made possible by using MAXRS, a new method of analyzing heterologous hybridization on microarrays. This method takes advantage of the design of several commercial microarrays, with different probes targeting the same transcript. To illustrate and test this method, we analyzed the transcriptome of king penguin pectoralis muscle hybridized to Affymetrix chicken microarrays, two organisms separated by an evolutionary distance of approximately 100 million years. The differential gene expression observed between different physiological situations computed by MAXRS was confirmed by real-time PCR on 10 genes out of 11 tested. Conclusions MAXRS appears to be an appropriate method for gene expression analysis under heterologous hybridization conditions. PMID:20509979
A Human Lectin Microarray for Sperm Surface Glycosylation Analysis *
Sun, Yangyang; Cheng, Li; Gu, Yihua; Xin, Aijie; Wu, Bin; Zhou, Shumin; Guo, Shujuan; Liu, Yin; Diao, Hua; Shi, Huijuan; Wang, Guangyu; Tao, Sheng-ce
2016-01-01
Glycosylation is one of the most abundant and functionally important protein post-translational modifications. As such, technology for efficient glycosylation analysis is in high demand. Lectin microarrays are a powerful tool for such investigations and have been successfully applied for a variety of glycobiological studies. However, most of the current lectin microarrays are primarily constructed from plant lectins, which are not well suited for studies of human glycosylation because of the extreme complexity of human glycans. Herein, we constructed a human lectin microarray with 60 human lectin and lectin-like proteins. All of the lectins and lectin-like proteins were purified from yeast, and most showed binding to human glycans. To demonstrate the applicability of the human lectin microarray, human sperm were probed on the microarray and strong bindings were observed for several lectins, including galectin-1, 7, 8, GalNAc-T6, and ERGIC-53 (LMAN1). These bindings were validated by flow cytometry and fluorescence immunostaining. Further, mass spectrometry analysis showed that galectin-1 binds several membrane-associated proteins including heat shock protein 90. Finally, functional assays showed that binding of galectin-8 could significantly enhance the acrosome reaction within human sperms. To our knowledge, this is the first construction of a human lectin microarray, and we anticipate it will find wide use for a range of human or mammalian studies, alone or in combination with plant lectin microarrays. PMID:27364157
Spaceflight Alters Bacterial Gene Expression and Virulence and Reveals Role for Global Regulator Hfq
NASA Technical Reports Server (NTRS)
Wilson, J. W.; Ott, C. M.; zuBentrup, K. Honer; Ramamurthy R.; Quick, L.; Porwollik, S.; Cheng, P.; McClellan, M.; Tsaprailis, G.; Radabaugh, T.;
2007-01-01
A comprehensive analysis of both the molecular genetic and phenotypic responses of any organism to the spaceflight environment has never been accomplished due to significant technological and logistical hurdles. Moreover, the effects of spaceflight on microbial pathogenicity and associated infectious disease risks have not been studied. The bacterial pathogen Salmonella typhimurium was grown aboard Space Shuttle mission STS-115 and compared to identical ground control cultures. Global microarray and proteomic analyses revealed 167 transcripts and 73 proteins changed expression with the conserved RNA-binding protein Hfq identified as a likely global regulator involved in the response to this environment. Hfq involvement was confirmed with a ground based microgravity culture model. Spaceflight samples exhibited enhanced virulence in a murine infection model and extracellular matrix accumulation consistent with a biofilm. Strategies to target Hfq and related regulators could potentially decrease infectious disease risks during spaceflight missions and provide novel therapeutic options on Earth.
Yuan, Haiming; Meng, Zhe; Zhang, Lina; Luo, Xiangyang; Liu, Liping; Chen, Mengfan; Li, Xinwei; Zhao, Weiwei; Liang, Liyang
2016-01-01
Interstitial duplications distal to 15q13 are very rare. Here, we reported a 14-year-old boy with severe short stature, delayed bone age, hypogonadism, global developmental delay and intellectual disability. His had distinctive facial features including macrocephaly, broad forehead, deep-set and widely spaced eyes, broad nose bridge, shallow philtrum and thick lips. A de novo 6.4 Mb interstitial duplication of 15q15.3q21.2 was detected by chromosomal microarray analysis. We compared our patient's clinical phenotypes with those of several individuals with overlapping duplications and several candidate genes responsible for the phenotypes were identified as well. The results suggest a novel contiguous gene duplication syndrome characterized with shared features including short stature, hypogonadism, global developmental delay and other congenital anomalies.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gentry, T.; Schadt, C.; Zhou, J.
Microarray technology has the unparalleled potential tosimultaneously determine the dynamics and/or activities of most, if notall, of the microbial populations in complex environments such as soilsand sediments. Researchers have developed several types of arrays thatcharacterize the microbial populations in these samples based on theirphylogenetic relatedness or functional genomic content. Several recentstudies have used these microarrays to investigate ecological issues;however, most have only analyzed a limited number of samples withrelatively few experiments utilizing the full high-throughput potentialof microarray analysis. This is due in part to the unique analyticalchallenges that these samples present with regard to sensitivity,specificity, quantitation, and data analysis. Thismore » review discussesspecific applications of microarrays to microbial ecology research alongwith some of the latest studies addressing the difficulties encounteredduring analysis of complex microbial communities within environmentalsamples. With continued development, microarray technology may ultimatelyachieve its potential for comprehensive, high-throughput characterizationof microbial populations in near real-time.« less
Fully automated analysis of multi-resolution four-channel micro-array genotyping data
NASA Astrophysics Data System (ADS)
Abbaspour, Mohsen; Abugharbieh, Rafeef; Podder, Mohua; Tebbutt, Scott J.
2006-03-01
We present a fully-automated and robust microarray image analysis system for handling multi-resolution images (down to 3-micron with sizes up to 80 MBs per channel). The system is developed to provide rapid and accurate data extraction for our recently developed microarray analysis and quality control tool (SNP Chart). Currently available commercial microarray image analysis applications are inefficient, due to the considerable user interaction typically required. Four-channel DNA microarray technology is a robust and accurate tool for determining genotypes of multiple genetic markers in individuals. It plays an important role in the state of the art trend where traditional medical treatments are to be replaced by personalized genetic medicine, i.e. individualized therapy based on the patient's genetic heritage. However, fast, robust, and precise image processing tools are required for the prospective practical use of microarray-based genetic testing for predicting disease susceptibilities and drug effects in clinical practice, which require a turn-around timeline compatible with clinical decision-making. In this paper we have developed a fully-automated image analysis platform for the rapid investigation of hundreds of genetic variations across multiple genes. Validation tests indicate very high accuracy levels for genotyping results. Our method achieves a significant reduction in analysis time, from several hours to just a few minutes, and is completely automated requiring no manual interaction or guidance.
Ko, Kwan Soo; Park, Sulhee; Oh, Won Sup; Suh, Ji-Yoeun; Oh, Taejeong; Ahn, Sungwhan; Chun, Jongsik; Song, Jae-Hoon
2006-02-28
The global pattern of growth-dependent gene expres-sion in Streptococcus pneumoniae strains was evalu-ated using a high-density DNA microarray. Total RNAs obtained from an avirulent S. pneumoniae strain R6 and a virulent strain AMC96-6 were used to compare the expression patterns at seven time points (2.5, 3.5, 4.5, 5.5, 6.0, 6.5, and 8.0 h). The expression profile of strain R6 changed between log and station-ary growth (the Log-Stat switch). There were clear differences between the growth-dependent gene ex-pression profiles of the virulent and avirulent pneumo-coccal strains in 367 of 1,112 genes. Transcripts of genes associated with bacterial competence and capsular polysaccharide formation, as well as clpP and cbpA, were higher in the virulent strain. Our data suggest that late log or early stationary phase may be the most virulent phase of S. pneumoniae.
2010-01-01
Background Burkholderia pseudomallei is the causative agent of melioidosis where the highest reported incidence world wide is in the Northeast of Thailand, where saline soil and water are prevalent. Moreover, recent reports indicate a potential pathogenic role for B. pseudomallei in cystic fibrosis lung disease, where an increased sodium chloride (NaCl) concentration in airway surface liquid has been proposed. These observations raise the possibility that high salinity may represent a favorable niche for B. pseudomallei. We therefore investigated the global transcriptional response of B. pseudomallei to increased salinity using microarray analysis. Results Transcriptome analysis of B. pseudomallei under salt stress revealed several genes significantly up-regulated in the presence of 320 mM NaCl including genes associated with the bsa-derived Type III secretion system (T3SS). Microarray data were verified by reverse transcriptase-polymerase chain reactions (RT-PCR). Western blot analysis confirmed the increased expression and secretion of the invasion-associated type III secreted proteins BipD and BopE in B. pseudomallei cultures at 170 and 320 mM NaCl relative to salt-free medium. Furthermore, salt-treated B. pseudomallei exhibited greater invasion efficiency into the lung epithelial cell line A549 in a manner partly dependent on a functional Bsa system. Conclusions B. pseudomallei responds to salt stress by modulating the transcription of a relatively small set of genes, among which is the bsa locus associated with invasion and virulence. Expression and secretion of Bsa-secreted proteins was elevated in the presence of exogenous salt and the invasion efficiency was enhanced. Our data indicate that salinity has the potential to influence the virulence of B. pseudomallei. PMID:20540813
PRACTICAL STRATEGIES FOR PROCESSING AND ANALYZING SPOTTED OLIGONUCLEOTIDE MICROARRAY DATA
Thoughtful data analysis is as important as experimental design, biological sample quality, and appropriate experimental procedures for making microarrays a useful supplement to traditional toxicology. In the present study, spotted oligonucleotide microarrays were used to profile...
2011-01-01
Background Sporadic amyotrophic lateral sclerosis (sALS) is a motor neuron disease with poorly understood etiology. Results of gene expression profiling studies of whole blood from ALS patients have not been validated and are difficult to relate to ALS pathogenesis because gene expression profiles depend on the relative abundance of the different cell types present in whole blood. We conducted microarray analyses using Agilent Human Whole Genome 4 × 44k Arrays on a more homogeneous cell population, namely purified peripheral blood lymphocytes (PBLs), from ALS patients and healthy controls to identify molecular signatures possibly relevant to ALS pathogenesis. Methods Differentially expressed genes were determined by LIMMA (Linear Models for MicroArray) and SAM (Significance Analysis of Microarrays) analyses. The SAFE (Significance Analysis of Function and Expression) procedure was used to identify molecular pathway perturbations. Proteasome inhibition assays were conducted on cultured peripheral blood mononuclear cells (PBMCs) from ALS patients to confirm alteration of the Ubiquitin/Proteasome System (UPS). Results For the first time, using SAFE in a global gene ontology analysis (gene set size 5-100), we show significant perturbation of the KEGG (Kyoto Encyclopedia of Genes and Genomes) ALS pathway of motor neuron degeneration in PBLs from ALS patients. This was the only KEGG disease pathway significantly upregulated among 25, and contributing genes, including SOD1, represented 54% of the encoded proteins or protein complexes of the KEGG ALS pathway. Further SAFE analysis, including gene set sizes >100, showed that only neurodegenerative diseases (4 out of 34 disease pathways) including ALS were significantly upregulated. Changes in UBR2 expression correlated inversely with time since onset of disease and directly with ALSFRS-R, implying that UBR2 was increased early in the course of ALS. Cultured PBMCs from ALS patients accumulated more ubiquitinated proteins than PBMCs from healthy controls in a serum-dependent manner confirming changes in this pathway. Conclusions Our study indicates that PBLs from sALS patients are strong responders to systemic signals or local signals acquired by cell trafficking, representing changes in gene expression similar to those present in brain and spinal cord of sALS patients. PBLs may provide a useful means to study ALS pathogenesis. PMID:22027401
[Oligonucleotide microarray for subtyping avian influenza virus].
Xueqing, Han; Xiangmei, Lin; Yihong, Hou; Shaoqiang, Wu; Jian, Liu; Lin, Mei; Guangle, Jia; Zexiao, Yang
2008-09-01
Avian influenza viruses are important human and animal respiratory pathogens and rapid diagnosis of novel emerging avian influenza viruses is vital for effective global influenza surveillance. We developed an oligonucleotide microarray-based method for subtyping all avian influenza virus (16 HA and 9 NA subtypes). In total 25 pairs of primers specific for different subtypes and 1 pair of universal primers were carefully designed based on the genomic sequences of influenza A viruses retrieved from GenBank database. Several multiplex RT-PCR methods were then developed, and the target cDNAs of 25 subtype viruses were amplified by RT-PCR or overlapping PCR for evaluating the microarray. Further 52 oligonucleotide probes specific for all 25 subtype viruses were designed according to published gene sequences of avian influenza viruses in amplified target cDNAs domains, and a microarray for subtyping influenza A virus was developed. Then its specificity and sensitivity were validated by using different subtype strains and 2653 samples from 49 different areas. The results showed that all the subtypes of influenza virus could be identified simultaneously on this microarray with high sensitivity, which could reach to 2.47 pfu/mL virus or 2.5 ng target DNA. Furthermore, there was no cross reaction with other avian respiratory virus. An oligonucleotide microarray-based strategy for detection of avian influenza viruses has been developed. Such a diagnostic microarray will be useful in discovering and identifying all subtypes of avian influenza virus.
Transcription profile of brewery yeast under fermentation conditions.
James, T C; Campbell, S; Donnelly, D; Bond, U
2003-01-01
Yeast strains, used in the brewing industry, experience distinctive physiological conditions. During a brewing fermentation, yeast are exposed to anaerobic conditions, high pressure, high specific gravity and low temperatures. The purpose of this study was to examine the global gene expression profile of yeast subjected to brewing stress. We have carried out a microarray analysis of a typical brewer's yeast during the course of an 8-day fermentation in 15 degrees P wort. We used the probes derived from Saccharomyces cerevisiae genomic DNA on the chip and RNA isolated from three stages of brewing. This analysis shows a high level of expression of genes involved in fatty acid and ergosterol biosynthesis early in fermentation. Furthermore, genes involved in respiration and mitochondrial protein synthesis also show higher levels of expression. Surprisingly, we observed a complete repression of many stress response genes and genes involved in protein synthesis throughout the 8-day period compared with that at the start of fermentation. This microarray data set provides an analysis of gene expression under brewing fermentation conditions. The data provide an insight into the various metabolic processes altered or activated by brewing conditions of growth. This study leads to future experiments whereby selective alterations in brewing conditions could be introduced to take advantage of the changing transcript profile to improve the quality of the brew.
Jones, D L; Petty, J; Hoyle, D C; Hayes, A; Ragni, E; Popolo, L; Oliver, S G; Stateva, L I
2003-12-16
Often changes in gene expression levels have been considered significant only when above/below some arbitrarily chosen threshold. We investigated the effect of applying a purely statistical approach to microarray analysis and demonstrated that small changes in gene expression have biological significance. Whole genome microarray analysis of a pde2Delta mutant, constructed in the Saccharomyces cerevisiae reference strain FY23, revealed altered expression of approximately 11% of protein encoding genes. The mutant, characterized by constitutive activation of the Ras/cAMP pathway, has increased sensitivity to stress, reduced ability to assimilate nonfermentable carbon sources, and some cell wall integrity defects. Applying the Munich Information Centre for Protein Sequences (MIPS) functional categories revealed increased expression of genes related to ribosome biogenesis and downregulation of genes in the cell rescue, defense, cell death and aging category, suggesting a decreased response to stress conditions. A reduced level of gene expression in the unfolded protein response pathway (UPR) was observed. Cell wall genes whose expression was affected by this mutation were also identified. Several of the cAMP-responsive orphan genes, upon further investigation, revealed cell wall functions; others had previously unidentified phenotypes assigned to them. This investigation provides a statistical global transcriptome analysis of the cellular response to constitutive activation of the Ras/cAMP pathway.
GeneXplorer: an interactive web application for microarray data visualization and analysis.
Rees, Christian A; Demeter, Janos; Matese, John C; Botstein, David; Sherlock, Gavin
2004-10-01
When publishing large-scale microarray datasets, it is of great value to create supplemental websites where either the full data, or selected subsets corresponding to figures within the paper, can be browsed. We set out to create a CGI application containing many of the features of some of the existing standalone software for the visualization of clustered microarray data. We present GeneXplorer, a web application for interactive microarray data visualization and analysis in a web environment. GeneXplorer allows users to browse a microarray dataset in an intuitive fashion. It provides simple access to microarray data over the Internet and uses only HTML and JavaScript to display graphic and annotation information. It provides radar and zoom views of the data, allows display of the nearest neighbors to a gene expression vector based on their Pearson correlations and provides the ability to search gene annotation fields. The software is released under the permissive MIT Open Source license, and the complete documentation and the entire source code are freely available for download from CPAN http://search.cpan.org/dist/Microarray-GeneXplorer/.
The tissue microarray OWL schema: An open-source tool for sharing tissue microarray data
Kang, Hyunseok P.; Borromeo, Charles D.; Berman, Jules J.; Becich, Michael J.
2010-01-01
Background: Tissue microarrays (TMAs) are enormously useful tools for translational research, but incompatibilities in database systems between various researchers and institutions prevent the efficient sharing of data that could help realize their full potential. Resource Description Framework (RDF) provides a flexible method to represent knowledge in triples, which take the form Subject-Predicate-Object. All data resources are described using Uniform Resource Identifiers (URIs), which are global in scope. We present an OWL (Web Ontology Language) schema that expands upon the TMA data exchange specification to address this issue and assist in data sharing and integration. Methods: A minimal OWL schema was designed containing only concepts specific to TMA experiments. More general data elements were incorporated from predefined ontologies such as the NCI thesaurus. URIs were assigned using the Linked Data format. Results: We present examples of files utilizing the schema and conversion of XML data (similar to the TMA DES) to OWL. Conclusion: By utilizing predefined ontologies and global unique identifiers, this OWL schema provides a solution to the limitations of XML, which represents concepts defined in a localized setting. This will help increase the utilization of tissue resources, facilitating collaborative translational research efforts. PMID:20805954
Hatazawa, Yukino; Minami, Kimiko; Yoshimura, Ryoji; Onishi, Takumi; Manio, Mark Christian; Inoue, Kazuo; Sawada, Naoki; Suzuki, Osamu; Miura, Shinji; Kamei, Yasutomi
2016-12-09
The expression of the transcriptional coactivator PGC1α is increased in skeletal muscles during exercise. Previously, we showed that increased PGC1α leads to prolonged exercise performance (the duration for which running can be continued) and, at the same time, increases the expression of branched-chain amino acid (BCAA) metabolism-related enzymes and genes that are involved in supplying substrates for the TCA cycle. We recently created mice with PGC1α knockout specifically in the skeletal muscles (PGC1α KO mice), which show decreased mitochondrial content. In this study, global gene expression (microarray) analysis was performed in the skeletal muscles of PGC1α KO mice compared with that of wild-type control mice. As a result, decreased expression of genes involved in the TCA cycle, oxidative phosphorylation, and BCAA metabolism were observed. Compared with previously obtained microarray data on PGC1α-overexpressing transgenic mice, each gene showed the completely opposite direction of expression change. Bioinformatic analysis of the promoter region of genes with decreased expression in PGC1α KO mice predicted the involvement of several transcription factors, including a nuclear receptor, ERR, in their regulation. As PGC1α KO microarray data in this study show opposing findings to the PGC1α transgenic data, a loss-of-function experiment, as well as a gain-of-function experiment, revealed PGC1α's function in the oxidative energy metabolism of skeletal muscles. Copyright © 2016 Elsevier Inc. All rights reserved.
Le, Mai Q; Pagter, Majken; Hincha, Dirk K
2015-01-01
During cold acclimation plants increase in freezing tolerance in response to low non-freezing temperatures. This is accompanied by many physiological, biochemical and molecular changes that have been extensively investigated. In addition, plants of many species, including Arabidopsis thaliana, become more freezing tolerant during exposure to mild, non-damaging sub-zero temperatures after cold acclimation. There is hardly any information available about the molecular basis of this adaptation. Here, we have used microarrays and a qRT-PCR primer platform covering 1,880 genes encoding transcription factors (TFs) to monitor changes in gene expression in the Arabidopsis accessions Columbia-0, Rschew and Tenela during the first 3 days of sub-zero acclimation at -3 °C. The results indicate that gene expression during sub-zero acclimation follows a tighly controlled time-course. Especially AP2/EREBP and WRKY TFs may be important regulators of sub-zero acclimation, although the CBF signal transduction pathway seems to be less important during sub-zero than during cold acclimation. Globally, we estimate that approximately 5% of all Arabidopsis genes are regulated during sub-zero acclimation. Particularly photosynthesis-related genes are down-regulated and genes belonging to the functional classes of cell wall biosynthesis, hormone metabolism and RNA regulation of transcription are up-regulated. Collectively, these data provide the first global analysis of gene expression during sub-zero acclimation and allow the identification of candidate genes for forward and reverse genetic studies into the molecular mechanisms of sub-zero acclimation.
Nanotechnology: moving from microarrays toward nanoarrays.
Chen, Hua; Li, Jun
2007-01-01
Microarrays are important tools for high-throughput analysis of biomolecules. The use of microarrays for parallel screening of nucleic acid and protein profiles has become an industry standard. A few limitations of microarrays are the requirement for relatively large sample volumes and elongated incubation time, as well as the limit of detection. In addition, traditional microarrays make use of bulky instrumentation for the detection, and sample amplification and labeling are quite laborious, which increase analysis cost and delays the time for obtaining results. These problems limit microarray techniques from point-of-care and field applications. One strategy for overcoming these problems is to develop nanoarrays, particularly electronics-based nanoarrays. With further miniaturization, higher sensitivity, and simplified sample preparation, nanoarrays could potentially be employed for biomolecular analysis in personal healthcare and monitoring of trace pathogens. In this chapter, it is intended to introduce the concept and advantage of nanotechnology and then describe current methods and protocols for novel nanoarrays in three aspects: (1) label-free nucleic acids analysis using nanoarrays, (2) nanoarrays for protein detection by conventional optical fluorescence microscopy as well as by novel label-free methods such as atomic force microscopy, and (3) nanoarray for enzymatic-based assay. These nanoarrays will have significant applications in drug discovery, medical diagnosis, genetic testing, environmental monitoring, and food safety inspection.
A meta-data based method for DNA microarray imputation.
Jörnsten, Rebecka; Ouyang, Ming; Wang, Hui-Yu
2007-03-29
DNA microarray experiments are conducted in logical sets, such as time course profiling after a treatment is applied to the samples, or comparisons of the samples under two or more conditions. Due to cost and design constraints of spotted cDNA microarray experiments, each logical set commonly includes only a small number of replicates per condition. Despite the vast improvement of the microarray technology in recent years, missing values are prevalent. Intuitively, imputation of missing values is best done using many replicates within the same logical set. In practice, there are few replicates and thus reliable imputation within logical sets is difficult. However, it is in the case of few replicates that the presence of missing values, and how they are imputed, can have the most profound impact on the outcome of downstream analyses (e.g. significance analysis and clustering). This study explores the feasibility of imputation across logical sets, using the vast amount of publicly available microarray data to improve imputation reliability in the small sample size setting. We download all cDNA microarray data of Saccharomyces cerevisiae, Arabidopsis thaliana, and Caenorhabditis elegans from the Stanford Microarray Database. Through cross-validation and simulation, we find that, for all three species, our proposed imputation using data from public databases is far superior to imputation within a logical set, sometimes to an astonishing degree. Furthermore, the imputation root mean square error for significant genes is generally a lot less than that of non-significant ones. Since downstream analysis of significant genes, such as clustering and network analysis, can be very sensitive to small perturbations of estimated gene effects, it is highly recommended that researchers apply reliable data imputation prior to further analysis. Our method can also be applied to cDNA microarray experiments from other species, provided good reference data are available.
2010-01-01
Background The development of DNA microarrays has facilitated the generation of hundreds of thousands of transcriptomic datasets. The use of a common reference microarray design allows existing transcriptomic data to be readily compared and re-analysed in the light of new data, and the combination of this design with large datasets is ideal for 'systems'-level analyses. One issue is that these datasets are typically collected over many years and may be heterogeneous in nature, containing different microarray file formats and gene array layouts, dye-swaps, and showing varying scales of log2- ratios of expression between microarrays. Excellent software exists for the normalisation and analysis of microarray data but many data have yet to be analysed as existing methods struggle with heterogeneous datasets; options include normalising microarrays on an individual or experimental group basis. Our solution was to develop the Batch Anti-Banana Algorithm in R (BABAR) algorithm and software package which uses cyclic loess to normalise across the complete dataset. We have already used BABAR to analyse the function of Salmonella genes involved in the process of infection of mammalian cells. Results The only input required by BABAR is unprocessed GenePix or BlueFuse microarray data files. BABAR provides a combination of 'within' and 'between' microarray normalisation steps and diagnostic boxplots. When applied to a real heterogeneous dataset, BABAR normalised the dataset to produce a comparable scaling between the microarrays, with the microarray data in excellent agreement with RT-PCR analysis. When applied to a real non-heterogeneous dataset and a simulated dataset, BABAR's performance in identifying differentially expressed genes showed some benefits over standard techniques. Conclusions BABAR is an easy-to-use software tool, simplifying the simultaneous normalisation of heterogeneous two-colour common reference design cDNA microarray-based transcriptomic datasets. We show BABAR transforms real and simulated datasets to allow for the correct interpretation of these data, and is the ideal tool to facilitate the identification of differentially expressed genes or network inference analysis from transcriptomic datasets. PMID:20128918
Genome-wide analysis of the heat stress response in Zebu (Sahiwal) cattle.
Mehla, Kusum; Magotra, Ankit; Choudhary, Jyoti; Singh, A K; Mohanty, A K; Upadhyay, R C; Srinivasan, Surendran; Gupta, Pankaj; Choudhary, Neelam; Antony, Bristo; Khan, Farheen
2014-01-10
Environmental-induced hyperthermia compromises animal production with drastic economic consequences to global animal agriculture and jeopardizes animal welfare. Heat stress is a major stressor that occurs as a result of an imbalance between heat production within the body and its dissipation and it affects animals at cellular, molecular and ecological levels. The molecular mechanism underlying the physiology of heat stress in the cattle remains undefined. The present study sought to evaluate mRNA expression profiles in the cattle blood in response to heat stress. In this study we report the genes that were differentially expressed in response to heat stress using global scale genome expression technology (Microarray). Four Sahiwal heifers were exposed to 42°C with 90% humidity for 4h followed by normothermia. Gene expression changes include activation of heat shock transcription factor 1 (HSF1), increased expression of heat shock proteins (HSP) and decreased expression and synthesis of other proteins, immune system activation via extracellular secretion of HSP. A cDNA microarray analysis found 140 transcripts to be up-regulated and 77 down-regulated in the cattle blood after heat treatment (P<0.05). But still a comprehensive explanation for the direction of fold change and the specific genes involved in response to acute heat stress still remains to be explored. These findings may provide insights into the underlying mechanism of physiology of heat stress in cattle. Understanding the biology and mechanisms of heat stress is critical to developing approaches to ameliorate current production issues for improving animal performance and agriculture economics. © 2013 Elsevier B.V. All rights reserved.
2010-01-01
Background Myxococcus xanthus is a Gram negative bacterium that can differentiate into metabolically quiescent, environmentally resistant spores. Little is known about the mechanisms involved in differentiation in part because sporulation is normally initiated at the culmination of a complex starvation-induced developmental program and only inside multicellular fruiting bodies. To obtain a broad overview of the sporulation process and to identify novel genes necessary for differentiation, we instead performed global transcriptome analysis of an artificial chemically-induced sporulation process in which addition of glycerol to vegetatively growing liquid cultures of M. xanthus leads to rapid and synchronized differentiation of nearly all cells into myxospore-like entities. Results Our analyses identified 1 486 genes whose expression was significantly regulated at least two-fold within four hours of chemical-induced differentiation. Most of the previously identified sporulation marker genes were significantly upregulated. In contrast, most genes that are required to build starvation-induced multicellular fruiting bodies, but which are not required for sporulation per se, were not significantly regulated in our analysis. Analysis of functional gene categories significantly over-represented in the regulated genes, suggested large rearrangements in core metabolic pathways, and in genes involved in protein synthesis and fate. We used the microarray data to identify a novel operon of eight genes that, when mutated, rendered cells unable to produce viable chemical- or starvation-induced spores. Importantly, these mutants displayed no defects in building fruiting bodies, suggesting these genes are necessary for the core sporulation process. Furthermore, during the starvation-induced developmental program, these genes were expressed in fruiting bodies but not in peripheral rods, a subpopulation of developing cells which do not sporulate. Conclusions These results suggest that microarray analysis of chemical-induced spore formation is an excellent system to specifically identify genes necessary for the core sporulation process of a Gram negative model organism for differentiation. PMID:20420673
Dupl'áková, Nikoleta; Renák, David; Hovanec, Patrik; Honysová, Barbora; Twell, David; Honys, David
2007-07-23
Microarray technologies now belong to the standard functional genomics toolbox and have undergone massive development leading to increased genome coverage, accuracy and reliability. The number of experiments exploiting microarray technology has markedly increased in recent years. In parallel with the rapid accumulation of transcriptomic data, on-line analysis tools are being introduced to simplify their use. Global statistical data analysis methods contribute to the development of overall concepts about gene expression patterns and to query and compose working hypotheses. More recently, these applications are being supplemented with more specialized products offering visualization and specific data mining tools. We present a curated gene family-oriented gene expression database, Arabidopsis Gene Family Profiler (aGFP; http://agfp.ueb.cas.cz), which gives the user access to a large collection of normalised Affymetrix ATH1 microarray datasets. The database currently contains NASC Array and AtGenExpress transcriptomic datasets for various tissues at different developmental stages of wild type plants gathered from nearly 350 gene chips. The Arabidopsis GFP database has been designed as an easy-to-use tool for users needing an easily accessible resource for expression data of single genes, pre-defined gene families or custom gene sets, with the further possibility of keyword search. Arabidopsis Gene Family Profiler presents a user-friendly web interface using both graphic and text output. Data are stored at the MySQL server and individual queries are created in PHP script. The most distinguishable features of Arabidopsis Gene Family Profiler database are: 1) the presentation of normalized datasets (Affymetrix MAS algorithm and calculation of model-based gene-expression values based on the Perfect Match-only model); 2) the choice between two different normalization algorithms (Affymetrix MAS4 or MAS5 algorithms); 3) an intuitive interface; 4) an interactive "virtual plant" visualizing the spatial and developmental expression profiles of both gene families and individual genes. Arabidopsis GFP gives users the possibility to analyze current Arabidopsis developmental transcriptomic data starting with simple global queries that can be expanded and further refined to visualize comparative and highly selective gene expression profiles.
van Huet, Ramon A. C.; Pierrache, Laurence H.M.; Meester-Smoor, Magda A.; Klaver, Caroline C.W.; van den Born, L. Ingeborgh; Hoyng, Carel B.; de Wijs, Ilse J.; Collin, Rob W. J.; Hoefsloot, Lies H.
2015-01-01
Purpose To determine the efficacy of multiple versions of a commercially available arrayed primer extension (APEX) microarray chip for autosomal recessive retinitis pigmentosa (arRP). Methods We included 250 probands suspected of arRP who were genetically analyzed with the APEX microarray between January 2008 and November 2013. The mode of inheritance had to be autosomal recessive according to the pedigree (including isolated cases). If the microarray identified a heterozygous mutation, we performed Sanger sequencing of exons and exon–intron boundaries of that specific gene. The efficacy of this microarray chip with the additional Sanger sequencing approach was determined by the percentage of patients that received a molecular diagnosis. We also collected data from genetic tests other than the APEX analysis for arRP to provide a detailed description of the molecular diagnoses in our study cohort. Results The APEX microarray chip for arRP identified the molecular diagnosis in 21 (8.5%) of the patients in our cohort. Additional Sanger sequencing yielded a second mutation in 17 patients (6.8%), thereby establishing the molecular diagnosis. In total, 38 patients (15.2%) received a molecular diagnosis after analysis using the microarray and additional Sanger sequencing approach. Further genetic analyses after a negative result of the arRP microarray (n = 107) resulted in a molecular diagnosis of arRP (n = 23), autosomal dominant RP (n = 5), X-linked RP (n = 2), and choroideremia (n = 1). Conclusions The efficacy of the commercially available APEX microarray chips for arRP appears to be low, most likely caused by the limitations of this technique and the genetic and allelic heterogeneity of RP. Diagnostic yields up to 40% have been reported for next-generation sequencing (NGS) techniques that, as expected, thereby outperform targeted APEX analysis. PMID:25999674
Zenoni, Sara; D'Agostino, Nunzio; Tornielli, Giovanni B; Quattrocchio, Francesca; Chiusano, Maria L; Koes, Ronald; Zethof, Jan; Guzzo, Flavia; Delledonne, Massimo; Frusciante, Luigi; Gerats, Tom; Pezzotti, Mario
2011-10-01
Petunia is an excellent model system, especially for genetic, physiological and molecular studies. Thus far, however, genome-wide expression analysis has been applied rarely because of the lack of sequence information. We applied next-generation sequencing to generate, through de novo read assembly, a large catalogue of transcripts for Petunia axillaris and Petunia inflata. On the basis of both transcriptomes, comprehensive microarray chips for gene expression analysis were established and used for the analysis of global- and organ-specific gene expression in Petunia axillaris and Petunia inflata and to explore the molecular basis of the seed coat defects in a Petunia hybrida mutant, anthocyanin 11 (an11), lacking a WD40-repeat (WDR) transcription regulator. Among the transcripts differentially expressed in an11 seeds compared with wild type, many expected targets of AN11 were found but also several interesting new candidates that might play a role in morphogenesis of the seed coat. Our results validate the combination of next-generation sequencing with microarray analyses strategies to identify the transcriptome of two petunia species without previous knowledge of their genome, and to develop comprehensive chips as useful tools for the analysis of gene expression in P. axillaris, P. inflata and P. hybrida. © 2011 The Authors. The Plant Journal © 2011 Blackwell Publishing Ltd.
Scholten, Johannes C M; Culley, David E; Nie, Lei; Munn, Kyle J; Chow, Lely; Brockman, Fred J; Zhang, Weiwen
2007-06-29
The application of DNA microarray technology to investigate multiple-species microbial communities presents great challenges. In this study, we reported the design and quality assessment of four whole genome oligonucleotide microarrays for two syntroph bacteria, Desulfovibrio vulgaris and Syntrophobacter fumaroxidans, and two archaeal methanogens, Methanosarcina barkeri, and Methanospirillum hungatei, and their application to analyze global gene expression in a four-species microbial community in response to oxidative stress. In order to minimize the possibility of cross-hybridization, cross-genome comparison was performed to assure all probes unique to each genome so that the microarrays could provide species-level resolution. Microarray quality was validated by the good reproducibility of experimental measurements of multiple biological and analytical replicates. This study showed that S. fumaroxidans and M. hungatei responded to the oxidative stress with up-regulation of several genes known to be involved in reactive oxygen species (ROS) detoxification, such as catalase and rubrerythrin in S. fumaroxidans and thioredoxin and heat shock protein Hsp20 in M. hungatei. However, D. vulgaris seemed to be less sensitive to the oxidative stress as a member of a four-species community, since no gene involved in ROS detoxification was up-regulated. Our work demonstrated the successful application of microarrays to a multiple-species microbial community, and our preliminary results indicated that this approach could provide novel insights on the metabolism within microbial communities.
Principles of gene microarray data analysis.
Mocellin, Simone; Rossi, Carlo Riccardo
2007-01-01
The development of several gene expression profiling methods, such as comparative genomic hybridization (CGH), differential display, serial analysis of gene expression (SAGE), and gene microarray, together with the sequencing of the human genome, has provided an opportunity to monitor and investigate the complex cascade of molecular events leading to tumor development and progression. The availability of such large amounts of information has shifted the attention of scientists towards a nonreductionist approach to biological phenomena. High throughput technologies can be used to follow changing patterns of gene expression over time. Among them, gene microarray has become prominent because it is easier to use, does not require large-scale DNA sequencing, and allows for the parallel quantification of thousands of genes from multiple samples. Gene microarray technology is rapidly spreading worldwide and has the potential to drastically change the therapeutic approach to patients affected with tumor. Therefore, it is of paramount importance for both researchers and clinicians to know the principles underlying the analysis of the huge amount of data generated with microarray technology.
Schönmann, Susan; Loy, Alexander; Wimmersberger, Céline; Sobek, Jens; Aquino, Catharine; Vandamme, Peter; Frey, Beat; Rehrauer, Hubert; Eberl, Leo
2009-04-01
For cultivation-independent and highly parallel analysis of members of the genus Burkholderia, an oligonucleotide microarray (phylochip) consisting of 131 hierarchically nested 16S rRNA gene-targeted oligonucleotide probes was developed. A novel primer pair was designed for selective amplification of a 1.3 kb 16S rRNA gene fragment of Burkholderia species prior to microarray analysis. The diagnostic performance of the microarray for identification and differentiation of Burkholderia species was tested with 44 reference strains of the genera Burkholderia, Pandoraea, Ralstonia and Limnobacter. Hybridization patterns based on presence/absence of probe signals were interpreted semi-automatically using the novel likelihood-based strategy of the web-tool Phylo- Detect. Eighty-eight per cent of the reference strains were correctly identified at the species level. The evaluated microarray was applied to investigate shifts in the Burkholderia community structure in acidic forest soil upon addition of cadmium, a condition that selected for Burkholderia species. The microarray results were in agreement with those obtained from phylogenetic analysis of Burkholderia 16S rRNA gene sequences recovered from the same cadmiumcontaminated soil, demonstrating the value of the Burkholderia phylochip for determinative and environmental studies.
Support vector machine and principal component analysis for microarray data classification
NASA Astrophysics Data System (ADS)
Astuti, Widi; Adiwijaya
2018-03-01
Cancer is a leading cause of death worldwide although a significant proportion of it can be cured if it is detected early. In recent decades, technology called microarray takes an important role in the diagnosis of cancer. By using data mining technique, microarray data classification can be performed to improve the accuracy of cancer diagnosis compared to traditional techniques. The characteristic of microarray data is small sample but it has huge dimension. Since that, there is a challenge for researcher to provide solutions for microarray data classification with high performance in both accuracy and running time. This research proposed the usage of Principal Component Analysis (PCA) as a dimension reduction method along with Support Vector Method (SVM) optimized by kernel functions as a classifier for microarray data classification. The proposed scheme was applied on seven data sets using 5-fold cross validation and then evaluation and analysis conducted on term of both accuracy and running time. The result showed that the scheme can obtained 100% accuracy for Ovarian and Lung Cancer data when Linear and Cubic kernel functions are used. In term of running time, PCA greatly reduced the running time for every data sets.
Experimental Approaches to Microarray Analysis of Tumor Samples
ERIC Educational Resources Information Center
Furge, Laura Lowe; Winter, Michael B.; Meyers, Jacob I.; Furge, Kyle A.
2008-01-01
Comprehensive measurement of gene expression using high-density nucleic acid arrays (i.e. microarrays) has become an important tool for investigating the molecular differences in clinical and research samples. Consequently, inclusion of discussion in biochemistry, molecular biology, or other appropriate courses of microarray technologies has…
Multiplex cDNA quantification method that facilitates the standardization of gene expression data
Gotoh, Osamu; Murakami, Yasufumi; Suyama, Akira
2011-01-01
Microarray-based gene expression measurement is one of the major methods for transcriptome analysis. However, current microarray data are substantially affected by microarray platforms and RNA references because of the microarray method can provide merely the relative amounts of gene expression levels. Therefore, valid comparisons of the microarray data require standardized platforms, internal and/or external controls and complicated normalizations. These requirements impose limitations on the extensive comparison of gene expression data. Here, we report an effective approach to removing the unfavorable limitations by measuring the absolute amounts of gene expression levels on common DNA microarrays. We have developed a multiplex cDNA quantification method called GEP-DEAN (Gene expression profiling by DCN-encoding-based analysis). The method was validated by using chemically synthesized DNA strands of known quantities and cDNA samples prepared from mouse liver, demonstrating that the absolute amounts of cDNA strands were successfully measured with a sensitivity of 18 zmol in a highly multiplexed manner in 7 h. PMID:21415008
Spot detection and image segmentation in DNA microarray data.
Qin, Li; Rueda, Luis; Ali, Adnan; Ngom, Alioune
2005-01-01
Following the invention of microarrays in 1994, the development and applications of this technology have grown exponentially. The numerous applications of microarray technology include clinical diagnosis and treatment, drug design and discovery, tumour detection, and environmental health research. One of the key issues in the experimental approaches utilising microarrays is to extract quantitative information from the spots, which represent genes in a given experiment. For this process, the initial stages are important and they influence future steps in the analysis. Identifying the spots and separating the background from the foreground is a fundamental problem in DNA microarray data analysis. In this review, we present an overview of state-of-the-art methods for microarray image segmentation. We discuss the foundations of the circle-shaped approach, adaptive shape segmentation, histogram-based methods and the recently introduced clustering-based techniques. We analytically show that clustering-based techniques are equivalent to the one-dimensional, standard k-means clustering algorithm that utilises the Euclidean distance.
Split-plot microarray experiments: issues of design, power and sample size.
Tsai, Pi-Wen; Lee, Mei-Ling Ting
2005-01-01
This article focuses on microarray experiments with two or more factors in which treatment combinations of the factors corresponding to the samples paired together onto arrays are not completely random. A main effect of one (or more) factor(s) is confounded with arrays (the experimental blocks). This is called a split-plot microarray experiment. We utilise an analysis of variance (ANOVA) model to assess differentially expressed genes for between-array and within-array comparisons that are generic under a split-plot microarray experiment. Instead of standard t- or F-test statistics that rely on mean square errors of the ANOVA model, we use a robust method, referred to as 'a pooled percentile estimator', to identify genes that are differentially expressed across different treatment conditions. We illustrate the design and analysis of split-plot microarray experiments based on a case application described by Jin et al. A brief discussion of power and sample size for split-plot microarray experiments is also presented.
Bruno, D L; Ganesamoorthy, D; Schoumans, J; Bankier, A; Coman, D; Delatycki, M; Gardner, R J M; Hunter, M; James, P A; Kannu, P; McGillivray, G; Pachter, N; Peters, H; Rieubland, C; Savarirayan, R; Scheffer, I E; Sheffield, L; Tan, T; White, S M; Yeung, A; Bowman, Z; Ngo, C; Choy, K W; Cacheux, V; Wong, L; Amor, D J; Slater, H R
2009-02-01
Microarray genome analysis is realising its promise for improving detection of genetic abnormalities in individuals with mental retardation and congenital abnormality. Copy number variations (CNVs) are now readily detectable using a variety of platforms and a major challenge is the distinction of pathogenic from ubiquitous, benign polymorphic CNVs. The aim of this study was to investigate replacement of time consuming, locus specific testing for specific microdeletion and microduplication syndromes with microarray analysis, which theoretically should detect all known syndromes with CNV aetiologies as well as new ones. Genome wide copy number analysis was performed on 117 patients using Affymetrix 250K microarrays. 434 CNVs (195 losses and 239 gains) were found, including 18 pathogenic CNVs and 9 identified as "potentially pathogenic". Almost all pathogenic CNVs were larger than 500 kb, significantly larger than the median size of all CNVs detected. Segmental regions of loss of heterozygosity larger than 5 Mb were found in 5 patients. Genome microarray analysis has improved diagnostic success in this group of patients. Several examples of recently discovered "new syndromes" were found suggesting they are more common than previously suspected and collectively are likely to be a major cause of mental retardation. The findings have several implications for clinical practice. The study revealed the potential to make genetic diagnoses that were not evident in the clinical presentation, with implications for pretest counselling and the consent process. The importance of contributing novel CNVs to high quality databases for genotype-phenotype analysis and review of guidelines for selection of individuals for microarray analysis is emphasised.
Bouras, Toula; Southey, Melissa C; Chang, Andy C; Reddel, Roger R; Willhite, Dorian; Glynne, Richard; Henderson, Michael A; Armes, Jane E; Venter, Deon J
2002-03-01
Differences in gene expression are likely to explain the phenotypic variation between hormone-responsive and hormone-unresponsive breast cancers. In this study, DNA microarray analysis of approximately 10,000 known genes and 25,000 expressed sequence tag clusters was performed to identify genes induced by estrogen and repressed by the pure antiestrogen ICI 182 780 in vitro that correlated with estrogen receptor (ER) expression in primary breast carcinomas in vivo. Stanniocalcin (STC) 2 was identified as one of the genes that fulfilled these criteria. DNA microarray hybridization showed a 3-fold induction of STC2 mRNA expression in MCF-7 cells in < or = 3 h of estrogen exposure and a 3-fold repression in the presence of antiestrogen (one-way ANOVA, P < 0.0005). In 13 ER-positive and 12 ER-negative breast carcinomas, the microarray-derived mRNA levels observed for STC2 correlated with tumor ER mRNA (Pearson's correlation, r = 0.85; P < 0.0001) and ER protein status (Spearman's rank correlation, r = 0.73; P < 0.0001). The expression profile of STC2 was further confirmed by in situ hybridization and immunohistochemistry on a larger cohort of 236 unselected breast carcinomas using tissue microarrays. STC2 mRNA and protein expression were found to be associated with tumor ER status (Fisher's exact test, P < 0.005). The related gene, STC1, was also examined and shown to be associated with ER status in breast carcinomas (Fisher's exact test, P < 0.05). This study demonstrates the feasibility of using global gene expression data derived from an in vitro model to pinpoint novel estrogen-responsive genes of potential clinical relevance.
Salinas, Yasmmyn D.; Shi, YiJun; Greenwood, Michael; Hoe, See Ziau; Murphy, David; Gainer, Harold
2015-01-01
Magnocellular neurons (MCNs) in the hypothalamo-neurohypophysial system (HNS) are highly specialized to release large amounts of arginine vasopressin (Avp) or oxytocin (Oxt) into the blood stream and play critical roles in the regulation of body fluid homeostasis. The MCNs are osmosensory neurons and are excited by exposure to hypertonic solutions and inhibited by hypotonic solutions. The MCNs respond to systemic hypertonic and hypotonic stimulation with large changes in the expression of their Avp and Oxt genes, and microarray studies have shown that these osmotic perturbations also cause large changes in global gene expression in the HNS. In this paper, we examine gene expression in the rat supraoptic nucleus (SON) under normosmotic and chronic salt-loading SL) conditions by the first time using “new-generation”, RNA sequencing (RNA-Seq) methods. We reliably detect 9,709 genes as present in the SON by RNA-Seq, and 552 of these genes were changed in expression as a result of chronic SL. These genes reflect diverse functions, and 42 of these are involved in either transcriptional or translational processes. In addition, we compare the SON transcriptomes resolved by RNA-Seq methods with the SON transcriptomes determined by Affymetrix microarray methods in rats under the same osmotic conditions, and find that there are 6,466 genes present in the SON that are represented in both data sets, although 1,040 of the expressed genes were found only in the microarray data, and 2,762 of the expressed genes are selectively found in the RNA-Seq data and not the microarray data. These data provide the research community a comprehensive view of the transcriptome in the SON under normosmotic conditions and the changes in specific gene expression evoked by salt loading. PMID:25897513
DOE Office of Scientific and Technical Information (OSTI.GOV)
He, Zhili; Deng, Ye; Nostrand, Joy Van
2010-05-17
Microarray-based genomic technology has been widely used for microbial community analysis, and it is expected that microarray-based genomic technologies will revolutionize the analysis of microbial community structure, function and dynamics. A new generation of functional gene arrays (GeoChip 3.0) has been developed, with 27,812 probes covering 56,990 gene variants from 292 functional gene families involved in carbon, nitrogen, phosphorus and sulfur cycles, energy metabolism, antibiotic resistance, metal resistance, and organic contaminant degradation. Those probes were derived from 2,744, 140, and 262 species for bacteria, archaea, and fungi, respectively. GeoChip 3.0 has several other distinct features, such as a common oligomore » reference standard (CORS) for data normalization and comparison, a software package for data management and future updating, and the gyrB gene for phylogenetic analysis. Our computational evaluation of probe specificity indicated that all designed probes had a high specificity to their corresponding targets. Also, experimental analysis with synthesized oligonucleotides and genomic DNAs showed that only 0.0036percent-0.025percent false positive rates were observed, suggesting that the designed probes are highly specific under the experimental conditions examined. In addition, GeoChip 3.0 was applied to analyze soil microbial communities in a multifactor grassland ecosystem in Minnesota, USA, which demonstrated that the structure, composition, and potential activity of soil microbial communities significantly changed with the plant species diversity. All results indicate that GeoChip 3.0 is a high throughput powerful tool for studying microbial community functional structure, and linking microbial communities to ecosystem processes and functioning. To our knowledge, GeoChip 3.0 is the most comprehensive microarrays currently available for studying microbial communities associated with geobiochemical cycling, global climate change, bioenergy, agricuture, land use, ecosystem management, environmental cleanup and restoration, bioreactor systems, and human health.« less
Wang, Li-jia; Bai, Yu; Bao, Zhao-shi; Chen, Yan; Yan, Zhuo-hong; Zhang, Wei; Zhang, Quan-geng
2013-01-01
Glioblastoma is the most common and lethal cancer of the central nervous system. Global genomic hypomethylation and some CpG island hypermethylation are common hallmarks of these malignancies, but the effects of these methylation abnormalities on glioblastomas are still largely unclear. Methylation of the O6-methylguanine-DNA methyltransferase promoter is currently an only confirmed molecular predictor of better outcome in temozolomide treatment. To better understand the relationship between CpG island methylation status and patient outcome, this study launched DNA methylation profiles for thirty-three primary glioblastomas (pGBMs) and nine secondary glioblastomas (sGBMs) with the expectation to identify valuable prognostic and therapeutic targets. We evaluated the methylation status of testis derived transcript (TES) gene promoter by microarray analysis of glioblastomas and the prognostic value for TES methylation in the clinical outcome of pGBM patients. Significance analysis of microarrays was used for genes significantly differently methylated between 33 pGBM and nine sGBM. Survival curves were calculated according to the Kaplan-Meier method, and differences between curves were assessed using the log-rank test. Then, we treated glioblastoma cell lines (U87 and U251) with 5-aza-2-deoxycytidines (5-aza-dC) and detected cell biological behaviors. Microarray data analysis identified TES promoter was hypermethylated in pGBMs compared with sGBMs (P < 0.05). Survival curves from the Kaplan-Meier method analysis revealed that the patients with TES hypermethylation had a short overall survival (P < 0.05). This abnormality is also confirmed in glioblastoma cell lines (U87 and U251). Treating these cells with 5-aza-dC released TES protein expression resulted in significant inhibition of cell growth (P = 0.013). Hypermethylation of TES gene promoter highly correlated with worse outcome in pGBM patients. TES might represent a valuable prognostic marker for glioblastoma.
Rode, Tone Mari; Berget, Ingunn; Langsrud, Solveig; Møretrø, Trond; Holck, Askild
2009-07-01
Microorganisms are constantly exposed to new and altered growth conditions, and respond by changing gene expression patterns. Several methods for studying gene expression exist. During the last decade, the analysis of microarrays has been one of the most common approaches applied for large scale gene expression studies. A relatively new method for gene expression analysis is MassARRAY, which combines real competitive-PCR and MALDI-TOF (matrix-assisted laser desorption/ionization time-of-flight) mass spectrometry. In contrast to microarray methods, MassARRAY technology is suitable for analysing a larger number of samples, though for a smaller set of genes. In this study we compare the results from MassARRAY with microarrays on gene expression responses of Staphylococcus aureus exposed to acid stress at pH 4.5. RNA isolated from the same stress experiments was analysed using both the MassARRAY and the microarray methods. The MassARRAY and microarray methods showed good correlation. Both MassARRAY and microarray estimated somewhat lower fold changes compared with quantitative real-time PCR (qRT-PCR). The results confirmed the up-regulation of the urease genes in acidic environments, and also indicated the importance of metal ion regulation. This study shows that the MassARRAY technology is suitable for gene expression analysis in prokaryotes, and has advantages when a set of genes is being analysed for an organism exposed to many different environmental conditions.
Microarray analysis of potential genes in the pathogenesis of recurrent oral ulcer.
Han, Jingying; He, Zhiwei; Li, Kun; Hou, Lu
2015-01-01
Recurrent oral ulcer seriously threatens patients' daily life and health. This study investigated potential genes and pathways that participate in the pathogenesis of recurrent oral ulcer by high throughput bioinformatic analysis. RT-PCR and Western blot were applied to further verify screened interleukins effect. Recurrent oral ulcer related genes were collected from websites and papers, and further found out from Human Genome 280 6.0 microarray data. Each pathway of recurrent oral ulcer related genes were got through chip hybridization. RT-PCR was applied to test four recurrent oral ulcer related genes to verify the microarray data. Data transformation, scatter plot, clustering analysis, and expression pattern analysis were used to analyze recurrent oral ulcer related gene expression changes. Recurrent oral ulcer gene microarray was successfully established. Microarray showed that 551 genes involved in recurrent oral ulcer activity and 196 genes were recurrent oral ulcer related genes. Of them, 76 genes up-regulated, 62 genes down-regulated, and 58 genes up-/down-regulated. Total expression level up-regulated 752 times (60%) and down-regulated 485 times (40%). IL-2 plays an important role in the occurrence, development and recurrence of recurrent oral ulcer on the mRNA and protein levels. Gene microarray can be used to analyze potential genes and pathways in recurrent oral ulcer. IL-2 may be involved in the pathogenesis of recurrent oral ulcer.
2010-01-01
Background Analysis of gene expression and gene mutation may add information to be different from ordinary pathological tissue diagnosis. Since samples obtained endoscopically are very small, it is desired that more sensitive technology is developed for gene analysis. We investigated whether gene expression and gene mutation analysis by newly developed ultra-sensitive three-dimensional (3D) microarray is possible using small amount samples from endoscopic ultrasound-guided fine-needle aspiration (EUS-FNA) specimens and pancreatic juices. Methods Small amount samples from 17 EUS-FNA specimens and 16 pancreatic juices were obtained. After nucleic acid extraction, the samples were amplified with labeling and analyzed by the 3D microarray. Results The analyzable rate with the microarray was 46% (6/13) in EUS-FNA specimens of RNAlater® storage, and RNA degradations were observed in all the samples of frozen storage. In pancreatic juices, the analyzable rate was 67% (4/6) in frozen storage samples and 20% (2/10) in RNAlater® storage. EUS-FNA specimens were classified into cancer and non-cancer by gene expression analysis and K-ras codon 12 mutations were also detected using the 3D microarray. Conclusions Gene analysis from small amount samples obtained endoscopically was possible by newly developed 3D microarray technology. High quality RNA from EUS-FNA samples were obtained and remained in good condition only using RNA stabilizer. In contrast, high quality RNA from pancreatic juice samples were obtained only in frozen storage without RNA stabilizer. PMID:20416107
MAAMD: a workflow to standardize meta-analyses and comparison of affymetrix microarray data
2014-01-01
Background Mandatory deposit of raw microarray data files for public access, prior to study publication, provides significant opportunities to conduct new bioinformatics analyses within and across multiple datasets. Analysis of raw microarray data files (e.g. Affymetrix CEL files) can be time consuming, complex, and requires fundamental computational and bioinformatics skills. The development of analytical workflows to automate these tasks simplifies the processing of, improves the efficiency of, and serves to standardize multiple and sequential analyses. Once installed, workflows facilitate the tedious steps required to run rapid intra- and inter-dataset comparisons. Results We developed a workflow to facilitate and standardize Meta-Analysis of Affymetrix Microarray Data analysis (MAAMD) in Kepler. Two freely available stand-alone software tools, R and AltAnalyze were embedded in MAAMD. The inputs of MAAMD are user-editable csv files, which contain sample information and parameters describing the locations of input files and required tools. MAAMD was tested by analyzing 4 different GEO datasets from mice and drosophila. MAAMD automates data downloading, data organization, data quality control assesment, differential gene expression analysis, clustering analysis, pathway visualization, gene-set enrichment analysis, and cross-species orthologous-gene comparisons. MAAMD was utilized to identify gene orthologues responding to hypoxia or hyperoxia in both mice and drosophila. The entire set of analyses for 4 datasets (34 total microarrays) finished in ~ one hour. Conclusions MAAMD saves time, minimizes the required computer skills, and offers a standardized procedure for users to analyze microarray datasets and make new intra- and inter-dataset comparisons. PMID:24621103
Gene stage-specific expression in the microenvironment of pediatric myelodysplastic syndromes.
Roela, Rosimeire A; Carraro, Dirce M; Brentani, Helena P; Kaiano, Jane H L; Simão, Daniel F; Guarnieiro, Roberto; Lopes, Luiz Fernando; Borojevic, Radovan; Brentani, M Mitzi
2007-05-01
Using cDNA microarray assays we have observed a clear difference in the gene expression pattern between bone marrow stromal cells obtained from healthy children (CT) and from pediatric patients with either myelodysplastic syndromes (MDS) or acute myeloid leukemia (AML) associated with MDS (MDS-AML). The global gene function profiling analysis indicated that in the pediatric MDS microenvironment the disease stages may be characterized mainly by underexpression of genes associated with biological processes such as transport. Furthermore, a subset of downregulated genes related to endocytosis and protein secretion was able to discriminate MDS from MDS-AML.
Directed module detection in a large-scale expression compendium.
Fu, Qiang; Lemmens, Karen; Sanchez-Rodriguez, Aminael; Thijs, Inge M; Meysman, Pieter; Sun, Hong; Fierro, Ana Carolina; Engelen, Kristof; Marchal, Kathleen
2012-01-01
Public online microarray databases contain tremendous amounts of expression data. Mining these data sources can provide a wealth of information on the underlying transcriptional networks. In this chapter, we illustrate how the web services COLOMBOS and DISTILLER can be used to identify condition-dependent coexpression modules by exploring compendia of public expression data. COLOMBOS is designed for user-specified query-driven analysis, whereas DISTILLER generates a global regulatory network overview. The user is guided through both web services by means of a case study in which condition-dependent coexpression modules comprising a gene of interest (i.e., "directed") are identified.
USDA-ARS?s Scientific Manuscript database
The amount of microarray gene expression data in public repositories has been increasing exponentially for the last couple of decades. High-throughput microarray data integration and analysis has become a critical step in exploring the large amount of expression data for biological discovery. Howeve...
NASA Astrophysics Data System (ADS)
Bogdanov, Valery L.; Boyce-Jacino, Michael
1999-05-01
Confined arrays of biochemical probes deposited on a solid support surface (analytical microarray or 'chip') provide an opportunity to analysis multiple reactions simultaneously. Microarrays are increasingly used in genetics, medicine and environment scanning as research and analytical instruments. A power of microarray technology comes from its parallelism which grows with array miniaturization, minimization of reagent volume per reaction site and reaction multiplexing. An optical detector of microarray signals should combine high sensitivity, spatial and spectral resolution. Additionally, low-cost and a high processing rate are needed to transfer microarray technology into biomedical practice. We designed an imager that provides confocal and complete spectrum detection of entire fluorescently-labeled microarray in parallel. Imager uses microlens array, non-slit spectral decomposer, and high- sensitive detector (cooled CCD). Two imaging channels provide a simultaneous detection of localization, integrated and spectral intensities for each reaction site in microarray. A dimensional matching between microarray and imager's optics eliminates all in moving parts in instrumentation, enabling highly informative, fast and low-cost microarray detection. We report theory of confocal hyperspectral imaging with microlenses array and experimental data for implementation of developed imager to detect fluorescently labeled microarray with a density approximately 103 sites per cm2.
WholePathwayScope: a comprehensive pathway-based analysis tool for high-throughput data
Yi, Ming; Horton, Jay D; Cohen, Jonathan C; Hobbs, Helen H; Stephens, Robert M
2006-01-01
Background Analysis of High Throughput (HTP) Data such as microarray and proteomics data has provided a powerful methodology to study patterns of gene regulation at genome scale. A major unresolved problem in the post-genomic era is to assemble the large amounts of data generated into a meaningful biological context. We have developed a comprehensive software tool, WholePathwayScope (WPS), for deriving biological insights from analysis of HTP data. Result WPS extracts gene lists with shared biological themes through color cue templates. WPS statistically evaluates global functional category enrichment of gene lists and pathway-level pattern enrichment of data. WPS incorporates well-known biological pathways from KEGG (Kyoto Encyclopedia of Genes and Genomes) and Biocarta, GO (Gene Ontology) terms as well as user-defined pathways or relevant gene clusters or groups, and explores gene-term relationships within the derived gene-term association networks (GTANs). WPS simultaneously compares multiple datasets within biological contexts either as pathways or as association networks. WPS also integrates Genetic Association Database and Partial MedGene Database for disease-association information. We have used this program to analyze and compare microarray and proteomics datasets derived from a variety of biological systems. Application examples demonstrated the capacity of WPS to significantly facilitate the analysis of HTP data for integrative discovery. Conclusion This tool represents a pathway-based platform for discovery integration to maximize analysis power. The tool is freely available at . PMID:16423281
NASA Astrophysics Data System (ADS)
Brazhnik, Kristina; Sokolova, Zinaida; Baryshnikova, Maria; Bilan, Regina; Nabiev, Igor; Sukhanova, Alyona
Multiplexed analysis of cancer markers is crucial for early tumor diagnosis and screening. We have designed lab-on-a-bead microarray for quantitative detection of three breast cancer markers in human serum. Quantum dots were used as bead-bound fluorescent tags for identifying each marker by means of flow cytometry. Antigen-specific beads reliably detected CA 15-3, CEA, and CA 125 in serum samples, providing clear discrimination between the samples with respect to the antigen levels. The novel microarray is advantageous over the routine single-analyte ones due to the simultaneous detection of various markers. Therefore the developed microarray is a promising tool for serum tumor marker profiling.
Mansourian, Robert; Mutch, David M; Antille, Nicolas; Aubert, Jerome; Fogel, Paul; Le Goff, Jean-Marc; Moulin, Julie; Petrov, Anton; Rytz, Andreas; Voegel, Johannes J; Roberts, Matthew-Alan
2004-11-01
Microarray technology has become a powerful research tool in many fields of study; however, the cost of microarrays often results in the use of a low number of replicates (k). Under circumstances where k is low, it becomes difficult to perform standard statistical tests to extract the most biologically significant experimental results. Other more advanced statistical tests have been developed; however, their use and interpretation often remain difficult to implement in routine biological research. The present work outlines a method that achieves sufficient statistical power for selecting differentially expressed genes under conditions of low k, while remaining as an intuitive and computationally efficient procedure. The present study describes a Global Error Assessment (GEA) methodology to select differentially expressed genes in microarray datasets, and was developed using an in vitro experiment that compared control and interferon-gamma treated skin cells. In this experiment, up to nine replicates were used to confidently estimate error, thereby enabling methods of different statistical power to be compared. Gene expression results of a similar absolute expression are binned, so as to enable a highly accurate local estimate of the mean squared error within conditions. The model then relates variability of gene expression in each bin to absolute expression levels and uses this in a test derived from the classical ANOVA. The GEA selection method is compared with both the classical and permutational ANOVA tests, and demonstrates an increased stability, robustness and confidence in gene selection. A subset of the selected genes were validated by real-time reverse transcription-polymerase chain reaction (RT-PCR). All these results suggest that GEA methodology is (i) suitable for selection of differentially expressed genes in microarray data, (ii) intuitive and computationally efficient and (iii) especially advantageous under conditions of low k. The GEA code for R software is freely available upon request to authors.
SAMMD: Staphylococcus aureus microarray meta-database.
Nagarajan, Vijayaraj; Elasri, Mohamed O
2007-10-02
Staphylococcus aureus is an important human pathogen, causing a wide variety of diseases ranging from superficial skin infections to severe life threatening infections. S. aureus is one of the leading causes of nosocomial infections. Its ability to resist multiple antibiotics poses a growing public health problem. In order to understand the mechanism of pathogenesis of S. aureus, several global expression profiles have been developed. These transcriptional profiles included regulatory mutants of S. aureus and growth of wild type under different growth conditions. The abundance of these profiles has generated a large amount of data without a uniform annotation system to comprehensively examine them. We report the development of the Staphylococcus aureus Microarray meta-database (SAMMD) which includes data from all the published transcriptional profiles. SAMMD is a web-accessible database that helps users to perform a variety of analysis against and within the existing transcriptional profiles. SAMMD is a relational database that uses MySQL as the back end and PHP/JavaScript/DHTML as the front end. The database is normalized and consists of five tables, which holds information about gene annotations, regulated gene lists, experimental details, references, and other details. SAMMD data is collected from the peer-reviewed published articles. Data extraction and conversion was done using perl scripts while data entry was done through phpMyAdmin tool. The database is accessible via a web interface that contains several features such as a simple search by ORF ID, gene name, gene product name, advanced search using gene lists, comparing among datasets, browsing, downloading, statistics, and help. The database is licensed under General Public License (GPL). SAMMD is hosted and available at http://www.bioinformatics.org/sammd/. Currently there are over 9500 entries for regulated genes, from 67 microarray experiments. SAMMD will help staphylococcal scientists to analyze their expression data and understand it at global level. It will also allow scientists to compare and contrast their transcriptome to that of the other published transcriptomes.
SAMMD: Staphylococcus aureus Microarray Meta-Database
Nagarajan, Vijayaraj; Elasri, Mohamed O
2007-01-01
Background Staphylococcus aureus is an important human pathogen, causing a wide variety of diseases ranging from superficial skin infections to severe life threatening infections. S. aureus is one of the leading causes of nosocomial infections. Its ability to resist multiple antibiotics poses a growing public health problem. In order to understand the mechanism of pathogenesis of S. aureus, several global expression profiles have been developed. These transcriptional profiles included regulatory mutants of S. aureus and growth of wild type under different growth conditions. The abundance of these profiles has generated a large amount of data without a uniform annotation system to comprehensively examine them. We report the development of the Staphylococcus aureus Microarray meta-database (SAMMD) which includes data from all the published transcriptional profiles. SAMMD is a web-accessible database that helps users to perform a variety of analysis against and within the existing transcriptional profiles. Description SAMMD is a relational database that uses MySQL as the back end and PHP/JavaScript/DHTML as the front end. The database is normalized and consists of five tables, which holds information about gene annotations, regulated gene lists, experimental details, references, and other details. SAMMD data is collected from the peer-reviewed published articles. Data extraction and conversion was done using perl scripts while data entry was done through phpMyAdmin tool. The database is accessible via a web interface that contains several features such as a simple search by ORF ID, gene name, gene product name, advanced search using gene lists, comparing among datasets, browsing, downloading, statistics, and help. The database is licensed under General Public License (GPL). Conclusion SAMMD is hosted and available at . Currently there are over 9500 entries for regulated genes, from 67 microarray experiments. SAMMD will help staphylococcal scientists to analyze their expression data and understand it at global level. It will also allow scientists to compare and contrast their transcriptome to that of the other published transcriptomes. PMID:17910768
Implementation of GenePattern within the Stanford Microarray Database.
Hubble, Jeremy; Demeter, Janos; Jin, Heng; Mao, Maria; Nitzberg, Michael; Reddy, T B K; Wymore, Farrell; Zachariah, Zachariah K; Sherlock, Gavin; Ball, Catherine A
2009-01-01
Hundreds of researchers across the world use the Stanford Microarray Database (SMD; http://smd.stanford.edu/) to store, annotate, view, analyze and share microarray data. In addition to providing registered users at Stanford access to their own data, SMD also provides access to public data, and tools with which to analyze those data, to any public user anywhere in the world. Previously, the addition of new microarray data analysis tools to SMD has been limited by available engineering resources, and in addition, the existing suite of tools did not provide a simple way to design, execute and share analysis pipelines, or to document such pipelines for the purposes of publication. To address this, we have incorporated the GenePattern software package directly into SMD, providing access to many new analysis tools, as well as a plug-in architecture that allows users to directly integrate and share additional tools through SMD. In this article, we describe our implementation of the GenePattern microarray analysis software package into the SMD code base. This extension is available with the SMD source code that is fully and freely available to others under an Open Source license, enabling other groups to create a local installation of SMD with an enriched data analysis capability.
Cooper, Nichola H; Balachandra, Jeya P; Hardman, Matthew J
2015-12-01
The skin's mechanical integrity is maintained by an organized and robust dermal extracellular matrix (ECM). Resistance to mechanical disruption hinges primarily on homeostasis of the dermal collagen fibril architecture, which is regulated, at least in part, by members of the small leucine-rich proteoglycan (SLRP) family. Here we present data linking protein kinase C alpha (PKCα) to the regulated expression of multiple ECM components including SLRPs. Global microarray profiling reveals deficiencies in ECM gene expression in PKCα-/- skin correlating with abnormal collagen fibril morphology, disorganized dermal architecture, and reduced skin strength. Detailed analysis of the skin and wounds from wild-type and PKCα-/- mice reveals a failure to upregulate collagen and other ECM components in response to injury, resulting in delayed granulation tissue deposition in PKCα-/- wounds. Thus, our data reveal a previously unappreciated role for PKCα in the regulation of ECM structure and deposition during skin wound healing.
Differential global gene expression in red and white skeletal muscle
NASA Technical Reports Server (NTRS)
Campbell, W. G.; Gordon, S. E.; Carlson, C. J.; Pattison, J. S.; Hamilton, M. T.; Booth, F. W.
2001-01-01
The differences in gene expression among the fiber types of skeletal muscle have long fascinated scientists, but for the most part, previous experiments have only reported differences of one or two genes at a time. The evolving technology of global mRNA expression analysis was employed to determine the potential differential expression of approximately 3,000 mRNAs between the white quad (white muscle) and the red soleus muscle (mixed red muscle) of female ICR mice (30-35 g). Microarray analysis identified 49 mRNA sequences that were differentially expressed between white and mixed red skeletal muscle, including newly identified differential expressions between muscle types. For example, the current findings increase the number of known, differentially expressed mRNAs for transcription factors/coregulators by nine and signaling proteins by three. The expanding knowledge of the diversity of mRNA expression between white and mixed red muscle suggests that there could be quite a complex regulation of phenotype between muscles of different fiber types.
Clarke, Luka A; Botelho, Hugo M; Sousa, Lisete; Falcao, Andre O; Amaral, Margarida D
2015-11-01
A meta-analysis of 13 independent microarray data sets was performed and gene expression profiles from cystic fibrosis (CF), similar disorders (COPD: chronic obstructive pulmonary disease, IPF: idiopathic pulmonary fibrosis, asthma), environmental conditions (smoking, epithelial injury), related cellular processes (epithelial differentiation/regeneration), and non-respiratory "control" conditions (schizophrenia, dieting), were compared. Similarity among differentially expressed (DE) gene lists was assessed using a permutation test, and a clustergram was constructed, identifying common gene markers. Global gene expression values were standardized using a novel approach, revealing that similarities between independent data sets run deeper than shared DE genes. Correlation of gene expression values identified putative gene regulators of the CF transmembrane conductance regulator (CFTR) gene, of potential therapeutic significance. Our study provides a novel perspective on CF epithelial gene expression in the context of other lung disorders and conditions, and highlights the contribution of differentiation/EMT and injury to gene signatures of respiratory disease. Copyright © 2015 Elsevier Inc. All rights reserved.
Genome-scale cluster analysis of replicated microarrays using shrinkage correlation coefficient.
Yao, Jianchao; Chang, Chunqi; Salmi, Mari L; Hung, Yeung Sam; Loraine, Ann; Roux, Stanley J
2008-06-18
Currently, clustering with some form of correlation coefficient as the gene similarity metric has become a popular method for profiling genomic data. The Pearson correlation coefficient and the standard deviation (SD)-weighted correlation coefficient are the two most widely-used correlations as the similarity metrics in clustering microarray data. However, these two correlations are not optimal for analyzing replicated microarray data generated by most laboratories. An effective correlation coefficient is needed to provide statistically sufficient analysis of replicated microarray data. In this study, we describe a novel correlation coefficient, shrinkage correlation coefficient (SCC), that fully exploits the similarity between the replicated microarray experimental samples. The methodology considers both the number of replicates and the variance within each experimental group in clustering expression data, and provides a robust statistical estimation of the error of replicated microarray data. The value of SCC is revealed by its comparison with two other correlation coefficients that are currently the most widely-used (Pearson correlation coefficient and SD-weighted correlation coefficient) using statistical measures on both synthetic expression data as well as real gene expression data from Saccharomyces cerevisiae. Two leading clustering methods, hierarchical and k-means clustering were applied for the comparison. The comparison indicated that using SCC achieves better clustering performance. Applying SCC-based hierarchical clustering to the replicated microarray data obtained from germinating spores of the fern Ceratopteris richardii, we discovered two clusters of genes with shared expression patterns during spore germination. Functional analysis suggested that some of the genetic mechanisms that control germination in such diverse plant lineages as mosses and angiosperms are also conserved among ferns. This study shows that SCC is an alternative to the Pearson correlation coefficient and the SD-weighted correlation coefficient, and is particularly useful for clustering replicated microarray data. This computational approach should be generally useful for proteomic data or other high-throughput analysis methodology.
Fabrication of Carbohydrate Microarrays by Boronate Formation.
Adak, Avijit K; Lin, Ting-Wei; Li, Ben-Yuan; Lin, Chun-Cheng
2017-01-01
The interactions between soluble carbohydrates and/or surface displayed glycans and protein receptors are essential to many biological processes and cellular recognition events. Carbohydrate microarrays provide opportunities for high-throughput quantitative analysis of carbohydrate-protein interactions. Over the past decade, various techniques have been implemented for immobilizing glycans on solid surfaces in a microarray format. Herein, we describe a detailed protocol for fabricating carbohydrate microarrays that capitalizes on the intrinsic reactivity of boronic acid toward carbohydrates to form stable boronate diesters. A large variety of unprotected carbohydrates ranging in structure from simple disaccharides and trisaccharides to considerably more complex human milk and blood group (oligo)saccharides have been covalently immobilized in a single step on glass slides, which were derivatized with high-affinity boronic acid ligands. The immobilized ligands in these microarrays maintain the receptor-binding activities including those of lectins and antibodies according to the structures of their pendant carbohydrates for rapid analysis of a number of carbohydrate-recognition events within 30 h. This method facilitates the direct construction of otherwise difficult to obtain carbohydrate microarrays from underivatized glycans.
The Glycan Microarray Story from Construction to Applications.
Hyun, Ji Young; Pai, Jaeyoung; Shin, Injae
2017-04-18
Not only are glycan-mediated binding processes in cells and organisms essential for a wide range of physiological processes, but they are also implicated in various pathological processes. As a result, elucidation of glycan-associated biomolecular interactions and their consequences is of great importance in basic biological research and biomedical applications. In 2002, we and others were the first to utilize glycan microarrays in efforts aimed at the rapid analysis of glycan-associated recognition events. Because they contain a number of glycans immobilized in a dense and orderly manner on a solid surface, glycan microarrays enable multiple parallel analyses of glycan-protein binding events while utilizing only small amounts of glycan samples. Therefore, this microarray technology has become a leading edge tool in studies aimed at elucidating roles played by glycans and glycan binding proteins in biological systems. In this Account, we summarize our efforts on the construction of glycan microarrays and their applications in studies of glycan-associated interactions. Immobilization strategies of functionalized and unmodified glycans on derivatized glass surfaces are described. Although others have developed immobilization techniques, our efforts have focused on improving the efficiencies and operational simplicity of microarray construction. The microarray-based technology has been most extensively used for rapid analysis of the glycan binding properties of proteins. In addition, glycan microarrays have been employed to determine glycan-protein interactions quantitatively, detect pathogens, and rapidly assess substrate specificities of carbohydrate-processing enzymes. More recently, the microarrays have been employed to identify functional glycans that elicit cell surface lectin-mediated cellular responses. Owing to these efforts, it is now possible to use glycan microarrays to expand the understanding of roles played by glycans and glycan binding proteins in biological systems.
Holliday, Jason A; Ralph, Steven G; White, Richard; Bohlmann, Jörg; Aitken, Sally N
2008-01-01
Cold acclimation in conifers is a complex process, the timing and extent of which reflects local adaptation and varies widely along latitudinal gradients for many temperate and boreal tree species. Despite their ecological and economic importance, little is known about the global changes in gene expression that accompany autumn cold acclimation in conifers. Using three populations of Sitka spruce (Picea sitchensis) spanning the species range, and a Picea cDNA microarray with 21,840 unique elements, within- and among-population gene expression was monitored during the autumn. Microarray data were validated for selected genes using real-time PCR. Similar numbers of genes were significantly twofold upregulated (1257) and downregulated (967) between late summer and early winter. Among those upregulated were dehydrins, pathogenesis-related/antifreeze genes, carbohydrate and lipid metabolism genes, and genes involved in signal transduction and transcriptional regulation. Among-population microarray hybridizations at early and late autumn time points revealed substantial variation in the autumn transcriptome, some of which may reflect local adaptation. These results demonstrate the complexity of cold acclimation in conifers, highlight similarities and differences to cold tolerance in annual plants, and provide a solid foundation for functional and genetic studies of this important adaptive process.
EDGE3: A web-based solution for management and analysis of Agilent two color microarray experiments
Vollrath, Aaron L; Smith, Adam A; Craven, Mark; Bradfield, Christopher A
2009-01-01
Background The ability to generate transcriptional data on the scale of entire genomes has been a boon both in the improvement of biological understanding and in the amount of data generated. The latter, the amount of data generated, has implications when it comes to effective storage, analysis and sharing of these data. A number of software tools have been developed to store, analyze, and share microarray data. However, a majority of these tools do not offer all of these features nor do they specifically target the commonly used two color Agilent DNA microarray platform. Thus, the motivating factor for the development of EDGE3 was to incorporate the storage, analysis and sharing of microarray data in a manner that would provide a means for research groups to collaborate on Agilent-based microarray experiments without a large investment in software-related expenditures or extensive training of end-users. Results EDGE3 has been developed with two major functions in mind. The first function is to provide a workflow process for the generation of microarray data by a research laboratory or a microarray facility. The second is to store, analyze, and share microarray data in a manner that doesn't require complicated software. To satisfy the first function, EDGE3 has been developed as a means to establish a well defined experimental workflow and information system for microarray generation. To satisfy the second function, the software application utilized as the user interface of EDGE3 is a web browser. Within the web browser, a user is able to access the entire functionality, including, but not limited to, the ability to perform a number of bioinformatics based analyses, collaborate between research groups through a user-based security model, and access to the raw data files and quality control files generated by the software used to extract the signals from an array image. Conclusion Here, we present EDGE3, an open-source, web-based application that allows for the storage, analysis, and controlled sharing of transcription-based microarray data generated on the Agilent DNA platform. In addition, EDGE3 provides a means for managing RNA samples and arrays during the hybridization process. EDGE3 is freely available for download at . PMID:19732451
Vollrath, Aaron L; Smith, Adam A; Craven, Mark; Bradfield, Christopher A
2009-09-04
The ability to generate transcriptional data on the scale of entire genomes has been a boon both in the improvement of biological understanding and in the amount of data generated. The latter, the amount of data generated, has implications when it comes to effective storage, analysis and sharing of these data. A number of software tools have been developed to store, analyze, and share microarray data. However, a majority of these tools do not offer all of these features nor do they specifically target the commonly used two color Agilent DNA microarray platform. Thus, the motivating factor for the development of EDGE(3) was to incorporate the storage, analysis and sharing of microarray data in a manner that would provide a means for research groups to collaborate on Agilent-based microarray experiments without a large investment in software-related expenditures or extensive training of end-users. EDGE(3) has been developed with two major functions in mind. The first function is to provide a workflow process for the generation of microarray data by a research laboratory or a microarray facility. The second is to store, analyze, and share microarray data in a manner that doesn't require complicated software. To satisfy the first function, EDGE3 has been developed as a means to establish a well defined experimental workflow and information system for microarray generation. To satisfy the second function, the software application utilized as the user interface of EDGE(3) is a web browser. Within the web browser, a user is able to access the entire functionality, including, but not limited to, the ability to perform a number of bioinformatics based analyses, collaborate between research groups through a user-based security model, and access to the raw data files and quality control files generated by the software used to extract the signals from an array image. Here, we present EDGE(3), an open-source, web-based application that allows for the storage, analysis, and controlled sharing of transcription-based microarray data generated on the Agilent DNA platform. In addition, EDGE(3) provides a means for managing RNA samples and arrays during the hybridization process. EDGE(3) is freely available for download at http://edge.oncology.wisc.edu/.
Villeneuve, L; Wang, Rong-Lin; Bencic, David C; Biales, Adam D; Martinović, Dalma; Lazorchak, James M; Toth, Gregory; Ankley, Gerald T
2009-08-01
As part of a research effort examining system-wide responses of the hypothalamic-pituitary-gonadal (HPG) axis in fish to endocrine-active chemicals (EACs) with different modes of action, zebrafish (Danio rerio) were exposed to 25 or 100 microg/L of the aromatase inhibitor fadrozole for 24, 48, or 96 h. Global transcriptional response in brain and ovarian tissue of fish exposed to 25 microg/L of fadrozole was compared to that in control fish using a commercially available, 22,000-gene oligonucleotide microarray. Transcripts altered in brain were functionally linked to differentiation, development, DNA replication, and cell cycle. Additionally, multiple genes associated with the one-carbon pool by folate pathway (KEGG 00670) were significantly up-regulated. Transcripts altered in ovary were functionally linked to cell-cell adhesion, extracellular matrix, vasculogenesis, and development. Promoter motif analysis identified GATA-binding factor 2, Ikaros 2, alcohol dehydrogenase gene regulator 1, myoblast-determining factor, and several heat shock factors as being associated with coexpressed gene clusters that were differentially expressed following exposure to fadrozole. Based on the transcriptional changes observed, it was hypothesized that fadrozole elicits neurodegenerative stress in brain tissue and that fish cope with this stress through proliferation of radial glial cells. Additionally, it was hypothesized that changes of gene expression in the ovary of fadrozole-exposed zebrafish reflect disruption of oocyte maturation and ovulation because of impaired vitellogenesis. These hypotheses and others derived from the microarray results provide a foundation for future studies aimed at understanding responses of the HPG axis to EACs and other chemical stressors.
Lee, Min-Young; Yu, Ji Hea; Kim, Ji Yeon; Seo, Jung Hwa; Park, Eun Sook; Kim, Chul Hoon; Kim, Hyongbum; Cho, Sung-Rae
2013-01-01
Housing animals in an enriched environment (EE) enhances behavioral function. However, the mechanism underlying this EE-mediated functional improvement and the resultant changes in gene expression have yet to be elucidated. We attempted to investigate the underlying mechanisms associated with long-term exposure to an EE by evaluating gene expression patterns. We housed 6-week-old CD-1 (ICR) mice in standard cages or an EE comprising a running wheel, novel objects, and social interaction for 2 months. Motor and cognitive performances were evaluated using the rotarod test and passive avoidance test, and gene expression profile was investigated in the cerebral hemispheres using microarray and gene set enrichment analysis (GSEA). In behavioral assessment, an EE significantly enhanced rotarod performance and short-term working memory. Microarray analysis revealed that genes associated with neuronal activity were significantly altered by an EE. GSEA showed that genes involved in synaptic transmission and postsynaptic signal transduction were globally upregulated, whereas those associated with reuptake by presynaptic neurotransmitter transporters were downregulated. In particular, both microarray and GSEA demonstrated that EE exposure increased opioid signaling, acetylcholine release cycle, and postsynaptic neurotransmitter receptors but decreased Na+ / Cl- -dependent neurotransmitter transporters, including dopamine transporter Slc6a3 in the brain. Western blotting confirmed that SLC6A3, DARPP32 (PPP1R1B), and P2RY12 were largely altered in a region-specific manner. An EE enhanced motor and cognitive function through the alteration of synaptic activity-regulating genes, improving the efficient use of neurotransmitters and synaptic plasticity by the upregulation of genes associated with postsynaptic receptor activity and downregulation of presynaptic reuptake by neurotransmitter transporters.
Gupta, Surya; De Puysseleyr, Veronic; Van der Heyden, José; Maddelein, Davy; Lemmens, Irma; Lievens, Sam; Degroeve, Sven; Tavernier, Jan; Martens, Lennart
2017-05-01
Protein-protein interaction (PPI) studies have dramatically expanded our knowledge about cellular behaviour and development in different conditions. A multitude of high-throughput PPI techniques have been developed to achieve proteome-scale coverage for PPI studies, including the microarray based Mammalian Protein-Protein Interaction Trap (MAPPIT) system. Because such high-throughput techniques typically report thousands of interactions, managing and analysing the large amounts of acquired data is a challenge. We have therefore built the MAPPIT cell microArray Protein Protein Interaction-Data management & Analysis Tool (MAPPI-DAT) as an automated data management and analysis tool for MAPPIT cell microarray experiments. MAPPI-DAT stores the experimental data and metadata in a systematic and structured way, automates data analysis and interpretation, and enables the meta-analysis of MAPPIT cell microarray data across all stored experiments. MAPPI-DAT is developed in Python, using R for data analysis and MySQL as data management system. MAPPI-DAT is cross-platform and can be ran on Microsoft Windows, Linux and OS X/macOS. The source code and a Microsoft Windows executable are freely available under the permissive Apache2 open source license at https://github.com/compomics/MAPPI-DAT. jan.tavernier@vib-ugent.be or lennart.martens@vib-ugent.be. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press.
What is the study? This study is the first to use microarray analysis in the Ames strains of Salmonella. The microarray chips were custom-designed for this study and are not commercially available, and we evaluated the well-studied drinking water mutagen, MX. Because much inform...
MICROARRAY ANALYSIS OF DICHLOROACETIC ACID-INDUCED CHANGES IN GENE EXPRESSION
MICROARRAY ANALYSIS OF DICHLOROACETIC ACID-INDUCED CHANGES IN GENE EXPRESSION
Dichloroacetic acid (DCA) is a major by-product of water disinfection by chlorination. Several studies have demonstrated the hepatocarcinogenicity of DCA in rodents when administered in dri...
Wilson, J. W.; Ott, C. M.; zu Bentrup, K. Höner; Ramamurthy, R.; Quick, L.; Porwollik, S.; Cheng, P.; McClelland, M.; Tsaprailis, G.; Radabaugh, T.; Hunt, A.; Fernandez, D.; Richter, E.; Shah, M.; Kilcoyne, M.; Joshi, L.; Nelman-Gonzalez, M.; Hing, S.; Parra, M.; Dumars, P.; Norwood, K.; Bober, R.; Devich, J.; Ruggles, A.; Goulart, C.; Rupert, M.; Stodieck, L.; Stafford, P.; Catella, L.; Schurr, M. J.; Buchanan, K.; Morici, L.; McCracken, J.; Allen, P.; Baker-Coleman, C.; Hammond, T.; Vogel, J.; Nelson, R.; Pierson, D. L.; Stefanyshyn-Piper, H. M.; Nickerson, C. A.
2007-01-01
A comprehensive analysis of both the molecular genetic and phenotypic responses of any organism to the space flight environment has never been accomplished because of significant technological and logistical hurdles. Moreover, the effects of space flight on microbial pathogenicity and associated infectious disease risks have not been studied. The bacterial pathogen Salmonella typhimurium was grown aboard Space Shuttle mission STS-115 and compared with identical ground control cultures. Global microarray and proteomic analyses revealed that 167 transcripts and 73 proteins changed expression with the conserved RNA-binding protein Hfq identified as a likely global regulator involved in the response to this environment. Hfq involvement was confirmed with a ground-based microgravity culture model. Space flight samples exhibited enhanced virulence in a murine infection model and extracellular matrix accumulation consistent with a biofilm. Strategies to target Hfq and related regulators could potentially decrease infectious disease risks during space flight missions and provide novel therapeutic options on Earth. PMID:17901201
Robust gene selection methods using weighting schemes for microarray data analysis.
Kang, Suyeon; Song, Jongwoo
2017-09-02
A common task in microarray data analysis is to identify informative genes that are differentially expressed between two different states. Owing to the high-dimensional nature of microarray data, identification of significant genes has been essential in analyzing the data. However, the performances of many gene selection techniques are highly dependent on the experimental conditions, such as the presence of measurement error or a limited number of sample replicates. We have proposed new filter-based gene selection techniques, by applying a simple modification to significance analysis of microarrays (SAM). To prove the effectiveness of the proposed method, we considered a series of synthetic datasets with different noise levels and sample sizes along with two real datasets. The following findings were made. First, our proposed methods outperform conventional methods for all simulation set-ups. In particular, our methods are much better when the given data are noisy and sample size is small. They showed relatively robust performance regardless of noise level and sample size, whereas the performance of SAM became significantly worse as the noise level became high or sample size decreased. When sufficient sample replicates were available, SAM and our methods showed similar performance. Finally, our proposed methods are competitive with traditional methods in classification tasks for microarrays. The results of simulation study and real data analysis have demonstrated that our proposed methods are effective for detecting significant genes and classification tasks, especially when the given data are noisy or have few sample replicates. By employing weighting schemes, we can obtain robust and reliable results for microarray data analysis.
Sharma, Pankaj; Gupta, Neerja; Chowdhury, Madhumita Roy; Sapra, Savita; Ghosh, Manju; Gulati, Sheffali; Kabra, Madhulika
2016-09-15
Intellectual disability (ID)/Global developmental delay (GDD) is a diverse group of disorders in terms of cognitive and non-cognitive functions and can occur with or without associated co-morbidities. It affects 1-3% of individuals globally and in at least 30-50% of cases the etiology remains unexplained. The widespread use of chromosomal microarray analysis (CMA) in a clinical setting has allowed the identification of submicroscopic copy number variations (CNVs), throughout the genome, associated with neurodevelopmental phenotypes including ID/GDD. In this study we investigated the utility of CMA in the detection of CNVs in 106 patients with unexplained ID/DD, dysmorphism with or without multiple congenital anomalies (MCA). CMA study was carried out using Agilent 8×60K chips and Illumina Human CytoSNP-12 chips. Pathogenic CNVs were found in 15 (14.2%) patients. In these patients, CNVs on single chromosome were detected in 10 patients while 5 patients showed co-occurrence CNVs on two chromosomes. The size of these CNVs ranged between 322kb to 13Mb. The yield of pathogenic CNVs was similar for both mild and severe ID/GDD cases. One patient described in this paper is considered to harbour a likely pathogenic CNV with deletion in 17q22 region. Only few cases have been described in literature for 17q22 deletion and patient reported here was found to have an atypical deletion in 17q22 region (Case 90). This study re-affirms the view point that CMA is a powerful diagnostic tool in the evaluation of idiopathic ID/GDD patients irrespective of the degree of severity. Identifying pathogenic CNVs helps in counseling and prenatal diagnosis if desired. Copyright © 2016 Elsevier B.V. All rights reserved.
Martyniuk, Christopher J.; Spade, Daniel J.; Blum, Jason L.; Kroll, Kevin J.; Denslow, Nancy D.
2011-01-01
Methoxychlor (MXC) is an organochlorine pesticide that has been shown to have estrogenic activity by activating estrogen receptors and inducing vitellogenin production in male fish. Previous studies report that exposure to MXC induces changes in mRNA abundance of reproductive genes in the liver and testes of largemouth bass (Micropterus salmoides). The objective of the present study was to better characterize the mode of action of MXC by measuring the global transcriptomic response in the male largemouth liver using an oligonucleotide microarray. Microarray analysis identified highly significant changes in the expression of 37 transcripts (p<0.001) (20 induced and 17 decreased) in the liver after MXC injection and a total of 900 expression changes (p<0.05) in transcripts with high homology to known genes. Largemouth bass estrogen receptor alpha (esr1) and androgen receptor (ar) were among the transcripts that were increased in the liver after MXC treatment. Functional enrichment analysis identified the molecular functions of steroid binding and androgen receptor activity as well as steroid hormone receptor activity as being significantly over-represented gene ontology terms. Pathway analysis identified c-fos signaling as being putatively affected through both estrogen and androgen signaling. This study provides evidence that MXC elicits transcriptional effects through the estrogen receptor as well as androgen receptor-mediated pathways in the liver. PMID:21276474
Kumari, Bharti; Jain, Pratistha; Das, Shaoli; Ghosal, Suman; Hazra, Bibhabasu; Trivedi, Ashish Chandra; Basu, Anirban; Chakrabarti, Jayprokas; Vrati, Sudhanshu; Banerjee, Arup
2016-01-01
Microglia cells in the brain play essential role during Japanese Encephalitis Virus (JEV) infection and may lead to change in microRNA (miRNA) and mRNA profile. These changes may together control disease outcome. Using Affymetrix microarray platform, we profiled cellular miRNA and mRNA expression at multiple time points during viral infection in human microglial (CHME3) cells. In silico analysis of microarray data revealed a phased pattern of miRNAs expression, associated with JEV replication and provided unique signatures of infection. Target prediction and pathway enrichment analysis identified anti correlation between differentially expressed miRNA and the gene expression at multiple time point which ultimately affected diverse signaling pathways including Notch signaling pathways in microglia. Activation of Notch pathway during JEV infection was demonstrated in vitro and in vivo. The expression of a subset of miRNAs that target multiple genes in Notch signaling pathways were suppressed and their overexpression could affect JEV induced immune response. Further analysis provided evidence for the possible presence of cellular competing endogenous RNA (ceRNA) associated with innate immune response. Collectively, our data provide a uniquely comprehensive view of the changes in the host miRNAs induced by JEV during cellular infection and identify Notch pathway in modulating microglia mediated inflammation. PMID:26838068
Kumari, Bharti; Jain, Pratistha; Das, Shaoli; Ghosal, Suman; Hazra, Bibhabasu; Trivedi, Ashish Chandra; Basu, Anirban; Chakrabarti, Jayprokas; Vrati, Sudhanshu; Banerjee, Arup
2016-02-03
Microglia cells in the brain play essential role during Japanese Encephalitis Virus (JEV) infection and may lead to change in microRNA (miRNA) and mRNA profile. These changes may together control disease outcome. Using Affymetrix microarray platform, we profiled cellular miRNA and mRNA expression at multiple time points during viral infection in human microglial (CHME3) cells. In silico analysis of microarray data revealed a phased pattern of miRNAs expression, associated with JEV replication and provided unique signatures of infection. Target prediction and pathway enrichment analysis identified anti correlation between differentially expressed miRNA and the gene expression at multiple time point which ultimately affected diverse signaling pathways including Notch signaling pathways in microglia. Activation of Notch pathway during JEV infection was demonstrated in vitro and in vivo. The expression of a subset of miRNAs that target multiple genes in Notch signaling pathways were suppressed and their overexpression could affect JEV induced immune response. Further analysis provided evidence for the possible presence of cellular competing endogenous RNA (ceRNA) associated with innate immune response. Collectively, our data provide a uniquely comprehensive view of the changes in the host miRNAs induced by JEV during cellular infection and identify Notch pathway in modulating microglia mediated inflammation.
Coda, Alvin B; Icen, Murat; Smith, Jason R; Sinha, Animesh A
2012-07-01
There are major gaps in our knowledge regarding the exact mechanisms and genetic basis of psoriasis. To investigate the pathogenesis of psoriasis, gene expression in 10 skin (5 lesional, 5 nonlesional) and 11 blood (6 psoriatic, 5 nonpsoriatic) samples were examined using Affymetrix HG-U95A microarrays. We detected 535 (425 upregulated, 110 downregulated) DEGs in lesional skin at 1% false discovery rate (FDR). Combining nine microarray studies comparing lesional and nonlesional psoriatic skin, 34.5% of dysregulated genes were overlapped in multiple studies. We further identified 20 skin and 2 blood associated transcriptional "hot spots" at specified genomic locations. At 5% FDR, 11.8% skin and 10.4% blood DEGs in our study mapped to one of the 12 PSORS loci. DEGs that overlap with PSORS loci may offer prioritized targets for downstream genetic fine mapping studies. Novel DEG "hot spots" may provide new targets for defining susceptibility loci in future studies. Copyright © 2012 Elsevier Inc. All rights reserved.
Quantitative proteomic analysis in breast cancer.
Tabchy, A; Hennessy, B T; Gonzalez-Angulo, A M; Bernstam, F M; Lu, Y; Mills, G B
2011-02-01
Much progress has recently been made in the genomic and transcriptional characterization of tumors. However, historically the characterization of cells at the protein level has suffered limitations in reproducibility, scalability and robustness. Recent technological advances have made it possible to accurately and reproducibly portray the global levels and active states of cellular proteins. Protein microarrays examine the native post-translational conformations of proteins including activated phosphorylated states, in a comprehensive high-throughput mode, and can map activated pathways and networks of proteins inside the cells. The reverse-phase protein microarray (RPPA) offers a unique opportunity to study signal transduction networks in small biological samples such as human biopsy material and can provide critical information for therapeutic decision-making and the monitoring of patients for targeted molecular medicine. By providing the key missing link to the story generated from genomic and gene expression characterization efforts, functional proteomics offer the promise of a comprehensive understanding of cancer. Several initial successes in breast cancer are showing that such information is clinically relevant. Copyright 2011 Prous Science, S.A.U. or its licensors. All rights reserved.
The application of DNA microarrays in gene expression analysis.
van Hal, N L; Vorst, O; van Houwelingen, A M; Kok, E J; Peijnenburg, A; Aharoni, A; van Tunen, A J; Keijer, J
2000-03-31
DNA microarray technology is a new and powerful technology that will substantially increase the speed of molecular biological research. This paper gives a survey of DNA microarray technology and its use in gene expression studies. The technical aspects and their potential improvements are discussed. These comprise array manufacturing and design, array hybridisation, scanning, and data handling. Furthermore, it is discussed how DNA microarrays can be applied in the working fields of: safety, functionality and health of food and gene discovery and pathway engineering in plants.
Implementation of mutual information and bayes theorem for classification microarray data
NASA Astrophysics Data System (ADS)
Dwifebri Purbolaksono, Mahendra; Widiastuti, Kurnia C.; Syahrul Mubarok, Mohamad; Adiwijaya; Aminy Ma’ruf, Firda
2018-03-01
Microarray Technology is one of technology which able to read the structure of gen. The analysis is important for this technology. It is for deciding which attribute is more important than the others. Microarray technology is able to get cancer information to diagnose a person’s gen. Preparation of microarray data is a huge problem and takes a long time. That is because microarray data contains high number of insignificant and irrelevant attributes. So, it needs a method to reduce the dimension of microarray data without eliminating important information in every attribute. This research uses Mutual Information to reduce dimension. System is built with Machine Learning approach specifically Bayes Theorem. This theorem uses a statistical and probability approach. By combining both methods, it will be powerful for Microarray Data Classification. The experiment results show that system is good to classify Microarray data with highest F1-score using Bayesian Network by 91.06%, and Naïve Bayes by 88.85%.
Gene ARMADA: an integrated multi-analysis platform for microarray data implemented in MATLAB.
Chatziioannou, Aristotelis; Moulos, Panagiotis; Kolisis, Fragiskos N
2009-10-27
The microarray data analysis realm is ever growing through the development of various tools, open source and commercial. However there is absence of predefined rational algorithmic analysis workflows or batch standardized processing to incorporate all steps, from raw data import up to the derivation of significantly differentially expressed gene lists. This absence obfuscates the analytical procedure and obstructs the massive comparative processing of genomic microarray datasets. Moreover, the solutions provided, heavily depend on the programming skills of the user, whereas in the case of GUI embedded solutions, they do not provide direct support of various raw image analysis formats or a versatile and simultaneously flexible combination of signal processing methods. We describe here Gene ARMADA (Automated Robust MicroArray Data Analysis), a MATLAB implemented platform with a Graphical User Interface. This suite integrates all steps of microarray data analysis including automated data import, noise correction and filtering, normalization, statistical selection of differentially expressed genes, clustering, classification and annotation. In its current version, Gene ARMADA fully supports 2 coloured cDNA and Affymetrix oligonucleotide arrays, plus custom arrays for which experimental details are given in tabular form (Excel spreadsheet, comma separated values, tab-delimited text formats). It also supports the analysis of already processed results through its versatile import editor. Besides being fully automated, Gene ARMADA incorporates numerous functionalities of the Statistics and Bioinformatics Toolboxes of MATLAB. In addition, it provides numerous visualization and exploration tools plus customizable export data formats for seamless integration by other analysis tools or MATLAB, for further processing. Gene ARMADA requires MATLAB 7.4 (R2007a) or higher and is also distributed as a stand-alone application with MATLAB Component Runtime. Gene ARMADA provides a highly adaptable, integrative, yet flexible tool which can be used for automated quality control, analysis, annotation and visualization of microarray data, constituting a starting point for further data interpretation and integration with numerous other tools.
Microarray data mining using Bioconductor packages.
Nie, Haisheng; Neerincx, Pieter B T; van der Poel, Jan; Ferrari, Francesco; Bicciato, Silvio; Leunissen, Jack A M; Groenen, Martien A M
2009-07-16
This paper describes the results of a Gene Ontology (GO) term enrichment analysis of chicken microarray data using the Bioconductor packages. By checking the enriched GO terms in three contrasts, MM8-PM8, MM8-MA8, and MM8-MM24, of the provided microarray data during this workshop, this analysis aimed to investigate the host reactions in chickens occurring shortly after a secondary challenge with either a homologous or heterologous species of Eimeria. The results of GO enrichment analysis using GO terms annotated to chicken genes and GO terms annotated to chicken-human orthologous genes were also compared. Furthermore, a locally adaptive statistical procedure (LAP) was performed to test differentially expressed chromosomal regions, rather than individual genes, in the chicken genome after Eimeria challenge. GO enrichment analysis identified significant (raw p-value < 0.05) GO terms for all three contrasts included in the analysis. Some of the GO terms linked to, generally, primary immune responses or secondary immune responses indicating the GO enrichment analysis is a useful approach to analyze microarray data. The comparisons of GO enrichment results using chicken gene information and chicken-human orthologous gene information showed more refined GO terms related to immune responses when using chicken-human orthologous gene information, this suggests that using chicken-human orthologous gene information has higher power to detect significant GO terms with more refined functionality. Furthermore, three chromosome regions were identified to be significantly up-regulated in contrast MM8-PM8 (q-value < 0.01). Overall, this paper describes a practical approach to analyze microarray data in farm animals where the genome information is still incomplete. For farm animals, such as chicken, with currently limited gene annotation, borrowing gene annotation information from orthologous genes in well-annotated species, such as human, will help improve the pathway analysis results substantially. Furthermore, LAP analysis approach is a relatively new and very useful way to be applied in microarray analysis.
Ramirez-Córdova, Jesús; Drnevich, Jenny; Madrigal-Pulido, Jaime Alberto; Arrizon, Javier; Allen, Kirk; Martínez-Velázquez, Moisés; Alvarez-Maya, Ikuri
2012-08-01
During ethanol fermentation, yeast cells are exposed to stress due to the accumulation of ethanol, cell growth is altered and the output of the target product is reduced. For Agave beverages, like tequila, no reports have been published on the global gene expression under ethanol stress. In this work, we used microarray analysis to identify Saccharomyces cerevisiae genes involved in the ethanol response. Gene expression of a tequila yeast strain of S. cerevisiae (AR5) was explored by comparing global gene expression with that of laboratory strain S288C, both after ethanol exposure. Additionally, we used two different culture conditions, cells grown in Agave tequilana juice as a natural fermentation media or grown in yeast-extract peptone dextrose as artificial media. Of the 6368 S. cerevisiae genes in the microarray, 657 genes were identified that had different expression responses to ethanol stress due to strain and/or media. A cluster of 28 genes was found over-expressed specifically in the AR5 tequila strain that could be involved in the adaptation to tequila yeast fermentation, 14 of which are unknown such as yor343c, ylr162w, ygr182c, ymr265c, yer053c-a or ydr415c. These could be the most suitable genes for transforming tequila yeast to increase ethanol tolerance in the tequila fermentation process. Other genes involved in response to stress (RFC4, TSA1, MLH1, PAU3, RAD53) or transport (CYB2, TIP20, QCR9) were expressed in the same cluster. Unknown genes could be good candidates for the development of recombinant yeasts with ethanol tolerance for use in industrial tequila fermentation.
Yasuike, Motoshige; Fujiwara, Atushi; Nakamura, Yoji; Iwasaki, Yuki; Nishiki, Issei; Sugaya, Takuma; Shimizu, Akio; Sano, Motohiko; Kobayashi, Takanori; Ototake, Mitsuru
2016-02-01
Bluefin tunas are one of the most important fishery resources worldwide. Because of high market values, bluefin tuna farming has been rapidly growing during recent years. At present, the most common form of the tuna farming is based on the stocking of wild-caught fish. Therefore, concerns have been raised about the negative impact of the tuna farming on wild stocks. Recently, the Pacific bluefin tuna (PBT), Thunnus orientalis, has succeeded in completing the reproduction cycle under aquaculture conditions, but production bottlenecks remain to be solved because of very little biological information on bluefin tunas. Functional genomics approaches promise to rapidly increase our knowledge on biological processes in the bluefin tuna. Here, we describe the development of the first 44K PBT oligonucleotide microarray (oligo-array), based on whole-genome shotgun (WGS) sequencing and large-scale expressed sequence tags (ESTs) data. In addition, we also introduce an initial 44K PBT oligo-array experiment using in vitro grown peripheral blood leukocytes (PBLs) stimulated with immunostimulants such as lipopolysaccharide (LPS: a cell wall component of Gram-negative bacteria) or polyinosinic:polycytidylic acid (poly I:C: a synthetic mimic of viral infection). This pilot 44K PBT oligo-array analysis successfully addressed distinct immune processes between LPS- and poly I:C- stimulated PBLs. Thus, we expect that this oligo-array will provide an excellent opportunity to analyze global gene expression profiles for a better understanding of diseases and stress, as well as for reproduction, development and influence of nutrition on tuna aquaculture production. Copyright © 2015 The Authors. Published by Elsevier B.V. All rights reserved.
Mirmirani, P; Consolo, M; Oyetakin-White, P; Baron, E; Leahy, P; Karnik, P
2015-06-01
There are regional variations in the scalp hair miniaturization seen in androgenetic alopecia (AGA). Use of topical minoxidil can lead to reversal of miniaturization in the vertex scalp. However, its effects on other scalp regions have been less well studied. To determine whether scalp biopsies from men with AGA show variable gene expression before and after 8 weeks of treatment with minoxidil topical foam 5% (MTF) vs. placebo. A placebo-controlled double-blinded prospective pilot study of MTF vs. placebo was conducted in 16 healthy men aged 18-49 years with Hamilton-Norwood type IV-V thinning. The subjects were asked to apply the treatment (active drug or placebo) to the scalp twice daily for 8 weeks. Stereotactic scalp photographs were taken at the baseline and final visits, to monitor global hair growth. Scalp biopsies were taken at the leading edge of hair loss from the frontal and vertex scalp before and after treatment with MTF and placebo, and microarray analysis was performed using the Affymetrix GeneChip HG U133 Plus 2.0. Global stereotactic photographs showed that MTF induced hair growth in both the frontal and vertex scalp of patients with AGA. Regional differences in gene expression profiles were observed before treatment. However, MTF treatment induced the expression of hair keratin-associated genes and decreased the expression of epidermal differentiation complex and inflammatory genes in both scalp regions. These data suggest that MTF is effective in the treatment of both the frontal and vertex scalp of patients with AGA. © 2014 British Association of Dermatologists.
Polyadenylation state microarray (PASTA) analysis.
Beilharz, Traude H; Preiss, Thomas
2011-01-01
Nearly all eukaryotic mRNAs terminate in a poly(A) tail that serves important roles in mRNA utilization. In the cytoplasm, the poly(A) tail promotes both mRNA stability and translation, and these functions are frequently regulated through changes in tail length. To identify the scope of poly(A) tail length control in a transcriptome, we developed the polyadenylation state microarray (PASTA) method. It involves the purification of mRNA based on poly(A) tail length using thermal elution from poly(U) sepharose, followed by microarray analysis of the resulting fractions. In this chapter we detail our PASTA approach and describe some methods for bulk and mRNA-specific poly(A) tail length measurements of use to monitor the procedure and independently verify the microarray data.
The Use of Atomic Force Microscopy for 3D Analysis of Nucleic Acid Hybridization on Microarrays.
Dubrovin, E V; Presnova, G V; Rubtsova, M Yu; Egorov, A M; Grigorenko, V G; Yaminsky, I V
2015-01-01
Oligonucleotide microarrays are considered today to be one of the most efficient methods of gene diagnostics. The capability of atomic force microscopy (AFM) to characterize the three-dimensional morphology of single molecules on a surface allows one to use it as an effective tool for the 3D analysis of a microarray for the detection of nucleic acids. The high resolution of AFM offers ways to decrease the detection threshold of target DNA and increase the signal-to-noise ratio. In this work, we suggest an approach to the evaluation of the results of hybridization of gold nanoparticle-labeled nucleic acids on silicon microarrays based on an AFM analysis of the surface both in air and in liquid which takes into account of their three-dimensional structure. We suggest a quantitative measure of the hybridization results which is based on the fraction of the surface area occupied by the nanoparticles.
The Utility of Chromosomal Microarray Analysis in Developmental and Behavioral Pediatrics
ERIC Educational Resources Information Center
Beaudet, Arthur L.
2013-01-01
Chromosomal microarray analysis (CMA) has emerged as a powerful new tool to identify genomic abnormalities associated with a wide range of developmental disabilities including congenital malformations, cognitive impairment, and behavioral abnormalities. CMA includes array comparative genomic hybridization (CGH) and single nucleotide polymorphism…
Quintela, Telma; Gonçalves, Isabel; Carreto, Laura C; Santos, Manuel A S; Marcelino, Helena; Patriarca, Filipa M; Santos, Cecília R A
2013-01-01
The choroid plexus (CP) are highly vascularized branched structures that protrude into the ventricles of the brain, and form a unique interface between the blood and the cerebrospinal fluid (CSF), the blood-CSF barrier, that are the main site of production and secretion of CSF. Sex hormones are widely recognized as neuroprotective agents against several neurodegenerative diseases, and the presence of sex hormones cognate receptors suggest that it may be a target for these hormones. In an effort to provide further insight into the neuroprotective mechanisms triggered by sex hormones we analyzed gene expression differences in the CP of female and male rats subjected to gonadectomy, using microarray technology. In gonadectomized female and male animals, 3045 genes were differentially expressed by 1.5-fold change, compared to sham controls. Analysis of the CP transcriptome showed that the top-five pathways significantly regulated by the sex hormone background are olfactory transduction, taste transduction, metabolism, steroid hormone biosynthesis and circadian rhythm pathways. These results represent the first overview of global expression changes in CP of female and male rats induced by gonadectomy and suggest that sex hormones are implicated in pathways with central roles in CP functions and CSF homeostasis.
Quintela, Telma; Gonçalves, Isabel; Carreto, Laura C.; Santos, Manuel A. S.; Marcelino, Helena; Patriarca, Filipa M.; Santos, Cecília R. A.
2013-01-01
The choroid plexus (CP) are highly vascularized branched structures that protrude into the ventricles of the brain, and form a unique interface between the blood and the cerebrospinal fluid (CSF), the blood-CSF barrier, that are the main site of production and secretion of CSF. Sex hormones are widely recognized as neuroprotective agents against several neurodegenerative diseases, and the presence of sex hormones cognate receptors suggest that it may be a target for these hormones. In an effort to provide further insight into the neuroprotective mechanisms triggered by sex hormones we analyzed gene expression differences in the CP of female and male rats subjected to gonadectomy, using microarray technology. In gonadectomized female and male animals, 3045 genes were differentially expressed by 1.5-fold change, compared to sham controls. Analysis of the CP transcriptome showed that the top-five pathways significantly regulated by the sex hormone background are olfactory transduction, taste transduction, metabolism, steroid hormone biosynthesis and circadian rhythm pathways. These results represent the first overview of global expression changes in CP of female and male rats induced by gonadectomy and suggest that sex hormones are implicated in pathways with central roles in CP functions and CSF homeostasis. PMID:23585832
Wentz, Elisabet; Vujic, Mihailo; Kärrstedt, Ewa-Lotta; Erlandsson, Anna; Gillberg, Christopher
2014-05-01
Autism spectrum disorder, severe behaviour problems and duplication of the Xq12 to Xq13 region have recently been described in three male relatives. To describe the psychiatric comorbidity and dysmorphic features, including craniosynostosis, of two male siblings with autism and duplication of the Xq13 to Xq21 region, and attempt to narrow down the number of duplicated genes proposed to be leading to global developmental delay and autism. We performed DNA sequencing of certain exons of the TWIST1 gene, the FGFR2 gene and the FGFR3 gene. We also performed microarray analysis of the DNA. In addition to autism, the two male siblings exhibited severe learning disability, self-injurious behaviour, temper tantrums and hyperactivity, and had no communicative language. Chromosomal analyses were normal. Neither of the two siblings showed mutations of the sequenced exons known to produce craniosynostosis. The microarray analysis detected an extra copy of a region on the long arm of chromosome X, chromosome band Xq13.1-q21.1. Comparison of our two cases with previously described patients allowed us to identify three genes predisposing for autism in the duplicated chromosomal region. Sagittal craniosynostosis is also a new finding linked to the duplication.
Severino, Patricia; Alvares, Adriana M; Michaluart, Pedro; Okamoto, Oswaldo K; Nunes, Fabio D; Moreira-Filho, Carlos A; Tajara, Eloiza H
2008-01-01
Background Oral squamous cell carcinoma (OSCC) is a frequent neoplasm, which is usually aggressive and has unpredictable biological behavior and unfavorable prognosis. The comprehension of the molecular basis of this variability should lead to the development of targeted therapies as well as to improvements in specificity and sensitivity of diagnosis. Results Samples of primary OSCCs and their corresponding surgical margins were obtained from male patients during surgery and their gene expression profiles were screened using whole-genome microarray technology. Hierarchical clustering and Principal Components Analysis were used for data visualization and One-way Analysis of Variance was used to identify differentially expressed genes. Samples clustered mostly according to disease subsite, suggesting molecular heterogeneity within tumor stages. In order to corroborate our results, two publicly available datasets of microarray experiments were assessed. We found significant molecular differences between OSCC anatomic subsites concerning groups of genes presently or potentially important for drug development, including mRNA processing, cytoskeleton organization and biogenesis, metabolic process, cell cycle and apoptosis. Conclusion Our results corroborate literature data on molecular heterogeneity of OSCCs. Differences between disease subsites and among samples belonging to the same TNM class highlight the importance of gene expression-based classification and challenge the development of targeted therapies. PMID:19014556
Long noncoding RNA OR3A4 promotes metastasis and tumorigenicity in gastric cancer
Guo, Xiaobo; Yang, Ziguo; Zhi, Qiaoming; Wang, Dan; Guo, Lei; Li, Guimei; Miao, Ruizhen; Shi, Yulong; Kuang, Yuting
2016-01-01
The contribution of long noncoding RNAs (lncRNAs) to metastasis of gastric cancer remains largely unknown. We used microarray analysis to identify lncRNAs differentially expressed between normal gastric tissues and gastric cancer tissues and validated these differences in quantitative real-time (qRT)-PCR experiments. The expression levels of lncRNA olfactory receptor, family 3, subfamily A, member 4 (OR3A4) were significantly associated with lymphatic metastasis, the depth of cancer invasion, and distal metastasis in 130 paired gastric cancer tissues. The effects of OR3A4 were assessed by overexpressing and silencing OR3A4 in gastric cancer cells. OR3A4 promoted cancer cell growth, angiogenesis, metastasis, and tumorigenesis in vitro and in vivo. Global microarray analysis combined with RT-PCR, RNA immunoprecipitation, and RNA pull-down analyses after OR3A4 transfection demonstrated that OR3A4 influenced biologic functions in gastric cancer cells via regulating the activation of PDLIM2, MACC1, NTN4, and GNB2L1. Our results reveal OR3A4 as an oncogenic lncRNA that promotes tumor progression, Therefore, lncRNAs might function as key regulatory hubs in gastric cancer progression. PMID:26863570
Identification of a transcriptional signature for the wound healing continuum
Peake, Matthew A; Caley, Mathew; Giles, Peter J; Wall, Ivan; Enoch, Stuart; Davies, Lindsay C; Kipling, David; Thomas, David W; Stephens, Phil
2014-01-01
There is a spectrum/continuum of adult human wound healing outcomes ranging from the enhanced (nearly scarless) healing observed in oral mucosa to scarring within skin and the nonhealing of chronic skin wounds. Central to these outcomes is the role of the fibroblast. Global gene expression profiling utilizing microarrays is starting to give insight into the role of such cells during the healing process, but no studies to date have produced a gene signature for this wound healing continuum. Microarray analysis of adult oral mucosal fibroblast (OMF), normal skin fibroblast (NF), and chronic wound fibroblast (CWF) at 0 and 6 hours post-serum stimulation was performed. Genes whose expression increases following serum exposure in the order OMF < NF < CWF are candidates for a negative/impaired healing phenotype (the dysfunctional healing group), whereas genes with the converse pattern are potentially associated with a positive/preferential healing phenotype (the enhanced healing group). Sixty-six genes in the enhanced healing group and 38 genes in the dysfunctional healing group were identified. Overrepresentation analysis revealed pathways directly and indirectly associated with wound healing and aging and additional categories associated with differentiation, development, and morphogenesis. Knowledge of this wound healing continuum gene signature may in turn assist in the therapeutic assessment/treatment of a patient's wounds. PMID:24844339
2011-01-01
Background Cytogenetic evaluation is a key component of the diagnosis and prognosis of chronic lymphocytic leukemia (CLL). We performed oligonucleotide-based comparative genomic hybridization microarray analysis on 34 samples with CLL and known abnormal karyotypes previously determined by cytogenetics and/or fluorescence in situ hybridization (FISH). Results Using a custom designed microarray that targets >1800 genes involved in hematologic disease and other malignancies, we identified additional cryptic aberrations and novel findings in 59% of cases. These included gains and losses of genes associated with cell cycle regulation, apoptosis and susceptibility loci on 3p21.31, 5q35.2q35.3, 10q23.31q23.33, 11q22.3, and 22q11.23. Conclusions Our results show that microarray analysis will detect known aberrations, including microscopic and cryptic alterations. In addition, novel genomic changes will be uncovered that may become important prognostic predictors or treatment targets for CLL in the future. PMID:22087757
Interim report on updated microarray probes for the LLNL Burkholderia pseudomallei SNP array
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gardner, S; Jaing, C
2012-03-27
The overall goal of this project is to forensically characterize 100 unknown Burkholderia isolates in the US-Australia collaboration. We will identify genome-wide single nucleotide polymorphisms (SNPs) from B. pseudomallei and near neighbor species including B. mallei, B. thailandensis and B. oklahomensis. We will design microarray probes to detect these SNP markers and analyze 100 Burkholderia genomic DNAs extracted from environmental, clinical and near neighbor isolates from Australian collaborators on the Burkholderia SNP microarray. We will analyze the microarray genotyping results to characterize the genetic diversity of these new isolates and triage the samples for whole genome sequencing. In this interimmore » report, we described the SNP analysis and the microarray probe design for the Burkholderia SNP microarray.« less
Generalized Correlation Coefficient for Non-Parametric Analysis of Microarray Time-Course Data.
Tan, Qihua; Thomassen, Mads; Burton, Mark; Mose, Kristian Fredløv; Andersen, Klaus Ejner; Hjelmborg, Jacob; Kruse, Torben
2017-06-06
Modeling complex time-course patterns is a challenging issue in microarray study due to complex gene expression patterns in response to the time-course experiment. We introduce the generalized correlation coefficient and propose a combinatory approach for detecting, testing and clustering the heterogeneous time-course gene expression patterns. Application of the method identified nonlinear time-course patterns in high agreement with parametric analysis. We conclude that the non-parametric nature in the generalized correlation analysis could be an useful and efficient tool for analyzing microarray time-course data and for exploring the complex relationships in the omics data for studying their association with disease and health.
Huerta, Mario; Munyi, Marc; Expósito, David; Querol, Enric; Cedano, Juan
2014-06-15
The microarrays performed by scientific teams grow exponentially. These microarray data could be useful for researchers around the world, but unfortunately they are underused. To fully exploit these data, it is necessary (i) to extract these data from a repository of the high-throughput gene expression data like Gene Expression Omnibus (GEO) and (ii) to make the data from different microarrays comparable with tools easy to use for scientists. We have developed these two solutions in our server, implementing a database of microarray marker genes (Marker Genes Data Base). This database contains the marker genes of all GEO microarray datasets and it is updated monthly with the new microarrays from GEO. Thus, researchers can see whether the marker genes of their microarray are marker genes in other microarrays in the database, expanding the analysis of their microarray to the rest of the public microarrays. This solution helps not only to corroborate the conclusions regarding a researcher's microarray but also to identify the phenotype of different subsets of individuals under investigation, to frame the results with microarray experiments from other species, pathologies or tissues, to search for drugs that promote the transition between the studied phenotypes, to detect undesirable side effects of the treatment applied, etc. Thus, the researcher can quickly add relevant information to his/her studies from all of the previous analyses performed in other studies as long as they have been deposited in public repositories. Marker-gene database tool: http://ibb.uab.es/mgdb © The Author 2014. Published by Oxford University Press.
Zeller, Tanja; Wild, Philipp S.; Truong, Vinh; Trégouët, David-Alexandre; Munzel, Thomas; Ziegler, Andreas; Cambien, François; Blankenberg, Stefan; Tiret, Laurence
2011-01-01
Background The hypothesis of dosage compensation of genes of the X chromosome, supported by previous microarray studies, was recently challenged by RNA-sequencing data. It was suggested that microarray studies were biased toward an over-estimation of X-linked expression levels as a consequence of the filtering of genes below the detection threshold of microarrays. Methodology/Principal Findings To investigate this hypothesis, we used microarray expression data from circulating monocytes in 1,467 individuals. In total, 25,349 and 1,156 probes were unambiguously assigned to autosomes and the X chromosome, respectively. Globally, there was a clear shift of X-linked expressions toward lower levels than autosomes. We compared the ratio of expression levels of X-linked to autosomal transcripts (X∶AA) using two different filtering methods: 1. gene expressions were filtered out using a detection threshold irrespective of gene chromosomal location (the standard method in microarrays); 2. equal proportions of genes were filtered out separately on the X and on autosomes. For a wide range of filtering proportions, the X∶AA ratio estimated with the first method was not significantly different from 1, the value expected if dosage compensation was achieved, whereas it was significantly lower than 1 with the second method, leading to the rejection of the hypothesis of dosage compensation. We further showed in simulated data that the choice of the most appropriate method was dependent on biological assumptions regarding the proportion of actively expressed genes on the X chromosome comparative to the autosomes and the extent of dosage compensation. Conclusion/Significance This study shows that the method used for filtering out lowly expressed genes in microarrays may have a major impact according to the hypothesis investigated. The hypothesis of dosage compensation of X-linked genes cannot be firmly accepted or rejected using microarray-based data. PMID:21912656
Bălăcescu, Loredana; Bălăcescu, O; Crişan, N; Fetica, B; Petruţ, B; Bungărdean, Cătălina; Rus, Meda; Tudoran, Oana; Meurice, G; Irimie, Al; Dragoş, N; Berindan-Neagoe, Ioana
2011-01-01
Prostate cancer represents the first leading cause of cancer among western male population, with different clinical behavior ranging from indolent to metastatic disease. Although many molecules and deregulated pathways are known, the molecular mechanisms involved in the development of prostate cancer are not fully understood. The aim of this study was to explore the molecular variation underlying the prostate cancer, based on microarray analysis and bioinformatics approaches. Normal and prostate cancer tissues were collected by macrodissection from prostatectomy pieces. All prostate cancer specimens used in our study were Gleason score 7. Gene expression microarray (Agilent Technologies) was used for Whole Human Genome evaluation. The bioinformatics and functional analysis were based on Limma and Ingenuity software. The microarray analysis identified 1119 differentially expressed genes between prostate cancer and normal prostate, which were up- or down-regulated at least 2-fold. P-values were adjusted for multiple testing using Benjamini-Hochberg method with a false discovery rate of 0.01. These genes were analyzed with Ingenuity Pathway Analysis software and were established 23 genetic networks. Our microarray results provide new information regarding the molecular networks in prostate cancer stratified as Gleason 7. These data highlighted gene expression profiles for better understanding of prostate cancer progression.
Saka, Ernur; Harrison, Benjamin J; West, Kirk; Petruska, Jeffrey C; Rouchka, Eric C
2017-12-06
Since the introduction of microarrays in 1995, researchers world-wide have used both commercial and custom-designed microarrays for understanding differential expression of transcribed genes. Public databases such as ArrayExpress and the Gene Expression Omnibus (GEO) have made millions of samples readily available. One main drawback to microarray data analysis involves the selection of probes to represent a specific transcript of interest, particularly in light of the fact that transcript-specific knowledge (notably alternative splicing) is dynamic in nature. We therefore developed a framework for reannotating and reassigning probe groups for Affymetrix® GeneChip® technology based on functional regions of interest. This framework addresses three issues of Affymetrix® GeneChip® data analyses: removing nonspecific probes, updating probe target mapping based on the latest genome knowledge and grouping probes into gene, transcript and region-based (UTR, individual exon, CDS) probe sets. Updated gene and transcript probe sets provide more specific analysis results based on current genomic and transcriptomic knowledge. The framework selects unique probes, aligns them to gene annotations and generates a custom Chip Description File (CDF). The analysis reveals only 87% of the Affymetrix® GeneChip® HG-U133 Plus 2 probes uniquely align to the current hg38 human assembly without mismatches. We also tested new mappings on the publicly available data series using rat and human data from GSE48611 and GSE72551 obtained from GEO, and illustrate that functional grouping allows for the subtle detection of regions of interest likely to have phenotypical consequences. Through reanalysis of the publicly available data series GSE48611 and GSE72551, we profiled the contribution of UTR and CDS regions to the gene expression levels globally. The comparison between region and gene based results indicated that the detected expressed genes by gene-based and region-based CDFs show high consistency and regions based results allows us to detection of changes in transcript formation.
Tojo, Axel; Malm, Johan; Marko-Varga, György; Lilja, Hans; Laurell, Thomas
2014-01-01
The antibody microarrays have become widespread, but their use for quantitative analyses in clinical samples has not yet been established. We investigated an immunoassay based on nanoporous silicon antibody microarrays for quantification of total prostate-specific-antigen (PSA) in 80 clinical plasma samples, and provide quantitative data from a duplex microarray assay that simultaneously quantifies free and total PSA in plasma. To further develop the assay the porous silicon chips was placed into a standard 96-well microtiter plate for higher throughput analysis. The samples analyzed by this quantitative microarray were 80 plasma samples obtained from men undergoing clinical PSA testing (dynamic range: 0.14-44ng/ml, LOD: 0.14ng/ml). The second dataset, measuring free PSA (dynamic range: 0.40-74.9ng/ml, LOD: 0.47ng/ml) and total PSA (dynamic range: 0.87-295ng/ml, LOD: 0.76ng/ml), was also obtained from the clinical routine. The reference for the quantification was a commercially available assay, the ProStatus PSA Free/Total DELFIA. In an analysis of 80 plasma samples the microarray platform performs well across the range of total PSA levels. This assay might have the potential to substitute for the large-scale microtiter plate format in diagnostic applications. The duplex assay paves the way for a future quantitative multiplex assay, which analyses several prostate cancer biomarkers simultaneously. PMID:22921878
Wang, Zheng; Malanoski, Anthony P; Lin, Baochuan; Kidd, Carolyn; Long, Nina C; Blaney, Kate M; Thach, Dzung C; Tibbetts, Clark; Stenger, David A
2008-01-01
Background Febrile respiratory illness (FRI) has a high impact on public health and global economics and poses a difficult challenge for differential diagnosis. A particular issue is the detection of genetically diverse pathogens, i.e. human rhinoviruses (HRV) and enteroviruses (HEV) which are frequent causes of FRI. Resequencing Pathogen Microarray technology has demonstrated potential for differential diagnosis of several respiratory pathogens simultaneously, but a high confidence design method to select probes for genetically diverse viruses is lacking. Results Using HRV and HEV as test cases, we assess a general design strategy for detecting and serotyping genetically diverse viruses. A minimal number of probe sequences (26 for HRV and 13 for HEV), which were potentially capable of detecting all serotypes of HRV and HEV, were determined and implemented on the Resequencing Pathogen Microarray RPM-Flu v.30/31 (Tessarae RPM-Flu). The specificities of designed probes were validated using 34 HRV and 28 HEV strains. All strains were successfully detected and identified at least to species level. 33 HRV strains and 16 HEV strains could be further differentiated to serotype level. Conclusion This study provides a fundamental evaluation of simultaneous detection and differential identification of genetically diverse RNA viruses with a minimal number of prototype sequences. The results demonstrated that the newly designed RPM-Flu v.30/31 can provide comprehensive and specific analysis of HRV and HEV samples which implicates that this design strategy will be applicable for other genetically diverse viruses. PMID:19046445
2011-01-01
Background Understanding the genetic elements that contribute to key aspects of coffee biology will have an impact on future agronomical improvements for this economically important tree. During the past years, EST collections were generated in Coffee, opening the possibility to create new tools for functional genomics. Results The "PUCE CAFE" Project, organized by the scientific consortium NESTLE/IRD/CIRAD, has developed an oligo-based microarray using 15,721 unigenes derived from published coffee EST sequences mostly obtained from different stages of fruit development and leaves in Coffea Canephora (Robusta). Hybridizations for two independent experiments served to compare global gene expression profiles in three types of tissue matter (mature beans, leaves and flowers) in C. canephora as well as in the leaves of three different coffee species (C. canephora, C. eugenoides and C. arabica). Microarray construction, statistical analyses and validation by Q-PCR analysis are presented in this study. Conclusion We have generated the first 15 K coffee array during this PUCE CAFE project, granted by Génoplante (the French consortium for plant genomics). This new tool will help study functional genomics in a wide range of experiments on various plant tissues, such as analyzing bean maturation or resistance to pathogens or drought. Furthermore, the use of this array has proven to be valid in different coffee species (diploid or tetraploid), drastically enlarging its impact for high-throughput gene expression in the community of coffee research. PMID:21208403
NASA Technical Reports Server (NTRS)
Wilson, James W.; Ramamurthy, Rajee; Porwollik, Steffen; McClelland, Michael; Hammond, Timothy; Allen, Pat; Ott, C. Mark; Pierson, Duane L.; Nickerson, Cheryl A.
2002-01-01
The low-shear environment of optimized rotation suspension culture allows both eukaryotic and prokaryotic cells to assume physiologically relevant phenotypes that have led to significant advances in fundamental investigations of medical and biological importance. This culture environment has also been used to model microgravity for ground-based studies regarding the impact of space flight on eukaryotic and prokaryotic physiology. We have previously demonstrated that low-shear modeled microgravity (LSMMG) under optimized rotation suspension culture is a novel environmental signal that regulates the virulence, stress resistance, and protein expression levels of Salmonella enterica serovar Typhimurium. However, the mechanisms used by the cells of any species, including Salmonella, to sense and respond to LSMMG and identities of the genes involved are unknown. In this study, we used DNA microarrays to elucidate the global transcriptional response of Salmonella to LSMMG. When compared with identical growth conditions under normal gravity (1 x g), LSMMG differentially regulated the expression of 163 genes distributed throughout the chromosome, representing functionally diverse groups including transcriptional regulators, virulence factors, lipopolysaccharide biosynthetic enzymes, iron-utilization enzymes, and proteins of unknown function. Many of the LSMMG-regulated genes were organized in clusters or operons. The microarray results were further validated by RT-PCR and phenotypic analyses, and they indicate that the ferric uptake regulator is involved in the LSMMG response. The results provide important insight about the Salmonella LSMMG response and could provide clues for the functioning of known Salmonella virulence systems or the identification of uncharacterized bacterial virulence strategies.
Privat, Isabelle; Bardil, Amélie; Gomez, Aureliano Bombarely; Severac, Dany; Dantec, Christelle; Fuentes, Ivanna; Mueller, Lukas; Joët, Thierry; Pot, David; Foucrier, Séverine; Dussert, Stéphane; Leroy, Thierry; Journot, Laurent; de Kochko, Alexandre; Campa, Claudine; Combes, Marie-Christine; Lashermes, Philippe; Bertrand, Benoit
2011-01-05
Understanding the genetic elements that contribute to key aspects of coffee biology will have an impact on future agronomical improvements for this economically important tree. During the past years, EST collections were generated in Coffee, opening the possibility to create new tools for functional genomics. The "PUCE CAFE" Project, organized by the scientific consortium NESTLE/IRD/CIRAD, has developed an oligo-based microarray using 15,721 unigenes derived from published coffee EST sequences mostly obtained from different stages of fruit development and leaves in Coffea Canephora (Robusta). Hybridizations for two independent experiments served to compare global gene expression profiles in three types of tissue matter (mature beans, leaves and flowers) in C. canephora as well as in the leaves of three different coffee species (C. canephora, C. eugenoides and C. arabica). Microarray construction, statistical analyses and validation by Q-PCR analysis are presented in this study. We have generated the first 15 K coffee array during this PUCE CAFE project, granted by Génoplante (the French consortium for plant genomics). This new tool will help study functional genomics in a wide range of experiments on various plant tissues, such as analyzing bean maturation or resistance to pathogens or drought. Furthermore, the use of this array has proven to be valid in different coffee species (diploid or tetraploid), drastically enlarging its impact for high-throughput gene expression in the community of coffee research.
Patel, Isha R.; Gangiredla, Jayanthi; Lacher, David W.; Mammel, Mark K.; Jackson, Scott A.; Lampel, Keith A.
2016-01-01
ABSTRACT Most Escherichia coli strains are nonpathogenic. However, for clinical diagnosis and food safety analysis, current identification methods for pathogenic E. coli either are time-consuming and/or provide limited information. Here, we utilized a custom DNA microarray with informative genetic features extracted from 368 sequence sets for rapid and high-throughput pathogen identification. The FDA Escherichia coli Identification (FDA-ECID) platform contains three sets of molecularly informative features that together stratify strain identification and relatedness. First, 53 known flagellin alleles, 103 alleles of wzx and wzy, and 5 alleles of wzm provide molecular serotyping utility. Second, 41,932 probe sets representing the pan-genome of E. coli provide strain-level gene content information. Third, approximately 125,000 single nucleotide polymorphisms (SNPs) of available whole-genome sequences (WGS) were distilled to 9,984 SNPs capable of recapitulating the E. coli phylogeny. We analyzed 103 diverse E. coli strains with available WGS data, including those associated with past foodborne illnesses, to determine robustness and accuracy. The array was able to accurately identify the molecular O and H serotypes, potentially correcting serological failures and providing better resolution for H-nontypeable/nonmotile phenotypes. In addition, molecular risk assessment was possible with key virulence marker identifications. Epidemiologically, each strain had a unique comparative genomic fingerprint that was extended to an additional 507 food and clinical isolates. Finally, a 99.7% phylogenetic concordance was established between microarray analysis and WGS using SNP-level data for advanced genome typing. Our study demonstrates FDA-ECID as a powerful tool for epidemiology and molecular risk assessment with the capacity to profile the global landscape and diversity of E. coli. IMPORTANCE This study describes a robust, state-of-the-art platform developed from available whole-genome sequences of E. coli and Shigella spp. by distilling useful signatures for epidemiology and molecular risk assessment into one assay. The FDA-ECID microarray contains features that enable comprehensive molecular serotyping and virulence profiling along with genome-scale genotyping and SNP analysis. Hence, it is a molecular toolbox that stratifies strain identification and pathogenic potential in the contexts of epidemiology and phylogeny. We applied this tool to strains from food, environmental, and clinical sources, resulting in significantly greater phylogenetic and strain-specific resolution than previously reported for available typing methods. PMID:27037122
Schmid, Patrick; Yao, Hui; Galdzicki, Michal; Berger, Bonnie; Wu, Erxi; Kohane, Isaac S.
2009-01-01
Background Although microarray technology has become the most common method for studying global gene expression, a plethora of technical factors across the experiment contribute to the variable of genome gene expression profiling using peripheral whole blood. A practical platform needs to be established in order to obtain reliable and reproducible data to meet clinical requirements for biomarker study. Methods and Findings We applied peripheral whole blood samples with globin reduction and performed genome-wide transcriptome analysis using Illumina BeadChips. Real-time PCR was subsequently used to evaluate the quality of array data and elucidate the mode in which hemoglobin interferes in gene expression profiling. We demonstrated that, when applied in the context of standard microarray processing procedures, globin reduction results in a consistent and significant increase in the quality of beadarray data. When compared to their pre-globin reduction counterparts, post-globin reduction samples show improved detection statistics, lowered variance and increased sensitivity. More importantly, gender gene separation is remarkably clearer in post-globin reduction samples than in pre-globin reduction samples. Our study suggests that the poor data obtained from pre-globin reduction samples is the result of the high concentration of hemoglobin derived from red blood cells either interfering with target mRNA binding or giving the pseudo binding background signal. Conclusion We therefore recommend the combination of performing globin mRNA reduction in peripheral whole blood samples and hybridizing on Illumina BeadChips as the practical approach for biomarker study. PMID:19381341
Canales, Javier; Moyano, Tomás C.; Villarroel, Eva; Gutiérrez, Rodrigo A.
2014-01-01
Nitrogen (N) is an essential macronutrient for plant growth and development. Plants adapt to changes in N availability partly by changes in global gene expression. We integrated publicly available root microarray data under contrasting nitrate conditions to identify new genes and functions important for adaptive nitrate responses in Arabidopsis thaliana roots. Overall, more than 2000 genes exhibited changes in expression in response to nitrate treatments in Arabidopsis thaliana root organs. Global regulation of gene expression by nitrate depends largely on the experimental context. However, despite significant differences from experiment to experiment in the identity of regulated genes, there is a robust nitrate response of specific biological functions. Integrative gene network analysis uncovered relationships between nitrate-responsive genes and 11 highly co-expressed gene clusters (modules). Four of these gene network modules have robust nitrate responsive functions such as transport, signaling, and metabolism. Network analysis hypothesized G2-like transcription factors are key regulatory factors controlling transport and signaling functions. Our meta-analysis highlights the role of biological processes not studied before in the context of the nitrate response such as root hair development and provides testable hypothesis to advance our understanding of nitrate responses in plants. PMID:24570678
2016-01-01
Abstract Microarray gene expression data sets are jointly analyzed to increase statistical power. They could either be merged together or analyzed by meta-analysis. For a given ensemble of data sets, it cannot be foreseen which of these paradigms, merging or meta-analysis, works better. In this article, three joint analysis methods, Z -score normalization, ComBat and the inverse normal method (meta-analysis) were selected for survival prognosis and risk assessment of breast cancer patients. The methods were applied to eight microarray gene expression data sets, totaling 1324 patients with two clinical endpoints, overall survival and relapse-free survival. The performance derived from the joint analysis methods was evaluated using Cox regression for survival analysis and independent validation used as bias estimation. Overall, Z -score normalization had a better performance than ComBat and meta-analysis. Higher Area Under the Receiver Operating Characteristic curve and hazard ratio were also obtained when independent validation was used as bias estimation. With a lower time and memory complexity, Z -score normalization is a simple method for joint analysis of microarray gene expression data sets. The derived findings suggest further assessment of this method in future survival prediction and cancer classification applications. PMID:26504096
Oligonucleotide microarrays are a powerful tool for unsupervised analysis of chemical impacts on biological systems. However, the lack of well annotated biological pathways for many aquatic organisms, including fish, and the poor power of microarray-based analyses to detect diffe...
Ogunnaike, Babatunde A; Gelmi, Claudio A; Edwards, Jeremy S
2010-05-21
Gene expression studies generate large quantities of data with the defining characteristic that the number of genes (whose expression profiles are to be determined) exceed the number of available replicates by several orders of magnitude. Standard spot-by-spot analysis still seeks to extract useful information for each gene on the basis of the number of available replicates, and thus plays to the weakness of microarrays. On the other hand, because of the data volume, treating the entire data set as an ensemble, and developing theoretical distributions for these ensembles provides a framework that plays instead to the strength of microarrays. We present theoretical results that under reasonable assumptions, the distribution of microarray intensities follows the Gamma model, with the biological interpretations of the model parameters emerging naturally. We subsequently establish that for each microarray data set, the fractional intensities can be represented as a mixture of Beta densities, and develop a procedure for using these results to draw statistical inference regarding differential gene expression. We illustrate the results with experimental data from gene expression studies on Deinococcus radiodurans following DNA damage using cDNA microarrays. Copyright (c) 2010 Elsevier Ltd. All rights reserved.
Microarray-based screening of heat shock protein inhibitors.
Schax, Emilia; Walter, Johanna-Gabriela; Märzhäuser, Helene; Stahl, Frank; Scheper, Thomas; Agard, David A; Eichner, Simone; Kirschning, Andreas; Zeilinger, Carsten
2014-06-20
Based on the importance of heat shock proteins (HSPs) in diseases such as cancer, Alzheimer's disease or malaria, inhibitors of these chaperons are needed. Today's state-of-the-art techniques to identify HSP inhibitors are performed in microplate format, requiring large amounts of proteins and potential inhibitors. In contrast, we have developed a miniaturized protein microarray-based assay to identify novel inhibitors, allowing analysis with 300 pmol of protein. The assay is based on competitive binding of fluorescence-labeled ATP and potential inhibitors to the ATP-binding site of HSP. Therefore, the developed microarray enables the parallel analysis of different ATP-binding proteins on a single microarray. We have demonstrated the possibility of multiplexing by immobilizing full-length human HSP90α and HtpG of Helicobacter pylori on microarrays. Fluorescence-labeled ATP was competed by novel geldanamycin/reblastatin derivatives with IC50 values in the range of 0.5 nM to 4 μM and Z(*)-factors between 0.60 and 0.96. Our results demonstrate the potential of a target-oriented multiplexed protein microarray to identify novel inhibitors for different members of the HSP90 family. Copyright © 2014 Elsevier B.V. All rights reserved.
Watson, Christopher M.; Crinnion, Laura A.; Gurgel‐Gianetti, Juliana; Harrison, Sally M.; Daly, Catherine; Antanavicuite, Agne; Lascelles, Carolina; Markham, Alexander F.; Pena, Sergio D. J.; Bonthron, David T.
2015-01-01
ABSTRACT Autozygosity mapping is a powerful technique for the identification of rare, autosomal recessive, disease‐causing genes. The ease with which this category of disease gene can be identified has greatly increased through the availability of genome‐wide SNP genotyping microarrays and subsequently of exome sequencing. Although these methods have simplified the generation of experimental data, its analysis, particularly when disparate data types must be integrated, remains time consuming. Moreover, the huge volume of sequence variant data generated from next generation sequencing experiments opens up the possibility of using these data instead of microarray genotype data to identify disease loci. To allow these two types of data to be used in an integrated fashion, we have developed AgileVCFMapper, a program that performs both the mapping of disease loci by SNP genotyping and the analysis of potentially deleterious variants using exome sequence variant data, in a single step. This method does not require microarray SNP genotype data, although analysis with a combination of microarray and exome genotype data enables more precise delineation of disease loci, due to superior marker density and distribution. PMID:26037133
Automatic Identification and Quantification of Extra-Well Fluorescence in Microarray Images.
Rivera, Robert; Wang, Jie; Yu, Xiaobo; Demirkan, Gokhan; Hopper, Marika; Bian, Xiaofang; Tahsin, Tasnia; Magee, D Mitchell; Qiu, Ji; LaBaer, Joshua; Wallstrom, Garrick
2017-11-03
In recent studies involving NAPPA microarrays, extra-well fluorescence is used as a key measure for identifying disease biomarkers because there is evidence to support that it is better correlated with strong antibody responses than statistical analysis involving intraspot intensity. Because this feature is not well quantified by traditional image analysis software, identification and quantification of extra-well fluorescence is performed manually, which is both time-consuming and highly susceptible to variation between raters. A system that could automate this task efficiently and effectively would greatly improve the process of data acquisition in microarray studies, thereby accelerating the discovery of disease biomarkers. In this study, we experimented with different machine learning methods, as well as novel heuristics, for identifying spots exhibiting extra-well fluorescence (rings) in microarray images and assigning each ring a grade of 1-5 based on its intensity and morphology. The sensitivity of our final system for identifying rings was found to be 72% at 99% specificity and 98% at 92% specificity. Our system performs this task significantly faster than a human, while maintaining high performance, and therefore represents a valuable tool for microarray image analysis.
Thermodynamically optimal whole-genome tiling microarray design and validation.
Cho, Hyejin; Chou, Hui-Hsien
2016-06-13
Microarray is an efficient apparatus to interrogate the whole transcriptome of species. Microarray can be designed according to annotated gene sets, but the resulted microarrays cannot be used to identify novel transcripts and this design method is not applicable to unannotated species. Alternatively, a whole-genome tiling microarray can be designed using only genomic sequences without gene annotations, and it can be used to detect novel RNA transcripts as well as known genes. The difficulty with tiling microarray design lies in the tradeoff between probe-specificity and coverage of the genome. Sequence comparison methods based on BLAST or similar software are commonly employed in microarray design, but they cannot precisely determine the subtle thermodynamic competition between probe targets and partially matched probe nontargets during hybridizations. Using the whole-genome thermodynamic analysis software PICKY to design tiling microarrays, we can achieve maximum whole-genome coverage allowable under the thermodynamic constraints of each target genome. The resulted tiling microarrays are thermodynamically optimal in the sense that all selected probes share the same melting temperature separation range between their targets and closest nontargets, and no additional probes can be added without violating the specificity of the microarray to the target genome. This new design method was used to create two whole-genome tiling microarrays for Escherichia coli MG1655 and Agrobacterium tumefaciens C58 and the experiment results validated the design.
Usadel, Björn; Nagel, Axel; Steinhauser, Dirk; Gibon, Yves; Bläsing, Oliver E; Redestig, Henning; Sreenivasulu, Nese; Krall, Leonard; Hannah, Matthew A; Poree, Fabien; Fernie, Alisdair R; Stitt, Mark
2006-12-18
Microarray technology has become a widely accepted and standardized tool in biology. The first microarray data analysis programs were developed to support pair-wise comparison. However, as microarray experiments have become more routine, large scale experiments have become more common, which investigate multiple time points or sets of mutants or transgenics. To extract biological information from such high-throughput expression data, it is necessary to develop efficient analytical platforms, which combine manually curated gene ontologies with efficient visualization and navigation tools. Currently, most tools focus on a few limited biological aspects, rather than offering a holistic, integrated analysis. Here we introduce PageMan, a multiplatform, user-friendly, and stand-alone software tool that annotates, investigates, and condenses high-throughput microarray data in the context of functional ontologies. It includes a GUI tool to transform different ontologies into a suitable format, enabling the user to compare and choose between different ontologies. It is equipped with several statistical modules for data analysis, including over-representation analysis and Wilcoxon statistical testing. Results are exported in a graphical format for direct use, or for further editing in graphics programs.PageMan provides a fast overview of single treatments, allows genome-level responses to be compared across several microarray experiments covering, for example, stress responses at multiple time points. This aids in searching for trait-specific changes in pathways using mutants or transgenics, analyzing development time-courses, and comparison between species. In a case study, we analyze the results of publicly available microarrays of multiple cold stress experiments using PageMan, and compare the results to a previously published meta-analysis.PageMan offers a complete user's guide, a web-based over-representation analysis as well as a tutorial, and is freely available at http://mapman.mpimp-golm.mpg.de/pageman/. PageMan allows multiple microarray experiments to be efficiently condensed into a single page graphical display. The flexible interface allows data to be quickly and easily visualized, facilitating comparisons within experiments and to published experiments, thus enabling researchers to gain a rapid overview of the biological responses in the experiments.
Identification of candidate genes in osteoporosis by integrated microarray analysis.
Li, J J; Wang, B Q; Fei, Q; Yang, Y; Li, D
2016-12-01
In order to screen the altered gene expression profile in peripheral blood mononuclear cells of patients with osteoporosis, we performed an integrated analysis of the online microarray studies of osteoporosis. We searched the Gene Expression Omnibus (GEO) database for microarray studies of peripheral blood mononuclear cells in patients with osteoporosis. Subsequently, we integrated gene expression data sets from multiple microarray studies to obtain differentially expressed genes (DEGs) between patients with osteoporosis and normal controls. Gene function analysis was performed to uncover the functions of identified DEGs. A total of three microarray studies were selected for integrated analysis. In all, 1125 genes were found to be significantly differentially expressed between osteoporosis patients and normal controls, with 373 upregulated and 752 downregulated genes. Positive regulation of the cellular amino metabolic process (gene ontology (GO): 0033240, false discovery rate (FDR) = 1.00E + 00) was significantly enriched under the GO category for biological processes, while for molecular functions, flavin adenine dinucleotide binding (GO: 0050660, FDR = 3.66E-01) and androgen receptor binding (GO: 0050681, FDR = 6.35E-01) were significantly enriched. DEGs were enriched in many osteoporosis-related signalling pathways, including those of mitogen-activated protein kinase (MAPK) and calcium. Protein-protein interaction (PPI) network analysis showed that the significant hub proteins contained ubiquitin specific peptidase 9, X-linked (Degree = 99), ubiquitin specific peptidase 19 (Degree = 57) and ubiquitin conjugating enzyme E2 B (Degree = 57). Analysis of gene function of identified differentially expressed genes may expand our understanding of fundamental mechanisms leading to osteoporosis. Moreover, significantly enriched pathways, such as MAPK and calcium, may involve in osteoporosis through osteoblastic differentiation and bone formation.Cite this article: J. J. Li, B. Q. Wang, Q. Fei, Y. Yang, D. Li. Identification of candidate genes in osteoporosis by integrated microarray analysis. Bone Joint Res 2016;5:594-601. DOI: 10.1302/2046-3758.512.BJR-2016-0073.R1. © 2016 Fei et al.
Founds, Sandra A.; Conley, Yvette P.; Lyons-Weiler, James F.; Jeyabalan, Arun; Hogge, W. Allen; Conrad, Kirk P.
2009-01-01
Background Preeclampsia is a pregnancy-specific disorder that remains a leading cause of maternal, fetal and neonatal morbidity and mortality, and is associated with risk for future cardiovascular disease. There are no reliable predictors, specific preventative measures or treatments other than delivery. A widely-held view is that the antecedents of preeclampsia lie with impaired placentation in early pregnancy. Accordingly, we hypothesized dysregulation of global gene expression in first trimester placentas of women who later manifested preeclampsia. Methods Surplus chorionic villus sampling (CVS) tissues were collected at 10–12 weeks gestation in 160 patients with singleton fetuses. Four patients developed preeclampsia, and their banked CVS specimens were matched to 8 control samples from patients with unaffected pregnancies. Affymetrix HG-U133 Plus 2.0 GeneChips were utilized for microarray analysis. Naïve Bayes prediction modeling and pathway analysis were conducted. qRT-PCR examined three of the dysregulated genes. Results Thirty-six differentially expressed genes were identified in the preeclampsia placentas. qRT-PCR verified the microarray analysis. Thirty-one genes were down-regulated. Many were related to inflammation/immunoregulation and cell motility. Decidual gene dysregulation was prominent. No evidence was found for alterations in hypoxia and oxidative stress regulated genes. Conclusions To our knowledge, this is the first study to show dysregulation of gene expression in the early placentas of women ~6 months before developing preeclampsia, thereby reinforcing a placental origin of the disorder. We hypothesize that placentation in preeclampsia is compromised in the first trimester by maternal and fetal immune dysregulation, abnormal decidualization, or both, thereby impairing trophoblast invasion. Several of the genes provide potential targets for the development of clinical biomarkers in maternal blood during the first trimester. Supplementary materials are available for this article via the publisher’s online edition. PMID:19027158
Wei, Kai-Fa; Chen, Juan; Chen, Yan-Feng; Wu, Ling-Juan; Xie, Dao-Xin
2012-01-01
The WRKY transcription factors function in plant growth and development, and response to the biotic and abiotic stresses. Although many studies have focused on the functional identification of the WRKY transcription factors, much less is known about molecular phylogenetic and global expression analysis of the complete WRKY family in maize. In this study, we identified 136 WRKY proteins coded by 119 genes in the B73 inbred line from the complete genome and named them in an orderly manner. Then, a comprehensive phylogenetic analysis of five species was performed to explore the origin and evolutionary patterns of these WRKY genes, and the result showed that gene duplication is the major driving force for the origin of new groups and subgroups and functional divergence during evolution. Chromosomal location analysis of maize WRKY genes indicated that 20 gene clusters are distributed unevenly in the genome. Microarray-based expression analysis has revealed that 131 WRKY transcripts encoded by 116 genes may participate in the regulation of maize growth and development. Among them, 102 transcripts are stably expressed with a coefficient of variation (CV) value of <15%. The remaining 29 transcripts produced by 25 WRKY genes with the CV value of >15% are further analysed to discover new organ- or tissue-specific genes. In addition, microarray analyses of transcriptional responses to drought stress and fungal infection showed that maize WRKY proteins are involved in stress responses. All these results contribute to a deep probing into the roles of WRKY transcription factors in maize growth and development and stress tolerance. PMID:22279089
Wei, Kai-Fa; Chen, Juan; Chen, Yan-Feng; Wu, Ling-Juan; Xie, Dao-Xin
2012-04-01
The WRKY transcription factors function in plant growth and development, and response to the biotic and abiotic stresses. Although many studies have focused on the functional identification of the WRKY transcription factors, much less is known about molecular phylogenetic and global expression analysis of the complete WRKY family in maize. In this study, we identified 136 WRKY proteins coded by 119 genes in the B73 inbred line from the complete genome and named them in an orderly manner. Then, a comprehensive phylogenetic analysis of five species was performed to explore the origin and evolutionary patterns of these WRKY genes, and the result showed that gene duplication is the major driving force for the origin of new groups and subgroups and functional divergence during evolution. Chromosomal location analysis of maize WRKY genes indicated that 20 gene clusters are distributed unevenly in the genome. Microarray-based expression analysis has revealed that 131 WRKY transcripts encoded by 116 genes may participate in the regulation of maize growth and development. Among them, 102 transcripts are stably expressed with a coefficient of variation (CV) value of <15%. The remaining 29 transcripts produced by 25 WRKY genes with the CV value of >15% are further analysed to discover new organ- or tissue-specific genes. In addition, microarray analyses of transcriptional responses to drought stress and fungal infection showed that maize WRKY proteins are involved in stress responses. All these results contribute to a deep probing into the roles of WRKY transcription factors in maize growth and development and stress tolerance.
Workflows for microarray data processing in the Kepler environment.
Stropp, Thomas; McPhillips, Timothy; Ludäscher, Bertram; Bieda, Mark
2012-05-17
Microarray data analysis has been the subject of extensive and ongoing pipeline development due to its complexity, the availability of several options at each analysis step, and the development of new analysis demands, including integration with new data sources. Bioinformatics pipelines are usually custom built for different applications, making them typically difficult to modify, extend and repurpose. Scientific workflow systems are intended to address these issues by providing general-purpose frameworks in which to develop and execute such pipelines. The Kepler workflow environment is a well-established system under continual development that is employed in several areas of scientific research. Kepler provides a flexible graphical interface, featuring clear display of parameter values, for design and modification of workflows. It has capabilities for developing novel computational components in the R, Python, and Java programming languages, all of which are widely used for bioinformatics algorithm development, along with capabilities for invoking external applications and using web services. We developed a series of fully functional bioinformatics pipelines addressing common tasks in microarray processing in the Kepler workflow environment. These pipelines consist of a set of tools for GFF file processing of NimbleGen chromatin immunoprecipitation on microarray (ChIP-chip) datasets and more comprehensive workflows for Affymetrix gene expression microarray bioinformatics and basic primer design for PCR experiments, which are often used to validate microarray results. Although functional in themselves, these workflows can be easily customized, extended, or repurposed to match the needs of specific projects and are designed to be a toolkit and starting point for specific applications. These workflows illustrate a workflow programming paradigm focusing on local resources (programs and data) and therefore are close to traditional shell scripting or R/BioConductor scripting approaches to pipeline design. Finally, we suggest that microarray data processing task workflows may provide a basis for future example-based comparison of different workflow systems. We provide a set of tools and complete workflows for microarray data analysis in the Kepler environment, which has the advantages of offering graphical, clear display of conceptual steps and parameters and the ability to easily integrate other resources such as remote data and web services.
Goodman, Corey W.; Major, Heather J.; Walls, William D.; Sheffield, Val C.; Casavant, Thomas L.; Darbro, Benjamin W.
2016-01-01
Chromosomal microarrays (CMAs) are routinely used in both research and clinical laboratories; yet, little attention has been given to the estimation of genome-wide true and false negatives during the assessment of these assays and how such information could be used to calibrate various algorithmic metrics to improve performance. Low-throughput, locus-specific methods such as fluorescence in situ hybridization (FISH), quantitative PCR (qPCR), or multiplex ligation-dependent probe amplification (MLPA) preclude rigorous calibration of various metrics used by copy number variant (CNV) detection algorithms. To aid this task, we have established a comparative methodology, CNV-ROC, which is capable of performing a high throughput, low cost, analysis of CMAs that takes into consideration genome-wide true and false negatives. CNV-ROC uses a higher resolution microarray to confirm calls from a lower resolution microarray and provides for a true measure of genome-wide performance metrics at the resolution offered by microarray testing. CNV-ROC also provides for a very precise comparison of CNV calls between two microarray platforms without the need to establish an arbitrary degree of overlap. Comparison of CNVs across microarrays is done on a per-probe basis and receiver operator characteristic (ROC) analysis is used to calibrate algorithmic metrics, such as log2 ratio threshold, to enhance CNV calling performance. CNV-ROC addresses a critical and consistently overlooked aspect of analytical assessments of genome-wide techniques like CMAs which is the measurement and use of genome-wide true and false negative data for the calculation of performance metrics and comparison of CNV profiles between different microarray experiments. PMID:25595567
MICROARRAY DATA ANALYSIS USING MULTIPLE STATISTICAL MODELS
Microarray Data Analysis Using Multiple Statistical Models
Wenjun Bao1, Judith E. Schmid1, Amber K. Goetz1, Ming Ouyang2, William J. Welsh2,Andrew I. Brooks3,4, ChiYi Chu3,Mitsunori Ogihara3,4, Yinhe Cheng5, David J. Dix1. 1National Health and Environmental Effects Researc...
ERIC Educational Resources Information Center
Reiff, Marian; Giarelli, Ellen; Bernhardt, Barbara A.; Easley, Ebony; Spinner, Nancy B.; Sankar, Pamela L.; Mulchandani, Surabhi
2015-01-01
Clinical guidelines recommend chromosomal microarray analysis (CMA) for all children with autism spectrum disorders (ASDs). We explored the test's perceived usefulness among parents of children with ASD who had undergone CMA, and received a result categorized as pathogenic, variant of uncertain significance, or negative. Fifty-seven parents…
Oligonucleotide microarrays and other ‘omics’ approaches are powerful tools for unsupervised analysis of chemical impacts on biological systems. However, the lack of well annotated biological pathways for many aquatic organisms, including fish, and the poor power of microarray-b...
Bumm, Klaus; Zheng, Mingzhong; Bailey, Clyde; Zhan, Fenghuang; Chiriva-Internati, M; Eddlemon, Paul; Terry, Julian; Barlogie, Bart; Shaughnessy, John D
2002-02-01
Clinical GeneOrganizer (CGO) is a novel windows-based archiving, organization and data mining software for the integration of gene expression profiling in clinical medicine. The program implements various user-friendly tools and extracts data for further statistical analysis. This software was written for Affymetrix GeneChip *.txt files, but can also be used for any other microarray-derived data. The MS-SQL server version acts as a data mart and links microarray data with clinical parameters of any other existing database and therefore represents a valuable tool for combining gene expression analysis and clinical disease characteristics.
cluML: A markup language for clustering and cluster validity assessment of microarray data.
Bolshakova, Nadia; Cunningham, Pádraig
2005-01-01
cluML is a new markup language for microarray data clustering and cluster validity assessment. The XML-based format has been designed to address some of the limitations observed in traditional formats, such as inability to store multiple clustering (including biclustering) and validation results within a dataset. cluML is an effective tool to support biomedical knowledge representation in gene expression data analysis. Although cluML was developed for DNA microarray analysis applications, it can be effectively used for the representation of clustering and for the validation of other biomedical and physical data that has no limitations.
Kirby, Ralph; Herron, Paul; Hoskisson, Paul
2011-02-01
Based on available genome sequences, Actinomycetales show significant gene synteny across a wide range of species and genera. In addition, many genera show varying degrees of complex morphological development. Using the presence of gene synteny as a basis, it is clear that an analysis of gene conservation across the Streptomyces and various other Actinomycetales will provide information on both the importance of genes and gene clusters and the evolution of morphogenesis in these bacteria. Genome sequencing, although becoming cheaper, is still relatively expensive for comparing large numbers of strains. Thus, a heterologous DNA/DNA microarray hybridization dataset based on a Streptomyces coelicolor microarray allows a cheaper and greater depth of analysis of gene conservation. This study, using both bioinformatical and microarray approaches, was able to classify genes previously identified as involved in morphogenesis in Streptomyces into various subgroups in terms of conservation across species and genera. This will allow the targeting of genes for further study based on their importance at the species level and at higher evolutionary levels.
Cross species analysis of microarray expression data
Lu, Yong; Huggins, Peter; Bar-Joseph, Ziv
2009-01-01
Motivation: Many biological systems operate in a similar manner across a large number of species or conditions. Cross-species analysis of sequence and interaction data is often applied to determine the function of new genes. In contrast to these static measurements, microarrays measure the dynamic, condition-specific response of complex biological systems. The recent exponential growth in microarray expression datasets allows researchers to combine expression experiments from multiple species to identify genes that are not only conserved in sequence but also operated in a similar way in the different species studied. Results: In this review we discuss the computational and technical challenges associated with these studies, the approaches that have been developed to address these challenges and the advantages of cross-species analysis of microarray data. We show how successful application of these methods lead to insights that cannot be obtained when analyzing data from a single species. We also highlight current open problems and discuss possible ways to address them. Contact: zivbj@cs.cmu.edu PMID:19357096
Yamamura, Shohei; Yamada, Eriko; Kimura, Fukiko; Miyajima, Kumiko; Shigeto, Hajime
2017-10-21
A new single-cell microarray chip was designed and developed to separate and analyze single adherent and non-adherent cancer cells. The single-cell microarray chip is made of polystyrene with over 60,000 microchambers of 10 different size patterns (31-40 µm upper diameter, 11-20 µm lower diameter). A drop of suspension of adherent carcinoma (NCI-H1650) and non-adherent leukocyte (CCRF-CEM) cells was placed onto the chip, and single-cell occupancy of NCI-H1650 and CCRF-CEM was determined to be 79% and 84%, respectively. This was achieved by controlling the chip design and surface treatment. Analysis of protein expression in single NCI-H1650 and CCRF-CEM cells was performed on the single-cell microarray chip by multi-antibody staining. Additionally, with this system, we retrieved positive single cells from the microchambers by a micromanipulator. Thus, this system demonstrates the potential for easy and accurate separation and analysis of various types of single cells.
Jain, K K
2001-02-01
Cambridge Healthtech Institute's Third Annual Conference on Lab-on-a-Chip and Microarray technology covered the latest advances in this technology and applications in life sciences. Highlights of the meetings are reported briefly with emphasis on applications in genomics, drug discovery and molecular diagnostics. There was an emphasis on microfluidics because of the wide applications in laboratory and drug discovery. The lab-on-a-chip provides the facilities of a complete laboratory in a hand-held miniature device. Several microarray systems have been used for hybridisation and detection techniques. Oligonucleotide scanning arrays provide a versatile tool for the analysis of nucleic acid interactions and provide a platform for improving the array-based methods for investigation of antisense therapeutics. A method for analysing combinatorial DNA arrays using oligonucleotide-modified gold nanoparticle probes and a conventional scanner has considerable potential in molecular diagnostics. Various applications of microarray technology for high-throughput screening in drug discovery and single nucleotide polymorphisms (SNP) analysis were discussed. Protein chips have important applications in proteomics. With the considerable amount of data generated by the different technologies using microarrays, it is obvious that the reading of the information and its interpretation and management through the use of bioinformatics is essential. Various techniques for data analysis were presented. Biochip and microarray technology has an essential role to play in the evolving trends in healthcare, which integrate diagnosis with prevention/treatment and emphasise personalised medicines.
Wu, Baolin
2006-02-15
Differential gene expression detection and sample classification using microarray data have received much research interest recently. Owing to the large number of genes p and small number of samples n (p > n), microarray data analysis poses big challenges for statistical analysis. An obvious problem owing to the 'large p small n' is over-fitting. Just by chance, we are likely to find some non-differentially expressed genes that can classify the samples very well. The idea of shrinkage is to regularize the model parameters to reduce the effects of noise and produce reliable inferences. Shrinkage has been successfully applied in the microarray data analysis. The SAM statistics proposed by Tusher et al. and the 'nearest shrunken centroid' proposed by Tibshirani et al. are ad hoc shrinkage methods. Both methods are simple, intuitive and prove to be useful in empirical studies. Recently Wu proposed the penalized t/F-statistics with shrinkage by formally using the (1) penalized linear regression models for two-class microarray data, showing good performance. In this paper we systematically discussed the use of penalized regression models for analyzing microarray data. We generalize the two-class penalized t/F-statistics proposed by Wu to multi-class microarray data. We formally derive the ad hoc shrunken centroid used by Tibshirani et al. using the (1) penalized regression models. And we show that the penalized linear regression models provide a rigorous and unified statistical framework for sample classification and differential gene expression detection.
Best practices for hybridization design in two-colour microarray analysis.
Knapen, Dries; Vergauwen, Lucia; Laukens, Kris; Blust, Ronny
2009-07-01
Two-colour microarrays are a popular platform of choice in gene expression studies. Because two different samples are hybridized on a single microarray, and several microarrays are usually needed in a given experiment, there are many possible ways to combine samples on different microarrays. The actual combination employed is commonly referred to as the 'hybridization design'. Different types of hybridization designs have been developed, all aimed at optimizing the experimental setup for the detection of differentially expressed genes while coping with technical noise. Here, we first provide an overview of the different classes of hybridization designs, discussing their advantages and limitations, and then we illustrate the current trends in the use of different hybridization design types in contemporary research.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Green, Pamela J.
The long-term goal of this research was to better understand the influence of mRNA stability on gene regulation, particularly in response to hormones and the circadian clock. The primary aim of this project was to examine this using DNA microarrays, small RNA analysis and other approaches. We accomplished these objectives, although we were only able to detect small changes in mRNA stability in response to these stimuli. However, the work also contributed to a major breakthrough allowing the identification of small RNAs on a genomic scale in eukaryotes. Moreover, the project prompted us to develop a new way to analyzemore » mRNA decay genome wide. Thus, the research was hugely successful beyond our objectives.« less
Pathway results from the chicken data set using GOTM, Pathway Studio and Ingenuity softwares
Bonnet, Agnès; Lagarrigue, Sandrine; Liaubet, Laurence; Robert-Granié, Christèle; SanCristobal, Magali; Tosser-Klopp, Gwenola
2009-01-01
Background As presented in the introduction paper, three sets of differentially regulated genes were found after the analysis of the chicken infection data set from EADGENE. Different methods were used to interpret these results. Results GOTM, Pathway Studio and Ingenuity softwares were used to investigate the three lists of genes. The three softwares allowed the analysis of the data and highlighted different networks. However, only one set of genes, showing a differential expression between primary and secondary response gave significant biological interpretation. Conclusion Combining these databases that were developed independently on different annotation sources supplies a useful tool for a global biological interpretation of microarray data, even if they may contain some imperfections (e.g. gene not or not well annotated). PMID:19615111
Jin, S J; Liu, M; Long, W J; Luo, X P
2016-12-02
Objective: To explore the clinical phenotypes and the genetic cause for a boy with unexplained growth retardation, nephrocalcinosis, auditory anomalies and multi-organ/system developmental disorders. Method: Routine G-banding and chromosome microarray analysis were applied to a child with unexplained growth retardation, nephrocalcinosis, auditory anomalies and multi-organ/system developmental disorders treated in the Department of Pediatrics of Tongji Hospital Affiliated to Tongji Medical College of Huazhong University of Science and Technology in September 2015 and his parents to conduct the chromosomal karyotype analysis and the whole genome scanning. Deleted genes were searched in the Decipher and NCBI databases, and their relationships with the clinical phenotypes were analyzed. Result: A six-month-old boy was refered to us because of unexplained growth retardation and feeding intolerance.The affected child presented with abnormal manifestation such as special face, umbilical hernia, growth retardation, hypothyroidism, congenital heart disease, right ear sensorineural deafness, hypercalcemia and nephrocalcinosis. The child's karyotype was 46, XY, 16qh + , and his parents' karyotypes were normal. Chromosome microarray analysis revealed a 1 436 kb deletion on the 7q11.23(72701098_74136633) region of the child. This region included 23 protein-coding genes, which were reported to be corresponding to Williams-Beuren syndrome and its certain clinical phenotypes. His parents' results of chromosome microarray analysis were normal. Conclusion: A boy with characteristic manifestation of Williams-Beuren syndrome and rare nephrocalcinosis was diagnosed using chromosome microarray analysis. The deletion on the 7q11.23 might be related to the clinical phenotypes of Williams-Beuren syndrome, yet further studies are needed.
AFM 4.0: a toolbox for DNA microarray analysis
Breitkreutz, Bobby-Joe; Jorgensen, Paul; Breitkreutz, Ashton; Tyers, Mike
2001-01-01
We have developed a series of programs, collectively packaged as Array File Maker 4.0 (AFM), that manipulate and manage DNA microarray data. AFM 4.0 is simple to use, applicable to any organism or microarray, and operates within the familiar confines of Microsoft Excel. Given a database of expression ratios, AFM 4.0 generates input files for clustering, helps prepare colored figures and Venn diagrams, and can uncover aneuploidy in yeast microarray data. AFM 4.0 should be especially useful to laboratories that do not have access to specialized commercial or in-house software. PMID:11532221
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhou, J.; Wu, L.; Gentry, T.
2006-04-05
To effectively monitor microbial populations involved in various important processes, a 50-mer-based oligonucleotide microarray was developed based on known genes and pathways involved in: biodegradation, metal resistance and reduction, denitrification, nitrification, nitrogen fixation, methane oxidation, methanogenesis, carbon polymer decomposition, and sulfate reduction. This array contains approximately 2000 unique and group-specific probes with <85% similarity to their non-target sequences. Based on artificial probes, our results showed that at hybridization conditions of 50 C and 50% formamide, the 50-mer microarray hybridization can differentiate sequences having <88% similarity. Specificity tests with representative pure cultures indicated that the designed probes on the arrays appearedmore » to be specific to their corresponding target genes. Detection limits were about 5-10ng genomic DNA in the absence of background DNA, and 50-100ng ({approx}1.3{sup o} 10{sup 7} cells) in the presence background DNA. Strong linear relationships between signal intensity and target DNA and RNA concentration were observed (r{sup 2} = 0.95-0.99). Application of this microarray to naphthalene-amended enrichments and soil microcosms demonstrated that composition of the microflora varied depending on incubation conditions. While the naphthalene-degrading genes from Rhodococcus-type microorganisms were dominant in enrichments, the genes involved in naphthalene degradation from Gram-negative microorganisms such as Ralstonia, Comamonas, and Burkholderia were most abundant in the soil microcosms (as well as those for polyaromatic hydrocarbon and nitrotoluene degradation). Although naphthalene degradation is widely known and studied in Pseudomonas, Pseudomonas genes were not detected in either system. Real-time PCR analysis of 4 representative genes was consistent with microarray-based quantification (r{sup 2} = 0.95). Currently, we are also applying this microarray to the study of several different microbial communities and processes at the NABIR-FRC in Oak Ridge, TN. One project involves the monitoring of the development and dynamics of the microbial community of a fluidized bed reactor (FBR) used for reducing nitrate and the other project monitors microbial community responses to stimulation of uranium reducing populations via ethanol donor additions in situ and in a model system. Additionally, we are developing novel strategies for increasing microarray hybridization sensitivity. Finally, great improvements to our methods of probe design were made by the development of a new computer program, CommOligo. CommOligo designs unique and group-specific oligo probes for whole-genomes, metagenomes, and groups of environmental sequences and uses a new global alignment algorithm to design single or multiple probes for each gene or group. We are now using this program to design a more comprehensive functional gene array for environmental studies. Overall, our results indicate that the 50mer-based microarray technology has potential as a specific and quantitative tool to reveal the composition of microbial communities and their dynamics important to processes within contaminated environments.« less
Fully Automated Complementary DNA Microarray Segmentation using a Novel Fuzzy-based Algorithm.
Saberkari, Hamidreza; Bahrami, Sheyda; Shamsi, Mousa; Amoshahy, Mohammad Javad; Ghavifekr, Habib Badri; Sedaaghi, Mohammad Hossein
2015-01-01
DNA microarray is a powerful approach to study simultaneously, the expression of 1000 of genes in a single experiment. The average value of the fluorescent intensity could be calculated in a microarray experiment. The calculated intensity values are very close in amount to the levels of expression of a particular gene. However, determining the appropriate position of every spot in microarray images is a main challenge, which leads to the accurate classification of normal and abnormal (cancer) cells. In this paper, first a preprocessing approach is performed to eliminate the noise and artifacts available in microarray cells using the nonlinear anisotropic diffusion filtering method. Then, the coordinate center of each spot is positioned utilizing the mathematical morphology operations. Finally, the position of each spot is exactly determined through applying a novel hybrid model based on the principle component analysis and the spatial fuzzy c-means clustering (SFCM) algorithm. Using a Gaussian kernel in SFCM algorithm will lead to improving the quality in complementary DNA microarray segmentation. The performance of the proposed algorithm has been evaluated on the real microarray images, which is available in Stanford Microarray Databases. Results illustrate that the accuracy of microarray cells segmentation in the proposed algorithm reaches to 100% and 98% for noiseless/noisy cells, respectively.
Dellett, Margaret; O’Hagan, Kathleen Ann; Colyer, Hilary Ann Alexandra; Mills, Ken I.
2010-01-01
Around 80% of acute myeloid leukemia (AML) patients achieve a complete remission, however many will relapse and ultimately die of their disease. The association between karyotype and prognosis has been studied extensively and identified patient cohorts as having favourable [e.g. t(8; 21), inv (16)/t(16; 16), t(15; 17)], intermediate [e.g. cytogenetically normal (NK-AML)] or adverse risk [e.g. complex karyotypes]. Previous studies have shown that gene expression profiling signatures can classify the sub-types of AML, although few reports have shown a similar feature by using methylation markers. The global methylation patterns in 19 diagnostic AML samples were investigated using the Methylated CpG Island Amplification Microarray (MCAM) method and CpG island microarrays containing 12,000 CpG sites. The first analysis, comparing favourable and intermediate cytogenetic risk groups, revealed significantly differentially methylated CpG sites (594 CpG islands) between the two subgroups. Mutations in the NPM1 gene occur at a high frequency (40%) within the NK-AML subgroup and are associated with a more favourable prognosis in these patients. A second analysis comparing the NPM1 mutant and wild-type research study subjects again identified distinct methylation profiles between these two subgroups. Network and pathway analysis revealed possible molecular mechanisms associated with the different risk and/or mutation sub-groups. This may result in a better classification of the risk groups, improved monitoring targets, or the identification of novel molecular therapies. PMID:24179384
Amirhosseini, Mehdi; Andersson, Göran; Aspenberg, Per; Fahlgren, Anna
2017-12-01
Wear debris particles released from prosthetic bearing surfaces and mechanical instability of implants are two main causes of periprosthetic osteolysis. While particle-induced loosening has been studied extensively, mechanisms through which mechanical factors lead to implant loosening have been less investigated. This study compares the transcriptional profiles associated with osteolysis in a rat model for aseptic loosening, induced by either mechanical instability or titanium particles. Rats were exposed to mechanical instability or titanium particles. After 15 min, 3, 48 or 120 h from start of the stimulation, gene expression changes in periprosthetic bone tissue was determined by microarray analysis. Microarray data were analyzed by PANTHER Gene List Analysis tool and Ingenuity Pathway Analysis (IPA). Both types of osteolytic stimulation led to gene regulation in comparison to unstimulated controls after 3, 48 or 120 h. However, when mechanical instability was compared to titanium particles, no gene showed a statistically significant difference (fold change ≥ ± 1.5 and adjusted p-value ≤ 0.05) at any time point. There was a remarkable similarity in numbers and functional classification of regulated genes. Pathway analysis showed several inflammatory pathways activated by both stimuli, including Acute Phase Response signaling, IL-6 signaling and Oncostatin M signaling. Quantitative PCR confirmed the changes in expression of key genes involved in osteolysis observed by global transcriptomics. Inflammatory mediators including interleukin (IL)-6, IL-1β, chemokine (C-C motif) ligand (CCL)2, prostaglandin-endoperoxide synthase (Ptgs)2 and leukemia inhibitory factor (LIF) showed strong upregulation, as assessed by both microarray and qPCR. By investigating genome-wide expression changes we show that, despite the different nature of mechanical implant instability and titanium particles, osteolysis seems to be induced through similar biological and signaling pathways in this rat model for aseptic loosening. Pathways associated to the innate inflammatory response appear to be a major driver for osteolysis. Our findings implicate early restriction of inflammation to be critical to prevent or mitigate osteolysis and aseptic loosening of orthopedic implants.
Large-scale analysis of gene expression using cDNA microarrays promises the
rapid detection of the mode of toxicity for drugs and other chemicals. cDNA
microarrays were used to examine chemically-induced alterations of gene
expression in HepG2 cells exposed to oxidative ...
Where statistics and molecular microarray experiments biology meet.
Kelmansky, Diana M
2013-01-01
This review chapter presents a statistical point of view to microarray experiments with the purpose of understanding the apparent contradictions that often appear in relation to their results. We give a brief introduction of molecular biology for nonspecialists. We describe microarray experiments from their construction and the biological principles the experiments rely on, to data acquisition and analysis. The role of epidemiological approaches and sample size considerations are also discussed.
Grenville-Briggs, Laura J; Stansfield, Ian
2011-01-01
This report describes a linked series of Masters-level computer practical workshops. They comprise an advanced functional genomics investigation, based upon analysis of a microarray dataset probing yeast DNA damage responses. The workshops require the students to analyse highly complex transcriptomics datasets, and were designed to stimulate active learning through experience of current research methods in bioinformatics and functional genomics. They seek to closely mimic a realistic research environment, and require the students first to propose research hypotheses, then test those hypotheses using specific sections of the microarray dataset. The complexity of the microarray data provides students with the freedom to propose their own unique hypotheses, tested using appropriate sections of the microarray data. This research latitude was highly regarded by students and is a strength of this practical. In addition, the focus on DNA damage by radiation and mutagenic chemicals allows them to place their results in a human medical context, and successfully sparks broad interest in the subject material. In evaluation, 79% of students scored the practical workshops on a five-point scale as 4 or 5 (totally effective) for student learning. More broadly, the general use of microarray data as a "student research playground" is also discussed. Copyright © 2011 Wiley Periodicals, Inc.
Brodsky, Leonid; Leontovich, Andrei; Shtutman, Michael; Feinstein, Elena
2004-01-01
Mathematical methods of analysis of microarray hybridizations deal with gene expression profiles as elementary units. However, some of these profiles do not reflect a biologically relevant transcriptional response, but rather stem from technical artifacts. Here, we describe two technically independent but rationally interconnected methods for identification of such artifactual profiles. Our diagnostics are based on detection of deviations from uniformity, which is assumed as the main underlying principle of microarray design. Method 1 is based on detection of non-uniformity of microarray distribution of printed genes that are clustered based on the similarity of their expression profiles. Method 2 is based on evaluation of the presence of gene-specific microarray spots within the slides’ areas characterized by an abnormal concentration of low/high differential expression values, which we define as ‘patterns of differentials’. Applying two novel algorithms, for nested clustering (method 1) and for pattern detection (method 2), we can make a dual estimation of the profile’s quality for almost every printed gene. Genes with artifactual profiles detected by method 1 may then be removed from further analysis. Suspicious differential expression values detected by method 2 may be either removed or weighted according to the probabilities of patterns that cover them, thus diminishing their input in any further data analysis. PMID:14999086
Brunner, C; Hoffmann, K; Thiele, T; Schedler, U; Jehle, H; Resch-Genger, U
2015-04-01
Commercial platforms consisting of ready-to-use microarrays printed with target-specific DNA probes, a microarray scanner, and software for data analysis are available for different applications in medical diagnostics and food analysis, detecting, e.g., viral and bacteriological DNA sequences. The transfer of these tools from basic research to routine analysis, their broad acceptance in regulated areas, and their use in medical practice requires suitable calibration tools for regular control of instrument performance in addition to internal assay controls. Here, we present the development of a novel assay-adapted calibration slide for a commercialized DNA-based assay platform, consisting of precisely arranged fluorescent areas of various intensities obtained by incorporating different concentrations of a "green" dye and a "red" dye in a polymer matrix. These dyes present "Cy3" and "Cy5" analogues with improved photostability, chosen based upon their spectroscopic properties closely matching those of common labels for the green and red channel of microarray scanners. This simple tool allows to efficiently and regularly assess and control the performance of the microarray scanner provided with the biochip platform and to compare different scanners. It will be eventually used as fluorescence intensity scale for referencing of assays results and to enhance the overall comparability of diagnostic tests.
Stekel, Dov J.; Sarti, Donatella; Trevino, Victor; Zhang, Lihong; Salmon, Mike; Buckley, Chris D.; Stevens, Mark; Pallen, Mark J.; Penn, Charles; Falciani, Francesco
2005-01-01
A key step in the analysis of microarray data is the selection of genes that are differentially expressed. Ideally, such experiments should be properly replicated in order to infer both technical and biological variability, and the data should be subjected to rigorous hypothesis tests to identify the differentially expressed genes. However, in microarray experiments involving the analysis of very large numbers of biological samples, replication is not always practical. Therefore, there is a need for a method to select differentially expressed genes in a rational way from insufficiently replicated data. In this paper, we describe a simple method that uses bootstrapping to generate an error model from a replicated pilot study that can be used to identify differentially expressed genes in subsequent large-scale studies on the same platform, but in which there may be no replicated arrays. The method builds a stratified error model that includes array-to-array variability, feature-to-feature variability and the dependence of error on signal intensity. We apply this model to the characterization of the host response in a model of bacterial infection of human intestinal epithelial cells. We demonstrate the effectiveness of error model based microarray experiments and propose this as a general strategy for a microarray-based screening of large collections of biological samples. PMID:15800204
Hu, Guohong; Wang, Hui-Yun; Greenawalt, Danielle M.; Azaro, Marco A.; Luo, Minjie; Tereshchenko, Irina V.; Cui, Xiangfeng; Yang, Qifeng; Gao, Richeng; Shen, Li; Li, Honghua
2006-01-01
Microarray-based analysis of single nucleotide polymorphisms (SNPs) has many applications in large-scale genetic studies. To minimize the influence of experimental variation, microarray data usually need to be processed in different aspects including background subtraction, normalization and low-signal filtering before genotype determination. Although many algorithms are sophisticated for these purposes, biases are still present. In the present paper, new algorithms for SNP microarray data analysis and the software, AccuTyping, developed based on these algorithms are described. The algorithms take advantage of a large number of SNPs included in each assay, and the fact that the top and bottom 20% of SNPs can be safely treated as homozygous after sorting based on their ratios between the signal intensities. These SNPs are then used as controls for color channel normalization and background subtraction. Genotype calls are made based on the logarithms of signal intensity ratios using two cutoff values, which were determined after training the program with a dataset of ∼160 000 genotypes and validated by non-microarray methods. AccuTyping was used to determine >300 000 genotypes of DNA and sperm samples. The accuracy was shown to be >99%. AccuTyping can be downloaded from . PMID:16982644
The statistics of identifying differentially expressed genes in Expresso and TM4: a comparison
Sioson, Allan A; Mane, Shrinivasrao P; Li, Pinghua; Sha, Wei; Heath, Lenwood S; Bohnert, Hans J; Grene, Ruth
2006-01-01
Background Analysis of DNA microarray data takes as input spot intensity measurements from scanner software and returns differential expression of genes between two conditions, together with a statistical significance assessment. This process typically consists of two steps: data normalization and identification of differentially expressed genes through statistical analysis. The Expresso microarray experiment management system implements these steps with a two-stage, log-linear ANOVA mixed model technique, tailored to individual experimental designs. The complement of tools in TM4, on the other hand, is based on a number of preset design choices that limit its flexibility. In the TM4 microarray analysis suite, normalization, filter, and analysis methods form an analysis pipeline. TM4 computes integrated intensity values (IIV) from the average intensities and spot pixel counts returned by the scanner software as input to its normalization steps. By contrast, Expresso can use either IIV data or median intensity values (MIV). Here, we compare Expresso and TM4 analysis of two experiments and assess the results against qRT-PCR data. Results The Expresso analysis using MIV data consistently identifies more genes as differentially expressed, when compared to Expresso analysis with IIV data. The typical TM4 normalization and filtering pipeline corrects systematic intensity-specific bias on a per microarray basis. Subsequent statistical analysis with Expresso or a TM4 t-test can effectively identify differentially expressed genes. The best agreement with qRT-PCR data is obtained through the use of Expresso analysis and MIV data. Conclusion The results of this research are of practical value to biologists who analyze microarray data sets. The TM4 normalization and filtering pipeline corrects microarray-specific systematic bias and complements the normalization stage in Expresso analysis. The results of Expresso using MIV data have the best agreement with qRT-PCR results. In one experiment, MIV is a better choice than IIV as input to data normalization and statistical analysis methods, as it yields as greater number of statistically significant differentially expressed genes; TM4 does not support the choice of MIV input data. Overall, the more flexible and extensive statistical models of Expresso achieve more accurate analytical results, when judged by the yardstick of qRT-PCR data, in the context of an experimental design of modest complexity. PMID:16626497
Chang, Tzu-Hao; Chen, Mien-Cheng; Chang, Jen-Ping; Huang, Hsien-Da; Ho, Wan-Chun; Lin, Yu-Sheng; Pan, Kuo-Li; Huang, Yao-Kuang; Liu, Wen-Hao; Wu, Chia-Chen
2016-01-01
Background Left atrial enlargement in mitral regurgitation (MR) predicts a poor prognosis. The regulatory mechanisms of atrial myocyte hypertrophy of MR patients remain unknown. Methods and Results This study comprised 14 patients with MR, 7 patients with aortic valve disease (AVD), and 6 purchased samples from normal subjects (NC). We used microarrays, enrichment analysis and quantitative RT-PCR to study the gene expression profiles in the left atria. Microarray results showed that 112 genes were differentially up-regulated and 132 genes were differentially down-regulated in the left atria between MR patients and NC. Enrichment analysis of differentially expressed genes demonstrated that “NFAT in cardiac hypertrophy” pathway was not only one of the significant associated canonical pathways, but also the only one predicted with a non-zero score of 1.34 (i.e. activated) through Ingenuity Pathway Analysis molecule activity predictor. Ingenuity Pathway Analysis Global Molecular Network analysis exhibited that the highest score network also showed high association with cardiac related pathways and functions. Therefore, 5 NFAT associated genes (PPP3R1, PPP3CB, CAMK1, MEF2C, PLCE1) were studies for validation. The mRNA expressions of PPP3CB and MEF2C were significantly up-regulated, and CAMK1 and PPP3R1 were significantly down-regulated in MR patients compared to NC. Moreover, MR patients had significantly increased mRNA levels of PPP3CB, MEF2C and PLCE1 compared to AVD patients. The atrial myocyte size of MR patients significantly exceeded that of the AVD patients and NC. Conclusions Differentially expressed genes in the “NFAT in cardiac hypertrophy” pathway may play a critical role in the atrial myocyte hypertrophy of MR patients. PMID:27907007
ERIC Educational Resources Information Center
McGrew, Susan G.; Peters, Brittany R.; Crittendon, Julie A.; Veenstra-VanderWeele, Jeremy
2012-01-01
Genetic testing is recommended for patients with ASD; however specific recommendations vary by specialty. American Academy of Pediatrics and American Academy of Neurology guidelines recommend G-banded karyotype and Fragile X DNA. The American College of Medical Genetics recommends Chromosomal Microarray Analysis (CMA). We determined the yield of…
ERIC Educational Resources Information Center
Grenville-Briggs, Laura J.; Stansfield, Ian
2011-01-01
This report describes a linked series of Masters-level computer practical workshops. They comprise an advanced functional genomics investigation, based upon analysis of a microarray dataset probing yeast DNA damage responses. The workshops require the students to analyse highly complex transcriptomics datasets, and were designed to stimulate…
The observation of transcriptional changes following embryonic ethanol exposure may provide significant insights into the biological response to ethanol exposure. In this study, we used microarray analysis to examine the transcriptional response of the developing limb to a dose ...
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ovacik, Meric A.; Sen, Banalata; Euling, Susan Y.
Pathway activity level analysis, the approach pursued in this study, focuses on all genes that are known to be members of metabolic and signaling pathways as defined by the KEGG database. The pathway activity level analysis entails singular value decomposition (SVD) of the expression data of the genes constituting a given pathway. We explore an extension of the pathway activity methodology for application to time-course microarray data. We show that pathway analysis enhances our ability to detect biologically relevant changes in pathway activity using synthetic data. As a case study, we apply the pathway activity level formulation coupled with significancemore » analysis to microarray data from two different rat testes exposed in utero to Dibutyl Phthalate (DBP). In utero DBP exposure in the rat results in developmental toxicity of a number of male reproductive organs, including the testes. One well-characterized mode of action for DBP and the male reproductive developmental effects is the repression of expression of genes involved in cholesterol transport, steroid biosynthesis and testosterone synthesis that lead to a decreased fetal testicular testosterone. Previous analyses of DBP testes microarray data focused on either individual gene expression changes or changes in the expression of specific genes that are hypothesized, or known, to be important in testicular development and testosterone synthesis. However, a pathway analysis may inform whether there are additional affected pathways that could inform additional modes of action linked to DBP developmental toxicity. We show that Pathway activity analysis may be considered for a more comprehensive analysis of microarray data.« less
Ghosh, Somiranjan; Zang, Shizhu; Mitra, Partha S; Ghimbovschi, Svetlana; Hoffman, Eric P; Dutta, Sisir K
2011-07-01
Several reports have indicated that low level of polychlorinated biphenyl (PCB) exposure can adversely affect a multitude of physiological disorders and diseases in in vitro, in vivo, and as reported in epidemiological studies. This investigation is focused on the possible contribution of two most prevalent PCB congeners in vitro in developing toxicities. We used PCBs 138 and 153 at the human equivalence level as model agents to test their specificity in developing toxicities. We chose a global approach using oligonucleotide microarray technology to investigate modulated gene expression for biological effects, upon exposure of PCBs, followed by Ingenuity Pathway Analysis (IPA), to understand the underlying consequence in developing disease and disorders. We performed in vitro studies with human peripheral blood mononuclear cells (PBMC), where PBMC cells were exposed to respective PCBs for 48 h. Overall, our observation on gene expression indicated that PCB produces a unique signature affecting different pathways, specific for each congener. While analyzing these data through IPA, the prominent and interesting disease and disorders were neurological disease, cancer, cardiovascular disease, respiratory disease, as well as endocrine system disorders, genetic disorders, and reproductive system disease. They showed strong resemblances with in vitro, in vivo, and in the epidemiological studies. A distinct difference was observed in renal and urological diseases, organisimal injury and abnormalities, dental disease, ophthalmic disease, and psychological disorders, which are only revealed by PCB 138 exposure, but not in PCB 153. The present study emphasizes the challenges of global gene expression in vitro and was correlated with the results of exposed human population. The microarray results give a molecular mechanistic insight and functional effects, following PCB exposure. The extent of changes in genes related to several possible mode(s) of action highlights the changes in cellular functions and signaling pathways that play major roles. In addition to understanding the pathways related to mode of action for chemicals, these data could lead to the identification of genomic signatures that could be used for screening of chemicals for their potential to cause disease and developmental disorders. Copyright © 2011 Elsevier Ltd. All rights reserved.
Optimal consistency in microRNA expression analysis using reference-gene-based normalization.
Wang, Xi; Gardiner, Erin J; Cairns, Murray J
2015-05-01
Normalization of high-throughput molecular expression profiles secures differential expression analysis between samples of different phenotypes or biological conditions, and facilitates comparison between experimental batches. While the same general principles apply to microRNA (miRNA) normalization, there is mounting evidence that global shifts in their expression patterns occur in specific circumstances, which pose a challenge for normalizing miRNA expression data. As an alternative to global normalization, which has the propensity to flatten large trends, normalization against constitutively expressed reference genes presents an advantage through their relative independence. Here we investigated the performance of reference-gene-based (RGB) normalization for differential miRNA expression analysis of microarray expression data, and compared the results with other normalization methods, including: quantile, variance stabilization, robust spline, simple scaling, rank invariant, and Loess regression. The comparative analyses were executed using miRNA expression in tissue samples derived from subjects with schizophrenia and non-psychiatric controls. We proposed a consistency criterion for evaluating methods by examining the overlapping of differentially expressed miRNAs detected using different partitions of the whole data. Based on this criterion, we found that RGB normalization generally outperformed global normalization methods. Thus we recommend the application of RGB normalization for miRNA expression data sets, and believe that this will yield a more consistent and useful readout of differentially expressed miRNAs, particularly in biological conditions characterized by large shifts in miRNA expression.
Multi-omic profiling to assess the effect of iron starvation in Streptococcus pneumoniae TIGR4
Jiménez-Munguía, Irene; Calderón-Santiago, Mónica; Rodríguez-Franco, Antonio; Priego-Capote, Feliciano
2018-01-01
We applied multi-omics approaches (transcriptomics, proteomics and metabolomics) to study the effect of iron starvation on the Gram-positive human pathogen Streptococcus pneumoniae to elucidate global changes in the bacterium in a condition similar to what can be found in the host during an infectious episode. We treated the reference strain TIGR4 with the iron chelator deferoxamine mesylate. DNA microarrays revealed changes in the expression of operons involved in multiple biological processes, with a prevalence of genes coding for ion binding proteins. We also studied the changes in protein abundance by 2-DE followed by MALDI-TOF/TOF analysis of total cell extracts and secretome fractions. The main proteomic changes were found in proteins related to the primary and amino sugar metabolism, especially in enzymes with divalent cations as cofactors. Finally, the metabolomic analysis of intracellular metabolites showed altered levels of amino sugars involved in the cell wall peptidoglycan metabolism. This work shows the utility of multi-perspective studies that can provide complementary results for the comprehension of how a given condition can influence global physiological changes in microorganisms.
Kanika, Nirmala; Chang, Jinsook; Tong, Yuehong; Tiplitsky, Scott; Lin, Juan; Yohannes, Elizabeth; Tar, Moses; Chance, Mark; Christ, George J.; Melman, Arnold; Davies, Kelvin
2010-01-01
Objectives To investigate the role that oxidative stress plays in the development of diabetic cystopathy. Materials and methods Comparative gene expression in the bladder of non-diabetic and streptozotocin (STZ)-induced 2-month-old diabetic rats was carried out using microarray analysis. Evidence of oxidative stress was investigated in the bladder by analyzing glutathione S-transferase activity, lipid peroxidation, and carbonylation and nitrosylation of proteins. The activity of protein degradation pathways was assessed using western blot analysis. Results Analysis of global gene expression showed that detrusor smooth muscle tissue of STZ-induced diabetes undergoes significant enrichment in targets involved in the production or regulation of reactive oxygen species (P = 1.27 × 10−10). The microarray analysis was confirmed by showing that markers of oxidative stress were all significantly increased in the diabetic bladder. It was hypothesized that the sequelae to oxidative stress would be increased protein damage and apoptosis. This was confirmed by showing that two key proteins involved in protein degradation (Nedd4 and LC3B) were greatly up-regulated in diabetic bladders compared to controls by 12.2 ± 0.76 and 4.4 ± 1.0-fold, respectively, and the apoptosis inducing protein, BAX, was up-regulated by 6.76 ± 0.76-fold. Conclusions Overall, the findings obtained in the present study add to the growing body of evidence showing that diabetic cystopathy is associated with oxidative damage of smooth muscle cells, and results in protein damage and activation of apoptotic pathways that may contribute to a deterioration in bladder function. PMID:21518418
Goodman, Corey W; Major, Heather J; Walls, William D; Sheffield, Val C; Casavant, Thomas L; Darbro, Benjamin W
2015-04-01
Chromosomal microarrays (CMAs) are routinely used in both research and clinical laboratories; yet, little attention has been given to the estimation of genome-wide true and false negatives during the assessment of these assays and how such information could be used to calibrate various algorithmic metrics to improve performance. Low-throughput, locus-specific methods such as fluorescence in situ hybridization (FISH), quantitative PCR (qPCR), or multiplex ligation-dependent probe amplification (MLPA) preclude rigorous calibration of various metrics used by copy number variant (CNV) detection algorithms. To aid this task, we have established a comparative methodology, CNV-ROC, which is capable of performing a high throughput, low cost, analysis of CMAs that takes into consideration genome-wide true and false negatives. CNV-ROC uses a higher resolution microarray to confirm calls from a lower resolution microarray and provides for a true measure of genome-wide performance metrics at the resolution offered by microarray testing. CNV-ROC also provides for a very precise comparison of CNV calls between two microarray platforms without the need to establish an arbitrary degree of overlap. Comparison of CNVs across microarrays is done on a per-probe basis and receiver operator characteristic (ROC) analysis is used to calibrate algorithmic metrics, such as log2 ratio threshold, to enhance CNV calling performance. CNV-ROC addresses a critical and consistently overlooked aspect of analytical assessments of genome-wide techniques like CMAs which is the measurement and use of genome-wide true and false negative data for the calculation of performance metrics and comparison of CNV profiles between different microarray experiments. Copyright © 2015 Elsevier Inc. All rights reserved.
Johnson, Keven R; Nicodemus-Johnson, Jessie; Spindler, Mathew J; Carnegie, Graeme K
2015-01-01
In the heart, scaffolding proteins such as A-Kinase Anchoring Proteins (AKAPs) play a crucial role in normal cellular function by serving as a signaling hub for multiple protein kinases including protein kinase D1 (PKD1). Under cardiac hypertrophic conditions AKAP13 anchored PKD1 activates the transcription factor MEF2 leading to subsequent fetal gene activation and hypertrophic response. We used an expression microarray to identify the global transcriptional response in the hearts of wild-type mice expressing the native form of AKAP13 compared to a gene-trap mouse model expressing a truncated form of AKAP13 that is unable to bind PKD1 (AKAP13-ΔPKD1). Microarray analysis showed that AKAP13-ΔPKD1 mice broadly failed to exhibit the transcriptional profile normally associated with compensatory cardiac hypertrophy following trans-aortic constriction (TAC). The identified differentially expressed genes in WT and AKAP13-ΔPKD1 hearts are vital for the compensatory hypertrophic response to pressure-overload and include myofilament, apoptotic, and cell growth/differentiation genes in addition to genes not previously identified as affected by AKAP13-anchored PKD1. Our results show that AKAP13-PKD1 signaling is critical for transcriptional regulation of key contractile, cell death, and metabolic pathways during the development of compensatory hypertrophy in vivo.
Johnson, Keven R.; Nicodemus-Johnson, Jessie; Spindler, Mathew J.
2015-01-01
In the heart, scaffolding proteins such as A-Kinase Anchoring Proteins (AKAPs) play a crucial role in normal cellular function by serving as a signaling hub for multiple protein kinases including protein kinase D1 (PKD1). Under cardiac hypertrophic conditions AKAP13 anchored PKD1 activates the transcription factor MEF2 leading to subsequent fetal gene activation and hypertrophic response. We used an expression microarray to identify the global transcriptional response in the hearts of wild-type mice expressing the native form of AKAP13 compared to a gene-trap mouse model expressing a truncated form of AKAP13 that is unable to bind PKD1 (AKAP13-ΔPKD1). Microarray analysis showed that AKAP13-ΔPKD1 mice broadly failed to exhibit the transcriptional profile normally associated with compensatory cardiac hypertrophy following trans-aortic constriction (TAC). The identified differentially expressed genes in WT and AKAP13-ΔPKD1 hearts are vital for the compensatory hypertrophic response to pressure-overload and include myofilament, apoptotic, and cell growth/differentiation genes in addition to genes not previously identified as affected by AKAP13-anchored PKD1. Our results show that AKAP13-PKD1 signaling is critical for transcriptional regulation of key contractile, cell death, and metabolic pathways during the development of compensatory hypertrophy in vivo. PMID:26192751
Mining microarrays for metabolic meaning: nutritional regulation of hypothalamic gene expression.
Mobbs, Charles V; Yen, Kelvin; Mastaitis, Jason; Nguyen, Ha; Watson, Elizabeth; Wurmbach, Elisa; Sealfon, Stuart C; Brooks, Andrew; Salton, Stephen R J
2004-06-01
DNA microarray analysis has been used to investigate relative changes in the level of gene expression in the CNS, including changes that are associated with disease, injury, psychiatric disorders, drug exposure or withdrawal, and memory formation. We have used oligonucleotide microarrays to identify hypothalamic genes that respond to nutritional manipulation. In addition to commonly used microarray analysis based on criteria such as fold-regulation, we have also found that simply carrying out multiple t tests then sorting by P value constitutes a highly reliable method to detect true regulation, as assessed by real-time polymerase chain reaction (PCR), even for relatively low abundance genes or relatively low magnitude of regulation. Such analyses directly suggested novel mechanisms that mediate effects of nutritional state on neuroendocrine function and are being used to identify regulated gene products that may elucidate the metabolic pathology of obese ob/ob, lean Vgf-/Vgf-, and other models with profound metabolic impairments.
Cardiac mesenchymal progenitors from postmortem cardiac tissues retained cellular characterization.
Kami, D; Kitani, T; Nakata, M; Gojo, S
2014-05-01
Currently, cells for transplantation in regenerative medicine are derived from either autologous or allogeneic tissue. The former has the drawbacks that the quality of donor cells may depend on the condition of the patient, while the quantity of the cells may also be limited. To solve these problems, we investigated the potential of allogeneic cardiac mesenchymal progenitors (CMPs) derived from postmortem hearts, which may be immunologically privileged similar to bone marrow-derived mesenchymal progenitors. We examined whether viable CMPs could be isolated from C57/B6 murine cardiac tissues harvested at 24 hours postmortem. After 2- to 3-week propagation with a high dose of basic fibroblast growth factor, we performed cellular characteristics analyses, which included proliferation and differentiation property flow cytometry and microarray analyses. Postmortem CMPs had a longer lag phase after seeding than CMPs obtained from living tissues, but otherwise had similar characteristics in all the analyses. In addition, global gene expression analysis by microarray showed that cells derived from postmortem and living tissues had similar characteristics. These results indicate that allogeneic postmortem CMPs have potential for cell transplantation because they circumvent the issue of both the quality and quantity of donor cells. Copyright © 2014 Elsevier Inc. All rights reserved.
Parallel human genome analysis: microarray-based expression monitoring of 1000 genes.
Schena, M; Shalon, D; Heller, R; Chai, A; Brown, P O; Davis, R W
1996-01-01
Microarrays containing 1046 human cDNAs of unknown sequence were printed on glass with high-speed robotics. These 1.0-cm2 DNA "chips" were used to quantitatively monitor differential expression of the cognate human genes using a highly sensitive two-color hybridization assay. Array elements that displayed differential expression patterns under given experimental conditions were characterized by sequencing. The identification of known and novel heat shock and phorbol ester-regulated genes in human T cells demonstrates the sensitivity of the assay. Parallel gene analysis with microarrays provides a rapid and efficient method for large-scale human gene discovery. Images Fig. 1 Fig. 2 Fig. 3 PMID:8855227
Chockalingam, Sriram; Aluru, Maneesha; Aluru, Srinivas
2016-09-19
Pre-processing of microarray data is a well-studied problem. Furthermore, all popular platforms come with their own recommended best practices for differential analysis of genes. However, for genome-scale network inference using microarray data collected from large public repositories, these methods filter out a considerable number of genes. This is primarily due to the effects of aggregating a diverse array of experiments with different technical and biological scenarios. Here we introduce a pre-processing pipeline suitable for inferring genome-scale gene networks from large microarray datasets. We show that partitioning of the available microarray datasets according to biological relevance into tissue- and process-specific categories significantly extends the limits of downstream network construction. We demonstrate the effectiveness of our pre-processing pipeline by inferring genome-scale networks for the model plant Arabidopsis thaliana using two different construction methods and a collection of 11,760 Affymetrix ATH1 microarray chips. Our pre-processing pipeline and the datasets used in this paper are made available at http://alurulab.cc.gatech.edu/microarray-pp.
Bikel, Shirley; Jacobo-Albavera, Leonor; Sánchez-Muñoz, Fausto; Cornejo-Granados, Fernanda; Canizales-Quinteros, Samuel; Soberón, Xavier; Sotelo-Mundo, Rogerio R; Del Río-Navarro, Blanca E; Mendoza-Vargas, Alfredo; Sánchez, Filiberto; Ochoa-Leyva, Adrian
2017-01-01
In spite of the emergence of RNA sequencing (RNA-seq), microarrays remain in widespread use for gene expression analysis in the clinic. There are over 767,000 RNA microarrays from human samples in public repositories, which are an invaluable resource for biomedical research and personalized medicine. The absolute gene expression analysis allows the transcriptome profiling of all expressed genes under a specific biological condition without the need of a reference sample. However, the background fluorescence represents a challenge to determine the absolute gene expression in microarrays. Given that the Y chromosome is absent in female subjects, we used it as a new approach for absolute gene expression analysis in which the fluorescence of the Y chromosome genes of female subjects was used as the background fluorescence for all the probes in the microarray. This fluorescence was used to establish an absolute gene expression threshold, allowing the differentiation between expressed and non-expressed genes in microarrays. We extracted the RNA from 16 children leukocyte samples (nine males and seven females, ages 6-10 years). An Affymetrix Gene Chip Human Gene 1.0 ST Array was carried out for each sample and the fluorescence of 124 genes of the Y chromosome was used to calculate the absolute gene expression threshold. After that, several expressed and non-expressed genes according to our absolute gene expression threshold were compared against the expression obtained using real-time quantitative polymerase chain reaction (RT-qPCR). From the 124 genes of the Y chromosome, three genes (DDX3Y, TXLNG2P and EIF1AY) that displayed significant differences between sexes were used to calculate the absolute gene expression threshold. Using this threshold, we selected 13 expressed and non-expressed genes and confirmed their expression level by RT-qPCR. Then, we selected the top 5% most expressed genes and found that several KEGG pathways were significantly enriched. Interestingly, these pathways were related to the typical functions of leukocytes cells, such as antigen processing and presentation and natural killer cell mediated cytotoxicity. We also applied this method to obtain the absolute gene expression threshold in already published microarray data of liver cells, where the top 5% expressed genes showed an enrichment of typical KEGG pathways for liver cells. Our results suggest that the three selected genes of the Y chromosome can be used to calculate an absolute gene expression threshold, allowing a transcriptome profiling of microarray data without the need of an additional reference experiment. Our approach based on the establishment of a threshold for absolute gene expression analysis will allow a new way to analyze thousands of microarrays from public databases. This allows the study of different human diseases without the need of having additional samples for relative expression experiments.
Wilson, James W.; Ramamurthy, Rajee; Porwollik, Steffen; McClelland, Michael; Hammond, Timothy; Allen, Pat; Ott, C. Mark; Pierson, Duane L.; Nickerson, Cheryl A.
2002-01-01
The low-shear environment of optimized rotation suspension culture allows both eukaryotic and prokaryotic cells to assume physiologically relevant phenotypes that have led to significant advances in fundamental investigations of medical and biological importance. This culture environment has also been used to model microgravity for ground-based studies regarding the impact of space flight on eukaryotic and prokaryotic physiology. We have previously demonstrated that low-shear modeled microgravity (LSMMG) under optimized rotation suspension culture is a novel environmental signal that regulates the virulence, stress resistance, and protein expression levels of Salmonella enterica serovar Typhimurium. However, the mechanisms used by the cells of any species, including Salmonella, to sense and respond to LSMMG and identities of the genes involved are unknown. In this study, we used DNA microarrays to elucidate the global transcriptional response of Salmonella to LSMMG. When compared with identical growth conditions under normal gravity (1 × g), LSMMG differentially regulated the expression of 163 genes distributed throughout the chromosome, representing functionally diverse groups including transcriptional regulators, virulence factors, lipopolysaccharide biosynthetic enzymes, iron-utilization enzymes, and proteins of unknown function. Many of the LSMMG-regulated genes were organized in clusters or operons. The microarray results were further validated by RT-PCR and phenotypic analyses, and they indicate that the ferric uptake regulator is involved in the LSMMG response. The results provide important insight about the Salmonella LSMMG response and could provide clues for the functioning of known Salmonella virulence systems or the identification of uncharacterized bacterial virulence strategies. PMID:12370447
Chen, Jie; Fu, Ziyi; Ji, Chenbo; Gu, Pingqing; Xu, Pengfei; Yu, Ningzhu; Kan, Yansheng; Wu, Xiaowei; Shen, Rong; Shen, Yan
2015-05-01
The human uterine cervix carcinoma is one of the most well-known malignancy reproductive system cancers, which threatens women health globally. However, the mechanisms of the oncogenesis and development process of cervix carcinoma are not yet fully understood. Long non-coding RNAs (lncRNAs) have been proved to play key roles in various biological processes, especially development of cancer. The function and mechanism of lncRNAs on cervix carcinoma is still rarely reported. We selected 3 cervix cancer and normal cervix tissues separately, then performed lncRNA microarray to detect the differentially expressed lncRNAs. Subsequently, we explored the potential function of these dysregulated lncRNAs through online bioinformatics databases. Finally, quantity real-time PCR was carried out to confirm the expression levels of these dysregulated lncRNAs in cervix cancer and normal tissues. We uncovered the profiles of differentially expressed lncRNAs between normal and cervix carcinoma tissues by using the microarray techniques, and found 1622 upregulated and 3026 downregulated lncRNAs (fold-change>2.0) in cervix carcinoma compared to the normal cervical tissue. Furthermore, we found HOXA11-AS might participate in cervix carcinogenesis by regulating HOXA11, which is involved in regulating biological processes of cervix cancer. This study afforded expression profiles of lncRNAs between cervix carcinoma tissue and normal cervical tissue, which could provide database for further research about the function and mechanism of key-lncRNAs in cervix carcinoma, and might be helpful to explore potential diagnosis factors and therapeutic targets for cervix carcinoma. Copyright © 2015 Elsevier Masson SAS. All rights reserved.
2011-01-01
Background Global transcriptional analysis of loblolly pine (Pinus taeda L.) is challenging due to limited molecular tools. PtGen2, a 26,496 feature cDNA microarray, was fabricated and used to assess drought-induced gene expression in loblolly pine propagule roots. Statistical analysis of differential expression and weighted gene correlation network analysis were used to identify drought-responsive genes and further characterize the molecular basis of drought tolerance in loblolly pine. Results Microarrays were used to interrogate root cDNA populations obtained from 12 genotype × treatment combinations (four genotypes, three watering regimes). Comparison of drought-stressed roots with roots from the control treatment identified 2445 genes displaying at least a 1.5-fold expression difference (false discovery rate = 0.01). Genes commonly associated with drought response in pine and other plant species, as well as a number of abiotic and biotic stress-related genes, were up-regulated in drought-stressed roots. Only 76 genes were identified as differentially expressed in drought-recovered roots, indicating that the transcript population can return to the pre-drought state within 48 hours. Gene correlation analysis predicts a scale-free network topology and identifies eleven co-expression modules that ranged in size from 34 to 938 members. Network topological parameters identified a number of central nodes (hubs) including those with significant homology (E-values ≤ 2 × 10-30) to 9-cis-epoxycarotenoid dioxygenase, zeatin O-glucosyltransferase, and ABA-responsive protein. Identified hubs also include genes that have been associated previously with osmotic stress, phytohormones, enzymes that detoxify reactive oxygen species, and several genes of unknown function. Conclusion PtGen2 was used to evaluate transcriptome responses in loblolly pine and was leveraged to identify 2445 differentially expressed genes responding to severe drought stress in roots. Many of the genes identified are known to be up-regulated in response to osmotic stress in pine and other plant species and encode proteins involved in both signal transduction and stress tolerance. Gene expression levels returned to control values within a 48-hour recovery period in all but 76 transcripts. Correlation network analysis indicates a scale-free network topology for the pine root transcriptome and identifies central nodes that may serve as drivers of drought-responsive transcriptome dynamics in the roots of loblolly pine. PMID:21609476
Kumar, Amod; Gaur, Gyanendra Kumar; Gandham, Ravi Kumar; Panigrahi, Manjit; Ghosh, Shrikant; Saravanan, B C; Bhushan, Bharat; Tiwari, Ashok Kumar; Sulabh, Sourabh; Priya, Bhuvana; V N, Muhasin Asaf; Gupta, Jay Prakash; Wani, Sajad Ahmad; Sahu, Amit Ranjan; Sahoo, Aditya Prasad
2017-01-01
Bovine tropical theileriosis is an important haemoprotozoan disease associated with high rates of morbidity and mortality particularly in exotic and crossbred cattle. It is one of the major constraints of the livestock development programmes in India and Southeast Asia. Indigenous cattle (Bos indicus) are reported to be comparatively less affected than exotic and crossbred cattle. However, genetic basis of resistance to tropical theileriosis in indigenous cattle is not well documented. Recent studies incited an idea that differentially expressed genes in exotic and indigenous cattle play significant role in breed specific resistance to tropical theileriosis. The present study was designed to determine the global gene expression profile in peripheral blood mononuclear cells derived from indigenous (Tharparkar) and cross-bred cattle following in vitro infection of T. annulata (Parbhani strain). Two separate microarray experiments were carried out each for cross-bred and Tharparkar cattle. The cross-bred cattle showed 1082 differentially expressed genes (DEGs). Out of total DEGs, 597 genes were down-regulated and 485 were up-regulated. Their fold change varied from 2283.93 to -4816.02. Tharparkar cattle showed 875 differentially expressed genes including 451 down-regulated and 424 up-regulated. The fold change varied from 94.93 to -19.20. A subset of genes was validated by qRT-PCR and results were correlated well with microarray data indicating that microarray results provided an accurate report of transcript level. Functional annotation study of DEGs confirmed their involvement in various pathways including response to oxidative stress, immune system regulation, cell proliferation, cytoskeletal changes, kinases activity and apoptosis. Gene network analysis of these DEGs plays an important role to understand the interaction among genes. It is therefore, hypothesized that the different susceptibility to tropical theileriosis exhibited by indigenous and crossbred cattle is due to breed-specific differences in the dealing of infected cells with other immune cells, which ultimately influence the immune response responded against T. annulata infection. Copyright © 2016 Elsevier B.V. All rights reserved.
Host Transcriptional Response to Ebola Virus Infection
Speranza, Emily; Connor, John H
2017-01-01
Ebola virus disease (EVD) is a serious illness that causes severe disease in humans and non-human primates (NHPs) and has mortality rates up to 90%. EVD is caused by the Ebolavirus and currently there are no licensed therapeutics or vaccines to treat EVD. Due to its high mortality rates and potential as a bioterrorist weapon, a better understanding of the disease is of high priority. Multiparametric analysis techniques allow for a more complete understanding of a disease and the host response. Analysis of RNA species present in a sample can lead to a greater understanding of activation or suppression of different states of the immune response. Transcriptomic analyses such as microarrays and RNA-Sequencing (RNA-Seq) have been important tools to better understand the global gene expression response to EVD. In this review, we outline the current knowledge gained by transcriptomic analysis of EVD. PMID:28930167
Chipster: user-friendly analysis software for microarray and other high-throughput data.
Kallio, M Aleksi; Tuimala, Jarno T; Hupponen, Taavi; Klemelä, Petri; Gentile, Massimiliano; Scheinin, Ilari; Koski, Mikko; Käki, Janne; Korpelainen, Eija I
2011-10-14
The growth of high-throughput technologies such as microarrays and next generation sequencing has been accompanied by active research in data analysis methodology, producing new analysis methods at a rapid pace. While most of the newly developed methods are freely available, their use requires substantial computational skills. In order to enable non-programming biologists to benefit from the method development in a timely manner, we have created the Chipster software. Chipster (http://chipster.csc.fi/) brings a powerful collection of data analysis methods within the reach of bioscientists via its intuitive graphical user interface. Users can analyze and integrate different data types such as gene expression, miRNA and aCGH. The analysis functionality is complemented with rich interactive visualizations, allowing users to select datapoints and create new gene lists based on these selections. Importantly, users can save the performed analysis steps as reusable, automatic workflows, which can also be shared with other users. Being a versatile and easily extendable platform, Chipster can be used for microarray, proteomics and sequencing data. In this article we describe its comprehensive collection of analysis and visualization tools for microarray data using three case studies. Chipster is a user-friendly analysis software for high-throughput data. Its intuitive graphical user interface enables biologists to access a powerful collection of data analysis and integration tools, and to visualize data interactively. Users can collaborate by sharing analysis sessions and workflows. Chipster is open source, and the server installation package is freely available.
Chipster: user-friendly analysis software for microarray and other high-throughput data
2011-01-01
Background The growth of high-throughput technologies such as microarrays and next generation sequencing has been accompanied by active research in data analysis methodology, producing new analysis methods at a rapid pace. While most of the newly developed methods are freely available, their use requires substantial computational skills. In order to enable non-programming biologists to benefit from the method development in a timely manner, we have created the Chipster software. Results Chipster (http://chipster.csc.fi/) brings a powerful collection of data analysis methods within the reach of bioscientists via its intuitive graphical user interface. Users can analyze and integrate different data types such as gene expression, miRNA and aCGH. The analysis functionality is complemented with rich interactive visualizations, allowing users to select datapoints and create new gene lists based on these selections. Importantly, users can save the performed analysis steps as reusable, automatic workflows, which can also be shared with other users. Being a versatile and easily extendable platform, Chipster can be used for microarray, proteomics and sequencing data. In this article we describe its comprehensive collection of analysis and visualization tools for microarray data using three case studies. Conclusions Chipster is a user-friendly analysis software for high-throughput data. Its intuitive graphical user interface enables biologists to access a powerful collection of data analysis and integration tools, and to visualize data interactively. Users can collaborate by sharing analysis sessions and workflows. Chipster is open source, and the server installation package is freely available. PMID:21999641
Smoot, L M; Smoot, J C; Graham, M R; Somerville, G A; Sturdevant, D E; Migliaccio, C A; Sylva, G L; Musser, J M
2001-08-28
Pathogens are exposed to different temperatures during an infection cycle and must regulate gene expression accordingly. However, the extent to which virulent bacteria alter gene expression in response to temperatures encountered in the host is unknown. Group A Streptococcus (GAS) is a human-specific pathogen that is responsible for illnesses ranging from superficial skin infections and pharyngitis to severe invasive infections such as necrotizing fasciitis and streptococcal toxic shock syndrome. GAS survives and multiplies at different temperatures during human infection. DNA microarray analysis was used to investigate the influence of temperature on global gene expression in a serotype M1 strain grown to exponential phase at 29 degrees C and 37 degrees C. Approximately 9% of genes were differentially expressed by at least 1.5-fold at 29 degrees C relative to 37 degrees C, including genes encoding transporter proteins, proteins involved in iron homeostasis, transcriptional regulators, phage-associated proteins, and proteins with no known homologue. Relatively few known virulence genes were differentially expressed at this threshold. However, transcription of 28 genes encoding proteins with predicted secretion signal sequences was altered, indicating that growth temperature substantially influences the extracellular proteome. TaqMan real-time reverse transcription-PCR assays confirmed the microarray data. We also discovered that transcription of genes encoding hemolysins, and proteins with inferred roles in iron regulation, transport, and homeostasis, was influenced by growth at 40 degrees C. Thus, GAS profoundly alters gene expression in response to temperature. The data delineate the spectrum of temperature-regulated gene expression in an important human pathogen and provide many unforeseen lines of pathogenesis investigation.
A database for the analysis of immunity genes in Drosophila: PADMA database.
Lee, Mark J; Mondal, Ariful; Small, Chiyedza; Paddibhatla, Indira; Kawaguchi, Akira; Govind, Shubha
2011-01-01
While microarray experiments generate voluminous data, discerning trends that support an existing or alternative paradigm is challenging. To synergize hypothesis building and testing, we designed the Pathogen Associated Drosophila MicroArray (PADMA) database for easy retrieval and comparison of microarray results from immunity-related experiments (www.padmadatabase.org). PADMA also allows biologists to upload their microarray-results and compare it with datasets housed within PADMA. We tested PADMA using a preliminary dataset from Ganaspis xanthopoda-infected fly larvae, and uncovered unexpected trends in gene expression, reshaping our hypothesis. Thus, the PADMA database will be a useful resource to fly researchers to evaluate, revise, and refine hypotheses.
ERIC Educational Resources Information Center
Tra, Yolande V.; Evans, Irene M.
2010-01-01
"BIO2010" put forth the goal of improving the mathematical educational background of biology students. The analysis and interpretation of microarray high-dimensional data can be very challenging and is best done by a statistician and a biologist working and teaching in a collaborative manner. We set up such a collaboration and designed a course on…
ERIC Educational Resources Information Center
Al-Mamari, Watfa; Al-Saegh, Abeer; Al-Kindy, Adila; Bruwer, Zandre; Al-Murshedi, Fathiya; Al-Thihli, Khalid
2015-01-01
Autism Spectrum Disorders are a complicated group of disorders characterized with heterogeneous genetic etiologies. The genetic investigations for this group of disorders have expanded considerably over the past decade. In our study we designed a tired approach and studied the diagnostic yield of chromosomal microarray analysis on patients…
Immunological Targeting of Tumor Initiating Prostate Cancer Cells
2014-10-01
clinically using well-accepted immuno-competent animal models. 2) Keywords: Prostate Cancer, Lymphocyte, Vaccine, Antibody 3) Overall Project Summary...castrate animals . Task 1: Identify and verify antigenic targets from CAstrate Resistant Luminal Epithelial Cells (CRLEC) (months 1-16... animals per group will be processed to derive sufficient RNA for microarray analysis; the experiment will be repeated x 3. Microarray analysis will
MiMiR – an integrated platform for microarray data sharing, mining and analysis
Tomlinson, Chris; Thimma, Manjula; Alexandrakis, Stelios; Castillo, Tito; Dennis, Jayne L; Brooks, Anthony; Bradley, Thomas; Turnbull, Carly; Blaveri, Ekaterini; Barton, Geraint; Chiba, Norie; Maratou, Klio; Soutter, Pat; Aitman, Tim; Game, Laurence
2008-01-01
Background Despite considerable efforts within the microarray community for standardising data format, content and description, microarray technologies present major challenges in managing, sharing, analysing and re-using the large amount of data generated locally or internationally. Additionally, it is recognised that inconsistent and low quality experimental annotation in public data repositories significantly compromises the re-use of microarray data for meta-analysis. MiMiR, the Microarray data Mining Resource was designed to tackle some of these limitations and challenges. Here we present new software components and enhancements to the original infrastructure that increase accessibility, utility and opportunities for large scale mining of experimental and clinical data. Results A user friendly Online Annotation Tool allows researchers to submit detailed experimental information via the web at the time of data generation rather than at the time of publication. This ensures the easy access and high accuracy of meta-data collected. Experiments are programmatically built in the MiMiR database from the submitted information and details are systematically curated and further annotated by a team of trained annotators using a new Curation and Annotation Tool. Clinical information can be annotated and coded with a clinical Data Mapping Tool within an appropriate ethical framework. Users can visualise experimental annotation, assess data quality, download and share data via a web-based experiment browser called MiMiR Online. All requests to access data in MiMiR are routed through a sophisticated middleware security layer thereby allowing secure data access and sharing amongst MiMiR registered users prior to publication. Data in MiMiR can be mined and analysed using the integrated EMAAS open source analysis web portal or via export of data and meta-data into Rosetta Resolver data analysis package. Conclusion The new MiMiR suite of software enables systematic and effective capture of extensive experimental and clinical information with the highest MIAME score, and secure data sharing prior to publication. MiMiR currently contains more than 150 experiments corresponding to over 3000 hybridisations and supports the Microarray Centre's large microarray user community and two international consortia. The MiMiR flexible and scalable hardware and software architecture enables secure warehousing of thousands of datasets, including clinical studies, from microarray and potentially other -omics technologies. PMID:18801157
MiMiR--an integrated platform for microarray data sharing, mining and analysis.
Tomlinson, Chris; Thimma, Manjula; Alexandrakis, Stelios; Castillo, Tito; Dennis, Jayne L; Brooks, Anthony; Bradley, Thomas; Turnbull, Carly; Blaveri, Ekaterini; Barton, Geraint; Chiba, Norie; Maratou, Klio; Soutter, Pat; Aitman, Tim; Game, Laurence
2008-09-18
Despite considerable efforts within the microarray community for standardising data format, content and description, microarray technologies present major challenges in managing, sharing, analysing and re-using the large amount of data generated locally or internationally. Additionally, it is recognised that inconsistent and low quality experimental annotation in public data repositories significantly compromises the re-use of microarray data for meta-analysis. MiMiR, the Microarray data Mining Resource was designed to tackle some of these limitations and challenges. Here we present new software components and enhancements to the original infrastructure that increase accessibility, utility and opportunities for large scale mining of experimental and clinical data. A user friendly Online Annotation Tool allows researchers to submit detailed experimental information via the web at the time of data generation rather than at the time of publication. This ensures the easy access and high accuracy of meta-data collected. Experiments are programmatically built in the MiMiR database from the submitted information and details are systematically curated and further annotated by a team of trained annotators using a new Curation and Annotation Tool. Clinical information can be annotated and coded with a clinical Data Mapping Tool within an appropriate ethical framework. Users can visualise experimental annotation, assess data quality, download and share data via a web-based experiment browser called MiMiR Online. All requests to access data in MiMiR are routed through a sophisticated middleware security layer thereby allowing secure data access and sharing amongst MiMiR registered users prior to publication. Data in MiMiR can be mined and analysed using the integrated EMAAS open source analysis web portal or via export of data and meta-data into Rosetta Resolver data analysis package. The new MiMiR suite of software enables systematic and effective capture of extensive experimental and clinical information with the highest MIAME score, and secure data sharing prior to publication. MiMiR currently contains more than 150 experiments corresponding to over 3000 hybridisations and supports the Microarray Centre's large microarray user community and two international consortia. The MiMiR flexible and scalable hardware and software architecture enables secure warehousing of thousands of datasets, including clinical studies, from microarray and potentially other -omics technologies.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Andersen, G.L.; He, Z.; DeSantis, T.Z.
Microarrays have proven to be a useful and high-throughput method to provide targeted DNA sequence information for up to many thousands of specific genetic regions in a single test. A microarray consists of multiple DNA oligonucleotide probes that, under high stringency conditions, hybridize only to specific complementary nucleic acid sequences (targets). A fluorescent signal indicates the presence and, in many cases, the abundance of genetic regions of interest. In this chapter we will look at how microarrays are used in microbial ecology, especially with the recent increase in microbial community DNA sequence data. Of particular interest to microbial ecologists, phylogeneticmore » microarrays are used for the analysis of phylotypes in a community and functional gene arrays are used for the analysis of functional genes, and, by inference, phylotypes in environmental samples. A phylogenetic microarray that has been developed by the Andersen laboratory, the PhyloChip, will be discussed as an example of a microarray that targets the known diversity within the 16S rRNA gene to determine microbial community composition. Using multiple, confirmatory probes to increase the confidence of detection and a mismatch probe for every perfect match probe to minimize the effect of cross-hybridization by non-target regions, the PhyloChip is able to simultaneously identify any of thousands of taxa present in an environmental sample. The PhyloChip is shown to reveal greater diversity within a community than rRNA gene sequencing due to the placement of the entire gene product on the microarray compared with the analysis of up to thousands of individual molecules by traditional sequencing methods. A functional gene array that has been developed by the Zhou laboratory, the GeoChip, will be discussed as an example of a microarray that dynamically identifies functional activities of multiple members within a community. The recent version of GeoChip contains more than 24,000 50mer oligonucleotide probes and covers more than 10,000 gene sequences in 150 gene categories involved in carbon, nitrogen, sulfur, and phosphorus cycling, metal resistance and reduction, and organic contaminant degradation. GeoChip can be used as a generic tool for microbial community analysis, and also link microbial community structure to ecosystem functioning. Examples of the application of both arrays in different environmental samples will be described in the two subsequent sections.« less
Gillet, Jean-Pierre; Molina, Thierry Jo; Jamart, Jacques; Gaulard, Philippe; Leroy, Karen; Briere, Josette; Theate, Ivan; Thieblemont, Catherine; Bosly, Andre; Herin, Michel; Hamels, Jacques; Remacle, Jose
2009-03-01
Lymphomas are classified according to the World Health Organisation (WHO) classification which defines subtypes on the basis of clinical, morphological, immunophenotypic, molecular and cytogenetic criteria. Differential diagnosis of the subtypes is sometimes difficult, especially for small B-cell lymphoma (SBCL). Standardisation of molecular genetic assays using multiple gene expression analysis by microarrays could be a useful complement to the current diagnosis. The aim of the present study was to develop a low density DNA microarray for the analysis of 107 genes associated with B-cell non-Hodgkin lymphoma and to evaluate its performance in the diagnosis of SBCL. A predictive tool based on Fisher discriminant analysis using a training set of 40 patients including four different subtypes (follicular lymphoma n = 15, mantle cell lymphoma n = 7, B-cell chronic lymphocytic leukemia n = 6 and splenic marginal zone lymphoma n = 12) was designed. A short additional preliminary analysis to gauge the accuracy of this signature was then performed on an external set of nine patients. Using this model, eight of nine of those samples were classified successfully. This pilot study demonstrates that such a microarray tool may be a promising diagnostic approach for small B-cell non-Hodgkin lymphoma.
MAGMA: analysis of two-channel microarrays made easy.
Rehrauer, Hubert; Zoller, Stefan; Schlapbach, Ralph
2007-07-01
The web application MAGMA provides a simple and intuitive interface to identify differentially expressed genes from two-channel microarray data. While the underlying algorithms are not superior to those of similar web applications, MAGMA is particularly user friendly and can be used without prior training. The user interface guides the novice user through the most typical microarray analysis workflow consisting of data upload, annotation, normalization and statistical analysis. It automatically generates R-scripts that document MAGMA's entire data processing steps, thereby allowing the user to regenerate all results in his local R installation. The implementation of MAGMA follows the model-view-controller design pattern that strictly separates the R-based statistical data processing, the web-representation and the application logic. This modular design makes the application flexible and easily extendible by experts in one of the fields: statistical microarray analysis, web design or software development. State-of-the-art Java Server Faces technology was used to generate the web interface and to perform user input processing. MAGMA's object-oriented modular framework makes it easily extendible and applicable to other fields and demonstrates that modern Java technology is also suitable for rather small and concise academic projects. MAGMA is freely available at www.magma-fgcz.uzh.ch.
Tomato Expression Database (TED): a suite of data presentation and analysis tools
Fei, Zhangjun; Tang, Xuemei; Alba, Rob; Giovannoni, James
2006-01-01
The Tomato Expression Database (TED) includes three integrated components. The Tomato Microarray Data Warehouse serves as a central repository for raw gene expression data derived from the public tomato cDNA microarray. In addition to expression data, TED stores experimental design and array information in compliance with the MIAME guidelines and provides web interfaces for researchers to retrieve data for their own analysis and use. The Tomato Microarray Expression Database contains normalized and processed microarray data for ten time points with nine pair-wise comparisons during fruit development and ripening in a normal tomato variety and nearly isogenic single gene mutants impacting fruit development and ripening. Finally, the Tomato Digital Expression Database contains raw and normalized digital expression (EST abundance) data derived from analysis of the complete public tomato EST collection containing >150 000 ESTs derived from 27 different non-normalized EST libraries. This last component also includes tools for the comparison of tomato and Arabidopsis digital expression data. A set of query interfaces and analysis, and visualization tools have been developed and incorporated into TED, which aid users in identifying and deciphering biologically important information from our datasets. TED can be accessed at . PMID:16381976
Tomato Expression Database (TED): a suite of data presentation and analysis tools.
Fei, Zhangjun; Tang, Xuemei; Alba, Rob; Giovannoni, James
2006-01-01
The Tomato Expression Database (TED) includes three integrated components. The Tomato Microarray Data Warehouse serves as a central repository for raw gene expression data derived from the public tomato cDNA microarray. In addition to expression data, TED stores experimental design and array information in compliance with the MIAME guidelines and provides web interfaces for researchers to retrieve data for their own analysis and use. The Tomato Microarray Expression Database contains normalized and processed microarray data for ten time points with nine pair-wise comparisons during fruit development and ripening in a normal tomato variety and nearly isogenic single gene mutants impacting fruit development and ripening. Finally, the Tomato Digital Expression Database contains raw and normalized digital expression (EST abundance) data derived from analysis of the complete public tomato EST collection containing >150,000 ESTs derived from 27 different non-normalized EST libraries. This last component also includes tools for the comparison of tomato and Arabidopsis digital expression data. A set of query interfaces and analysis, and visualization tools have been developed and incorporated into TED, which aid users in identifying and deciphering biologically important information from our datasets. TED can be accessed at http://ted.bti.cornell.edu.
Development and application of a microarray meter tool to optimize microarray experiments
Rouse, Richard JD; Field, Katrine; Lapira, Jennifer; Lee, Allen; Wick, Ivan; Eckhardt, Colleen; Bhasker, C Ramana; Soverchia, Laura; Hardiman, Gary
2008-01-01
Background Successful microarray experimentation requires a complex interplay between the slide chemistry, the printing pins, the nucleic acid probes and targets, and the hybridization milieu. Optimization of these parameters and a careful evaluation of emerging slide chemistries are a prerequisite to any large scale array fabrication effort. We have developed a 'microarray meter' tool which assesses the inherent variations associated with microarray measurement prior to embarking on large scale projects. Findings The microarray meter consists of nucleic acid targets (reference and dynamic range control) and probe components. Different plate designs containing identical probe material were formulated to accommodate different robotic and pin designs. We examined the variability in probe quality and quantity (as judged by the amount of DNA printed and remaining post-hybridization) using three robots equipped with capillary printing pins. Discussion The generation of microarray data with minimal variation requires consistent quality control of the (DNA microarray) manufacturing and experimental processes. Spot reproducibility is a measure primarily of the variations associated with printing. The microarray meter assesses array quality by measuring the DNA content for every feature. It provides a post-hybridization analysis of array quality by scoring probe performance using three metrics, a) a measure of variability in the signal intensities, b) a measure of the signal dynamic range and c) a measure of variability of the spot morphologies. PMID:18710498
USDA-ARS?s Scientific Manuscript database
Chickens were immunized subcutaneously with an Eimeria recombinant profilin protein plus ISA 70 VG (ISA 70) or ISA 71 VG (ISA 71) water-in-oil adjuvants, or with profilin alone, and comparative RNA microarray hybridizations were performed to ascertain global transcriptome changes induced by profilin...
Inoue, Daisuke; Hinoura, Takuji; Suzuki, Noriko; Pang, Junqin; Malla, Rabin; Shrestha, Sadhana; Chapagain, Saroj Kumar; Matsuzawa, Hiroaki; Nakamura, Takashi; Tanaka, Yasuhiro; Ike, Michihiko; Nishida, Kei; Sei, Kazunari
2015-01-01
Because of heavy dependence on groundwater for drinking water and other domestic use, microbial contamination of groundwater is a serious problem in the Kathmandu Valley, Nepal. This study investigated comprehensively the occurrence of pathogenic bacteria in shallow well groundwater in the Kathmandu Valley by applying DNA microarray analysis targeting 941 pathogenic bacterial species/groups. Water quality measurements found significant coliform (fecal) contamination in 10 of the 11 investigated groundwater samples and significant nitrogen contamination in some samples. The results of DNA microarray analysis revealed the presence of 1-37 pathogen species/groups, including 1-27 biosafety level 2 ones, in 9 of the 11 groundwater samples. While the detected pathogens included several feces- and animal-related ones, those belonging to Legionella and Arthrobacter, which were considered not to be directly associated with feces, were detected prevalently. This study could provide a rough picture of overall pathogenic bacterial contamination in the Kathmandu Valley, and demonstrated the usefulness of DNA microarray analysis as a comprehensive screening tool of a wide variety of pathogenic bacteria.
Microarray Analysis of Long Noncoding RNAs in Female Diabetic Peripheral Neuropathy Patients.
Luo, Lin; Ji, Lin-Dan; Cai, Jiang-Jia; Feng, Mei; Zhou, Mi; Hu, Su-Pei; Xu, Jin; Zhou, Wen-Hua
2018-01-01
Diabetic peripheral neuropathy (DPN) is the most common complication of diabetes mellitus (DM). Because of its controversial pathogenesis, DPN is still not diagnosed or managed properly in most patients. In this study, human lncRNA microarrays were used to identify the differentially expressed lncRNAs in DM and DPN patients, and some of the discovered lncRNAs were further validated in additional 78 samples by quantitative realtime PCR (qRT-PCR). The microarray analysis identified 446 and 1327 differentially expressed lncRNAs in DM and DPN, respectively. The KEGG pathway analysis further revealed that the differentially expressed lncRNA-coexpressed mRNAs between DPN and DM groups were significantly enriched in the MAPK signaling pathway. The lncRNA/mRNA coexpression network indicated that BDNF and TRAF2 correlated with 6 lncRNAs. The qRT-PCR confirmed the initial microarray results. These findings demonstrated that the interplay between lncRNAs and mRNA may be involved in the pathogenesis of DPN, especially the neurotrophin-MAPK signaling pathway, thus providing relevant information for future studies. © 2018 The Author(s). Published by S. Karger AG, Basel.
MASQOT: a method for cDNA microarray spot quality control
Bylesjö, Max; Eriksson, Daniel; Sjödin, Andreas; Sjöström, Michael; Jansson, Stefan; Antti, Henrik; Trygg, Johan
2005-01-01
Background cDNA microarray technology has emerged as a major player in the parallel detection of biomolecules, but still suffers from fundamental technical problems. Identifying and removing unreliable data is crucial to prevent the risk of receiving illusive analysis results. Visual assessment of spot quality is still a common procedure, despite the time-consuming work of manually inspecting spots in the range of hundreds of thousands or more. Results A novel methodology for cDNA microarray spot quality control is outlined. Multivariate discriminant analysis was used to assess spot quality based on existing and novel descriptors. The presented methodology displays high reproducibility and was found superior in identifying unreliable data compared to other evaluated methodologies. Conclusion The proposed methodology for cDNA microarray spot quality control generates non-discrete values of spot quality which can be utilized as weights in subsequent analysis procedures as well as to discard spots of undesired quality using the suggested threshold values. The MASQOT approach provides a consistent assessment of spot quality and can be considered an alternative to the labor-intensive manual quality assessment process. PMID:16223442
Xia, Yu; Yang, Yongchao; Huang, Shufang; Wu, Yueheng; Li, Ping; Zhuang, Jian
2018-03-24
This study aimed to determine chromosomal abnormalities and copy number variations (CNVs) in fetuses with congenital heart disease (CHD) by chromosomal microarray analysis (CMA). One hundred and ten cases with CHD detected by prenatal echocardiography were enrolled in the study; 27 cases were simple CHDs, and 83 were complex CHDs. Chromosomal microarray analysis was performed on the Affymetrix CytoScan HD platform. All annotated CNVs were validated by quantitative PCR. Chromosomal microarray analysis identified 6 cases with chromosomal abnormalities, including 2 cases with trisomy 21, 2 cases with trisomy 18, 1 case with trisomy 13, and 1 unusual case of mosaic trisomy 21. Pathogenic CNVs were detected in 15.5% (17/110) of the fetuses with CHDs, including 13 cases with CHD-associated CNVs. We further identified 10 genes as likely novel CHD candidate genes through gene functional enrichment analysis. We also found that pathogenic CMA results impacted the rate of pregnancy termination. This study shows that CMA is particularly effective for identifying chromosomal abnormalities and CNVs in fetuses with CHDs as well as having an effect on obstetrical outcomes. The elucidation of the genetic basis of CHDs will continue to expand our understanding of the etiology of CHDs. © 2018 John Wiley & Sons, Ltd.
Karsten, Stanislav L.; Van Deerlin, Vivianna M. D.; Sabatti, Chiara; Gill, Lisa H.; Geschwind, Daniel H.
2002-01-01
Archival formalin-fixed, paraffin-embedded and ethanol-fixed tissues represent a potentially invaluable resource for gene expression analysis, as they are the most widely available material for studies of human disease. Little data are available evaluating whether RNA obtained from fixed (archival) tissues could produce reliable and reproducible microarray expression data. Here we compare the use of RNA isolated from human archival tissues fixed in ethanol and formalin to frozen tissue in cDNA microarray experiments. Since an additional factor that can limit the utility of archival tissue is the often small quantities available, we also evaluate the use of the tyramide signal amplification method (TSA), which allows the use of small amounts of RNA. Detailed analysis indicates that TSA provides a consistent and reproducible signal amplification method for cDNA microarray analysis, across both arrays and the genes tested. Analysis of this method also highlights the importance of performing non-linear channel normalization and dye switching. Furthermore, archived, fixed specimens can perform well, but not surprisingly, produce more variable results than frozen tissues. Consistent results are more easily obtainable using ethanol-fixed tissues, whereas formalin-fixed tissue does not typically provide a useful substrate for cDNA synthesis and labeling. PMID:11788730
Wang, Shih-Han; Cheng, Chuen-Yu; Tang, Pin-Chi; Chen, Chih-Feng; Chen, Hsin-Hsin; Lee, Yen-Pai; Huang, San-Yuan
2013-01-15
Acute heat stress affects genes involved in spermatogenesis in mammals. However, there is apparently no elaborate research on the effects of acute heat stress on gene expression in avian testes. The purpose of this study was to investigate global gene expression in testes of the L2 strain of Taiwan country chicken after acute heat stress. Twelve roosters, 45 weeks old, were allocated into four groups, including control roosters kept at 25 °C, roosters subjected to 38 °C acute heat stress for 4 hours without recovery, with 2-hour recovery, and with 6-hour recovery, respectively. Testis samples were collected for RNA isolation and microarray analysis. Based on gene expression profiles, 169 genes were upregulated and 140 genes were downregulated after heat stress using a cutoff value of twofold or greater change. Based on gene ontology analysis, differentially expressed genes were mainly related to response to stress, transport, signal transduction, and metabolism. A functional network analysis displayed that heat shock protein genes and related chaperones were the major upregulated groups in chicken testes after acute heat stress. A quantitative real-time polymerase chain reaction analysis of mRNA expressions of HSP70, HSP90AA1, BAG3, SERPINB2, HSP25, DNAJA4, CYP3A80, CIRBP, and TAGLN confirmed the results of the microarray analysis. Because the HSP genes (HSP25, HSP70, and HSP90AA1) and the antiapoptotic BAG3 gene were dramatically altered in heat-stressed chicken testes, we concluded that these genes were important factors in the avian testes under acute heat stress. Whether these genes could be candidate genes for thermotolerance in roosters requires further investigation. Copyright © 2013 Elsevier Inc. All rights reserved.
MIGS-GPU: Microarray Image Gridding and Segmentation on the GPU.
Katsigiannis, Stamos; Zacharia, Eleni; Maroulis, Dimitris
2017-05-01
Complementary DNA (cDNA) microarray is a powerful tool for simultaneously studying the expression level of thousands of genes. Nevertheless, the analysis of microarray images remains an arduous and challenging task due to the poor quality of the images that often suffer from noise, artifacts, and uneven background. In this study, the MIGS-GPU [Microarray Image Gridding and Segmentation on Graphics Processing Unit (GPU)] software for gridding and segmenting microarray images is presented. MIGS-GPU's computations are performed on the GPU by means of the compute unified device architecture (CUDA) in order to achieve fast performance and increase the utilization of available system resources. Evaluation on both real and synthetic cDNA microarray images showed that MIGS-GPU provides better performance than state-of-the-art alternatives, while the proposed GPU implementation achieves significantly lower computational times compared to the respective CPU approaches. Consequently, MIGS-GPU can be an advantageous and useful tool for biomedical laboratories, offering a user-friendly interface that requires minimum input in order to run.
2012-01-01
Over the last decade, the introduction of microarray technology has had a profound impact on gene expression research. The publication of studies with dissimilar or altogether contradictory results, obtained using different microarray platforms to analyze identical RNA samples, has raised concerns about the reliability of this technology. The MicroArray Quality Control (MAQC) project was initiated to address these concerns, as well as other performance and data analysis issues. Expression data on four titration pools from two distinct reference RNA samples were generated at multiple test sites using a variety of microarray-based and alternative technology platforms. Here we describe the experimental design and probe mapping efforts behind the MAQC project. We show intraplatform consistency across test sites as well as a high level of interplatform concordance in terms of genes identified as differentially expressed. This study provides a resource that represents an important first step toward establishing a framework for the use of microarrays in clinical and regulatory settings. PMID:16964229
Women's experiences receiving abnormal prenatal chromosomal microarray testing results.
Bernhardt, Barbara A; Soucier, Danielle; Hanson, Karen; Savage, Melissa S; Jackson, Laird; Wapner, Ronald J
2013-02-01
Genomic microarrays can detect copy-number variants not detectable by conventional cytogenetics. This technology is diffusing rapidly into prenatal settings even though the clinical implications of many copy-number variants are currently unknown. We conducted a qualitative pilot study to explore the experiences of women receiving abnormal results from prenatal microarray testing performed in a research setting. Participants were a subset of women participating in a multicenter prospective study "Prenatal Cytogenetic Diagnosis by Array-based Copy Number Analysis." Telephone interviews were conducted with 23 women receiving abnormal prenatal microarray results. We found that five key elements dominated the experiences of women who had received abnormal prenatal microarray results: an offer too good to pass up, blindsided by the results, uncertainty and unquantifiable risks, need for support, and toxic knowledge. As prenatal microarray testing is increasingly used, uncertain findings will be common, resulting in greater need for careful pre- and posttest counseling, and more education of and resources for providers so they can adequately support the women who are undergoing testing.
Fuzzy support vector machine: an efficient rule-based classification technique for microarrays.
Hajiloo, Mohsen; Rabiee, Hamid R; Anooshahpour, Mahdi
2013-01-01
The abundance of gene expression microarray data has led to the development of machine learning algorithms applicable for tackling disease diagnosis, disease prognosis, and treatment selection problems. However, these algorithms often produce classifiers with weaknesses in terms of accuracy, robustness, and interpretability. This paper introduces fuzzy support vector machine which is a learning algorithm based on combination of fuzzy classifiers and kernel machines for microarray classification. Experimental results on public leukemia, prostate, and colon cancer datasets show that fuzzy support vector machine applied in combination with filter or wrapper feature selection methods develops a robust model with higher accuracy than the conventional microarray classification models such as support vector machine, artificial neural network, decision trees, k nearest neighbors, and diagonal linear discriminant analysis. Furthermore, the interpretable rule-base inferred from fuzzy support vector machine helps extracting biological knowledge from microarray data. Fuzzy support vector machine as a new classification model with high generalization power, robustness, and good interpretability seems to be a promising tool for gene expression microarray classification.
Haitsma, Jack J.; Furmli, Suleiman; Masoom, Hussain; Liu, Mingyao; Imai, Yumiko; Slutsky, Arthur S.; Beyene, Joseph; Greenwood, Celia M. T.; dos Santos, Claudia
2012-01-01
Objectives To perform a meta-analysis of gene expression microarray data from animal studies of lung injury, and to identify an injury-specific gene expression signature capable of predicting the development of lung injury in humans. Methods We performed a microarray meta-analysis using 77 microarray chips across six platforms, two species and different animal lung injury models exposed to lung injury with or/and without mechanical ventilation. Individual gene chips were classified and grouped based on the strategy used to induce lung injury. Effect size (change in gene expression) was calculated between non-injurious and injurious conditions comparing two main strategies to pool chips: (1) one-hit and (2) two-hit lung injury models. A random effects model was used to integrate individual effect sizes calculated from each experiment. Classification models were built using the gene expression signatures generated by the meta-analysis to predict the development of lung injury in human lung transplant recipients. Results Two injury-specific lists of differentially expressed genes generated from our meta-analysis of lung injury models were validated using external data sets and prospective data from animal models of ventilator-induced lung injury (VILI). Pathway analysis of gene sets revealed that both new and previously implicated VILI-related pathways are enriched with differentially regulated genes. Classification model based on gene expression signatures identified in animal models of lung injury predicted development of primary graft failure (PGF) in lung transplant recipients with larger than 80% accuracy based upon injury profiles from transplant donors. We also found that better classifier performance can be achieved by using meta-analysis to identify differentially-expressed genes than using single study-based differential analysis. Conclusion Taken together, our data suggests that microarray analysis of gene expression data allows for the detection of “injury" gene predictors that can classify lung injury samples and identify patients at risk for clinically relevant lung injury complications. PMID:23071521
Fasoli, Marianna; Dal Santo, Silvia; Zenoni, Sara; Tornielli, Giovanni Battista; Farina, Lorenzo; Zamboni, Anita; Porceddu, Andrea; Venturini, Luca; Bicego, Manuele; Murino, Vittorio; Ferrarini, Alberto; Delledonne, Massimo; Pezzotti, Mario
2012-09-01
We developed a genome-wide transcriptomic atlas of grapevine (Vitis vinifera) based on 54 samples representing green and woody tissues and organs at different developmental stages as well as specialized tissues such as pollen and senescent leaves. Together, these samples expressed ∼91% of the predicted grapevine genes. Pollen and senescent leaves had unique transcriptomes reflecting their specialized functions and physiological status. However, microarray and RNA-seq analysis grouped all the other samples into two major classes based on maturity rather than organ identity, namely, the vegetative/green and mature/woody categories. This division represents a fundamental transcriptomic reprogramming during the maturation process and was highlighted by three statistical approaches identifying the transcriptional relationships among samples (correlation analysis), putative biomarkers (O2PLS-DA approach), and sets of strongly and consistently expressed genes that define groups (topics) of similar samples (biclustering analysis). Gene coexpression analysis indicated that the mature/woody developmental program results from the reiterative coactivation of pathways that are largely inactive in vegetative/green tissues, often involving the coregulation of clusters of neighboring genes and global regulation based on codon preference. This global transcriptomic reprogramming during maturation has not been observed in herbaceous annual species and may be a defining characteristic of perennial woody plants.
Addressable droplet microarrays for single cell protein analysis.
Salehi-Reyhani, Ali; Burgin, Edward; Ces, Oscar; Willison, Keith R; Klug, David R
2014-11-07
Addressable droplet microarrays are potentially attractive as a way to achieve miniaturised, reduced volume, high sensitivity analyses without the need to fabricate microfluidic devices or small volume chambers. We report a practical method for producing oil-encapsulated addressable droplet microarrays which can be used for such analyses. To demonstrate their utility, we undertake a series of single cell analyses, to determine the variation in copy number of p53 proteins in cells of a human cancer cell line.
Kennedy, Laura; Vass, J. Keith; Haggart, D. Ross; Moore, Steve; Burczynski, Michael E.; Crowther, Dan; Miele, Gino
2008-01-01
Peripheral blood as a surrogate tissue for transcriptome profiling holds great promise for the discovery of diagnostic and prognostic disease biomarkers, particularly when target tissues of disease are not readily available. To maximize the reliability of gene expression data generated from clinical blood samples, both the sample collection and the microarray probe generation methods should be optimized to provide stabilized, reproducible and representative gene expression profiles faithfully representing the transcriptional profiles of the constituent blood cell types present in the circulation. Given the increasing innovation in this field in recent years, we investigated a combination of methodological advances in both RNA stabilisation and microarray probe generation with the goal of achieving robust, reliable and representative transcriptional profiles from whole blood. To assess the whole blood profiles, the transcriptomes of purified blood cell types were measured and compared with the global transcriptomes measured in whole blood. The results demonstrate that a combination of PAXgene™ RNA stabilising technology and single-stranded cDNA probe generation afforded by the NuGEN Ovation RNA amplification system V2™ enables an approach that yields faithful representation of specific hematopoietic cell lineage transcriptomes in whole blood without the necessity for prior sample fractionation, cell enrichment or globin reduction. Storage stability assessments of the PAXgene™ blood samples also advocate a short, fixed room temperature storage time for all PAXgene™ blood samples collected for the purposes of global transcriptional profiling in clinical studies. PMID:19578521
Popescu, F; Jaslow, C R; Kutteh, W H
2018-04-01
Will the addition of 24-chromosome microarray analysis on miscarriage tissue combined with the standard American Society for Reproductive Medicine (ASRM) evaluation for recurrent miscarriage explain most losses? Over 90% of patients with recurrent pregnancy loss (RPL) will have a probable or definitive cause identified when combining genetic testing on miscarriage tissue with the standard ASRM evaluation for recurrent miscarriage. RPL is estimated to occur in 2-4% of reproductive age couples. A probable cause can be identified in approximately 50% of patients after an ASRM recommended workup including an evaluation for parental chromosomal abnormalities, congenital and acquired uterine anomalies, endocrine imbalances and autoimmune factors including antiphospholipid syndrome. Single-center, prospective cohort study that included 100 patients seen in a private RPL clinic from 2014 to 2017. All 100 women had two or more pregnancy losses, a complete evaluation for RPL as defined by the ASRM, and miscarriage tissue evaluated by 24-chromosome microarray analysis after their second or subsequent miscarriage. Frequencies of abnormal results for evidence-based diagnostic tests considered definite or probable causes of RPL (karyotyping for parental chromosomal abnormalities, and 24-chromosome microarray evaluation for products of conception (POC); pelvic sonohysterography, hysterosalpingogram, or hysteroscopy for uterine anomalies; immunological tests for lupus anticoagulant and anticardiolipin antibodies; and blood tests for thyroid stimulating hormone (TSH), prolactin and hemoglobin A1c) were evaluated. We excluded cases where there was maternal cell contamination of the miscarriage tissue or if the ASRM evaluation was incomplete. A cost analysis for the evaluation of RPL was conducted to determine whether a proposed procedure of 24-chromome microarray evaluation followed by an ASRM RPL workup (for those RPL patients who had a normal 24-chromosome microarray evaluation) was more cost-efficient than conducting ASRM RPL workups on RPL patients followed by 24-chromosome microarray analysis (for those RPL patients who had a normal RPL workup). A definite or probable cause of pregnancy loss was identified in the vast majority (95/100; 95%) of RPL patients when a 24-chromosome pair microarray evaluation of POC testing is combined with the standard ASRM RPL workup evaluation at the time of the second or subsequent loss. The ASRM RPL workup identified an abnormality and a probable explanation for pregnancy loss in only 45/100 or 45% of all patients. A definite abnormality was identified in 67/100 patients or 67% when initial testing was performed using 24-chromosome microarray analyses on the miscarriage tissue. Only 5/100 (5%) patients, who had a euploid loss and a normal ASRM RPL workup, had a pregnancy loss without a probable or definitive cause identified. All other losses were explained by an abnormal 24-chromosome microarray analysis of the miscarriage tissue, an abnormal finding of the RPL workup, or a combination of both. Results from the cost analysis indicated that an initial approach of using a 24-chromosome microarray analysis on miscarriage tissue resulted in a 50% savings in cost to the health care system and to the patient. This is a single-center study on a small group of well-characterized women with RPL. There was an incomplete follow-up on subsequent pregnancy outcomes after evaluation, however this should not affect our principal results. The maternal age of patients varied from 26 to 45 years old. More aneuploid pregnancy losses would be expected in older women, particularly over the age of 35 years old. Evaluation of POC using 24-chromosome microarray analysis adds significantly to the ASRM recommended evaluation of RPL. Genetic evaluation on miscarriage tissue obtained at the time of the second and subsequent pregnancy losses should be offered to all couples with two or more consecutive pregnancy losses. The combination of a genetic evaluation on miscarriage tissue with an evidence-based evaluation for RPL will identify a probable or definitive cause in over 90% of miscarriages. No funding was received for this study and there are no conflicts of interest to declare. Not applicable.
A Platform for Combined DNA and Protein Microarrays Based on Total Internal Reflection Fluorescence
Asanov, Alexander; Zepeda, Angélica; Vaca, Luis
2012-01-01
We have developed a novel microarray technology based on total internal reflection fluorescence (TIRF) in combination with DNA and protein bioassays immobilized at the TIRF surface. Unlike conventional microarrays that exhibit reduced signal-to-background ratio, require several stages of incubation, rinsing and stringency control, and measure only end-point results, our TIRF microarray technology provides several orders of magnitude better signal-to-background ratio, performs analysis rapidly in one step, and measures the entire course of association and dissociation kinetics between target DNA and protein molecules and the bioassays. In many practical cases detection of only DNA or protein markers alone does not provide the necessary accuracy for diagnosing a disease or detecting a pathogen. Here we describe TIRF microarrays that detect DNA and protein markers simultaneously, which reduces the probabilities of false responses. Supersensitive and multiplexed TIRF DNA and protein microarray technology may provide a platform for accurate diagnosis or enhanced research studies. Our TIRF microarray system can be mounted on upright or inverted microscopes or interfaced directly with CCD cameras equipped with a single objective, facilitating the development of portable devices. As proof-of-concept we applied TIRF microarrays for detecting molecular markers from Bacillus anthracis, the pathogen responsible for anthrax. PMID:22438738
Validation of MIMGO: a method to identify differentially expressed GO terms in a microarray dataset
2012-01-01
Background We previously proposed an algorithm for the identification of GO terms that commonly annotate genes whose expression is upregulated or downregulated in some microarray data compared with in other microarray data. We call these “differentially expressed GO terms” and have named the algorithm “matrix-assisted identification method of differentially expressed GO terms” (MIMGO). MIMGO can also identify microarray data in which genes annotated with a differentially expressed GO term are upregulated or downregulated. However, MIMGO has not yet been validated on a real microarray dataset using all available GO terms. Findings We combined Gene Set Enrichment Analysis (GSEA) with MIMGO to identify differentially expressed GO terms in a yeast cell cycle microarray dataset. GSEA followed by MIMGO (GSEA + MIMGO) correctly identified (p < 0.05) microarray data in which genes annotated to differentially expressed GO terms are upregulated. We found that GSEA + MIMGO was slightly less effective than, or comparable to, GSEA (Pearson), a method that uses Pearson’s correlation as a metric, at detecting true differentially expressed GO terms. However, unlike other methods including GSEA (Pearson), GSEA + MIMGO can comprehensively identify the microarray data in which genes annotated with a differentially expressed GO term are upregulated or downregulated. Conclusions MIMGO is a reliable method to identify differentially expressed GO terms comprehensively. PMID:23232071
The use of open source bioinformatics tools to dissect transcriptomic data.
Nitsche, Benjamin M; Ram, Arthur F J; Meyer, Vera
2012-01-01
Microarrays are a valuable technology to study fungal physiology on a transcriptomic level. Various microarray platforms are available comprising both single and two channel arrays. Despite different technologies, preprocessing of microarray data generally includes quality control, background correction, normalization, and summarization of probe level data. Subsequently, depending on the experimental design, diverse statistical analysis can be performed, including the identification of differentially expressed genes and the construction of gene coexpression networks.We describe how Bioconductor, a collection of open source and open development packages for the statistical programming language R, can be used for dissecting microarray data. We provide fundamental details that facilitate the process of getting started with R and Bioconductor. Using two publicly available microarray datasets from Aspergillus niger, we give detailed protocols on how to identify differentially expressed genes and how to construct gene coexpression networks.
NASA Astrophysics Data System (ADS)
Liu, Robin H.; Lodes, Mike; Fuji, H. Sho; Danley, David; McShea, Andrew
Microarray assays typically involve multistage sample processing and fluidic handling, which are generally labor-intensive and time-consuming. Automation of these processes would improve robustness, reduce run-to-run and operator-to-operator variation, and reduce costs. In this chapter, a fully integrated and self-contained microfluidic biochip device that has been developed to automate the fluidic handling steps for microarray-based gene expression or genotyping analysis is presented. The device consists of a semiconductor-based CustomArray® chip with 12,000 features and a microfluidic cartridge. The CustomArray was manufactured using a semiconductor-based in situ synthesis technology. The micro-fluidic cartridge consists of microfluidic pumps, mixers, valves, fluid channels, and reagent storage chambers. Microarray hybridization and subsequent fluidic handling and reactions (including a number of washing and labeling steps) were performed in this fully automated and miniature device before fluorescent image scanning of the microarray chip. Electrochemical micropumps were integrated in the cartridge to provide pumping of liquid solutions. A micromixing technique based on gas bubbling generated by electrochemical micropumps was developed. Low-cost check valves were implemented in the cartridge to prevent cross-talk of the stored reagents. Gene expression study of the human leukemia cell line (K562) and genotyping detection and sequencing of influenza A subtypes have been demonstrated using this integrated biochip platform. For gene expression assays, the microfluidic CustomArray device detected sample RNAs with a concentration as low as 0.375 pM. Detection was quantitative over more than three orders of magnitude. Experiment also showed that chip-to-chip variability was low indicating that the integrated microfluidic devices eliminate manual fluidic handling steps that can be a significant source of variability in genomic analysis. The genotyping results showed that the device identified influenza A hemagglutinin and neuraminidase subtypes and sequenced portions of both genes, demonstrating the potential of integrated microfluidic and microarray technology for multiple virus detection. The device provides a cost-effective solution to eliminate labor-intensive and time-consuming fluidic handling steps and allows microarray-based DNA analysis in a rapid and automated fashion.
Peterson, Leif E
2002-01-01
CLUSFAVOR (CLUSter and Factor Analysis with Varimax Orthogonal Rotation) 5.0 is a Windows-based computer program for hierarchical cluster and principal-component analysis of microarray-based transcriptional profiles. CLUSFAVOR 5.0 standardizes input data; sorts data according to gene-specific coefficient of variation, standard deviation, average and total expression, and Shannon entropy; performs hierarchical cluster analysis using nearest-neighbor, unweighted pair-group method using arithmetic averages (UPGMA), or furthest-neighbor joining methods, and Euclidean, correlation, or jack-knife distances; and performs principal-component analysis. PMID:12184816
Integrative analysis of RUNX1 downstream pathways and target genes
Michaud, Joëlle; Simpson, Ken M; Escher, Robert; Buchet-Poyau, Karine; Beissbarth, Tim; Carmichael, Catherine; Ritchie, Matthew E; Schütz, Frédéric; Cannon, Ping; Liu, Marjorie; Shen, Xiaofeng; Ito, Yoshiaki; Raskind, Wendy H; Horwitz, Marshall S; Osato, Motomi; Turner, David R; Speed, Terence P; Kavallaris, Maria; Smyth, Gordon K; Scott, Hamish S
2008-01-01
Background The RUNX1 transcription factor gene is frequently mutated in sporadic myeloid and lymphoid leukemia through translocation, point mutation or amplification. It is also responsible for a familial platelet disorder with predisposition to acute myeloid leukemia (FPD-AML). The disruption of the largely unknown biological pathways controlled by RUNX1 is likely to be responsible for the development of leukemia. We have used multiple microarray platforms and bioinformatic techniques to help identify these biological pathways to aid in the understanding of why RUNX1 mutations lead to leukemia. Results Here we report genes regulated either directly or indirectly by RUNX1 based on the study of gene expression profiles generated from 3 different human and mouse platforms. The platforms used were global gene expression profiling of: 1) cell lines with RUNX1 mutations from FPD-AML patients, 2) over-expression of RUNX1 and CBFβ, and 3) Runx1 knockout mouse embryos using either cDNA or Affymetrix microarrays. We observe that our datasets (lists of differentially expressed genes) significantly correlate with published microarray data from sporadic AML patients with mutations in either RUNX1 or its cofactor, CBFβ. A number of biological processes were identified among the differentially expressed genes and functional assays suggest that heterozygous RUNX1 point mutations in patients with FPD-AML impair cell proliferation, microtubule dynamics and possibly genetic stability. In addition, analysis of the regulatory regions of the differentially expressed genes has for the first time systematically identified numerous potential novel RUNX1 target genes. Conclusion This work is the first large-scale study attempting to identify the genetic networks regulated by RUNX1, a master regulator in the development of the hematopoietic system and leukemia. The biological pathways and target genes controlled by RUNX1 will have considerable importance in disease progression in both familial and sporadic leukemia as well as therapeutic implications. PMID:18671852
NASA Technical Reports Server (NTRS)
Koizumi, Yoshikazu; Kelly, John J.; Nakagawa, Tatsunori; Urakawa, Hidetoshi; El-Fantroussi, Said; Al-Muzaini, Saleh; Fukui, Manabu; Urushigawa, Yoshikuni; Stahl, David A.
2002-01-01
A mesophilic toluene-degrading consortium (TDC) and an ethylbenzene-degrading consortium (EDC) were established under sulfate-reducing conditions. These consortia were first characterized by denaturing gradient gel electrophoresis (DGGE) fingerprinting of PCR-amplified 16S rRNA gene fragments, followed by sequencing. The sequences of the major bands (T-1 and E-2) belonging to TDC and EDC, respectively, were affiliated with the family Desulfobacteriaceae. Another major band from EDC (E-1) was related to an uncultured non-sulfate-reducing soil bacterium. Oligonucleotide probes specific for the 16S rRNAs of target organisms corresponding to T-1, E-1, and E-2 were designed, and hybridization conditions were optimized for two analytical formats, membrane and DNA microarray hybridization. Both formats were used to characterize the TDC and EDC, and the results of both were consistent with DGGE analysis. In order to assess the utility of the microarray format for analysis of environmental samples, oil-contaminated sediments from the coast of Kuwait were analyzed. The DNA microarray successfully detected bacterial nucleic acids from these samples, but probes targeting specific groups of sulfate-reducing bacteria did not give positive signals. The results of this study demonstrate the limitations and the potential utility of DNA microarrays for microbial community analysis.
Koizumi, Yoshikazu; Kelly, John J.; Nakagawa, Tatsunori; Urakawa, Hidetoshi; El-Fantroussi, Saïd; Al-Muzaini, Saleh; Fukui, Manabu; Urushigawa, Yoshikuni; Stahl, David A.
2002-01-01
A mesophilic toluene-degrading consortium (TDC) and an ethylbenzene-degrading consortium (EDC) were established under sulfate-reducing conditions. These consortia were first characterized by denaturing gradient gel electrophoresis (DGGE) fingerprinting of PCR-amplified 16S rRNA gene fragments, followed by sequencing. The sequences of the major bands (T-1 and E-2) belonging to TDC and EDC, respectively, were affiliated with the family Desulfobacteriaceae. Another major band from EDC (E-1) was related to an uncultured non-sulfate-reducing soil bacterium. Oligonucleotide probes specific for the 16S rRNAs of target organisms corresponding to T-1, E-1, and E-2 were designed, and hybridization conditions were optimized for two analytical formats, membrane and DNA microarray hybridization. Both formats were used to characterize the TDC and EDC, and the results of both were consistent with DGGE analysis. In order to assess the utility of the microarray format for analysis of environmental samples, oil-contaminated sediments from the coast of Kuwait were analyzed. The DNA microarray successfully detected bacterial nucleic acids from these samples, but probes targeting specific groups of sulfate-reducing bacteria did not give positive signals. The results of this study demonstrate the limitations and the potential utility of DNA microarrays for microbial community analysis. PMID:12088997
Ontology-based, Tissue MicroArray oriented, image centered tissue bank
Viti, Federica; Merelli, Ivan; Caprera, Andrea; Lazzari, Barbara; Stella, Alessandra; Milanesi, Luciano
2008-01-01
Background Tissue MicroArray technique is becoming increasingly important in pathology for the validation of experimental data from transcriptomic analysis. This approach produces many images which need to be properly managed, if possible with an infrastructure able to support tissue sharing between institutes. Moreover, the available frameworks oriented to Tissue MicroArray provide good storage for clinical patient, sample treatment and block construction information, but their utility is limited by the lack of data integration with biomolecular information. Results In this work we propose a Tissue MicroArray web oriented system to support researchers in managing bio-samples and, through the use of ontologies, enables tissue sharing aimed at the design of Tissue MicroArray experiments and results evaluation. Indeed, our system provides ontological description both for pre-analysis tissue images and for post-process analysis image results, which is crucial for information exchange. Moreover, working on well-defined terms it is then possible to query web resources for literature articles to integrate both pathology and bioinformatics data. Conclusions Using this system, users associate an ontology-based description to each image uploaded into the database and also integrate results with the ontological description of biosequences identified in every tissue. Moreover, it is possible to integrate the ontological description provided by the user with a full compliant gene ontology definition, enabling statistical studies about correlation between the analyzed pathology and the most commonly related biological processes. PMID:18460177
Richard, Arianne C; Lyons, Paul A; Peters, James E; Biasci, Daniele; Flint, Shaun M; Lee, James C; McKinney, Eoin F; Siegel, Richard M; Smith, Kenneth G C
2014-08-04
Although numerous investigations have compared gene expression microarray platforms, preprocessing methods and batch correction algorithms using constructed spike-in or dilution datasets, there remains a paucity of studies examining the properties of microarray data using diverse biological samples. Most microarray experiments seek to identify subtle differences between samples with variable background noise, a scenario poorly represented by constructed datasets. Thus, microarray users lack important information regarding the complexities introduced in real-world experimental settings. The recent development of a multiplexed, digital technology for nucleic acid measurement enables counting of individual RNA molecules without amplification and, for the first time, permits such a study. Using a set of human leukocyte subset RNA samples, we compared previously acquired microarray expression values with RNA molecule counts determined by the nCounter Analysis System (NanoString Technologies) in selected genes. We found that gene measurements across samples correlated well between the two platforms, particularly for high-variance genes, while genes deemed unexpressed by the nCounter generally had both low expression and low variance on the microarray. Confirming previous findings from spike-in and dilution datasets, this "gold-standard" comparison demonstrated signal compression that varied dramatically by expression level and, to a lesser extent, by dataset. Most importantly, examination of three different cell types revealed that noise levels differed across tissues. Microarray measurements generally correlate with relative RNA molecule counts within optimal ranges but suffer from expression-dependent accuracy bias and precision that varies across datasets. We urge microarray users to consider expression-level effects in signal interpretation and to evaluate noise properties in each dataset independently.
Weniger, Markus; Engelmann, Julia C; Schultz, Jörg
2007-01-01
Background Regulation of gene expression is relevant to many areas of biology and medicine, in the study of treatments, diseases, and developmental stages. Microarrays can be used to measure the expression level of thousands of mRNAs at the same time, allowing insight into or comparison of different cellular conditions. The data derived out of microarray experiments is highly dimensional and often noisy, and interpretation of the results can get intricate. Although programs for the statistical analysis of microarray data exist, most of them lack an integration of analysis results and biological interpretation. Results We have developed GEPAT, Genome Expression Pathway Analysis Tool, offering an analysis of gene expression data under genomic, proteomic and metabolic context. We provide an integration of statistical methods for data import and data analysis together with a biological interpretation for subsets of probes or single probes on the chip. GEPAT imports various types of oligonucleotide and cDNA array data formats. Different normalization methods can be applied to the data, afterwards data annotation is performed. After import, GEPAT offers various statistical data analysis methods, as hierarchical, k-means and PCA clustering, a linear model based t-test or chromosomal profile comparison. The results of the analysis can be interpreted by enrichment of biological terms, pathway analysis or interaction networks. Different biological databases are included, to give various information for each probe on the chip. GEPAT offers no linear work flow, but allows the usage of any subset of probes and samples as a start for a new data analysis. GEPAT relies on established data analysis packages, offers a modular approach for an easy extension, and can be run on a computer grid to allow a large number of users. It is freely available under the LGPL open source license for academic and commercial users at . Conclusion GEPAT is a modular, scalable and professional-grade software integrating analysis and interpretation of microarray gene expression data. An installation available for academic users can be found at . PMID:17543125
Shekhar, M S; Gomathi, A; Gopikrishna, G; Ponniah, A G
2015-06-01
White spot syndrome virus (WSSV) continues to be the most devastating viral pathogen infecting penaeid shrimp the world over. The genome of WSSV has been deciphered and characterized from three geographical isolates and significant progress has been made in developing various molecular diagnostic methods to detect the virus. However, the information on host immune gene response to WSSV pathogenesis is limited. Microarray analysis was carried out as an approach to analyse the gene expression in black tiger shrimp Penaeus monodon in response to WSSV infection. Gill tissues collected from the WSSV infected shrimp at 6, 24, 48 h and moribund stage were analysed for differential gene expression. Shrimp cDNAs of 40,059 unique sequences were considered for designing the microarray chip. The Cy3-labeled cRNA derived from healthy and WSSV-infected shrimp was subjected to hybridization with all the DNA spots in the microarray which revealed 8,633 and 11,147 as up- and down-regulated genes respectively at different time intervals post infection. The altered expression of these numerous genes represented diverse functions such as immune response, osmoregulation, apoptosis, nucleic acid binding, energy and metabolism, signal transduction, stress response and molting. The changes in gene expression profiles observed by microarray analysis provides molecular insights and framework of genes which are up- and down-regulated at different time intervals during WSSV infection in shrimp. The microarray data was validated by Real Time analysis of four differentially expressed genes involved in apoptosis (translationally controlled tumor protein, inhibitor of apoptosis protein, ubiquitin conjugated enzyme E2 and caspase) for gene expression levels. The role of apoptosis related genes in WSSV infected shrimp is discussed herein.
Reuse of imputed data in microarray analysis increases imputation efficiency
Kim, Ki-Yeol; Kim, Byoung-Jin; Yi, Gwan-Su
2004-01-01
Background The imputation of missing values is necessary for the efficient use of DNA microarray data, because many clustering algorithms and some statistical analysis require a complete data set. A few imputation methods for DNA microarray data have been introduced, but the efficiency of the methods was low and the validity of imputed values in these methods had not been fully checked. Results We developed a new cluster-based imputation method called sequential K-nearest neighbor (SKNN) method. This imputes the missing values sequentially from the gene having least missing values, and uses the imputed values for the later imputation. Although it uses the imputed values, the efficiency of this new method is greatly improved in its accuracy and computational complexity over the conventional KNN-based method and other methods based on maximum likelihood estimation. The performance of SKNN was in particular higher than other imputation methods for the data with high missing rates and large number of experiments. Application of Expectation Maximization (EM) to the SKNN method improved the accuracy, but increased computational time proportional to the number of iterations. The Multiple Imputation (MI) method, which is well known but not applied previously to microarray data, showed a similarly high accuracy as the SKNN method, with slightly higher dependency on the types of data sets. Conclusions Sequential reuse of imputed data in KNN-based imputation greatly increases the efficiency of imputation. The SKNN method should be practically useful to save the data of some microarray experiments which have high amounts of missing entries. The SKNN method generates reliable imputed values which can be used for further cluster-based analysis of microarray data. PMID:15504240
Hartmann, Luise; Stephenson, Christine F; Verkamp, Stephanie R; Johnson, Krystal R; Burnworth, Bettina; Hammock, Kelle; Brodersen, Lisa Eidenschink; de Baca, Monica E; Wells, Denise A; Loken, Michael R; Zehentner, Barbara K
2014-12-01
Array comparative genomic hybridization (aCGH) has become a powerful tool for analyzing hematopoietic neoplasms and identifying genome-wide copy number changes in a single assay. aCGH also has superior resolution compared with fluorescence in situ hybridization (FISH) or conventional cytogenetics. Integration of single nucleotide polymorphism (SNP) probes with microarray analysis allows additional identification of acquired uniparental disomy, a copy neutral aberration with known potential to contribute to tumor pathogenesis. However, a limitation of microarray analysis has been the inability to detect clonal heterogeneity in a sample. This study comprised 16 samples (acute myeloid leukemia, myelodysplastic syndrome, chronic lymphocytic leukemia, plasma cell neoplasm) with complex cytogenetic features and evidence of clonal evolution. We used an integrated manual peak reassignment approach combining analysis of aCGH and SNP microarray data for characterization of subclonal abnormalities. We compared array findings with results obtained from conventional cytogenetic and FISH studies. Clonal heterogeneity was detected in 13 of 16 samples by microarray on the basis of log2 values. Use of the manual peak reassignment analysis approach improved resolution of the sample's clonal composition and genetic heterogeneity in 10 of 13 (77%) patients. Moreover, in 3 patients, clonal disease progression was revealed by array analysis that was not evident by cytogenetic or FISH studies. Genetic abnormalities originating from separate clonal subpopulations can be identified and further characterized by combining aCGH and SNP hybridization results from 1 integrated microarray chip by use of the manual peak reassignment technique. Its clinical utility in comparison to conventional cytogenetic or FISH studies is demonstrated. © 2014 American Association for Clinical Chemistry.
Sharma, Nirmala; Anderson, Maureen; Kumar, Arvind; Zhang, Yan; Giblin, E Michael; Abrams, Suzanne R; Zaharia, L Irina; Taylor, David C; Fobert, Pierre R
2008-12-19
Seed oil accumulates primarily as triacylglycerol (TAG). While the biochemical pathway for TAG biosynthesis is known, its regulation remains unclear. Previous research identified microsomal diacylglycerol acyltransferase 1 (DGAT1, EC 2.3.1.20) as controlling a rate-limiting step in the TAG biosynthesis pathway. Of note, overexpression of DGAT1 results in substantial increases in oil content and seed size. To further analyze the global consequences of manipulating DGAT1 levels during seed development, a concerted transcriptome and metabolome analysis of transgenic B. napus prototypes was performed. Using a targeted Brassica cDNA microarray, about 200 genes were differentially expressed in two independent transgenic lines analyzed. Interestingly, 24-33% of the targets showing significant changes have no matching gene in Arabidopsis although these represent only 5% of the targets on the microarray. Further analysis of some of these novel transcripts indicated that several are inducible by ABA in microspore-derived embryos. Of the 200 Arabidopsis genes implicated in lipid biology present on the microarray, 36 were found to be differentially regulated in DGAT transgenic lines. Furthermore, kinetic reverse transcriptase Polymerase Chain Reaction (k-PCR) analysis revealed up-regulation of genes encoding enzymes of the Kennedy pathway involved in assembly of TAGs. Hormone profiling indicated that levels of auxins and cytokinins varied between transgenic lines and untransformed controls, while differences in the pool sizes of ABA and catabolites were only observed at later stages of development. Our results indicate that the increased TAG accumulation observed in transgenic DGAT1 plants is associated with modest transcriptional and hormonal changes during seed development that are not limited to the TAG biosynthesis pathway. These might be associated with feedback or feed-forward effects due to altered levels of DGAT1 activity. The fact that a large fraction of significant amplicons have no matching genes in Arabidopsis compromised our ability to draw concrete inferences from the data at this stage, but has led to the identification of novel genes of potential interest.
Analysis and modelling of septic shock microarray data using Singular Value Decomposition.
Allanki, Srinivas; Dixit, Madhulika; Thangaraj, Paul; Sinha, Nandan Kumar
2017-06-01
Being a high throughput technique, enormous amounts of microarray data has been generated and there arises a need for more efficient techniques of analysis, in terms of speed and accuracy. Finding the differentially expressed genes based on just fold change and p-value might not extract all the vital biological signals that occur at a lower gene expression level. Besides this, numerous mathematical models have been generated to predict the clinical outcome from microarray data, while very few, if not none, aim at predicting the vital genes that are important in a disease progression. Such models help a basic researcher narrow down and concentrate on a promising set of genes which leads to the discovery of gene-based therapies. In this article, as a first objective, we have used the lesser known and used Singular Value Decomposition (SVD) technique to build a microarray data analysis tool that works with gene expression patterns and intrinsic structure of the data in an unsupervised manner. We have re-analysed a microarray data over the clinical course of Septic shock from Cazalis et al. (2014) and have shown that our proposed analysis provides additional information compared to the conventional method. As a second objective, we developed a novel mathematical model that predicts a set of vital genes in the disease progression that works by generating samples in the continuum between health and disease, using a simple normal-distribution-based random number generator. We also verify that most of the predicted genes are indeed related to septic shock. Copyright © 2017 Elsevier Inc. All rights reserved.
Bikel, Shirley; Jacobo-Albavera, Leonor; Sánchez-Muñoz, Fausto; Cornejo-Granados, Fernanda; Canizales-Quinteros, Samuel; Soberón, Xavier; Sotelo-Mundo, Rogerio R.; del Río-Navarro, Blanca E.; Mendoza-Vargas, Alfredo; Sánchez, Filiberto
2017-01-01
Background In spite of the emergence of RNA sequencing (RNA-seq), microarrays remain in widespread use for gene expression analysis in the clinic. There are over 767,000 RNA microarrays from human samples in public repositories, which are an invaluable resource for biomedical research and personalized medicine. The absolute gene expression analysis allows the transcriptome profiling of all expressed genes under a specific biological condition without the need of a reference sample. However, the background fluorescence represents a challenge to determine the absolute gene expression in microarrays. Given that the Y chromosome is absent in female subjects, we used it as a new approach for absolute gene expression analysis in which the fluorescence of the Y chromosome genes of female subjects was used as the background fluorescence for all the probes in the microarray. This fluorescence was used to establish an absolute gene expression threshold, allowing the differentiation between expressed and non-expressed genes in microarrays. Methods We extracted the RNA from 16 children leukocyte samples (nine males and seven females, ages 6–10 years). An Affymetrix Gene Chip Human Gene 1.0 ST Array was carried out for each sample and the fluorescence of 124 genes of the Y chromosome was used to calculate the absolute gene expression threshold. After that, several expressed and non-expressed genes according to our absolute gene expression threshold were compared against the expression obtained using real-time quantitative polymerase chain reaction (RT-qPCR). Results From the 124 genes of the Y chromosome, three genes (DDX3Y, TXLNG2P and EIF1AY) that displayed significant differences between sexes were used to calculate the absolute gene expression threshold. Using this threshold, we selected 13 expressed and non-expressed genes and confirmed their expression level by RT-qPCR. Then, we selected the top 5% most expressed genes and found that several KEGG pathways were significantly enriched. Interestingly, these pathways were related to the typical functions of leukocytes cells, such as antigen processing and presentation and natural killer cell mediated cytotoxicity. We also applied this method to obtain the absolute gene expression threshold in already published microarray data of liver cells, where the top 5% expressed genes showed an enrichment of typical KEGG pathways for liver cells. Our results suggest that the three selected genes of the Y chromosome can be used to calculate an absolute gene expression threshold, allowing a transcriptome profiling of microarray data without the need of an additional reference experiment. Discussion Our approach based on the establishment of a threshold for absolute gene expression analysis will allow a new way to analyze thousands of microarrays from public databases. This allows the study of different human diseases without the need of having additional samples for relative expression experiments. PMID:29230367
DigOut: viewing differential expression genes as outliers.
Yu, Hui; Tu, Kang; Xie, Lu; Li, Yuan-Yuan
2010-12-01
With regards to well-replicated two-conditional microarray datasets, the selection of differentially expressed (DE) genes is a well-studied computational topic, but for multi-conditional microarray datasets with limited or no replication, the same task is not properly addressed by previous studies. This paper adopts multivariate outlier analysis to analyze replication-lacking multi-conditional microarray datasets, finding that it performs significantly better than the widely used limit fold change (LFC) model in a simulated comparative experiment. Compared with the LFC model, the multivariate outlier analysis also demonstrates improved stability against sample variations in a series of manipulated real expression datasets. The reanalysis of a real non-replicated multi-conditional expression dataset series leads to satisfactory results. In conclusion, a multivariate outlier analysis algorithm, like DigOut, is particularly useful for selecting DE genes from non-replicated multi-conditional gene expression dataset.
Prediction of regulatory gene pairs using dynamic time warping and gene ontology.
Yang, Andy C; Hsu, Hui-Huang; Lu, Ming-Da; Tseng, Vincent S; Shih, Timothy K
2014-01-01
Selecting informative genes is the most important task for data analysis on microarray gene expression data. In this work, we aim at identifying regulatory gene pairs from microarray gene expression data. However, microarray data often contain multiple missing expression values. Missing value imputation is thus needed before further processing for regulatory gene pairs becomes possible. We develop a novel approach to first impute missing values in microarray time series data by combining k-Nearest Neighbour (KNN), Dynamic Time Warping (DTW) and Gene Ontology (GO). After missing values are imputed, we then perform gene regulation prediction based on our proposed DTW-GO distance measurement of gene pairs. Experimental results show that our approach is more accurate when compared with existing missing value imputation methods on real microarray data sets. Furthermore, our approach can also discover more regulatory gene pairs that are known in the literature than other methods.
Temperature Gradient Effect on Gas Discrimination Power of a Metal-Oxide Thin-Film Sensor Microarray
Sysoev, Victor V.; Kiselev, Ilya; Frietsch, Markus; Goschnick, Joachim
2004-01-01
The paper presents results concerning the effect of spatial inhomogeneous operating temperature on the gas discrimination power of a gas-sensor microarray, with the latter based on a thin SnO2 film employed in the KAMINA electronic nose. Three different temperature distributions over the substrate are discussed: a nearly homogeneous one and two temperature gradients, equal to approx. 3.3 °C/mm and 6.7 °C/mm, applied across the sensor elements (segments) of the array. The gas discrimination power of the microarray is judged by using the Mahalanobis distance in the LDA (Linear Discrimination Analysis) coordinate system between the data clusters obtained by the response of the microarray to four target vapors: ethanol, acetone, propanol and ammonia. It is shown that the application of a temperature gradient increases the gas discrimination power of the microarray by up to 35 %.
Chondrocyte channel transcriptomics
Lewis, Rebecca; May, Hannah; Mobasheri, Ali; Barrett-Jolley, Richard
2013-01-01
To date, a range of ion channels have been identified in chondrocytes using a number of different techniques, predominantly electrophysiological and/or biomolecular; each of these has its advantages and disadvantages. Here we aim to compare and contrast the data available from biophysical and microarray experiments. This letter analyses recent transcriptomics datasets from chondrocytes, accessible from the European Bioinformatics Institute (EBI). We discuss whether such bioinformatic analysis of microarray datasets can potentially accelerate identification and discovery of ion channels in chondrocytes. The ion channels which appear most frequently across these microarray datasets are discussed, along with their possible functions. We discuss whether functional or protein data exist which support the microarray data. A microarray experiment comparing gene expression in osteoarthritis and healthy cartilage is also discussed and we verify the differential expression of 2 of these genes, namely the genes encoding large calcium-activated potassium (BK) and aquaporin channels. PMID:23995703
Salehi, Reza; Tsoi, Stephen C M; Colazo, Marcos G; Ambrose, Divakar J; Robert, Claude; Dyck, Michael K
2017-01-30
Early embryonic loss is a large contributor to infertility in cattle. Moreover, bovine becomes an interesting model to study human preimplantation embryo development due to their similar developmental process. Although genetic factors are known to affect early embryonic development, the discovery of such factors has been a serious challenge. Microarray technology allows quantitative measurement and gene expression profiling of transcript levels on a genome-wide basis. One of the main decisions that have to be made when planning a microarray experiment is whether to use a one- or two-color approach. Two-color design increases technical replication, minimizes variability, improves sensitivity and accuracy as well as allows having loop designs, defining the common reference samples. Although microarray is a powerful biological tool, there are potential pitfalls that can attenuate its power. Hence, in this technical paper we demonstrate an optimized protocol for RNA extraction, amplification, labeling, hybridization of the labeled amplified RNA to the array, array scanning and data analysis using the two-color analysis strategy.
Clustering gene expression data based on predicted differential effects of GV interaction.
Pan, Hai-Yan; Zhu, Jun; Han, Dan-Fu
2005-02-01
Microarray has become a popular biotechnology in biological and medical research. However, systematic and stochastic variabilities in microarray data are expected and unavoidable, resulting in the problem that the raw measurements have inherent "noise" within microarray experiments. Currently, logarithmic ratios are usually analyzed by various clustering methods directly, which may introduce bias interpretation in identifying groups of genes or samples. In this paper, a statistical method based on mixed model approaches was proposed for microarray data cluster analysis. The underlying rationale of this method is to partition the observed total gene expression level into various variations caused by different factors using an ANOVA model, and to predict the differential effects of GV (gene by variety) interaction using the adjusted unbiased prediction (AUP) method. The predicted GV interaction effects can then be used as the inputs of cluster analysis. We illustrated the application of our method with a gene expression dataset and elucidated the utility of our approach using an external validation.
GenePublisher: Automated analysis of DNA microarray data.
Knudsen, Steen; Workman, Christopher; Sicheritz-Ponten, Thomas; Friis, Carsten
2003-07-01
GenePublisher, a system for automatic analysis of data from DNA microarray experiments, has been implemented with a web interface at http://www.cbs.dtu.dk/services/GenePublisher. Raw data are uploaded to the server together with a specification of the data. The server performs normalization, statistical analysis and visualization of the data. The results are run against databases of signal transduction pathways, metabolic pathways and promoter sequences in order to extract more information. The results of the entire analysis are summarized in report form and returned to the user.
USDA-ARS?s Scientific Manuscript database
Background: To identify the genes involved in the development of low temperature (LT) tolerance in hexaploid wheat, we examined the global changes in expression in response to cold of the 55,052 potentially unique genes represented in the Affymetrix Wheat Genome microarray. We compared the expressi...
USDA-ARS?s Scientific Manuscript database
Chickens were immunized subcutaneously with an Eimeria recombinant profilin protein plus MontanideTM ISA 70 VG (ISA 70) or MontanideTM ISA 71 VG (ISA 71) water-in-oil adjuvants, or with profilin alone, and comparative RNA microarray analyses were performed to ascertain global transcriptomic changes ...
Microarray characterization of gene expression changes in blood during acute ethanol exposure
2013-01-01
Background As part of the civil aviation safety program to define the adverse effects of ethanol on flying performance, we performed a DNA microarray analysis of human whole blood samples from a five-time point study of subjects administered ethanol orally, followed by breathalyzer analysis, to monitor blood alcohol concentration (BAC) to discover significant gene expression changes in response to the ethanol exposure. Methods Subjects were administered either orange juice or orange juice with ethanol. Blood samples were taken based on BAC and total RNA was isolated from PaxGene™ blood tubes. The amplified cDNA was used in microarray and quantitative real-time polymerase chain reaction (RT-qPCR) analyses to evaluate differential gene expression. Microarray data was analyzed in a pipeline fashion to summarize and normalize and the results evaluated for relative expression across time points with multiple methods. Candidate genes showing distinctive expression patterns in response to ethanol were clustered by pattern and further analyzed for related function, pathway membership and common transcription factor binding within and across clusters. RT-qPCR was used with representative genes to confirm relative transcript levels across time to those detected in microarrays. Results Microarray analysis of samples representing 0%, 0.04%, 0.08%, return to 0.04%, and 0.02% wt/vol BAC showed that changes in gene expression could be detected across the time course. The expression changes were verified by qRT-PCR. The candidate genes of interest (GOI) identified from the microarray analysis and clustered by expression pattern across the five BAC points showed seven coordinately expressed groups. Analysis showed function-based networks, shared transcription factor binding sites and signaling pathways for members of the clusters. These include hematological functions, innate immunity and inflammation functions, metabolic functions expected of ethanol metabolism, and pancreatic and hepatic function. Five of the seven clusters showed links to the p38 MAPK pathway. Conclusions The results of this study provide a first look at changing gene expression patterns in human blood during an acute rise in blood ethanol concentration and its depletion because of metabolism and excretion, and demonstrate that it is possible to detect changes in gene expression using total RNA isolated from whole blood. The analysis approach for this study serves as a workflow to investigate the biology linked to expression changes across a time course and from these changes, to identify target genes that could serve as biomarkers linked to pilot performance. PMID:23883607
Maslow, Bat-Sheva L; Budinetz, Tara; Sueldo, Carolina; Anspach, Erica; Engmann, Lawrence; Benadiva, Claudio; Nulsen, John C
2015-07-01
To compare the analysis of chromosome number from paraffin-embedded products of conception using single-nucleotide polymorphism (SNP) microarray with the recommended screening for the evaluation of couples presenting with recurrent pregnancy loss who do not have previous fetal cytogenetic data. We performed a retrospective cohort study including all women who presented for a new evaluation of recurrent pregnancy loss over a 2-year period (January 1, 2012, to December 31, 2013). All participants had at least two documented first-trimester losses and both the recommended screening tests and SNP microarray performed on at least one paraffin-embedded products of conception sample. Single-nucleotide polymorphism microarray identifies all 24 chromosomes (22 autosomes, X, and Y). Forty-two women with a total of 178 losses were included in the study. Paraffin-embedded products of conception from 62 losses were sent for SNP microarray. Single-nucleotide polymorphism microarray successfully diagnosed fetal chromosome number in 71% (44/62) of samples, of which 43% (19/44) were euploid and 57% (25/44) were noneuploid. Seven of 42 (17%) participants had abnormalities on recurrent pregnancy loss screening. The per-person detection rate for a cause of pregnancy loss was significantly higher in the SNP microarray (0.50; 95% confidence interval [CI] 0.36-0.64) compared with recurrent pregnancy loss evaluation (0.17; 95% CI 0.08-0.31) (P=.002). Participants with one or more euploid loss identified on paraffin-embedded products of conception were significantly more likely to have an abnormality on recurrent pregnancy loss screening than those with only noneuploid results (P=.028). The significance remained when controlling for age, number of losses, number of samples, and total pregnancies. These results suggest that SNP microarray testing of paraffin-embedded products of conception is a valuable tool for the evaluation of recurrent pregnancy loss in patients without prior fetal cytogenetic results. Recommended recurrent pregnancy loss screening was unnecessary in almost half the patients in our study. II.
NASA Astrophysics Data System (ADS)
Liu, Robin H.; Longiaru, Mathew
2009-05-01
DNA microarrays are becoming a widespread tool used in life science and drug screening due to its many benefits of miniaturization and integration. Microarrays permit a highly multiplexed DNA analysis. Recently, the development of new detection methods and simplified methodologies has rapidly expanded the use of microarray technologies from predominantly gene expression analysis into the arena of diagnostics. Osmetech's eSensor® is an electrochemical detection platform based on a low-to- medium density DNA hybridization array on a cost-effective printed circuit board substrate. eSensor® has been cleared by FDA for Warfarin sensitivity test and Cystic Fibrosis Carrier Detection. Other genetic-based diagnostic and infectious disease detection tests are under development. The eSensor® platform eliminates the need for an expensive laser-based optical system and fluorescent reagents. It allows one to perform hybridization and detection in a single and small instrument without any fluidic processing and handling. Furthermore, the eSensor® platform is readily adaptable to on-chip sample-to-answer genetic analyses using microfluidics technology. The eSensor® platform provides a cost-effective solution to direct sample-to-answer genetic analysis, and thus have a potential impact in the fields of point-of-care genetic analysis, environmental testing, and biological warfare agent detection.
Chavan, Shweta S; Bauer, Michael A; Peterson, Erich A; Heuck, Christoph J; Johann, Donald J
2013-01-01
Transcriptome analysis by microarrays has produced important advances in biomedicine. For instance in multiple myeloma (MM), microarray approaches led to the development of an effective disease subtyping via cluster assignment, and a 70 gene risk score. Both enabled an improved molecular understanding of MM, and have provided prognostic information for the purposes of clinical management. Many researchers are now transitioning to Next Generation Sequencing (NGS) approaches and RNA-seq in particular, due to its discovery-based nature, improved sensitivity, and dynamic range. Additionally, RNA-seq allows for the analysis of gene isoforms, splice variants, and novel gene fusions. Given the voluminous amounts of historical microarray data, there is now a need to associate and integrate microarray and RNA-seq data via advanced bioinformatic approaches. Custom software was developed following a model-view-controller (MVC) approach to integrate Affymetrix probe set-IDs, and gene annotation information from a variety of sources. The tool/approach employs an assortment of strategies to integrate, cross reference, and associate microarray and RNA-seq datasets. Output from a variety of transcriptome reconstruction and quantitation tools (e.g., Cufflinks) can be directly integrated, and/or associated with Affymetrix probe set data, as well as necessary gene identifiers and/or symbols from a diversity of sources. Strategies are employed to maximize the annotation and cross referencing process. Custom gene sets (e.g., MM 70 risk score (GEP-70)) can be specified, and the tool can be directly assimilated into an RNA-seq pipeline. A novel bioinformatic approach to aid in the facilitation of both annotation and association of historic microarray data, in conjunction with richer RNA-seq data, is now assisting with the study of MM cancer biology.
Global mapping of transposon location.
Gabriel, Abram; Dapprich, Johannes; Kunkel, Mark; Gresham, David; Pratt, Stephen C; Dunham, Maitreya J
2006-12-15
Transposable genetic elements are ubiquitous, yet their presence or absence at any given position within a genome can vary between individual cells, tissues, or strains. Transposable elements have profound impacts on host genomes by altering gene expression, assisting in genomic rearrangements, causing insertional mutations, and serving as sources of phenotypic variation. Characterizing a genome's full complement of transposons requires whole genome sequencing, precluding simple studies of the impact of transposition on interindividual variation. Here, we describe a global mapping approach for identifying transposon locations in any genome, using a combination of transposon-specific DNA extraction and microarray-based comparative hybridization analysis. We use this approach to map the repertoire of endogenous transposons in different laboratory strains of Saccharomyces cerevisiae and demonstrate that transposons are a source of extensive genomic variation. We also apply this method to mapping bacterial transposon insertion sites in a yeast genomic library. This unique whole genome view of transposon location will facilitate our exploration of transposon dynamics, as well as defining bases for individual differences and adaptive potential.
Latorre, Mauricio; Ehrenfeld, Nicole; Cortés, María Paz; Travisany, Dante; Budinich, Marko; Aravena, Andrés; González, Mauricio; Bobadilla-Fazzini, Roberto A; Parada, Pilar; Maass, Alejandro
2016-01-01
In order to provide new information about the adaptation of Acidithiobacillus ferrooxidans during the bioleaching process, the current analysis presents the first report of the global transcriptional response of the native copper mine strain Wenelen (DSM 16786) oxidized under different sulfide minerals. Microarrays were used to measure the response of At. ferrooxidans Wenelen to shifts from iron supplemented liquid cultures (reference state) to the addition of solid substrates enriched in pyrite or chalcopyrite. Genes encoding for energy metabolism showed a similar transcriptional profile for the two sulfide minerals. Interestingly, four operons related to sulfur metabolism were over-expressed during growth on a reduced sulfur source. Genes associated with metal tolerance (RND and ATPases type P) were up-regulated in the presence of pyrite or chalcopyrite. These results suggest that At. ferrooxidans Wenelen presents an efficient transcriptional system developed to respond to environmental conditions, namely the ability to withstand high copper concentrations. Copyright © 2015 Elsevier Ltd. All rights reserved.
Optimization of cDNA microarrays procedures using criteria that do not rely on external standards.
Bruland, Torunn; Anderssen, Endre; Doseth, Berit; Bergum, Hallgeir; Beisvag, Vidar; Laegreid, Astrid
2007-10-18
The measurement of gene expression using microarray technology is a complicated process in which a large number of factors can be varied. Due to the lack of standard calibration samples such as are used in traditional chemical analysis it may be a problem to evaluate whether changes done to the microarray procedure actually improve the identification of truly differentially expressed genes. The purpose of the present work is to report the optimization of several steps in the microarray process both in laboratory practices and in data processing using criteria that do not rely on external standards. We performed a cDNA microarry experiment including RNA from samples with high expected differential gene expression termed "high contrasts" (rat cell lines AR42J and NRK52E) compared to self-self hybridization, and optimized a pipeline to maximize the number of genes found to be differentially expressed in the "high contrasts" RNA samples by estimating the false discovery rate (FDR) using a null distribution obtained from the self-self experiment. The proposed high-contrast versus self-self method (HCSSM) requires only four microarrays per evaluation. The effects of blocking reagent dose, filtering, and background corrections methodologies were investigated. In our experiments a dose of 250 ng LNA (locked nucleic acid) dT blocker, no background correction and weight based filtering gave the largest number of differentially expressed genes. The choice of background correction method had a stronger impact on the estimated number of differentially expressed genes than the choice of filtering method. Cross platform microarray (Illumina) analysis was used to validate that the increase in the number of differentially expressed genes found by HCSSM was real. The results show that HCSSM can be a useful and simple approach to optimize microarray procedures without including external standards. Our optimizing method is highly applicable to both long oligo-probe microarrays which have become commonly used for well characterized organisms such as man, mouse and rat, as well as to cDNA microarrays which are still of importance for organisms with incomplete genome sequence information such as many bacteria, plants and fish.
Optimization of cDNA microarrays procedures using criteria that do not rely on external standards
Bruland, Torunn; Anderssen, Endre; Doseth, Berit; Bergum, Hallgeir; Beisvag, Vidar; Lægreid, Astrid
2007-01-01
Background The measurement of gene expression using microarray technology is a complicated process in which a large number of factors can be varied. Due to the lack of standard calibration samples such as are used in traditional chemical analysis it may be a problem to evaluate whether changes done to the microarray procedure actually improve the identification of truly differentially expressed genes. The purpose of the present work is to report the optimization of several steps in the microarray process both in laboratory practices and in data processing using criteria that do not rely on external standards. Results We performed a cDNA microarry experiment including RNA from samples with high expected differential gene expression termed "high contrasts" (rat cell lines AR42J and NRK52E) compared to self-self hybridization, and optimized a pipeline to maximize the number of genes found to be differentially expressed in the "high contrasts" RNA samples by estimating the false discovery rate (FDR) using a null distribution obtained from the self-self experiment. The proposed high-contrast versus self-self method (HCSSM) requires only four microarrays per evaluation. The effects of blocking reagent dose, filtering, and background corrections methodologies were investigated. In our experiments a dose of 250 ng LNA (locked nucleic acid) dT blocker, no background correction and weight based filtering gave the largest number of differentially expressed genes. The choice of background correction method had a stronger impact on the estimated number of differentially expressed genes than the choice of filtering method. Cross platform microarray (Illumina) analysis was used to validate that the increase in the number of differentially expressed genes found by HCSSM was real. Conclusion The results show that HCSSM can be a useful and simple approach to optimize microarray procedures without including external standards. Our optimizing method is highly applicable to both long oligo-probe microarrays which have become commonly used for well characterized organisms such as man, mouse and rat, as well as to cDNA microarrays which are still of importance for organisms with incomplete genome sequence information such as many bacteria, plants and fish. PMID:17949480
Hu, Hejing; Zhang, Yannan; Shi, Yanfeng; Feng, Lin; Duan, Junchao; Sun, Zhiwei
2017-10-01
With rapid development of nanotechnology and growing environmental pollution, the combined toxic effects of SiNPs and pollutants of heavy metals like lead have received global attentions. The aim of this study was to explore the cardiovascular effects of the co-exposure of SiNPs and lead acetate (PbAc) in zebrafish using microarray and bioinformatics analysis. Although there was no other obvious cardiovascular malformation except bleeding phenotype, bradycardia, angiogenesis inhibition and declined cardiac output in zebrafish co-exposed of SiNPs and PbAc at NOAEL level, significant changes were observed in mRNA and microRNA (miRNA) expression patterns. STC-GO analysis indicated that the co-exposure might have more toxic effects on cardiovascular system than that exposure alone. Key differentially expressed genes were discerned out based on the Dynamic-gene-network, including stxbp1a, ndfip2, celf4 and gsk3b. Furthermore, several miRNAs obtained from the miRNA-Gene-Network might play crucial roles in cardiovascular disease, such as dre-miR-93, dre-miR-34a, dre-miR-181c, dre-miR-7145, dre-miR-730, dre-miR-129-5p, dre-miR-19d, dre-miR-218b, dre-miR-221. Besides, the analysis of miRNA-pathway-network indicated that the zebrafish were stimulated by the co-exposure of SiNPs and PbAc, which might cause the disturbance of calcium homeostasis and endoplasmic reticulum stress. As a result, cardiac muscle contraction might be deteriorated. In general, our data provide abundant fundamental research clues to the combined toxicity of environmental pollutants and further in-depth verifications are needed. Copyright © 2017 Elsevier Ltd. All rights reserved.
Kanika, Nirmala D; Chang, Jinsook; Tong, Yuehong; Tiplitsky, Scott; Lin, Juan; Yohannes, Elizabeth; Tar, Moses; Chance, Mark; Christ, George J; Melman, Arnold; Davies, Kelvin D
2011-05-01
• To investigate the role that oxidative stress plays in the development of diabetic cystopathy. • Comparative gene expression in the bladder of non-diabetic and streptozotocin (STZ)-induced 2-month- old diabetic rats was carried out using microarray analysis. • Evidence of oxidative stress was investigated in the bladder by analyzing glutathione S-transferase activity, lipid peroxidation, and carbonylation and nitrosylation of proteins. • The activity of protein degradation pathways was assessed using Western blot analysis. • Analysis of global gene expression showed that detrusor smooth muscle tissue of STZ-induced diabetes undergoes significant enrichment in targets involved in the production or regulation of reactive oxygen species (P = 1.27 × 10(-10)). The microarray analysis was confirmed by showing that markers of oxidative stress were all significantly increased in the diabetic bladder. • It was hypothesized that the sequelae to oxidative stress would be increased protein damage and apoptosis. • This was confirmed by showing that two key proteins involved in protein degradation (Nedd4 and LC3B) were greatly up-regulated in diabetic bladders compared to controls by 12.2 ± 0.76 and 4.4 ± 1.0-fold, respectively, and the apoptosis inducing protein, BAX, was up-regulated by 6.76 ± 0.76-fold. • Overall, the findings obtained in the present study add to the growing body of evidence showing that diabetic cystopathy is associated with oxidative damage of smooth muscle cells, and results in protein damage and activation of apoptotic pathways that may contribute to a deterioration in bladder function. © 2010 THE AUTHORS; BJU INTERNATIONAL © 2010 BJU INTERNATIONAL.
Krieg, S A; Fan, X; Hong, Y; Sang, Q-X; Giaccia, A; Westphal, L M; Lathi, R B; Krieg, A J; Nayak, N R
2012-09-01
Recurrent pregnancy loss (RPL) occurs in ∼5% of women. However, the etiology is still poorly understood. Defects in decidualization of the endometrium during early pregnancy contribute to several pregnancy complications, such as pre-eclampsia and intrauterine growth restriction (IUGR), and are believed to be important in the pathogenesis of idiopathic RPL. We performed microarray analysis to identify gene expression alterations in the deciduas of idiopathic RPL patients. Control patients had one antecedent term delivery, but were undergoing dilation and curettage for current aneuploid miscarriage. Gene expression differences were evaluated using both pathway and gene ontology (GO) analysis. Selected genes were validated using quantitative reverse transcription-polymerase chain reaction (qRT-PCR). A total of 155 genes were found to be significantly dysregulated in the deciduas of RPL patients (>2-fold change, P < 0.05), with 22 genes up-regulated and 133 genes down-regulated. GO analysis linked a large percentage of genes to discrete biological functions, including immune response (23%), cell signaling (18%) and cell invasion (17.1%), and pathway analysis revealed consistent changes in both the interleukin 1 (IL-1) and IL-8 pathways. All genes in the IL-8 pathway were up-regulated while genes in the IL-1 pathway were down-regulated. Although both pathways can promote inflammation, IL-1 pathway activity is important for normal implantation. Additionally, genes known to be critical for degradation of the extracellular matrix, including matrix metalloproteinase 26 and serine peptidase inhibitor Kazal-type 1, were also highly up-regulated. In this first microarray approach to decidual gene expression in RPL patients, our data suggest that dysregulation of genes associated with cell invasion and immunity may contribute significantly to idiopathic recurrent miscarriage.
Khan, Haseeb Ahmad
2004-01-01
The massive surge in the production of microarray data poses a great challenge for proper analysis and interpretation. In recent years numerous computational tools have been developed to extract meaningful interpretation of microarray gene expression data. However, a convenient tool for two-groups comparison of microarray data is still lacking and users have to rely on commercial statistical packages that might be costly and require special skills, in addition to extra time and effort for transferring data from one platform to other. Various statistical methods, including the t-test, analysis of variance, Pearson test and Mann-Whitney U test, have been reported for comparing microarray data, whereas the utilization of the Wilcoxon signed-rank test, which is an appropriate test for two-groups comparison of gene expression data, has largely been neglected in microarray studies. The aim of this investigation was to build an integrated tool, ArraySolver, for colour-coded graphical display and comparison of gene expression data using the Wilcoxon signed-rank test. The results of software validation showed similar outputs with ArraySolver and SPSS for large datasets. Whereas the former program appeared to be more accurate for 25 or fewer pairs (n < or = 25), suggesting its potential application in analysing molecular signatures that usually contain small numbers of genes. The main advantages of ArraySolver are easy data selection, convenient report format, accurate statistics and the familiar Excel platform.
2004-01-01
The massive surge in the production of microarray data poses a great challenge for proper analysis and interpretation. In recent years numerous computational tools have been developed to extract meaningful interpretation of microarray gene expression data. However, a convenient tool for two-groups comparison of microarray data is still lacking and users have to rely on commercial statistical packages that might be costly and require special skills, in addition to extra time and effort for transferring data from one platform to other. Various statistical methods, including the t-test, analysis of variance, Pearson test and Mann–Whitney U test, have been reported for comparing microarray data, whereas the utilization of the Wilcoxon signed-rank test, which is an appropriate test for two-groups comparison of gene expression data, has largely been neglected in microarray studies. The aim of this investigation was to build an integrated tool, ArraySolver, for colour-coded graphical display and comparison of gene expression data using the Wilcoxon signed-rank test. The results of software validation showed similar outputs with ArraySolver and SPSS for large datasets. Whereas the former program appeared to be more accurate for 25 or fewer pairs (n ≤ 25), suggesting its potential application in analysing molecular signatures that usually contain small numbers of genes. The main advantages of ArraySolver are easy data selection, convenient report format, accurate statistics and the familiar Excel platform. PMID:18629036
RECOVERING FILTER-BASED MICROARRAY DATA FOR PATHWAYS ANALYSIS USING A MULTIPOINT ALIGNMENT STRATEGY
The use of commercial microarrays are rapidly becoming the method of choice for profiling gene expression and assessing various disease states. Research Genetics has provided a series of well defined biological and software tools to the research community for these analyses. Th...
Estimating gene function with least squares nonnegative matrix factorization.
Wang, Guoli; Ochs, Michael F
2007-01-01
Nonnegative matrix factorization is a machine learning algorithm that has extracted information from data in a number of fields, including imaging and spectral analysis, text mining, and microarray data analysis. One limitation with the method for linking genes through microarray data in order to estimate gene function is the high variance observed in transcription levels between different genes. Least squares nonnegative matrix factorization uses estimates of the uncertainties on the mRNA levels for each gene in each condition, to guide the algorithm to a local minimum in normalized chi2, rather than a Euclidean distance or divergence between the reconstructed data and the data itself. Herein, application of this method to microarray data is demonstrated in order to predict gene function.
Microarray expression technology: from start to finish.
Elvidge, Gareth
2006-01-01
The recent introduction of new microarray expression technologies and the further development of established platforms ensure that the researcher is presented with a range of options for performing an experiment. Whilst this has opened up the possibilities for future applications, such as exon-specific arrays, increased sample throughput and 'chromatin immunoprecipitation (ChIP) on chip' experiments, the initial decision processes and experiment planning are made more difficult. This review will give an overview of the various technologies that are available to perform a microarray expression experiment, from the initial planning stages through to the final data analysis. Both practical aspects and data analysis options will be considered. The relative advantages and disadvantages will be discussed with insights provided for future directions of the technology.
Single molecule fluorescence microscopy for ultra-sensitive RNA expression profiling
NASA Astrophysics Data System (ADS)
Hesse, Jan; Jacak, Jaroslaw; Regl, Gerhard; Eichberger, Thomas; Aberger, Fritz; Schlapak, Robert; Howorka, Stefan; Muresan, Leila; Frischauf, Anna-Maria; Schütz, Gerhard J.
2007-02-01
We developed a microarray analysis platform for ultra-sensitive RNA expression profiling of minute samples. It utilizes a novel scanning system for single molecule fluorescence detection on cm2 size samples in combination with specialized biochips, optimized for low autofluorescence and weak unspecific adsorption. 20 μg total RNA was extracted from 10 6 cells of a human keratinocyte cell line (HaCaT) and reversely transcribed in the presence of Alexa647-aha-dUTP. 1% of the resulting labeled cDNA was used for complex hybridization to a custom-made oligonucleotide microarray representing a set of 125 different genes. For low abundant genes, individual cDNA molecules hybridized to the microarray spots could be resolved. Single cDNA molecules hybridized to the chip surface appeared as diffraction limited features in the fluorescence images. The à trous wavelet method was utilized for localization and counting of the separated cDNA signals. Subsequently, the degree of labeling of the localized cDNA molecules was determined by brightness analysis for the different genes. Variations by factors up to 6 were found, which in conventional microarray analysis would result in a misrepresentation of the relative abundance of mRNAs.
Schröder, Christoph; Jacob, Anette; Tonack, Sarah; Radon, Tomasz P.; Sill, Martin; Zucknick, Manuela; Rüffer, Sven; Costello, Eithne; Neoptolemos, John P.; Crnogorac-Jurcevic, Tatjana; Bauer, Andrea; Fellenberg, Kurt; Hoheisel, Jörg D.
2010-01-01
Antibody microarrays have the potential to enable comprehensive proteomic analysis of small amounts of sample material. Here, protocols are presented for the production, quality assessment, and reproducible application of antibody microarrays in a two-color mode with an array of 1,800 features, representing 810 antibodies that were directed at 741 cancer-related proteins. In addition to measures of array quality, we implemented indicators for the accuracy and significance of dual-color detection. Dual-color measurements outperform a single-color approach concerning assay reproducibility and discriminative power. In the analysis of serum samples, depletion of high-abundance proteins did not improve technical assay quality. On the contrary, depletion introduced a strong bias in protein representation. In an initial study, we demonstrated the applicability of the protocols to proteins derived from urine samples. We identified differences between urine samples from pancreatic cancer patients and healthy subjects and between sexes. This study demonstrates that biomedically relevant data can be produced. As demonstrated by the thorough quality analysis, the dual-color antibody array approach proved to be competitive with other proteomic techniques and comparable in performance to transcriptional microarray analyses. PMID:20164060
Estimating differential expression from multiple indicators
Ilmjärv, Sten; Hundahl, Christian Ansgar; Reimets, Riin; Niitsoo, Margus; Kolde, Raivo; Vilo, Jaak; Vasar, Eero; Luuk, Hendrik
2014-01-01
Regardless of the advent of high-throughput sequencing, microarrays remain central in current biomedical research. Conventional microarray analysis pipelines apply data reduction before the estimation of differential expression, which is likely to render the estimates susceptible to noise from signal summarization and reduce statistical power. We present a probe-level framework, which capitalizes on the high number of concurrent measurements to provide more robust differential expression estimates. The framework naturally extends to various experimental designs and target categories (e.g. transcripts, genes, genomic regions) as well as small sample sizes. Benchmarking in relation to popular microarray and RNA-sequencing data-analysis pipelines indicated high and stable performance on the Microarray Quality Control dataset and in a cell-culture model of hypoxia. Experimental-data-exhibiting long-range epigenetic silencing of gene expression was used to demonstrate the efficacy of detecting differential expression of genomic regions, a level of analysis not embraced by conventional workflows. Finally, we designed and conducted an experiment to identify hypothermia-responsive genes in terms of monotonic time-response. As a novel insight, hypothermia-dependent up-regulation of multiple genes of two major antioxidant pathways was identified and verified by quantitative real-time PCR. PMID:24586062
Curcumin modulates DNA methylation in colorectal cancer cells.
Link, Alexander; Balaguer, Francesc; Shen, Yan; Lozano, Juan Jose; Leung, Hon-Chiu E; Boland, C Richard; Goel, Ajay
2013-01-01
Recent evidence suggests that several dietary polyphenols may exert their chemopreventive effect through epigenetic modifications. Curcumin is one of the most widely studied dietary chemopreventive agents for colon cancer prevention, however, its effects on epigenetic alterations, particularly DNA methylation, remain unclear. Using systematic genome-wide approaches, we aimed to elucidate the effect of curcumin on DNA methylation alterations in colorectal cancer cells. To evaluate the effect of curcumin on DNA methylation, three CRC cell lines, HCT116, HT29 and RKO, were treated with curcumin. 5-aza-2'-deoxycytidine (5-aza-CdR) and trichostatin A treated cells were used as positive and negative controls for DNA methylation changes, respectively. Methylation status of LINE-1 repeat elements, DNA promoter methylation microarrays and gene expression arrays were used to assess global methylation and gene expression changes. Validation was performed using independent microarrays, quantitative bisulfite pyrosequencing, and qPCR. As expected, genome-wide methylation microarrays revealed significant DNA hypomethylation in 5-aza-CdR-treated cells (mean β-values of 0.12), however, non-significant changes in mean β-values were observed in curcumin-treated cells. In comparison to mock-treated cells, curcumin-induced DNA methylation alterations occurred in a time-dependent manner. In contrast to the generalized, non-specific global hypomethylation observed with 5-aza-CdR, curcumin treatment resulted in methylation changes at selected, partially-methylated loci, instead of fully-methylated CpG sites. DNA methylation alterations were supported by corresponding changes in gene expression at both up- and down-regulated genes in various CRC cell lines. Our data provide previously unrecognized evidence for curcumin-mediated DNA methylation alterations as a potential mechanism of colon cancer chemoprevention. In contrast to non-specific global hypomethylation induced by 5-aza-CdR, curcumin-induced methylation changes occurred only in a subset of partially-methylated genes, which provides additional mechanistic insights into the potent chemopreventive effect of this dietary nutraceutical.
Curcumin Modulates DNA Methylation in Colorectal Cancer Cells
Link, Alexander; Balaguer, Francesc; Shen, Yan; Lozano, Juan Jose; Leung, Hon-Chiu E.; Boland, C. Richard; Goel, Ajay
2013-01-01
Aim Recent evidence suggests that several dietary polyphenols may exert their chemopreventive effect through epigenetic modifications. Curcumin is one of the most widely studied dietary chemopreventive agents for colon cancer prevention, however, its effects on epigenetic alterations, particularly DNA methylation, remain unclear. Using systematic genome-wide approaches, we aimed to elucidate the effect of curcumin on DNA methylation alterations in colorectal cancer cells. Materials and Methods To evaluate the effect of curcumin on DNA methylation, three CRC cell lines, HCT116, HT29 and RKO, were treated with curcumin. 5-aza-2′-deoxycytidine (5-aza-CdR) and trichostatin A treated cells were used as positive and negative controls for DNA methylation changes, respectively. Methylation status of LINE-1 repeat elements, DNA promoter methylation microarrays and gene expression arrays were used to assess global methylation and gene expression changes. Validation was performed using independent microarrays, quantitative bisulfite pyrosequencing, and qPCR. Results As expected, genome-wide methylation microarrays revealed significant DNA hypomethylation in 5-aza-CdR-treated cells (mean β-values of 0.12), however, non-significant changes in mean β-values were observed in curcumin-treated cells. In comparison to mock-treated cells, curcumin-induced DNA methylation alterations occurred in a time-dependent manner. In contrast to the generalized, non-specific global hypomethylation observed with 5-aza-CdR, curcumin treatment resulted in methylation changes at selected, partially-methylated loci, instead of fully-methylated CpG sites. DNA methylation alterations were supported by corresponding changes in gene expression at both up- and down-regulated genes in various CRC cell lines. Conclusions Our data provide previously unrecognized evidence for curcumin-mediated DNA methylation alterations as a potential mechanism of colon cancer chemoprevention. In contrast to non-specific global hypomethylation induced by 5-aza-CdR, curcumin-induced methylation changes occurred only in a subset of partially-methylated genes, which provides additional mechanistic insights into the potent chemopreventive effect of this dietary nutraceutical. PMID:23460897
Kim, Chang Sup; Seo, Jeong Hyun; Cha, Hyung Joon
2012-08-07
The development of analytical tools is important for understanding the infection mechanisms of pathogenic bacteria or viruses. In the present work, a functional carbohydrate microarray combined with a fluorescence immunoassay was developed to analyze the interactions of Vibrio cholerae toxin (ctx) proteins and GM1-related carbohydrates. Ctx proteins were loaded onto the surface-immobilized GM1 pentasaccharide and six related carbohydrates, and their binding affinities were detected immunologically. The analysis of the ctx-carbohydrate interactions revealed that the intrinsic selectivity of ctx was GM1 pentasaccharide ≫ GM2 tetrasaccharide > asialo GM1 tetrasaccharide ≥ GM3trisaccharide, indicating that a two-finger grip formation and the terminal monosaccharides play important roles in the ctx-GM1 interaction. In addition, whole cholera toxin (ctxAB(5)) had a stricter substrate specificity and a stronger binding affinity than only the cholera toxin B subunit (ctxB). On the basis of the quantitative analysis, the carbohydrate microarray showed the sensitivity of detection of the ctxAB(5)-GM1 interaction with a limit-of-detection (LOD) of 2 ng mL(-1) (23 pM), which is comparable to other reported high sensitivity assay tools. In addition, the carbohydrate microarray successfully detected the actual toxin directly secreted from V. cholerae, without showing cross-reactivity to other bacteria. Collectively, these results demonstrate that the functional carbohydrate microarray is suitable for analyzing toxin protein-carbohydrate interactions and can be applied as a biosensor for toxin detection.
Bacterial identification and subtyping using DNA microarray and DNA sequencing.
Al-Khaldi, Sufian F; Mossoba, Magdi M; Allard, Marc M; Lienau, E Kurt; Brown, Eric D
2012-01-01
The era of fast and accurate discovery of biological sequence motifs in prokaryotic and eukaryotic cells is here. The co-evolution of direct genome sequencing and DNA microarray strategies not only will identify, isotype, and serotype pathogenic bacteria, but also it will aid in the discovery of new gene functions by detecting gene expressions in different diseases and environmental conditions. Microarray bacterial identification has made great advances in working with pure and mixed bacterial samples. The technological advances have moved beyond bacterial gene expression to include bacterial identification and isotyping. Application of new tools such as mid-infrared chemical imaging improves detection of hybridization in DNA microarrays. The research in this field is promising and future work will reveal the potential of infrared technology in bacterial identification. On the other hand, DNA sequencing by using 454 pyrosequencing is so cost effective that the promise of $1,000 per bacterial genome sequence is becoming a reality. Pyrosequencing technology is a simple to use technique that can produce accurate and quantitative analysis of DNA sequences with a great speed. The deposition of massive amounts of bacterial genomic information in databanks is creating fingerprint phylogenetic analysis that will ultimately replace several technologies such as Pulsed Field Gel Electrophoresis. In this chapter, we will review (1) the use of DNA microarray using fluorescence and infrared imaging detection for identification of pathogenic bacteria, and (2) use of pyrosequencing in DNA cluster analysis to fingerprint bacterial phylogenetic trees.
Comparisons of Robustness and Sensitivity between Cancer and Normal Cells by Microarray Data
Chu, Liang-Hui; Chen, Bor-Sen
2008-01-01
Robustness is defined as the ability to uphold performance in face of perturbations and uncertainties, and sensitivity is a measure of the system deviations generated by perturbations to the system. While cancer appears as a robust but fragile system, few computational and quantitative evidences demonstrate robustness tradeoffs in cancer. Microarrays have been widely applied to decipher gene expression signatures in human cancer research, and quantification of global gene expression profiles facilitates precise prediction and modeling of cancer in systems biology. We provide several efficient computational methods based on system and control theory to compare robustness and sensitivity between cancer and normal cells by microarray data. Measurement of robustness and sensitivity by linear stochastic model is introduced in this study, which shows oscillations in feedback loops of p53 and demonstrates robustness tradeoffs that cancer is a robust system with some extreme fragilities. In addition, we measure sensitivity of gene expression to perturbations in other gene expression and kinetic parameters, discuss nonlinear effects in feedback loops of p53 and extend our method to robustness-based cancer drug design. PMID:19259409
Removing technical variability in RNA-seq data using conditional quantile normalization.
Hansen, Kasper D; Irizarry, Rafael A; Wu, Zhijin
2012-04-01
The ability to measure gene expression on a genome-wide scale is one of the most promising accomplishments in molecular biology. Microarrays, the technology that first permitted this, were riddled with problems due to unwanted sources of variability. Many of these problems are now mitigated, after a decade's worth of statistical methodology development. The recently developed RNA sequencing (RNA-seq) technology has generated much excitement in part due to claims of reduced variability in comparison to microarrays. However, we show that RNA-seq data demonstrate unwanted and obscuring variability similar to what was first observed in microarrays. In particular, we find guanine-cytosine content (GC-content) has a strong sample-specific effect on gene expression measurements that, if left uncorrected, leads to false positives in downstream results. We also report on commonly observed data distortions that demonstrate the need for data normalization. Here, we describe a statistical methodology that improves precision by 42% without loss of accuracy. Our resulting conditional quantile normalization algorithm combines robust generalized regression to remove systematic bias introduced by deterministic features such as GC-content and quantile normalization to correct for global distortions.
Yang, Mingxing; Li, Xiumin; Li, Zhibin; Ou, Zhimin; Liu, Ming; Liu, Suhuan; Li, Xuejun; Yang, Shuyu
2013-01-01
DNA microarray analysis is characterized by obtaining a large number of gene variables from a small number of observations. Cluster analysis is widely used to analyze DNA microarray data to make classification and diagnosis of disease. Because there are so many irrelevant and insignificant genes in a dataset, a feature selection approach must be employed in data analysis. The performance of cluster analysis of this high-throughput data depends on whether the feature selection approach chooses the most relevant genes associated with disease classes. Here we proposed a new method using multiple Orthogonal Partial Least Squares-Discriminant Analysis (mOPLS-DA) models and S-plots to select the most relevant genes to conduct three-class disease classification and prediction. We tested our method using Golub's leukemia microarray data. For three classes with subtypes, we proposed hierarchical orthogonal partial least squares-discriminant analysis (OPLS-DA) models and S-plots to select features for two main classes and their subtypes. For three classes in parallel, we employed three OPLS-DA models and S-plots to choose marker genes for each class. The power of feature selection to classify and predict three-class disease was evaluated using cluster analysis. Further, the general performance of our method was tested using four public datasets and compared with those of four other feature selection methods. The results revealed that our method effectively selected the most relevant features for disease classification and prediction, and its performance was better than that of the other methods.
Changes in Global Transcriptional Profiling of Women Following Obesity Surgery Bypass.
Pinhel, Marcela Augusta de Souza; Noronha, Natalia Yumi; Nicoletti, Carolina Ferreira; de Oliveira, Bruno Affonso Parente; Cortes-Oliveira, Cristiana; Pinhanelli, Vitor Caressato; Salgado Junior, Wilson; Machry, Ana Julia; da Silva Junior, Wilson Araújo; Souza, Dorotéia Rossi Silva; Marchini, Júlio Sérgio; Nonino, Carla Barbosa
2018-01-01
Differential gene expression in peripheral blood mononuclear cells (PBMCs) after Roux-en-Y gastric bypass (RYGB) is poorly characterized. Markers of these processes may provide a deeper understanding of the mechanisms that underlie these events. The main goal of this study was to identify changes in PBMC gene expression in women with obesity before and 6 months after RYGB-induced weight loss. The ribonucleic acid (RNA) of PBMCs from 13 obese women was analyzed before and 6 months after RYGB; the RNA of PBMCs from nine healthy women served as control. The gene expression levels were determined by microarray analysis. Significant differences in gene expression were validated by real-time quantitative polymerase chain reaction (RT-qPCR). Microarray analysis for comparison of the pre- and postoperative periods showed that 1366 genes were differentially expressed genes (DEGs). The main pathways were related to gene transcription; lipid, energy, and glycide metabolism; inflammatory and immunological response; cell differentiation; oxidative stress regulation; response to endogenous and exogenous stimuli; substrate oxidation; mTOR signaling pathway; interferon signaling; mitogen-activated protein kinases (MAPK), cAMP response element binding protein (CREB1), heat shock factor 1 (HSF1), and sterol regulatory element binding protein 1c (SREBP-1c) gene expression; adipocyte differentiation; and methylation. Six months after bariatric surgery and significant weight loss, many molecular pathways involved in obesity and metabolic diseases change. These findings are an important tool to identify potential targets for therapeutic intervention and clinical practice of nutritional genomics in obesity.
Single cell gene expression profiling in Alzheimer's disease.
Ginsberg, Stephen D; Che, Shaoli; Counts, Scott E; Mufson, Elliott J
2006-07-01
Development and implementation of microarray techniques to quantify expression levels of dozens to hundreds to thousands of transcripts simultaneously within select tissue samples from normal control subjects and neurodegenerative diseased brains has enabled scientists to create molecular fingerprints of vulnerable neuronal populations in Alzheimer's disease (AD) and related disorders. A goal is to sample gene expression from homogeneous cell types within a defined region without potential contamination by expression profiles of adjacent neuronal subpopulations and nonneuronal cells. The precise resolution afforded by single cell and population cell RNA analysis in combination with microarrays and real-time quantitative polymerase chain reaction (qPCR)-based analyses allows for relative gene expression level comparisons across cell types under different experimental conditions and disease progression. The ability to analyze single cells is an important distinction from global and regional assessments of mRNA expression and can be applied to optimally prepared tissues from animal models of neurodegeneration as well as postmortem human brain tissues. Gene expression analysis in postmortem AD brain regions including the hippocampal formation and neocortex reveals selectively vulnerable cell types share putative pathogenetic alterations in common classes of transcripts, for example, markers of glutamatergic neurotransmission, synaptic-related markers, protein phosphatases and kinases, and neurotrophins/neurotrophin receptors. Expression profiles of vulnerable regions and neurons may reveal important clues toward the understanding of the molecular pathogenesis of various neurological diseases and aid in identifying rational targets toward pharmacotherapeutic interventions for progressive, late-onset neurodegenerative disorders such as mild cognitive impairment (MCI) and AD.
2010-01-01
Background The large amount of high-throughput genomic data has facilitated the discovery of the regulatory relationships between transcription factors and their target genes. While early methods for discovery of transcriptional regulation relationships from microarray data often focused on the high-throughput experimental data alone, more recent approaches have explored the integration of external knowledge bases of gene interactions. Results In this work, we develop an algorithm that provides improved performance in the prediction of transcriptional regulatory relationships by supplementing the analysis of microarray data with a new method of integrating information from an existing knowledge base. Using a well-known dataset of yeast microarrays and the Yeast Proteome Database, a comprehensive collection of known information of yeast genes, we show that knowledge-based predictions demonstrate better sensitivity and specificity in inferring new transcriptional interactions than predictions from microarray data alone. We also show that comprehensive, direct and high-quality knowledge bases provide better prediction performance. Comparison of our results with ChIP-chip data and growth fitness data suggests that our predicted genome-wide regulatory pairs in yeast are reasonable candidates for follow-up biological verification. Conclusion High quality, comprehensive, and direct knowledge bases, when combined with appropriate bioinformatic algorithms, can significantly improve the discovery of gene regulatory relationships from high throughput gene expression data. PMID:20122245
Seok, Junhee; Kaushal, Amit; Davis, Ronald W; Xiao, Wenzhong
2010-01-18
The large amount of high-throughput genomic data has facilitated the discovery of the regulatory relationships between transcription factors and their target genes. While early methods for discovery of transcriptional regulation relationships from microarray data often focused on the high-throughput experimental data alone, more recent approaches have explored the integration of external knowledge bases of gene interactions. In this work, we develop an algorithm that provides improved performance in the prediction of transcriptional regulatory relationships by supplementing the analysis of microarray data with a new method of integrating information from an existing knowledge base. Using a well-known dataset of yeast microarrays and the Yeast Proteome Database, a comprehensive collection of known information of yeast genes, we show that knowledge-based predictions demonstrate better sensitivity and specificity in inferring new transcriptional interactions than predictions from microarray data alone. We also show that comprehensive, direct and high-quality knowledge bases provide better prediction performance. Comparison of our results with ChIP-chip data and growth fitness data suggests that our predicted genome-wide regulatory pairs in yeast are reasonable candidates for follow-up biological verification. High quality, comprehensive, and direct knowledge bases, when combined with appropriate bioinformatic algorithms, can significantly improve the discovery of gene regulatory relationships from high throughput gene expression data.
Vartanian, Kristina; Slottke, Rachel; Johnstone, Timothy; Casale, Amanda; Planck, Stephen R; Choi, Dongseok; Smith, Justine R; Rosenbaum, James T; Harrington, Christina A
2009-01-01
Background Peripheral blood is an accessible and informative source of transcriptomal information for many human disease and pharmacogenomic studies. While there can be significant advantages to analyzing RNA isolated from whole blood, particularly in clinical studies, the preparation of samples for microarray analysis is complicated by the need to minimize artifacts associated with highly abundant globin RNA transcripts. The impact of globin RNA transcripts on expression profiling data can potentially be reduced by using RNA preparation and labeling methods that remove or block globin RNA during the microarray assay. We compared four different methods for preparing microarray hybridization targets from human whole blood collected in PAXGene tubes. Three of the methods utilized the Affymetrix one-cycle cDNA synthesis/in vitro transcription protocol but varied treatment of input RNA as follows: i. no treatment; ii. treatment with GLOBINclear; or iii. treatment with globin PNA oligos. In the fourth method cDNA targets were prepared with the Ovation amplification and labeling system. Results We find that microarray targets generated with labeling methods that reduce globin mRNA levels or minimize the impact of globin transcripts during hybridization detect more transcripts in the microarray assay compared with the standard Affymetrix method. Comparison of microarray results with quantitative PCR analysis of a panel of genes from the NF-kappa B pathway shows good correlation of transcript measurements produced with all four target preparation methods, although method-specific differences in overall correlation were observed. The impact of freezing blood collected in PAXGene tubes on data reproducibility was also examined. Expression profiles show little or no difference when RNA is extracted from either fresh or frozen blood samples. Conclusion RNA preparation and labeling methods designed to reduce the impact of globin mRNA transcripts can significantly improve the sensitivity of the DNA microarray expression profiling assay for whole blood samples. While blockage of globin transcripts during first strand cDNA synthesis with globin PNAs resulted in the best overall performance in this study, we conclude that selection of a protocol for expression profiling studies in blood should depend on several factors, including implementation requirements of the method and study design. RNA isolated from either freshly collected or frozen blood samples stored in PAXGene tubes can be used without altering gene expression profiles. PMID:19123946
Liu, Bao-Hong; Cai, Jian-Ping
2017-01-01
Salmonella enterica Pullorum is one of the leading causes of mortality in poultry. Understanding the molecular response in chickens in response to the infection by S. enterica is important in revealing the mechanisms of pathogenesis and disease progress. There have been studies on identifying genes associated with Salmonella infection by differential expression analysis, but the relationships among regulated genes have not been investigated. In this study, we employed weighted gene coexpression network analysis (WGCNA) and differential coexpression analysis (DCEA) to identify coexpression modules by exploring microarray data derived from chicken splenic tissues in response to the S. enterica infection. A total of 19 modules from 13,538 genes were associated with the Jak-STAT signaling pathway, the extracellular matrix, cytoskeleton organization, the regulation of the actin cytoskeleton, G-protein coupled receptor activity, Toll-like receptor signaling pathways, and immune system processes; among them, 14 differentially coexpressed modules (DCMs) and 2,856 differentially coexpressed genes (DCGs) were identified. The global expression of module genes between infected and uninfected chickens showed slight differences but considerable changes for global coexpression. Furthermore, DCGs were consistently linked to the hubs of the modules. These results will help prioritize candidate genes for future studies of Salmonella infection.
2017-01-01
Salmonella enterica Pullorum is one of the leading causes of mortality in poultry. Understanding the molecular response in chickens in response to the infection by S. enterica is important in revealing the mechanisms of pathogenesis and disease progress. There have been studies on identifying genes associated with Salmonella infection by differential expression analysis, but the relationships among regulated genes have not been investigated. In this study, we employed weighted gene coexpression network analysis (WGCNA) and differential coexpression analysis (DCEA) to identify coexpression modules by exploring microarray data derived from chicken splenic tissues in response to the S. enterica infection. A total of 19 modules from 13,538 genes were associated with the Jak-STAT signaling pathway, the extracellular matrix, cytoskeleton organization, the regulation of the actin cytoskeleton, G-protein coupled receptor activity, Toll-like receptor signaling pathways, and immune system processes; among them, 14 differentially coexpressed modules (DCMs) and 2,856 differentially coexpressed genes (DCGs) were identified. The global expression of module genes between infected and uninfected chickens showed slight differences but considerable changes for global coexpression. Furthermore, DCGs were consistently linked to the hubs of the modules. These results will help prioritize candidate genes for future studies of Salmonella infection. PMID:28529955
APPLICATION OF DNA MICROARRAYS TO REPRODUCTIVE TOXICOLOGY AND THE DEVELOPMENT OF A TESTIS ARRAY
With the advent of sequence information for entire mammalian genomes, it is now possible to analyze gene expression and gene polymorphisms on a genomic scale. The primary tool for analysis of gene expression is the DNA microarray. We have used commercially available cDNA micro...
Microarrays for Undergraduate Classes
ERIC Educational Resources Information Center
Hancock, Dale; Nguyen, Lisa L.; Denyer, Gareth S.; Johnston, Jill M.
2006-01-01
A microarray experiment is presented that, in six laboratory sessions, takes undergraduate students from the tissue sample right through to data analysis. The model chosen, the murine erythroleukemia cell line, can be easily cultured in sufficient quantities for class use. Large changes in gene expression can be induced in these cells by…
With the advent of sequence information for entire eukaryotic genomes, it is now possible to analyze gene expression on a genomic scale. The primary tool for genomic analysis of gene expression is the gene microarray. We have used commercially available and custom cDNA microarray...
Microarrays have the potential to significantly impact our ability to identify toxic hazards by the identification of mechanistically-relevant markers of toxicity. To be useful for risk assessment however, microarray data must be challenged to determine its reliability and inter...
USDA-ARS?s Scientific Manuscript database
To analyze transcriptome response to virus infection, we have assembled currently available microarray data on changes in gene expression levels in compatible Arabidopsis-virus interactions. We used the mean r (Pearson’s correlation coefficient) for neighboring pairs to estimate pairwise local simil...
Baumann, Antoine; Devaux, Yvan; Audibert, Gérard; Zhang, Lu; Bracard, Serge; Colnat-Coulbois, Sophie; Klein, Olivier; Zannad, Faiez; Charpentier, Claire; Longrois, Dan; Mertes, Paul-Michel
2013-01-01
Delayed cerebral ischemia (DCI) is a potentially devastating complication after intracranial aneurysm rupture and its mechanisms remain poorly elucidated. Early identification of the patients prone to developing DCI after rupture may represent a major breakthrough in its prevention and treatment. The single gene approach of DCI has demonstrated interest in humans. We hypothesized that whole genome expression profile of blood cells may be useful for better comprehension and prediction of aneurysmal DCI. Over a 35-month period, 218 patients with aneurysm rupture were included in this study. DCI was defined as the occurrence of a new delayed neurological deficit occurring within 2 weeks after aneurysm rupture with evidence of ischemia either on perfusion-diffusion MRI, CT angiography or CT perfusion imaging, or with cerebral angiography. DCI patients were matched against controls based on 4 out of 5 criteria (age, sex, Fisher grade, aneurysm location and smoking status). Genome-wide expression analysis of blood cells obtained at admission was performed by microarrays. Transcriptomic analysis was performed using long oligonucleotide microarrays representing 25,000 genes. Quantitative PCR: 1 µg of total RNA extracted was reverse-transcribed, and the resulting cDNA was diluted 10-fold before performing quantitative PCR. Microarray data were first analyzed by 'Significance Analysis of Microarrays' software which includes the Benjamini correction for multiple testing. In a second step, microarray data fold change was compared using a two-tailed, paired t test. Analysis of receiver-operating characteristic (ROC) curves and the area under the ROC curves were used for prediction analysis. Logistic regression models were used to investigate the additive value of multiple biomarkers. A total of 16 patients demonstrated DCI. Significance Analysis of Microarrays software failed to retrieve significant genes, most probably because of the heterogeneity of the patients included in the microarray experiments and the small size of the DCI population sample. Standard two-tailed paired t test and C-statistic revealed significant associations between gene expression and the occurrence of DCI: in particular, the expression of neuroregulin 1 was 1.6-fold upregulated in patients with DCI (p = 0.01) and predicted DCI with an area under the ROC curve of 0.96. Logistic regression analyses revealed a significant association between neuroregulin 1 and DCI (odds ratio 1.46, 95% confidence interval 1.02-2.09, p = 0.02). This pilot study suggests that blood cells may be a reservoir of prognostic biomarkers of DCI in patients with intracranial aneurysm rupture. Despite an evident lack of power, this study elicited neuroregulin 1, a vasoreactivity-, inflammation- and angiogenesis-related gene, as a possible candidate predictor of DCI. Larger cohort studies are needed but genome-wide microarray-based studies are promising research tools for the understanding of DCI after intracranial aneurysm rupture. © 2013 S. Karger AG, Basel.
Hook, S E
2010-12-01
The advent of any new technology is typically met with great excitement. So it was a few years ago, when the combination of advances in sequencing technology and the development of microarray technology made measurements of global gene expression in ecologically relevant species possible. Many of the review papers published around that time promised that these new technologies would revolutionize environmental biology as they had revolutionized medicine and related fields. A few years have passed since these technological advancements have been made, and the use of microarray studies in non-model fish species has been adopted in many laboratories internationally. Has the relatively widespread adoption of this technology really revolutionized the fields of environmental biology, including ecotoxicology, aquaculture and ecology, as promised? Or have these studies merely become a novelty and a potential distraction for scientists addressing environmentally relevant questions? In this review, the promises made in early review papers, in particular about the advances that the use of microarrays would enable, are summarized; these claims are compared to the results of recent studies to determine whether the forecasted changes have materialized. Some applications, as discussed in the paper, have been realized and have led to advances in their field, others are still under development. © 2010 CSIRO. Journal of Fish Biology © 2010 The Fisheries Society of the British Isles.
Giotis, Efstathios S; Robey, Rebecca C; Skinner, Natalie G; Tomlinson, Christopher D; Goodbourn, Stephen; Skinner, Michael A
2016-08-05
Viruses that infect birds pose major threats-to the global supply of chicken, the major, universally-acceptable meat, and as zoonotic agents (e.g. avian influenza viruses H5N1 and H7N9). Controlling these viruses in birds as well as understanding their emergence into, and transmission amongst, humans will require considerable ingenuity and understanding of how different species defend themselves. The type I interferon-coordinated response constitutes the major antiviral innate defence. Although interferon was discovered in chicken cells, details of the response, particularly the identity of hundreds of stimulated genes, are far better described in mammals. Viruses induce interferon-stimulated genes but they also regulate the expression of many hundreds of cellular metabolic and structural genes to facilitate their replication. This study focusses on the potentially anti-viral genes by identifying those induced just by interferon in primary chick embryo fibroblasts. Three transcriptomic technologies were exploited: RNA-seq, a classical 3'-biased chicken microarray and a high density, "sense target", whole transcriptome chicken microarray, with each recognising 120-150 regulated genes (curated for duplication and incorrect assignment of some microarray probesets). Overall, the results are considered robust because 128 of the compiled, curated list of 193 regulated genes were detected by two, or more, of the technologies.
Grade, Marian; Hörmann, Patrick; Becker, Sandra; Hummon, Amanda B.; Wangsa, Danny; Varma, Sudhir; Simon, Richard; Liersch, Torsten; Becker, Heinz; Difilippantonio, Michael J.; Ghadimi, B. Michael; Ried, Thomas
2016-01-01
To characterize patterns of global transcriptional deregulation in primary colon carcinomas, we did gene expression profiling of 73 tumors [Unio Internationale Contra Cancrum stage II (n = 33) and stage III (n = 40)] using oligonucleotide microarrays. For 30 of the tumors, expression profiles were compared with those from matched normal mucosa samples. We identified a set of 1,950 genes with highly significant deregulation between tumors and mucosa samples (P < 1e–7). A significant proportion of these genes mapped to chromosome 20 (P = 0.01). Seventeen genes had a >5-fold average expression difference between normal colon mucosa and carcinomas, including up-regulation of MYC and of HMGA1, a putative oncogene. Furthermore, we identified 68 genes that were significantly differentially expressed between lymph node–negative and lymph node–positive tumors (P < 0.001), the functional annotation of which revealed a preponderance of genes that play a role in cellular immune response and surveillance. The microarray-derived gene expression levels of 20 deregulated genes were validated using quantitative real-time reverse transcription-PCR in >40 tumor and normal mucosa samples with good concordance between the techniques. Finally, we established a relationship between specific genomic imbalances, which were mapped for 32 of the analyzed colon tumors by comparative genomic hybridization, and alterations of global transcriptional activity. Previously, we had conducted a similar analysis of primary rectal carcinomas. The systematic comparison of colon and rectal carcinomas revealed a significant overlap of genomic imbalances and transcriptional deregulation, including activation of the Wnt/β-catenin signaling cascade, suggesting similar pathogenic pathways. PMID:17210682
Grade, Marian; Hörmann, Patrick; Becker, Sandra; Hummon, Amanda B; Wangsa, Danny; Varma, Sudhir; Simon, Richard; Liersch, Torsten; Becker, Heinz; Difilippantonio, Michael J; Ghadimi, B Michael; Ried, Thomas
2007-01-01
To characterize patterns of global transcriptional deregulation in primary colon carcinomas, we did gene expression profiling of 73 tumors [Unio Internationale Contra Cancrum stage II (n = 33) and stage III (n = 40)] using oligonucleotide microarrays. For 30 of the tumors, expression profiles were compared with those from matched normal mucosa samples. We identified a set of 1,950 genes with highly significant deregulation between tumors and mucosa samples (P < 1e-7). A significant proportion of these genes mapped to chromosome 20 (P = 0.01). Seventeen genes had a >5-fold average expression difference between normal colon mucosa and carcinomas, including up-regulation of MYC and of HMGA1, a putative oncogene. Furthermore, we identified 68 genes that were significantly differentially expressed between lymph node-negative and lymph node-positive tumors (P < 0.001), the functional annotation of which revealed a preponderance of genes that play a role in cellular immune response and surveillance. The microarray-derived gene expression levels of 20 deregulated genes were validated using quantitative real-time reverse transcription-PCR in >40 tumor and normal mucosa samples with good concordance between the techniques. Finally, we established a relationship between specific genomic imbalances, which were mapped for 32 of the analyzed colon tumors by comparative genomic hybridization, and alterations of global transcriptional activity. Previously, we had conducted a similar analysis of primary rectal carcinomas. The systematic comparison of colon and rectal carcinomas revealed a significant overlap of genomic imbalances and transcriptional deregulation, including activation of the Wnt/beta-catenin signaling cascade, suggesting similar pathogenic pathways.
Szcześniak, K A; Ciecierska, A; Ostaszewski, P; Sadkowski, T
2016-01-01
Adult skeletal muscle myogenesis depends on the activation of satellite cells that have the potential to differentiate into new fibers. Gamma-oryzanol (GO), a commercially available nutriactive phytochemical, has gained global interest on account of its muscle-building and regenerating effects. Here, we investigated GO for its potential influence on myogenesis, using equine satellite cell culture model, since the horse is a unique animal, bred and exercised for competitive sport. To our knowledge, this is the first report where the global gene expression in cultured equine satellite cells has been described. Equine satellite cells were isolated from semitendinosus muscle and cultured until the second day of differentiation. Differentiating cells were incubated with GO for the next 24 h. Subsequently, total RNA from GO-treated and control cells was isolated, amplified, labeled, and hybridized to two-color Horse Gene Expression Microarray slides. Quantitative PCR was used for the validation of microarray data. Our results revealed 58 genes with changed expression in GO-treated vs. control cells. Analysis of expression changes suggests that various processes are reinforced by GO in differentiating equine satellite cells, including inhibition of myoblast differentiation, increased proliferation and differentiation, stress response, and increased myogenic lineage commitment. The present study may confirm putative muscle-enhancing abilities of GO; however, the collective role of GO in skeletal myogenesis remains equivocal. The diversity of these changes is likely due to heterogenous growth rate of cells in primary culture. Genes identified in our study, modulated by the presence of GO, may become potential targets of future research investigating impact of this supplement in skeletal muscle on proteomic and biochemical level.
Profiling In Situ Microbial Community Structure with an Amplification Microarray
Knickerbocker, Christopher; Bryant, Lexi; Golova, Julia; Wiles, Cory; Williams, Kenneth H.; Peacock, Aaron D.; Long, Philip E.
2013-01-01
The objectives of this study were to unify amplification, labeling, and microarray hybridization chemistries within a single, closed microfluidic chamber (an amplification microarray) and verify technology performance on a series of groundwater samples from an in situ field experiment designed to compare U(VI) mobility under conditions of various alkalinities (as HCO3−) during stimulated microbial activity accompanying acetate amendment. Analytical limits of detection were between 2 and 200 cell equivalents of purified DNA. Amplification microarray signatures were well correlated with 16S rRNA-targeted quantitative PCR results and hybridization microarray signatures. The succession of the microbial community was evident with and consistent between the two microarray platforms. Amplification microarray analysis of acetate-treated groundwater showed elevated levels of iron-reducing bacteria (Flexibacter, Geobacter, Rhodoferax, and Shewanella) relative to the average background profile, as expected. Identical molecular signatures were evident in the transect treated with acetate plus NaHCO3, but at much lower signal intensities and with a much more rapid decline (to nondetection). Azoarcus, Thaurea, and Methylobacterium were responsive in the acetate-only transect but not in the presence of bicarbonate. Observed differences in microbial community composition or response to bicarbonate amendment likely had an effect on measured rates of U reduction, with higher rates probable in the part of the field experiment that was amended with bicarbonate. The simplification in microarray-based work flow is a significant technological advance toward entirely closed-amplicon microarray-based tests and is generally extensible to any number of environmental monitoring applications. PMID:23160129
Page, Grier P; Coulibaly, Issa
2008-01-01
Microarrays are a very powerful tool for quantifying the amount of RNA in samples; however, their ability to query essentially every gene in a genome, which can number in the tens of thousands, presents analytical and interpretative problems. As a result, a variety of software and web-based tools have been developed to help with these issues. This article highlights and reviews some of the tools for the first steps in the analysis of a microarray study. We have tried for a balance between free and commercial systems. We have organized the tools by topics including image processing tools (Section 2), power analysis tools (Section 3), image analysis tools (Section 4), database tools (Section 5), databases of functional information (Section 6), annotation tools (Section 7), statistical and data mining tools (Section 8), and dissemination tools (Section 9).
Bioinformatics and Microarray Data Analysis on the Cloud.
Calabrese, Barbara; Cannataro, Mario
2016-01-01
High-throughput platforms such as microarray, mass spectrometry, and next-generation sequencing are producing an increasing volume of omics data that needs large data storage and computing power. Cloud computing offers massive scalable computing and storage, data sharing, on-demand anytime and anywhere access to resources and applications, and thus, it may represent the key technology for facing those issues. In fact, in the recent years it has been adopted for the deployment of different bioinformatics solutions and services both in academia and in the industry. Although this, cloud computing presents several issues regarding the security and privacy of data, that are particularly important when analyzing patients data, such as in personalized medicine. This chapter reviews main academic and industrial cloud-based bioinformatics solutions; with a special focus on microarray data analysis solutions and underlines main issues and problems related to the use of such platforms for the storage and analysis of patients data.
Identification of differentially expressed genes and false discovery rate in microarray studies.
Gusnanto, Arief; Calza, Stefano; Pawitan, Yudi
2007-04-01
To highlight the development in microarray data analysis for the identification of differentially expressed genes, particularly via control of false discovery rate. The emergence of high-throughput technology such as microarrays raises two fundamental statistical issues: multiplicity and sensitivity. We focus on the biological problem of identifying differentially expressed genes. First, multiplicity arises due to testing tens of thousands of hypotheses, rendering the standard P value meaningless. Second, known optimal single-test procedures such as the t-test perform poorly in the context of highly multiple tests. The standard approach of dealing with multiplicity is too conservative in the microarray context. The false discovery rate concept is fast becoming the key statistical assessment tool replacing the P value. We review the false discovery rate approach and argue that it is more sensible for microarray data. We also discuss some methods to take into account additional information from the microarrays to improve the false discovery rate. There is growing consensus on how to analyse microarray data using the false discovery rate framework in place of the classical P value. Further research is needed on the preprocessing of the raw data, such as the normalization step and filtering, and on finding the most sensitive test procedure.
Trio, Phoebe Zapanta; Kawahara, Atsuyoshi; Tanigawa, Shunsuke; Sakao, Kozue; Hou, De-Xing
2017-01-01
6-MSITC and 6-MTITC are sulforaphane (SFN) analogs found in Japanese Wasabi. As we reported previously, Wasabi isothiocyanates (ITCs) are activators of Nrf2-antioxidant response element pathway, and also inhibitors of pro-inflammatory cyclooxygenase-2. This study is the first to assess the global changes in transcript levels by Wasabi ITCs, comparing with SFN, in HepG2 cells. We performed comparative gene expression profiling by treating HepG2 cells with ITCs, followed by DNA microarray analyses using HG-U133 plus 2.0 oligonucleotide array. Partial array data on selected gene products were confirmed by RT-PCR and Western blotting. Ingenuity Pathway Analysis (IPA) was used to identify functional subsets of genes and biologically significant network pathways. 6-MTITC showed the highest number of differentially altered (≥2 folds) gene expression, of which 114 genes were upregulated and 75 were downregulated. IPA revealed that Nrf2-mediated pathway, together with glutamate metabolism, is the common significantly modulated pathway across treatments. Interestingly, 6-MSITC exhibited the most potent effect toward Nrf2-mediated pathway. Our data suggest that 6-MSITC could exert chemopreventive role against cancer through its underlying antioxidant activity via the activation of Nrf2-mediated subsequent induction of cytoprotective genes.
Sharova, Tatyana Y; Poterlowicz, Krzysztof; Botchkareva, Natalia V; Kondratiev, Nikita A; Aziz, Ahmar; Spiegel, Jeffrey H; Botchkarev, Vladimir A; Sharov, Andrey A
2014-12-01
Chemotherapy has severe side effects in normal rapidly proliferating organs, such as hair follicles, and causes massive apoptosis in hair matrix keratinocytes followed by hair loss. To define the molecular signature of hair follicle response to chemotherapy, human scalp hair follicles cultured ex vivo were treated with doxorubicin (DXR), and global microarray analysis was performed 3 hours after treatment. Microarray data revealed changes in expression of 504 genes in DXR-treated hair follicles versus controls. Among these genes, upregulations of several tumor necrosis factor family of apoptotic receptors (FAS, TRAIL (tumor necrosis factor-related apoptosis-inducing ligand) receptors 1/2), as well as of a large number of keratin-associated protein genes, were seen after DXR treatment. Hair follicle apoptosis induced by DXR was significantly inhibited by either TRAIL-neutralizing antibody or caspase-8 inhibitor, thus suggesting a previously unreported role for TRAIL receptor signaling in mediating DXR-induced hair loss. These data demonstrate that the early phase of the hair follicle response to DXR includes upregulation of apoptosis-associated markers, as well as substantial reorganization of the terminal differentiation programs in hair follicle keratinocytes. These data provide an important platform for further studies toward the design of effective approaches for the management of chemotherapy-induced hair loss.
Sharova, Tatyana Y.; Poterlowicz, Krzysztof; Botchkareva, Natalia V.; Kondratiev, Nikita A.; Aziz, Ahmar; Spiegel, Jeffrey H.; Botchkarev, Vladimir A.; Sharov, Andrey A.
2014-01-01
Chemotherapy has severe side-effects for normal rapidly proliferating organs, such as hair follicle, and causes massive apoptosis in hair matrix keratinocytes followed by hair loss. To define the molecular signature of hair follicle response to chemotherapy, human scalp hair follicles cultured ex vivo were treated with doxorubicin and global microarray analysis was performed 3 hours after treatment. Microarray data revealed changes in expression of 504 genes in doxorubicin-treated hair follicles versus the controls. Among these genes, upregulations of several tumor necrosis factor family of apoptotic receptors (FAS, TRAIL receptors 1/2), as well as of a large number of the keratin-associated protein genes were seen after doxorubicin treatment. Hair follicle apoptosis induced by doxorubicin was significantly inhibited by either TRAIL neutralizing antibody or caspase 8 inhibitor, thus suggesting a novel role for TRAIL receptor signaling in mediating doxorubicin-induced hair loss. These data demonstrate that the early phase of the hair follicle response to doxorubicin includes upregulation of apoptosis-associated markers, as well as substantial re-organization of the terminal differentiation programs in hair follicle keratinocytes. These data provide an important platform for further studies towards the design of novel approaches for management of chemotherapy-induced hair loss. PMID:24999588
An evaluation of two-channel ChIP-on-chip and DNA methylation microarray normalization strategies
2012-01-01
Background The combination of chromatin immunoprecipitation with two-channel microarray technology enables genome-wide mapping of binding sites of DNA-interacting proteins (ChIP-on-chip) or sites with methylated CpG di-nucleotides (DNA methylation microarray). These powerful tools are the gateway to understanding gene transcription regulation. Since the goals of such studies, the sample preparation procedures, the microarray content and study design are all different from transcriptomics microarrays, the data pre-processing strategies traditionally applied to transcriptomics microarrays may not be appropriate. Particularly, the main challenge of the normalization of "regulation microarrays" is (i) to make the data of individual microarrays quantitatively comparable and (ii) to keep the signals of the enriched probes, representing DNA sequences from the precipitate, as distinguishable as possible from the signals of the un-enriched probes, representing DNA sequences largely absent from the precipitate. Results We compare several widely used normalization approaches (VSN, LOWESS, quantile, T-quantile, Tukey's biweight scaling, Peng's method) applied to a selection of regulation microarray datasets, ranging from DNA methylation to transcription factor binding and histone modification studies. Through comparison of the data distributions of control probes and gene promoter probes before and after normalization, and assessment of the power to identify known enriched genomic regions after normalization, we demonstrate that there are clear differences in performance between normalization procedures. Conclusion T-quantile normalization applied separately on the channels and Tukey's biweight scaling outperform other methods in terms of the conservation of enriched and un-enriched signal separation, as well as in identification of genomic regions known to be enriched. T-quantile normalization is preferable as it additionally improves comparability between microarrays. In contrast, popular normalization approaches like quantile, LOWESS, Peng's method and VSN normalization alter the data distributions of regulation microarrays to such an extent that using these approaches will impact the reliability of the downstream analysis substantially. PMID:22276688
Rai, Muhammad Farooq; Tycksen, Eric D; Sandell, Linda J; Brophy, Robert H
2018-01-01
Microarrays and RNA-seq are at the forefront of high throughput transcriptome analyses. Since these methodologies are based on different principles, there are concerns about the concordance of data between the two techniques. The concordance of RNA-seq and microarrays for genome-wide analysis of differential gene expression has not been rigorously assessed in clinically derived ligament tissues. To demonstrate the concordance between RNA-seq and microarrays and to assess potential benefits of RNA-seq over microarrays, we assessed differences in transcript expression in anterior cruciate ligament (ACL) tissues based on time-from-injury. ACL remnants were collected from patients with an ACL tear at the time of ACL reconstruction. RNA prepared from torn ACL remnants was subjected to Agilent microarrays (N = 24) and RNA-seq (N = 8). The correlation of biological replicates in RNA-seq and microarrays data was similar (0.98 vs. 0.97), demonstrating that each platform has high internal reproducibility. Correlations between the RNA-seq data and the individual microarrays were low, but correlations between the RNA-seq values and the geometric mean of the microarrays values were moderate. The cross-platform concordance for differentially expressed transcripts or enriched pathways was linearly correlated (r = 0.64). RNA-Seq was superior in detecting low abundance transcripts and differentiating biologically critical isoforms. Additional independent validation of transcript expression was undertaken using microfluidic PCR for selected genes. PCR data showed 100% concordance (in expression pattern) with RNA-seq and microarrays data. These findings demonstrate that RNA-seq has advantages over microarrays for transcriptome profiling of ligament tissues when available and affordable. Furthermore, these findings are likely transferable to other musculoskeletal tissues where tissue collection is challenging and cells are in low abundance. © 2017 Orthopaedic Research Society. Published by Wiley Periodicals, Inc. J Orthop Res 36:484-497, 2018. © 2017 Orthopaedic Research Society. Published by Wiley Periodicals, Inc.
2012-01-01
Background DNA microarrays are used both for research and for diagnostics. In research, Affymetrix arrays are commonly used for genome wide association studies, resequencing, and for gene expression analysis. These arrays provide large amounts of data. This data is analyzed using statistical methods that quite often discard a large portion of the information. Most of the information that is lost comes from probes that systematically fail across chips and from batch effects. The aim of this study was to develop a comprehensive model for hybridization that predicts probe intensities for Affymetrix arrays and that could provide a basis for improved microarray analysis and probe development. The first part of the model calculates probe binding affinities to all the possible targets in the hybridization solution using the Langmuir isotherm. In the second part of the model we integrate details that are specific to each experiment and contribute to the differences between hybridization in solution and on the microarray. These details include fragmentation, wash stringency, temperature, salt concentration, and scanner settings. Furthermore, the model fits probe synthesis efficiency and target concentration parameters directly to the data. All the parameters used in the model have a well-established physical origin. Results For the 302 chips that were analyzed the mean correlation between expected and observed probe intensities was 0.701 with a range of 0.88 to 0.55. All available chips were included in the analysis regardless of the data quality. Our results show that batch effects arise from differences in probe synthesis, scanner settings, wash strength, and target fragmentation. We also show that probe synthesis efficiencies for different nucleotides are not uniform. Conclusions To date this is the most complete model for binding on microarrays. This is the first model that includes both probe synthesis efficiency and hybridization kinetics/cross-hybridization. These two factors are sequence dependent and have a large impact on probe intensity. The results presented here provide novel insight into the effect of probe synthesis errors on Affymetrix microarrays; furthermore, the algorithms developed in this work provide useful tools for the analysis of cross-hybridization, probe synthesis efficiency, fragmentation, wash stringency, temperature, and salt concentration on microarray intensities. PMID:23270536
Thormar, Hans G; Gudmundsson, Bjarki; Eiriksdottir, Freyja; Kil, Siyoen; Gunnarsson, Gudmundur H; Magnusson, Magnus Karl; Hsu, Jason C; Jonsson, Jon J
2013-04-01
The causes of imprecision in microarray expression analysis are poorly understood, limiting the use of this technology in molecular diagnostics. Two-dimensional strandness-dependent electrophoresis (2D-SDE) separates nucleic acid molecules on the basis of length and strandness, i.e., double-stranded DNA (dsDNA), single-stranded DNA (ssDNA), and RNA·DNA hybrids. We used 2D-SDE to measure the efficiency of cDNA synthesis and its importance for the imprecision of an in vitro transcription-based microarray expression analysis. The relative amount of double-stranded cDNA formed in replicate experiments that used the same RNA sample template was highly variable, ranging between 0% and 72% of the total DNA. Microarray experiments showed an inverse relationship between the difference between sample pairs in probe variance and the relative amount of dsDNA. Approximately 15% of probes showed between-sample variation (P < 0.05) when the dsDNA percentage was between 12% and 35%. In contrast, only 3% of probes showed between-sample variation when the dsDNA percentage was 69% and 72%. Replication experiments of the 35% dsDNA and 72% dsDNA samples were used to separate sample variation from probe replication variation. The estimated SD of the sample-to-sample variation and of the probe replicates was lower in 72% dsDNA samples than in 35% dsDNA samples. Variation in the relative amount of double-stranded cDNA synthesized can be an important component of the imprecision in T7 RNA polymerase-based microarray expression analysis. © 2013 American Association for Clinical Chemistry
Li, Lingyun; Li, Qingbo; Rohlin, Lars; Kim, UnMi; Salmon, Kirsty; Rejtar, Tomas; Gunsalus, Robert P.; Karger, Barry L.; Ferry, James G.
2008-01-01
Summary Methanosarcina acetivorans strain C2A is an acetate- and methanol-utilizing methane-producing organism for which the genome, the largest yet sequenced among the Archaea, reveals extensive physiological diversity. LC linear ion trap-FTICR mass spectrometry was employed to analyze acetate- vs. methanol-grown cells metabolically labeled with 14N vs. 15N, respectively, to obtain quantitative protein abundance ratios. DNA microarray analyses of acetate- vs. methanol-grown cells was also performed to determine gene expression ratios. The combined approaches were highly complementary, extending the physiological understanding of growth and methanogenesis. Of the 1081 proteins detected, 255 were ≥ 3-fold differentially abundant. DNA microarray analysis revealed 410 genes that were ≥ 2.5-fold differentially expressed of 1972 genes with detected expression. The ratios of differentially abundant proteins were in good agreement with expression ratios of the encoding genes. Taken together, the results suggest several novel roles for electron transport components specific to acetate-grown cells, including two flavodoxins each specific for growth on acetate or methanol. Protein abundance ratios indicated that duplicate CO dehydrogenase/acetyl-CoA complexes function in the conversion of acetate to methane. Surprisingly, the protein abundance and gene expression ratios indicated a general stress response in acetate- vs. methanol-grown cells that included enzymes specific for polyphosphate accumulation and oxidative stress. The microarray analysis identified transcripts of several genes encoding regulatory proteins with identity to the PhoU, MarR, GlnK, and TetR families commonly found in the Bacteria domain. An analysis of neighboring genes suggested roles in controlling phosphate metabolism (PhoU), ammonia assimilation (GlnK), and molybdopterin cofactor biosynthesis (TetR). Finally, the proteomic and microarray results suggested roles for two-component regulatory systems specific for each growth substrate. PMID:17269732
Integrated analysis of chromosome copy number variation and gene expression in cervical carcinoma
Yan, Deng; Yi, Song; Chiu, Wang Chi; Qin, Liu Gui; Kin, Wong Hoi; Kwok Hung, Chung Tony; Linxiao, Han; Wai, Choy Kwong; Yi, Sui; Tao, Yang; Tao, Tang
2017-01-01
Objective This study was conducted to explore chromosomal copy number variations (CNV) and transcript expression and to examine pathways in cervical pathogenesis using genome-wide high resolution microarrays. Methods Genome-wide chromosomal CNVs were investigated in 6 cervical cancer cell lines by Human Genome CGH Microarray Kit (4x44K). Gene expression profiles in cervical cancer cell lines, primary cervical carcinoma and normal cervical epithelium tissues were also studied using the Whole Human Genome Microarray Kit (4x44K). Results Fifty common chromosomal CNVs were identified in the cervical cancer cell lines. Correlation analysis revealed that gene up-regulation or down-regulation is significantly correlated with genomic amplification (P=0.009) or deletion (P=0.006) events. Expression profiles were identified through cluster analysis. Gene annotation analysis pinpointed cell cycle pathways was significantly (P=1.15E-08) affected in cervical cancer. Common CNVs were associated with cervical cancer. Conclusion Chromosomal CNVs may contribute to their transcript expression in cervical cancer. PMID:29312578
BioconductorBuntu: a Linux distribution that implements a web-based DNA microarray analysis server.
Geeleher, Paul; Morris, Dermot; Hinde, John P; Golden, Aaron
2009-06-01
BioconductorBuntu is a custom distribution of Ubuntu Linux that automatically installs a server-side microarray processing environment, providing a user-friendly web-based GUI to many of the tools developed by the Bioconductor Project, accessible locally or across a network. System installation is via booting off a CD image or by using a Debian package provided to upgrade an existing Ubuntu installation. In its current version, several microarray analysis pipelines are supported including oligonucleotide, dual-or single-dye experiments, including post-processing with Gene Set Enrichment Analysis. BioconductorBuntu is designed to be extensible, by server-side integration of further relevant Bioconductor modules as required, facilitated by its straightforward underlying Python-based infrastructure. BioconductorBuntu offers an ideal environment for the development of processing procedures to facilitate the analysis of next-generation sequencing datasets. BioconductorBuntu is available for download under a creative commons license along with additional documentation and a tutorial from (http://bioinf.nuigalway.ie).
De Hertogh, Benoît; De Meulder, Bertrand; Berger, Fabrice; Pierre, Michael; Bareke, Eric; Gaigneaux, Anthoula; Depiereux, Eric
2010-01-11
Recent reanalysis of spike-in datasets underscored the need for new and more accurate benchmark datasets for statistical microarray analysis. We present here a fresh method using biologically-relevant data to evaluate the performance of statistical methods. Our novel method ranks the probesets from a dataset composed of publicly-available biological microarray data and extracts subset matrices with precise information/noise ratios. Our method can be used to determine the capability of different methods to better estimate variance for a given number of replicates. The mean-variance and mean-fold change relationships of the matrices revealed a closer approximation of biological reality. Performance analysis refined the results from benchmarks published previously.We show that the Shrinkage t test (close to Limma) was the best of the methods tested, except when two replicates were examined, where the Regularized t test and the Window t test performed slightly better. The R scripts used for the analysis are available at http://urbm-cluster.urbm.fundp.ac.be/~bdemeulder/.
Integrated analysis of chromosome copy number variation and gene expression in cervical carcinoma.
Yan, Deng; Yi, Song; Chiu, Wang Chi; Qin, Liu Gui; Kin, Wong Hoi; Kwok Hung, Chung Tony; Linxiao, Han; Wai, Choy Kwong; Yi, Sui; Tao, Yang; Tao, Tang
2017-12-12
This study was conducted to explore chromosomal copy number variations (CNV) and transcript expression and to examine pathways in cervical pathogenesis using genome-wide high resolution microarrays. Genome-wide chromosomal CNVs were investigated in 6 cervical cancer cell lines by Human Genome CGH Microarray Kit (4x44K). Gene expression profiles in cervical cancer cell lines, primary cervical carcinoma and normal cervical epithelium tissues were also studied using the Whole Human Genome Microarray Kit (4x44K). Fifty common chromosomal CNVs were identified in the cervical cancer cell lines. Correlation analysis revealed that gene up-regulation or down-regulation is significantly correlated with genomic amplification ( P =0.009) or deletion ( P =0.006) events. Expression profiles were identified through cluster analysis. Gene annotation analysis pinpointed cell cycle pathways was significantly ( P =1.15E-08) affected in cervical cancer. Common CNVs were associated with cervical cancer. Chromosomal CNVs may contribute to their transcript expression in cervical cancer.
Hayashi, Ken-Go; Hosoe, Misa; Kizaki, Keiichiro; Fujii, Shiori; Kanahara, Hiroko; Takahashi, Toru; Sakumoto, Ryosuke
2017-03-23
Repeat breeding directly affects reproductive efficiency in cattle due to an increase in services per conception and calving interval. This study aimed to investigate whether changes in endometrial gene expression profile are involved in repeat breeding in cows. Differential gene expression profiles of the endometrium were investigated during the mid-luteal phase of the estrous cycle between repeat breeder (RB) and non-RB cows using microarray analysis. The caruncular (CAR) and intercaruncular (ICAR) endometrium of both ipsilateral and contralateral uterine horns to the corpus luteum were collected from RB (inseminated at least three times but not pregnant) and non-RB cows on Day 15 of the estrous cycle (4 cows/group). Global gene expression profiles of these endometrial samples were analyzed with a 15 K custom-made oligo-microarray for cattle. Immunohistochemistry was performed to investigate the cellular localization of proteins of three identified transcripts in the endometrium. Microarray analysis revealed that 405 and 397 genes were differentially expressed in the CAR and ICAR of the ipsilateral uterine horn of RB, respectively when compared with non-RB cows. In the contralateral uterine horn, 443 and 257 differentially expressed genes were identified in the CAR and ICAR of RB, respectively when compared with non-RB cows. Gene ontology analysis revealed that genes involved in development and morphogenesis were mainly up-regulated in the CAR of RB cows. In the ICAR of both the ipsilateral and contralateral uterine horns, genes related to the metabolic process were predominantly enriched in the RB cows when compared with non-RB cows. In the analysis of the whole uterus (combining the data above four endometrial compartments), RB cows showed up-regulation of 37 genes including PRSS2, GSTA3 and PIPOX and down-regulation of 39 genes including CHGA, KRT35 and THBS4 when compared with non-RB cows. Immunohistochemistry revealed that CHGA, GSTA3 and PRSS2 proteins were localized in luminal and glandular epithelial cells and stroma of the endometrium. The present study showed that endometrial gene expression profiles are different between RB and non-RB cows. The identified candidate endometrial genes and functions in each endometrial compartment may contribute to bovine reproductive performance.
Transcriptome analysis and related databases of Lactococcus lactis.
Kuipers, Oscar P; de Jong, Anne; Baerends, Richard J S; van Hijum, Sacha A F T; Zomer, Aldert L; Karsens, Harma A; den Hengst, Chris D; Kramer, Naomi E; Buist, Girbe; Kok, Jan
2002-08-01
Several complete genome sequences of Lactococcus lactis and their annotations will become available in the near future, next to the already published genome sequence of L. lactis ssp. lactis IL 1403. This will allow intraspecies comparative genomics studies as well as functional genomics studies aimed at a better understanding of physiological processes and regulatory networks operating in lactococci. This paper describes the initial set-up of a DNA-microarray facility in our group, to enable transcriptome analysis of various Gram-positive bacteria, including a ssp. lactis and a ssp. cremoris strain of Lactococcus lactis. Moreover a global description will be given of the hardware and software requirements for such a set-up, highlighting the crucial integration of relevant bioinformatics tools and methods. This includes the development of MolGenIS, an information system for transcriptome data storage and retrieval, and LactococCye, a metabolic pathway/genome database of Lactococcus lactis.
Identifying novel glioma associated pathways based on systems biology level meta-analysis.
Hu, Yangfan; Li, Jinquan; Yan, Wenying; Chen, Jiajia; Li, Yin; Hu, Guang; Shen, Bairong
2013-01-01
With recent advances in microarray technology, including genomics, proteomics, and metabolomics, it brings a great challenge for integrating this "-omics" data to analysis complex disease. Glioma is an extremely aggressive and lethal form of brain tumor, and thus the study of the molecule mechanism underlying glioma remains very important. To date, most studies focus on detecting the differentially expressed genes in glioma. However, the meta-analysis for pathway analysis based on multiple microarray datasets has not been systematically pursued. In this study, we therefore developed a systems biology based approach by integrating three types of omics data to identify common pathways in glioma. Firstly, the meta-analysis has been performed to study the overlapping of signatures at different levels based on the microarray gene expression data of glioma. Among these gene expression datasets, 12 pathways were found in GeneGO database that shared by four stages. Then, microRNA expression profiles and ChIP-seq data were integrated for the further pathway enrichment analysis. As a result, we suggest 5 of these pathways could be served as putative pathways in glioma. Among them, the pathway of TGF-beta-dependent induction of EMT via SMAD is of particular importance. Our results demonstrate that the meta-analysis based on systems biology level provide a more useful approach to study the molecule mechanism of complex disease. The integration of different types of omics data, including gene expression microarrays, microRNA and ChIP-seq data, suggest some common pathways correlated with glioma. These findings will offer useful potential candidates for targeted therapeutic intervention of glioma.
Whole-Genome Analysis of the SHORT-ROOT Developmental Pathway in Arabidopsis
Busch, Wolfgang; Cui, Hongchang; Wang, Jean Y; Blilou, Ikram; Hassan, Hala; Nakajima, Keiji; Matsumoto, Noritaka; Lohmann, Jan U; Scheres, Ben
2006-01-01
Stem cell function during organogenesis is a key issue in developmental biology. The transcription factor SHORT-ROOT (SHR) is a critical component in a developmental pathway regulating both the specification of the root stem cell niche and the differentiation potential of a subset of stem cells in the Arabidopsis root. To obtain a comprehensive view of the SHR pathway, we used a statistical method called meta-analysis to combine the results of several microarray experiments measuring the changes in global expression profiles after modulating SHR activity. Meta-analysis was first used to identify the direct targets of SHR by combining results from an inducible form of SHR driven by its endogenous promoter, ectopic expression, followed by cell sorting and comparisons of mutant to wild-type roots. Eight putative direct targets of SHR were identified, all with expression patterns encompassing subsets of the native SHR expression domain. Further evidence for direct regulation by SHR came from binding of SHR in vivo to the promoter regions of four of the eight putative targets. A new role for SHR in the vascular cylinder was predicted from the expression pattern of several direct targets and confirmed with independent markers. The meta-analysis approach was then used to perform a global survey of the SHR indirect targets. Our analysis suggests that the SHR pathway regulates root development not only through a large transcription regulatory network but also through hormonal pathways and signaling pathways using receptor-like kinases. Taken together, our results not only identify the first nodes in the SHR pathway and a new function for SHR in the development of the vascular tissue but also reveal the global architecture of this developmental pathway. PMID:16640459
Engelmann, Brett W
2017-01-01
The Src Homology 2 (SH2) domain family primarily recognizes phosphorylated tyrosine (pY) containing peptide motifs. The relative affinity preferences among competing SH2 domains for phosphopeptide ligands define "specificity space," and underpins many functional pY mediated interactions within signaling networks. The degree of promiscuity exhibited and the dynamic range of affinities supported by individual domains or phosphopeptides is best resolved by a carefully executed and controlled quantitative high-throughput experiment. Here, I describe the fabrication and application of a cellulose-peptide conjugate microarray (CPCMA) platform to the quantitative analysis of SH2 domain specificity space. Included herein are instructions for optimal experimental design with special attention paid to common sources of systematic error, phosphopeptide SPOT synthesis, microarray fabrication, analyte titrations, data capture, and analysis.
Kostić, Tanja; Sessitsch, Angela
2011-01-01
Reliable and sensitive pathogen detection in clinical and environmental (including food and water) samples is of greatest importance for public health. Standard microbiological methods have several limitations and improved alternatives are needed. Most important requirements for reliable analysis include: (i) specificity; (ii) sensitivity; (iii) multiplexing potential; (iv) robustness; (v) speed; (vi) automation potential; and (vii) low cost. Microarray technology can, through its very nature, fulfill many of these requirements directly and the remaining challenges have been tackled. In this review, we attempt to compare performance characteristics of the microbial diagnostic microarrays developed for the detection and typing of food and water pathogens, and discuss limitations, points still to be addressed and issues specific for the analysis of food, water and environmental samples. PMID:27605332
Laser capture microdissection of embryonic cells and preparation of RNA for microarray assays.
Redmond, Latasha C; Pang, Christopher J; Dumur, Catherine; Haar, Jack L; Lloyd, Joyce A
2014-01-01
In order to compare the global gene expression profiles of different embryonic cell types, it is first necessary to isolate the specific cells of interest. The purpose of this chapter is to provide a step-by-step protocol to perform laser capture microdissection (LCM) on embryo samples and obtain sufficient amounts of high-quality RNA for microarray hybridizations. Using the LCM/microarray strategy on mouse embryo samples has some challenges, because the cells of interest are available in limited quantities. The first step in the protocol is to obtain embryonic tissue, and immediately cryoprotect and freeze it in a cryomold containing Optimal Cutting Temperature freezing media (Sakura Finetek), using a dry ice-isopentane bath. The tissue is then cryosectioned, and the microscope slides are processed to fix, stain, and dehydrate the cells. LCM is employed to isolate specific cell types from the slides, identified under the microscope by virtue of their morphology. Detailed protocols are provided for using the currently available ArcturusXT LCM instrument and CapSure(®) LCM Caps, to which the selected cells adhere upon laser capture. To maintain RNA integrity, upon removing a slide from the final processing step, or attaching the first cells on the LCM cap, LCM is completed within 20 min. The cells are then immediately recovered from the LCM cap using a denaturing solution that stabilizes RNA integrity. RNA is prepared using standard methods, modified for working with small samples. To ensure the validity of the microarray data, the quality of the RNA is assessed using the Agilent bioanalyzer. Only RNA that is of sufficient integrity and quantity is used to perform microarray assays. This chapter provides guidance regarding troubleshooting and optimization to obtain high-quality RNA from cells of limited availability, obtained from embryo samples by LCM.
Laser Capture Microdissection of Embryonic Cells and Preparation of RNA for Microarray Assays
Redmond, Latasha C.; Pang, Christopher J.; Dumur, Catherine; Haar, Jack L.; Lloyd, Joyce A.
2014-01-01
In order to compare the global gene expression profiles of different embryonic cell types, it is first necessary to isolate the specific cells of interest. The purpose of this chapter is to provide a step-by-step protocol to perform laser capture microdissection (LCM) on embryo samples and obtain sufficient amounts of high-quality RNA for microarray hybridizations. Using the LCM/microarray strategy on mouse embryo samples has some challenges, because the cells of interest are available in limited quantities. The first step in the protocol is to obtain embryonic tissue, and immediately cryoprotect and freeze it in a cryomold containing Optimal Cutting Temperature freezing media (Sakura Finetek), using a dry ice–isopentane bath. The tissue is then cryosectioned, and the microscope slides are processed to fix, stain, and dehydrate the cells. LCM is employed to isolate specific cell types from the slides, identified under the microscope by virtue of their morphology. Detailed protocols are provided for using the currently available ArcturusXT LCM instrument and CapSure® LCM Caps, to which the selected cells adhere upon laser capture. To maintain RNA integrity, upon removing a slide from the final processing step, or attaching the first cells on the LCM cap, LCM is completed within 20 min. The cells are then immediately recovered from the LCM cap using a denaturing solution that stabilizes RNA integrity. RNA is prepared using standard methods, modified for working with small samples. To ensure the validity of the microarray data, the quality of the RNA is assessed using the Agilent bioanalyzer. Only RNA that is of sufficient integrity and quantity is used to perform microarray assays. This chapter provides guidance regarding troubleshooting and optimization to obtain high-quality RNA from cells of limited availability, obtained from embryo samples by LCM. PMID:24318813
Wang, Wen; Li, Hao; Zhao, Zheng; Wang, Haoyuan; Zhang, Dong; Zhang, Yan; Lan, Qing; Wang, Jiangfei; Cao, Yong; Zhao, Jizong
2018-04-01
Abdominal aortic aneurysms (AAAs) and intracranial saccular aneurysms (IAs) are the most common types of aneurysms. This study was to investigate the common pathogenesis shared between these two kinds of aneurysms. We collected 12 IAs samples and 12 control arteries from the Beijing Tiantan Hospital and performed microarray analysis. In addition, we utilized the microarray datasets of IAs and AAAs from the Gene Expression Omnibus (GEO), in combination with our microarray results, to generate messenger RNA expression profiles for both AAAs and IAs in our study. Functional exploration and protein-protein interaction (PPI) analysis were performed. A total of 727 common genes were differentially expressed (404 was upregulated; 323 was downregulated) for both AAAs and IAs. The GO and pathway analyses showed that the common dysregulated genes were mainly enriched in vascular smooth muscle contraction, muscle contraction, immune response, defense response, cell activation, IL-6 signaling and chemokine signaling pathways, etc. The further protein-protein analysis identified 35 hub nodes, including TNF, IL6, MAPK13, and CCL5. These hub node genes were enriched in inflammatory response, positive regulation of IL-6 production, chemokine signaling pathway, and T/B cell receptor signaling pathway. Our study will gain new insight into the molecular mechanisms for the pathogenesis of both types of aneurysms and provide new therapeutic targets for the patients harboring AAAs and IAs.
Wang, Hong; Bi, Yongyi; Tao, Ning; Wang, Chunhong
2005-08-01
To detect the differential expression of cell signal transduction genes associated with benzene poisoning, and to explore the pathogenic mechanisms of blood system damage induced by benzene. Peripheral white blood cell gene expression profile of 7 benzene poisoning patients, including one aplastic anemia, was determined by cDNA microarray. Seven chips from normal workers were served as controls. Cluster analysis of gene expression profile was performed. Among the 4265 target genes, 176 genes associated with cell signal transduction were differentially expressed. 35 up-regulated genes including PTPRC, STAT4, IFITM1 etc were found in at least 6 pieces of microarray; 45 down-regulated genes including ARHB, PPP3CB, CDC37 etc were found in at least 5 pieces of microarray. cDNA microarray technology is an effective technique for screening the differentially expressed genes of cell signal transduction. Disorder in cell signal transduction may play certain role in the pathogenic mechanism of benzene poisoning.
A study of metaheuristic algorithms for high dimensional feature selection on microarray data
NASA Astrophysics Data System (ADS)
Dankolo, Muhammad Nasiru; Radzi, Nor Haizan Mohamed; Sallehuddin, Roselina; Mustaffa, Noorfa Haszlinna
2017-11-01
Microarray systems enable experts to examine gene profile at molecular level using machine learning algorithms. It increases the potentials of classification and diagnosis of many diseases at gene expression level. Though, numerous difficulties may affect the efficiency of machine learning algorithms which includes vast number of genes features comprised in the original data. Many of these features may be unrelated to the intended analysis. Therefore, feature selection is necessary to be performed in the data pre-processing. Many feature selection algorithms are developed and applied on microarray which including the metaheuristic optimization algorithms. This paper discusses the application of the metaheuristics algorithms for feature selection in microarray dataset. This study reveals that, the algorithms have yield an interesting result with limited resources thereby saving computational expenses of machine learning algorithms.
USDA-ARS?s Scientific Manuscript database
Puccinia striiformis f. sp. tritici (Pst) causes stripe rust, one of the most important diseases of wheat worldwide. To identify Pst genes involved in infection and sporulation, a custom oligonucleotide Genechip was made using sequences of 442 genes selected from Pst cDNA libraries. Microarray analy...
Alexiev, Borislav A; Zou, Ying S
2014-12-01
Chromosomal microarray analysis using novel Molecular Inversion Probe (MIP) technology demonstrated 2,570 kb copy neutral LOH of 10q11.22 in two clear cell papillary renal cell carcinomas. In addition, one of the tumors had a big 29,784 kb deletion of 13q11-q14.2. There were two variants of unknown significance, a 2,509 kb gain of Xp22.33 and a 257 kb homozygous deletion of 8p11.22. The somatic mutation panel containing 74 mutations in nine genes did not reveal any mutations. Besides identification of submicroscopic duplications or deletions, SNP microarrays can reveal abnormal allelic imbalances including LOH and copy neutral LOH, which cannot be recognized by chromosome, FISH, and non-SNP microarray arrays. To the best of our knowledge, this is the first study demonstrating copy neutral LOH of 10q11.22 in clear cell papillary renal cell carcinomas using the new MIP SNP OncoScan FFPE Assay Kit on formalin-fixed paraffin-embedded tumor samples. Copyright © 2014 Elsevier GmbH. All rights reserved.
Sugii, Yuh; Kasai, Tomonari; Ikeda, Masashi; Vaidyanath, Arun; Kumon, Kazuki; Mizutani, Akifumi; Seno, Akimasa; Tokutaka, Heizo; Kudoh, Takayuki; Seno, Masaharu
2016-01-01
To identify cell-specific markers, we designed a DNA microarray platform with oligonucleotide probes for human membrane-anchored proteins. Human glioma cell lines were analyzed using microarray and compared with normal and fetal brain tissues. For the microarray analysis, we employed a spherical self-organizing map, which is a clustering method suitable for the conversion of multidimensional data into two-dimensional data and displays the relationship on a spherical surface. Based on the gene expression profile, the cell surface characteristics were successfully mirrored onto the spherical surface, thereby distinguishing normal brain tissue from the disease model based on the strength of gene expression. The clustered glioma-specific genes were further analyzed by polymerase chain reaction procedure and immunocytochemical staining of glioma cells. Our platform and the following procedure were successfully demonstrated to categorize the genes coding for cell surface proteins that are specific to glioma cells. Our assessment demonstrates that a spherical self-organizing map is a valuable tool for distinguishing cell surface markers and can be employed in marker discovery studies for the treatment of cancer.
Feng, Yinling; Wang, Xuefeng
2017-03-01
In order to investigate commonly disturbed genes and pathways in various brain regions of patients with Parkinson's disease (PD), microarray datasets from previous studies were collected and systematically analyzed. Different normalization methods were applied to microarray datasets from different platforms. A strategy combining gene co‑expression networks and clinical information was adopted, using weighted gene co‑expression network analysis (WGCNA) to screen for commonly disturbed genes in different brain regions of patients with PD. Functional enrichment analysis of commonly disturbed genes was performed using the Database for Annotation, Visualization, and Integrated Discovery (DAVID). Co‑pathway relationships were identified with Pearson's correlation coefficient tests and a hypergeometric distribution‑based test. Common genes in pathway pairs were selected out and regarded as risk genes. A total of 17 microarray datasets from 7 platforms were retained for further analysis. Five gene coexpression modules were identified, containing 9,745, 736, 233, 101 and 93 genes, respectively. One module was significantly correlated with PD samples and thus the 736 genes it contained were considered to be candidate PD‑associated genes. Functional enrichment analysis demonstrated that these genes were implicated in oxidative phosphorylation and PD. A total of 44 pathway pairs and 52 risk genes were revealed, and a risk gene pathway relationship network was constructed. Eight modules were identified and were revealed to be associated with PD, cancers and metabolism. A number of disturbed pathways and risk genes were unveiled in PD, and these findings may help advance understanding of PD pathogenesis.
Gene set analysis approaches for RNA-seq data: performance evaluation and application guideline
Rahmatallah, Yasir; Emmert-Streib, Frank
2016-01-01
Transcriptome sequencing (RNA-seq) is gradually replacing microarrays for high-throughput studies of gene expression. The main challenge of analyzing microarray data is not in finding differentially expressed genes, but in gaining insights into the biological processes underlying phenotypic differences. To interpret experimental results from microarrays, gene set analysis (GSA) has become the method of choice, in particular because it incorporates pre-existing biological knowledge (in a form of functionally related gene sets) into the analysis. Here we provide a brief review of several statistically different GSA approaches (competitive and self-contained) that can be adapted from microarrays practice as well as those specifically designed for RNA-seq. We evaluate their performance (in terms of Type I error rate, power, robustness to the sample size and heterogeneity, as well as the sensitivity to different types of selection biases) on simulated and real RNA-seq data. Not surprisingly, the performance of various GSA approaches depends only on the statistical hypothesis they test and does not depend on whether the test was developed for microarrays or RNA-seq data. Interestingly, we found that competitive methods have lower power as well as robustness to the samples heterogeneity than self-contained methods, leading to poor results reproducibility. We also found that the power of unsupervised competitive methods depends on the balance between up- and down-regulated genes in tested gene sets. These properties of competitive methods have been overlooked before. Our evaluation provides a concise guideline for selecting GSA approaches, best performing under particular experimental settings in the context of RNA-seq. PMID:26342128
Comparison of RNA-seq and microarray-based models for clinical endpoint prediction.
Zhang, Wenqian; Yu, Ying; Hertwig, Falk; Thierry-Mieg, Jean; Zhang, Wenwei; Thierry-Mieg, Danielle; Wang, Jian; Furlanello, Cesare; Devanarayan, Viswanath; Cheng, Jie; Deng, Youping; Hero, Barbara; Hong, Huixiao; Jia, Meiwen; Li, Li; Lin, Simon M; Nikolsky, Yuri; Oberthuer, André; Qing, Tao; Su, Zhenqiang; Volland, Ruth; Wang, Charles; Wang, May D; Ai, Junmei; Albanese, Davide; Asgharzadeh, Shahab; Avigad, Smadar; Bao, Wenjun; Bessarabova, Marina; Brilliant, Murray H; Brors, Benedikt; Chierici, Marco; Chu, Tzu-Ming; Zhang, Jibin; Grundy, Richard G; He, Min Max; Hebbring, Scott; Kaufman, Howard L; Lababidi, Samir; Lancashire, Lee J; Li, Yan; Lu, Xin X; Luo, Heng; Ma, Xiwen; Ning, Baitang; Noguera, Rosa; Peifer, Martin; Phan, John H; Roels, Frederik; Rosswog, Carolina; Shao, Susan; Shen, Jie; Theissen, Jessica; Tonini, Gian Paolo; Vandesompele, Jo; Wu, Po-Yen; Xiao, Wenzhong; Xu, Joshua; Xu, Weihong; Xuan, Jiekun; Yang, Yong; Ye, Zhan; Dong, Zirui; Zhang, Ke K; Yin, Ye; Zhao, Chen; Zheng, Yuanting; Wolfinger, Russell D; Shi, Tieliu; Malkas, Linda H; Berthold, Frank; Wang, Jun; Tong, Weida; Shi, Leming; Peng, Zhiyu; Fischer, Matthias
2015-06-25
Gene expression profiling is being widely applied in cancer research to identify biomarkers for clinical endpoint prediction. Since RNA-seq provides a powerful tool for transcriptome-based applications beyond the limitations of microarrays, we sought to systematically evaluate the performance of RNA-seq-based and microarray-based classifiers in this MAQC-III/SEQC study for clinical endpoint prediction using neuroblastoma as a model. We generate gene expression profiles from 498 primary neuroblastomas using both RNA-seq and 44 k microarrays. Characterization of the neuroblastoma transcriptome by RNA-seq reveals that more than 48,000 genes and 200,000 transcripts are being expressed in this malignancy. We also find that RNA-seq provides much more detailed information on specific transcript expression patterns in clinico-genetic neuroblastoma subgroups than microarrays. To systematically compare the power of RNA-seq and microarray-based models in predicting clinical endpoints, we divide the cohort randomly into training and validation sets and develop 360 predictive models on six clinical endpoints of varying predictability. Evaluation of factors potentially affecting model performances reveals that prediction accuracies are most strongly influenced by the nature of the clinical endpoint, whereas technological platforms (RNA-seq vs. microarrays), RNA-seq data analysis pipelines, and feature levels (gene vs. transcript vs. exon-junction level) do not significantly affect performances of the models. We demonstrate that RNA-seq outperforms microarrays in determining the transcriptomic characteristics of cancer, while RNA-seq and microarray-based models perform similarly in clinical endpoint prediction. Our findings may be valuable to guide future studies on the development of gene expression-based predictive models and their implementation in clinical practice.
A PCR primer bank for quantitative gene expression analysis.
Wang, Xiaowei; Seed, Brian
2003-12-15
Although gene expression profiling by microarray analysis is a useful tool for assessing global levels of transcriptional activity, variability associated with the data sets usually requires that observed differences be validated by some other method, such as real-time quantitative polymerase chain reaction (real-time PCR). However, non-specific amplification of non-target genes is frequently observed in the latter, confounding the analysis in approximately 40% of real-time PCR attempts when primer-specific labels are not used. Here we present an experimentally validated algorithm for the identification of transcript-specific PCR primers on a genomic scale that can be applied to real-time PCR with sequence-independent detection methods. An online database, PrimerBank, has been created for researchers to retrieve primer information for their genes of interest. PrimerBank currently contains 147 404 primers encompassing most known human and mouse genes. The primer design algorithm has been tested by conventional and real-time PCR for a subset of 112 primer pairs with a success rate of 98.2%.
Li, Yong-Fang; Mahalingam, Ramamurthy; Sunkar, Ramanjulu
2017-01-01
Alteration of gene expression is an essential mechanism, which allows plants to respond and adapt to adverse environmental conditions. Transcriptome and proteome analyses in plants exposed to abiotic stresses revealed that protein levels are not correlated with the changes in corresponding mRNAs, indicating regulation at translational level is another major regulator for gene expression. Analysis of translatome, which refers to all mRNAs associated with ribosomes, thus has the potential to bridge the gap between transcriptome and proteome. Polysomal RNA profiling and recently developed ribosome profiling (Ribo-seq) are two main methods for translatome analysis at global level. Here, we describe the classical procedure for polysomal RNA isolation by sucrose gradient ultracentrifugation followed by highthroughput RNA-seq to identify genes regulated at translational level. Polysomal RNA can be further used for a variety of downstream applications including Northern blot analysis, qRT-PCR, RNase protection assay, and microarray-based gene expression profiling.
2009-01-01
Background Large discrepancies in signature composition and outcome concordance have been observed between different microarray breast cancer expression profiling studies. This is often ascribed to differences in array platform as well as biological variability. We conjecture that other reasons for the observed discrepancies are the measurement error associated with each feature and the choice of preprocessing method. Microarray data are known to be subject to technical variation and the confidence intervals around individual point estimates of expression levels can be wide. Furthermore, the estimated expression values also vary depending on the selected preprocessing scheme. In microarray breast cancer classification studies, however, these two forms of feature variability are almost always ignored and hence their exact role is unclear. Results We have performed a comprehensive sensitivity analysis of microarray breast cancer classification under the two types of feature variability mentioned above. We used data from six state of the art preprocessing methods, using a compendium consisting of eight diferent datasets, involving 1131 hybridizations, containing data from both one and two-color array technology. For a wide range of classifiers, we performed a joint study on performance, concordance and stability. In the stability analysis we explicitly tested classifiers for their noise tolerance by using perturbed expression profiles that are based on uncertainty information directly related to the preprocessing methods. Our results indicate that signature composition is strongly influenced by feature variability, even if the array platform and the stratification of patient samples are identical. In addition, we show that there is often a high level of discordance between individual class assignments for signatures constructed on data coming from different preprocessing schemes, even if the actual signature composition is identical. Conclusion Feature variability can have a strong impact on breast cancer signature composition, as well as the classification of individual patient samples. We therefore strongly recommend that feature variability is considered in analyzing data from microarray breast cancer expression profiling experiments. PMID:19941644
Ávila-Fernández, Almudena; Cantalapiedra, Diego; Aller, Elena; Vallespín, Elena; Aguirre-Lambán, Jana; Blanco-Kelly, Fiona; Corton, M; Riveiro-Álvarez, Rosa; Allikmets, Rando; Trujillo-Tiebas, María José; Millán, José M; Cremers, Frans P M; Ayuso, Carmen
2010-12-03
Retinitis pigmentosa (RP) is a genetically heterogeneous disorder characterized by progressive loss of vision. The aim of this study was to identify the causative mutations in 272 Spanish families using a genotyping microarray. 272 unrelated Spanish families, 107 with autosomal recessive RP (arRP) and 165 with sporadic RP (sRP), were studied using the APEX genotyping microarray. The families were also classified by clinical criteria: 86 juveniles and 186 typical RP families. Haplotype and sequence analysis were performed to identify the second mutated allele. At least one-gene variant was found in 14% and 16% of the juvenile and typical RP groups respectively. Further study identified four new mutations, providing both causative changes in 11% of the families. Retinol Dehydrogenase 12 (RDH12) was the most frequently mutated gene in the juvenile RP group, and Usher Syndrome 2A (USH2A) and Ceramide Kinase-Like (CERKL) were the most frequently mutated genes in the typical RP group. The only variant found in CERKL was p.Arg257Stop, the most frequent mutation. The genotyping microarray combined with segregation and sequence analysis allowed us to identify the causative mutations in 11% of the families. Due to the low number of characterized families, this approach should be used in tandem with other techniques.
Lovell, Peter V; Huizinga, Nicole A; Getachew, Abel; Mees, Brianna; Friedrich, Samantha R; Wirthlin, Morgan; Mello, Claudio V
2018-05-18
Zebra finches are a major model organism for investigating mechanisms of vocal learning, a trait that enables spoken language in humans. The development of cDNA collections with expressed sequence tags (ESTs) and microarrays has allowed for extensive molecular characterizations of circuitry underlying vocal learning and production. However, poor database curation can lead to errors in transcriptome and bioinformatics analyses, limiting the impact of these resources. Here we used genomic alignments and synteny analysis for orthology verification to curate and reannotate ~ 35% of the oligonucleotides and corresponding ESTs/cDNAs that make-up Agilent microarrays for gene expression analysis in finches. We found that: (1) 5475 out of 43,084 oligos (a) failed to align to the zebra finch genome, (b) aligned to multiple loci, or (c) aligned to Chr_un only, and thus need to be flagged until a better genome assembly is available, or (d) reflect cloning artifacts; (2) Out of 9635 valid oligos examined further, 3120 were incorrectly named, including 1533 with no known orthologs; and (3) 2635 oligos required name update. The resulting curated dataset provides a reference for correcting gene identification errors in previous finch microarrays studies, and avoiding such errors in future studies.
A Model-Based Joint Identification of Differentially Expressed Genes and Phenotype-Associated Genes
Seo, Minseok; Shin, Su-kyung; Kwon, Eun-Young; Kim, Sung-Eun; Bae, Yun-Jung; Lee, Seungyeoun; Sung, Mi-Kyung; Choi, Myung-Sook; Park, Taesung
2016-01-01
Over the last decade, many analytical methods and tools have been developed for microarray data. The detection of differentially expressed genes (DEGs) among different treatment groups is often a primary purpose of microarray data analysis. In addition, association studies investigating the relationship between genes and a phenotype of interest such as survival time are also popular in microarray data analysis. Phenotype association analysis provides a list of phenotype-associated genes (PAGs). However, it is sometimes necessary to identify genes that are both DEGs and PAGs. We consider the joint identification of DEGs and PAGs in microarray data analyses. The first approach we used was a naïve approach that detects DEGs and PAGs separately and then identifies the genes in an intersection of the list of PAGs and DEGs. The second approach we considered was a hierarchical approach that detects DEGs first and then chooses PAGs from among the DEGs or vice versa. In this study, we propose a new model-based approach for the joint identification of DEGs and PAGs. Unlike the previous two-step approaches, the proposed method identifies genes simultaneously that are DEGs and PAGs. This method uses standard regression models but adopts different null hypothesis from ordinary regression models, which allows us to perform joint identification in one-step. The proposed model-based methods were evaluated using experimental data and simulation studies. The proposed methods were used to analyze a microarray experiment in which the main interest lies in detecting genes that are both DEGs and PAGs, where DEGs are identified between two diet groups and PAGs are associated with four phenotypes reflecting the expression of leptin, adiponectin, insulin-like growth factor 1, and insulin. Model-based approaches provided a larger number of genes, which are both DEGs and PAGs, than other methods. Simulation studies showed that they have more power than other methods. Through analysis of data from experimental microarrays and simulation studies, the proposed model-based approach was shown to provide a more powerful result than the naïve approach and the hierarchical approach. Since our approach is model-based, it is very flexible and can easily handle different types of covariates. PMID:26964035
A novel X-linked disorder with developmental delay and autistic features.
Kaya, Namik; Colak, Dilek; Albakheet, Albandary; Al-Owain, Mohammad; Abu-Dheim, Nada; Al-Younes, Banan; Al-Zahrani, Jawaher; Mukaddes, Nahit M; Dervent, Aysin; Al-Dosari, Naji; Al-Odaib, Ali; Kayaalp, Inci V; Al-Sayed, Moeenaladin; Al-Hassnan, Zuhair; Nester, Michael J; Al-Dosari, Mohammad; Al-Dhalaan, Hesham; Chedrawi, Aziza; Gunoz, Hulya; Karakas, Bedri; Sakati, Nadia; Alkuraya, Fowzan S; Gascon, Generaso G; Ozand, Pinar T
2012-04-01
Genomic duplications that lead to autism and other human diseases are interesting pathological lesions since the underlying mechanism almost certainly involves dosage sensitive genes. We aim to understand a novel genomic disorder with profound phenotypic consequences, most notably global developmental delay, autism, psychosis, and anorexia nervosa. We evaluated the affected individuals, all maternally related, using childhood autism rating scale (CARS) and Vineland Adaptive scales, magnetic resonance imaging (MRI) and magnetic resonance spectroscopy (MRS) brain, electroencephalography (EEG), electromyography (EMG), muscle biopsy, high-resolution molecular karyotype arrays, Giemsa banding (G-banding) and fluorescent in situ hybridization (FISH) experiments, mitochondrial DNA (mtDNA) sequencing, X-chromosome inactivation study, global gene expression analysis on Epstein-Barr virus (EBV)-transformed lymphoblasts, and quantitative reverse-transcription polymerase chain reaction (qRT-PCR). We have identified a novel Xq12-q13.3 duplication in an extended family. Clinically normal mothers were completely skewed in favor of the normal chromosome X. Global transcriptional profiling of affected individuals and controls revealed significant alterations of genes and pathways in a pattern consistent with previous microarray studies of autism spectrum disorder patients. Moreover, expression analysis revealed copy number-dependent increased messenger RNA (mRNA) levels in affected patients compared to control individuals. A subset of differentially expressed genes was validated using qRT-PCR. Xq12-q13.3 duplication is a novel global developmental delay and autism-predisposing chromosomal aberration; pathogenesis of which may be mediated by increased dosage of genes contained in the duplication, including NLGN3, OPHN1, AR, EFNB1, TAF1, GJB1, and MED12. Copyright © 2011 American Neurological Association.
Gao, Hui; Zhao, Chunyan
2018-01-01
Chromatin immunoprecipitation (ChIP) has become the most effective and widely used tool to study the interactions between specific proteins or modified forms of proteins and a genomic DNA region. Combined with genome-wide profiling technologies, such as microarray hybridization (ChIP-on-chip) or massively parallel sequencing (ChIP-seq), ChIP could provide a genome-wide mapping of in vivo protein-DNA interactions in various organisms. Here, we describe a protocol of ChIP-on-chip that uses tiling microarray to obtain a genome-wide profiling of ChIPed DNA.
A hybrid approach to device integration on a genetic analysis platform
NASA Astrophysics Data System (ADS)
Brennan, Des; Jary, Dorothee; Kurg, Ants; Berik, Evgeny; Justice, John; Aherne, Margaret; Macek, Milan; Galvin, Paul
2012-10-01
Point-of-care (POC) systems require significant component integration to implement biochemical protocols associated with molecular diagnostic assays. Hybrid platforms where discrete components are combined in a single platform are a suitable approach to integration, where combining multiple device fabrication steps on a single substrate is not possible due to incompatible or costly fabrication steps. We integrate three devices each with a specific system functionality: (i) a silicon electro-wetting-on-dielectric (EWOD) device to move and mix sample and reagent droplets in an oil phase, (ii) a polymer microfluidic chip containing channels and reservoirs and (iii) an aqueous phase glass microarray for fluorescence microarray hybridization detection. The EWOD device offers the possibility of fully integrating on-chip sample preparation using nanolitre sample and reagent volumes. A key challenge is sample transfer from the oil phase EWOD device to the aqueous phase microarray for hybridization detection. The EWOD device, waveguide performance and functionality are maintained during the integration process. An on-chip biochemical protocol for arrayed primer extension (APEX) was implemented for single nucleotide polymorphism (SNiP) analysis. The prepared sample is aspirated from the EWOD oil phase to the aqueous phase microarray for hybridization. A bench-top instrumentation system was also developed around the integrated platform to drive the EWOD electrodes, implement APEX sample heating and image the microarray after hybridization.
Genome-wide identification of WRKY family genes and their response to cold stress in Vitis vinifera
2014-01-01
Background WRKY transcription factors are one of the largest families of transcriptional regulators in plants. WRKY genes are not only found to play significant roles in biotic and abiotic stress response, but also regulate growth and development. Grapevine (Vitis vinifera) production is largely limited by stressful climate conditions such as cold stress and the role of WRKY genes in the survival of grapevine under these conditions remains unknown. Results We identified a total of 59 VvWRKYs from the V. vinifera genome, belonging to four subgroups according to conserved WRKY domains and zinc-finger structure. The majority of VvWRKYs were expressed in more than one tissue among the 7 tissues examined which included young leaves, mature leaves, tendril, stem apex, root, young fruits and ripe fruits. Publicly available microarray data suggested that a subset of VvWRKYs was activated in response to diverse stresses. Quantitative real-time PCR (qRT-PCR) results demonstrated that the expression levels of 36 VvWRKYs are changed following cold exposure. Comparative analysis was performed on data from publicly available microarray experiments, previous global transcriptome analysis studies, and qRT-PCR. We identified 15 VvWRKYs in at least two of these databases which may relate to cold stress. Among them, the transcription of three genes can be induced by exogenous ABA application, suggesting that they can be involved in an ABA-dependent signaling pathway in response to cold stress. Conclusions We identified 59 VvWRKYs from the V. vinifera genome and 15 of them showed cold stress-induced expression patterns. These genes represented candidate genes for future functional analysis of VvWRKYs involved in the low temperature-related signal pathways in grape. PMID:24755338
NASA Astrophysics Data System (ADS)
Kikuchi, Shoshi
2009-02-01
Completion of the high-precision genome sequence analysis of rice led to the collection of about 35,000 full-length cDNA clones and the determination of their complete sequences. Mapping of these full-length cDNA sequences has given us information on (1) the number of genes expressed in the rice genome; (2) the start and end positions and exon-intron structures of rice genes; (3) alternative transcripts; (4) possible encoded proteins; (5) non-protein-coding (np) RNAs; (6) the density of gene localization on the chromosome; (7) setting the parameters of gene prediction programs; and (8) the construction of a microarray system that monitors global gene expression. Manual curation for rice gene annotation by using mapping information on full-length cDNA and EST assemblies has revealed about 32,000 expressed genes in the rice genome. Analysis of major gene families, such as those encoding membrane transport proteins (pumps, ion channels, and secondary transporters), along with the evolution from bacteria to higher animals and plants, reveals how gene numbers have increased through adaptation to circumstances. Family-based gene annotation also gives us a new way of comparing organisms. Massive amounts of data on gene expression under many kinds of physiological conditions are being accumulated in rice oligoarrays (22K and 44K) based on full-length cDNA sequences. Cluster analyses of genes that have the same promoter cis-elements, that have similar expression profiles, or that encode enzymes in the same metabolic pathways or signal transduction cascades give us clues to understanding the networks of gene expression in rice. As a tool for that purpose, we recently developed "RiCES", a tool for searching for cis-elements in the promoter regions of clustered genes.
Changes in Polysome Association of mRNA Throughout Growth and Development in Arabidopsis thaliana.
Yamasaki, Shotaro; Matsuura, Hideyuki; Demura, Taku; Kato, Ko
2015-11-01
Translational control is a key regulatory step in the expression of genes as proteins. In plant cells, the translational efficiency of mRNAs differs for different mRNA species, and the efficiency dynamically changes in various conditions. To gain a global view of translational control throughout growth and development, we performed genome-wide analysis of polysome association of mRNA during growth and leaf development in Arabidopsis thaliana by subjecting the mRNAs in polysomes to DNA microarray. This analysis revealed that the degree of polysome association of mRNA was different depending on the mRNA species, and the polysome association changed greatly throughout growth and development for each. In the growth stage, transcripts showed varying changes in polysome association from strongly depressed to unchanged, with the majority of transcripts showing dissociation from ribosomes. On the other hand, during leaf development, the polysome association of transcripts showed a normal distribution from repressed to activated mRNAs when comparing expanding and expanded leaves. In addition, functional category analysis of the microarray data suggested that translational control has a physiological significance in the plant growth and development process, especially in the categories of signaling and protein synthesis. In addition to this, we compared changes in polysome association of mRNAs between various conditions and characterized translational controls in each. This result suggested that mRNA translation might be controlled by complicated mechanisms for response to each condition. Our results highlight the importance of dynamic changes in mRNA translation in plant development and growth. © The Author 2015. Published by Oxford University Press on behalf of Japanese Society of Plant Physiologists. All rights reserved. For permissions, please email: journals.permissions@oup.com.
Selection of higher order regression models in the analysis of multi-factorial transcription data.
Prazeres da Costa, Olivia; Hoffman, Arthur; Rey, Johannes W; Mansmann, Ulrich; Buch, Thorsten; Tresch, Achim
2014-01-01
Many studies examine gene expression data that has been obtained under the influence of multiple factors, such as genetic background, environmental conditions, or exposure to diseases. The interplay of multiple factors may lead to effect modification and confounding. Higher order linear regression models can account for these effects. We present a new methodology for linear model selection and apply it to microarray data of bone marrow-derived macrophages. This experiment investigates the influence of three variable factors: the genetic background of the mice from which the macrophages were obtained, Yersinia enterocolitica infection (two strains, and a mock control), and treatment/non-treatment with interferon-γ. We set up four different linear regression models in a hierarchical order. We introduce the eruption plot as a new practical tool for model selection complementary to global testing. It visually compares the size and significance of effect estimates between two nested models. Using this methodology we were able to select the most appropriate model by keeping only relevant factors showing additional explanatory power. Application to experimental data allowed us to qualify the interaction of factors as either neutral (no interaction), alleviating (co-occurring effects are weaker than expected from the single effects), or aggravating (stronger than expected). We find a biologically meaningful gene cluster of putative C2TA target genes that appear to be co-regulated with MHC class II genes. We introduced the eruption plot as a tool for visual model comparison to identify relevant higher order interactions in the analysis of expression data obtained under the influence of multiple factors. We conclude that model selection in higher order linear regression models should generally be performed for the analysis of multi-factorial microarray data.
Howat, William J; Blows, Fiona M; Provenzano, Elena; Brook, Mark N; Morris, Lorna; Gazinska, Patrycja; Johnson, Nicola; McDuffus, Leigh‐Anne; Miller, Jodi; Sawyer, Elinor J; Pinder, Sarah; van Deurzen, Carolien H M; Jones, Louise; Sironen, Reijo; Visscher, Daniel; Caldas, Carlos; Daley, Frances; Coulson, Penny; Broeks, Annegien; Sanders, Joyce; Wesseling, Jelle; Nevanlinna, Heli; Fagerholm, Rainer; Blomqvist, Carl; Heikkilä, Päivi; Ali, H Raza; Dawson, Sarah‐Jane; Figueroa, Jonine; Lissowska, Jolanta; Brinton, Louise; Mannermaa, Arto; Kataja, Vesa; Kosma, Veli‐Matti; Cox, Angela; Brock, Ian W; Cross, Simon S; Reed, Malcolm W; Couch, Fergus J; Olson, Janet E; Devillee, Peter; Mesker, Wilma E; Seyaneve, Caroline M; Hollestelle, Antoinette; Benitez, Javier; Perez, Jose Ignacio Arias; Menéndez, Primitiva; Bolla, Manjeet K; Easton, Douglas F; Schmidt, Marjanka K; Pharoah, Paul D; Sherman, Mark E
2014-01-01
Abstract Breast cancer risk factors and clinical outcomes vary by tumour marker expression. However, individual studies often lack the power required to assess these relationships, and large‐scale analyses are limited by the need for high throughput, standardized scoring methods. To address these limitations, we assessed whether automated image analysis of immunohistochemically stained tissue microarrays can permit rapid, standardized scoring of tumour markers from multiple studies. Tissue microarray sections prepared in nine studies containing 20 263 cores from 8267 breast cancers stained for two nuclear (oestrogen receptor, progesterone receptor), two membranous (human epidermal growth factor receptor 2 and epidermal growth factor receptor) and one cytoplasmic (cytokeratin 5/6) marker were scanned as digital images. Automated algorithms were used to score markers in tumour cells using the Ariol system. We compared automated scores against visual reads, and their associations with breast cancer survival. Approximately 65–70% of tissue microarray cores were satisfactory for scoring. Among satisfactory cores, agreement between dichotomous automated and visual scores was highest for oestrogen receptor (Kappa = 0.76), followed by human epidermal growth factor receptor 2 (Kappa = 0.69) and progesterone receptor (Kappa = 0.67). Automated quantitative scores for these markers were associated with hazard ratios for breast cancer mortality in a dose‐response manner. Considering visual scores of epidermal growth factor receptor or cytokeratin 5/6 as the reference, automated scoring achieved excellent negative predictive value (96–98%), but yielded many false positives (positive predictive value = 30–32%). For all markers, we observed substantial heterogeneity in automated scoring performance across tissue microarrays. Automated analysis is a potentially useful tool for large‐scale, quantitative scoring of immunohistochemically stained tissue microarrays available in consortia. However, continued optimization, rigorous marker‐specific quality control measures and standardization of tissue microarray designs, staining and scoring protocols is needed to enhance results. PMID:27499890
USDA-ARS?s Scientific Manuscript database
A computer algorithm was created to inspect scanned images from DNA microarray slides developed to rapidly detect and genotype E. Coli O157 virulent strains. The algorithm computes centroid locations for signal and background pixels in RGB space and defines a plane perpendicular to the line connect...
Assessing probe-specific dye and slide biases in two-color microarray data
USDA-ARS?s Scientific Manuscript database
A primary reason for using two-color microarrays is that the use of two samples labeled with different dyes on the same slide and that bind to probes on the same spot is supposed to adjust for many factors that introduce noise and errors into the analysis. Most users assume that any differences bet...
Reif, David M.; Israel, Mark A.; Moore, Jason H.
2007-01-01
The biological interpretation of gene expression microarray results is a daunting challenge. For complex diseases such as cancer, wherein the body of published research is extensive, the incorporation of expert knowledge provides a useful analytical framework. We have previously developed the Exploratory Visual Analysis (EVA) software for exploring data analysis results in the context of annotation information about each gene, as well as biologically relevant groups of genes. We present EVA as a flexible combination of statistics and biological annotation that provides a straightforward visual interface for the interpretation of microarray analyses of gene expression in the most commonly occuring class of brain tumors, glioma. We demonstrate the utility of EVA for the biological interpretation of statistical results by analyzing publicly available gene expression profiles of two important glial tumors. The results of a statistical comparison between 21 malignant, high-grade glioblastoma multiforme (GBM) tumors and 19 indolent, low-grade pilocytic astrocytomas were analyzed using EVA. By using EVA to examine the results of a relatively simple statistical analysis, we were able to identify tumor class-specific gene expression patterns having both statistical and biological significance. Our interactive analysis highlighted the potential importance of genes involved in cell cycle progression, proliferation, signaling, adhesion, migration, motility, and structure, as well as candidate gene loci on a region of Chromosome 7 that has been implicated in glioma. Because EVA does not require statistical or computational expertise and has the flexibility to accommodate any type of statistical analysis, we anticipate EVA will prove a useful addition to the repertoire of computational methods used for microarray data analysis. EVA is available at no charge to academic users and can be found at http://www.epistasis.org. PMID:19390666
Autoregressive-model-based missing value estimation for DNA microarray time series data.
Choong, Miew Keen; Charbit, Maurice; Yan, Hong
2009-01-01
Missing value estimation is important in DNA microarray data analysis. A number of algorithms have been developed to solve this problem, but they have several limitations. Most existing algorithms are not able to deal with the situation where a particular time point (column) of the data is missing entirely. In this paper, we present an autoregressive-model-based missing value estimation method (ARLSimpute) that takes into account the dynamic property of microarray temporal data and the local similarity structures in the data. ARLSimpute is especially effective for the situation where a particular time point contains many missing values or where the entire time point is missing. Experiment results suggest that our proposed algorithm is an accurate missing value estimator in comparison with other imputation methods on simulated as well as real microarray time series datasets.
Karim, Ahmad Faisal; Chandra, Pallavi; Chopra, Aanchal; Siddiqui, Zaved; Bhaskar, Ashima; Singh, Amit; Kumar, Dhiraj
2011-11-18
Global gene expression profiling has emerged as a major tool in understanding complex response patterns of biological systems to perturbations. However, a lack of unbiased analytical approaches has restricted the utility of complex microarray data to gain novel system level insights. Here we report a strategy, express path analysis (EPA), that helps to establish various pathways differentially recruited to achieve specific cellular responses under contrasting environmental conditions in an unbiased manner. The analysis superimposes differentially regulated genes between contrasting environments onto the network of functional protein associations followed by a series of iterative enrichments and network analysis. To test the utility of the approach, we infected THP1 macrophage cells with a virulent Mycobacterium tuberculosis strain (H37Rv) or the attenuated non-virulent strain H37Ra as contrasting perturbations and generated the temporal global expression profiles. EPA of the results provided details of response-specific and time-dependent host molecular network perturbations. Further analysis identified tyrosine kinase Src as the major regulatory hub discriminating the responses between wild-type and attenuated Mtb infection. We were then able to verify this novel role of Src experimentally and show that Src executes its role through regulating two vital antimicrobial processes of the host cells (i.e. autophagy and acidification of phagolysosome). These results bear significant potential for developing novel anti-tuberculosis therapy. We propose that EPA could prove extremely useful in understanding complex cellular responses for a variety of perturbations, including pathogenic infections.
Drost, Derek R; Novaes, Evandro; Boaventura-Novaes, Carolina; Benedict, Catherine I; Brown, Ryan S; Yin, Tongming; Tuskan, Gerald A; Kirst, Matias
2009-06-01
Microarrays have demonstrated significant power for genome-wide analyses of gene expression, and recently have also revolutionized the genetic analysis of segregating populations by genotyping thousands of loci in a single assay. Although microarray-based genotyping approaches have been successfully applied in yeast and several inbred plant species, their power has not been proven in an outcrossing species with extensive genetic diversity. Here we have developed methods for high-throughput microarray-based genotyping in such species using a pseudo-backcross progeny of 154 individuals of Populus trichocarpa and P. deltoides analyzed with long-oligonucleotide in situ-synthesized microarray probes. Our analysis resulted in high-confidence genotypes for 719 single-feature polymorphism (SFP) and 1014 gene expression marker (GEM) candidates. Using these genotypes and an established microsatellite (SSR) framework map, we produced a high-density genetic map comprising over 600 SFPs, GEMs and SSRs. The abundance of gene-based markers allowed us to localize over 35 million base pairs of previously unplaced whole-genome shotgun (WGS) scaffold sequence to putative locations in the genome of P. trichocarpa. A high proportion of sampled scaffolds could be verified for their placement with independently mapped SSRs, demonstrating the previously un-utilized power that high-density genotyping can provide in the context of map-based WGS sequence reassembly. Our results provide a substantial contribution to the continued improvement of the Populus genome assembly, while demonstrating the feasibility of microarray-based genotyping in a highly heterozygous population. The strategies presented are applicable to genetic mapping efforts in all plant species with similarly high levels of genetic diversity.
Cruella: developing a scalable tissue microarray data management system.
Cowan, James D; Rimm, David L; Tuck, David P
2006-06-01
Compared with DNA microarray technology, relatively little information is available concerning the special requirements, design influences, and implementation strategies of data systems for tissue microarray technology. These issues include the requirement to accommodate new and different data elements for each new project as well as the need to interact with pre-existing models for clinical, biological, and specimen-related data. To design and implement a flexible, scalable tissue microarray data storage and management system that could accommodate information regarding different disease types and different clinical investigators, and different clinical investigation questions, all of which could potentially contribute unforeseen data types that require dynamic integration with existing data. The unpredictability of the data elements combined with the novelty of automated analysis algorithms and controlled vocabulary standards in this area require flexible designs and practical decisions. Our design includes a custom Java-based persistence layer to mediate and facilitate interaction with an object-relational database model and a novel database schema. User interaction is provided through a Java Servlet-based Web interface. Cruella has become an indispensable resource and is used by dozens of researchers every day. The system stores millions of experimental values covering more than 300 biological markers and more than 30 disease types. The experimental data are merged with clinical data that has been aggregated from multiple sources and is available to the researchers for management, analysis, and export. Cruella addresses many of the special considerations for managing tissue microarray experimental data and the associated clinical information. A metadata-driven approach provides a practical solution to many of the unique issues inherent in tissue microarray research, and allows relatively straightforward interoperability with and accommodation of new data models.
Evaluation of the skin irritation using a DNA microarray on a reconstructed human epidermal model.
Niwa, Makoto; Nagai, Kanji; Oike, Hideaki; Kobori, Masuko
2009-02-01
To avoid the need to use animals to test the skin irritancy potential of chemicals and cosmetics, it is important to establish an in vitro method based on the reconstructed human epidermal model. To evaluate skin irritancy efficiently and sensitively, we determined the gene expression induced by a topically-applied mild irritant sodium dodecyl sulfate (SDS) in a reconstructed human epidermal model LabCyte EPI-MODEL (LabCyte) using a DNA microarray carrying genes that were related to inflammation, immunity, stress and housekeeping. The expression and secretion of IL-1alpha in reconstructed human epidermal culture is known to be induced by irritation. We detected the induction of IL-1alpha expression and its secretion into the cell culture medium by treatment with 0.075% SDS for 18 h in LabCyte culture using DNA microarray, quantitative reverse-transcription polymerase chain reaction (RT-PCR) and ELISA. DNA microarray analysis indicated that the expression of 10 of the 205 genes carried on the DNA microarray was significantly induced in a LabCyte culture by 0.05% or 0.075% SDS irritation for 18 h. RT-PCR analysis confirmed that SDS treatment significantly induced the expressions of interleukin-1 receptor antagonist (IL-1RN), FOS-like antigen 1 (FOSL1), heat shock 70 kDa protein 1A (HSPA1) and myeloid differentiation primary response gene (88) (MYD88), as well as the known marker genes for irritation IL-1beta and IL-8 in a LabCyte culture. Our results showed that a DNA microarray is a useful tool for efficiently evaluating mild skin irritation using a reconstructed human epidermal model.
Booman, Marije; Borza, Tudor; Feng, Charles Y; Hori, Tiago S; Higgins, Brent; Culf, Adrian; Léger, Daniel; Chute, Ian C; Belkaid, Anissa; Rise, Marlies; Gamperl, A Kurt; Hubert, Sophie; Kimball, Jennifer; Ouellette, Rodney J; Johnson, Stewart C; Bowman, Sharen; Rise, Matthew L
2011-08-01
The collapse of Atlantic cod (Gadus morhua) wild populations strongly impacted the Atlantic cod fishery and led to the development of cod aquaculture. In order to improve aquaculture and broodstock quality, we need to gain knowledge of genes and pathways involved in Atlantic cod responses to pathogens and other stressors. The Atlantic Cod Genomics and Broodstock Development Project has generated over 150,000 expressed sequence tags from 42 cDNA libraries representing various tissues, developmental stages, and stimuli. We used this resource to develop an Atlantic cod oligonucleotide microarray containing 20,000 unique probes. Selection of sequences from the full range of cDNA libraries enables application of the microarray for a broad spectrum of Atlantic cod functional genomics studies. We included sequences that were highly abundant in suppression subtractive hybridization (SSH) libraries, which were enriched for transcripts responsive to pathogens or other stressors. These sequences represent genes that potentially play an important role in stress and/or immune responses, making the microarray particularly useful for studies of Atlantic cod gene expression responses to immune stimuli and other stressors. To demonstrate its value, we used the microarray to analyze the Atlantic cod spleen response to stimulation with formalin-killed, atypical Aeromonas salmonicida, resulting in a gene expression profile that indicates a strong innate immune response. These results were further validated by quantitative PCR analysis and comparison to results from previous analysis of an SSH library. This study shows that the Atlantic cod 20K oligonucleotide microarray is a valuable new tool for Atlantic cod functional genomics research.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jaing, C; Gardner, S
The goal of this project is to develop forensic genotyping assays for select agent viruses, enhancing the current capabilities for the viral bioforensics and law enforcement community. We used a multipronged approach combining bioinformatics analysis, PCR-enriched samples, microarrays and TaqMan assays to develop high resolution and cost effective genotyping methods for strain level forensic discrimination of viruses. We have leveraged substantial experience and efficiency gained through year 1 on software development, SNP discovery, TaqMan signature design and phylogenetic signature mapping to scale up the development of forensics signatures in year 2. In this report, we have summarized the whole genomemore » wide SNP analysis and microarray probe design for forensics characterization of South American hemorrhagic fever viruses, tick-borne encephalitis viruses and henipaviruses, Old World Arenaviruses, filoviruses, Crimean-Congo hemorrhagic fever virus, Rift Valley fever virus and Japanese encephalitis virus.« less
Jeong, J; Bong, J; Kim, G D; Joo, S T; Lee, H-J; Baik, M
2013-10-01
Castration increases intramuscular fat (IMF) deposition, improving beef quality in cattle. The present study was performed to determine the global transcriptome changes following castration of bulls and to identify genes associated with IMF deposition in the longissimus dorsi (LM) of Korean cattle. A customized bovine CombiMatrix oligonucleotide microarray was constructed, and transcriptome changes following castration were determined by microarray hybridization. Transcriptome comparison between bulls and steers indicated that 428 of 8,407 genes were differentially expressed in the LM by greater than two fold (P < 0.05). Gene expression profiling indicated alterations in several pathways, including adipogenesis, fatty acid oxidation, tricarboxylic acid (TCA) cycle, and oxidative phosphorylation (OP), following castration. Castration upregulated transcription of adipogenic perilipin 2 (PLIN2) and visfatin, lipogenic fatty acid synthase, fatty acid esterification 1-acylglycerol-3-phosphate O-acyltransferase 5, and many fatty acid oxidation-related genes. Many TCA cycle and OP genes were also transcriptionally upregulated. Correlation analysis indicated that the IMF content in the LM was highly correlated with mRNA levels of PLIN2 (r = 0.70, P < 0.001), adenosine triphosphatase (ATPase), H(+)-transporting, lysosomal 42 kDa, V1 subunit C1 (ATP6V1C1: r = 0.66, P < 0.001), and cytochrome c oxidase assembly homolog 11 (COX11: r = 0.72, P < 0.001) genes in a pooled animal group of steers plus bulls, and significant correlations in the steer-alone group were maintained in the 3 genes, PLIN2 (r = 0.47, P < 0.05), ATP6V1C1 (r = 0.50, P < 0.05), and COX11 (r = 0.60, P < 0.01). In conclusion, our study provided evidence that castration shifts transcription of lipid metabolism genes, favoring IMF deposition by increasing adipogenesis, lipogenesis, and triglyceride synthesis. This study also indicated that castration increases transcription of genes involved in fatty acid oxidation and subsequent energy production (TCA and OP genes). Our microarray analysis provided novel information that castration alters the transcriptome associated with lipid/energy metabolism, favoring IMF deposition in the LM.
Multi-membership gene regulation in pathway based microarray analysis
2011-01-01
Background Gene expression analysis has been intensively researched for more than a decade. Recently, there has been elevated interest in the integration of microarray data analysis with other types of biological knowledge in a holistic analytical approach. We propose a methodology that can be facilitated for pathway based microarray data analysis, based on the observation that a substantial proportion of genes present in biochemical pathway databases are members of a number of distinct pathways. Our methodology aims towards establishing the state of individual pathways, by identifying those truly affected by the experimental conditions based on the behaviour of such genes. For that purpose it considers all the pathways in which a gene participates and the general census of gene expression per pathway. Results We utilise hill climbing, simulated annealing and a genetic algorithm to analyse the consistency of the produced results, through the application of fuzzy adjusted rand indexes and hamming distance. All algorithms produce highly consistent genes to pathways allocations, revealing the contribution of genes to pathway functionality, in agreement with current pathway state visualisation techniques, with the simulated annealing search proving slightly superior in terms of efficiency. Conclusions We show that the expression values of genes, which are members of a number of biochemical pathways or modules, are the net effect of the contribution of each gene to these biochemical processes. We show that by manipulating the pathway and module contribution of such genes to follow underlying trends we can interpret microarray results centred on the behaviour of these genes. PMID:21939531
Multi-membership gene regulation in pathway based microarray analysis.
Pavlidis, Stelios P; Payne, Annette M; Swift, Stephen M
2011-09-22
Gene expression analysis has been intensively researched for more than a decade. Recently, there has been elevated interest in the integration of microarray data analysis with other types of biological knowledge in a holistic analytical approach. We propose a methodology that can be facilitated for pathway based microarray data analysis, based on the observation that a substantial proportion of genes present in biochemical pathway databases are members of a number of distinct pathways. Our methodology aims towards establishing the state of individual pathways, by identifying those truly affected by the experimental conditions based on the behaviour of such genes. For that purpose it considers all the pathways in which a gene participates and the general census of gene expression per pathway. We utilise hill climbing, simulated annealing and a genetic algorithm to analyse the consistency of the produced results, through the application of fuzzy adjusted rand indexes and hamming distance. All algorithms produce highly consistent genes to pathways allocations, revealing the contribution of genes to pathway functionality, in agreement with current pathway state visualisation techniques, with the simulated annealing search proving slightly superior in terms of efficiency. We show that the expression values of genes, which are members of a number of biochemical pathways or modules, are the net effect of the contribution of each gene to these biochemical processes. We show that by manipulating the pathway and module contribution of such genes to follow underlying trends we can interpret microarray results centred on the behaviour of these genes.
caCORRECT2: Improving the accuracy and reliability of microarray data in the presence of artifacts
2011-01-01
Background In previous work, we reported the development of caCORRECT, a novel microarray quality control system built to identify and correct spatial artifacts commonly found on Affymetrix arrays. We have made recent improvements to caCORRECT, including the development of a model-based data-replacement strategy and integration with typical microarray workflows via caCORRECT's web portal and caBIG grid services. In this report, we demonstrate that caCORRECT improves the reproducibility and reliability of experimental results across several common Affymetrix microarray platforms. caCORRECT represents an advance over state-of-art quality control methods such as Harshlighting, and acts to improve gene expression calculation techniques such as PLIER, RMA and MAS5.0, because it incorporates spatial information into outlier detection as well as outlier information into probe normalization. The ability of caCORRECT to recover accurate gene expressions from low quality probe intensity data is assessed using a combination of real and synthetic artifacts with PCR follow-up confirmation and the affycomp spike in data. The caCORRECT tool can be accessed at the website: http://cacorrect.bme.gatech.edu. Results We demonstrate that (1) caCORRECT's artifact-aware normalization avoids the undesirable global data warping that happens when any damaged chips are processed without caCORRECT; (2) When used upstream of RMA, PLIER, or MAS5.0, the data imputation of caCORRECT generally improves the accuracy of microarray gene expression in the presence of artifacts more than using Harshlighting or not using any quality control; (3) Biomarkers selected from artifactual microarray data which have undergone the quality control procedures of caCORRECT are more likely to be reliable, as shown by both spike in and PCR validation experiments. Finally, we present a case study of the use of caCORRECT to reliably identify biomarkers for renal cell carcinoma, yielding two diagnostic biomarkers with potential clinical utility, PRKAB1 and NNMT. Conclusions caCORRECT is shown to improve the accuracy of gene expression, and the reproducibility of experimental results in clinical application. This study suggests that caCORRECT will be useful to clean up possible artifacts in new as well as archived microarray data. PMID:21957981
Giancarlo, R; Scaturro, D; Utro, F
2015-02-01
The prediction of the number of clusters in a dataset, in particular microarrays, is a fundamental task in biological data analysis, usually performed via validation measures. Unfortunately, it has received very little attention and in fact there is a growing need for software tools/libraries dedicated to it. Here we present ValWorkBench, a software library consisting of eleven well known validation measures, together with novel heuristic approximations for some of them. The main objective of this paper is to provide the interested researcher with the full software documentation of an open source cluster validation platform having the main features of being easily extendible in a homogeneous way and of offering software components that can be readily re-used. Consequently, the focus of the presentation is on the architecture of the library, since it provides an essential map that can be used to access the full software documentation, which is available at the supplementary material website [1]. The mentioned main features of ValWorkBench are also discussed and exemplified, with emphasis on software abstraction design and re-usability. A comparison with existing cluster validation software libraries, mainly in terms of the mentioned features, is also offered. It suggests that ValWorkBench is a much needed contribution to the microarray software development/algorithm engineering community. For completeness, it is important to mention that previous accurate algorithmic experimental analysis of the relative merits of each of the implemented measures [19,23,25], carried out specifically on microarray data, gives useful insights on the effectiveness of ValWorkBench for cluster validation to researchers in the microarray community interested in its use for the mentioned task. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Nguyen, Doan H.; Toshida, Hiroshi; Schurr, Jill; Beuerman, Roger W.
2010-01-01
Previous studies showed that loss of muscarinic parasympathetic input to the lacrimal gland (LG) leads to a dramatic reduction in tear secretion and profound changes to LG structure. In this study, we used DNA microarrays to examine the regulation of the gene expression of the genes for secretory function and organization of the LG. Long-Evans rats anesthetized with a mixture of ketamine/xylazine (80:10 mg/kg) underwent unilateral sectioning of the greater superficial petrosal nerve, the input to the pterygopalatine ganglion. After 7 days, tear secretion was measured, the animals were killed, and structural changes in the LG were examined by light microscopy. Total RNA from control and experimental LGs (n = 5) was used for DNA microarray analysis employing the U34A GeneChip. Three statistical algorithms (detection, change call, and signal log ratio) were used to determine differential gene expression using the Microarray Suite (5.0) and Data Mining Tools (3.0). Tear secretion was significantly reduced and corneal ulcers developed in all experimental eyes. Light microscopy showed breakdown of the acinar structure of the LG. DNA microarray analysis showed downregulation of genes associated with the endoplasmic reticulum and Golgi, including genes involved in protein folding and processing. Conversely, transcripts for cytoskeleton and extracellular matrix components, inflammation, and apoptosis were upregulated. The number of significantly upregulated genes (116) was substantially greater than the number of downregulated genes (49). Removal of the main secretory input to the rat LG resulted in clinical symptoms associated with severe dry eye. Components of the secretory pathway were negatively affected, and the increase in cell proliferation and inflammation may lead to loss of organization in the parasympathectomized lacrimal gland. PMID:15084711
Vallée, Maud; Gravel, Catherine; Palin, Marie-France; Reghenas, Hélène; Stothard, Paul; Wishart, David S; Sirard, Marc-André
2005-07-01
The main objective of the present study was to identify novel oocyte-specific genes in three different species: bovine, mouse, and Xenopus laevis. To achieve this goal, two powerful technologies were combined: a polymerase chain reaction (PCR)-based cDNA subtraction, and cDNA microarrays. Three subtractive libraries consisting of 3456 clones were established and enriched for oocyte-specific transcripts. Sequencing analysis of the positive insert-containing clones resulted in the following classification: 53% of the clones corresponded to known cDNAs, 26% were classified as uncharacterized cDNAs, and a final 9% were classified as novel sequences. All these clones were used for cDNA microarray preparation. Results from these microarray analyses revealed that in addition to already known oocyte-specific genes, such as GDF9, BMP15, and ZP, known genes with unknown function in the oocyte were identified, such as a MLF1-interacting protein (MLF1IP), B-cell translocation gene 4 (BTG4), and phosphotyrosine-binding protein (xPTB). Furthermore, 15 novel oocyte-specific genes were validated by reverse transcription-PCR to confirm their preferential expression in the oocyte compared to somatic tissues. The results obtained in the present study confirmed that microarray analysis is a robust technique to identify true positives from the suppressive subtractive hybridization experiment. Furthermore, obtaining oocyte-specific genes from three species simultaneously allowed us to look at important genes that are conserved across species. Further characterization of these novel oocyte-specific genes will lead to a better understanding of the molecular mechanisms related to the unique functions found in the oocyte.
Microarray analysis of gene expression profiles in ripening pineapple fruits.
Koia, Jonni H; Moyle, Richard L; Botella, Jose R
2012-12-18
Pineapple (Ananas comosus) is a tropical fruit crop of significant commercial importance. Although the physiological changes that occur during pineapple fruit development have been well characterized, little is known about the molecular events that occur during the fruit ripening process. Understanding the molecular basis of pineapple fruit ripening will aid the development of new varieties via molecular breeding or genetic modification. In this study we developed a 9277 element pineapple microarray and used it to profile gene expression changes that occur during pineapple fruit ripening. Microarray analyses identified 271 unique cDNAs differentially expressed at least 1.5-fold between the mature green and mature yellow stages of pineapple fruit ripening. Among these 271 sequences, 184 share significant homology with genes encoding proteins of known function, 53 share homology with genes encoding proteins of unknown function and 34 share no significant homology with any database accession. Of the 237 pineapple sequences with homologs, 160 were up-regulated and 77 were down-regulated during pineapple fruit ripening. DAVID Functional Annotation Cluster (FAC) analysis of all 237 sequences with homologs revealed confident enrichment scores for redox activity, organic acid metabolism, metalloenzyme activity, glycolysis, vitamin C biosynthesis, antioxidant activity and cysteine peptidase activity, indicating the functional significance and importance of these processes and pathways during pineapple fruit development. Quantitative real-time PCR analysis validated the microarray expression results for nine out of ten genes tested. This is the first report of a microarray based gene expression study undertaken in pineapple. Our bioinformatic analyses of the transcript profiles have identified a number of genes, processes and pathways with putative involvement in the pineapple fruit ripening process. This study extends our knowledge of the molecular basis of pineapple fruit ripening and non-climacteric fruit ripening in general.
Microarray analysis of gene expression profiles in ripening pineapple fruits
2012-01-01
Background Pineapple (Ananas comosus) is a tropical fruit crop of significant commercial importance. Although the physiological changes that occur during pineapple fruit development have been well characterized, little is known about the molecular events that occur during the fruit ripening process. Understanding the molecular basis of pineapple fruit ripening will aid the development of new varieties via molecular breeding or genetic modification. In this study we developed a 9277 element pineapple microarray and used it to profile gene expression changes that occur during pineapple fruit ripening. Results Microarray analyses identified 271 unique cDNAs differentially expressed at least 1.5-fold between the mature green and mature yellow stages of pineapple fruit ripening. Among these 271 sequences, 184 share significant homology with genes encoding proteins of known function, 53 share homology with genes encoding proteins of unknown function and 34 share no significant homology with any database accession. Of the 237 pineapple sequences with homologs, 160 were up-regulated and 77 were down-regulated during pineapple fruit ripening. DAVID Functional Annotation Cluster (FAC) analysis of all 237 sequences with homologs revealed confident enrichment scores for redox activity, organic acid metabolism, metalloenzyme activity, glycolysis, vitamin C biosynthesis, antioxidant activity and cysteine peptidase activity, indicating the functional significance and importance of these processes and pathways during pineapple fruit development. Quantitative real-time PCR analysis validated the microarray expression results for nine out of ten genes tested. Conclusions This is the first report of a microarray based gene expression study undertaken in pineapple. Our bioinformatic analyses of the transcript profiles have identified a number of genes, processes and pathways with putative involvement in the pineapple fruit ripening process. This study extends our knowledge of the molecular basis of pineapple fruit ripening and non-climacteric fruit ripening in general. PMID:23245313
Puinean, Alin M; Foster, Stephen P; Oliphant, Linda; Denholm, Ian; Field, Linda M; Millar, Neil S; Williamson, Martin S; Bass, Chris
2010-06-24
The aphid Myzus persicae is a globally significant crop pest that has evolved high levels of resistance to almost all classes of insecticide. To date, the neonicotinoids, an economically important class of insecticides that target nicotinic acetylcholine receptors (nAChRs), have remained an effective control measure; however, recent reports of resistance in M. persicae represent a threat to the long-term efficacy of this chemical class. In this study, the mechanisms underlying resistance to the neonicotinoid insecticides were investigated using biological, biochemical, and genomic approaches. Bioassays on a resistant M. persicae clone (5191A) suggested that P450-mediated detoxification plays a primary role in resistance, although additional mechanism(s) may also contribute. Microarray analysis, using an array populated with probes corresponding to all known detoxification genes in M. persicae, revealed constitutive over-expression (22-fold) of a single P450 gene (CYP6CY3); and quantitative PCR showed that the over-expression is due, at least in part, to gene amplification. This is the first report of a P450 gene amplification event associated with insecticide resistance in an agriculturally important insect pest. The microarray analysis also showed over-expression of several gene sequences that encode cuticular proteins (2-16-fold), and artificial feeding assays and in vivo penetration assays using radiolabeled insecticide provided direct evidence of a role for reduced cuticular penetration in neonicotinoid resistance. Conversely, receptor radioligand binding studies and nucleotide sequencing of nAChR subunit genes suggest that target-site changes are unlikely to contribute to resistance to neonicotinoid insecticides in M. persicae.
2011-01-01
Background Polygalacturonase-inhibiting proteins (PGIPs) directly limit the effective ingress of fungal pathogens by inhibiting cell wall-degrading endopolygalacturonases (ePGs). Transgenic tobacco plants over-expressing grapevine (Vitis vinifera) Vvpgip1 have previously been shown to be resistant to Botrytis infection. In this study we characterized two of these PGIP over-expressing lines with known resistance phenotypes by gene expression and hormone profiling in the absence of pathogen infection. Results Global gene expression was performed by a cross-species microarray approach using a potato cDNA microarray. The degree of potential cross-hybridization between probes was modeled by a novel computational workflow designed in-house. Probe annotations were updated by predicting probe-to-transcript hybridizations and combining information derived from other plant species. Comparing uninfected Vvpgip1-overexpressing lines to wild-type (WT), 318 probes showed significant change in expression. Functional groups of genes involved in metabolism and associated to the cell wall were identified and consequent cell wall analysis revealed increased lignin-levels in the transgenic lines, but no major differences in cell wall-derived polysaccharides. GO enrichment analysis also identified genes responsive to auxin, which was supported by elevated indole-acetic acid (IAA) levels in the transgenic lines. Finally, a down-regulation of xyloglucan endotransglycosylase/hydrolases (XTHs), which are important in cell wall remodeling, was linked to a decrease in total XTH activity. Conclusions This evaluation of PGIP over-expressing plants performed under pathogen-free conditions to exclude the classical PGIP-ePG inhibition interaction indicates additional roles for PGIPs beyond the inhibition of ePGs. PMID:22078230
Controlling false-negative errors in microarray differential expression analysis: a PRIM approach.
Cole, Steve W; Galic, Zoran; Zack, Jerome A
2003-09-22
Theoretical considerations suggest that current microarray screening algorithms may fail to detect many true differences in gene expression (Type II analytic errors). We assessed 'false negative' error rates in differential expression analyses by conventional linear statistical models (e.g. t-test), microarray-adapted variants (e.g. SAM, Cyber-T), and a novel strategy based on hold-out cross-validation. The latter approach employs the machine-learning algorithm Patient Rule Induction Method (PRIM) to infer minimum thresholds for reliable change in gene expression from Boolean conjunctions of fold-induction and raw fluorescence measurements. Monte Carlo analyses based on four empirical data sets show that conventional statistical models and their microarray-adapted variants overlook more than 50% of genes showing significant up-regulation. Conjoint PRIM prediction rules recover approximately twice as many differentially expressed transcripts while maintaining strong control over false-positive (Type I) errors. As a result, experimental replication rates increase and total analytic error rates decline. RT-PCR studies confirm that gene inductions detected by PRIM but overlooked by other methods represent true changes in mRNA levels. PRIM-based conjoint inference rules thus represent an improved strategy for high-sensitivity screening of DNA microarrays. Freestanding JAVA application at http://microarray.crump.ucla.edu/focus
Severgnini, Marco; Bicciato, Silvio; Mangano, Eleonora; Scarlatti, Francesca; Mezzelani, Alessandra; Mattioli, Michela; Ghidoni, Riccardo; Peano, Clelia; Bonnal, Raoul; Viti, Federica; Milanesi, Luciano; De Bellis, Gianluca; Battaglia, Cristina
2006-06-01
Meta-analysis of microarray data is increasingly important, considering both the availability of multiple platforms using disparate technologies and the accumulation in public repositories of data sets from different laboratories. We addressed the issue of comparing gene expression profiles from two microarray platforms by devising a standardized investigative strategy. We tested this procedure by studying MDA-MB-231 cells, which undergo apoptosis on treatment with resveratrol. Gene expression profiles were obtained using high-density, short-oligonucleotide, single-color microarray platforms: GeneChip (Affymetrix) and CodeLink (Amersham). Interplatform analyses were carried out on 8414 common transcripts represented on both platforms, as identified by LocusLink ID, representing 70.8% and 88.6% of annotated GeneChip and CodeLink features, respectively. We identified 105 differentially expressed genes (DEGs) on CodeLink and 42 DEGs on GeneChip. Among them, only 9 DEGs were commonly identified by both platforms. Multiple analyses (BLAST alignment of probes with target sequences, gene ontology, literature mining, and quantitative real-time PCR) permitted us to investigate the factors contributing to the generation of platform-dependent results in single-color microarray experiments. An effective approach to cross-platform comparison involves microarrays of similar technologies, samples prepared by identical methods, and a standardized battery of bioinformatic and statistical analyses.
Janse, Ingmar; Bok, Jasper M.; Hamidjaja, Raditijo A.; Hodemaekers, Hennie M.; van Rotterdam, Bart J.
2012-01-01
Microarrays provide a powerful analytical tool for the simultaneous detection of multiple pathogens. We developed diagnostic suspension microarrays for sensitive and specific detection of the biothreat pathogens Bacillus anthracis, Yersinia pestis, Francisella tularensis and Coxiella burnetii. Two assay chemistries for amplification and labeling were developed, one method using direct hybridization and the other using target-specific primer extension, combined with hybridization to universal arrays. Asymmetric PCR products for both assay chemistries were produced by using a multiplex asymmetric PCR amplifying 16 DNA signatures (16-plex). The performances of both assay chemistries were compared and their advantages and disadvantages are discussed. The developed microarrays detected multiple signature sequences and an internal control which made it possible to confidently identify the targeted pathogens and assess their virulence potential. The microarrays were highly specific and detected various strains of the targeted pathogens. Detection limits for the different pathogen signatures were similar or slightly higher compared to real-time PCR. Probit analysis showed that even a few genomic copies could be detected with 95% confidence. The microarrays detected DNA from different pathogens mixed in different ratios and from spiked or naturally contaminated samples. The assays that were developed have a potential for application in surveillance and diagnostics. PMID:22355407
Microarray platform affords improved product analysis in mammalian cell growth studies
Li, Lingyun; Migliore, Nicole; Schaefer, Eugene; Sharfstein, Susan T.; Dordick, Jonathan S.; Linhardt, Robert J.
2014-01-01
High throughput (HT) platforms serve as cost-efficient and rapid screening method for evaluating the effect of cell culture conditions and screening of chemicals. The aim of the current study was to develop a high-throughput cell-based microarray platform to assess the effect of culture conditions on Chinese hamster ovary (CHO) cells. Specifically, growth, transgene expression and metabolism of a GS/MSX CHO cell line, which produces a therapeutic monoclonal antibody, was examined using microarray system in conjunction with conventional shake flask platform in a non-proprietary medium. The microarray system consists of 60 nl spots of cells encapsulated in alginate and separated in groups via an 8-well chamber system attached to the chip. Results show the non-proprietary medium developed allows cell growth, production and normal glycosylation of recombinant antibody and metabolism of the recombinant CHO cells in both the microarray and shake flask platforms. In addition, 10.3 mM glutamate addition to the defined base media results in lactate metabolism shift in the recombinant GS/MSX CHO cells in the shake flask platform. Ultimately, the results demonstrate that the high-throughput microarray platform has the potential to be utilized for evaluating the impact of media additives on cellular processes, such as, cell growth, metabolism and productivity. PMID:24227746
Janse, Ingmar; Bok, Jasper M; Hamidjaja, Raditijo A; Hodemaekers, Hennie M; van Rotterdam, Bart J
2012-01-01
Microarrays provide a powerful analytical tool for the simultaneous detection of multiple pathogens. We developed diagnostic suspension microarrays for sensitive and specific detection of the biothreat pathogens Bacillus anthracis, Yersinia pestis, Francisella tularensis and Coxiella burnetii. Two assay chemistries for amplification and labeling were developed, one method using direct hybridization and the other using target-specific primer extension, combined with hybridization to universal arrays. Asymmetric PCR products for both assay chemistries were produced by using a multiplex asymmetric PCR amplifying 16 DNA signatures (16-plex). The performances of both assay chemistries were compared and their advantages and disadvantages are discussed. The developed microarrays detected multiple signature sequences and an internal control which made it possible to confidently identify the targeted pathogens and assess their virulence potential. The microarrays were highly specific and detected various strains of the targeted pathogens. Detection limits for the different pathogen signatures were similar or slightly higher compared to real-time PCR. Probit analysis showed that even a few genomic copies could be detected with 95% confidence. The microarrays detected DNA from different pathogens mixed in different ratios and from spiked or naturally contaminated samples. The assays that were developed have a potential for application in surveillance and diagnostics.
Yang, Yunfeng; Zhu, Mengxia; Wu, Liyou; Zhou, Jizhong
2008-09-16
Using genomic DNA as common reference in microarray experiments has recently been tested by different laboratories. Conflicting results have been reported with regard to the reliability of microarray results using this method. To explain it, we hypothesize that data processing is a critical element that impacts the data quality. Microarray experiments were performed in a gamma-proteobacterium Shewanella oneidensis. Pair-wise comparison of three experimental conditions was obtained either with two labeled cDNA samples co-hybridized to the same array, or by employing Shewanella genomic DNA as a standard reference. Various data processing techniques were exploited to reduce the amount of inconsistency between both methods and the results were assessed. We discovered that data quality was significantly improved by imposing the constraint of minimal number of replicates, logarithmic transformation and random error analyses. These findings demonstrate that data processing significantly influences data quality, which provides an explanation for the conflicting evaluation in the literature. This work could serve as a guideline for microarray data analysis using genomic DNA as a standard reference.
Employing image processing techniques for cancer detection using microarray images.
Dehghan Khalilabad, Nastaran; Hassanpour, Hamid
2017-02-01
Microarray technology is a powerful genomic tool for simultaneously studying and analyzing the behavior of thousands of genes. The analysis of images obtained from this technology plays a critical role in the detection and treatment of diseases. The aim of the current study is to develop an automated system for analyzing data from microarray images in order to detect cancerous cases. The proposed system consists of three main phases, namely image processing, data mining, and the detection of the disease. The image processing phase performs operations such as refining image rotation, gridding (locating genes) and extracting raw data from images the data mining includes normalizing the extracted data and selecting the more effective genes. Finally, via the extracted data, cancerous cell is recognized. To evaluate the performance of the proposed system, microarray database is employed which includes Breast cancer, Myeloid Leukemia and Lymphomas from the Stanford Microarray Database. The results indicate that the proposed system is able to identify the type of cancer from the data set with an accuracy of 95.45%, 94.11%, and 100%, respectively. Copyright © 2017 Elsevier Ltd. All rights reserved.
Improved microarray methods for profiling the yeast knockout strain collection
Yuan, Daniel S.; Pan, Xuewen; Ooi, Siew Loon; Peyser, Brian D.; Spencer, Forrest A.; Irizarry, Rafael A.; Boeke, Jef D.
2005-01-01
A remarkable feature of the Yeast Knockout strain collection is the presence of two unique 20mer TAG sequences in almost every strain. In principle, the relative abundances of strains in a complex mixture can be profiled swiftly and quantitatively by amplifying these sequences and hybridizing them to microarrays, but TAG microarrays have not been widely used. Here, we introduce a TAG microarray design with sophisticated controls and describe a robust method for hybridizing high concentrations of dye-labeled TAGs in single-stranded form. We also highlight the importance of avoiding PCR contamination and provide procedures for detection and eradication. Validation experiments using these methods yielded false positive (FP) and false negative (FN) rates for individual TAG detection of 3–6% and 15–18%, respectively. Analysis demonstrated that cross-hybridization was the chief source of FPs, while TAG amplification defects were the main cause of FNs. The materials, protocols, data and associated software described here comprise a suite of experimental resources that should facilitate the use of TAG microarrays for a wide variety of genetic screens. PMID:15994458
An efficient method to identify differentially expressed genes in microarray experiments
Qin, Huaizhen; Feng, Tao; Harding, Scott A.; Tsai, Chung-Jui; Zhang, Shuanglin
2013-01-01
Motivation Microarray experiments typically analyze thousands to tens of thousands of genes from small numbers of biological replicates. The fact that genes are normally expressed in functionally relevant patterns suggests that gene-expression data can be stratified and clustered into relatively homogenous groups. Cluster-wise dimensionality reduction should make it feasible to improve screening power while minimizing information loss. Results We propose a powerful and computationally simple method for finding differentially expressed genes in small microarray experiments. The method incorporates a novel stratification-based tight clustering algorithm, principal component analysis and information pooling. Comprehensive simulations show that our method is substantially more powerful than the popular SAM and eBayes approaches. We applied the method to three real microarray datasets: one from a Populus nitrogen stress experiment with 3 biological replicates; and two from public microarray datasets of human cancers with 10 to 40 biological replicates. In all three analyses, our method proved more robust than the popular alternatives for identification of differentially expressed genes. Availability The C++ code to implement the proposed method is available upon request for academic use. PMID:18453554
Clustering approaches to identifying gene expression patterns from DNA microarray data.
Do, Jin Hwan; Choi, Dong-Kug
2008-04-30
The analysis of microarray data is essential for large amounts of gene expression data. In this review we focus on clustering techniques. The biological rationale for this approach is the fact that many co-expressed genes are co-regulated, and identifying co-expressed genes could aid in functional annotation of novel genes, de novo identification of transcription factor binding sites and elucidation of complex biological pathways. Co-expressed genes are usually identified in microarray experiments by clustering techniques. There are many such methods, and the results obtained even for the same datasets may vary considerably depending on the algorithms and metrics for dissimilarity measures used, as well as on user-selectable parameters such as desired number of clusters and initial values. Therefore, biologists who want to interpret microarray data should be aware of the weakness and strengths of the clustering methods used. In this review, we survey the basic principles of clustering of DNA microarray data from crisp clustering algorithms such as hierarchical clustering, K-means and self-organizing maps, to complex clustering algorithms like fuzzy clustering.
Shin, Hwa Hui; Hwang, Byeong Hee; Seo, Jeong Hyun
2014-01-01
It is important to rapidly and selectively detect and analyze pathogenic Salmonella enterica subsp. enterica in contaminated food to reduce the morbidity and mortality of Salmonella infection and to guarantee food safety. In the present work, we developed an oligonucleotide microarray containing duplicate specific capture probes based on the carB gene, which encodes the carbamoyl phosphate synthetase large subunit, as a competent biomarker evaluated by genetic analysis to selectively and efficiently detect and discriminate three S. enterica subsp. enterica serotypes: Choleraesuis, Enteritidis, and Typhimurium. Using the developed microarray system, three serotype targets were successfully analyzed in a range as low as 1.6 to 3.1 nM and were specifically discriminated from each other without nonspecific signals. In addition, the constructed microarray did not have cross-reactivity with other common pathogenic bacteria and even enabled the clear discrimination of the target Salmonella serotype from a bacterial mixture. Therefore, these results demonstrated that our novel carB-based oligonucleotide microarray can be used as an effective and specific detection system for S. enterica subsp. enterica serotypes. PMID:24185846
Shin, Hwa Hui; Hwang, Byeong Hee; Seo, Jeong Hyun; Cha, Hyung Joon
2014-01-01
It is important to rapidly and selectively detect and analyze pathogenic Salmonella enterica subsp. enterica in contaminated food to reduce the morbidity and mortality of Salmonella infection and to guarantee food safety. In the present work, we developed an oligonucleotide microarray containing duplicate specific capture probes based on the carB gene, which encodes the carbamoyl phosphate synthetase large subunit, as a competent biomarker evaluated by genetic analysis to selectively and efficiently detect and discriminate three S. enterica subsp. enterica serotypes: Choleraesuis, Enteritidis, and Typhimurium. Using the developed microarray system, three serotype targets were successfully analyzed in a range as low as 1.6 to 3.1 nM and were specifically discriminated from each other without nonspecific signals. In addition, the constructed microarray did not have cross-reactivity with other common pathogenic bacteria and even enabled the clear discrimination of the target Salmonella serotype from a bacterial mixture. Therefore, these results demonstrated that our novel carB-based oligonucleotide microarray can be used as an effective and specific detection system for S. enterica subsp. enterica serotypes.
USDA-ARS?s Scientific Manuscript database
The long-term goal of our study is to understand the genetic and epigenetic mechanisms of breast cancer metastasis in human and to discover new possible genetic markers for use in clinical practice. We have used microarray technology (Human OneArray microarray, phylanxbiotech.com) to compare gene ex...
USDA-ARS?s Scientific Manuscript database
The objectives of this study were (1) to evaluate differential gene expression levels for resistance to A. flavus kernel infection in susceptible (Va35) and resistant (Mp313E) maize lines using Oligonucleotide and cDNA microarray analysis, (2) to evaluate differences in A. flavus accumulation betwee...
Microarrays have had a significant impact on many areas of biology. However, there are still many fertile research areas that would benefit from microarray analysis but are limited by the amount of biological material that can be obtained (e.g. samples obtained by small biopsy, f...
Customizing microarrays for neuroscience drug discovery.
Girgenti, Matthew J; Newton, Samuel S
2007-08-01
Microarray-based gene profiling has become the centerpiece of gene expression studies in the biological sciences. The ability to now interrogate the entire genome using a single chip demonstrates the progress in technology and instrumentation that has been made over the last two decades. Although this unbiased approach provides researchers with an immense quantity of data, obtaining meaningful insight is not possible without intensive data analysis and processing. Custom developed arrays have emerged as a viable and attractive alternative that can take advantage of this robust technology and tailor it to suit the needs and requirements of individual investigations. The ability to simplify data analysis, reduce noise and carefully optimize experimental conditions makes it a suitable tool that can be effectively utilized in neuroscience drug discovery efforts. Furthermore, incorporating recent advancements in fine focusing gene profiling to include specific cellular phenotypes can help resolve the complex cellular heterogeneity of the brain. This review surveys the use of microarray technology in neuroscience paying special attention to customized arrays and their potential in drug discovery. Novel applications of microarrays and ancillary techniques, such as laser microdissection, FAC sorting and RNA amplification, have also been discussed. The notion that a hypothesis-driven approach can be integrated into drug development programs is highlighted.
Peschl, Patrick; Ramberger, Melanie; Höftberger, Romana; Jöhrer, Karin; Baumann, Matthias; Rostásy, Kevin; Reindl, Markus
2017-01-01
Acute disseminated encephalomyelitis (ADEM) is a rare autoimmune-mediated demyelinating disease affecting mainly children and young adults. Differentiation to multiple sclerosis is not always possible, due to overlapping clinical symptoms and recurrent and multiphasic forms. Until now, immunoglobulins reactive to myelin oligodendrocyte glycoprotein (MOG antibodies) have been found in a subset of patients with ADEM. However, there are still patients lacking autoantibodies, necessitating the identification of new autoantibodies as biomarkers in those patients. Therefore, we aimed to identify novel autoantibody targets in ADEM patients. Sixteen ADEM patients (11 seronegative, 5 seropositive for MOG antibodies) were analysed for potential new biomarkers, using a protein microarray and immunohistochemistry on rat brain tissue to identify antibodies against intracellular and surface neuronal and glial antigens. Nine candidate antigens were identified in the protein microarray analysis in at least two patients per group. Immunohistochemistry on rat brain tissue did not reveal new target antigens. Although no new autoantibody targets could be found here, future studies should aim to identify new biomarkers for therapeutic and prognostic purposes. The microarray analysis and immunohistochemistry methods used here have several limitations, which should be considered in future searches for biomarkers. PMID:28327523
Microarray gene expression profiling analysis combined with bioinformatics in multiple sclerosis.
Liu, Mingyuan; Hou, Xiaojun; Zhang, Ping; Hao, Yong; Yang, Yiting; Wu, Xiongfeng; Zhu, Desheng; Guan, Yangtai
2013-05-01
Multiple sclerosis (MS) is the most prevalent demyelinating disease and the principal cause of neurological disability in young adults. Recent microarray gene expression profiling studies have identified several genetic variants contributing to the complex pathogenesis of MS, however, expressional and functional studies are still required to further understand its molecular mechanism. The present study aimed to analyze the molecular mechanism of MS using microarray analysis combined with bioinformatics techniques. We downloaded the gene expression profile of MS from Gene Expression Omnibus (GEO) and analysed the microarray data using the differentially coexpressed genes (DCGs) and links package in R and Database for Annotation, Visualization and Integrated Discovery. The regulatory impact factor (RIF) algorithm was used to measure the impact factor of transcription factor. A total of 1,297 DCGs between MS patients and healthy controls were identified. Functional annotation indicated that these DCGs were associated with immune and neurological functions. Furthermore, the RIF result suggested that IKZF1, BACH1, CEBPB, EGR1, FOS may play central regulatory roles in controlling gene expression in the pathogenesis of MS. Our findings confirm the presence of multiple molecular alterations in MS and indicate the possibility for identifying prognostic factors associated with MS pathogenesis.
Mining microarray data at NCBI's Gene Expression Omnibus (GEO)*.
Barrett, Tanya; Edgar, Ron
2006-01-01
The Gene Expression Omnibus (GEO) at the National Center for Biotechnology Information (NCBI) has emerged as the leading fully public repository for gene expression data. This chapter describes how to use Web-based interfaces, applications, and graphics to effectively explore, visualize, and interpret the hundreds of microarray studies and millions of gene expression patterns stored in GEO. Data can be examined from both experiment-centric and gene-centric perspectives using user-friendly tools that do not require specialized expertise in microarray analysis or time-consuming download of massive data sets. The GEO database is publicly accessible through the World Wide Web at http://www.ncbi.nlm.nih.gov/geo.
Payne, Adrienne C.; Clarkson, Graham J.J.; Rothwell, Steve; Taylor, Gail
2015-01-01
Watercress (Nasturtium officinale R. Br.) is a nutrient intense, leafy crop that is consumed raw or in soups across the globe, but for which, currently no genomic resources or breeding programme exists. Promising morphological, biochemical and functional genomic variation was identified for the first time in a newly established watercress germplasm collection, consisting of 48 watercress accessions sourced from contrasting global locations. Stem length, stem diameter and anti-oxidant (AO) potential varied across the accessions. This variation was used to identify three extreme contrasting accessions for further analysis. Variation in global gene expression was investigated using an Affymetrix Arabidopsis ATH1 microarray gene chip, using the commercial control (C), an accession selected for dwarf phenotype with a high AO potential (dwarfAO, called ‘Boldrewood’) and one with high AO potential alone. A set of transcripts significantly differentially expressed between these three accessions, were identified, including transcripts involved in the regulation of growth and development and those involved in secondary metabolism. In particular, when differential gene expression was compared between C and dwarfAO, the dwarfAO was characterised by increased expression of genes encoding glucosinolates, which are known precursors of phenethyl isothiocyanate, linked to the anti-carcinogenic effects well-documented in watercress. This study provides the first analysis of natural variation across the watercress genome and has identified important underpinning information for future breeding for enhanced anti-carcinogenic properties and morphology traits in this nutrient-intense crop. PMID:26504575
Gene Expression Profiling of Gastric Cancer
Marimuthu, Arivusudar; Jacob, Harrys K.C.; Jakharia, Aniruddha; Subbannayya, Yashwanth; Keerthikumar, Shivakumar; Kashyap, Manoj Kumar; Goel, Renu; Balakrishnan, Lavanya; Dwivedi, Sutopa; Pathare, Swapnali; Dikshit, Jyoti Bajpai; Maharudraiah, Jagadeesha; Singh, Sujay; Sameer Kumar, Ghantasala S; Vijayakumar, M.; Veerendra Kumar, Kariyanakatte Veeraiah; Premalatha, Chennagiri Shrinivasamurthy; Tata, Pramila; Hariharan, Ramesh; Roa, Juan Carlos; Prasad, T.S.K; Chaerkady, Raghothama; Kumar, Rekha Vijay; Pandey, Akhilesh
2015-01-01
Gastric cancer is the second leading cause of cancer death worldwide, both in men and women. A genomewide gene expression analysis was carried out to identify differentially expressed genes in gastric adenocarcinoma tissues as compared to adjacent normal tissues. We used Agilent’s whole human genome oligonucleotide microarray platform representing ~41,000 genes to carry out gene expression analysis. Two-color microarray analysis was employed to directly compare the expression of genes between tumor and normal tissues. Through this approach, we identified several previously known candidate genes along with a number of novel candidate genes in gastric cancer. Testican-1 (SPOCK1) was one of the novel molecules that was 10-fold upregulated in tumors. Using tissue microarrays, we validated the expression of testican-1 by immunohistochemical staining. It was overexpressed in 56% (160/282) of the cases tested. Pathway analysis led to the identification of several networks in which SPOCK1 was among the topmost networks of interacting genes. By gene enrichment analysis, we identified several genes involved in cell adhesion and cell proliferation to be significantly upregulated while those corresponding to metabolic pathways were significantly downregulated. The differentially expressed genes identified in this study are candidate biomarkers for gastric adenoacarcinoma. PMID:27030788
The Ser/Thr Protein Kinase Protein-Protein Interaction Map of M. tuberculosis.
Wu, Fan-Lin; Liu, Yin; Jiang, He-Wei; Luan, Yi-Zhao; Zhang, Hai-Nan; He, Xiang; Xu, Zhao-Wei; Hou, Jing-Li; Ji, Li-Yun; Xie, Zhi; Czajkowsky, Daniel M; Yan, Wei; Deng, Jiao-Yu; Bi, Li-Jun; Zhang, Xian-En; Tao, Sheng-Ce
2017-08-01
Mycobacterium tuberculosis (Mtb) is the causative agent of tuberculosis, the leading cause of death among all infectious diseases. There are 11 eukaryotic-like serine/threonine protein kinases (STPKs) in Mtb, which are thought to play pivotal roles in cell growth, signal transduction and pathogenesis. However, their underlying mechanisms of action remain largely uncharacterized. In this study, using a Mtb proteome microarray, we have globally identified the binding proteins in Mtb for all of the STPKs, and constructed the first STPK protein interaction (KPI) map that includes 492 binding proteins and 1,027 interactions. Bioinformatics analysis showed that the interacting proteins reflect diverse functions, including roles in two-component system, transcription, protein degradation, and cell wall integrity. Functional investigations confirmed that PknG regulates cell wall integrity through key components of peptidoglycan (PG) biosynthesis, e.g. MurC. The global STPK-KPIs network constructed here is expected to serve as a rich resource for understanding the key signaling pathways in Mtb, thus facilitating drug development and effective control of Mtb. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
Dolled-Filhart, Marisa P; Gustavson, Mark D
2012-11-01
Translational oncology has been improved by using tissue microarrays (TMAs), which facilitate biomarker analysis of large cohorts on a single slide. This has allowed for rapid analysis and validation of potential biomarkers for prognostic and predictive value, as well as for evaluation of biomarker prevalence. Coupled with quantitative analysis of immunohistochemical (IHC) staining, objective and standardized biomarker data from tumor samples can further advance companion diagnostic approaches for the identification of drug-responsive or resistant patient subpopulations. This review covers the advantages, disadvantages and applications of TMAs for biomarker research. Research literature and reviews of TMAs and quantitative image analysis methodology have been surveyed for this review (with an AQUA® analysis focus). Applications such as multi-marker diagnostic development and pathway-based biomarker subpopulation analyses are described. Tissue microarrays are a useful tool for biomarker analyses including prevalence surveys, disease progression assessment and addressing potential prognostic or predictive value. By combining quantitative image analysis with TMAs, analyses will be more objective and reproducible, allowing for more robust IHC-based diagnostic test development. Quantitative multi-biomarker IHC diagnostic tests that can predict drug response will allow for greater success of clinical trials for targeted therapies and provide more personalized clinical decision making.
Chawade, Aakash; Lindlöf, Angelica; Olsson, Björn; Olsson, Olof
2013-01-01
Low temperature is a key factor that limits growth and productivity of many important agronomical crops worldwide. Rice (Oryza sativa L.) is negatively affected already at temperatures below +10°C and is therefore denoted as chilling sensitive. However, chilling tolerant rice cultivars exist and can be commercially cultivated at altitudes up to 3,050 meters with temperatures reaching as low as +4°C. In this work, the global transcriptional response to cold stress (+4°C) was studied in the Nepalese highland variety Jumli Marshi (spp. japonica) and 4,636 genes were identified as significantly differentially expressed within 24 hours of cold stress. Comparison with previously published microarray data from one chilling tolerant and two sensitive rice cultivars identified 182 genes differentially expressed (DE) upon cold stress in all four rice cultivars and 511 genes DE only in the chilling tolerant rice. Promoter analysis of the 182 genes suggests a complex cross-talk between ABRE and CBF regulons. Promoter analysis of the 511 genes identified over-represented ABRE motifs but not DRE motifs, suggesting a role for ABA signaling in cold tolerance. Moreover, 2,101 genes were DE in Jumli Marshi alone. By chromosomal localization analysis, 473 of these cold responsive genes were located within 13 different QTLs previously identified as cold associated. PMID:24349120
Expression profiling and pathway analysis of Krüppel-like factor 4 in mouse embryonic fibroblasts
Hagos, Engda G; Ghaleb, Amr M; Kumar, Amrita; Neish, Andrew S; Yang, Vincent W
2011-01-01
Background: Krüppel-like factor 4 (KLF4) is a zinc-finger transcription factor with diverse regulatory functions in proliferation, differentiation, and development. KLF4 also plays a role in inflammation, tumorigenesis, and reprogramming of somatic cells to induced pluripotent stem (iPS) cells. To gain insight into the mechanisms by which KLF4 regulates these processes, we conducted DNA microarray analyses to identify differentially expressed genes in mouse embryonic fibroblasts (MEFs) wild type and null for Klf4. Methods: Expression profiles of fibroblasts isolated from mouse embryos wild type or null for the Klf4 alleles were examined by DNA microarrays. Differentially expressed genes were subjected to the Database for Annotation, Visualization and Integrated Discovery (DAVID). The microarray data were also interrogated with the Ingenuity Pathway Analysis (IPA) and Gene Set Enrichment Analysis (GSEA) for pathway identification. Results obtained from the microarray analysis were confirmed by Western blotting for select genes with biological relevance to determine the correlation between mRNA and protein levels. Results: One hundred and sixty three up-regulated and 88 down-regulated genes were identified that demonstrated a fold-change of at least 1.5 and a P-value < 0.05 in Klf4-null MEFs compared to wild type MEFs. Many of the up-regulated genes in Klf4-null MEFs encode proto-oncogenes, growth factors, extracellular matrix, and cell cycle activators. In contrast, genes encoding tumor suppressors and those involved in JAK-STAT signaling pathways are down-regulated in Klf4-null MEFs. IPA and GSEA also identified various pathways that are regulated by KLF4. Lastly, Western blotting of select target genes confirmed the changes revealed by microarray data. Conclusions: These data are not only consistent with previous functional studies of KLF4's role in tumor suppression and somatic cell reprogramming, but also revealed novel target genes that mediate KLF4's functions. PMID:21892412
An efficient pseudomedian filter for tiling microrrays.
Royce, Thomas E; Carriero, Nicholas J; Gerstein, Mark B
2007-06-07
Tiling microarrays are becoming an essential technology in the functional genomics toolbox. They have been applied to the tasks of novel transcript identification, elucidation of transcription factor binding sites, detection of methylated DNA and several other applications in several model organisms. These experiments are being conducted at increasingly finer resolutions as the microarray technology enjoys increasingly greater feature densities. The increased densities naturally lead to increased data analysis requirements. Specifically, the most widely employed algorithm for tiling array analysis involves smoothing observed signals by computing pseudomedians within sliding windows, a O(n2logn) calculation in each window. This poor time complexity is an issue for tiling array analysis and could prove to be a real bottleneck as tiling microarray experiments become grander in scope and finer in resolution. We therefore implemented Monahan's HLQEST algorithm that reduces the runtime complexity for computing the pseudomedian of n numbers to O(nlogn) from O(n2logn). For a representative tiling microarray dataset, this modification reduced the smoothing procedure's runtime by nearly 90%. We then leveraged the fact that elements within sliding windows remain largely unchanged in overlapping windows (as one slides across genomic space) to further reduce computation by an additional 43%. This was achieved by the application of skip lists to maintaining a sorted list of values from window to window. This sorted list could be maintained with simple O(log n) inserts and deletes. We illustrate the favorable scaling properties of our algorithms with both time complexity analysis and benchmarking on synthetic datasets. Tiling microarray analyses that rely upon a sliding window pseudomedian calculation can require many hours of computation. We have eased this requirement significantly by implementing efficient algorithms that scale well with genomic feature density. This result not only speeds the current standard analyses, but also makes possible ones where many iterations of the filter may be required, such as might be required in a bootstrap or parameter estimation setting. Source code and executables are available at http://tiling.gersteinlab.org/pseudomedian/.
An efficient pseudomedian filter for tiling microrrays
Royce, Thomas E; Carriero, Nicholas J; Gerstein, Mark B
2007-01-01
Background Tiling microarrays are becoming an essential technology in the functional genomics toolbox. They have been applied to the tasks of novel transcript identification, elucidation of transcription factor binding sites, detection of methylated DNA and several other applications in several model organisms. These experiments are being conducted at increasingly finer resolutions as the microarray technology enjoys increasingly greater feature densities. The increased densities naturally lead to increased data analysis requirements. Specifically, the most widely employed algorithm for tiling array analysis involves smoothing observed signals by computing pseudomedians within sliding windows, a O(n2logn) calculation in each window. This poor time complexity is an issue for tiling array analysis and could prove to be a real bottleneck as tiling microarray experiments become grander in scope and finer in resolution. Results We therefore implemented Monahan's HLQEST algorithm that reduces the runtime complexity for computing the pseudomedian of n numbers to O(nlogn) from O(n2logn). For a representative tiling microarray dataset, this modification reduced the smoothing procedure's runtime by nearly 90%. We then leveraged the fact that elements within sliding windows remain largely unchanged in overlapping windows (as one slides across genomic space) to further reduce computation by an additional 43%. This was achieved by the application of skip lists to maintaining a sorted list of values from window to window. This sorted list could be maintained with simple O(log n) inserts and deletes. We illustrate the favorable scaling properties of our algorithms with both time complexity analysis and benchmarking on synthetic datasets. Conclusion Tiling microarray analyses that rely upon a sliding window pseudomedian calculation can require many hours of computation. We have eased this requirement significantly by implementing efficient algorithms that scale well with genomic feature density. This result not only speeds the current standard analyses, but also makes possible ones where many iterations of the filter may be required, such as might be required in a bootstrap or parameter estimation setting. Source code and executables are available at . PMID:17555595
Jani, Saurin D; Argraves, Gary L; Barth, Jeremy L; Argraves, W Scott
2010-04-01
An important objective of DNA microarray-based gene expression experimentation is determining inter-relationships that exist between differentially expressed genes and biological processes, molecular functions, cellular components, signaling pathways, physiologic processes and diseases. Here we describe GeneMesh, a web-based program that facilitates analysis of DNA microarray gene expression data. GeneMesh relates genes in a query set to categories available in the Medical Subject Headings (MeSH) hierarchical index. The interface enables hypothesis driven relational analysis to a specific MeSH subcategory (e.g., Cardiovascular System, Genetic Processes, Immune System Diseases etc.) or unbiased relational analysis to broader MeSH categories (e.g., Anatomy, Biological Sciences, Disease etc.). Genes found associated with a given MeSH category are dynamically linked to facilitate tabular and graphical depiction of Entrez Gene information, Gene Ontology information, KEGG metabolic pathway diagrams and intermolecular interaction information. Expression intensity values of groups of genes that cluster in relation to a given MeSH category, gene ontology or pathway can be displayed as heat maps of Z score-normalized values. GeneMesh operates on gene expression data derived from a number of commercial microarray platforms including Affymetrix, Agilent and Illumina. GeneMesh is a versatile web-based tool for testing and developing new hypotheses through relating genes in a query set (e.g., differentially expressed genes from a DNA microarray experiment) to descriptors making up the hierarchical structure of the National Library of Medicine controlled vocabulary thesaurus, MeSH. The system further enhances the discovery process by providing links between sets of genes associated with a given MeSH category to a rich set of html linked tabular and graphic information including Entrez Gene summaries, gene ontologies, intermolecular interactions, overlays of genes onto KEGG pathway diagrams and heatmaps of expression intensity values. GeneMesh is freely available online at http://proteogenomics.musc.edu/genemesh/.
Micro-Analyzer: automatic preprocessing of Affymetrix microarray data.
Guzzi, Pietro Hiram; Cannataro, Mario
2013-08-01
A current trend in genomics is the investigation of the cell mechanism using different technologies, in order to explain the relationship among genes, molecular processes and diseases. For instance, the combined use of gene-expression arrays and genomic arrays has been demonstrated as an effective instrument in clinical practice. Consequently, in a single experiment different kind of microarrays may be used, resulting in the production of different types of binary data (images and textual raw data). The analysis of microarray data requires an initial preprocessing phase, that makes raw data suitable for use on existing analysis platforms, such as the TIGR M4 (TM4) Suite. An additional challenge to be faced by emerging data analysis platforms is the ability to treat in a combined way those different microarray formats coupled with clinical data. In fact, resulting integrated data may include both numerical and symbolic data (e.g. gene expression and SNPs regarding molecular data), as well as temporal data (e.g. the response to a drug, time to progression and survival rate), regarding clinical data. Raw data preprocessing is a crucial step in analysis but is often performed in a manual and error prone way using different software tools. Thus novel, platform independent, and possibly open source tools enabling the semi-automatic preprocessing and annotation of different microarray data are needed. The paper presents Micro-Analyzer (Microarray Analyzer), a cross-platform tool for the automatic normalization, summarization and annotation of Affymetrix gene expression and SNP binary data. It represents the evolution of the μ-CS tool, extending the preprocessing to SNP arrays that were not allowed in μ-CS. The Micro-Analyzer is provided as a Java standalone tool and enables users to read, preprocess and analyse binary microarray data (gene expression and SNPs) by invoking TM4 platform. It avoids: (i) the manual invocation of external tools (e.g. the Affymetrix Power Tools), (ii) the manual loading of preprocessing libraries, and (iii) the management of intermediate files, such as results and metadata. Micro-Analyzer users can directly manage Affymetrix binary data without worrying about locating and invoking the proper preprocessing tools and chip-specific libraries. Moreover, users of the Micro-Analyzer tool can load the preprocessed data directly into the well-known TM4 platform, extending in such a way also the TM4 capabilities. Consequently, Micro Analyzer offers the following advantages: (i) it reduces possible errors in the preprocessing and further analysis phases, e.g. due to the incorrect choice of parameters or due to the use of old libraries, (ii) it enables the combined and centralized pre-processing of different arrays, (iii) it may enhance the quality of further analysis by storing the workflow, i.e. information about the preprocessing steps, and (iv) finally Micro-Analzyer is freely available as a standalone application at the project web site http://sourceforge.net/projects/microanalyzer/. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
A Versatile Microarray Platform for Capturing Rare Cells
NASA Astrophysics Data System (ADS)
Brinkmann, Falko; Hirtz, Michael; Haller, Anna; Gorges, Tobias M.; Vellekoop, Michael J.; Riethdorf, Sabine; Müller, Volkmar; Pantel, Klaus; Fuchs, Harald
2015-10-01
Analyses of rare events occurring at extremely low frequencies in body fluids are still challenging. We established a versatile microarray-based platform able to capture single target cells from large background populations. As use case we chose the challenging application of detecting circulating tumor cells (CTCs) - about one cell in a billion normal blood cells. After incubation with an antibody cocktail, targeted cells are extracted on a microarray in a microfluidic chip. The accessibility of our platform allows for subsequent recovery of targets for further analysis. The microarray facilitates exclusion of false positive capture events by co-localization allowing for detection without fluorescent labelling. Analyzing blood samples from cancer patients with our platform reached and partly outreached gold standard performance, demonstrating feasibility for clinical application. Clinical researchers free choice of antibody cocktail without need for altered chip manufacturing or incubation protocol, allows virtual arbitrary targeting of capture species and therefore wide spread applications in biomedical sciences.
He, Xianmin; Wei, Qing; Sun, Meiqian; Fu, Xuping; Fan, Sichang; Li, Yao
2006-05-01
Biological techniques such as Array-Comparative genomic hybridization (CGH), fluorescent in situ hybridization (FISH) and affymetrix single nucleotide pleomorphism (SNP) array have been used to detect cytogenetic aberrations. However, on genomic scale, these techniques are labor intensive and time consuming. Comparative genomic microarray analysis (CGMA) has been used to identify cytogenetic changes in hepatocellular carcinoma (HCC) using gene expression microarray data. However, CGMA algorithm can not give precise localization of aberrations, fails to identify small cytogenetic changes, and exhibits false negatives and positives. Locally un-weighted smoothing cytogenetic aberrations prediction (LS-CAP) based on local smoothing and binomial distribution can be expected to address these problems. LS-CAP algorithm was built and used on HCC microarray profiles. Eighteen cytogenetic abnormalities were identified, among them 5 were reported previously, and 12 were proven by CGH studies. LS-CAP effectively reduced the false negatives and positives, and precisely located small fragments with cytogenetic aberrations.
2013-01-01
Background The synthesis of information across microarray studies has been performed by combining statistical results of individual studies (as in a mosaic), or by combining data from multiple studies into a large pool to be analyzed as a single data set (as in a melting pot of data). Specific issues relating to data heterogeneity across microarray studies, such as differences within and between labs or differences among experimental conditions, could lead to equivocal results in a melting pot approach. Results We applied statistical theory to determine the specific effect of different means and heteroskedasticity across 19 groups of microarray data on the sign and magnitude of gene-to-gene Pearson correlation coefficients obtained from the pool of 19 groups. We quantified the biases of the pooled coefficients and compared them to the biases of correlations estimated by an effect-size model. Mean differences across the 19 groups were the main factor determining the magnitude and sign of the pooled coefficients, which showed largest values of bias as they approached ±1. Only heteroskedasticity across the pool of 19 groups resulted in less efficient estimations of correlations than did a classical meta-analysis approach of combining correlation coefficients. These results were corroborated by simulation studies involving either mean differences or heteroskedasticity across a pool of N > 2 groups. Conclusions The combination of statistical results is best suited for synthesizing the correlation between expression profiles of a gene pair across several microarray studies. PMID:23822712
Microarray analysis of genes associated with cell surface NIS protein levels in breast cancer.
Beyer, Sasha J; Zhang, Xiaoli; Jimenez, Rafael E; Lee, Mei-Ling T; Richardson, Andrea L; Huang, Kun; Jhiang, Sissy M
2011-10-11
Na+/I- symporter (NIS)-mediated iodide uptake allows radioiodine therapy for thyroid cancer. NIS is also expressed in breast tumors, raising potential for radionuclide therapy of breast cancer. However, NIS expression in most breast cancers is low and may not be sufficient for radionuclide therapy. We aimed to identify biomarkers associated with NIS expression such that mechanisms underlying NIS modulation in human breast tumors may be elucidated. Published oligonucleotide microarray data within the National Center for Biotechnology Information Gene Expression Omnibus database were analyzed to identify gene expression tightly correlated with NIS mRNA level among human breast tumors. NIS immunostaining was performed in a tissue microarray composed of 28 human breast tumors which had corresponding oligonucleotide microarray data available for each tumor such that gene expression associated with cell surface NIS protein level could be identified. NIS mRNA levels do not vary among breast tumors or when compared to normal breast tissues when detected by Affymetrix oligonucleotide microarray platforms. Cell surface NIS protein levels are much more variable than their corresponding NIS mRNA levels. Despite a limited number of breast tumors examined, our analysis identified cysteinyl-tRNA synthetase as a biomarker that is highly associated with cell surface NIS protein levels in the ER-positive breast cancer subtype. Further investigation on genes associated with cell surface NIS protein levels within each breast cancer molecular subtype may lead to novel targets for selectively increasing NIS expression/function in a subset of breast cancers patients.
Zinke, Ingo; Schütz, Christina S.; Katzenberger, Jörg D.; Bauer, Matthias; Pankratz, Michael J.
2002-01-01
We have identified genes regulated by starvation and sugar signals in Drosophila larvae using whole-genome microarrays. Based on expression profiles in the two nutrient conditions, they were organized into different categories that reflect distinct physiological pathways mediating sugar and fat metabolism, and cell growth. In the category of genes regulated in sugar-fed, but not in starved, animals, there is an upregulation of genes encoding key enzymes of the fat biosynthesis pathway and a downregulation of genes encoding lipases. The highest and earliest activated gene upon sugar ingestion is sugarbabe, a zinc finger protein that is induced in the gut and the fat body. Identification of potential targets using microarrays suggests that sugarbabe functions to repress genes involved in dietary fat breakdown and absorption. The current analysis provides a basis for studying the genetic mechanisms underlying nutrient signalling. PMID:12426388
Finding Groups in Gene Expression Data
2005-01-01
The vast potential of the genomic insight offered by microarray technologies has led to their widespread use since they were introduced a decade ago. Application areas include gene function discovery, disease diagnosis, and inferring regulatory networks. Microarray experiments enable large-scale, high-throughput investigations of gene activity and have thus provided the data analyst with a distinctive, high-dimensional field of study. Many questions in this field relate to finding subgroups of data profiles which are very similar. A popular type of exploratory tool for finding subgroups is cluster analysis, and many different flavors of algorithms have been used and indeed tailored for microarray data. Cluster analysis, however, implies a partitioning of the entire data set, and this does not always match the objective. Sometimes pattern discovery or bump hunting tools are more appropriate. This paper reviews these various tools for finding interesting subgroups. PMID:16046827
Prostate Cancer Prevention Through Induction of Phase 2 Enzymes
2001-04-01
enzymes. During our Phase I Award, we identified sulforaphane as the most potent inducer of carcinogen defenses in the prostate cell. We have...characterized global effects of sulforaphane in prostate cancer cell lines using cDNA microarray technology that allows large-scale determination of changes...of sulforaphane ) and decreased risk of prostate cancer. These findings argue strongly for a preventive intervention trial involving supplementation
Caroline M. Press; Niklaus J. Grunwald
2008-01-01
The release of the draft genome sequence of P. ramorum strain Pr102, enabled the construction of an oligonucleotide microarray of the entire genome of Pr102. The array contains 344,680 features (oligos) that represent the transcriptome of Pr102. P. ramorum RNA was extracted from mycelium and sporangia and used to compare gene...
Alonso, Sergio; Suzuki, Koichi; Yamamoto, Fumiichiro; Perucho, Manuel
2018-01-01
Somatic, and in a minor scale also germ line, epigenetic aberrations are fundamental to carcinogenesis, cancer progression, and tumor phenotype. DNA methylation is the most extensively studied and arguably the best understood epigenetic mechanisms that become altered in cancer. Both somatic loss of methylation (hypomethylation) and gain of methylation (hypermethylation) are found in the genome of malignant cells. In general, the cancer cell epigenome is globally hypomethylated, while some regions-typically gene-associated CpG islands-become hypermethylated. Given the profound impact that DNA methylation exerts on the transcriptional profile and genomic stability of cancer cells, its characterization is essential to fully understand the complexity of cancer biology, improve tumor classification, and ultimately advance cancer patient management and treatment. A plethora of methods have been devised to analyze and quantify DNA methylation alterations. Several of the early-developed methods relied on the use of methylation-sensitive restriction enzymes, whose activity depends on the methylation status of their recognition sequences. Among these techniques, methylation-sensitive amplification length polymorphism (MS-AFLP) was developed in the early 2000s, and successfully adapted from its original gel electrophoresis fingerprinting format to a microarray format that notably increased its throughput and allowed the quantification of the methylation changes. This array-based platform interrogates over 9500 independent loci putatively amplified by the MS-AFLP technique, corresponding to the NotI sites mapped throughout the human genome.
The first Korean patient with Potocki-Shaffer syndrome: a rare cause of multiple exostoses.
Sohn, Young Bae; Yim, Shin-Young; Cho, Eun-Hae; Kim, Ok-Hwa
2015-02-01
Potocki-Shaffer syndrome (PSS, OMIM #601224) is a rare contiguous gene deletion syndrome caused by haploinsufficiency of genes located on the 11p11.2p12. Affected individuals have a number of characteristic features including multiple exostoses, biparietal foramina, abnormalities of genitourinary system, hypotonia, developmental delay, and intellectual disability. We report here on the first Korean case of an 8-yr-old boy with PSS diagnosed by high resolution microarray. Initial evaluation was done at age 6 months because of a history of developmental delay, hypotonia, and dysmorphic face. Coronal craniosynostosis and enlarged parietal foramina were found on skull radiographs. At age 6 yr, he had severe global developmental delay. Multiple exostoses of long bones were detected during a radiological check-up. Based on the clinical and radiological features, PSS was highly suspected. Subsequently, chromosomal microarray analysis identified an 8.6 Mb deletion at 11p11.2 [arr 11p12p11.2 (Chr11:39,204,770-47,791,278)×1]. The patient continued rehabilitation therapy for profound developmental delay. The progression of multiple exostosis has being monitored. This case confirms and extends data on the genetic basis of PSS. In clinical and radiologic aspect, a patient with multiple exostoses accompanying with syndromic features, including craniofacial abnormalities and mental retardation, the diagnosis of PSS should be considered.
Harripaul, R; Vasli, N; Mikhailov, A; Rafiq, M A; Mittal, K; Windpassinger, C; Sheikh, T I; Noor, A; Mahmood, H; Downey, S; Johnson, M; Vleuten, K; Bell, L; Ilyas, M; Khan, F S; Khan, V; Moradi, M; Ayaz, M; Naeem, F; Heidari, A; Ahmed, I; Ghadami, S; Agha, Z; Zeinali, S; Qamar, R; Mozhdehipanah, H; John, P; Mir, A; Ansar, M; French, L; Ayub, M; Vincent, J B
2018-04-01
Approximately 1% of the global population is affected by intellectual disability (ID), and the majority receive no molecular diagnosis. Previous studies have indicated high levels of genetic heterogeneity, with estimates of more than 2500 autosomal ID genes, the majority of which are autosomal recessive (AR). Here, we combined microarray genotyping, homozygosity-by-descent (HBD) mapping, copy number variation (CNV) analysis, and whole exome sequencing (WES) to identify disease genes/mutations in 192 multiplex Pakistani and Iranian consanguineous families with non-syndromic ID. We identified definite or candidate mutations (or CNVs) in 51% of families in 72 different genes, including 26 not previously reported for ARID. The new ARID genes include nine with loss-of-function mutations (ABI2, MAPK8, MPDZ, PIDD1, SLAIN1, TBC1D23, TRAPPC6B, UBA7 and USP44), and missense mutations include the first reports of variants in BDNF or TET1 associated with ID. The genes identified also showed overlap with de novo gene sets for other neuropsychiatric disorders. Transcriptional studies showed prominent expression in the prenatal brain. The high yield of AR mutations for ID indicated that this approach has excellent clinical potential and should inform clinical diagnostics, including clinical whole exome and genome sequencing, for populations in which consanguinity is common. As with other AR disorders, the relevance will also apply to outbred populations.
Gene expression profiling of dendritic cells by microarray.
Foti, Maria; Ricciardi-Castagnoli, Paola; Granucci, Francesca
2007-01-01
The immune system of vertebrate animals has evolved to respond to different types of perturbations (invading pathogens, stress signals), limiting self-tissue damage. The decision to activate an immune response is made by antigen-presenting cells (APCs) that are quiescent until they encounter a foreign microorganism or inflammatory stimuli. Early activated APCs trigger innate immune responses that represent the first line of reaction against invading pathogens to limit the infections. At later times, activated APCs acquire the ability to prime antigen-specific immune responses that clear the infections and give rise to memory. During the immune response self-tissue damage is limited and tolerance to self is maintained through life. Among the cells that constitute the immune system, dendritic cells (DC) play a central role. They are extremely versatile APCs involved in the initiation of both innate and adaptive immunity and also in the differentiation of regulatory T cells required for the maintenance of self-tolerance. How DC can mediate these diverse and almost contradictory functions has recently been investigated. The plasticity of these cells allows them to undergo a complete genetic reprogramming in response to external microbial stimuli with the sequential acquisition of different regulatory functions in innate and adaptive immunity. The specific genetic reprogramming DC undergo upon activation can be easily investigated by using microarrays to perform global gene expression analysis in different conditions.
Yin, Liang; Xue, Yanfen; Ma, Yanhe
2015-01-01
The alkaliphilic halotolerant bacterium Bacillus sp. N16-5 is often exposed to salt stress in its natural habitats. In this study, we used one-colour microarrays to investigate adaptive responses of Bacillus sp. N16-5 transcriptome to long-term growth at different salinity levels (0%, 2%, 8%, and 15% NaCl) and to a sudden salt increase from 0% to 8% NaCl. The common strategies used by bacteria to survive and grow at high salt conditions, such as K+ uptake, Na+ efflux, and the accumulation of organic compatible solutes (glycine betaine and ectoine), were observed in Bacillus sp. N16-5. The genes of SigB regulon involved in general stress responses and chaperone-encoding genes were also induced by high salt concentration. Moreover, the genes regulating swarming ability and the composition of the cytoplasmic membrane and cell wall were also differentially expressed. The genes involved in iron uptake were down-regulated, whereas the iron homeostasis regulator Fur was up-regulated, suggesting that Fur may play a role in the salt adaption of Bacillus sp. N16-5. In summary, we present a comprehensive gene expression profiling of alkaliphilic Bacillus sp. N16-5 cells exposed to high salt stress, which would help elucidate the mechanisms underlying alkaliphilic Bacillus spp. survival in and adaptation to salt stress. PMID:26030352
van Haaften, Rachel I M; Luceri, Cristina; van Erk, Arie; Evelo, Chris T A
2009-06-01
Omics technology used for large-scale measurements of gene expression is rapidly evolving. This work pointed out the need of an extensive bioinformatics analyses for array quality assessment before and after gene expression clustering and pathway analysis. A study focused on the effect of red wine polyphenols on rat colon mucosa was used to test the impact of quality control and normalisation steps on the biological conclusions. The integration of data visualization, pathway analysis and clustering revealed an artifact problem that was solved with an adapted normalisation. We propose a possible point to point standard analysis procedure, based on a combination of clustering and data visualization for the analysis of microarray data.
MeV+R: using MeV as a graphical user interface for Bioconductor applications in microarray analysis
Chu, Vu T; Gottardo, Raphael; Raftery, Adrian E; Bumgarner, Roger E; Yeung, Ka Yee
2008-01-01
We present MeV+R, an integration of the JAVA MultiExperiment Viewer program with Bioconductor packages. This integration of MultiExperiment Viewer and R is easily extensible to other R packages and provides users with point and click access to traditionally command line driven tools written in R. We demonstrate the ability to use MultiExperiment Viewer as a graphical user interface for Bioconductor applications in microarray data analysis by incorporating three Bioconductor packages, RAMA, BRIDGE and iterativeBMA. PMID:18652698
Jison, Maria L.; Munson, Peter J.; Barb, Jennifer J.; Suffredini, Anthony F.; Talwar, Shefali; Logun, Carolea; Raghavachari, Nalini; Beigel, John H.; Shelhamer, James H.; Danner, Robert L.; Gladwin, Mark T.
2016-01-01
In sickle cell disease, deoxygenation of intra-erythrocytic hemoglobin S leads to hemoglobin polymerization, erythrocyte rigidity, hemolysis, and microvascular occlusion. Ischemia-reperfusion injury, plasma hemoglobin-mediated nitric oxide consumption, and free radical generation activate systemic inflammatory responses. To characterize the role of circulating leukocytes in sickle cell pathogenesis we performed global transcriptional analysis of blood mononuclear cells from 27 patients in steady-state sickle cell disease (10 patients treated and 17 patients untreated with hydroxyurea) compared with 13 control subjects. We used gender-specific gene expression to validate human microarray experiments. Patients with sickle cell disease demonstrated differential gene expression of 112 genes involved in heme metabolism, cell-cycle regulation, antioxidant and stress responses, inflammation, and angiogenesis. Inducible heme oxygenase-1 and downstream proteins biliverdin reductase and p21, a cyclin-dependent kinase, were up-regulated, potentially contributing to phenotypic heterogeneity and absence of atherosclerosis in patients with sickle cell disease despite endothelial dysfunction and vascular inflammation. Hydroxyurea therapy did not significantly affect leukocyte gene expression, suggesting that such therapy has limited direct anti-inflammatory activity beyond leukoreduction. Global transcriptional analysis of circulating leukocytes highlights the intense oxidant and inflammatory nature of steady-state sickle cell disease and provides insight into the broad compensatory responses to vascular injury. PMID:15031206
2012-01-01
Background Biological systems respond to changes in both the Earth's magnetic and gravitational fields, but as experiments in space are expensive and infrequent, Earth-based simulation techniques are required. A high gradient magnetic field can be used to levitate biological material, thereby simulating microgravity and can also create environments with a reduced or an enhanced level of gravity (g), although special attention should be paid to the possible effects of the magnetic field (B) itself. Results Using diamagnetic levitation, we exposed Arabidopsis thaliana in vitro callus cultures to five environments with different levels of effective gravity and magnetic field strengths. The environments included levitation, i.e. simulated μg* (close to 0 g* at B = 10.1 T), intermediate g* (0.1 g* at B = 14.7 T) and enhanced gravity levels (1.9 g* at B = 14.7 T and 2 g* at B = 10.1 T) plus an internal 1 g* control (B = 16.5 T). The asterisk denotes the presence of the background magnetic field, as opposed to the effective gravity environments in the absence of an applied magnetic field, created using a Random Position Machine (simulated μg) and a Large Diameter Centrifuge (2 g). Microarray analysis indicates that changes in the overall gene expression of cultured cells exposed to these unusual environments barely reach significance using an FDR algorithm. However, it was found that gravitational and magnetic fields produce synergistic variations in the steady state of the transcriptional profile of plants. Transcriptomic results confirm that high gradient magnetic fields (i.e. to create μg* and 2 g* conditions) have a significant effect, mainly on structural, abiotic stress genes and secondary metabolism genes, but these subtle gravitational effects are only observable using clustering methodologies. Conclusions A detailed microarray dataset analysis, based on clustering of similarly expressed genes (GEDI software), can detect underlying global-scale responses, which cannot be detected by means of individual gene expression techniques using raw or corrected p values (FDR). A subtle, but consistent, genome-scale response to hypogravity environments was found, which was opposite to the response in a hypergravity environment. PMID:22435851
ArrayWiki: an enabling technology for sharing public microarray data repositories and meta-analyses
Stokes, Todd H; Torrance, JT; Li, Henry; Wang, May D
2008-01-01
Background A survey of microarray databases reveals that most of the repository contents and data models are heterogeneous (i.e., data obtained from different chip manufacturers), and that the repositories provide only basic biological keywords linking to PubMed. As a result, it is difficult to find datasets using research context or analysis parameters information beyond a few keywords. For example, to reduce the "curse-of-dimension" problem in microarray analysis, the number of samples is often increased by merging array data from different datasets. Knowing chip data parameters such as pre-processing steps (e.g., normalization, artefact removal, etc), and knowing any previous biological validation of the dataset is essential due to the heterogeneity of the data. However, most of the microarray repositories do not have meta-data information in the first place, and do not have a a mechanism to add or insert this information. Thus, there is a critical need to create "intelligent" microarray repositories that (1) enable update of meta-data with the raw array data, and (2) provide standardized archiving protocols to minimize bias from the raw data sources. Results To address the problems discussed, we have developed a community maintained system called ArrayWiki that unites disparate meta-data of microarray meta-experiments from multiple primary sources with four key features. First, ArrayWiki provides a user-friendly knowledge management interface in addition to a programmable interface using standards developed by Wikipedia. Second, ArrayWiki includes automated quality control processes (caCORRECT) and novel visualization methods (BioPNG, Gel Plots), which provide extra information about data quality unavailable in other microarray repositories. Third, it provides a user-curation capability through the familiar Wiki interface. Fourth, ArrayWiki provides users with simple text-based searches across all experiment meta-data, and exposes data to search engine crawlers (Semantic Agents) such as Google to further enhance data discovery. Conclusions Microarray data and meta information in ArrayWiki are distributed and visualized using a novel and compact data storage format, BioPNG. Also, they are open to the research community for curation, modification, and contribution. By making a small investment of time to learn the syntax and structure common to all sites running MediaWiki software, domain scientists and practioners can all contribute to make better use of microarray technologies in research and medical practices. ArrayWiki is available at . PMID:18541053
Digital microarray analysis for digital artifact genomics
NASA Astrophysics Data System (ADS)
Jaenisch, Holger; Handley, James; Williams, Deborah
2013-06-01
We implement a Spatial Voting (SV) based analogy of microarray analysis for digital gene marker identification in malware code sections. We examine a famous set of malware formally analyzed by Mandiant and code named Advanced Persistent Threat (APT1). APT1 is a Chinese organization formed with specific intent to infiltrate and exploit US resources. Manidant provided a detailed behavior and sting analysis report for the 288 malware samples available. We performed an independent analysis using a new alternative to the traditional dynamic analysis and static analysis we call Spatial Analysis (SA). We perform unsupervised SA on the APT1 originating malware code sections and report our findings. We also show the results of SA performed on some members of the families associated by Manidant. We conclude that SV based SA is a practical fast alternative to dynamics analysis and static analysis.
Hybrid genetic algorithm-neural network: feature extraction for unpreprocessed microarray data.
Tong, Dong Ling; Schierz, Amanda C
2011-09-01
Suitable techniques for microarray analysis have been widely researched, particularly for the study of marker genes expressed to a specific type of cancer. Most of the machine learning methods that have been applied to significant gene selection focus on the classification ability rather than the selection ability of the method. These methods also require the microarray data to be preprocessed before analysis takes place. The objective of this study is to develop a hybrid genetic algorithm-neural network (GANN) model that emphasises feature selection and can operate on unpreprocessed microarray data. The GANN is a hybrid model where the fitness value of the genetic algorithm (GA) is based upon the number of samples correctly labelled by a standard feedforward artificial neural network (ANN). The model is evaluated by using two benchmark microarray datasets with different array platforms and differing number of classes (a 2-class oligonucleotide microarray data for acute leukaemia and a 4-class complementary DNA (cDNA) microarray dataset for SRBCTs (small round blue cell tumours)). The underlying concept of the GANN algorithm is to select highly informative genes by co-evolving both the GA fitness function and the ANN weights at the same time. The novel GANN selected approximately 50% of the same genes as the original studies. This may indicate that these common genes are more biologically significant than other genes in the datasets. The remaining 50% of the significant genes identified were used to build predictive models and for both datasets, the models based on the set of genes extracted by the GANN method produced more accurate results. The results also suggest that the GANN method not only can detect genes that are exclusively associated with a single cancer type but can also explore the genes that are differentially expressed in multiple cancer types. The results show that the GANN model has successfully extracted statistically significant genes from the unpreprocessed microarray data as well as extracting known biologically significant genes. We also show that assessing the biological significance of genes based on classification accuracy may be misleading and though the GANN's set of extra genes prove to be more statistically significant than those selected by other methods, a biological assessment of these genes is highly recommended to confirm their functionality. Copyright © 2011 Elsevier B.V. All rights reserved.
arrayCGHbase: an analysis platform for comparative genomic hybridization microarrays
Menten, Björn; Pattyn, Filip; De Preter, Katleen; Robbrecht, Piet; Michels, Evi; Buysse, Karen; Mortier, Geert; De Paepe, Anne; van Vooren, Steven; Vermeesch, Joris; Moreau, Yves; De Moor, Bart; Vermeulen, Stefan; Speleman, Frank; Vandesompele, Jo
2005-01-01
Background The availability of the human genome sequence as well as the large number of physically accessible oligonucleotides, cDNA, and BAC clones across the entire genome has triggered and accelerated the use of several platforms for analysis of DNA copy number changes, amongst others microarray comparative genomic hybridization (arrayCGH). One of the challenges inherent to this new technology is the management and analysis of large numbers of data points generated in each individual experiment. Results We have developed arrayCGHbase, a comprehensive analysis platform for arrayCGH experiments consisting of a MIAME (Minimal Information About a Microarray Experiment) supportive database using MySQL underlying a data mining web tool, to store, analyze, interpret, compare, and visualize arrayCGH results in a uniform and user-friendly format. Following its flexible design, arrayCGHbase is compatible with all existing and forthcoming arrayCGH platforms. Data can be exported in a multitude of formats, including BED files to map copy number information on the genome using the Ensembl or UCSC genome browser. Conclusion ArrayCGHbase is a web based and platform independent arrayCGH data analysis tool, that allows users to access the analysis suite through the internet or a local intranet after installation on a private server. ArrayCGHbase is available at . PMID:15910681
2010-01-01
Background The zebra mussel (Dreissena polymorpha) has been well known for its expertise in attaching to substances under the water. Studies in past decades on this underwater adhesion focused on the adhesive protein isolated from the byssogenesis apparatus of the zebra mussel. However, the mechanism of the initiation, maintenance, and determination of the attachment process remains largely unknown. Results In this study, we used a zebra mussel cDNA microarray previously developed in our lab and a factorial analysis to identify the genes that were involved in response to the changes of four factors: temperature (Factor A), current velocity (Factor B), dissolved oxygen (Factor C), and byssogenesis status (Factor D). Twenty probes in the microarray were found to be modified by one of the factors. The transcription products of four selected genes, DPFP-BG20_A01, EGP-BG97/192_B06, EGP-BG13_G05, and NH-BG17_C09 were unique to the zebra mussel foot based on the results of quantitative reverse transcription PCR (qRT-PCR). The expression profiles of these four genes under the attachment and non-attachment were also confirmed by qRT-PCR and the result is accordant to that from microarray assay. The in situ hybridization with the RNA probes of two identified genes DPFP-BG20_A01 and EGP-BG97/192_B06 indicated that both of them were expressed by a type of exocrine gland cell located in the middle part of the zebra mussel foot. Conclusions The results of this study suggested that the changes of D. polymorpha byssogenesis status and the environmental factors can dramatically affect the expression profiles of the genes unique to the foot. It turns out that the factorial design and analysis of the microarray experiment is a reliable method to identify the influence of multiple factors on the expression profiles of the probesets in the microarray; therein it provides a powerful tool to reveal the mechanism of zebra mussel underwater attachment. PMID:20509938
Lengger, Sandra; Otto, Johannes; Elsässer, Dennis; Schneider, Oliver; Tiehm, Andreas; Fleischer, Jens; Niessner, Reinhard; Seidel, Michael
2014-05-01
Pathogenic viruses are emerging contaminants in water which should be analyzed for water safety to preserve public health. A strategy was developed to quantify RNA and DNA viruses in parallel on chemiluminescence flow-through oligonucleotide microarrays. In order to show the proof of principle, bacteriophage MS2, ΦX174, and the human pathogenic adenovirus type 2 (hAdV2) were analyzed in spiked tap water samples on the analysis platform MCR 3. The chemiluminescence microarray imaging unit was equipped with a Peltier heater for a controlled heating of the flow cell. The efficiency and selectivity of DNA hybridization could be increased resulting in higher signal intensities and lower cross-reactivities of polymerase chain reaction (PCR) products from other viruses. The total analysis time for DNA/RNA extraction, cDNA synthesis for RNA viruses, polymerase chain reaction, single-strand separation, and oligonucleotide microarray analysis was performed in 4-4.5 h. The parallel quantification was possible in a concentration range of 9.6 × 10(5)-1.4 × 10(10) genomic units (GU)/mL for bacteriophage MS2, 1.4 × 10(5)-3.7 × 10(8) GU/mL for bacteriophage ΦX174, and 6.5 × 10(3)-1.2 × 10(5) for hAdV2, respectively, by using a measuring temperature of 40 °C. Detection limits could be calculated to 6.6 × 10(5) GU/mL for MS2, 5.3 × 10(3) GU/mL for ΦX174, and 1.5 × 10(2) GU/mL for hAdV2, respectively. Real samples of surface water and treated wastewater were tested. Generally, found concentrations of hAdV2, bacteriophage MS2, and ΦX174 were at the detection limit. Nevertheless, bacteriophages could be identified with similar results by means of quantitative PCR and oligonucleotide microarray analysis on the MCR 3.